Train batch generic by HosseinKaviani-H · Pull Request #724 · meta-pytorch/torchforge

HosseinKaviani-H · 2026-01-22T23:59:05Z

Summary

Adds TrainBatch dataclass that separates model_inputs from loss_inputs, enabling any training paradigm without type changes.

Motivation

The current TextTrainBatch has limitations:

Hardcoded fields require changes for each new training mode
Text-only naming doesn't support multimodal
Every new paradigm (DPO, distillation, etc.) needs type updates

Solution

@dataclass
class TrainBatch:
    model_inputs: dict[str, Any]
    loss_inputs: dict[str, Any]
    meta: dict[str, Any] = field(default_factory=dict)

# Usage:
logits = model(**batch.model_inputs)
loss = loss_fn(logits, **batch.loss_inputs)

Files Changed

File: src/forge/types.py
Change: Added TrainBatch dataclass
────────────────────────────────────────
File: src/forge/rl/collate.py
Change: Updated to return list[TrainBatch] with model_inputs/loss_inputs
────────────────────────────────────────
File: src/forge/actors/trainer/titan.py
Change: Updated train_step() to accept list[TrainBatch] and unpack fields
────────────────────────────────────────
File: apps/grpo/main.py
Change: Updated to pass batch directly: trainer.train_step.call(batch)
────────────────────────────────────────
File: tests/sandbox/rl_trainer/main.py
Change: Updated to pass batch directly: trainer.train_step.call(batch)
────────────────────────────────────────
File: tests/sandbox/weight_sync/main.py
Change: Updated to pass batch directly: trainer.train_step.call(batch)

Test Plan

Core implementation: types.py, collate.py, titan.py, main.py
Update test files (tests/sandbox/)

Rewards and Losses

Tested the GRPO for 100 steps:

felipemello1

i dont think that this class should be in trainer.py. Probably in types.py or something like that. Are you also going to add it to collate and test it in this PR?

joecummings · 2026-01-23T18:08:33Z

i dont think that this class should be in trainer.py. Probably in types.py or something like that. Are you also going to add it to collate and test it in this PR?

Why wouldn't this be in the trainer.py file under api? It defines the training API of which this is part. I would vote to keep it in the trainer API.

felipemello1 · 2026-01-23T20:01:15Z

Why wouldn't this be in the trainer.py file under api?

this is also used collate_fn. Not sure if it may be used in other places. I think we would be exposed to circular dependencies.

e.g. collate imports from train
train imports from X
X imports from collate

Also, thats what other frameworks do, like tinker: https://github.com/thinking-machines-lab/tinker/blob/ad03d44978096b1dcae662e469293e70f509d5a8/src/tinker/types/datum.py#L25

joecummings · 2026-01-23T20:30:43Z

e.g. collate imports from train
train imports from X
X imports from collate

What would X be here? I will not hold up the PR on this point but am curious b/c I have a hard time imagining what that would be.

felipemello1 · 2026-01-23T21:18:35Z

What would X be here?

I will leave that as an exercise for the reader

jk, i guess it cannot happen if collate is its own file and doesnt really import from anywhere. It just makes more sense to me, given the patterns i have seen. But no big deal either way. Worst case we refactor later.

felipemello1

LGTM, ty! but i would like to see main.py for 25-50 steps. Could you run it and share the rewards and loss sections?

Co-authored-by: Felipe Mello <fmellomascarenhas@gmail.com>

Co-authored-by: Hossein Kavianihamedani <hosseinkh@fb.com> Co-authored-by: Felipe Mello <fmellomascarenhas@gmail.com>

meta-cla Bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jan 22, 2026

HosseinKaviani-H requested a review from felipemello1 January 23, 2026 00:42

felipemello1 reviewed Jan 23, 2026

View reviewed changes

Comment thread src/forge/api/trainer.py Outdated

felipemello1 reviewed Jan 23, 2026

View reviewed changes

Hossein Kavianihamedani added 3 commits January 23, 2026 09:37

Add TrainBatch dataclass for universal training batches

f6e493c

More consice examples

8eb1f77

Move TrainBatch to types.py and update collate imports

e6f1ff8

Update TitanTrainer and GRPO main to use TrainBatch

34af55b

HosseinKaviani-H force-pushed the TrainBatch_Generic branch from 81e475d to 34af55b Compare January 23, 2026 18:32

Hossein Kavianihamedani added 2 commits January 23, 2026 16:11

Update test scripts to use TrainBatch

8cf0f1b

Update test scripts to use TrainBatch

0a988c2

felipemello1 reviewed Jan 26, 2026

View reviewed changes

Comment thread src/forge/types.py Outdated

felipemello1 reviewed Jan 26, 2026

View reviewed changes

Comment thread src/forge/types.py Outdated

felipemello1 reviewed Jan 26, 2026

View reviewed changes

HosseinKaviani-H and others added 2 commits January 26, 2026 08:55

Update src/forge/types.py

4244d96

Co-authored-by: Felipe Mello <fmellomascarenhas@gmail.com>

Update src/forge/types.py

1490f6e

Co-authored-by: Felipe Mello <fmellomascarenhas@gmail.com>

felipemello1 approved these changes Jan 26, 2026

View reviewed changes

felipemello1 merged commit a3ae18b into meta-pytorch:main Jan 26, 2026
10 checks passed

HosseinKaviani-H added a commit to HosseinKaviani-H/forge that referenced this pull request Feb 9, 2026

Train batch generic (meta-pytorch#724)

e49e526

Co-authored-by: Hossein Kavianihamedani <hosseinkh@fb.com> Co-authored-by: Felipe Mello <fmellomascarenhas@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Train batch generic#724

Train batch generic#724
felipemello1 merged 8 commits into
meta-pytorch:mainfrom
HosseinKaviani-H:TrainBatch_Generic

HosseinKaviani-H commented Jan 22, 2026 •

edited

Loading

Uh oh!

Uh oh!

felipemello1 left a comment

Uh oh!

joecummings commented Jan 23, 2026

Uh oh!

felipemello1 commented Jan 23, 2026

Uh oh!

joecummings commented Jan 23, 2026

Uh oh!

felipemello1 commented Jan 23, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

felipemello1 left a comment •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

HosseinKaviani-H commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Motivation

Solution

Files Changed

Test Plan

Rewards and Losses

Uh oh!

Uh oh!

felipemello1 left a comment

Choose a reason for hiding this comment

Uh oh!

joecummings commented Jan 23, 2026

Uh oh!

felipemello1 commented Jan 23, 2026

Uh oh!

joecummings commented Jan 23, 2026

Uh oh!

felipemello1 commented Jan 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

felipemello1 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

HosseinKaviani-H commented Jan 22, 2026 •

edited

Loading

felipemello1 commented Jan 23, 2026 •

edited

Loading

felipemello1 left a comment •

edited

Loading