File tree Expand file tree Collapse file tree
Expand file tree Collapse file tree Original file line number Diff line number Diff line change 11# CHANGELOG
22
33
4+ ## v0.81.1 (2026-03-29)
5+
6+ ### Bug Fixes
7+
8+ - Set per_device_train_batch_size to match dataset size
9+ ([ #240 ] ( https://github.com/OpenAdaptAI/openadapt-evals/pull/240 ) ,
10+ [ ` 048796c ` ] ( https://github.com/OpenAdaptAI/openadapt-evals/commit/048796c020a474293758ff8a95ed6ef520f41fbf ) )
11+
12+ * fix: set per_device_train_batch_size to match dataset size
13+
14+ TRL's default per_device_train_batch_size=8, but with 1-3 tasks the dataset is too small to form a
15+ single batch. TRL computes 0 steps and exits with "There seems not to be a single sample in your
16+ epoch_iterator".
17+
18+ Fix: set batch_size=n_tasks when building default GRPOConfig. When the
19+
20+ user provides their own trl_config, warn if batch_size > dataset size.
21+
22+ Co-Authored-By: Claude Opus 4.6 (1M context) < noreply@anthropic.com >
23+
24+ * fix: use batch_size=1 instead of n_tasks to avoid OOM with many tasks
25+
26+ batch_size=n_tasks could OOM on GPU with many tasks. batch_size=1 is safer and matches the
27+ standalone trainer behavior (one task per step, rotating through tasks via epochs). Each step
28+ still does num_generations rollouts, so learning signal is preserved.
29+
30+ ---------
31+
32+ Co-authored-by: Claude Opus 4.6 (1M context) < noreply@anthropic.com >
33+
34+
435## v0.81.0 (2026-03-29)
536
637### Features
Original file line number Diff line number Diff line change @@ -4,7 +4,7 @@ build-backend = "hatchling.build"
44
55[project ]
66name = " openadapt-evals"
7- version = " 0.81.0 "
7+ version = " 0.81.1 "
88description = " Evaluation infrastructure for GUI agent benchmarks"
99readme = " README.md"
1010requires-python = " >=3.10"
You can’t perform that action at this time.
0 commit comments