File tree Expand file tree Collapse file tree
Expand file tree Collapse file tree Original file line number Diff line number Diff line change 11# CHANGELOG
22
33
4+ ## v0.70.0 (2026-03-24)
5+
6+ ### Features
7+
8+ - Add --task-ids, --max-steps-per-episode, --max-new-tokens to standalone GRPO CLI
9+ ([ ` 6ecd136 ` ] ( https://github.com/OpenAdaptAI/openadapt-evals/commit/6ecd1369fd834fce34e8c7f1f43a5d1f358aeba0 ) )
10+
11+ Without --task-ids, the trainer cycles through ALL tasks in --task-dir including hard ones
12+ (calc-formula) that base models can't complete. Now you can filter: --task-ids
13+ custom-notepad-hello
14+
15+ Co-Authored-By: Claude Opus 4.6 (1M context) < noreply@anthropic.com >
16+
17+
418## v0.69.1 (2026-03-24)
519
620### Bug Fixes
Original file line number Diff line number Diff line change @@ -4,7 +4,7 @@ build-backend = "hatchling.build"
44
55[project ]
66name = " openadapt-evals"
7- version = " 0.69.1 "
7+ version = " 0.70.0 "
88description = " Evaluation infrastructure for GUI agent benchmarks"
99readme = " README.md"
1010requires-python = " >=3.10"
You can’t perform that action at this time.
0 commit comments