Skip to content

Commit 2d24e5f

Browse files
author
semantic-release
committed
chore: release 0.70.0
1 parent 6ecd136 commit 2d24e5f

2 files changed

Lines changed: 15 additions & 1 deletion

File tree

CHANGELOG.md

Lines changed: 14 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,20 @@
11
# CHANGELOG
22

33

4+
## v0.70.0 (2026-03-24)
5+
6+
### Features
7+
8+
- Add --task-ids, --max-steps-per-episode, --max-new-tokens to standalone GRPO CLI
9+
([`6ecd136`](https://github.com/OpenAdaptAI/openadapt-evals/commit/6ecd1369fd834fce34e8c7f1f43a5d1f358aeba0))
10+
11+
Without --task-ids, the trainer cycles through ALL tasks in --task-dir including hard ones
12+
(calc-formula) that base models can't complete. Now you can filter: --task-ids
13+
custom-notepad-hello
14+
15+
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
16+
17+
418
## v0.69.1 (2026-03-24)
519

620
### Bug Fixes

pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ build-backend = "hatchling.build"
44

55
[project]
66
name = "openadapt-evals"
7-
version = "0.69.1"
7+
version = "0.70.0"
88
description = "Evaluation infrastructure for GUI agent benchmarks"
99
readme = "README.md"
1010
requires-python = ">=3.10"

0 commit comments

Comments
 (0)