Skip to content

Commit 4c000c5

Browse files
author
semantic-release
committed
chore: release 0.41.0
1 parent 578985a commit 4c000c5

2 files changed

Lines changed: 21 additions & 1 deletion

File tree

CHANGELOG.md

Lines changed: 20 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,26 @@
11
# CHANGELOG
22

33

4+
## v0.41.0 (2026-03-18)
5+
6+
### Features
7+
8+
- Add TRL GRPOTrainer rollout_func for WAA environments
9+
([#127](https://github.com/OpenAdaptAI/openadapt-evals/pull/127),
10+
[`578985a`](https://github.com/OpenAdaptAI/openadapt-evals/commit/578985a1b599ecdc230701b4512b950669a7f8dc))
11+
12+
make_waa_rollout_func() wraps WAADesktopEnv into TRL's experimental rollout_func API. Handles VLM
13+
multimodal generation (screenshot → action tokens), dense rewards via milestones, and action JSON
14+
parsing with thinking-token tolerance.
15+
16+
Includes parse_action_json() that handles common VLM quirks (markdown fences, thinking prefixes,
17+
unknown action types).
18+
19+
15 tests passing (10 parser + 5 integration with mock adapter).
20+
21+
Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
22+
23+
424
## v0.40.0 (2026-03-18)
525

626
### Bug Fixes

pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ build-backend = "hatchling.build"
44

55
[project]
66
name = "openadapt-evals"
7-
version = "0.40.0"
7+
version = "0.41.0"
88
description = "Evaluation infrastructure for GUI agent benchmarks"
99
readme = "README.md"
1010
requires-python = ">=3.10"

0 commit comments

Comments
 (0)