File tree Expand file tree Collapse file tree
Expand file tree Collapse file tree Original file line number Diff line number Diff line change 11# CHANGELOG
22
33
4+ ## v0.69.1 (2026-03-24)
5+
6+ ### Bug Fixes
7+
8+ - Align standalone GRPO with WAA API format and add retry logic
9+ ([ #193 ] ( https://github.com/OpenAdaptAI/openadapt-evals/pull/193 ) ,
10+ [ ` 43cac1c ` ] ( https://github.com/OpenAdaptAI/openadapt-evals/commit/43cac1ca9708dbfaa858c89fdea2b3af40df5ebb ) )
11+
12+ The standalone GRPO trainer produced zero rewards due to two API format bugs in WAADirect:
13+
14+ 1 . screenshot() tried resp.json() expecting base64-encoded JSON, but WAA's /screenshot returns raw
15+ PNG bytes via Flask's send_file(). Fixed to use resp.content (matching WAALiveAdapter).
16+
17+ 2 . execute_action() wrapped commands in ` python -c "..." ` , but WAA's /execute_windows uses exec()
18+ directly -- the wrapper caused SyntaxError inside the VM. Fixed to send bare Python statements
19+ (matching WAALiveAdapter._ build_pixel_command).
20+
21+ Additional improvements: - Add probe() method for structured health checking - Add screenshot retry
22+ logic (3 attempts with 2s delay) - Add double_click, right_click, scroll action types - Fix type
23+ action to click target first then type (match WAALiveAdapter) - Add pre-rollout health check in
24+ trainer._ collect_group() - Handle empty rollouts gracefully in training loop - Fix train script to
25+ bypass openadapt_evals/__ init__ .py eager imports (open_clip -> numpy ABI crash in minimal training
26+ environments)
27+
28+ Co-authored-by: Claude Opus 4.6 (1M context) < noreply@anthropic.com >
29+
30+
431## v0.69.0 (2026-03-24)
532
633### Features
Original file line number Diff line number Diff line change @@ -4,7 +4,7 @@ build-backend = "hatchling.build"
44
55[project ]
66name = " openadapt-evals"
7- version = " 0.69.0 "
7+ version = " 0.69.1 "
88description = " Evaluation infrastructure for GUI agent benchmarks"
99readme = " README.md"
1010requires-python = " >=3.10"
You can’t perform that action at this time.
0 commit comments