Skip to content

Commit 25eb2f2

Browse files
author
semantic-release
committed
chore: release 0.69.1
1 parent 43cac1c commit 25eb2f2

2 files changed

Lines changed: 28 additions & 1 deletion

File tree

CHANGELOG.md

Lines changed: 27 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,33 @@
11
# CHANGELOG
22

33

4+
## v0.69.1 (2026-03-24)
5+
6+
### Bug Fixes
7+
8+
- Align standalone GRPO with WAA API format and add retry logic
9+
([#193](https://github.com/OpenAdaptAI/openadapt-evals/pull/193),
10+
[`43cac1c`](https://github.com/OpenAdaptAI/openadapt-evals/commit/43cac1ca9708dbfaa858c89fdea2b3af40df5ebb))
11+
12+
The standalone GRPO trainer produced zero rewards due to two API format bugs in WAADirect:
13+
14+
1. screenshot() tried resp.json() expecting base64-encoded JSON, but WAA's /screenshot returns raw
15+
PNG bytes via Flask's send_file(). Fixed to use resp.content (matching WAALiveAdapter).
16+
17+
2. execute_action() wrapped commands in `python -c "..."`, but WAA's /execute_windows uses exec()
18+
directly -- the wrapper caused SyntaxError inside the VM. Fixed to send bare Python statements
19+
(matching WAALiveAdapter._build_pixel_command).
20+
21+
Additional improvements: - Add probe() method for structured health checking - Add screenshot retry
22+
logic (3 attempts with 2s delay) - Add double_click, right_click, scroll action types - Fix type
23+
action to click target first then type (match WAALiveAdapter) - Add pre-rollout health check in
24+
trainer._collect_group() - Handle empty rollouts gracefully in training loop - Fix train script to
25+
bypass openadapt_evals/__init__.py eager imports (open_clip -> numpy ABI crash in minimal training
26+
environments)
27+
28+
Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
29+
30+
431
## v0.69.0 (2026-03-24)
532

633
### Features

pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ build-backend = "hatchling.build"
44

55
[project]
66
name = "openadapt-evals"
7-
version = "0.69.0"
7+
version = "0.69.1"
88
description = "Evaluation infrastructure for GUI agent benchmarks"
99
readme = "README.md"
1010
requires-python = ">=3.10"

0 commit comments

Comments
 (0)