Skip to content

Commit 23a1a50

Browse files
author
semantic-release
committed
chore: release 0.71.2
1 parent 1a8ad93 commit 23a1a50

2 files changed

Lines changed: 33 additions & 1 deletion

File tree

CHANGELOG.md

Lines changed: 32 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,38 @@
11
# CHANGELOG
22

33

4+
## v0.71.2 (2026-03-28)
5+
6+
### Bug Fixes
7+
8+
- Demo guidance plan overview, anti-loop recovery, GRPO trainer fixes
9+
([#196](https://github.com/OpenAdaptAI/openadapt-evals/pull/196),
10+
[`1a8ad93`](https://github.com/OpenAdaptAI/openadapt-evals/commit/1a8ad939368a14a85d6ba1c9d921b3e065dead77))
11+
12+
Flywheel guidance: - Add get_plan_overview() to DemoLibrary: injects full demo strategy (all steps +
13+
keyboard shortcuts) instead of one-step-at-a-time guidance - Fix _build_enriched_instruction()
14+
producing "Click on Double-click..." and omit fake coordinates from manual demos - DemoGuidedAgent
15+
sets demo_guidance on base agent for anti-loop recovery
16+
17+
Anti-loop recovery: - New _ANTI_LOOP_WARNING_WITH_DEMO directs planner to use keyboard shortcuts
18+
from demo strategy when stuck clicking unresponsive elements - _check_action_loop() uses
19+
demo-aware warning when demo_guidance is set
20+
21+
Chrome popup suppression: - Add registry policy key disabling SpeedComparison in task setup - Launch
22+
Chrome with --no-first-run --disable-features=SpeedComparison - Send Escape to dismiss residual
23+
popups before closing Chrome
24+
25+
GRPO trainer (standalone): - Add vision_loss_mode config: "exclude" (default), "include",
26+
"checkpoint" with warning log when vision tensors stripped from loss computation - Add VRAM
27+
recommendations for max_new_tokens (L40S: 512, A100: 1024-2048) - Add truncation warning when
28+
output hits max_new_tokens without action - Fix float parsing crash on CLICK(x=..., y=...) literal
29+
dots
30+
31+
Validated: flywheel 0.00 -> 0.25 (+0.25) on clear-browsing-data-chrome
32+
33+
Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
34+
35+
436
## v0.71.1 (2026-03-27)
537

638
### Bug Fixes

pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ build-backend = "hatchling.build"
44

55
[project]
66
name = "openadapt-evals"
7-
version = "0.71.1"
7+
version = "0.71.2"
88
description = "Evaluation infrastructure for GUI agent benchmarks"
99
readme = "README.md"
1010
requires-python = ">=3.10"

0 commit comments

Comments
 (0)