File tree Expand file tree Collapse file tree
Expand file tree Collapse file tree Original file line number Diff line number Diff line change 11# CHANGELOG
22
33
4+ ## v0.71.2 (2026-03-28)
5+
6+ ### Bug Fixes
7+
8+ - Demo guidance plan overview, anti-loop recovery, GRPO trainer fixes
9+ ([ #196 ] ( https://github.com/OpenAdaptAI/openadapt-evals/pull/196 ) ,
10+ [ ` 1a8ad93 ` ] ( https://github.com/OpenAdaptAI/openadapt-evals/commit/1a8ad939368a14a85d6ba1c9d921b3e065dead77 ) )
11+
12+ Flywheel guidance: - Add get_plan_overview() to DemoLibrary: injects full demo strategy (all steps +
13+ keyboard shortcuts) instead of one-step-at-a-time guidance - Fix _ build_enriched_instruction()
14+ producing "Click on Double-click..." and omit fake coordinates from manual demos - DemoGuidedAgent
15+ sets demo_guidance on base agent for anti-loop recovery
16+
17+ Anti-loop recovery: - New _ ANTI_LOOP_WARNING_WITH_DEMO directs planner to use keyboard shortcuts
18+ from demo strategy when stuck clicking unresponsive elements - _ check_action_loop() uses
19+ demo-aware warning when demo_guidance is set
20+
21+ Chrome popup suppression: - Add registry policy key disabling SpeedComparison in task setup - Launch
22+ Chrome with --no-first-run --disable-features=SpeedComparison - Send Escape to dismiss residual
23+ popups before closing Chrome
24+
25+ GRPO trainer (standalone): - Add vision_loss_mode config: "exclude" (default), "include",
26+ "checkpoint" with warning log when vision tensors stripped from loss computation - Add VRAM
27+ recommendations for max_new_tokens (L40S: 512, A100: 1024-2048) - Add truncation warning when
28+ output hits max_new_tokens without action - Fix float parsing crash on CLICK(x=..., y=...) literal
29+ dots
30+
31+ Validated: flywheel 0.00 -> 0.25 (+0.25) on clear-browsing-data-chrome
32+
33+ Co-authored-by: Claude Opus 4.6 (1M context) < noreply@anthropic.com >
34+
35+
436## v0.71.1 (2026-03-27)
537
638### Bug Fixes
Original file line number Diff line number Diff line change @@ -4,7 +4,7 @@ build-backend = "hatchling.build"
44
55[project ]
66name = " openadapt-evals"
7- version = " 0.71.1 "
7+ version = " 0.71.2 "
88description = " Evaluation infrastructure for GUI agent benchmarks"
99readme = " README.md"
1010requires-python = " >=3.10"
You can’t perform that action at this time.
0 commit comments