File tree Expand file tree Collapse file tree
Expand file tree Collapse file tree Original file line number Diff line number Diff line change 11# CHANGELOG
22
33
4+ ## v0.34.2 (2026-03-06)
5+
6+ ### Bug Fixes
7+
8+ - Default strict_setup_readiness to False to avoid false infra failures
9+ ([ #113 ] ( https://github.com/OpenAdaptAI/openadapt-evals/pull/113 ) ,
10+ [ ` 73111d3 ` ] ( https://github.com/OpenAdaptAI/openadapt-evals/commit/73111d3a059c50b54293c2e9fecd043aad0861b7 ) )
11+
12+ The post-setup focus check (PR #107 ) defaults to strict mode, which marks tasks as infrastructure
13+ failures when the a11y window enumeration can't find the expected app title. In practice,
14+ LibreOffice windows take longer to render titles than the check allows, causing ALL LibreOffice
15+ tasks to fail as infra — even though the app IS open.
16+
17+ Changing default to False: focus check still runs and logs warnings, but doesn't abort the task. The
18+ agent can recover from focus issues on its own (it did in all prior trials without this check).
19+
20+ Use --strict-setup-readiness to opt into the fatal behavior when the a11y detection is more
21+ reliable.
22+
23+ Co-authored-by: Claude Opus 4.6 < noreply@anthropic.com >
24+
25+
426## v0.34.1 (2026-03-06)
527
628### Bug Fixes
Original file line number Diff line number Diff line change @@ -4,7 +4,7 @@ build-backend = "hatchling.build"
44
55[project ]
66name = " openadapt-evals"
7- version = " 0.34.1 "
7+ version = " 0.34.2 "
88description = " Evaluation infrastructure for GUI agent benchmarks"
99readme = " README.md"
1010requires-python = " >=3.10"
You can’t perform that action at this time.
0 commit comments