Skip to content

Commit 06fe650

Browse files
author
semantic-release
committed
chore: release 0.34.2
1 parent 73111d3 commit 06fe650

2 files changed

Lines changed: 23 additions & 1 deletion

File tree

CHANGELOG.md

Lines changed: 22 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,28 @@
11
# CHANGELOG
22

33

4+
## v0.34.2 (2026-03-06)
5+
6+
### Bug Fixes
7+
8+
- Default strict_setup_readiness to False to avoid false infra failures
9+
([#113](https://github.com/OpenAdaptAI/openadapt-evals/pull/113),
10+
[`73111d3`](https://github.com/OpenAdaptAI/openadapt-evals/commit/73111d3a059c50b54293c2e9fecd043aad0861b7))
11+
12+
The post-setup focus check (PR #107) defaults to strict mode, which marks tasks as infrastructure
13+
failures when the a11y window enumeration can't find the expected app title. In practice,
14+
LibreOffice windows take longer to render titles than the check allows, causing ALL LibreOffice
15+
tasks to fail as infra — even though the app IS open.
16+
17+
Changing default to False: focus check still runs and logs warnings, but doesn't abort the task. The
18+
agent can recover from focus issues on its own (it did in all prior trials without this check).
19+
20+
Use --strict-setup-readiness to opt into the fatal behavior when the a11y detection is more
21+
reliable.
22+
23+
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
24+
25+
426
## v0.34.1 (2026-03-06)
527

628
### Bug Fixes

pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ build-backend = "hatchling.build"
44

55
[project]
66
name = "openadapt-evals"
7-
version = "0.34.1"
7+
version = "0.34.2"
88
description = "Evaluation infrastructure for GUI agent benchmarks"
99
readme = "README.md"
1010
requires-python = ">=3.10"

0 commit comments

Comments
 (0)