Skip to content

Commit 9e9bbc8

Browse files
author
semantic-release
committed
chore: release 0.72.1
1 parent 4ec7d51 commit 9e9bbc8

2 files changed

Lines changed: 25 additions & 1 deletion

File tree

CHANGELOG.md

Lines changed: 24 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,30 @@
11
# CHANGELOG
22

33

4+
## v0.72.1 (2026-03-28)
5+
6+
### Bug Fixes
7+
8+
- Constrained decoding cache bug, task rotation, add trainer tests
9+
([#199](https://github.com/OpenAdaptAI/openadapt-evals/pull/199),
10+
[`4ec7d51`](https://github.com/OpenAdaptAI/openadapt-evals/commit/4ec7d5174e1c8420d436af5cc6810b211a85de61))
11+
12+
Constrained decoding: - Remove (.|\n)* prefix from action regex — Outlines can't compile it into a
13+
DFA efficiently. Model must output action directly. - Fix cache sentinel: use False for failure
14+
(not []) so subsequent calls correctly return None instead of empty logits_processor list. Prior
15+
bug: [] cached as "success" → model generated unconstrained. - Upgrade warning to error level for
16+
visibility.
17+
18+
Task rotation: - Fix _load_task_configs: check `not task_ids` once BEFORE the loop (was checking
19+
inside loop — only first task ever appended).
20+
21+
Tests (21 new): - TestActionRegex: 8 valid actions match, 6 invalid texts rejected -
22+
TestConstrainedDecodingCache: sentinel logic, regression for [] bug - TestTaskRotation: all tasks
23+
loaded, explicit ids preserved, rotation
24+
25+
Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
26+
27+
428
## v0.72.0 (2026-03-28)
529

630
### Features

pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ build-backend = "hatchling.build"
44

55
[project]
66
name = "openadapt-evals"
7-
version = "0.72.0"
7+
version = "0.72.1"
88
description = "Evaluation infrastructure for GUI agent benchmarks"
99
readme = "README.md"
1010
requires-python = ">=3.10"

0 commit comments

Comments
 (0)