Skip to content

Commit b634e8f

Browse files
author
semantic-release
committed
chore: release 0.77.4
1 parent 5413864 commit b634e8f

2 files changed

Lines changed: 24 additions & 1 deletion

File tree

CHANGELOG.md

Lines changed: 23 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,29 @@
11
# CHANGELOG
22

33

4+
## v0.77.4 (2026-03-29)
5+
6+
### Bug Fixes
7+
8+
- Proper vision-safe loss — process full text as one unit
9+
([#224](https://github.com/OpenAdaptAI/openadapt-evals/pull/224),
10+
[`5413864`](https://github.com/OpenAdaptAI/openadapt-evals/commit/5413864342a71f3cead106d640e5ba6adc5fae95))
11+
12+
Root cause: manually concatenating action_ids onto prompt input_ids created inconsistent input
13+
(pixel_values sized for prompt, input_ids includes action tokens). Qwen3's vision merge changes
14+
internal sequence length, crashing with attention mask mismatches.
15+
16+
Fix: process prompt_text + action_text as a SINGLE string through the
17+
18+
processor. Produces consistent input_ids, pixel_values, attention_mask. The model handles vision
19+
merge correctly on processor output.
20+
21+
Replaces the silent fallback from PR #223 with a proper solution that gives correct vision-aware
22+
gradients for ALL steps in ALL modes.
23+
24+
Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
25+
26+
427
## v0.77.3 (2026-03-29)
528

629
### Bug Fixes

pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ build-backend = "hatchling.build"
44

55
[project]
66
name = "openadapt-evals"
7-
version = "0.77.3"
7+
version = "0.77.4"
88
description = "Evaluation infrastructure for GUI agent benchmarks"
99
readme = "README.md"
1010
requires-python = ">=3.10"

0 commit comments

Comments
 (0)