File tree Expand file tree Collapse file tree
Expand file tree Collapse file tree Original file line number Diff line number Diff line change 11# CHANGELOG
22
33
4+ ## v0.77.4 (2026-03-29)
5+
6+ ### Bug Fixes
7+
8+ - Proper vision-safe loss — process full text as one unit
9+ ([ #224 ] ( https://github.com/OpenAdaptAI/openadapt-evals/pull/224 ) ,
10+ [ ` 5413864 ` ] ( https://github.com/OpenAdaptAI/openadapt-evals/commit/5413864342a71f3cead106d640e5ba6adc5fae95 ) )
11+
12+ Root cause: manually concatenating action_ids onto prompt input_ids created inconsistent input
13+ (pixel_values sized for prompt, input_ids includes action tokens). Qwen3's vision merge changes
14+ internal sequence length, crashing with attention mask mismatches.
15+
16+ Fix: process prompt_text + action_text as a SINGLE string through the
17+
18+ processor. Produces consistent input_ids, pixel_values, attention_mask. The model handles vision
19+ merge correctly on processor output.
20+
21+ Replaces the silent fallback from PR #223 with a proper solution that gives correct vision-aware
22+ gradients for ALL steps in ALL modes.
23+
24+ Co-authored-by: Claude Opus 4.6 (1M context) < noreply@anthropic.com >
25+
26+
427## v0.77.3 (2026-03-29)
528
629### Bug Fixes
Original file line number Diff line number Diff line change @@ -4,7 +4,7 @@ build-backend = "hatchling.build"
44
55[project ]
66name = " openadapt-evals"
7- version = " 0.77.3 "
7+ version = " 0.77.4 "
88description = " Evaluation infrastructure for GUI agent benchmarks"
99readme = " README.md"
1010requires-python = " >=3.10"
You can’t perform that action at this time.
0 commit comments