File tree Expand file tree Collapse file tree
Expand file tree Collapse file tree Original file line number Diff line number Diff line change 11# CHANGELOG
22
33
4+ ## v0.61.0 (2026-03-22)
5+
6+ ### Features
7+
8+ - Add checkpoint evaluation script for GRPO before/after comparison
9+ ([ #183 ] ( https://github.com/OpenAdaptAI/openadapt-evals/pull/183 ) ,
10+ [ ` 075472b ` ] ( https://github.com/OpenAdaptAI/openadapt-evals/commit/075472bd3b8dd4bb1eb41fb2aaa13f64068ba77c ) )
11+
12+ Evaluates LoRA checkpoints against WAA tasks with milestone scoring. Same model loading, prompt, and
13+ action parsing as the GRPO trainer. Supports baseline vs checkpoint comparison via TraceAnalyzer.
14+
15+ Co-authored-by: Claude Opus 4.6 (1M context) < noreply@anthropic.com >
16+
17+
418## v0.60.0 (2026-03-22)
519
620### Features
Original file line number Diff line number Diff line change @@ -4,7 +4,7 @@ build-backend = "hatchling.build"
44
55[project ]
66name = " openadapt-evals"
7- version = " 0.60 .0"
7+ version = " 0.61 .0"
88description = " Evaluation infrastructure for GUI agent benchmarks"
99readme = " README.md"
1010requires-python = " >=3.10"
You can’t perform that action at this time.
0 commit comments