Skip to content

Commit 47f1126

Browse files
author
semantic-release
committed
chore: release 0.69.0
1 parent 43021a7 commit 47f1126

2 files changed

Lines changed: 24 additions & 1 deletion

File tree

CHANGELOG.md

Lines changed: 23 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,29 @@
11
# CHANGELOG
22

33

4+
## v0.69.0 (2026-03-24)
5+
6+
### Features
7+
8+
- Add comprehensive API and infrastructure cost tracking
9+
([#192](https://github.com/OpenAdaptAI/openadapt-evals/pull/192),
10+
[`43021a7`](https://github.com/OpenAdaptAI/openadapt-evals/commit/43021a74ede54200df9e87d83dc32bcf48d0455b))
11+
12+
Add a centralized, thread-safe CostTracker that records token usage from every VLM/LLM API call and
13+
infrastructure time (GPU/VM hours).
14+
15+
The tracker is integrated at the vlm_call() level so all 15+ callers automatically get cost tracking
16+
without any changes. Key integration points have cost_label tags for per-component breakdown
17+
(planner, grounder, vlm_judge, demo_verify, etc.).
18+
19+
- New openadapt_evals/cost_tracker.py with global singleton, pricing tables, JSON persistence, and
20+
human-readable summary output - vlm.py extracts response.usage tokens from both OpenAI and
21+
Anthropic responses and reports to the tracker - 18 unit tests covering pricing lookup,
22+
aggregation, thread safety, persistence, and vlm.py integration
23+
24+
Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
25+
26+
427
## v0.68.0 (2026-03-23)
528

629
### Features

pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ build-backend = "hatchling.build"
44

55
[project]
66
name = "openadapt-evals"
7-
version = "0.68.0"
7+
version = "0.69.0"
88
description = "Evaluation infrastructure for GUI agent benchmarks"
99
readme = "README.md"
1010
requires-python = ">=3.10"

0 commit comments

Comments
 (0)