Skip to content

Commit 0824a36

Browse files
author
semantic-release
committed
chore: release 0.78.0
1 parent 6d9fcb7 commit 0824a36

2 files changed

Lines changed: 25 additions & 1 deletion

File tree

CHANGELOG.md

Lines changed: 24 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,30 @@
11
# CHANGELOG
22

33

4+
## v0.78.0 (2026-03-29)
5+
6+
### Features
7+
8+
- Weave (W&B) integration for LLM/agent tracing
9+
([#228](https://github.com/OpenAdaptAI/openadapt-evals/pull/228),
10+
[`6d9fcb7`](https://github.com/OpenAdaptAI/openadapt-evals/commit/6d9fcb704e0ea0d4f1e7d2beff927a64e2676d48))
11+
12+
Weave auto-patches OpenAI and Anthropic clients after weave.init(), giving automatic tracing of
13+
every VLM call with prompts, responses, costs, and latency in hierarchical trace trees.
14+
15+
Integration points: - vlm_call() — @weave_op: all planner/grounder/evaluator calls traced -
16+
vlm_judge() — @weave_op: milestone evaluation traced - DemoExecutor.run() + _execute_step() —
17+
@weave_op: episode trace tree - PlannerGrounderAgent.act() + _call_planner() — @weave_op: agent
18+
decisions - WandbLogger.init() — calls weave_init() alongside wandb.init()
19+
20+
When weave is not installed, all decorators are zero-cost passthrough. weave>=0.50.0 added to
21+
[wandb] optional extra.
22+
23+
76/76 tests pass.
24+
25+
Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
26+
27+
428
## v0.77.5 (2026-03-29)
529

630
### Bug Fixes

pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ build-backend = "hatchling.build"
44

55
[project]
66
name = "openadapt-evals"
7-
version = "0.77.5"
7+
version = "0.78.0"
88
description = "Evaluation infrastructure for GUI agent benchmarks"
99
readme = "README.md"
1010
requires-python = ">=3.10"

0 commit comments

Comments
 (0)