Skip to content

Commit bd1df33

Browse files
author
semantic-release
committed
chore: release 0.67.0
1 parent 6454bc9 commit bd1df33

2 files changed

Lines changed: 23 additions & 1 deletion

File tree

CHANGELOG.md

Lines changed: 22 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,28 @@
11
# CHANGELOG
22

33

4+
## v0.67.0 (2026-03-23)
5+
6+
### Features
7+
8+
- Add SGLang local model serving to comparison framework
9+
([#190](https://github.com/OpenAdaptAI/openadapt-evals/pull/190),
10+
[`6454bc9`](https://github.com/OpenAdaptAI/openadapt-evals/commit/6454bc9057bf5bea9abd368ae7b0819e85c0a5b2))
11+
12+
Add support for serving models via SGLang on remote GPU hosts, enabling comparison of API models
13+
(GPT, Claude) against locally-served models (e.g. Qwen3.5-9B) that vLLM cannot serve.
14+
15+
Key changes: - New scripts/sglang_server.py: SGLangServerManager handles full lifecycle (SSH
16+
install, server start, readiness polling, SSH tunnel, cleanup) - Extended ModelConfig with
17+
provider="sglang", serve config, max_new_tokens - New --gpu-host and --ssh-key CLI flags
18+
(optional; sglang models skipped without --gpu-host) - SGLang server auto-starts per model,
19+
tunneled as OpenAI-compatible API - Environment variables (OPENAI_BASE_URL, OPENAI_API_KEY)
20+
saved/restored between models so API models remain unaffected - New
21+
example_comparisons/unified_agents.yaml demonstrating mixed config
22+
23+
Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
24+
25+
426
## v0.66.0 (2026-03-23)
527

628
### Features

pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ build-backend = "hatchling.build"
44

55
[project]
66
name = "openadapt-evals"
7-
version = "0.66.0"
7+
version = "0.67.0"
88
description = "Evaluation infrastructure for GUI agent benchmarks"
99
readme = "README.md"
1010
requires-python = ">=3.10"

0 commit comments

Comments
 (0)