Skip to content

Commit 3b5d733

Browse files
authored
[Release] v0.3.0 (#427)
1 parent 158f5df commit 3b5d733

4 files changed

Lines changed: 97 additions & 6 deletions

File tree

README.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -37,7 +37,7 @@ pip install agentlightning
3737
For the latest nightly build (cutting-edge features), you can install from Test PyPI:
3838

3939
```bash
40-
pip install --upgrade --index-url https://test.pypi.org/simple/ --extra-index-url https://pypi.org/simple/ agentlightning
40+
pip install --upgrade --index-url https://test.pypi.org/simple/ --extra-index-url https://pypi.org/simple/ --pre agentlightning
4141
```
4242

4343
Please refer to our [installation guide](https://microsoft.github.io/agent-lightning/stable/tutorials/installation/) for more details.
@@ -46,6 +46,7 @@ To start using Agent-lightning, check out our [documentation](https://microsoft.
4646

4747
## ⚡ Articles
4848

49+
- 12/17/2025 [Adopting the Trajectory Level Aggregation for Faster Training](https://agent-lightning.github.io/posts/trajectory_level_aggregation/) Agent-lightning blog.
4950
- 11/4/2025 [Tuning ANY AI agent with Tinker ✕ Agent-lightning](https://medium.com/@yugez/tuning-any-ai-agent-with-tinker-agent-lightning-part-1-1d8c9a397f0e) Medium. See also [Part 2](https://medium.com/@yugez/tuning-any-ai-agent-with-tinker-agent-lightning-part-2-332c5437f0dc).
5051
- 10/22/2025 [No More Retokenization Drift: Returning Token IDs via the OpenAI Compatible API Matters in Agent RL](https://blog.vllm.ai/2025/10/22/agent-lightning.html) vLLM blog. See also [Zhihu writeup](https://zhuanlan.zhihu.com/p/1965067274642785725).
5152
- 8/11/2025 [Training AI Agents to Write and Self-correct SQL with Reinforcement Learning](https://medium.com/@yugez/training-ai-agents-to-write-and-self-correct-sql-with-reinforcement-learning-571ed31281ad) Medium.

docs/changelog.md

Lines changed: 92 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,97 @@
11
# Changelog
22

3+
## Agent-lightning v0.3.0 (12/24/2025)
4+
5+
Agent-lightning v0.3.0 is a major release that introduces several new features and bug fixes. The release is a collaborative effort between Agent-lightning core teams and the community. Thanks to all the contributors who made this release possible.
6+
7+
### Highlights
8+
9+
* **Tinker integration**: Support Tinker as an alternative backend for Reinforcement Learning (#226 #245 #264 #269 #327). See [example code](https://github.com/microsoft/agent-lightning/tree/v0.3.0/examples/tinker), [blog 1](https://medium.com/@yugez/tuning-any-ai-agent-with-tinker-agent-lightning-part-1-1d8c9a397f0e) and [blog 2](https://medium.com/@yugez/tuning-any-ai-agent-with-tinker-agent-lightning-part-2-332c5437f0dc).
10+
* **Azure OpenAI integration**: Support Azure OpenAI as a backend for LLM inference and supervised fine-tuning (#256 #327). [Example code](https://github.com/microsoft/agent-lightning/tree/v0.3.0/examples/azure).
11+
* **MongoDB-based Lightning Store** is added as an alternative backend for Lightning Store (#323). [Documentation](https://microsoft.github.io/agent-lightning/v0.3.0/tutorials/parallelize/#parallelizing-lightningstore).
12+
* **Contrib package**: Add contrib package for community projects. Search-R1 is integrated as a contrib recipe. More coming. (#239 #396 #410 #412 #417).
13+
* **RESTful API**: Stabilize and document RESTful API for Lightning Store (#241 #275). [Documentation](https://microsoft.github.io/agent-lightning/v0.3.0/reference/restful/).
14+
* **OTel Semantic Conventions** that are specifically designed for Agent-optimization areas (#340). [Documentation](http://microsoft.github.io/agent-lightning/v0.3.0/reference/semconv/).
15+
* *[Preview]* **Agent-lightning Dashboard** is now available (#288 #289 #291 #296 #371 #375). It's the official web application for inspecting and debugging Agent-lightning experiments. See details [here](https://microsoft.github.io/agent-lightning/v0.3.0/tutorials/debug/).
16+
* *[Preview]* **Multi-modality example** featuring VERL and a LangGraph agent on ChartQA dataset (#379). [Example code](https://github.com/microsoft/agent-lightning/tree/v0.3.0/examples/chartqa).
17+
* *[Preview]* Integrate **Claude Code** as a LitAgent and support training on SWE-Bench (#332 #346 #348). [Example code](https://github.com/microsoft/agent-lightning/tree/v0.3.0/examples/claude_code).
18+
* *[Preview]* **Weave tracer** as a substitute for AgentOps tracer (#277 #411 #420 #423). [Documentation](https://microsoft.github.io/agent-lightning/v0.3.0/tutorials/traces/#weave-tracer-experimental).
19+
* *[Preview]* **Trajectory Level Aggregation** for more efficient training with VERL. See [blog](https://agent-lightning.github.io/posts/trajectory_level_aggregation/) and [documentation](https://microsoft.github.io/agent-lightning/v0.3.0/algorithm-zoo/verl/).
20+
21+
### Store Benchmark
22+
23+
In this release, the Lightning Store core was redesigned for significantly greater efficiency and scalability (#315 #318 #328 #342 #344 #356 #380 #388 #418 #421). The benchmark results below demonstrate the impact: with large numbers of concurrent runners, v0.3.0 delivers up to a 15x increase in throughput compared to v0.2.2.
24+
25+
| Throughput (\#rollout/sec) | v0.2.2 | v0.3.0 (in-memory) | v0.3.0 (Mongo) |
26+
| :---- | :---- | :---- | :---- |
27+
| Minimal (batch, #runner=32, #turns=6) | 8.73 | 9.06 | 8.71 |
28+
| Medium (batch, #runners=100, #turns=10) | 12.03 | 23.26 | 32.79 |
29+
| Mid-high (batch, #runners=300, #turns=6) | 10.61 | 24.42 | 40.24 |
30+
| Large (batch, #runners=1000, #turns=3) | 3.36 | 14.60 | 50.05 |
31+
| Long queue (queue, #runners=256, #turns=4) | 7.42 | 30.86 | 57.01 |
32+
| Heavy trace (queue, #runners=512, #turns=20) | 5.93 | 13.28 | 29.41 |
33+
34+
*Notes:*
35+
36+
1. Benchmarks were run on a single Standard_D32as_v4 Azure VM (Large and heavy trace tests used Standard_D64ads_v5), executed via GitHub Actions.
37+
2. Two algorithm patterns are evaluated: the batch pattern submits a group of rollouts and waits for all to finish before starting the next group, while the queue pattern maintains a set number of in-flight rollouts, submitting new ones as soon as capacity frees up. Configuration details are available [here](https://github.com/microsoft/agent-lightning/blob/v0.3.0/.github/workflows/benchmark.yml).
38+
3. The number of turns is directly proportional to the number of spans each rollout generates.
39+
40+
### Maintenance and Bug fixes
41+
42+
#### Core (Store, Interfaces, etc.)
43+
44+
* Add Trainer port option for client-server strategies (#198)
45+
* Fix store port conflict handling (#227)
46+
* Unified PythonServerLauncher (#286 #292 #303)
47+
* Make health timeout configurable (#305)
48+
* Refactor logging (#306)
49+
* Support OTLP in LightningStore (#313)
50+
* Centralized metrics helper (#368)
51+
* Fix redundant cancel tracebacks on Ctrl+C (#370)
52+
53+
#### Proxy, Adapters and Algorithms
54+
55+
* Fix training metrics before and after processing in VERL (#145)
56+
* Forward streaming requests for Anthropic and OpenAI APIs (as non-streaming requests) (#299)
57+
* Check traces with reward for VERL (#317)
58+
* Patch LiteLLM root span (#341)
59+
* Handle ref_in_actor flag for LoRA compatibility (#386)
60+
* Support `with_llm_proxy` and `with_store` in algorithms (#398)
61+
* Support image URL export in TracerTraceToTriplets (#400)
62+
* Fix match_rewards assign_to elements in TraceTree (#403)
63+
* Support customizing trainer and daemon in VERL (#407)
64+
65+
#### Runners, Tracers and Agents
66+
67+
* Refactor tracer initialization (#321)
68+
* Fix OpenAI Agents 0.6 compatibility (#322)
69+
* `emit_operation`, `emit_annotation`, tags and links (#359)
70+
* Sunset HTTP tracer (#402)
71+
72+
#### Examples
73+
74+
* Fix typos in train-first-agent.md (#263)
75+
* Fix room_selector example which always runs the first task (#270)
76+
* Fix typo in SQL agent example (#285)
77+
* Add the README and script files for training SQL agent on NPU (#272)
78+
* Examples Catalog and Refine Contribution Guide (#331)
79+
* Upgrade LangChain to 1.x (#364)
80+
* Update RAG example to Agent-lightning v0.2.x (#349)
81+
82+
#### Miscellaneous
83+
84+
* DeepWiki Badge (#263)
85+
* Add AGENTS.md (#374)
86+
87+
### New Contributors
88+
89+
Warm welcome to our first-time contributors: @cptnm3, @TerryChan, @genji970, @zxgx, @xiaochulaoban, @lspinheiro, @Kwanghoon-Choi, @Vasuk12, @totoluo, @jinghuan-Chen 🎉
90+
91+
**Full Changelog**: https://github.com/microsoft/agent-lightning/compare/v0.2.0...v0.3.0
92+
93+
---
94+
395
## Agent-lightning v0.2.2 (11/12/2025)
496

597
Agent-lightning v0.2.2 is a stabilization release for v0.2.1. It introduces several bug fixes.

docs/reference/restful.md

Lines changed: 2 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,10 +1,8 @@
11
# RESTful API References
22

3-
!!! warning
3+
!!! note
44

5-
The following contents are still under construction.
6-
7-
## Store RESTful API
5+
Shown in the following is the RESTful API for Lightning Store.
86

97
<div id="swagger-ui"></div>
108
<link rel="stylesheet" href="https://unpkg.com/swagger-ui-dist/swagger-ui.css" />

docs/tutorials/installation.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -34,7 +34,7 @@ This installs or upgrades Agent-Lightning to the newest stable version.
3434
Agent-Lightning also publishes **nightly builds**, which contain the latest experimental features and improvements from the main branch. These are available via **Test PyPI**.
3535

3636
```bash
37-
pip install --upgrade --index-url https://test.pypi.org/simple/ --extra-index-url https://pypi.org/simple/ agentlightning
37+
pip install --upgrade --index-url https://test.pypi.org/simple/ --extra-index-url https://pypi.org/simple/ --pre agentlightning
3838
```
3939

4040
!!! warning

0 commit comments

Comments
 (0)