|
1 | 1 | # Changelog |
2 | 2 |
|
| 3 | +## Agent-lightning v0.3.0 (12/24/2025) |
| 4 | + |
| 5 | +Agent-lightning v0.3.0 is a major release that introduces several new features and bug fixes. The release is a collaborative effort between Agent-lightning core teams and the community. Thanks to all the contributors who made this release possible. |
| 6 | + |
| 7 | +### Highlights |
| 8 | + |
| 9 | +* **Tinker integration**: Support Tinker as an alternative backend for Reinforcement Learning (#226 #245 #264 #269 #327). See [example code](https://github.com/microsoft/agent-lightning/tree/v0.3.0/examples/tinker), [blog 1](https://medium.com/@yugez/tuning-any-ai-agent-with-tinker-agent-lightning-part-1-1d8c9a397f0e) and [blog 2](https://medium.com/@yugez/tuning-any-ai-agent-with-tinker-agent-lightning-part-2-332c5437f0dc). |
| 10 | +* **Azure OpenAI integration**: Support Azure OpenAI as a backend for LLM inference and supervised fine-tuning (#256 #327). [Example code](https://github.com/microsoft/agent-lightning/tree/v0.3.0/examples/azure). |
| 11 | +* **MongoDB-based Lightning Store** is added as an alternative backend for Lightning Store (#323). [Documentation](https://microsoft.github.io/agent-lightning/v0.3.0/tutorials/parallelize/#parallelizing-lightningstore). |
| 12 | +* **Contrib package**: Add contrib package for community projects. Search-R1 is integrated as a contrib recipe. More coming. (#239 #396 #410 #412 #417). |
| 13 | +* **RESTful API**: Stabilize and document RESTful API for Lightning Store (#241 #275). [Documentation](https://microsoft.github.io/agent-lightning/v0.3.0/reference/restful/). |
| 14 | +* **OTel Semantic Conventions** that are specifically designed for Agent-optimization areas (#340). [Documentation](http://microsoft.github.io/agent-lightning/v0.3.0/reference/semconv/). |
| 15 | +* *[Preview]* **Agent-lightning Dashboard** is now available (#288 #289 #291 #296 #371 #375). It's the official web application for inspecting and debugging Agent-lightning experiments. See details [here](https://microsoft.github.io/agent-lightning/v0.3.0/tutorials/debug/). |
| 16 | +* *[Preview]* **Multi-modality example** featuring VERL and a LangGraph agent on ChartQA dataset (#379). [Example code](https://github.com/microsoft/agent-lightning/tree/v0.3.0/examples/chartqa). |
| 17 | +* *[Preview]* Integrate **Claude Code** as a LitAgent and support training on SWE-Bench (#332 #346 #348). [Example code](https://github.com/microsoft/agent-lightning/tree/v0.3.0/examples/claude_code). |
| 18 | +* *[Preview]* **Weave tracer** as a substitute for AgentOps tracer (#277 #411 #420 #423). [Documentation](https://microsoft.github.io/agent-lightning/v0.3.0/tutorials/traces/#weave-tracer-experimental). |
| 19 | +* *[Preview]* **Trajectory Level Aggregation** for more efficient training with VERL. See [blog](https://agent-lightning.github.io/posts/trajectory_level_aggregation/) and [documentation](https://microsoft.github.io/agent-lightning/v0.3.0/algorithm-zoo/verl/). |
| 20 | + |
| 21 | +### Store Benchmark |
| 22 | + |
| 23 | +In this release, the Lightning Store core was redesigned for significantly greater efficiency and scalability (#315 #318 #328 #342 #344 #356 #380 #388 #418 #421). The benchmark results below demonstrate the impact: with large numbers of concurrent runners, v0.3.0 delivers up to a 15x increase in throughput compared to v0.2.2. |
| 24 | + |
| 25 | +| Throughput (\#rollout/sec) | v0.2.2 | v0.3.0 (in-memory) | v0.3.0 (Mongo) | |
| 26 | +| :---- | :---- | :---- | :---- | |
| 27 | +| Minimal (batch, #runner=32, #turns=6) | 8.73 | 9.06 | 8.71 | |
| 28 | +| Medium (batch, #runners=100, #turns=10) | 12.03 | 23.26 | 32.79 | |
| 29 | +| Mid-high (batch, #runners=300, #turns=6) | 10.61 | 24.42 | 40.24 | |
| 30 | +| Large (batch, #runners=1000, #turns=3) | 3.36 | 14.60 | 50.05 | |
| 31 | +| Long queue (queue, #runners=256, #turns=4) | 7.42 | 30.86 | 57.01 | |
| 32 | +| Heavy trace (queue, #runners=512, #turns=20) | 5.93 | 13.28 | 29.41 | |
| 33 | + |
| 34 | +*Notes:* |
| 35 | + |
| 36 | +1. Benchmarks were run on a single Standard_D32as_v4 Azure VM (Large and heavy trace tests used Standard_D64ads_v5), executed via GitHub Actions. |
| 37 | +2. Two algorithm patterns are evaluated: the batch pattern submits a group of rollouts and waits for all to finish before starting the next group, while the queue pattern maintains a set number of in-flight rollouts, submitting new ones as soon as capacity frees up. Configuration details are available [here](https://github.com/microsoft/agent-lightning/blob/v0.3.0/.github/workflows/benchmark.yml). |
| 38 | +3. The number of turns is directly proportional to the number of spans each rollout generates. |
| 39 | + |
| 40 | +### Maintenance and Bug fixes |
| 41 | + |
| 42 | +#### Core (Store, Interfaces, etc.) |
| 43 | + |
| 44 | +* Add Trainer port option for client-server strategies (#198) |
| 45 | +* Fix store port conflict handling (#227) |
| 46 | +* Unified PythonServerLauncher (#286 #292 #303) |
| 47 | +* Make health timeout configurable (#305) |
| 48 | +* Refactor logging (#306) |
| 49 | +* Support OTLP in LightningStore (#313) |
| 50 | +* Centralized metrics helper (#368) |
| 51 | +* Fix redundant cancel tracebacks on Ctrl+C (#370) |
| 52 | + |
| 53 | +#### Proxy, Adapters and Algorithms |
| 54 | + |
| 55 | +* Fix training metrics before and after processing in VERL (#145) |
| 56 | +* Forward streaming requests for Anthropic and OpenAI APIs (as non-streaming requests) (#299) |
| 57 | +* Check traces with reward for VERL (#317) |
| 58 | +* Patch LiteLLM root span (#341) |
| 59 | +* Handle ref_in_actor flag for LoRA compatibility (#386) |
| 60 | +* Support `with_llm_proxy` and `with_store` in algorithms (#398) |
| 61 | +* Support image URL export in TracerTraceToTriplets (#400) |
| 62 | +* Fix match_rewards assign_to elements in TraceTree (#403) |
| 63 | +* Support customizing trainer and daemon in VERL (#407) |
| 64 | + |
| 65 | +#### Runners, Tracers and Agents |
| 66 | + |
| 67 | +* Refactor tracer initialization (#321) |
| 68 | +* Fix OpenAI Agents 0.6 compatibility (#322) |
| 69 | +* `emit_operation`, `emit_annotation`, tags and links (#359) |
| 70 | +* Sunset HTTP tracer (#402) |
| 71 | + |
| 72 | +#### Examples |
| 73 | + |
| 74 | +* Fix typos in train-first-agent.md (#263) |
| 75 | +* Fix room_selector example which always runs the first task (#270) |
| 76 | +* Fix typo in SQL agent example (#285) |
| 77 | +* Add the README and script files for training SQL agent on NPU (#272) |
| 78 | +* Examples Catalog and Refine Contribution Guide (#331) |
| 79 | +* Upgrade LangChain to 1.x (#364) |
| 80 | +* Update RAG example to Agent-lightning v0.2.x (#349) |
| 81 | + |
| 82 | +#### Miscellaneous |
| 83 | + |
| 84 | +* DeepWiki Badge (#263) |
| 85 | +* Add AGENTS.md (#374) |
| 86 | + |
| 87 | +### New Contributors |
| 88 | + |
| 89 | +Warm welcome to our first-time contributors: @cptnm3, @TerryChan, @genji970, @zxgx, @xiaochulaoban, @lspinheiro, @Kwanghoon-Choi, @Vasuk12, @totoluo, @jinghuan-Chen 🎉 |
| 90 | + |
| 91 | +**Full Changelog**: https://github.com/microsoft/agent-lightning/compare/v0.2.0...v0.3.0 |
| 92 | + |
| 93 | +--- |
| 94 | + |
3 | 95 | ## Agent-lightning v0.2.2 (11/12/2025) |
4 | 96 |
|
5 | 97 | Agent-lightning v0.2.2 is a stabilization release for v0.2.1. It introduces several bug fixes. |
|
0 commit comments