33</h1 >
44
55<p align =" center " >
6- | <a href =" https://arxiv.org/pdf/2505.24298 " ><b >Paper</b ></a > | <a href =" https://inclusionai.github.io/AReaL/ " ><b >Documentation</b ></a > | <a href =" https://deepwiki.com/inclusionAI/AReaL " ><b >Ask DeepWiki</b ></a > | <a href =" https://huggingface.co/collections/inclusionAI/ " ><b >🤗 Models & Data</b ></a > |
6+ | <a href =" https://arxiv.org/pdf/2505.24298 " ><b >Paper</b ></a > | <a href =" https://inclusionai.github.io/AReaL/ " ><b >Documentation</b ></a > | <a href =" https://inclusionai.github.io/AReaL/zh/ " >< b >中文文档</ b ></ a > | < a href = " https:// deepwiki.com/inclusionAI/AReaL" ><b >Ask DeepWiki</b ></a > | <a href =" https://huggingface.co/collections/inclusionAI/ " ><b >🤗 Models & Data</b ></a > |
77<a href =" ./assets/wechat_qrcode.png " target =" _blank " ><img src =" ./assets/wechat_icon.png " width =" 20 " style =" vertical-align : middle ;" > <b >WeChat (微信) Group</b ></a > |
88</p >
99
@@ -22,7 +22,7 @@ as much as you'd enjoy real milk tea. Cheers!
2222** AReaL Highlights**
2323
2424- ⚡ ** Flexibility** : Seamless customization for
25- [ agentic RL] ( https://inclusionai.github.io/AReaL/tutorial/agentic_rl.html ) and
25+ [ agentic RL] ( https://inclusionai.github.io/AReaL/en/ tutorial/agentic_rl.html ) and
2626 [ online RL training] ( ./examples/openclaw/ ) by simply replacing the ` base_url ` .
2727- 📈 ** Scalability** : ** Stable** fully asynchronous RL training with ** industry-leading
2828 speed** .
@@ -57,7 +57,7 @@ and the [announcement on X](https://x.com/guohao_li/status/2009678513574408636).
5757@HwVanICI , we are excited to officially announce stable support for AReaL training on
5858** Ascend NPU devices** ! The code is actively maintained and continuously updated in the
5959[ ` ascend ` branch] ( https://github.com/inclusionAI/AReaL/tree/ascend ) . Check out
60- [ our documentation] ( https://inclusionai.github.io/AReaL/tutorial/installation_npu.html )
60+ [ our documentation] ( https://inclusionai.github.io/AReaL/en/ tutorial/installation_npu.html )
6161to get started, and feel free to report any issues!
6262
6363** \[ 2025/08/30\] ** Introducing ASearcher, a state-of-the-art search agent built with
@@ -70,8 +70,8 @@ features an **algorithm-first** API design that prioritizes ease of use and algo
7070development, while natively supporting ** fully asynchronous agentic RL** . With 80% fewer
7171lines of code, AReaL-lite maintains 90% of AReaL's performance and core functionality.
7272Check out [ our AReaL-lite design documentation] ( /areal/README.md ) and
73- [ the quickstart guide] ( https://inclusionai.github.io/AReaL/tutorial/quickstart.html ) to
74- begin your journey with ** AReaL-lite** !
73+ [ the quickstart guide] ( https://inclusionai.github.io/AReaL/en/ tutorial/quickstart.html )
74+ to begin your journey with ** AReaL-lite** !
7575
7676** \[ 2025/06/03\] (v0.3, boba²)** We release ** boba²** (double-boba) for fully
7777asynchronous RL training, which achieves ** 2.77× speedup while delivering comparable or
@@ -119,7 +119,7 @@ python3 examples/math/gsm8k_rl.py --config examples/math/gsm8k_grpo.yaml \
119119```
120120
121121For comprehensive setup instructions, see
122- [ our quickstart guide] ( https://inclusionai.github.io/AReaL/tutorial/quickstart.html ) .
122+ [ our quickstart guide] ( https://inclusionai.github.io/AReaL/en/ tutorial/quickstart.html ) .
123123
124124## 📚 Examples
125125
@@ -212,45 +212,45 @@ Check the [AI Coding Assistant Guide](docs/reference/ai_assisted_dev.md) and
212212
213213### Tutorial
214214
215- - [ Installation] ( https://inclusionai.github.io/AReaL/ tutorial/installation.html )
216- - [ Quickstart] ( https://inclusionai.github.io/AReaL/ tutorial/quickstart.html )
217- - [ Agentic RL] ( https://inclusionai.github.io/AReaL/ tutorial/agentic_rl.html )
218- - [ Evaluation] ( https://inclusionai.github.io/AReaL/ tutorial/eval.html )
219- - [ Large MoE with Megatron] ( https://inclusionai.github.io/AReaL/ tutorial/megatron.html )
220- - [ Large MoE with PyTorch Archon] ( https://inclusionai.github.io/AReaL/ tutorial/archon.html )
215+ - [ Installation] ( docs/en/ tutorial/installation.md )
216+ - [ Quickstart] ( docs/en/ tutorial/quickstart.md )
217+ - [ Agentic RL] ( docs/en/ tutorial/agentic_rl.md )
218+ - [ Evaluation] ( docs/en/ tutorial/eval.md )
219+ - [ Large MoE with Megatron] ( docs/en/ tutorial/megatron.md )
220+ - [ Large MoE with PyTorch Archon] ( docs/en/ tutorial/archon.md )
221221
222222### Code Walkthrough
223223
224- - [ Running GRPO on GSM8K dataset] ( https://inclusionai.github.io/AReaL/ tutorial/gsm8k_grpo.html )
224+ - [ Running GRPO on GSM8K dataset] ( docs/en/ tutorial/gsm8k_grpo.md )
225225
226226### Best Practices
227227
228- - [ Improving Algorithm Performance] ( https://inclusionai.github.io/AReaL/ best_practices/algo_perf.html )
229- - [ Agent Workflow Best Practices] ( https://inclusionai.github.io/AReaL/ best_practices/workflow.html )
230- - [ Debugging] ( https://inclusionai.github.io/AReaL/ best_practices/debugging.html )
231- - [ Handling OOM Issues] ( https://inclusionai.github.io/AReaL/ best_practices/handling_oom.html )
232- - [ Performance Profiling] ( https://inclusionai.github.io/AReaL/ best_practices/perf_profiling.html )
228+ - [ Improving Algorithm Performance] ( docs/en/ best_practices/algo_perf.md )
229+ - [ Agent Workflow Best Practices] ( docs/en/ best_practices/workflow.md )
230+ - [ Debugging] ( docs/en/ best_practices/debugging.md )
231+ - [ Handling OOM Issues] ( docs/en/ best_practices/handling_oom.md )
232+ - [ Performance Profiling] ( docs/en/ best_practices/perf_profiling.md )
233233
234234### Customization
235235
236- - [ Customize Dataset] ( https://inclusionai.github.io/AReaL/ customization/dataset.html )
237- - [ Customize Agentic/RVLR Rollout Workflows] ( https://inclusionai.github.io/AReaL/ customization/agent.html )
236+ - [ Customize Dataset] ( docs/en/ customization/dataset.md )
237+ - [ Customize Agentic/RVLR Rollout Workflows] ( docs/en/ customization/agent.md )
238238
239239### Algorithms
240240
241- - [ Asynchronous RL Explained] ( https://inclusionai.github.io/AReaL/ algorithms/async.html )
242- - [ PPO, GRPO, and Related Algorithms] ( https://inclusionai.github.io/AReaL/ algorithms/grpo_series.html )
243- - [ M2PO] ( https://inclusionai.github.io/AReaL/ algorithms/m2po.html )
241+ - [ Asynchronous RL Explained] ( docs/en/ algorithms/async.md )
242+ - [ PPO, GRPO, and Related Algorithms] ( docs/en/ algorithms/grpo_series.md )
243+ - [ M2PO] ( docs/en/ algorithms/m2po.md )
244244
245245### Reference
246246
247- - [ CLI Configurations] ( https://inclusionai.github.io/AReaL/ cli_reference.html )
248- - [ Checkpointing] ( https://inclusionai.github.io/AReaL/ reference/checkpointing.html )
249- - [ Metrics Tracking] ( https://inclusionai.github.io/AReaL/ reference/metrics_tracking.html )
250- - [ Allocation Mode] ( https://inclusionai.github.io/AReaL/ reference/alloc_mode.html )
251- - [ Rollout Workflow] ( https://inclusionai.github.io/AReaL/ reference/rollout_workflow.html )
252- - [ Agent Workflow] ( https://inclusionai.github.io/AReaL/ reference/agent_workflow.html )
253- - [ AI-Assisted Development] ( https://inclusionai.github.io/AReaL/ reference/ai_assisted_dev.html )
247+ - [ CLI Configurations] ( docs/en/ cli_reference.md )
248+ - [ Checkpointing] ( docs/en/ reference/checkpointing.md )
249+ - [ Metrics Tracking] ( docs/en/ reference/metrics_tracking.md )
250+ - [ Allocation Mode] ( docs/en/ reference/alloc_mode.md )
251+ - [ Rollout Workflow] ( docs/en/ reference/rollout_workflow.md )
252+ - [ Agent Workflow] ( docs/en/ reference/agent_workflow.md )
253+ - [ AI-Assisted Development] ( docs/en/ reference/ai_assisted_dev.md )
254254
255255## 🤝 Contributing
256256
0 commit comments