You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -29,7 +29,7 @@ We aim to build a easy-to-learn Agent tuner that unlock more possibilities for a
29
29
-**Easy and Friendly**. AgentJet helps you tune models behind your agent workflows easily, optimizing your agents for top performance with minimal effort.
30
30
-**Rich Tutorial Library**. AgentJet provides a rich library of [examples](https://github.com/modelscope/AgentJet/tree/main/tutorial) as tutorials.
31
31
-**Efficient and Scalable**. AgentJet uses [verl] as the default backbone (`--backbone=verl`). However, we also support [trinity](https://github.com/modelscope/Trinity-RFT/) as alternative backbone, accelerating your tuning process via fully asynchronous RFT.
32
-
-**Flexible and Fast**. AgentJet supports [multi-agent workflows](docs/en/workflow.md) and adopts a context merging technique, accelerating training by 1.5x to 20x when the workflow involves multi-turn (or multi-agent) conversations.
32
+
-**Flexible and Fast**. AgentJet supports [multi-agent workflows](docs/en/workflow.md) and adopts a context merging technique, accelerating training by 1.5x to 10x when the workflow involves multi-turn (or multi-agent) conversations.
33
33
-**Reliability and Reproducibility**. Our team keeps track of framework performance across multiple [tasks + major-git-version + training-backbones](https://benchmark.agent-matrix.com/) (under construction, still gathering data, comming soon).
34
34
35
35
For advanced researchers, AgentJet also provides high-resolution logging and debugging solutions:
@@ -110,7 +110,7 @@ The internal system orchestrates several specialized modules to handle the compl
110
110
***Task Rollout**: Bridges LLM engines and manages the Gym environment lifecycle.
111
111
***Task Runner**: Executes the Agent workflow and calculates rewards.
112
112
***Model Tuner**: Forwards inference requests from the workflow to the LLM engine.
113
-
***Context Tracker**: Monitors LLM calls and automatically merges shared-history timelines to improve training efficiency by **3x to 10x**.
113
+
***Context Tracker**: Monitors LLM calls and automatically merges shared-history timelines to improve training efficiency by **1.5x to 10x**.
Copy file name to clipboardExpand all lines: docs/en/installation.md
+46-63Lines changed: 46 additions & 63 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -2,19 +2,17 @@
2
2
3
3
This document provides a step-by-step guide to installing AgentJet.
4
4
5
-
!!! tip "Latest Version Recommended"
5
+
!!! tip "Latest Version Recommended:"
6
+
6
7
AgentJet is under active development and iteration. We recommend installing from source to get the latest features and bug fixes.
7
8
8
-
---
9
9
10
10
## Prerequisites
11
11
12
12
| Requirement | Version |
13
13
|-------------|---------|
14
14
|**Python**| 3.10 |
15
-
|**CUDA**| 12.8 or higher |
16
15
17
-
---
18
16
19
17
## Install from Source
20
18
@@ -34,98 +32,83 @@ AgentJet supports multiple backbones. Currently we have `verl` and `trinity` (re
34
32
!!! info "Package Manager"
35
33
We recommend using `uv` to manage your Python environment as it is incredibly fast. See also [`uv` installation document](https://docs.astral.sh/uv/getting-started/installation/).
36
34
37
-
If you prefer `conda`, you can also install via conda and pip (simply change `uv pip` to `pip`).
38
-
39
-
=== "Verl (Recommended)"
35
+
And of course, if you prefer `conda`, you can also install via conda and pip (simply change `uv pip` to `pip`).
`flash-attn` must be installed after other dependencies. To build faster, export `MAX_JOBS=${N_CPU}`, or ensure a healthy connection to GitHub to install pre-compiled wheels.
52
52
53
-
54
-
=== "Trinity"
55
-
56
-
Install with `trinity` training backbone for fully asynchronous RFT:
Before proceeding, ensure you have **nvidia docker** installed on your system. CUDA is needed inside our docker container, which requires toolkits from Nvidia for GPU support.
66
+
!!! warning "flash-attn Installation"
67
+
`flash-attn` must be installed after other dependencies. To build faster, export `MAX_JOBS=${N_CPU}`, or ensure a healthy connection to GitHub to install pre-compiled wheels.
73
68
74
-
### Setup Nvidia Docker Runtime
75
69
76
-
Please install nvidia docker runtime on the host Ubuntu system. For details, refer to:
70
+
=== "VERL (aliyun)"
77
71
78
-
-[Nvidia Official Document](https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html#installing-on-ubuntu-and-debian)
This command mounts your current working directory (the root directory of agentjet) to `/workspace` and your data directory to `/data` inside the container.
`flash-attn` must be installed after other dependencies. To build faster, export `MAX_JOBS=${N_CPU}`, or ensure a healthy connection to GitHub to install pre-compiled wheels.
96
86
97
-
## Verify Installation
98
87
99
-
After installation, verify that everything is working correctly:
88
+
=== "Trinity"
100
89
101
-
```python
102
-
import ajet
103
-
print(ajet.__version__)
104
-
```
90
+
```bash
91
+
# Install with `trinity` training backbone for fully asynchronous RFT:
Copy file name to clipboardExpand all lines: docs/en/intro.md
+15-25Lines changed: 15 additions & 25 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,37 +1,28 @@
1
1
# Introduction
2
2
3
-
**AgentJet (AgentJet)** is a cutting-edge, user-friendly training framework designed to optimize AgentScope agents and workflows, fine-tuning language model weights behind the scenes.
3
+
**AgentJet (AgentJet)** is a cutting-edge, user-friendly agent tunning framework designed to optimize LLM models and agent workflows.
4
4
5
-
Simply provide your AgentScope workflow, training data, and reward function, and we will be ready to enhance your agents to their optimal performance!
5
+
Simply provide your workflow (built from AgentScope, OpenAI SDK, Langchain, raw HTTP requests, or hybrid of all of them), training data, and reward function, and we will be ready to enhance your agents to their optimal performance!
6
6
7
-
---
8
7
9
8
## Features
10
9
11
-
We aim to build an easy-to-learn AgentJet that unlocks more possibilities for agent developers:
12
-
13
-
<divclass="card-grid">
14
-
<ahref="./configuration/"class="feature-card"><divclass="card-header"><imgsrc="https://api.iconify.design/lucide:rocket.svg"class="card-icon card-icon-agent"alt=""><h3>Easy and Friendly</h3></div><pclass="card-desc">AgentJet helps you tune models behind your agent workflows easily, optimizing your agents for top performance with minimal effort.</p></a>
15
-
<ahref="#example-library"class="feature-card"><divclass="card-header"><imgsrc="https://api.iconify.design/lucide:book-open.svg"class="card-icon card-icon-general"alt=""><h3>Rich Tutorial Library</h3></div><pclass="card-desc">Rich library of examples as tutorials: Math Agent, Werewolves Game, AppWorld and more.</p></a>
16
-
<ahref="./installation/"class="feature-card"><divclass="card-header"><imgsrc="https://api.iconify.design/lucide:zap.svg"class="card-icon card-icon-tool"alt=""><h3>Efficient and Scalable</h3></div><pclass="card-desc">Uses <ahref="https://github.com/modelscope/Trinity-RFT/">Trinity</a> as the default backbone with fully asynchronous RFT. Support for verl backbone as fallback.</p></a>
17
-
</div>
10
+
AgentJet aims to build a state-of-the-art agent tuning platform for both developers and researchers
18
11
19
-
!!! tip "Multi-Agent Support"
20
-
AgentJet supports [multi-agent workflows](./workflow.md) and adopts a context merging technique, accelerating training by **1.5x to 20x** when the workflow involves multi-turn (or multi-agent) conversations.
12
+
-**Easy and Friendly**. AgentJet helps you tune models behind your agent workflows easily, optimizing your agents for top performance with minimal effort.
13
+
-**Rich Tutorial Library**. AgentJet provides a rich library of [examples](https://github.com/modelscope/AgentJet/tree/main/tutorial) as tutorials.
14
+
-**Efficient and Scalable**. AgentJet uses [verl] as the default backbone (`--backbone=verl`). However, we also support [trinity](https://github.com/modelscope/Trinity-RFT/) as alternative backbone, accelerating your tuning process via fully asynchronous RFT.
15
+
-**Flexible and Fast**. AgentJet supports [multi-agent workflows](docs/en/workflow.md) and adopts a context merging technique, accelerating training by 1.5x to 10x when the workflow involves multi-turn (or multi-agent) conversations.
16
+
-**Reliability and Reproducibility**. Our team keeps track of framework performance across multiple [tasks + major-git-version + training-backbones](https://benchmark.agent-matrix.com/) (under construction, still gathering data, comming soon).
21
17
22
-
!!! info "Reliability & Reproducibility"
23
-
Our team keeps track of framework performance across multiple [tasks + major-git-version + training-backbones](https://benchmark.agent-matrix.com/).
18
+
For advanced researchers, AgentJet also provides high-resolution logging and debugging solutions:
19
+
<!-- For advanced researchers, AgentJet provides high-resolution logging and debugging solutions that are, to our knowledge, unprecedented in other prior projects. -->
24
20
25
-
### For Advanced Researchers
21
+
-**High-Resolution Logging**: AgentJet allows users to save and inspect token-level rollout details, recording token IDs, token loss masks, and even token logprobs to facilitate workflow development and agent diagnostics.
22
+
-**Fast Debugging**: AgentJet also provides the `--backbone=debug` option for the best debugging experience, shortening your wait period from minutes to seconds after code changes and enabling breakpoint debugging in IDEs.
26
23
27
-
AgentJet also provides high-resolution logging and debugging solutions:
28
24
29
-
| Feature | Description |
30
-
|---------|-------------|
31
-
|**High-Resolution Logging**| Save and inspect token-level rollout details, recording token IDs, token loss masks, and even token logprobs |
32
-
|**Fast Debugging**| Use `--backbone=debug` option, shortening wait time from minutes to seconds after code changes |
33
25
34
-
---
35
26
36
27
## Quick Start
37
28
@@ -52,10 +43,10 @@ We recommend using `uv` for dependency management.
Copy file name to clipboardExpand all lines: docs/en/introduction.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -13,7 +13,7 @@ We aim to build a easy-to-learn AgentJet that unlock more possibilities for agen
13
13
-**Easy and Friendly**. AgentJet helps you tune models behind your agent workflows easily, optimizing your agents for top performance with minimal effort.
14
14
-**Rich Tutorial Library**. AgentJet provides a rich library of [examples](#example-library) as tutorials.
15
15
-**Efficient and Scalable**. AgentJet uses [trinity](https://github.com/modelscope/Trinity-RFT/) as the default backbone (`--backbone=trinity`), accelerating your tuning process via fully asynchronous RFT. Nevertheless, if actor colocating is your preference, you can still fall back to the [verl](./installation.md) backbone.
16
-
-**Flexible and Fast**. AgentJet supports [multi-agent workflows](./workflow.md) and adopts a context merging technique, accelerating training by 1.5x to 20x when the workflow involves multi-turn (or multi-agent) conversations.
16
+
-**Flexible and Fast**. AgentJet supports [multi-agent workflows](./workflow.md) and adopts a context merging technique, accelerating training by 1.5x to 10x when the workflow involves multi-turn (or multi-agent) conversations.
17
17
-**Reliability and Reproducibility**. Our team keeps track of framework performance across multiple [tasks + major-git-version + training-backbones](https://benchmark.agent-matrix.com/) (under construction, still gathering data, comming soon).
18
18
19
19
For advanced researchers, AgentJet also provides high-resolution logging and debugging solutions:
@@ -94,7 +94,7 @@ The internal system orchestrates several specialized modules to handle the compl
94
94
***Task Rollout**: Bridges LLM engines and manages the Gym environment lifecycle.
95
95
***Task Runner**: Executes the AgentScope workflow and calculates rewards.
96
96
***Model Tuner**: Forwards inference requests from the workflow to the LLM engine.
97
-
***Context Tracker**: Monitors LLM calls and automatically merges shared-history timelines to improve training efficiency by **3x to 10x**.
97
+
***Context Tracker**: Monitors LLM calls and automatically merges shared-history timelines to improve training efficiency by **1.5x to 10x**.
0 commit comments