You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+7-7Lines changed: 7 additions & 7 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -37,7 +37,7 @@ Let's begin with the simplest example: a math agent with a tool call. This is a
37
37
38
38
Let's begin with the simplest AgentJet Swarm example: also a math agent. In this case, you can use any GPU-less laptop to train the model remotely.
39
39
40
-
1. Start swarm server and begin swarm overwatch: `ajet-swarm start`and`ajet-swarm overwatch`.
40
+
1. Start swarm server and begin swarm overwatch: `ajet-swarm start`and`ajet-swarm overwatch`. (Alternative: if you are a fan of docker, use our [prebuilt docker image here](docs/en/ajet-swarm-docker.md) without setting up dependencies)
41
41
2. From your laptop (or swarm server localhost), run [this simple script](https://github.com/modelscope/AgentJet/blob/main/tutorial/example_math_swarm/math.py) to begin training:
@@ -49,12 +49,12 @@ Let's begin with the simplest AgentJet Swarm example: also a math agent. In this
49
49
50
50
## ✈️ Features
51
51
52
-
We aim to build a easy-to-learn Agent tuner that unlock more possibilities for agent developers:
52
+
We aim to build an easy-to-learn Agent tuner that unlocks more possibilities for agent developers:
53
53
54
54
-**Easy and Friendly**. AgentJet helps you tune models behind your agent workflows easily, optimizing your agents for top performance with minimal effort.
55
55
-**Rich Tutorial Library**. AgentJet provides a rich library of [examples](https://github.com/modelscope/AgentJet/tree/main/tutorial) as tutorials.
56
-
-**Swarm Training**. [This unique feature](https://modelscope.github.io/AgentJet/en/swarm_intro_blog_english/) of AgentJet opens many possibilities: deploying distributed &self-healing rollout workers, **non-shared-parameter multi-agent** training, **multi-runtime & multi-task cocktail** training. And just like Tinker, you can use AgentJet Swarm to train **models even on **GPU-less laptop(s)**.
57
-
-**Efficient and Scalable**. AgentJet uses [verl] as the default backbone (`--backbone=verl`). However, we also support trinity as alternative backbone, accelerating your tuning process via fully asynchronous RFT.
56
+
-**Swarm Training**. [This unique feature](https://modelscope.github.io/AgentJet/en/swarm_intro_blog_english/) of AgentJet opens many possibilities: deploying distributed &self-healing rollout workers, **non-shared-parameter multi-agent** training, **multi-runtime & multi-task cocktail** training. And just like Tinker, you can use AgentJet Swarm to train models even on **GPU-less laptop(s)**.
57
+
-**Efficient and Scalable**. AgentJet uses [verl] as the default backbone (`--backbone=verl`). However, we also support trinity asan alternative backbone, accelerating your tuning process via fully asynchronous RFT.
58
58
-**Flexible and Fast**. AgentJet supports [multi-agent workflows](https://modelscope.github.io/AgentJet/en/workflow/) and adopts a context merging technique, accelerating training by 1.5x to 10x when the workflow involves multi-turn (or multi-agent) conversations.
59
59
-**Reliability and Reproducibility**. Our team keeps track of framework performance across multiple [tasks + major-git-version + training-backbones](https://benchmark.agentjet.top/) (under construction, still gathering data, coming soon).
60
60
@@ -129,13 +129,13 @@ The internal system orchestrates several specialized modules to handle the compl
129
129
***Task Runner**: Executes the Agent workflow and calculates rewards.
130
130
***Model Tuner**: Forwards inference requests from the workflow to the LLM engine.
131
131
***Context Tracker**: Monitors LLM calls and automatically merges shared-history timelines to improve training efficiency by **1.5x to 10x**.
132
-
***Swarm Server**: A data interchange center that accept OpenAI-like requests and engine instructions, activated only in AgentJet Swarm mode.
132
+
***Swarm Server**: A data interchange center that accepts OpenAI-like requests and engine instructions, activated only in AgentJet Swarm mode.
133
133
134
134
#### 3. Swarm Architecture
135
135
136
-
When enabled swarm training mode, an additional component will be activated:
136
+
When swarm training modeis enabled, an additional component will be activated:
137
137
138
-
***Swarm Data Interchange Server**: Maintains HTTP service, listen to swarm instructions andopenai compatible requests. Establishing high-speed zmq communication channel to coordinate other modules.
138
+
***Swarm Data Interchange Server**: Maintains HTTP service, listens to swarm instructions andOpenAI compatible requests. Establishes a high-speed zmq communication channel to coordinate other modules.
@@ -48,7 +48,7 @@ And when completed, you will see a interface like this, which means the deployme
48
48
|`-v /path/to/host/Qwen/Qwen2.5-7B-Instruct:/Qwen/Qwen2.5-7B-Instruct`|**Model mount** — mounts your local model weights directory into the container. The path inside the container must match the `model` field you configure in your training job. |
49
49
|`-v ./swarmlog:/workspace/log`|**Log mount** — mounts a local `./swarmlog` directory to persist server logs outside the container. The VERL training log is written here. |
50
50
|`-p 10086:10086`|**Port mapping** — exposes port `10086` so that Swarm Clients on other machines can reach the server via `http://<server-ip>:10086`. |
51
-
|`ajet:latest`| The AgentJet Docker image. |
51
+
|`ghcr.io/modelscope/agentjet:main`| The AgentJet Docker image. |
52
52
|`bash -c "..."`| Runs two processes concurrently inside the container (see below). |
Copy file name to clipboardExpand all lines: docs/index.md
+5-5Lines changed: 5 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -17,7 +17,7 @@
17
17
<h3>Get Started with Ease</h3>
18
18
</div>
19
19
<p class="card-desc">
20
-
AgentJet simplifies the process of tuning the models that power your agent workflows. It supports nearly all major agent frameworks (e.g. <b>agentscope</b>, <b>langchain</b>), as well as <b>framwork-less</b> agents built from HTTP requests.
20
+
AgentJet simplifies the process of tuning the models that power your agent workflows. It supports nearly all major agent frameworks (e.g. <b>agentscope</b>, <b>langchain</b>), as well as <b>framework-less</b> agents built from HTTP requests.
21
21
</p>
22
22
</a>
23
23
<a href="#example-library" class="feature-card">
@@ -47,7 +47,7 @@
47
47
</div>
48
48
<p class="card-desc">
49
49
Built to support advanced <b>multi-agent</b> and <b>multi-turn</b> LLM workflows,
50
-
AgentJet intergrates timeline-merging algorithms that
50
+
AgentJet integrates timeline-merging algorithms that
51
51
automatically analyze and consolidate each agent's LLM timeline,
52
52
<b>accelerating</b> training speed 1.5x ~ 10x.
53
53
</p>
@@ -58,7 +58,7 @@
58
58
<h3>High Resolution Logging</h3>
59
59
</div>
60
60
<p class="card-desc">
61
-
Log <b>token-level</b> rollout details, capturing token IDs, token <b>loss masks</b>, and token <b>log probabilities</b> with <b>web UI display</b>. This Support workflow development, agent diagnostics, and facilitate research on advanced LLM algorithm studies.
61
+
Log <b>token-level</b> rollout details, capturing token IDs, token <b>loss masks</b>, and token <b>log probabilities</b> with <b>web UI display</b>. This supports workflow development, agent diagnostics, and facilitates research on advanced LLM algorithm studies.
62
62
</p>
63
63
</a>
64
64
<a href="en/installation/" class="feature-card">
@@ -67,7 +67,7 @@
67
67
<h3>Any Training Engine</h3>
68
68
</div>
69
69
<p class="card-desc">
70
-
Support <b>multiple training engines</b> as backbone (<b>VERL</b> and <b>Trinity-RFT</b>). Swarm backbone support will be released soon.
70
+
Supports <b>multiple training engines</b> as backbone (<b>VERL</b> and <b>Trinity-RFT</b>). Swarm backbone support will be released soon.
71
71
Choose from <b>vLLM</b> and <b>SGLang</b> as you wish. Say goodbye to training engine gaps.
72
72
</p>
73
73
</a>
@@ -149,7 +149,7 @@ The internal system orchestrates several specialized modules to handle the compl
149
149
|--------|-------------|
150
150
| **Launcher** | Manages background service processes (Ray, vLLM) and routes the backbone |
151
151
| **Task Rollout** | Bridges LLM engines and manages the Gym environment lifecycle |
152
-
| **Task Runner** | Executes the AgentScope workflow and calculates rewards |
152
+
| **Task Runner** | Executes the agent workflow and calculates rewards |
153
153
| **Model Tuner** | Forwards inference requests from the workflow to the LLM engine |
0 commit comments