You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/en/intro.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -12,7 +12,7 @@ AgentJet aims to build a state-of-the-art agent tuning platform for both develop
12
12
-**Easy and Friendly**. AgentJet helps you tune models behind your agent workflows easily, optimizing your agents for top performance with minimal effort.
13
13
-**Rich Tutorial Library**. AgentJet provides a rich library of [examples](https://github.com/modelscope/AgentJet/tree/main/tutorial) as tutorials.
14
14
-**Efficient and Scalable**. AgentJet uses [verl] as the default backbone (`--backbone=verl`). However, we also support [trinity](https://github.com/modelscope/Trinity-RFT/) as alternative backbone, accelerating your tuning process via fully asynchronous RFT.
15
-
-**Flexible and Fast**. AgentJet supports [multi-agent workflows](docs/en/workflow.md) and adopts a context merging technique, accelerating training by 1.5x to 10x when the workflow involves multi-turn (or multi-agent) conversations.
15
+
-**Flexible and Fast**. AgentJet supports [multi-agent workflows](workflow.md) and adopts a context merging technique, accelerating training by 1.5x to 10x when the workflow involves multi-turn (or multi-agent) conversations.
16
16
-**Reliability and Reproducibility**. Our team keeps track of framework performance across multiple [tasks + major-git-version + training-backbones](https://benchmark.agent-matrix.com/) (under construction, still gathering data, comming soon).
17
17
18
18
For advanced researchers, AgentJet also provides high-resolution logging and debugging solutions:
Let's begin with the simplest example: a math agent with a tool call.
14
16
15
-
<divclass="workflow-single">
16
-
<divclass="workflow-header">Getting Started Flow</div>
17
-
18
-
<divclass="workflow">
19
-
<olclass="workflow-steps">
20
-
<li><strong>Set up Environment</strong>
21
-
22
-
Check out the [installation guide](./installation.md) to set up the training environment.</li>
23
-
<li><strong>Define Your Workflow</strong>
24
-
25
-
Write an Agent class (e.g., `MathToolWorkflow`) that inherits from the base Workflow class.</li>
26
-
<li><strong>Configure and Run</strong>
27
-
28
-
Use the `AgentJetJob` API to configure and start training.</li>
29
-
</ol>
30
-
</div>
31
-
</div>
32
-
33
-
### Code Example
34
-
35
17
```python title="train_math_agent.py"
36
18
from ajet import AgentJetJob
19
+
20
+
# refer to `https://doc.agentjet.top/AgentJet/en/tune_your_first_agent/` on how to write your own workflow
37
21
from tutorial.example_math_agent.math_agent_simplify import MathToolWorkflow
38
22
39
23
model_path ="YOUR_MODEL_PATH"
@@ -57,9 +41,17 @@ tuned_model = job.tune()
57
41
ajet --conf ./saved_experiments/math.yaml
58
42
```
59
43
60
-
---
61
44
62
-
## Explore Examples
45
+
## 3. Compare with Community Training Curves
46
+
47
+
<divclass="card-grid">
48
+
<ahref="https://benchmark.agent-matrix.com/examples"class="feature-card"><divclass="card-header"><imgsrc="https://api.iconify.design/mdi:calculator-variant.svg"class="card-icon card-icon-math"alt=""><h3>AgentJet Bechmark Tracking System (Developing-In-Progress)</h3></div><pclass="card-desc">Compare training culves with community. Investigate the influence of versions, backbones, hyper-parameters, etc.</p></a>
49
+
</div>
50
+
51
+
52
+
53
+
54
+
## 4. Explore Example Gallery
63
55
64
56
Explore our rich library of examples to kickstart your journey:
65
57
@@ -72,11 +64,8 @@ Explore our rich library of examples to kickstart your journey:
72
64
<ahref="./example_frozenlake/"class="feature-card"><divclass="card-header"><imgsrc="https://api.iconify.design/mdi:snowflake.svg"class="card-icon card-icon-data"alt=""><h3>Frozen Lake</h3></div><pclass="card-desc">Solving a frozen lake walking puzzle.</p></a>
73
65
</div>
74
66
75
-
---
76
-
77
-
---
78
67
79
-
## Next Steps
68
+
## 5. Next Steps
80
69
81
70
<divclass="card-grid">
82
71
<ahref="../tune_your_first_agent/"class="feature-card"><divclass="card-header"><imgsrc="https://api.iconify.design/mdi:rocket-launch.svg"class="card-icon card-icon-agent"alt=""><h3>Tune Your First Agent</h3></div><pclass="card-desc">Complete step-by-step guide to building your own agent from scratch.</p></a>
Copy file name to clipboardExpand all lines: docs/en/tune_your_first_agent.md
+28-2Lines changed: 28 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -51,7 +51,7 @@ tutorial/example_math_agent
51
51
└── math_agent.yaml
52
52
```
53
53
54
-
Next, define your workflow (or convert an existing workflow). Here we use AgentScope to implement this agent. You can toggle two code before and after convertion to see the difference. If you prefer langchain or openai sdk, [please refer to this article](../agent_framework_support).
54
+
Next, define your workflow (or convert an existing workflow). Here we use AgentScope to implement this agent. You can toggle two code before and after convertion to see the difference. If you prefer langchain or openai sdk, [please refer to this article](agent_framework_support.md).
|**Project**|`project_name`| Name of the training project |`example_math_agent`|
207
+
|**Task Reader**|`type`| Type of data source to read tasks from |`huggingface_dat_repo` (options: `env_service`, `dataset_file`, `huggingface_dat_repo`) |
208
+
||`dataset_path`| Path or identifier of the dataset |`openai/gsm8k`|
209
+
||`training_split`| Dataset split used for training |`train`|
210
+
||`validation_split`| Dataset split used for validation/testing |`test`|
211
+
|**Model**|`path`| Path or identifier of the model to be trained |`Qwen/Qwen2.5-7B`|
212
+
|**Rollout**|`user_workflow`| Python module path to the workflow class |`tutorial.example_math_agent.math_agent->ExampleMathLearn`|
213
+
||`num_repeat`| Number of rollout repeats per task (GRPO `n` parameter) |`6`|
Copy file name to clipboardExpand all lines: docs/index.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -82,7 +82,7 @@
82
82
</div>
83
83
84
84
85
-
We recommend using `uv` for dependency management. [Click here](./installation/) for details and other training backbone (e.g. Trinity-RFT) options.
85
+
We recommend using `uv` for dependency management. [Click here](en/installation.md) for details and other training backbone (e.g. Trinity-RFT) options.
0 commit comments