You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
**AgentJet (AJet)** is a cutting-edge, user-friendly training framework designed to optimize agents and workflows (built with OpenAI SDK, AgentScope, and even vllm http requests), fine-tuning language model weights behind the scenes.
9
14
10
-
Simply provide your Agent workflow, training data, and reward function, and we will be ready to enhance your agents to their optimal performance!
15
+
**AgentJet (AJet)** is a cutting-edge, user-friendly training framework designed to optimize agents and workflows (built with OpenAI SDK, AgentScope, Langchain, or just HTTP requests), fine-tuning language model weights behind the scenes.
11
16
17
+
Simply provide your agent **workflow**, training **dataset**, and **reward** function, and **AgentJet** will be ready to enhance your agents to their optimal performance!
12
18
13
19
14
-
## 💡 Minimum Example
20
+
21
+
## 🛩️ Minimum Example
15
22
16
23
Let's begin with the simplest example: a math agent with a tool call.
17
24
18
-
- First, please check out the [installation guide](docs/en/installation.md) to set up the training environment.
25
+
- First, please check out the [installation guide](https://doc.agentjet.top/AgentJet/en/installation/) to set up the training environment.
19
26
- Then, tune your first model using the minimum example.
# change to --backbone='trinity' if you want to switch to trinity training engine;
31
+
# or --backbone='debug' if you want to debug with only vLLM
22
32
```
23
33
24
34
25
-
## Features
35
+
## 🛩️ Features
26
36
27
37
We aim to build a easy-to-learn Agent tuner that unlock more possibilities for agent developers:
28
38
29
39
-**Easy and Friendly**. AgentJet helps you tune models behind your agent workflows easily, optimizing your agents for top performance with minimal effort.
30
40
-**Rich Tutorial Library**. AgentJet provides a rich library of [examples](https://github.com/modelscope/AgentJet/tree/main/tutorial) as tutorials.
31
41
-**Efficient and Scalable**. AgentJet uses [verl] as the default backbone (`--backbone=verl`). However, we also support [trinity](https://github.com/modelscope/Trinity-RFT/) as alternative backbone, accelerating your tuning process via fully asynchronous RFT.
32
-
-**Flexible and Fast**. AgentJet supports [multi-agent workflows](docs/en/workflow.md) and adopts a context merging technique, accelerating training by 1.5x to 20x when the workflow involves multi-turn (or multi-agent) conversations.
33
-
-**Reliability and Reproducibility**. Our team keeps track of framework performance across multiple [tasks + major-git-version + training-backbones](https://benchmark.agent-matrix.com/) (under construction, still gathering data, comming soon).
42
+
-**Flexible and Fast**. AgentJet supports [multi-agent workflows](https://doc.agentjet.top/AgentJet/en/workflow.md) and adopts a context merging technique, accelerating training by 1.5x to 10x when the workflow involves multi-turn (or multi-agent) conversations.
43
+
-**Reliability and Reproducibility**. Our team keeps track of framework performance across multiple [tasks + major-git-version + training-backbones](https://benchmark.agent-matrix.com/) (under construction, still gathering data, coming soon).
34
44
35
45
For advanced researchers, AgentJet also provides high-resolution logging and debugging solutions:
36
46
<!-- For advanced researchers, AgentJet provides high-resolution logging and debugging solutions that are, to our knowledge, unprecedented in other prior projects. -->
@@ -40,51 +50,35 @@ For advanced researchers, AgentJet also provides high-resolution logging and deb
40
50
41
51
---
42
52
43
-
### 🚀 Quick Start
53
+
### 🛩️ Quick Start
44
54
45
55
#### Installation
46
56
47
-
We recommend using `uv` for dependency management.
-**Click here to read the**[**installation guide**](https://doc.agentjet.top/AgentJet/en/installation/).
64
58
65
59
#### Run Training
66
60
67
-
You can start training your first agent with a single command using a pre-configured YAML file. Take the [Math agent](docs/en/example_math_agent.md) as an example:
61
+
-You can start training your first agent with a single command using a pre-configured YAML file. Take the [Math agent](https://doc.agentjet.top/AgentJet/en/example_math_agent/) as an example:
Explore our rich library of examples to kickstart your journey:
76
70
77
-
- 🔢 [**Training a math agent that can write python code**](docs/en/example_math_agent.md).
78
-
- 📱 [**Creating an AppWorld agent using AgentScope and training it**](docs/en/example_app_world.md).
79
-
- 🐺 [**Developing Werewolves RPG agents and training them**](docs/en/example_werewolves.md).
80
-
- 👩🏻⚕️ [**Learning to ask questions like a doctor**](docs/en/example_learning_to_ask.md).
81
-
- 🎴 [**Writing a countdown game using AgentScope and solving it**](docs/en/example_countdown.md).
82
-
- 🚶 [**Solving a frozen lake walking puzzle using AgentJet**](docs/en/example_frozenlake.md).
71
+
- 🔢 [**Training a math agent that can write python code**](https://doc.agentjet.top/AgentJet/en/example_math_agent).
72
+
- 📱 [**Creating an AppWorld agent using AgentScope and training it**](https://doc.agentjet.top/AgentJet/en/example_app_world).
73
+
- 🐺 [**Developing Werewolves RPG agents and training them**](https://doc.agentjet.top/AgentJet/en/example_werewolves).
74
+
- 👩🏻⚕️ [**Learning to ask questions like a doctor**](https://doc.agentjet.top/AgentJet/en/example_learning_to_ask).
75
+
- 🎴 [**Writing a countdown game using AgentScope and solving it**](https://doc.agentjet.top/AgentJet/en/example_countdown).
76
+
- 🚶 [**Solving a frozen lake walking puzzle using AgentJet**](https://doc.agentjet.top/AgentJet/en/example_frozenlake).
83
77
84
78
85
79
---
86
80
87
-
### 🧩 Core Concepts
81
+
### 🛩️ Core Concepts
88
82
89
83
AgentJet makes agent fine-tuning straightforward by separating the developer interface from the internal execution logic.
90
84
@@ -97,9 +91,9 @@ AgentJet makes agent fine-tuning straightforward by separating the developer int
97
91
98
92
To optimize an agent, you provide three core inputs:
99
93
100
-
*[**Trainable Workflow**](docs/en/workflow.md): Define your agent logic by inheriting the Workflow class, supporting both simple agent setups and advanced multi-agent collaborations.
101
-
*[**Task Reader**](docs/en/data_pipeline.md): Load training tasks from JSONL files, HuggingFace datasets, interactive environments, or auto-generate them from documents.
102
-
*[**Task Judger**](docs/en/task_judger.md): Evaluates agent outputs and assigns rewards to guide training.
94
+
*[**Trainable Workflow**](https://doc.agentjet.top/AgentJet/en/workflow): Define your agent logic by inheriting the Workflow class, supporting both simple agent setups and advanced multi-agent collaborations.
95
+
*[**Task Reader**](https://doc.agentjet.top/AgentJet/en/data_pipeline): Load training tasks from JSONL files, HuggingFace datasets, interactive environments, or auto-generate them from documents.
96
+
*[**Task Judger**](https://doc.agentjet.top/AgentJet/en/task_judger): Evaluates agent outputs and assigns rewards to guide training.
103
97
104
98
#### 2. Internal System Architecture
105
99
@@ -110,28 +104,29 @@ The internal system orchestrates several specialized modules to handle the compl
110
104
***Task Rollout**: Bridges LLM engines and manages the Gym environment lifecycle.
111
105
***Task Runner**: Executes the Agent workflow and calculates rewards.
112
106
***Model Tuner**: Forwards inference requests from the workflow to the LLM engine.
113
-
***Context Tracker**: Monitors LLM calls and automatically merges shared-history timelines to improve training efficiency by **3x to 10x**.
107
+
***Context Tracker**: Monitors LLM calls and automatically merges shared-history timelines to improve training efficiency by **1.5x to 10x**.
108
+
114
109
115
110
116
-
---
117
111
118
-
### 🚦 Navigation
112
+
### 🛩️ Navigation
119
113
120
-
*📖 **Tutorials**: From [Installation](docs/en/installation.md) to [Tuning your first agent](docs/en/tutorial.md) — the essential path for beginners.
121
-
*🛠️ **Core Components**: Define your [Trainable Workflow](docs/en/workflow.md) and manage [Data](docs/en/data_pipeline.md) and [Reward](docs/en/tune_your_first_agent.md).
122
-
*💡 **Example**: Check the [Example Library](#example-library) above for real-world cases like [Math](docs/en/example_math_agent.md), [Werewolves game](docs/en/example_werewolves.md) and [Learning to ask task](docs/en/example_learning_to_ask.md).
***Tutorials**: From [Installation](https://doc.agentjet.top/AgentJet/en/installation) to [Tuning your first agent](https://doc.agentjet.top/AgentJet/en/tune_your_first_agent) — the essential path for beginners.
115
+
***Core Components**: Define your [Trainable Workflow](https://doc.agentjet.top/AgentJet/en/workflow) and manage [Data](https://doc.agentjet.top/AgentJet/en/data_pipeline) and [Reward](https://doc.agentjet.top/AgentJet/en/task_judger).
116
+
***Example**: Check the [Example Library](#example-library) above for real-world cases like [Math](https://doc.agentjet.top/AgentJet/en/example_math_agent), [Werewolves game](https://doc.agentjet.top/AgentJet/en/example_werewolves) and [Learning to ask task](https://doc.agentjet.top/AgentJet/en/example_learning_to_ask).
0 commit comments