modelscope
diff --git a/‎README.md‎
Lines changed: 42 additions & 24 deletions b/‎README.md‎
Lines changed: 42 additions & 24 deletions
diff --git a/‎ajet/copilot/job.py‎
Lines changed: 7 additions & 3 deletions b/‎ajet/copilot/job.py‎
Lines changed: 7 additions & 3 deletions
@@ -12,24 +12,39 @@
 </div>
 
 
-**AgentJet (AJet)** is a cutting-edge, user-friendly training framework designed to optimize agents and workflows (built with OpenAI SDK, AgentScope, Langchain, or just HTTP requests), fine-tuning language model weights behind the scenes.
+**AgentJet (AJet)** is a cutting-edge, user-friendly agent RL training framework designed to optimize agents and agentic workflows (supporting any agent built with OpenAI SDK, AgentScope, Langchain, or raw HTTP requests), fine-tuning LLM weights to enhance model performance.
 
-Simply provide your agent **workflow**, training **dataset**, and **reward** function, and **AgentJet** will be ready to enhance your agents to their optimal performance!
+**AgentJet (AJet)** has fully-distributed **swarm training** capability, which means that you can **deploy `ajet-swarm start` in GPU server(s) and then start training agents in your laptop(s)**! Simply provide your agent workflow, training dataset, and reward function, and AgentJet will be ready to go!
 
 
 
-## ✈️ Minimum Example
+## ✈️ Fast Introduction
 
-Let's begin with the simplest example: a math agent with a tool call.
+### Classic Mode
 
-- First, please check out the [installation guide](https://modelscope.github.io/AgentJet/en/installation/) to set up the training environment.
-- Then, tune your first model using the minimum example.
-  ```python
-  ajet --conf tutorial/example_math_agent/math_agent.yaml --backbone='verl'
+Let's begin with the simplest example: a math agent with a tool call. This is a simple & centralized training example.
 
-  # change to --backbone='trinity' if you want to switch to trinity training engine;
-  # or --backbone='debug' if you want to debug with only vLLM
-  ```
+1. please check out the [installation guide](https://modelscope.github.io/AgentJet/en/installation/) to set up the training environment.
+2. tune your first model using the minimum example.
+    ```python
+    ajet --conf ./tutorial/example_math_agent/math_agent.yaml --backbone='verl'
+    ```
+<div align="center">
+<img width="640" alt="image" src="https://serve.gptacademic.cn/publish/shared/Image/classic+swarm+revise.jpg"/>
+</div>
+
+### Swarm Mode
+
+Let's begin with the simplest AgentJet Swarm example: also a math agent. In this case, you can use any GPU-less laptop to train the model remotely.
+
+1. Start swarm server and begin swarm overwatch: `ajet-swarm start` and `ajet-swarm overwatch`.
+2. From your laptop (or swarm server localhost), run [this simple script](https://github.com/modelscope/AgentJet/blob/main/tutorial/example_math_swarm/math.py) to begin training:
+    ```python
+    AJET_SWARM_URL="http://swarm-server-ip:10086" python ./tutorial/example_math_swarm/math.py
+    ```
+<div align="center">
+<img width="600" alt="image" src="https://github.com/user-attachments/assets/41ed1e71-8b18-4c4c-b5e2-833399317337"/>
+</div>
 
 
 ## ✈️ Features
@@ -38,7 +53,8 @@ We aim to build a easy-to-learn Agent tuner that unlock more possibilities for a
 
 - **Easy and Friendly**. AgentJet helps you tune models behind your agent workflows easily, optimizing your agents for top performance with minimal effort.
 - **Rich Tutorial Library**. AgentJet provides a rich library of [examples](https://github.com/modelscope/AgentJet/tree/main/tutorial) as tutorials.
-- **Efficient and Scalable**. AgentJet uses [verl] as the default backbone (`--backbone=verl`). However, we also support [trinity](https://github.com/modelscope/Trinity-RFT/) as alternative backbone, accelerating your tuning process via fully asynchronous RFT.
+- **Swarm Training**. [This unique feature](https://modelscope.github.io/AgentJet/en/swarm_intro_blog_english/) of AgentJet opens many possibilities: deploying distributed & self-healing rollout workers, **non-shared-parameter multi-agent** training, **multi-runtime & multi-task cocktail** training. And just like Tinker, you can use AgentJet Swarm to train **models even on **GPU-less laptop(s)**.
+- **Efficient and Scalable**. AgentJet uses [verl] as the default backbone (`--backbone=verl`). However, we also support trinity as alternative backbone, accelerating your tuning process via fully asynchronous RFT.
 - **Flexible and Fast**. AgentJet supports [multi-agent workflows](https://modelscope.github.io/AgentJet/en/workflow/) and adopts a context merging technique, accelerating training by 1.5x to 10x when the workflow involves multi-turn (or multi-agent) conversations.
 - **Reliability and Reproducibility**. Our team keeps track of framework performance across multiple [tasks + major-git-version + training-backbones](https://benchmark.agentjet.top/) (under construction, still gathering data, coming soon).
 
@@ -48,6 +64,11 @@ For advanced researchers, AgentJet also provides high-resolution logging and deb
 - **High-Resolution Logging**: AgentJet allows users to save and inspect token-level rollout details, recording token IDs, token loss masks, and even token logprobs to facilitate workflow development and agent diagnostics.
 - **Fast Debugging**: AgentJet also provides the `--backbone=debug` option for the best debugging experience, shortening your wait period from minutes to seconds after code changes and enabling breakpoint debugging in IDEs.
 
+<div align="center">
+<img width="600" alt="image" src="https://serve.gptacademic.cn/publish/shared/Image/ai-generated-1771873242388.jpg"/>
+</div>
+
+
 ---
 
 ### ✈️ Quick Start
@@ -56,13 +77,6 @@ For advanced researchers, AgentJet also provides high-resolution logging and deb
 
 - **Click here to read the** [**installation guide**](https://modelscope.github.io/AgentJet/en/installation/).
 
-#### Run Training
-
-- You can start training your first agent with a single command using a pre-configured YAML file. Take the [Math agent](https://modelscope.github.io/AgentJet/en/example_math_agent/) as an example:
-
-  ```bash
-  ajet --conf tutorial/example_math_agent/math_agent.yaml
-  ```
 
 #### Example Library
 
@@ -75,6 +89,11 @@ Explore our rich library of examples to kickstart your journey:
 - 🎴 [**Writing a countdown game using AgentScope and solving it**](https://modelscope.github.io/AgentJet/en/example_countdown).
 - 🚶 [**Solving a frozen lake walking puzzle using AgentJet**](https://modelscope.github.io/AgentJet/en/example_frozenlake).
 
+Explore our automated benchmarking system [https://benchmark.agentjet.top/](https://benchmark.agentjet.top/):
+<div align="center">
+<img width="600" alt="image" src="https://serve.gptacademic.cn/publish/shared/Image/benchmark.gif"/>
+</div>
+
 
 ---
 
@@ -105,6 +124,7 @@ The internal system orchestrates several specialized modules to handle the compl
 * **Task Runner**: Executes the Agent workflow and calculates rewards.
 * **Model Tuner**: Forwards inference requests from the workflow to the LLM engine.
 * **Context Tracker**: Monitors LLM calls and automatically merges shared-history timelines to improve training efficiency by **1.5x to 10x**.
+* **Swarm Server**: A data interchange center that accept OpenAI-like requests and engine instructions, activated only in AgentJet Swarm mode.
 
 
 
@@ -122,14 +142,11 @@ AgentJet is a constantly evolving project. We are planning to add the following
 
 | Category | Feature | Status |
 | :--- | :--- | :--- |
-| **Examples** | Covering LangGraph and AutoGen frameworks | Done & Verifying |
 | **Examples** | Add LoRA training examples | Todo |
-| **Infra** | Cross-process Tuner wrapper to pass though process forking | Done & Verifying |
 | **Infra** | Optimize configurations for long-context adaptation on smaller GPUs | In Progress |
-| **Capability** | Prompt tuning | In Progress |
 | **Capability** | Multi-modal training support | Todo |
 | **Capability** | MARL Credit assignment | Todo |
-| **Capability** | Training dataset generation from few-shot samples | Done & Verifying |
+| **Capability** | Training dataset generation from few-shot samples | Todo |
 
 
 ## ✈️ Citation
@@ -152,8 +169,9 @@ If you use AgentJet in your research, please cite:
 
 ---
 <div align="center">
+This project is under active development, we need your help to make it shine! <br/>
 
-[⭐ Star Us](https://github.com/modelscope/AgentJet) · [Report Bug](https://github.com/modelscope/AgentJet/issues) · [Request Feature](https://github.com/modelscope/AgentJet/issues)
+[⭐ Star Us](https://github.com/modelscope/AgentJet) · [✈️ Report Bug](https://github.com/modelscope/AgentJet/issues) · [✈️ Request Feature](https://github.com/modelscope/AgentJet/issues)
 </div>
 
 
 
@@ -12,7 +12,6 @@
 from types import SimpleNamespace
 from typing import Any, Callable, Union
 
-import ray
 import yaml
 from loguru import logger
 
@@ -45,15 +44,17 @@ def __init__(
         project_name="ajet-swarm",
         experiment_name="test",
         n_gpu_for_infer: int | None = None, # only for trinity backbone
-        grpo_n: int = 8,
+        num_repeat: int = 8,
         batch_size: int = 32,
         swarm_mode: bool = True,
+        sample_collection_method: str = "rollout_until_finish_enough_tasks",
         *kwargs,
     ) -> None:
         self.backbone = backbone
         self.exp_dir = DEFAULT_DIR
         self.project_name = project_name
         self.exp_name = experiment_name
+        self.sample_collection_method = sample_collection_method
         if swarm_mode:
             default_yaml = os.path.abspath(os.path.join(os.path.dirname(__file__), '..', "default_config/ajet_ts_default.yaml"))
         else:
@@ -66,8 +67,10 @@ def __init__(
         self.config.ajet.model.path = model
         self.config.ajet.trainer_common.n_gpus_per_node = n_gpu
         self.config.ajet.trainer_common.algorithm.adv_estimator = algorithm
-        self.config.ajet.rollout.num_repeat = grpo_n
+        self.config.ajet.rollout.num_repeat = num_repeat
         self.config.ajet.data.train_batch_size = batch_size
+        self.config.ajet.enable_swarm_mode = swarm_mode
+        self.config.ajet.swarm_mode_sample_collection_method = sample_collection_method
         if n_gpu_for_infer is None and backbone == "trinity":
             raise ValueError("Please specify `n_gpu_for_infer` (n_gpu_for_infer < n_gpu) for trinity backbone.")
         if (n_gpu_for_infer is not None) and backbone == "verl":
@@ -134,6 +137,7 @@ def set_data(
         return self
 
     def tune(self, *args, **kwargs) -> "AgentJetJob":
+        import ray
         ast_cfg = self.config.ajet
         if not ast_cfg.rollout or not ast_cfg.rollout.user_workflow:
             raise ValueError("Workflow must be set via set_workflow before tuning.")