gipplab
diff --git a/‎.gitignore‎
Lines changed: 9 additions & 0 deletions b/‎.gitignore‎
Lines changed: 9 additions & 0 deletions
diff --git a/‎.idea/deployment.xml‎
Lines changed: 8 additions & 1 deletion b/‎.idea/deployment.xml‎
Lines changed: 8 additions & 1 deletion
diff --git a/‎.idea/sshConfigs.xml‎
Lines changed: 1 addition & 0 deletions b/‎.idea/sshConfigs.xml‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎.idea/webServers.xml‎
Lines changed: 7 additions & 0 deletions b/‎.idea/webServers.xml‎
Lines changed: 7 additions & 0 deletions
diff --git a/‎README.md‎
Lines changed: 5 additions & 3 deletions b/‎README.md‎
Lines changed: 5 additions & 3 deletions
diff --git a/‎examples/baselines/qwen2_5_vl_3b_clevr.sh‎
Lines changed: 0 additions & 14 deletions b/‎examples/baselines/qwen2_5_vl_3b_clevr.sh‎
Lines changed: 0 additions & 14 deletions
diff --git a/‎examples/baselines/qwen2_5_vl_3b_geoqa8k.sh‎
Lines changed: 0 additions & 88 deletions b/‎examples/baselines/qwen2_5_vl_3b_geoqa8k.sh‎
Lines changed: 0 additions & 88 deletions
diff --git a/‎examples/baselines/qwen2_5_vl_7b_doc_agent.sh‎
Lines changed: 0 additions & 126 deletions b/‎examples/baselines/qwen2_5_vl_7b_doc_agent.sh‎
Lines changed: 0 additions & 126 deletions
@@ -178,3 +178,12 @@ wandb/
 dataset/
 generation_results/
 backups/
+
+examples/baselines/qwen2_5_vl_3b_clevr.sh
+examples/baselines/qwen2_5_vl_3b_geoqa8k.sh
+examples/baselines/qwen2_5_vl_7b_doc_agent.sh
+examples/baselines/qwen2_5_vl_7b_doc_agent_generation_SCC.sh
+examples/baselines/qwen2_5_vl_7b_doc_agent_NHR.sh
+examples/baselines/qwen2_5_vl_7b_doc_agent_ppo_NHR.sh
+examples/baselines/qwen2_5_vl_7b_doc_agent_ppo_SCC.sh
+examples/baselines/qwen2_5_vl_7b_doc_agent_SCC.sh
@@ -40,7 +40,7 @@ pip install -e .
 
 ### 1. Corpus Building
 
-We provide the processed training corpus on Hugging Face: **[SkyFishQ/ALDEN](https://www.google.com/search?q=https://huggingface.co/SkyFishQ/ALDEN)**.
+We provide the processed training corpus on Hugging Face: **[SkyFishQ/ALDEN](https://huggingface.co/datasets/SkyFishQ/ALDEN/tree/main)**.
 
 If you wish to build the corpus from scratch using your own data:
 
@@ -147,6 +147,8 @@ First, launch the RAG environment server which handles the `<search>` and `<fetc
         --port 42354
     ```
 
+    *We initially set two retrievers for each GPU. Adapt the number of GPU and retriever according to the specific devices in the yaml file.*
+
 ### Step 2: RL Training
 
 Once the tool server is running, start the training. Ensure the server URL in the training script points to the IP obtained in Step 1.
@@ -168,7 +170,7 @@ bash examples/baselines/qwen2_5_vl_7b_doc_agent_generation.sh
 ### Merge Checkpoints in the Hugging Face Format
 
 ```bash
-python3 scripts/model_merger.py \
+python scripts/model_merger.py \
     --local_dir checkpoints/easy_r1/exp_name/global_step_1/actor
 ```
 
@@ -194,4 +196,4 @@ This work is built upon the following excellent open-source projects:
 - [verl](https://github.com/volcengine/verl): For efficient RL training.
 - [ReCall](https://github.com/Agent-RL/ReCall): For RAG integration concepts.
 
-We greatly appreciate their valuable contributions to the community.
+We greatly appreciate their valuable contributions to the community.