We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent 4217765 commit e97a243Copy full SHA for e97a243
1 file changed
README.md
@@ -103,7 +103,7 @@ Here is post-training result which **over 50% SFT data** comes from GraphGen and
103
We applied reinforcement learning directly to the Qwen2.5-7B base model without any prior SFT. Here are the results.
104
| Domain | Dataset | Ours | Qwen2.5-7B-Instruct (baseline) |
105
|:---------:|:---------------------------------------------------------:|:--------:|:------------------------------:|
106
-| Plant | SeedBench | **66.8** | 51.5 |
+| Plant | [SeedBench](https://github.com/open-sciencelab/SeedBench) | **66.8** | 51.5 |
107
| law | LawBench | **55.2** | 54.76 |
108
| Medicine | MedQA | **87.1** | 80.7 |
109
| General | BBH | **55.3** | 49.6 |
0 commit comments