File tree Expand file tree Collapse file tree 2 files changed +8
-8
lines changed
Expand file tree Collapse file tree 2 files changed +8
-8
lines changed Original file line number Diff line number Diff line change @@ -81,11 +81,11 @@ Inspired by Kimi-K2's [technical report](https://arxiv.org/pdf/2507.20534) (Impr
8181
8282| Method | ARC-E | ARC-C | HellaSwag | GSM8K | TruthfulQA-MC1 | TruthfulQA-MC2 | ** Average** |
8383| :---| :---:| :---:| :---:| :---:| :---:| :---:| :---:|
84- | SlimPajama-6B (baseline) | 25.55 | 21.08 | 24.48 | 0.08 | 24.36 | 49.90 | 24.24 |
85- | Executive-Summary Rephrase | 26.43 | ** 22.70** | ** 24.75** | ** 1.36** | ** 26.19** | ** 51.90** | ** 25.56** (1.32↑ ) |
86- | Cross-Domain Rephrase | ** 28.79** | 20.22 | 24.46 | 0.00 | 24.97 | 52.41 | 25.14(0.9↑ ) |
84+ | SlimPajama-6B trained for 2 epochs | 25.55 | 21.08 | 24.48 | 0.08 | 24.36 | 49.90 | 24.24 |
85+ | SlimPajama-6B + Executive-Summary Rephrase trained for 1 epoch | 26.43 | ** 22.70** | ** 24.75** | ** 1.36** | ** 26.19** | ** 51.90** | ** 25.56** (↑ 1.32) |
86+ | SlimPajama-6B + Cross-Domain Rephrase trained for 1 epoch | ** 28.79** | 20.22 | 24.46 | 0.00 | 24.97 | 52.41 | 25.14(↑ 0.9) |
8787
88- Both rephrase strategies lift the average by ~ 1 point over the baseline with ** zero additional data** — all gains come from how the same knowledge is expressed.
88+ Both rephrase methods lift the average by ~ 1 point over the baseline with ** zero additional data** — all gains come from how the same knowledge is expressed.
8989
9090
9191### SFT
Original file line number Diff line number Diff line change @@ -84,11 +84,11 @@ GraphGen 首先根据源文本构建细粒度的知识图谱,然后利用期
8484
8585| 方法 | ARC-E | ARC-C | HellaSwag | GSM8K | TruthfulQA-MC1 | TruthfulQA-MC2 | ** 平均值** |
8686| :---| :---:| :---:| :---:| :---:| :---:| :---:| :---:|
87- | SlimPajama-6B (baseline)) | 25.55 | 21.08 | 24.48 | 0.08 | 24.36 | 49.90 | 24.24 |
88- | Executive-Summary Rephrase | 26.43 | ** 22.70** | ** 24.75** | ** 1.36** | ** 26.19** | ** 51.90** | ** 25.56** (1.32↑ ) |
89- | Cross-Domain Rephrase | ** 28.79** | 20.22 | 24.46 | 0.00 | 24.97 | 52.41 | 25.14(0.9↑ ) |
87+ | SlimPajama-6B 训练 2 epoch | 25.55 | 21.08 | 24.48 | 0.08 | 24.36 | 49.90 | 24.24 |
88+ | SlimPajama-6B + Executive-Summary Rephrase 训练 1 epoch | 26.43 | ** 22.70** | ** 24.75** | ** 1.36** | ** 26.19** | ** 51.90** | ** 25.56** (↑ 1.32) |
89+ | SlimPajama-6B + Cross-Domain Rephrase 训练 1 epoch | ** 28.79** | 20.22 | 24.46 | 0.00 | 24.97 | 52.41 | 25.14(↑ 0.9) |
9090
91- 两种重述策略均在 ** 零额外数据** 的情况下,将平均性能较基线提升约 1 个百分点——所有增益均来自于对相同知识的不同表达方式。
91+ 两种重述方法均在 ** 零额外数据** 的情况下,将平均性能较基线提升约 1 个百分点——所有增益均来自于对相同知识的不同表达方式。
9292
9393
9494### SFT
You can’t perform that action at this time.
0 commit comments