File tree Expand file tree Collapse file tree
Expand file tree Collapse file tree Original file line number Diff line number Diff line change @@ -82,8 +82,8 @@ Inspired by Kimi-K2's [technical report](https://arxiv.org/pdf/2507.20534) (Impr
8282| Method | ARC-E | ARC-C | HellaSwag | GSM8K | TruthfulQA-MC1 | TruthfulQA-MC2 | ** Average** |
8383| :---| :---:| :---:| :---:| :---:| :---:| :---:| :---:|
8484| SlimPajama-6B trained for 2 epochs | 25.55 | 21.08 | 24.48 | 0.08 | 24.36 | 49.90 | 24.24 |
85- | SlimPajama-6B + Executive-Summary Rephrase trained for 1 epoch | 26.43 | ** 22.70** | ** 24.75** | ** 1.36** | ** 26.19** | ** 51.90** | ** 25.56** (↑1.32) |
86- | SlimPajama-6B + Cross-Domain Rephrase trained for 1 epoch | ** 28.79** | 20.22 | 24.46 | 0.00 | 24.97 | 52.41 | 25.14(↑0.9) |
85+ | SlimPajama-6B + Executive-Summary Rephrase trained for 1 epoch | 26.43 | ** 22.70** | ** 24.75** | ** 1.36** | ** 26.19** | 51.90 | ** 25.56** (↑1.32) |
86+ | SlimPajama-6B + Cross-Domain Rephrase trained for 1 epoch | ** 28.79** | 20.22 | 24.46 | 0.00 | 24.97 | ** 52.41** | 25.14(↑0.9) |
8787
8888Both rephrase methods lift the average by ~ 1 point over the baseline with ** zero additional data** — all gains come from how the same knowledge is expressed.
8989
Original file line number Diff line number Diff line change @@ -85,8 +85,8 @@ GraphGen 首先根据源文本构建细粒度的知识图谱,然后利用期
8585| 方法 | ARC-E | ARC-C | HellaSwag | GSM8K | TruthfulQA-MC1 | TruthfulQA-MC2 | ** 平均值** |
8686| :---| :---:| :---:| :---:| :---:| :---:| :---:| :---:|
8787| SlimPajama-6B 训练 2 epoch | 25.55 | 21.08 | 24.48 | 0.08 | 24.36 | 49.90 | 24.24 |
88- | SlimPajama-6B + Executive-Summary Rephrase 训练 1 epoch | 26.43 | ** 22.70** | ** 24.75** | ** 1.36** | ** 26.19** | ** 51.90** | ** 25.56** (↑1.32) |
89- | SlimPajama-6B + Cross-Domain Rephrase 训练 1 epoch | ** 28.79** | 20.22 | 24.46 | 0.00 | 24.97 | 52.41 | 25.14(↑0.9) |
88+ | SlimPajama-6B + Executive-Summary Rephrase 训练 1 epoch | 26.43 | ** 22.70** | ** 24.75** | ** 1.36** | ** 26.19** | 51.90 | ** 25.56** (↑1.32) |
89+ | SlimPajama-6B + Cross-Domain Rephrase 训练 1 epoch | ** 28.79** | 20.22 | 24.46 | 0.00 | 24.97 | ** 52.41** | 25.14(↑0.9) |
9090
9191两种重述方法均在** 零额外数据** 的情况下,将平均性能较基线提升约 1 个百分点——所有增益均来自于对相同知识的不同表达方式。
9292
You can’t perform that action at this time.
0 commit comments