Skip to content

Commit c474f3b

Browse files
docs: update README
1 parent 016a73b commit c474f3b

2 files changed

Lines changed: 4 additions & 4 deletions

File tree

README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -82,8 +82,8 @@ Inspired by Kimi-K2's [technical report](https://arxiv.org/pdf/2507.20534) (Impr
8282
| Method | ARC-E | ARC-C | HellaSwag | GSM8K | TruthfulQA-MC1 | TruthfulQA-MC2 | **Average** |
8383
|:---|:---:|:---:|:---:|:---:|:---:|:---:|:---:|
8484
| SlimPajama-6B trained for 2 epochs | 25.55 | 21.08 | 24.48 | 0.08 | 24.36 | 49.90 | 24.24 |
85-
| SlimPajama-6B + Executive-Summary Rephrase trained for 1 epoch | 26.43 | **22.70** | **24.75** | **1.36** | **26.19** | **51.90** | **25.56**(↑1.32) |
86-
| SlimPajama-6B + Cross-Domain Rephrase trained for 1 epoch | **28.79** | 20.22 | 24.46 | 0.00 | 24.97 | 52.41 | 25.14(↑0.9) |
85+
| SlimPajama-6B + Executive-Summary Rephrase trained for 1 epoch | 26.43 | **22.70** | **24.75** | **1.36** | **26.19** | 51.90 | **25.56**(↑1.32) |
86+
| SlimPajama-6B + Cross-Domain Rephrase trained for 1 epoch | **28.79** | 20.22 | 24.46 | 0.00 | 24.97 | **52.41** | 25.14(↑0.9) |
8787

8888
Both rephrase methods lift the average by ~1 point over the baseline with **zero additional data** — all gains come from how the same knowledge is expressed.
8989

README_zh.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -85,8 +85,8 @@ GraphGen 首先根据源文本构建细粒度的知识图谱,然后利用期
8585
| 方法 | ARC-E | ARC-C | HellaSwag | GSM8K | TruthfulQA-MC1 | TruthfulQA-MC2 | **平均值** |
8686
|:---|:---:|:---:|:---:|:---:|:---:|:---:|:---:|
8787
| SlimPajama-6B 训练 2 epoch | 25.55 | 21.08 | 24.48 | 0.08 | 24.36 | 49.90 | 24.24 |
88-
| SlimPajama-6B + Executive-Summary Rephrase 训练 1 epoch | 26.43 | **22.70** | **24.75** | **1.36** | **26.19** | **51.90** | **25.56**(↑1.32) |
89-
| SlimPajama-6B + Cross-Domain Rephrase 训练 1 epoch | **28.79** | 20.22 | 24.46 | 0.00 | 24.97 | 52.41 | 25.14(↑0.9) |
88+
| SlimPajama-6B + Executive-Summary Rephrase 训练 1 epoch | 26.43 | **22.70** | **24.75** | **1.36** | **26.19** | 51.90 | **25.56**(↑1.32) |
89+
| SlimPajama-6B + Cross-Domain Rephrase 训练 1 epoch | **28.79** | 20.22 | 24.46 | 0.00 | 24.97 | **52.41** | 25.14(↑0.9) |
9090

9191
两种重述方法均在**零额外数据**的情况下,将平均性能较基线提升约 1 个百分点——所有增益均来自于对相同知识的不同表达方式。
9292

0 commit comments

Comments
 (0)