Skip to content

Commit 3e0cda0

Browse files
authored
updated benchmark docs (#340)
* updated benchmark docs * polish * polish
1 parent 5c43694 commit 3e0cda0

4 files changed

Lines changed: 8 additions & 11 deletions

File tree

benchmarks/.gitignore

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,2 +1,2 @@
11
*.jsonl
2-
benchmark_results/
2+
results/

benchmarks/README.md

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -30,7 +30,6 @@ python3 bench_eagle3.py \
3030
--attention-backend fa3 \
3131
--config-list 1,0,0,0 1,3,1,4 \
3232
--benchmark-list mtbench gsm8k:5 ceval:5:accountant \
33-
--output test.jsonl \
3433
--dtype bfloat16
3534
```
3635

benchmarks/bench_eagle3.py

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -70,9 +70,7 @@ def parse_args():
7070
"--skip-launch-server", action="store_true", default=False
7171
)
7272
benchmark_group.add_argument("--num-prompts", type=int, default=80)
73-
benchmark_group.add_argument(
74-
"--output-dir", type=str, default="./bernchmark_results"
75-
)
73+
benchmark_group.add_argument("--output-dir", type=str, default="./results")
7674
benchmark_group.add_argument(
7775
"--config-list", type=str, nargs="+", default=["1,0,0,0", "1,3,1,4"]
7876
)

docs/examples/llama3-eagle3-online.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -43,20 +43,20 @@ The four numbers in the config represent: `batch_size, num_steps, topk, num_veri
4343
A pre-trained EAGLE model is available at [zhuyksir/EAGLE3-Llama-3.1-8B-Instruct](https://huggingface.co/zhuyksir/EAGLE3-Llama-3.1-8B-Instruct) for reference.
4444

4545
```shell
46+
cd benchmarks
47+
4648
config_list=(
4749
"4,3,1,4"
4850
"4,7,10,60"
4951
)
50-
CUDA_VISIBLE_DEVICES=4,5,6,7 python3 bench_model_speedup.py \
52+
python3 bench_eagle3.py \
5153
--model-path meta-llama/Llama-3.1-8B-Instruct \
5254
--speculative-draft-model-path /YOUR/PATH/Llama-3.1-8B-Instruct/dev_outputs/epoch_0 \
53-
--port 20001 \
54-
--trust-remote-code \
55+
--port 30000 \
5556
--mem-fraction-static 0.8 \
56-
--tp-size 4 \
57+
--tp-size 1 \
5758
--config-list "${config_list[@]}" \
58-
--benchmark-list mtbench:80 gsm8k:200 humaneval:200 math500:200 \
59-
--output output.jsonl
59+
--benchmark-list mtbench gsm8k humaneval math500
6060
```
6161

6262

0 commit comments

Comments
 (0)