Skip to content

Commit 4e2658e

Browse files
Oseltamivirclaude
andcommitted
bench: bump dsv4 gb300 sglang mtp 8p1d conc to 40960x65536
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
1 parent cdf21c3 commit 4e2658e

2 files changed

Lines changed: 2 additions & 2 deletions

File tree

.github/configs/nvidia-master.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8726,7 +8726,7 @@ dsv4-fp4-gb300-dynamo-sglang-mtp3:
87268726
dp-attn: true
87278727
# Mid curve 8p1d-dep8-dep8. 18 nodes.
87288728
- spec-decoding: mtp
8729-
conc-list: [32768, 65536]
8729+
conc-list: [40960, 65536]
87308730
prefill:
87318731
num-worker: 8
87328732
tp: 8

benchmarks/multi_node/srt-slurm-recipes/sglang/deepseek-v4/8k1k/disagg-mid-curve-8p1d-dep8-dep8-mtp-c32768.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -151,7 +151,7 @@ benchmark:
151151
isl: 8192
152152
osl: 256
153153
random_range_ratio: 1.0
154-
concurrencies: "32768x65536"
154+
concurrencies: "40960x65536"
155155
req_rate: "inf"
156156
use_chat_template: true
157157
custom_tokenizer: "sa_bench_tokenizers.sglang_deepseek_v4.SGLangDeepseekV4Tokenizer"

0 commit comments

Comments
 (0)