Skip to content

Commit e060d41

Browse files
Merge main into GB200 FP8 MiniMax PR
2 parents fd35ba3 + eb8350e commit e060d41

2 files changed

Lines changed: 15 additions & 1 deletion

File tree

.github/configs/nvidia-master.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -9067,7 +9067,7 @@ glm5-fp8-b200-dynamo-sglang:
90679067

90689068
# MTP variant of dsv4-fp4-gb300-dynamo-sglang.
90699069
dsv4-fp4-gb300-dynamo-sglang-mtp:
9070-
image: lmsysorg/sglang:nightly-dev-cu13-20260509-9ee83034
9070+
image: lmsysorg/sglang:nightly-dev-20260527-14f81a67
90719071
model: deepseek-ai/DeepSeek-V4-Pro
90729072
model-prefix: dsv4
90739073
runner: gb300-cw

perf-changelog.yaml

Lines changed: 14 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -3404,6 +3404,13 @@
34043404
- "Add DeepSeek-V4-Pro FP4 MI355X ATOM MTP3 benchmark; image rocm/atom:rocm7.2.4_ubuntu24.04_py3.12_pytorch_release_2.10.0_atom0.1.3"
34053405
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1627
34063406

3407+
- config-keys:
3408+
- dsv4-fp4-gb300-dynamo-sglang-mtp
3409+
description:
3410+
- "Update SGLang image from nightly-dev-cu13-20260509-9ee83034 to nightly-dev-20260527-14f81a67"
3411+
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1637
3412+
3413+
34073414
- config-keys:
34083415
- minimaxm2.5-fp4-gb200-dynamo-vllm
34093416
description:
@@ -3431,6 +3438,13 @@
34313438
- "Same 1k/1k and 8k/1k search space as gb300, plus a new tp8-1p1d at low concurrencies for both ISLs"
34323439
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1652
34333440

3441+
- config-keys:
3442+
- minimaxm2.5-fp4-gb300-dynamo-vllm
3443+
description:
3444+
- "Add MiniMax-M2.5 NVFP4 GB300 disaggregated multinode vLLM benchmarks via Dynamo"
3445+
- "Add 1k1k/8k1k minimax recipe set under benchmarks/multi_node/srt-slurm-recipes/vllm/minimax-m2.5/"
3446+
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1641
3447+
34343448
- config-keys:
34353449
- minimaxm2.5-fp8-gb200-dynamo-vllm
34363450
description:

0 commit comments

Comments
 (0)