File tree Expand file tree Collapse file tree
Expand file tree Collapse file tree Original file line number Diff line number Diff line change @@ -9067,7 +9067,7 @@ glm5-fp8-b200-dynamo-sglang:
90679067
90689068# MTP variant of dsv4-fp4-gb300-dynamo-sglang.
90699069dsv4-fp4-gb300-dynamo-sglang-mtp :
9070- image : lmsysorg/sglang:nightly-dev-cu13-20260509-9ee83034
9070+ image : lmsysorg/sglang:nightly-dev-20260527-14f81a67
90719071 model : deepseek-ai/DeepSeek-V4-Pro
90729072 model-prefix : dsv4
90739073 runner : gb300-cw
Original file line number Diff line number Diff line change 34043404 - " Add DeepSeek-V4-Pro FP4 MI355X ATOM MTP3 benchmark; image rocm/atom:rocm7.2.4_ubuntu24.04_py3.12_pytorch_release_2.10.0_atom0.1.3"
34053405 pr-link : https://github.com/SemiAnalysisAI/InferenceX/pull/1627
34063406
3407+ - config-keys :
3408+ - dsv4-fp4-gb300-dynamo-sglang-mtp
3409+ description :
3410+ - " Update SGLang image from nightly-dev-cu13-20260509-9ee83034 to nightly-dev-20260527-14f81a67"
3411+ pr-link : https://github.com/SemiAnalysisAI/InferenceX/pull/1637
3412+
3413+
34073414- config-keys :
34083415 - minimaxm2.5-fp4-gb200-dynamo-vllm
34093416 description :
34313438 - " Same 1k/1k and 8k/1k search space as gb300, plus a new tp8-1p1d at low concurrencies for both ISLs"
34323439 pr-link : https://github.com/SemiAnalysisAI/InferenceX/pull/1652
34333440
3441+ - config-keys :
3442+ - minimaxm2.5-fp4-gb300-dynamo-vllm
3443+ description :
3444+ - " Add MiniMax-M2.5 NVFP4 GB300 disaggregated multinode vLLM benchmarks via Dynamo"
3445+ - " Add 1k1k/8k1k minimax recipe set under benchmarks/multi_node/srt-slurm-recipes/vllm/minimax-m2.5/"
3446+ pr-link : https://github.com/SemiAnalysisAI/InferenceX/pull/1641
3447+
34343448- config-keys :
34353449 - minimaxm2.5-fp8-gb200-dynamo-vllm
34363450 description :
You can’t perform that action at this time.
0 commit comments