Skip to content

Commit 82914ea

Browse files
[Klaud Cold] Update minimaxm2.5-fp4-b200-vllm vLLM image to v0.21.0 (#1448)
* Update minimaxm2.5-fp4-b200-vllm vLLM image to v0.21.0 * chore: fill pr-link for #1448
1 parent cc01884 commit 82914ea

2 files changed

Lines changed: 7 additions & 1 deletion

File tree

.github/configs/nvidia-master.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4353,7 +4353,7 @@ minimaxm2.5-fp8-b300-vllm-agentic:
43534353
- { tp: 4, offloading: cpu, conc-list: [48, 64, 96, 100, 104, 108, 112, 116, 120, 124, 128, 192] }
43544354

43554355
minimaxm2.5-fp4-b200-vllm:
4356-
image: vllm/vllm-openai:v0.19.0-cu130
4356+
image: vllm/vllm-openai:v0.21.0
43574357
model: nvidia/MiniMax-M2.5-NVFP4
43584358
model-prefix: minimaxm2.5
43594359
runner: b200

perf-changelog.yaml

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2698,3 +2698,9 @@
26982698
description:
26992699
- "Update vLLM image from v0.19.0-cu130 (25d old) to v0.21.0"
27002700
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1449
2701+
2702+
- config-keys:
2703+
- minimaxm2.5-fp4-b200-vllm
2704+
description:
2705+
- "Update vLLM image from v0.19.0-cu130 (25d old) to v0.21.0"
2706+
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1448

0 commit comments

Comments
 (0)