Skip to content

Commit cc01884

Browse files
[Klaud Cold] Update minimaxm2.5-fp8-b200-vllm vLLM image to v0.21.0 (#1449)
* Update minimaxm2.5-fp8-b200-vllm vLLM image to v0.21.0 * chore: fill pr-link for #1449
1 parent 8a928f6 commit cc01884

2 files changed

Lines changed: 7 additions & 1 deletion

File tree

.github/configs/nvidia-master.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4252,7 +4252,7 @@ gptoss-fp4-b200-vllm-agentic:
42524252
- { tp: 8, offloading: cpu, conc-list: [64, 96, 128, 192, 256] }
42534253

42544254
minimaxm2.5-fp8-b200-vllm:
4255-
image: vllm/vllm-openai:v0.19.0-cu130
4255+
image: vllm/vllm-openai:v0.21.0
42564256
model: MiniMaxAI/MiniMax-M2.5
42574257
model-prefix: minimaxm2.5
42584258
runner: b200

perf-changelog.yaml

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2692,3 +2692,9 @@
26922692
description:
26932693
- "Update vLLM ROCm image from v0.18.0 (50d old) to v0.21.0"
26942694
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1469
2695+
2696+
- config-keys:
2697+
- minimaxm2.5-fp8-b200-vllm
2698+
description:
2699+
- "Update vLLM image from v0.19.0-cu130 (25d old) to v0.21.0"
2700+
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1449

0 commit comments

Comments
 (0)