Skip to content

Commit 1a91193

Browse files
[Klaud Cold] Update dsv4-fp4-b200-vllm (+mtp) vLLM image to v0.21.0 (#1476)
* Update dsv4-fp4-b200-vllm (+mtp) vLLM image to v0.21.0 Update vLLM image from v0.20.0-cu130 (20d/18d old) to v0.21.0 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * chore: fill pr-link for #1476 --------- Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
1 parent 8c5128a commit 1a91193

2 files changed

Lines changed: 9 additions & 2 deletions

File tree

.github/configs/nvidia-master.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1737,7 +1737,7 @@ dsv4-fp4-b200-sglang:
17371737
- { tp: 8, ep: 8, dp-attn: true, conc-start: 256, conc-end: 512 }
17381738

17391739
dsv4-fp4-b200-vllm:
1740-
image: vllm/vllm-openai:v0.20.0-cu130
1740+
image: vllm/vllm-openai:v0.21.0
17411741
model: deepseek-ai/DeepSeek-V4-Pro
17421742
model-prefix: dsv4
17431743
runner: b200-dsv4
@@ -1825,7 +1825,7 @@ dsv4-fp4-b200-trt-mtp:
18251825
# MTP variant of dsv4-fp4-b200-vllm. Mirrors the base search space and adds
18261826
# --speculative-config '{"method":"mtp","num_speculative_tokens":2}'.
18271827
dsv4-fp4-b200-vllm-mtp:
1828-
image: vllm/vllm-openai:v0.20.0-cu130
1828+
image: vllm/vllm-openai:v0.21.0
18291829
model: deepseek-ai/DeepSeek-V4-Pro
18301830
model-prefix: dsv4
18311831
runner: b200-dsv4

perf-changelog.yaml

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2769,3 +2769,10 @@
27692769
description:
27702770
- "Update TensorRT-LLM image from v1.1.0rc2.post2 (154d/124d old) to v1.3.0rc14 (latest pre-release)"
27712771
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1487
2772+
2773+
- config-keys:
2774+
- dsv4-fp4-b200-vllm
2775+
- dsv4-fp4-b200-vllm-mtp
2776+
description:
2777+
- "Update vLLM image from v0.20.0-cu130 (20d/18d old) to v0.21.0"
2778+
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1476

0 commit comments

Comments
 (0)