Skip to content

Commit 006bfab

Browse files
[Klaud Cold] Update kimik2.5-fp4-b300-vllm vLLM image to v0.21.0 (#1452)
* Update kimik2.5-fp4-b300-vllm vLLM image to v0.21.0 * chore: fill pr-link for #1452
1 parent e7262bb commit 006bfab

2 files changed

Lines changed: 7 additions & 1 deletion

File tree

.github/configs/nvidia-master.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2685,7 +2685,7 @@ kimik2.5-fp4-b200-vllm-agentic:
26852685
# Kimi-K2.5 FP4 B200 vLLM recipe as-is until B300-specific tuning is available.
26862686

26872687
kimik2.5-fp4-b300-vllm:
2688-
image: vllm/vllm-openai:v0.19.0-cu130
2688+
image: vllm/vllm-openai:v0.21.0
26892689
model: nvidia/Kimi-K2.5-NVFP4
26902690
model-prefix: kimik2.5
26912691
runner: b300

perf-changelog.yaml

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2729,3 +2729,9 @@
27292729
description:
27302730
- "Add MTP/EAGLE speculative-decoding sibling of qwen3.5-bf16-mi325x-sglang"
27312731
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1483
2732+
2733+
- config-keys:
2734+
- kimik2.5-fp4-b300-vllm
2735+
description:
2736+
- "Update vLLM image from v0.19.0-cu130 (27d old) to v0.21.0"
2737+
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1452

0 commit comments

Comments
 (0)