Skip to content

Commit bd3726a

Browse files
$Update kimik2.5-fp4-b200-vllm vLLM image to v0.20.2\n\nRef #1154\n\nCo-authored-by: Klaud Cold <Klaud-Cold@users.noreply.github.com>
1 parent ed5867f commit bd3726a

2 files changed

Lines changed: 7 additions & 1 deletion

File tree

.github/configs/nvidia-master.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2500,7 +2500,7 @@ kimik2.5-int4-h200-vllm:
25002500
- { tp: 8, conc-start: 4, conc-end: 64 }
25012501

25022502
kimik2.5-fp4-b200-vllm:
2503-
image: vllm/vllm-openai:v0.17.0
2503+
image: vllm/vllm-openai:v0.20.2
25042504
model: nvidia/Kimi-K2.5-NVFP4
25052505
model-prefix: kimik2.5
25062506
runner: b200

perf-changelog.yaml

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2343,3 +2343,9 @@
23432343
description:
23442344
- "Add Qwen3.5-397B-A17B FP8 MI355X ATOM benchmark configs with and without MTP"
23452345
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1310
2346+
2347+
- config-keys:
2348+
- kimik2.5-fp4-b200-vllm
2349+
description:
2350+
- "Update vLLM image from v0.17.0 to v0.20.2"
2351+
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/XXX

0 commit comments

Comments
 (0)