File tree Expand file tree Collapse file tree
Expand file tree Collapse file tree Original file line number Diff line number Diff line change @@ -2685,7 +2685,7 @@ kimik2.5-fp4-b200-vllm-agentic:
26852685# Kimi-K2.5 FP4 B200 vLLM recipe as-is until B300-specific tuning is available.
26862686
26872687kimik2.5-fp4-b300-vllm :
2688- image : vllm/vllm-openai:v0.19.0-cu130
2688+ image : vllm/vllm-openai:v0.21.0
26892689 model : nvidia/Kimi-K2.5-NVFP4
26902690 model-prefix : kimik2.5
26912691 runner : b300
Original file line number Diff line number Diff line change 27292729 description :
27302730 - " Add MTP/EAGLE speculative-decoding sibling of qwen3.5-bf16-mi325x-sglang"
27312731 pr-link : https://github.com/SemiAnalysisAI/InferenceX/pull/1483
2732+
2733+ - config-keys :
2734+ - kimik2.5-fp4-b300-vllm
2735+ description :
2736+ - " Update vLLM image from v0.19.0-cu130 (27d old) to v0.21.0"
2737+ pr-link : https://github.com/SemiAnalysisAI/InferenceX/pull/1452
You can’t perform that action at this time.
0 commit comments