File tree Expand file tree Collapse file tree
Expand file tree Collapse file tree Original file line number Diff line number Diff line change @@ -2964,7 +2964,7 @@ dsv4-fp8-h200-sglang-mtp:
29642964 # field, so dp-attn=true is used as the existing vLLM script switch for DP4
29652965 # layouts on 4 allocated GPUs.
29662966dsv4-fp4-b300-vllm :
2967- image : vllm/vllm-openai:v0.20.0-cu130
2967+ image : vllm/vllm-openai:v0.21.0
29682968 model : deepseek-ai/DeepSeek-V4-Pro
29692969 model-prefix : dsv4
29702970 runner : b300
Original file line number Diff line number Diff line change 28022802 description :
28032803 - " Update vLLM image from v0.20.0-cu130 (14d old) to v0.21.0"
28042804 pr-link : https://github.com/SemiAnalysisAI/InferenceX/pull/1457
2805+
2806+ - config-keys :
2807+ - dsv4-fp4-b300-vllm
2808+ description :
2809+ - " Update vLLM image from v0.20.0-cu130 (18d old) to v0.21.0"
2810+ pr-link : https://github.com/SemiAnalysisAI/InferenceX/pull/1456
You can’t perform that action at this time.
0 commit comments