Skip to content

Commit 29a0b95

Browse files
authored
Update gptoss-fp4-h200-vllm vLLM image to v0.20.2\n\nRef #1154
1 parent 0552ead commit 29a0b95

2 files changed

Lines changed: 7 additions & 1 deletion

File tree

.github/configs/nvidia-master.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4306,7 +4306,7 @@ gptoss-fp4-h200-trt:
43064306
- { tp: 8, ep: 8, dp-attn: false, conc-start: 4, conc-end: 8 }
43074307

43084308
gptoss-fp4-h200-vllm:
4309-
image: vllm/vllm-openai:v0.18.0
4309+
image: vllm/vllm-openai:v0.20.2
43104310
model: openai/gpt-oss-120b
43114311
model-prefix: gptoss
43124312
runner: h200

perf-changelog.yaml

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2329,6 +2329,12 @@
23292329
- "Add Qwen3.5-397B-A17B FP8 MI355X ATOM benchmark configs with and without MTP"
23302330
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1310
23312331

2332+
- config-keys:
2333+
- gptoss-fp4-h200-vllm
2334+
description:
2335+
- "Update vLLM image from v0.18.0 to v0.20.2"
2336+
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1335
2337+
23322338
- config-keys:
23332339
- gptoss-fp4-h100-vllm
23342340
description:

0 commit comments

Comments
 (0)