Skip to content

Commit 7afb3e9

Browse files
$Update gptoss-fp4-b200-vllm vLLM image to v0.20.2\n\nRef #1154\n\nCo-authored-by: Klaud Cold <Klaud-Cold@users.noreply.github.com>
1 parent ed5867f commit 7afb3e9

2 files changed

Lines changed: 7 additions & 1 deletion

File tree

.github/configs/nvidia-master.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3969,7 +3969,7 @@ gptoss-fp4-b200-trt:
39693969
- { tp: 8, conc-start: 4, conc-end: 4}
39703970

39713971
gptoss-fp4-b200-vllm:
3972-
image: vllm/vllm-openai:v0.15.1
3972+
image: vllm/vllm-openai:v0.20.2
39733973
model: openai/gpt-oss-120b
39743974
model-prefix: gptoss
39753975
runner: b200

perf-changelog.yaml

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2343,3 +2343,9 @@
23432343
description:
23442344
- "Add Qwen3.5-397B-A17B FP8 MI355X ATOM benchmark configs with and without MTP"
23452345
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1310
2346+
2347+
- config-keys:
2348+
- gptoss-fp4-b200-vllm
2349+
description:
2350+
- "Update vLLM image from v0.15.1 to v0.20.2"
2351+
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/XXX

0 commit comments

Comments
 (0)