Skip to content

Commit 2813d07

Browse files
github-actions[bot]claude-fix-bot
authored andcommitted
$Update gptoss-fp4-b200-vllm vLLM image to v0.20.2\n\nRef #1154\n\nCo-authored-by: Klaud Cold <Klaud-Cold@users.noreply.github.com>
1 parent 4151de7 commit 2813d07

2 files changed

Lines changed: 9 additions & 3 deletions

File tree

.github/configs/nvidia-master.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3971,7 +3971,7 @@ gptoss-fp4-b200-trt:
39713971
- { tp: 8, conc-start: 4, conc-end: 4}
39723972

39733973
gptoss-fp4-b200-vllm:
3974-
image: vllm/vllm-openai:v0.15.1
3974+
image: vllm/vllm-openai:v0.20.2
39753975
model: openai/gpt-oss-120b
39763976
model-prefix: gptoss
39773977
runner: b200

perf-changelog.yaml

Lines changed: 8 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2465,7 +2465,7 @@
24652465
- dsv4-fp4-mi355x-atom
24662466
description:
24672467
- "Add DeepSeek-V4-Pro FP4 MI355X ATOM benchmark config; bump image to rocm/atom-dev:nightly_202605101539, expand concurrency range (conc 4–1024), and simplify runtime script"
2468-
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1311
2468+
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1311
24692469

24702470
- config-keys:
24712471
- glm5-fp8-mi355x-sglang
@@ -2486,7 +2486,7 @@
24862486
description:
24872487
- "Update SGLang image from v0.5.9-cu130 to v0.5.11-cu130"
24882488
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1322
2489-
2489+
24902490

24912491
- config-keys:
24922492
- dsv4-fp4-b300-vllm-mtp
@@ -2574,3 +2574,9 @@
25742574
description:
25752575
- "Update SGLang image from v0.5.10.post1-cu130 to v0.5.12-cu130"
25762576
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1417
2577+
2578+
- config-keys:
2579+
- gptoss-fp4-b200-vllm
2580+
description:
2581+
- "Update vLLM image from v0.15.1 to v0.20.2"
2582+
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1394

0 commit comments

Comments
 (0)