Skip to content

Commit 16d3a92

Browse files
[Klaud Cold] Update gptoss-fp4-b200-vllm vLLM image to v0.21.0 (#1466)
* Update gptoss-fp4-b200-vllm vLLM image to v0.21.0 Bumps the gpt-oss-120b FP4 B200 vLLM recipe from v0.20.2 (1d stale) to the latest v0.21.0. Touches only gptoss-fp4-b200-vllm — the agentic sibling stays pinned to v0.19.1 on its own track. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * chore: fill pr-link for #1466 --------- Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
1 parent 1492b0e commit 16d3a92

2 files changed

Lines changed: 7 additions & 1 deletion

File tree

.github/configs/nvidia-master.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4225,7 +4225,7 @@ gptoss-fp4-b200-trt:
42254225
- { tp: 8, conc-start: 4, conc-end: 4}
42264226

42274227
gptoss-fp4-b200-vllm:
4228-
image: vllm/vllm-openai:v0.20.2
4228+
image: vllm/vllm-openai:v0.21.0
42294229
model: openai/gpt-oss-120b
42304230
model-prefix: gptoss
42314231
runner: b200

perf-changelog.yaml

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2790,3 +2790,9 @@
27902790
description:
27912791
- "Update SGLang image from nightly-dev-20260422-de962f32 (18d/12d old) to v0.5.12-cu130"
27922792
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1473
2793+
2794+
- config-keys:
2795+
- gptoss-fp4-b200-vllm
2796+
description:
2797+
- "Update vLLM image from v0.20.2 (1d old) to v0.21.0"
2798+
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1466

0 commit comments

Comments
 (0)