-
Notifications
You must be signed in to change notification settings - Fork 204
Update gptoss-fp4-b200-vllm vLLM image to v0.21.0 #1394
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
+7
−1
Merged
Changes from all commits
Commits
Show all changes
4 commits
Select commit
Hold shift + click to select a range
2813d07
$Update gptoss-fp4-b200-vllm vLLM image to v0.20.2\n\nRef #1154\n\nCo…
github-actions[bot] 1c05fb0
Merge remote-tracking branch 'origin/main' into HEAD
5095498
fix(perf-changelog): restore trailing whitespace dropped by prior rebase
0413184
Merge branch 'main' into claude/issue-1154-gptoss-fp4-b200-vllm
functionstackx File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
🟡 The PR title and description say the vLLM image is being updated to v0.21.0, but the actual YAML change in
.github/configs/nvidia-master.yaml(line 3972) bumps it tovllm/vllm-openai:v0.20.2, and theperf-changelog.yamlentry also says v0.20.2. Please reconcile by either bumping the image tag to v0.21.0 (matching the title/description) or updating the PR title/description/Ref to v0.20.2.Extended reasoning...
What the bug is
The PR title is "Update gptoss-fp4-b200-vllm vLLM image to v0.21.0" and the description states it "Updates the vLLM image tag for
gptoss-fp4-b200-vllmfrom v0.15.1 to v0.21.0". However, the actual diff updates the image to a different version:.github/configs/nvidia-master.yamlline 3972:image: vllm/vllm-openai:v0.20.2perf-changelog.yamlline 2460:"Update vLLM image from v0.15.1 to v0.20.2"The two code-side artifacts are internally consistent at v0.20.2, but they disagree with the PR metadata (title + description), which both claim v0.21.0.
Step-by-step proof
Update gptoss-fp4-b200-vllm vLLM image to v0.21.0→ claims target = v0.21.0.Updates the vLLM image tag for gptoss-fp4-b200-vllm from v0.15.1 to v0.21.0→ claims target = v0.21.0..github/configs/nvidia-master.yaml:perf-changelog.yaml:vllm/vllm-openai:v0.20.2, notv0.21.0as the title claims.Impact
The code side is internally consistent (YAML + changelog both v0.20.2), so runtime behavior is well-defined — this is a metadata/communication defect, not a runtime defect. However, this PR carries the
full-sweep-enabledlabel, which triggers a full performance-benchmark sweep for this config. Reviewers and downstream consumers reading the title or changelog/release notes will be misled about which vLLM version was actually benchmarked, and the perf delta will be attributed to the wrong version. It also makes the linked tracking issue (Ref #1154) ambiguous about which version was actually shipped.How to fix
The author needs to decide which side is canonical and align the other:
.github/configs/nvidia-master.yaml:3972toimage: vllm/vllm-openai:v0.21.0and update theperf-changelog.yamldescription string to"Update vLLM image from v0.15.1 to v0.21.0".Update gptoss-fp4-b200-vllm vLLM image to v0.20.2and amend the description accordingly. The code does not need to change in this case.Severity rationale
Filing as nit because the YAML and changelog are internally consistent — the deployment will not be broken, only the human-facing metadata is wrong. But it should be reconciled before merge so the benchmark results are correctly attributed.