Skip to content
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion .github/configs/nvidia-master.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -2500,7 +2500,7 @@
- { tp: 8, conc-start: 4, conc-end: 64 }

kimik2.5-fp4-b200-vllm:
image: vllm/vllm-openai:v0.17.0
image: vllm/vllm-openai:v0.20.2

Check warning on line 2503 in .github/configs/nvidia-master.yaml

View check run for this annotation

Claude / Claude Code Review

PR title/description claim v0.21.0 but diff bumps to v0.20.2

The PR title and description state the vLLM image is being bumped to v0.21.0, but the actual diff updates the image to v0.20.2 in both nvidia-master.yaml (line 2503) and the perf-changelog.yaml entry. This is a metadata-only mismatch — please reconcile before merge by either updating the title/description to say v0.20.2, or bumping the YAML/changelog to v0.21.0 if that was the intended target.
model: nvidia/Kimi-K2.5-NVFP4
model-prefix: kimik2.5
runner: b200
Expand Down
6 changes: 6 additions & 0 deletions perf-changelog.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -2329,6 +2329,12 @@
- "Add Qwen3.5-397B-A17B FP8 MI355X ATOM benchmark configs with and without MTP"
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1310

- config-keys:
- kimik2.5-fp4-b200-vllm
description:

Check warning on line 2334 in perf-changelog.yaml

View check run for this annotation

Claude / Claude Code Review

perf-changelog pr-link points to wrong PR (#1336 instead of #1395)

The new perf-changelog entry for `kimik2.5-fp4-b200-vllm` sets `pr-link` to https://github.com/SemiAnalysisAI/InferenceX/pull/1336, but this is PR #1395 — readers following the link will land on an unrelated PR. Please update the link to https://github.com/SemiAnalysisAI/InferenceX/pull/1395 to match the convention used by surrounding entries.
- "Update vLLM image from v0.17.0 to v0.20.2"
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1336

- config-keys:
- gptoss-fp4-h100-vllm
description:
Expand Down
Loading