Skip to content
Merged
Show file tree
Hide file tree
Changes from 2 commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion .github/configs/amd-master.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -1134,7 +1134,7 @@ gptoss-fp4-mi325x-vllm:

gptoss-fp4-mi355x-vllm:
image: vllm/vllm-openai-rocm:v0.22.0
model: amd/gpt-oss-120b-w-mxfp4-a-fp8
model: openai/gpt-oss-120b
model-prefix: gptoss
runner: mi355x
precision: fp4
Expand Down
6 changes: 6 additions & 0 deletions perf-changelog.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -3474,3 +3474,9 @@
- "Use scheduler-recv-interval values 2/60/30/1200/600/1920 for conc 1-4/8/16/32/64/128-256"
- "Set max-running-requests=256, chunked-prefill-size=16384, mem-fraction-static=0.8, cuda-graph-max-bs=CONC, and enable symm-mem"
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1544

- config-keys:
- gptoss-fp4-mi355x-vllm
description:
- "Update GPT-OSS model for MI355X vLLM from amd/gpt-oss-120b-w-mxfp4-a-fp8 to openai/gpt-oss-120b"
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1638

Check warning on line 3482 in perf-changelog.yaml

View check run for this annotation

Claude / Claude Code Review

perf-changelog pr-link points to wrong PR (#1638 instead of #1670)

The new perf-changelog entry added by this PR sets `pr-link` to `https://github.com/SemiAnalysisAI/InferenceX/pull/1638`, but this PR is #1670 — readers tracing the gpt-oss model swap via the changelog will land on an unrelated PR. Update the link to `/pull/1670` before merge.
Comment thread
chunfangamd marked this conversation as resolved.
Outdated