Skip to content

Commit 3fa6c61

Browse files
committed
Add perf-changelog entry placeholder
Record the GPT-OSS MI355X vLLM model update (amd/gpt-oss-120b-w-mxfp4-a-fp8 -> openai/gpt-oss-120b).
1 parent 0ab344e commit 3fa6c61

1 file changed

Lines changed: 6 additions & 0 deletions

File tree

perf-changelog.yaml

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -3474,3 +3474,9 @@
34743474
- "Use scheduler-recv-interval values 2/60/30/1200/600/1920 for conc 1-4/8/16/32/64/128-256"
34753475
- "Set max-running-requests=256, chunked-prefill-size=16384, mem-fraction-static=0.8, cuda-graph-max-bs=CONC, and enable symm-mem"
34763476
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1544
3477+
3478+
- config-keys:
3479+
- gptoss-fp4-mi355x-vllm
3480+
description:
3481+
- "Update GPT-OSS model for MI355X vLLM from amd/gpt-oss-120b-w-mxfp4-a-fp8 to openai/gpt-oss-120b"
3482+
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1638

0 commit comments

Comments
 (0)