Skip to content

Commit 97ac477

Browse files
Klaud-Coldgithub-actions[bot]claude-fix-botfunctionstackx
authored
Update qwen3.5-bf16-mi300x-sglang SGLang image to v0.5.12-rocm720-mi30x (#1426)
* Update qwen3.5-bf16-mi300x-sglang SGLang image to v0.5.12-rocm720-mi30x Ref #1154 Co-authored-by: Klaud Cold <Klaud-Cold@users.noreply.github.com> * fix(perf-changelog): restore from main + reappend PR entry --------- Co-authored-by: claude[bot] <41898282+claude[bot]@users.noreply.github.com> Co-authored-by: Klaud Cold <Klaud-Cold@users.noreply.github.com> Co-authored-by: claude-fix-bot <claude-fix-bot@local> Co-authored-by: functionstackx <47992694+functionstackx@users.noreply.github.com>
1 parent 975194f commit 97ac477

2 files changed

Lines changed: 7 additions & 1 deletion

File tree

.github/configs/amd-master.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -162,7 +162,7 @@ qwen3.5-bf16-mi355x-sglang-mtp:
162162
- { tp: 8, ep: 1, conc-start: 4, conc-end: 256, spec-decoding: mtp }
163163

164164
qwen3.5-bf16-mi300x-sglang:
165-
image: lmsysorg/sglang:v0.5.10-rocm720-mi30x
165+
image: lmsysorg/sglang:v0.5.12-rocm720-mi30x
166166
model: Qwen/Qwen3.5-397B-A17B
167167
model-prefix: qwen3.5
168168
runner: mi300x

perf-changelog.yaml

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2711,3 +2711,9 @@
27112711
- "Update vLLM image from v0.20.2 to v0.21.0"
27122712
- "Add VLLM_MEMORY_PROFILER_ESTIMATE_CUDAGRAPHS=0 to disable aggressive CUDA-graph memory profiler that OOMs the KV cache"
27132713
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1395
2714+
2715+
- config-keys:
2716+
- qwen3.5-bf16-mi300x-sglang
2717+
description:
2718+
- "Update SGLang image from v0.5.10-rocm720-mi30x to v0.5.12-rocm720-mi30x"
2719+
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1426

0 commit comments

Comments
 (0)