Skip to content

Commit 8a20938

Browse files
Klaud-Coldgithub-actions[bot]claude-fix-botclaude-rebase-botfunctionstackx
authored
Update dsr1-fp4-mi355x-sglang SGLang image to v0.5.12-rocm700-mi35x (#1431)
* Update dsr1-fp4-mi355x-sglang SGLang image to v0.5.12-rocm700-mi35x Ref #1154 Co-authored-by: Klaud Cold <Klaud-Cold@users.noreply.github.com> * amd-master: comment out agentic-coding sweep for dsr1-fp4-mi355x-sglang The agentic-coding matrix for dsr1-fp4-mi355x-sglang fans out into 10 matrix jobs (conc-list spans 1..256), which roughly doubles this image bump PR's sweep cost without adding signal — the bump only needs the fixed-seq-len throughput shape to validate. Re-enable after the PR merges; the next agentic cron PR will pick it up. --------- Co-authored-by: claude[bot] <41898282+claude[bot]@users.noreply.github.com> Co-authored-by: Klaud Cold <Klaud-Cold@users.noreply.github.com> Co-authored-by: claude-fix-bot <claude-fix-bot@local> Co-authored-by: claude-rebase-bot <claude-rebase-bot@local> Co-authored-by: functionstackx <47992694+functionstackx@users.noreply.github.com>
1 parent 9acd530 commit 8a20938

2 files changed

Lines changed: 15 additions & 5 deletions

File tree

.github/configs/amd-master.yaml

Lines changed: 9 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
dsr1-fp4-mi355x-sglang:
2-
image: lmsysorg/sglang:v0.5.9-rocm700-mi35x
2+
image: lmsysorg/sglang:v0.5.12-rocm700-mi35x
33
model: amd/DeepSeek-R1-0528-MXFP4-Preview
44
model-prefix: dsr1
55
runner: mi355x
@@ -17,10 +17,14 @@ dsr1-fp4-mi355x-sglang:
1717
osl: 1024
1818
search-space:
1919
- { tp: 8, conc-start: 4, conc-end: 64 }
20-
agentic-coding:
21-
- duration: 1800
22-
search-space:
23-
- { tp: 8, offloading: none, conc-list: [1, 2, 4, 8, 12, 16, 32, 64, 128, 256] }
20+
# Agentic-coding sweep commented out for this image-bump PR — the
21+
# 10-conc agentic matrix amplifies sweep cost and the bump validation
22+
# only needs the fixed-seq-len throughput shape. Re-enable once the
23+
# bump merges; the next agentic cron PR will pick it up.
24+
# agentic-coding:
25+
# - duration: 1800
26+
# search-space:
27+
# - { tp: 8, offloading: none, conc-list: [1, 2, 4, 8, 12, 16, 32, 64, 128, 256] }
2428

2529
dsr1-fp4-mi355x-atom:
2630
image: rocm/atom:rocm7.1.1-ubuntu24.04-pytorch2.9-atom0.1.1-MI350x

perf-changelog.yaml

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2901,3 +2901,9 @@
29012901
description:
29022902
- "Add MI355X config: qwen3.5-fp4-sglang-mtp using lmsysorg/sglang-rocm:v0.5.12-rocm720-mi35x-20260517"
29032903
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1445
2904+
2905+
- config-keys:
2906+
- dsr1-fp4-mi355x-sglang
2907+
description:
2908+
- "Update SGLang image to v0.5.12-rocm700-mi35x"
2909+
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1431

0 commit comments

Comments
 (0)