Skip to content
27 changes: 23 additions & 4 deletions .github/configs/amd-master.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -1862,7 +1862,7 @@
- "DECODE_MTP_SIZE=0"

dsr1-fp4-mi355x-sglang-disagg-mtp:
image: lmsysorg/sglang-rocm:v0.5.12-rocm720-mi35x-20260519
image: rocm/sgl-dev:v0.5.12.post1-rocm720-mi35x-20260526-sync
model: amd/DeepSeek-R1-0528-MXFP4-v2
model-prefix: dsr1
runner: mi355x-disagg
Expand Down Expand Up @@ -2030,7 +2030,7 @@
dp-attn: false
additional-settings:
- "DECODE_NODES=2"
- "DECODE_MTP_SIZE=2"
- "DECODE_MTP_SIZE=3"

# 1*DEP8 + 1*DEP8
- spec-decoding: "mtp"
Expand All @@ -2049,7 +2049,7 @@
dp-attn: true
additional-settings:
- "DECODE_NODES=1"
- "DECODE_MTP_SIZE=1"
- "DECODE_MTP_SIZE=3"

# 1*DEP8 + 1*DEP8
- spec-decoding: "mtp"
Expand All @@ -2068,7 +2068,26 @@
dp-attn: true
additional-settings:
- "DECODE_NODES=1"
- "DECODE_MTP_SIZE=1"
- "DECODE_MTP_SIZE=3"

# 1*DEP8 + 1*DEP8
- spec-decoding: "mtp"
conc-list: [ 64, 128 ]
prefill:
num-worker: 1
tp: 8
ep: 8
dp-attn: true
additional-settings:
- "PREFILL_NODES=1"
decode:
num-worker: 1
tp: 8
ep: 8
dp-attn: true
additional-settings:
- "DECODE_NODES=1"
- "DECODE_MTP_SIZE=3"

Check failure on line 2090 in .github/configs/amd-master.yaml

View check run for this annotation

Claude / Claude Code Review

Duplicate sweep points in new 1*DEP8+1*DEP8 ISL=8192 block

This PR introduces duplicate benchmark sweep points: after bumping the two pre-existing `1*DEP8 + 1*DEP8` blocks in the ISL=8192 search-space from `DECODE_MTP_SIZE=1` to `DECODE_MTP_SIZE=3` (lines 2052, 2071), and then adding a new block with conc-list `[64, 128]` and the same MTP=3 (lines 2073-2090), all three blocks now share byte-identical topology. `conc=64` and `conc=128` will each be benchmarked twice on the expensive `mi355x-disagg` multinode runner. Fix by either dropping 64 and 128 from
Comment thread
Oseltamivir marked this conversation as resolved.

# 2*DEP8 + 1*DEP8
- spec-decoding: "mtp"
Expand Down