Skip to content

Commit 73d649a

Browse files
ichbinblauclaude
andcommitted
reduce vllm-disagg concurrency sweep to single point for faster iteration
Co-Authored-By: Claude Opus 4 <noreply@anthropic.com>
1 parent 3569b0a commit 73d649a

1 file changed

Lines changed: 3 additions & 3 deletions

File tree

.github/configs/amd-master.yaml

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1159,9 +1159,9 @@ kimik2.5-fp4-mi355x-vllm-disagg:
11591159
- isl: 1024
11601160
osl: 1024
11611161
search-space:
1162-
# 1P2D: 1 prefill node (co-located with proxy) + 2 decode nodes = 3 nodes total
1162+
# 1P2D: 1 prefill node (co-located with proxy) + 2 decode nodes = 3 nodes total , 16, 32, 64, 128, 256, 512
11631163
- spec-decoding: "none"
1164-
conc-list: [ 8, 16, 32, 64, 128, 256, 512 ]
1164+
conc-list: [ 8 ]
11651165
prefill:
11661166
num-worker: 1
11671167
tp: 8
@@ -1182,7 +1182,7 @@ kimik2.5-fp4-mi355x-vllm-disagg:
11821182
osl: 1024
11831183
search-space:
11841184
- spec-decoding: "none"
1185-
conc-list: [ 8, 16, 32, 64, 128, 256, 512 ]
1185+
conc-list: [ 8 ]
11861186
prefill:
11871187
num-worker: 1
11881188
tp: 8

0 commit comments

Comments
 (0)