Skip to content

Commit a6821dc

Browse files
cquil11claude
andcommitted
test: DSv4-Pro B300 vLLM - widen conc to [1,2,4,8,16,32,48,64,70]
Full power-of-2 sweep up to 32, then 48, 64, 70 to walk through the likely cliff region with finer steps. Same 4 parallelism configs, still no offload, v0.20.2 image. 36 jobs total. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
1 parent 2bd132c commit a6821dc

1 file changed

Lines changed: 4 additions & 4 deletions

File tree

.github/configs/nvidia-master.yaml

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -2844,10 +2844,10 @@ dsv4-fp4-b300-vllm:
28442844
agentic-coding:
28452845
- duration: 1800
28462846
search-space:
2847-
- { tp: 4, offloading: none, conc-list: [16, 32, 64, 70, 76] }
2848-
- { tp: 8, offloading: none, conc-list: [16, 32, 64, 70, 76] }
2849-
- { tp: 4, ep: 4, dp-attn: true, offloading: none, conc-list: [16, 32, 64, 70, 76] }
2850-
- { tp: 8, ep: 8, dp-attn: true, offloading: none, conc-list: [16, 32, 64, 70, 76] }
2847+
- { tp: 4, offloading: none, conc-list: [1, 2, 4, 8, 16, 32, 48, 64, 70] }
2848+
- { tp: 8, offloading: none, conc-list: [1, 2, 4, 8, 16, 32, 48, 64, 70] }
2849+
- { tp: 4, ep: 4, dp-attn: true, offloading: none, conc-list: [1, 2, 4, 8, 16, 32, 48, 64, 70] }
2850+
- { tp: 8, ep: 8, dp-attn: true, offloading: none, conc-list: [1, 2, 4, 8, 16, 32, 48, 64, 70] }
28512851

28522852
dsv4-fp4-b300-trt:
28532853
image: ghcr.io#semianalysisai/trtllm-deepseek-v4:feat-deepseek_v4-9aa3715

0 commit comments

Comments
 (0)