Commit 6d10eaf
b200/b300 vllm-agentic: no-offload curves vs new cc-traces 051826
Replaces the cpu-offload-only search-space on both single-node configs
with no-offload curves at the user-requested conc points, against the
freshly-bumped cc-traces-weka-no-subagents-051826 dataset (98 traces,
v5-only + CC ≥ 2.1.139).
B300 (15 shards):
- TP=8 offload=none conc=[1,2,4]
- TP=4 offload=none conc=[1,2,4,8,10,12,16]
- DEP=4 (tp4 ep4 dp-attn) offload=none conc=[16,24,32,40,48]
B200 (14 shards):
- TP=8 offload=none conc=[1,2,4,8,12,16]
- DEP=8 (tp8 ep8 dp-attn) offload=none conc=[12,16,24,32,48,64,96,128]
Dispatched as two separate workflow runs per
[[feedback_separate_b200_b300_runs]] (cascade-cancel hazard if bundled).
Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>1 parent 21f71b6 commit 6d10eaf
1 file changed
Lines changed: 13 additions & 11 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1773 | 1773 | | |
1774 | 1774 | | |
1775 | 1775 | | |
1776 | | - | |
1777 | | - | |
1778 | | - | |
1779 | | - | |
1780 | | - | |
| 1776 | + | |
| 1777 | + | |
| 1778 | + | |
| 1779 | + | |
| 1780 | + | |
| 1781 | + | |
1781 | 1782 | | |
1782 | 1783 | | |
1783 | 1784 | | |
| |||
3007 | 3008 | | |
3008 | 3009 | | |
3009 | 3010 | | |
3010 | | - | |
3011 | | - | |
3012 | | - | |
3013 | | - | |
3014 | | - | |
3015 | | - | |
| 3011 | + | |
| 3012 | + | |
| 3013 | + | |
| 3014 | + | |
| 3015 | + | |
| 3016 | + | |
| 3017 | + | |
3016 | 3018 | | |
3017 | 3019 | | |
3018 | 3020 | | |
| |||
0 commit comments