Commit d4e9e43
fix(blog): reframe NVL72 bullet — disagg still applies, wide EP doesn't
Wider expert parallelism doesn't compound on a 10B-active / 256-small-expert
model the way it does on DeepSeek R1 or Kimi K2.5, but disaggregated
prefill + decode on NVL72 is still a valid next lever for MiniMax-M2.5 (KV
between pools over NVLink 5, decode pool absorbs more concurrency past the
single-node saturation knee). Drops the speculative FP4 KV cache and
"see MTP bullet" trailers.
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>1 parent 66a1670 commit d4e9e43
1 file changed
Lines changed: 1 addition & 1 deletion
Lines changed: 1 addition & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
188 | 188 | | |
189 | 189 | | |
190 | 190 | | |
191 | | - | |
| 191 | + | |
192 | 192 | | |
193 | 193 | | |
194 | 194 | | |
| |||
0 commit comments