Commit 14a1bb3
Point DSv4 B200/B300 TRT (non-MTP) at the SWA-scratch-fix image
Bump dsv4-fp4-b200-trt and dsv4-fp4-b300-trt to
ghcr.io#semianalysisai/trtllm-deepseek-v4:fix-dsv4-swa-scratch-revert-shrink-c914d6d
(TRT-LLM feat/deepseek_v4 @ 084cf2ba + kv_cache_manager_v2 fix). This resolves
the engine crash on attention-DP context/generation reverts at high concurrency
(the b300 8k1k conc>=512 "LLM is shutting down" hang). The -mtp variants stay on
feat-deepseek_v4-9aa3715.
Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>1 parent 1b0afeb commit 14a1bb3
2 files changed
Lines changed: 3 additions & 3 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1801 | 1801 | | |
1802 | 1802 | | |
1803 | 1803 | | |
1804 | | - | |
| 1804 | + | |
1805 | 1805 | | |
1806 | 1806 | | |
1807 | 1807 | | |
| |||
3049 | 3049 | | |
3050 | 3050 | | |
3051 | 3051 | | |
3052 | | - | |
| 3052 | + | |
3053 | 3053 | | |
3054 | 3054 | | |
3055 | 3055 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
3372 | 3372 | | |
3373 | 3373 | | |
3374 | 3374 | | |
3375 | | - | |
| 3375 | + | |
3376 | 3376 | | |
3377 | 3377 | | |
3378 | 3378 | | |
| |||
0 commit comments