Commit 4f63034
[Klaud Cold] Update qwen3.5-fp4-b300-sglang (+mtp) SGLang image to v0.5.12-cu130 (#1475)
* Update qwen3.5-fp4-b300-sglang (+mtp) SGLang image to v0.5.12-cu130
Update SGLang image from v0.5.11-cu130 (5d old) to v0.5.12-cu130
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
* fix(qwen3.5_fp4_b300): use --mm-attention-backend triton_attn
Same workaround as #1422 (bf16) and #1451 (fp8) — bypass the broken
flash-attn cute kernel sm_103 assertion in the Qwen-3.5-VL vision
encoder by switching only the multi-modal attention path to triton_attn.
Text decoder still uses --attention-backend trtllm_mha.
See sgl-project/sglang#25564 (root cause: cutedsl Arch enum aliasing on
non-cu13 path collapses sm_100..sm_110f range to exclude sm_103) and
Dao-AILab/flash-attention#2572 for the upstream fix in flight.
* Re-trigger sweep (previous Run Sweep run stuck pending with 0 jobs)
---------
Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
Co-authored-by: claude-fix-bot <claude-fix-bot@local>1 parent 8d76685 commit 4f63034
4 files changed
Lines changed: 11 additions & 4 deletions
File tree
- .github/configs
- benchmarks/single_node
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2454 | 2454 | | |
2455 | 2455 | | |
2456 | 2456 | | |
2457 | | - | |
| 2457 | + | |
2458 | 2458 | | |
2459 | 2459 | | |
2460 | 2460 | | |
| |||
2475 | 2475 | | |
2476 | 2476 | | |
2477 | 2477 | | |
2478 | | - | |
| 2478 | + | |
2479 | 2479 | | |
2480 | 2480 | | |
2481 | 2481 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
73 | 73 | | |
74 | 74 | | |
75 | 75 | | |
76 | | - | |
| 76 | + | |
77 | 77 | | |
78 | 78 | | |
79 | 79 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
73 | 73 | | |
74 | 74 | | |
75 | 75 | | |
76 | | - | |
| 76 | + | |
77 | 77 | | |
78 | 78 | | |
79 | 79 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
3036 | 3036 | | |
3037 | 3037 | | |
3038 | 3038 | | |
| 3039 | + | |
| 3040 | + | |
| 3041 | + | |
| 3042 | + | |
| 3043 | + | |
| 3044 | + | |
| 3045 | + | |
0 commit comments