Commit 1c85bdc
feat(loader): warn loudly when arch=deepseek2 (MLA not yet supported)
Phase 1 of the MLA work confirmed DeepSeek V2/V3 / Coder-V2 models
load through our generic GGUF reader without complaint, but the
forward pass produces multilingual garbage because attn_kv_a_mqa /
attn_kv_b are treated as standard wk / wv. Add a one-time loud
warning at load time so users do not mistake the garbage output for
a quantization artifact. Points at the Phase 2 entry plan in
docs/research/mla_support_plan.md.
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>1 parent 2ddd289 commit 1c85bdc
1 file changed
Lines changed: 17 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
3975 | 3975 | | |
3976 | 3976 | | |
3977 | 3977 | | |
| 3978 | + | |
| 3979 | + | |
| 3980 | + | |
| 3981 | + | |
| 3982 | + | |
| 3983 | + | |
| 3984 | + | |
| 3985 | + | |
| 3986 | + | |
| 3987 | + | |
| 3988 | + | |
| 3989 | + | |
| 3990 | + | |
| 3991 | + | |
| 3992 | + | |
| 3993 | + | |
| 3994 | + | |
3978 | 3995 | | |
3979 | 3996 | | |
3980 | 3997 | | |
| |||
0 commit comments