Commit acf7052
committed
Upgrade llama.cpp from b9621 to b9637
No breaking API changes: none of the project's include surface (common.h,
chat.h, speculative.h, mtmd.h, llama-cpp.h, arg.h, llama.h, download.h) is
touched. The upgrade is purely additive.
New capabilities gained automatically (no project code needed):
- Cohere2 MoE ("North Code") model arch (MoE + MTP/NextN) with a dedicated
chat parser, auto-detected via the existing specialized-template path.
- Jinja chat-template engine fixes (count/d/e filter aliases, negative-step
slicing, empty-separator split guard, empty-old_str replace).
Vulkan unary-shader consolidation + EXPM1, WebUI gzip serving, CLI/Docker/CI/
Python-converter changes are all in TUs the project does not compile or ship.
Verified: CMake configures cleanly against b9637 (ggml 0.15.1, CPU backend).
docs/history/llama-cpp-breaking-changes.md gains the b9621-b9637 rows.
https://claude.ai/code/session_01EQJCrQGmxCBf8WTCDuFE3X1 parent 69a7ab0 commit acf7052
4 files changed
Lines changed: 9 additions & 3 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
6 | 6 | | |
7 | 7 | | |
8 | 8 | | |
9 | | - | |
| 9 | + | |
10 | 10 | | |
11 | 11 | | |
12 | 12 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
139 | 139 | | |
140 | 140 | | |
141 | 141 | | |
142 | | - | |
| 142 | + | |
143 | 143 | | |
144 | 144 | | |
145 | 145 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
2 | 2 | | |
3 | 3 | | |
4 | | - | |
| 4 | + | |
5 | 5 | | |
6 | 6 | | |
7 | 7 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
346 | 346 | | |
347 | 347 | | |
348 | 348 | | |
| 349 | + | |
| 350 | + | |
| 351 | + | |
| 352 | + | |
| 353 | + | |
| 354 | + | |
0 commit comments