Skip to content

Commit 5a098a3

Browse files
jesus-talavera-ibmkashif
authored andcommitted
docs: Update documentation with Granite 4.0/4.1 (ggml-org#23404)
1 parent 977fea5 commit 5a098a3

2 files changed

Lines changed: 2 additions & 0 deletions

File tree

docs/autoparser.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -489,6 +489,7 @@ The following templates have active tests in `tests/test-chat.cpp`:
489489
| Qwen-QwQ-32B | Reasoning | Forced-open thinking |
490490
| NousResearch Hermes 2 Pro | JSON_NATIVE | `<tool_call>` wrapper |
491491
| IBM Granite 3.3 | JSON_NATIVE | `<think></think>` + `<response></response>` |
492+
| IBM Granite 4.0 | JSON_NATIVE | `<tool_call>` wrapper (same template used by 4.1) |
492493
| ByteDance Seed-OSS | TAG_WITH_TAGGED | Custom `<seed:think>` and `<seed:tool_call>` tags |
493494
| Qwen3-Coder | TAG_WITH_TAGGED | XML-style tool format |
494495
| DeepSeek V3.1 | JSON_NATIVE | Forced thinking mode |

docs/function-calling.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -291,6 +291,7 @@ Here are some models known to work (w/ chat template override when needed):
291291
llama-server --jinja -fa -hf bartowski/Qwen2.5-7B-Instruct-GGUF:Q4_K_M
292292
llama-server --jinja -fa -hf bartowski/Mistral-Nemo-Instruct-2407-GGUF:Q6_K_L
293293
llama-server --jinja -fa -hf bartowski/Llama-3.3-70B-Instruct-GGUF:Q4_K_M
294+
llama-server --jinja -fa -hf ibm-granite/granite-4.1-3b-GGUF:Q4_K_M
294295

295296
# Native support for DeepSeek R1 works best w/ our template override (official template is buggy, although we do work around it)
296297

0 commit comments

Comments
 (0)