You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Download [MiniCPM-V-4_6](https://huggingface.co/openbmb/MiniCPM-V-4_6) PyTorch model from huggingface to "MiniCPM-V-4_6" folder.
6
+
7
+
The model must be the standard `transformers` v5.7.0+ checkpoint (no `trust_remote_code`); the architecture in `config.json` is `MiniCPMV4_6ForConditionalGeneration` with a `qwen3_5_text` text model and a SigLIP-based vision tower plus a window-attention `vit_merger`.
8
+
9
+
### Build llama.cpp
10
+
11
+
If there are differences in usage, please refer to the official build [documentation](https://github.com/ggml-org/llama.cpp/blob/master/docs/build.md)
12
+
13
+
Clone llama.cpp:
14
+
```bash
15
+
git clone https://github.com/ggml-org/llama.cpp
16
+
cd llama.cpp
17
+
```
18
+
19
+
Build llama.cpp using `CMake`:
20
+
```bash
21
+
cmake -B build
22
+
cmake --build build --config Release
23
+
```
24
+
25
+
26
+
### Usage of MiniCPM-V 4.6
27
+
28
+
Unlike older MiniCPM-V variants, MiniCPM-V 4.6 is converted directly through `convert_hf_to_gguf.py`. The same script is invoked twice on the original Hugging Face directory: once to produce the language-model GGUF and once with `--mmproj` to produce the multimodal projector GGUF.
0 commit comments