mtmd: refactor mtmd_decode_use_mrope by ngxson · Pull Request #22161 · ggml-org/llama.cpp

ngxson · 2026-04-20T11:08:57Z

Overview

Traditionally, use_mrope status is determined by the mmproj arch name. However, more and more models using qwen-style vision encoder with non-qwen text model, which doesn't use m-rope.

This PR allows handling these use cases.

Test results:

vision] OK:   ggml-org/SmolVLM-500M-Instruct-GGUF:Q8_0
[vision] OK:   ggml-org/SmolVLM2-2.2B-Instruct-GGUF:Q4_K_M
[vision] OK:   ggml-org/SmolVLM2-500M-Video-Instruct-GGUF:Q8_0
[vision] OK:   ggml-org/gemma-3-4b-it-GGUF:Q4_K_M
[vision] OK:   THUDM/glm-edge-v-5b-gguf:Q4_K_M
[vision] OK:   second-state/Llava-v1.5-7B-GGUF:Q2_K
[vision] OK:   cjpais/llava-1.6-mistral-7b-gguf:Q3_K_M
[vision] OK:   ibm-research/granite-vision-3.2-2b-GGUF:Q4_K_M
[vision] OK:   second-state/MiniCPM-Llama3-V-2_5-GGUF:Q2_K
[vision] OK:   openbmb/MiniCPM-V-2_6-gguf:Q2_K
[vision] OK:   openbmb/MiniCPM-o-2_6-gguf:Q4_0
[vision] OK:   bartowski/Qwen2-VL-2B-Instruct-GGUF:Q4_K_M
[vision] OK:   ggml-org/Qwen2.5-VL-3B-Instruct-GGUF:Q4_K_M
[vision] OK:   ggml-org/InternVL2_5-1B-GGUF:Q8_0
[vision] OK:   ggml-org/InternVL3-1B-Instruct-GGUF:Q8_0
[vision] OK:   ggml-org/Qwen2.5-Omni-3B-GGUF:Q4_K_M
[vision] OK:   ggml-org/LFM2-VL-450M-GGUF:Q8_0
[vision] OK:   ggml-org/granite-docling-258M-GGUF:Q8_0
[vision] OK:   ggml-org/LightOnOCR-1B-1025-GGUF:Q8_0
[vision] OK:   ggml-org/DeepSeek-OCR-GGUF:Q8_0
[vision] OK:   ggml-org/dots.ocr-GGUF:Q8_0
[vision] OK:   ggml-org/HunyuanOCR-GGUF:Q8_0
[vision] OK:   ggml-org/gemma-4-E2B-it-GGUF:Q8_0
[audio]  OK:   ggml-org/ultravox-v0_5-llama-3_2-1b-GGUF:Q8_0
[audio]  OK:   ggml-org/Qwen2.5-Omni-3B-GGUF:Q4_K_M
[audio]  OK:   ggml-org/Voxtral-Mini-3B-2507-GGUF:Q4_K_M
[audio]  OK:   ggml-org/LFM2-Audio-1.5B-GGUF:Q8_0
[audio]  OK:   ggml-org/gemma-4-E2B-it-GGUF:Q8_0
[audio]  OK:   ggml-org/Qwen3-ASR-0.6B-GGUF:Q8_0
[vision] OK:   ggml-org/pixtral-12b-GGUF:Q4_K_M
[vision] OK:   ggml-org/Mistral-Small-3.1-24B-Instruct-2503-GGUF
[vision] OK:   ggml-org/Qwen2-VL-2B-Instruct-GGUF:Q4_K_M
[vision] OK:   ggml-org/Qwen2-VL-7B-Instruct-GGUF:Q4_K_M
[vision] OK:   ggml-org/Qwen2.5-VL-3B-Instruct-GGUF:Q4_K_M
[vision] OK:   ggml-org/Qwen2.5-VL-7B-Instruct-GGUF:Q4_K_M
[vision] OK:   ggml-org/Qwen3-VL-2B-Instruct-GGUF:Q8_0
[vision] OK:   ggml-org/InternVL3-8B-Instruct-GGUF:Q4_K_M
[vision] OK:   ggml-org/InternVL3-14B-Instruct-GGUF:Q4_K_M
[vision] OK:   ggml-org/Qwen2.5-Omni-7B-GGUF:Q4_K_M
[vision] OK:   ggml-org/GLM-4.6V-Flash-GGUF:Q4_K_M
[audio]  OK:   ggml-org/ultravox-v0_5-llama-3_1-8b-GGUF:Q4_K_M
[audio]  OK:   ggml-org/Qwen2.5-Omni-7B-GGUF:Q4_K_M

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure: no

mtmd: refactor mtmd_decode_use_mrope

6f2b00a

ngxson requested a review from a team April 20, 2026 11:08

ngxson requested a review from a team as a code owner April 20, 2026 11:08

ngxson mentioned this pull request Apr 20, 2026

feat: Support sarashina2.2-vision-3b model #22103

Merged

CISC approved these changes Apr 20, 2026

View reviewed changes

pwilkin approved these changes Apr 20, 2026

View reviewed changes

ngxson merged commit a678916 into ggml-org:master Apr 20, 2026
50 of 51 checks passed

github-actions Bot added the examples label Apr 20, 2026

This was referenced Apr 20, 2026

mtmd: correct get_n_pos / get_decoder_pos #22175

Merged

Add EXAONE 4.5 implementations #21733

Open

ArberSephirotheca pushed a commit to ArberSephirotheca/llama.cpp that referenced this pull request Apr 21, 2026

mtmd: refactor mtmd_decode_use_mrope (ggml-org#22161)

9d456a2

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Apr 23, 2026

mtmd: refactor mtmd_decode_use_mrope (ggml-org#22161)

658baef

rsenthilkumar6 pushed a commit to rsenthilkumar6/llama.cpp that referenced this pull request May 1, 2026

mtmd: refactor mtmd_decode_use_mrope (ggml-org#22161)

f976d03

jimbothigpen pushed a commit to jimbothigpen/frankenturbo2 that referenced this pull request May 2, 2026

mtmd: refactor mtmd_decode_use_mrope (ggml-org#22161)

8632d80

ljubomirj pushed a commit to ljubomirj/llama.cpp that referenced this pull request May 6, 2026

mtmd: refactor mtmd_decode_use_mrope (ggml-org#22161)

2a1c3e3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mtmd: refactor mtmd_decode_use_mrope#22161

mtmd: refactor mtmd_decode_use_mrope#22161
ngxson merged 1 commit intoggml-org:masterfrom
ngxson:xsn/mtmd_refactor_mrope

ngxson commented Apr 20, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ngxson commented Apr 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Requirements

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ngxson commented Apr 20, 2026 •

edited

Loading