server : auto-insert media marker in embedding / multimodal prompts by TheOneWhoWill · Pull Request #25093 · ggml-org/llama.cpp

TheOneWhoWill · 2026-06-28T06:55:05Z

The /embedding (and /embeddings, /v1/embeddings) endpoints failed with "number of media markers in text (0) does not match number of bitmaps (1)" when passing multimodal data via the "content" object format.

The server initializes the mtmd context with a randomized media marker (via get_media_marker()), but process_mtmd_prompt() passed the raw prompt string to mtmd_tokenize() without ensuring it contained the required markers. The CLI (mtmd-cli.cpp) already handles this by auto-prepending markers, but the server did not.

Fix: query the actual marker from the mtmd context via mtmd_get_marker() and auto-insert one per file if the prompt lacks them.

Overview

Fixes #25088

Essentially calls to the /embedding endpoint were failing because the process_mtmd_prompt function in tools/server/server-common.cpp passes the raw text from a user's prompt without including the placeholder marker from mtmd_default_marker() and one is required for each attatched image. I added a simple check for existence and inserted 1 per image.

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure:

Copilot

Pull request overview

This PR fixes multimodal /embedding requests in the server by ensuring the mtmd media marker is present in the text prompt before tokenization, aligning server behavior with the multimodal CLI and preventing marker/bitmap count mismatches.

Changes:

Query the active marker from the mtmd context (mtmd_get_marker()).
Auto-prepend media markers to the prompt before calling mtmd_tokenize() when markers are missing.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated 1 comment.

The /embedding (and /embeddings, /v1/embeddings) endpoints failed with "number of media markers in text (0) does not match number of bitmaps (1)" when passing multimodal data via the "content" object format. The server initializes the mtmd context with a randomized media marker (via get_media_marker()), but process_mtmd_prompt() passed the raw prompt string to mtmd_tokenize() without ensuring it contained the required markers. The CLI (mtmd-cli.cpp) already handles this by auto-prepending markers, but the server did not. Fix: query the actual marker from the mtmd context via mtmd_get_marker() and auto-insert one per file if the prompt lacks them. server: auto-insert missing media markers in process_mtmd_prompt Fixes the /embedding endpoint when multimodal data is provided without corresponding media markers in the prompt string. Counts existing markers and prepends only the missing number so the count matches files.size(). Assisted-by: GitHub Copilot Potential fix for pull request finding This just makes the wording more accurate Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com> server: auto-insert missing media markers in process_mtmd_prompt Fixes the /embedding endpoint when multimodal data is provided without corresponding media markers in the prompt string. Counts existing markers and prepends only the missing number so the count matches files.size(). Assisted-by: GitHub Copilot

Forgot to remove merge conflict headers Co-authored-by: AGawas <94751172+aln730@users.noreply.github.com>

aln730

LGTM. squash you commits.

Copilot AI review requested due to automatic review settings June 28, 2026 06:55

TheOneWhoWill requested a review from a team as a code owner June 28, 2026 06:55

Copilot started reviewing on behalf of TheOneWhoWill June 28, 2026 06:55 View session

github-actions Bot added the server label Jun 28, 2026

Copilot AI reviewed Jun 28, 2026

View reviewed changes

Comment thread tools/server/server-common.cpp Outdated

TheOneWhoWill closed this Jun 28, 2026

TheOneWhoWill reopened this Jun 28, 2026

TheOneWhoWill closed this Jun 28, 2026

TheOneWhoWill reopened this Jun 28, 2026

TheOneWhoWill requested a review from Copilot June 28, 2026 07:26

Copilot started reviewing on behalf of TheOneWhoWill June 28, 2026 07:27 View session

Copilot AI reviewed Jun 28, 2026

View reviewed changes

Comment thread tools/server/server-common.cpp

TheOneWhoWill force-pushed the master branch 2 times, most recently from 33ac645 to e50014f Compare June 28, 2026 07:53

TheOneWhoWill force-pushed the master branch from e50014f to 46cb6ea Compare June 28, 2026 07:57

aln730 reviewed Jun 28, 2026

View reviewed changes

Comment thread tools/server/server-common.cpp Outdated

Update tools/server/server-common.cpp

024c114

Forgot to remove merge conflict headers Co-authored-by: AGawas <94751172+aln730@users.noreply.github.com>

aln730 approved these changes Jun 28, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

server : auto-insert media marker in embedding / multimodal prompts#25093

server : auto-insert media marker in embedding / multimodal prompts#25093
TheOneWhoWill wants to merge 2 commits into
ggml-org:masterfrom
TheOneWhoWill:master

TheOneWhoWill commented Jun 28, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

aln730 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

TheOneWhoWill commented Jun 28, 2026

Overview

Requirements

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

aln730 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants