Feat/ollama model by eliasubz · Pull Request #1322 · EvolvingLMMs-Lab/lmms-eval

eliasubz · 2026-05-05T19:01:29Z

Summary

Adds an ollama chat backend for local inference through Ollama's OpenAI-compatible /v1 API.
Enables local text and vision generate_until evals with Ollama models.
Adds unit coverage for registration, initialization, base URL handling, and explicit loglikelihood non-support.

In scope

Adds lmms_eval/models/chat/ollama.py.
Registers "ollama": "Ollama" in AVAILABLE_CHAT_TEMPLATE_MODELS.
Adds test/models/test_ollama.py for backend registration and constructor behavior.
Supports generate_until through Ollama's OpenAI-compatible chat completions endpoint.

Out of scope

Does not add loglikelihood support; Ollama returns generated-token logprobs, not prompt/continuation likelihoods required by lmms-eval.
Does not add video or audio support.

Validation

Ran various vision and text evals on models such as smollm2, llava and moondream.
uv --cache-dir .\.uv-cache run --with pytest python -m pytest test/models/test_ollama.py -v | sample size: N=7 tests | key metrics: 7 passed | result: pass
uv --cache-dir .\.uv-cache run python -m lmms_eval --model ollama --model_args model_version=smollm2:135m --include_path C:\tmp\lmms_tasks --tasks gsm8k_ollama_1shot --limit 8 --batch_size 2 | sample size: N=8 | key metrics: flexible-extract exact_match=0.125, strict-match exact_match=0.000 | result: pass
uv --cache-dir .\.uv-cache run python -m lmms_eval --model ollama --model_args model_version=<vision-model> --tasks ok_vqa_val2014_lite --limit 1 --batch_size 1 | sample size: N=1 | key metrics: vision generate_until completed and produced metrics table | result: pass

Risk / Compatibility

Low risk: this adds a new model backend and does not change existing model or task behavior.
Reproducibility only requieres local Ollama model.

Type of Change

eliasubz · 2026-05-05T19:07:05Z

Hey could someone review this PR? It just adds the ollama interface to models/ to run local evals with text and image. I tested evals such as gsm8k with few_shot, vqa_val_lite and mme. If there is anything I missed just let me know!

kcz358

Hi, thank you for the contribution. It seems like this model just inherit oai with some change on the host or base url, where everything can also be configured using the original openai class itself? If this is just a self hosted oai server, I think can just use the original oai chat models instead of create a new ollama model

eliasubz · 2026-05-06T06:56:10Z

Yes, that actually makes a lot of sense. The only reason for the push I see now is that with the ollama model available more people would be aware that they can use it, because I dont think everyone is aware they can use the openai model with a ollama self-host.
Regardless, do you know any other necessary contributions that dont have an issue right now?

Elias and others added 3 commits May 5, 2026 18:39

preparing for PR

261e1f8

PR ready

5d3cbfe

deleting notes

6e17eab

kcz358 reviewed May 6, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/ollama model#1322

Feat/ollama model#1322
eliasubz wants to merge 3 commits intoEvolvingLMMs-Lab:mainfrom
eliasubz:feat/ollama-model

eliasubz commented May 5, 2026

Uh oh!

eliasubz commented May 5, 2026

Uh oh!

kcz358 left a comment

Uh oh!

eliasubz commented May 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

eliasubz commented May 5, 2026

Summary

In scope

Out of scope

Validation

Risk / Compatibility

Type of Change

Uh oh!

eliasubz commented May 5, 2026

Uh oh!

kcz358 left a comment

Choose a reason for hiding this comment

Uh oh!

eliasubz commented May 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants