Add support for Ouro (ByteDance/Ouro-1.4B) by openvino-dev-samples · Pull Request #1783 · huggingface/optimum-intel

openvino-dev-samples · 2026-06-11T02:11:58Z

What does this PR do?

This PR adds OpenVINO export and inference support for the Ouro model family (OuroForCausalLM).

Ouro is a Universal Transformer decoder: the same num_hidden_layers decoder layers are looped total_ut_steps times, and every iteration stores its own key/value entry. The only thing required for a correct export is a normalized config that reports num_layers = num_hidden_layers * total_ut_steps, so the exported model exposes the right number of past-key-value pairs. No custom model patching or new operators are needed — the stock OVDecoderModelPatcher is reused.

Export the model:

optimum-cli export openvino --model ByteDance/Ouro-1.4B --trust-remote-code ouro_ov

Run inference:

from transformers import AutoTokenizer
from optimum.intel import OVModelForCausalLM

model_id = "ByteDance/Ouro-1.4B"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = OVModelForCausalLM.from_pretrained(model_id, export=True, trust_remote_code=True)

inputs = tokenizer("The quick brown fox", return_tensors="pt")
outputs = model.generate(**inputs, max_new_tokens=20)
print(tokenizer.batch_decode(outputs, skip_special_tokens=True)[0])

Notes:

Ouro relies on remote modeling code that is incompatible with transformers v5, so tests are gated to >=4.53.0, <5 (validated against transformers==4.57.1).
Ouro's custom UniversalTransformerCache makes cached generation diverge from uncached under left-padding + beam search (this happens in transformers too), so in tests Ouro joins the other remote-code custom-cache models that compare against transformers with use_cache=False.
A tiny test model is added at optimum-intel-internal-testing/tiny-random-ouro (full vocab + real tokenizer, total_ut_steps=4).

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Copilot

Pull request overview

Adds OpenVINO export/inference support for the ByteDance Ouro Universal Transformer decoder family by introducing a normalized config that reports an effective layer count (to expose the correct number of past key/value pairs), plus wiring the architecture into the OpenVINO test matrix and documentation.

Changes:

Register ouro in the OpenVINO TasksManager with a custom NormalizedOuroConfig that expands num_layers = num_hidden_layers * total_ut_steps.
Add ouro to OpenVINO decoder/export/CLI/quantization test parametrizations (gated to transformers>=4.53.0,<5) and tiny model mappings.
Document Ouro as a supported OpenVINO architecture.

Reviewed changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
tests/openvino/utils_tests.py	Adds tiny Ouro test model mapping, expected node counts, and remote-code gating for `<5`.
tests/openvino/test_quantization.py	Adds Ouro to auto-compression test coverage under the `<5` transformers gate.
tests/openvino/test_exporters_cli.py	Adds Ouro to CLI export test matrix and expected tokenizer model count.
tests/openvino/test_export.py	Adds Ouro to exporter integration tests under the `<5` transformers gate.
tests/openvino/test_decoder.py	Adds Ouro to decoder integration tests and aligns comparison behavior with other custom-cache remote-code models.
optimum/exporters/openvino/model_configs.py	Registers `ouro` OpenVINO config and introduces `NormalizedOuroConfig` to expand effective layer count.
docs/source/openvino/models.mdx	Lists Ouro among supported OpenVINO architectures.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+class OuroOpenVINOConfig(TextDecoderWithPositionIdsOpenVINOConfig):
+    DUMMY_INPUT_GENERATOR_CLASSES = (DummyTextInputGenerator, MistralDummyPastKeyValuesGenerator)
+    DUMMY_PKV_GENERATOR_CLASS = MistralDummyPastKeyValuesGenerator
+    NORMALIZED_CONFIG_CLASS = NormalizedOuroConfig
+    MIN_TRANSFORMERS_VERSION = "4.53.0"
+    _MODEL_PATCHER = OVDecoderModelPatcher


rkazants · 2026-06-11T09:59:49Z

 - OLMo 2
 - OPT
 - Orion
+- Ouro


document this model in the bottom and share links. Check other trust-remote-code models like Kokoro.
Do we need any additinal library to setup for model loading?

HuggingFaceDocBuilderDev · 2026-06-11T10:00:42Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Ouro is a Universal Transformer decoder: the same num_hidden_layers decoder layers are looped total_ut_steps times, and every iteration stores its own key/value entry. Register an OpenVINO export config with a NormalizedOuroConfig that reports num_layers = num_hidden_layers * total_ut_steps so the exported model exposes the right number of past-key-value pairs. No model patching is required beyond the standard OVDecoderModelPatcher. Add Ouro to the decoder, export, exporters-cli and quantization tests (tiny-random-ouro, full vocab + real tokenizer, total_ut_steps=4). Because Ouro's custom UniversalTransformerCache makes cached generation diverge from uncached under left-padding + beam search (this happens in PyTorch too), Ouro joins the other remote-code custom-cache models that compare against transformers with use_cache=False. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

openvino-agent · 2026-06-12T11:20:31Z

+    DUMMY_INPUT_GENERATOR_CLASSES = (DummyTextInputGenerator, MistralDummyPastKeyValuesGenerator)
+    DUMMY_PKV_GENERATOR_CLASS = MistralDummyPastKeyValuesGenerator
+    NORMALIZED_CONFIG_CLASS = NormalizedOuroConfig
+    MIN_TRANSFORMERS_VERSION = "4.53.0"


since this is trust-remote-code model, what is max version?

Co-authored-by: Roman Kazantsev <roman.kazantsev@intel.com>

openvino-dev-samples marked this pull request as draft June 11, 2026 02:21

openvino-dev-samples force-pushed the add-ouro-support branch from 14df50e to b32d639 Compare June 11, 2026 02:41

openvino-dev-samples marked this pull request as ready for review June 11, 2026 02:41

rkazants requested review from Copilot and echarlaix June 11, 2026 09:56

Copilot started reviewing on behalf of rkazants June 11, 2026 09:56 View session

rkazants self-requested a review June 11, 2026 09:57

rkazants approved these changes Jun 11, 2026

View reviewed changes

Copilot AI reviewed Jun 11, 2026

View reviewed changes

rkazants reviewed Jun 11, 2026

View reviewed changes

openvino-dev-samples force-pushed the add-ouro-support branch from b32d639 to f3ad828 Compare June 11, 2026 13:57

rkazants reviewed Jun 12, 2026

View reviewed changes

Comment thread docs/source/openvino/models.mdx Outdated

openvino-agent reviewed Jun 12, 2026

View reviewed changes

Update docs/source/openvino/models.mdx

6b9b8ac

Co-authored-by: Roman Kazantsev <roman.kazantsev@intel.com>

rkazants reviewed Jun 17, 2026

View reviewed changes

Comment thread optimum/exporters/openvino/model_configs.py

openvino-dev-samples and others added 2 commits June 17, 2026 23:27

Update optimum/exporters/openvino/model_configs.py

2a6f2c1

Co-authored-by: Roman Kazantsev <roman.kazantsev@intel.com>

Merge branch 'main' into add-ouro-support

69c97c8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for Ouro (ByteDance/Ouro-1.4B)#1783

Add support for Ouro (ByteDance/Ouro-1.4B)#1783
openvino-dev-samples wants to merge 4 commits into
huggingface:mainfrom
openvino-dev-samples:add-ouro-support

openvino-dev-samples commented Jun 11, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

rkazants Jun 11, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Jun 11, 2026

Uh oh!

Uh oh!

openvino-agent Jun 12, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Conversation

openvino-dev-samples commented Jun 11, 2026

What does this PR do?

Before submitting

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

rkazants Jun 11, 2026

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Jun 11, 2026

Uh oh!

Uh oh!

openvino-agent Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants