OpenVINO GenAI tests NPU support and Windows fixes by helena-intel · Pull Request #1660 · huggingface/optimum-intel

helena-intel · 2026-03-27T16:25:44Z

Update OpenVINO GenAI tests

Fix issues and access violations caused by TemporaryDirectory on Windows
Add initial support for NPU
- Speech2Text (whisper), selected LLMs and selected VLMs are supported for now. More models will be added later.
For LLMs, compare tokens instead of detokenized text. This fixes issues on GPU
On GPU, there are a few known failures, mentioned at the top of the file. We are looking into this.
Use chat template in VLM tests, which is also done in preprocess_inputs in optimum-intel (if models added in the future will not have this we can make this an option but for now all VLM tests pass with this)

The solution for the temporary directory looks convoluted but this was trickier than expected when we also want to delete the directory if the test fails.

I tested GPU and NPU on LNL 258V with Linux and Windows.

HuggingFaceDocBuilderDev · 2026-04-09T17:10:18Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Copilot

Pull request overview

Updates the OpenVINO GenAI integration tests to improve cross-device stability (notably Windows/GPU) and introduce initial NPU coverage.

Changes:

Add a pytest-based temp directory/traceback cleanup mechanism to avoid Windows file-handle issues after failures.
Add initial NPU support by restricting the tested model sets and skipping unsupported test classes.
Make LLM comparisons more robust on GPU by comparing generated token IDs rather than detokenized text; align VLM generation with chat templates.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-21T14:05:29Z



+# NPU does not support f32 inference
+TEST_CONFIG = {"CACHE_DIR": ""} if OPENVINO_DEVICE == "NPU" else {**F32_CONFIG, "CACHE_DIR": ""}


TEST_CONFIG sets CACHE_DIR to an empty string. In Optimum's OpenVINO integration, CACHE_DIR is treated as an actual directory path when present, so passing "" can lead to an invalid cache path (or unexpected caching behavior) when compiling models. Prefer omitting CACHE_DIR entirely, or set it to a real directory under self.temp_dir if you need deterministic caching behavior in these tests.

Suggested change

TEST_CONFIG = {"CACHE_DIR": ""} if OPENVINO_DEVICE == "NPU" else {**F32_CONFIG, "CACHE_DIR": ""}

TEST_CONFIG = {} if OPENVINO_DEVICE == "NPU" else {**F32_CONFIG}

CACHE_DIR may be set by default to a particular directory, CACHE_DIR="" prevents that. We do not want model caching to be used for testing, even if the default for a particular device is to use model caching.

rkazants · 2026-04-28T12:13:08Z

@helena-intel, please resolve merge conflict. We re-run CI again.

in the meantime, @popovaan, please take a look at this PR.

- Fix TemporaryDirectory issues on Windows - Compare model output tokens instead of tokenized outputs for LLMs - Initial NPU support - Use chat template for VLM test

- Change supported versions for deepseek and qwen - ChatGLM issue is caused by NaN in tiny model outputs, tracked by internal ticket. For now, remove chatglm from genai tests. This only affects chatglm, not chatglm4.

rkazants · 2026-05-05T11:58:41Z

@anatyrova, @regisss, please take a look at this PR

regisss

LGTM

rkazants requested review from Copilot, echarlaix, popovaan and rkazants April 21, 2026 13:58

Copilot started reviewing on behalf of rkazants April 21, 2026 13:59 View session

Copilot AI reviewed Apr 21, 2026

View reviewed changes

helena-intel force-pushed the helena/test-ov-genai-npu branch from eef8826 to 0137597 Compare May 1, 2026 15:15

Fixes for Windows and NPU support for GenAI test

dee6428

- Fix TemporaryDirectory issues on Windows - Compare model output tokens instead of tokenized outputs for LLMs - Initial NPU support - Use chat template for VLM test

helena-intel force-pushed the helena/test-ov-genai-npu branch from 0137597 to dee6428 Compare May 1, 2026 15:23

helena-intel added 2 commits May 1, 2026 18:01

Fix quality check

89695b7

Fix errors with old transformers version

57eef28

- Change supported versions for deepseek and qwen - ChatGLM issue is caused by NaN in tiny model outputs, tracked by internal ticket. For now, remove chatglm from genai tests. This only affects chatglm, not chatglm4.

helena-intel force-pushed the helena/test-ov-genai-npu branch from e180fc1 to 57eef28 Compare May 4, 2026 15:57

rkazants requested a review from regisss May 5, 2026 11:58

regisss approved these changes May 5, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OpenVINO GenAI tests NPU support and Windows fixes#1660

OpenVINO GenAI tests NPU support and Windows fixes#1660
helena-intel wants to merge 3 commits intohuggingface:mainfrom
helena-intel:helena/test-ov-genai-npu

helena-intel commented Mar 27, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Apr 9, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 21, 2026

Uh oh!

helena-intel Apr 22, 2026

Uh oh!

Uh oh!

Uh oh!

rkazants commented Apr 28, 2026

Uh oh!

rkazants commented May 5, 2026 •

edited

Loading

Uh oh!

regisss left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants



		# NPU does not support f32 inference
		TEST_CONFIG = {"CACHE_DIR": ""} if OPENVINO_DEVICE == "NPU" else {**F32_CONFIG, "CACHE_DIR": ""}

	TEST_CONFIG = {"CACHE_DIR": ""} if OPENVINO_DEVICE == "NPU" else {**F32_CONFIG, "CACHE_DIR": ""}
	TEST_CONFIG = {} if OPENVINO_DEVICE == "NPU" else {**F32_CONFIG}

Conversation

helena-intel commented Mar 27, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Apr 9, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Apr 21, 2026

Choose a reason for hiding this comment

Uh oh!

helena-intel Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

rkazants commented Apr 28, 2026

Uh oh!

rkazants commented May 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

regisss left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

rkazants commented May 5, 2026 •

edited

Loading