[wwb] move to dtype auto by sbalandi · Pull Request #3793 · openvinotoolkit/openvino.genai

sbalandi · 2026-05-01T13:59:21Z

Description

CVS-###

Fixes #(issue)

Checklist:

This PR follows GenAI Contributing guidelines.
Tests have been updated or added to cover the new code.
This PR fully addresses the ticket.
I have made corresponding changes to the documentation.

Copilot

Pull request overview

Updates WWB’s default HuggingFace model loading dtype behavior to rely on Transformers’ automatic dtype selection rather than forcing fp32.

Changes:

Switch PYTORCH_MODEL_DTYPE_KWARG from torch.float32 to "auto" for HF model loading in WWB.

Copilot · 2026-05-01T14:02:12Z

+PYTORCH_MODEL_DTYPE_KWARG = {"torch_dtype": "auto"}




Setting torch_dtype to "auto" for all HuggingFace loads can break CPU execution: many models advertise fp16/bf16 in config, and loading them on CPU frequently leads to runtime errors (e.g., Half ops not implemented) or unexpected slowdowns. Consider keeping torch.float32 when device is CPU / CUDA is unavailable, and only using "auto" on GPU (or make dtype configurable via CLI).

Suggested change

PYTORCH_MODEL_DTYPE_KWARG = {"torch_dtype": "auto"}

def _get_pytorch_model_dtype_kwarg():

if torch.cuda.is_available():

return {"torch_dtype": "auto"}

return {"torch_dtype": torch.float32}

PYTORCH_MODEL_DTYPE_KWARG = _get_pytorch_model_dtype_kwarg()

Copilot · 2026-05-01T14:02:12Z

 logger = logging.getLogger(__name__)

-PYTORCH_MODEL_DTYPE_KWARG = {"torch_dtype": torch.float32}
+PYTORCH_MODEL_DTYPE_KWARG = {"torch_dtype": "auto"}


PR description still contains placeholders (e.g., CVS-###, Fixes #(issue)) and the checklist is not filled out. Please update the PR description to match the repository template before merging so reviewers can confirm scope, tests, and docs impact.

Copilot · 2026-05-01T14:02:13Z

 logger = logging.getLogger(__name__)

-PYTORCH_MODEL_DTYPE_KWARG = {"torch_dtype": torch.float32}
+PYTORCH_MODEL_DTYPE_KWARG = {"torch_dtype": "auto"}


This change alters the default dtype selection for HF models, but the WWB test suite doesn’t appear to cover the new behavior (e.g., loading a model whose config defaults to fp16/bf16 on CPU). Please add/update a WWB test to exercise HF loading with a non-fp32 default dtype (ideally using a tiny-random model) so regressions are caught.

rkazants

Before moving to auto type, please make sure that there will be no problems with CPU.
That is because I expect multiple JIRA tickets after this assigned to optimum-intel. Hovewer, the problem can be with CPU plugin.
So I would ask you to have validation runs and determine all existing issue in CPU. So we will know of it in advance.

sbalandi · 2026-05-05T17:12:25Z

Before moving to auto type, please make sure that there will be no problems with CPU. That is because I expect multiple JIRA tickets after this assigned to optimum-intel. Hovewer, the problem can be with CPU plugin. So I would ask you to have validation runs and determine all existing issue in CPU. So we will know of it in advance.

yes, this pr was created for validation purpose

[wwb] move to dtype auto

357a72b

Copilot AI review requested due to automatic review settings May 1, 2026 13:59

github-actions Bot added the category: WWB PR changes WWB label May 1, 2026

Copilot started reviewing on behalf of sbalandi May 1, 2026 13:59 View session

Copilot AI reviewed May 1, 2026

View reviewed changes

rkazants reviewed May 4, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[wwb] move to dtype auto#3793

[wwb] move to dtype auto#3793
sbalandi wants to merge 1 commit into
openvinotoolkit:masterfrom
sbalandi:dtype_auto

sbalandi commented May 1, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI May 1, 2026

Uh oh!

Copilot AI May 1, 2026

Uh oh!

Copilot AI May 1, 2026

Uh oh!

rkazants left a comment

Uh oh!

sbalandi commented May 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

-PYTORCH_MODEL_DTYPE_KWARG = {"torch_dtype": "auto"}
+def _get_pytorch_model_dtype_kwarg():
+    if torch.cuda.is_available():
+        return {"torch_dtype": "auto"}
+    return {"torch_dtype": torch.float32}
+PYTORCH_MODEL_DTYPE_KWARG = _get_pytorch_model_dtype_kwarg()

Conversation

sbalandi commented May 1, 2026

Description

Checklist:

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI May 1, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI May 1, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI May 1, 2026

Choose a reason for hiding this comment

Uh oh!

rkazants left a comment

Choose a reason for hiding this comment

Uh oh!

sbalandi commented May 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants