Fix: use resample-aware bilinear+antialias interpolation for tensor/numpy resize in VaeImageProcessor by GitGlimpse895 · Pull Request #13500 · huggingface/diffusers

GitGlimpse895 · 2026-04-18T11:27:36Z

What does this PR do?

VaeImageProcessor exposes a resample config parameter (defaulting to "lanczos")
and correctly applies it when resizing PIL images via PIL_INTERPOLATION. However,
the two torch.nn.functional.interpolate calls handling torch.Tensor and
np.ndarray inputs passed no mode argument — causing PyTorch to silently default
to "nearest" neighbor interpolation, regardless of the configured resample filter.
No antialias=True was set either, causing aliasing artifacts on downsampling.

This fix:

Adds a TORCH_INTERPOLATION dict in pil_utils.py mapping the same resample-string
keys as PIL_INTERPOLATION to their torch.nn.functional.interpolate equivalents
(with antialias eligibility). "lanczos" maps to bilinear+antialias, the closest
high-quality torch substitute.
Updates both tensor branches of VaeImageProcessor.resize() to use the mapped mode
and antialias flag, making tensor/numpy resize quality consistent with the PIL path.

This silently affected every pipeline that passes torch.Tensor inputs to
VaeImageProcessor (ControlNet conditioning, IP-Adapter, img2img, etc.).

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if
that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc
(important for complex PRs)?
Was this discussed/approved via a GitHub issue or the
forum?
Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@yiyixuxu @sayakpaul @DN6

ParamChordiya · 2026-04-18T18:38:21Z

Code Review: Fix resample-aware interpolation in VaeImageProcessor

Summary

This fixes a legitimate quality bug: VaeImageProcessor.resize() correctly applies the configured resample filter for PIL inputs, but the torch.Tensor and np.ndarray branches called F.interpolate with no mode argument, defaulting to "nearest" neighbor — producing blocky/aliased results. The TORCH_INTERPOLATION mapping dict and (mode, antialias) tuple pattern is clean.

Issues

"lanczos" maps to "bilinear" — should it be "bicubic"? — "bicubic" is a closer approximation to Lanczos in frequency response and sharpness. Since "lanczos" is the default resample value, this mapping affects the vast majority of users. Worth discussing.
No new tests — Existing tests only check output shapes, not interpolation quality or mode. Needs at minimum:
- A test verifying interpolation is not "nearest" (e.g. checkerboard pattern resize)
- A test that each key in TORCH_INTERPOLATION works without error
- A test comparing tensor vs PIL resize output similarity (PSNR/MSE threshold)
Import path inconsistency — PIL_INTERPOLATION is exported via utils/__init__.py, but TORCH_INTERPOLATION is imported directly from utils.pil_utils. Should be consistent.
Silent behavior change — This changes outputs for all existing pipelines passing tensor/numpy inputs. Deserves a changelog entry flagging it as a fix that changes output values.
No KeyError guard on mapping lookup — Invalid resample strings will produce an unhelpful KeyError. A ValueError with supported options listed would be better.
Duplicate code — The TORCH_INTERPOLATION lookup is identical in both the tensor and numpy branches. Could factor it out above with isinstance(image, (torch.Tensor, np.ndarray)).
resize_and_crop_tensor not updated — This static method also calls F.interpolate with hardcoded mode="bilinear" and no antialias. Worth a follow-up.

Verdict

Request changes — The core fix is correct, but needs tests, the lanczos mapping deserves discussion, and the import path should be consistent.

HuggingFaceDocBuilderDev · 2026-04-18T21:35:06Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

yiyixuxu

thanks for the PR!
i left one comment

yiyixuxu · 2026-04-18T22:01:31Z

+    "linear": ("bilinear", True),
+    "bilinear": ("bilinear", True),
+    "bicubic": ("bicubic", True),
+    "lanczos": ("bilinear", True),


ohh, so if this option is not supported in torch, let's not map it to anything
just send a warning that says this resample mode is not supported for tensor/ndarray so it will be ignored (the default nearest is used intead). this way we don't change the default behavior for resize
what do you think?

GitGlimpse895 · 2026-04-19T02:47:25Z

Thanks @yiyixuxu — updated! Revised approach:

TORCH_INTERPOLATION now only maps modes torch natively supports
(bilinear, bicubic, nearest).
For unsupported modes like "lanczos", the code now emits a
logger.warning and falls back to "nearest", preserving existing
default behavior with no silent output change.
Also factored the duplicate lookup out of both tensor and numpy
branches into a single shared isinstance(image, (torch.Tensor, np.ndarray))
branch, eliminating the code duplication @ParamChordiya flagged.

…mode translation

…y resize

…mode translation

…nsor/numpy resize branch

github-actions bot added utils size/S PR with diff < 50 LOC labels Apr 18, 2026

yiyixuxu reviewed Apr 18, 2026

View reviewed changes

github-actions bot added size/S PR with diff < 50 LOC and removed size/S PR with diff < 50 LOC labels Apr 19, 2026

GitGlimpse895 added 4 commits April 19, 2026 08:17

utils/pil_utils: add TORCH_INTERPOLATION map for consistent resample-…

ddd4ac8

…mode translation

image_processor: use bilinear+antialias interpolation for tensor/nump…

d7610a1

…y resize

utils/pil_utils: add TORCH_INTERPOLATION map for consistent resample-…

33720b2

…mode translation

image_processor: warn on unsupported resample mode and deduplicate te…

d3456ea

…nsor/numpy resize branch

GitGlimpse895 force-pushed the fix/tensor-resize-interpolation branch from 799e72c to d3456ea Compare April 19, 2026 02:47

github-actions bot added size/S PR with diff < 50 LOC and removed size/S PR with diff < 50 LOC labels Apr 19, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: use resample-aware bilinear+antialias interpolation for tensor/numpy resize in VaeImageProcessor#13500

Fix: use resample-aware bilinear+antialias interpolation for tensor/numpy resize in VaeImageProcessor#13500
GitGlimpse895 wants to merge 4 commits intohuggingface:mainfrom
GitGlimpse895:fix/tensor-resize-interpolation

GitGlimpse895 commented Apr 18, 2026 •

edited

Loading

Uh oh!

ParamChordiya commented Apr 18, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Apr 18, 2026

Uh oh!

yiyixuxu left a comment

Uh oh!

yiyixuxu Apr 18, 2026

Uh oh!

GitGlimpse895 commented Apr 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

GitGlimpse895 commented Apr 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

ParamChordiya commented Apr 18, 2026

Code Review: Fix resample-aware interpolation in VaeImageProcessor

Summary

Issues

Verdict

Uh oh!

HuggingFaceDocBuilderDev commented Apr 18, 2026

Uh oh!

yiyixuxu left a comment

Choose a reason for hiding this comment

Uh oh!

yiyixuxu Apr 18, 2026

Choose a reason for hiding this comment

Uh oh!

GitGlimpse895 commented Apr 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

GitGlimpse895 commented Apr 18, 2026 •

edited

Loading