Fix training gradient underflow in quantization tests by jiqing-feng · Pull Request #13539 · huggingface/diffusers

jiqing-feng · 2026-04-22T01:49:25Z

What does this PR do?

Changes autocast dtype from float16 to bfloat16 in _test_quantization_training. Float16's limited dynamic range (max ~65504, min subnormal ~5.96e-8) causes gradients to underflow to zero when passing through quantized tensor subclass operations; bfloat16 shares float32's exponent range and avoids this.

Change autocast dtype from float16 to bfloat16 in _test_quantization_training. Float16's limited dynamic range causes gradients to underflow to zero when passing through quantized tensor subclass operations.

jiqing-feng · 2026-05-09T05:01:30Z

Hi @sayakpaul . Would you please review the PR? Thanks!

sayakpaul · 2026-05-11T02:59:12Z

        inputs = self.get_dummy_inputs()

-        with torch.amp.autocast(torch_device, dtype=torch.float16):
+        # Use bfloat16 instead of float16 to avoid gradient underflow with quantized layers


Is this quantization backend agnostic?

Fix training gradient underflow in quantization tests

3152585

Change autocast dtype from float16 to bfloat16 in _test_quantization_training. Float16's limited dynamic range causes gradients to underflow to zero when passing through quantized tensor subclass operations.

github-actions Bot added size/S PR with diff < 50 LOC tests and removed size/S PR with diff < 50 LOC labels Apr 22, 2026

Merge branch 'main' into torchao-fix-training-underflow

04fc264

github-actions Bot added the size/S PR with diff < 50 LOC label Apr 22, 2026

jiqing-feng mentioned this pull request Apr 22, 2026

Improve TorchAO quantization test coverage and XPU support #13530

Open

Merge branch 'main' into torchao-fix-training-underflow

f08c764

github-actions Bot added size/S PR with diff < 50 LOC and removed size/S PR with diff < 50 LOC labels Apr 22, 2026

Merge branch 'main' into torchao-fix-training-underflow

3d459c8

github-actions Bot added size/S PR with diff < 50 LOC and removed size/S PR with diff < 50 LOC labels Apr 28, 2026

sayakpaul reviewed May 11, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix training gradient underflow in quantization tests#13539

Fix training gradient underflow in quantization tests#13539
jiqing-feng wants to merge 4 commits intohuggingface:mainfrom
jiqing-feng:torchao-fix-training-underflow

jiqing-feng commented Apr 22, 2026

Uh oh!

jiqing-feng commented May 9, 2026

Uh oh!

sayakpaul May 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jiqing-feng commented Apr 22, 2026

What does this PR do?

Uh oh!

jiqing-feng commented May 9, 2026

Uh oh!

sayakpaul May 11, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants