fix(qwen3_moe): correct return type annotation on Qwen3MoeSparseMoeBlock.forward by RudrenduPaul · Pull Request #45352 · huggingface/transformers

RudrenduPaul · 2026-04-09T21:53:31Z

What does this PR do?

Corrects an incorrect return type annotation on Qwen3MoeSparseMoeBlock.forward.

The method is annotated as returning tuple[torch.Tensor, torch.Tensor] but actually returns a single reshaped torch.Tensor (see the return final_hidden_states.reshape(...) statement). This type mismatch was identified in issue #45208 and confirmed by maintainer @Rocketknight1.

Fixes #45208

Changes

src/transformers/models/qwen3_moe/modular_qwen3_moe.py: Fix return type annotation from tuple[torch.Tensor, torch.Tensor] to torch.Tensor
src/transformers/models/qwen3_moe/modeling_qwen3_moe.py: Same fix in the generated file (kept in sync with the modular file)

Tests

No functional change — annotation-only fix. Existing tests continue to pass.

This is not a duplicate of PR #45211 (which was opened and closed without review before the maintainer commented on the issue).

Note: This PR was developed with AI assistance (Claude Code). I have reviewed every line and understand the change.

…ock.forward

Rocketknight1 · 2026-04-10T13:27:07Z

@RudrenduPaul can you run make fix-repo to clean up any modular/modeling issues and propagate the fix, then ping me for final review? Thank you!

RudrenduPaul · 2026-04-11T06:14:12Z

Hi @Rocketknight1 — ran make fix-repo (using python3 utils/checkers.py directly since the python alias wasn't available). All the key consistency checks passed with no changes needed:

✅ auto_mappings — OK
✅ doc_toc — OK
✅ copies — no # Copied from drift detected
✅ modular_conversion — no generated-file divergence
✅ dummies — OK
✅ pipeline_typing — OK

(3 checks failed due to missing local packages — ruff, gitpython, setuptools — these are env setup issues on my machine, not repo inconsistencies.)

The branch is also up to date with main (the upstream sync from the 'Update branch' button is already included). Ready for your final review whenever you're free — thanks!

…rated vl_moe and omni_moe files Built by Rudrendu Paul, developed with Claude Code

RudrenduPaul · 2026-04-11T06:43:33Z

@Rocketknight1 — follow-up on the check_repository_consistency failure.

The CI diff showed the modular conversion of Qwen3VLMoeTextSparseMoeBlock and Qwen3OmniMoeThinkerTextSparseMoeBlock (both inherit from Qwen3MoeSparseMoeBlock via pass) would generate -> torch.Tensor, while the actual generated modeling files still had -> tuple[torch.Tensor, torch.Tensor].

Just pushed a second commit propagating the fix to:

modeling_qwen3_vl_moe.py — Qwen3VLMoeTextSparseMoeBlock.forward
modeling_qwen3_omni_moe.py — Qwen3OmniMoeThinkerTextSparseMoeBlock.forward

(Left Qwen3OmniMoeTalkerTextSparseMoeBlock untouched — it inherits from Qwen2MoeSparseMoeBlock, not Qwen3.)

Should be ready for final review now!

github-actions · 2026-04-11T06:44:31Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: qwen3_moe, qwen3_omni_moe, qwen3_vl_moe

Rocketknight1

Yep, LGTM now!

HuggingFaceDocBuilderDev · 2026-04-13T14:01:07Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

…ock.forward (huggingface#45352) * fix(qwen3_moe): correct return type annotation on Qwen3MoeSparseMoeBlock.forward * fix: propagate Qwen3MoeSparseMoeBlock forward return type fix to generated vl_moe and omni_moe files Built by Rudrendu Paul, developed with Claude Code --------- Co-authored-by: Rudrendu <RudrenduPaul@users.noreply.github.com>

RudrenduPaul and others added 3 commits April 9, 2026 14:53

fix(qwen3_moe): correct return type annotation on Qwen3MoeSparseMoeBl…

7ee7bec

…ock.forward

Merge branch 'main' into fix/qwen3moe-return-type-annotation

aab4a36

Merge branch 'main' into fix/qwen3moe-return-type-annotation

910be66

Merge branch 'main' into fix/qwen3moe-return-type-annotation

de0f1ca

fix: propagate Qwen3MoeSparseMoeBlock forward return type fix to gene…

f0cefbe

…rated vl_moe and omni_moe files Built by Rudrendu Paul, developed with Claude Code

Rocketknight1 approved these changes Apr 13, 2026

View reviewed changes

Rocketknight1 enabled auto-merge April 13, 2026 13:50

Rocketknight1 added this pull request to the merge queue Apr 13, 2026

Merged via the queue into huggingface:main with commit 357f414 Apr 13, 2026
21 checks passed

KbKuuhaku mentioned this pull request Apr 19, 2026

[Qwen3MoE] Potentially a bug on Qwen3MoeSparseMoeBlock #45208

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(qwen3_moe): correct return type annotation on Qwen3MoeSparseMoeBlock.forward#45352

fix(qwen3_moe): correct return type annotation on Qwen3MoeSparseMoeBlock.forward#45352
Rocketknight1 merged 5 commits intohuggingface:mainfrom
RudrenduPaul:fix/qwen3moe-return-type-annotation

RudrenduPaul commented Apr 9, 2026

Uh oh!

Rocketknight1 commented Apr 10, 2026

Uh oh!

RudrenduPaul commented Apr 11, 2026

Uh oh!

RudrenduPaul commented Apr 11, 2026

Uh oh!

github-actions Bot commented Apr 11, 2026

Uh oh!

Rocketknight1 left a comment

Uh oh!

HuggingFaceDocBuilderDev commented Apr 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

RudrenduPaul commented Apr 9, 2026

What does this PR do?

Changes

Tests

Uh oh!

Rocketknight1 commented Apr 10, 2026

Uh oh!

RudrenduPaul commented Apr 11, 2026

Uh oh!

RudrenduPaul commented Apr 11, 2026

Uh oh!

github-actions Bot commented Apr 11, 2026

Uh oh!

Rocketknight1 left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Apr 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants