[Qwen3MoE] Fix wrong return type annotation in Qwen3MoeSparseMoeBlock.forward by matdou · Pull Request #45211 · huggingface/transformers

matdou · 2026-04-03T07:44:32Z

What does this PR do?

This PR corrects an incorrect return type in Qwen3MoeSparseMoeBlock.forward.

The method was annotated as returning tuple[torch.Tensor, torch.Tensor], while the implementation returns a torch.Tensor:

return final_hidden_states.reshape(batch_size, sequence_length, hidden_dim)

In particular, downstream usage (e.g. in Qwen3MoeDecoderLayer) expects a tensor:

hidden_states = self.mlp(hidden_states)
hidden_states = residual + hidden_states

This PR updates the annotation to:

-> torch.Tensor

in both:

modeling_qwen3_moe.py
modular_qwen3_moe.py

No functional changes are introduced, this is a typing correction only.

Checklist

I confirm that this is not a pure code agent PR.
This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@ArthurZucker @Cyrilvallez

github-actions · 2026-04-03T08:11:54Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: qwen3_moe, qwen3_omni_moe, qwen3_vl_moe

….forward Fixes huggingface#45208

…e return type fix

Rocketknight1 · 2026-04-08T12:28:49Z

No drive-by code agent PRs on other people's issues please!

matdou added 2 commits April 3, 2026 10:16

[Qwen3MoE] Fix wrong return type annotation on Qwen3MoeSparseMoeBlock…

e8fea51

….forward Fixes huggingface#45208

Regenerate qwen3_omni_moe and qwen3_vl_moe modeling files to propagat…

2c36a74

…e return type fix

matdou force-pushed the fix/qwen3-moe-sparse-moe-block-return-type branch from a0a6199 to 2c36a74 Compare April 3, 2026 08:16

Rocketknight1 closed this Apr 8, 2026

matdou deleted the fix/qwen3-moe-sparse-moe-block-return-type branch April 8, 2026 19:20

RudrenduPaul mentioned this pull request Apr 9, 2026

fix(qwen3_moe): correct return type annotation on Qwen3MoeSparseMoeBlock.forward #45352

Merged

KbKuuhaku mentioned this pull request Apr 19, 2026

[Qwen3MoE] Potentially a bug on Qwen3MoeSparseMoeBlock #45208

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Qwen3MoE] Fix wrong return type annotation in Qwen3MoeSparseMoeBlock.forward#45211

[Qwen3MoE] Fix wrong return type annotation in Qwen3MoeSparseMoeBlock.forward#45211
matdou wants to merge 2 commits intohuggingface:mainfrom
matdou:fix/qwen3-moe-sparse-moe-block-return-type

matdou commented Apr 3, 2026

Uh oh!

github-actions Bot commented Apr 3, 2026

Uh oh!

Rocketknight1 commented Apr 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

matdou commented Apr 3, 2026

What does this PR do?

Checklist

Who can review?

Uh oh!

github-actions Bot commented Apr 3, 2026

Uh oh!

Rocketknight1 commented Apr 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants