[None][fix] Pass dtype to AllReduce ctor to enable MNNVL all-reduce fo… by nv-guomingz · Pull Request #15547 · NVIDIA/TensorRT-LLM

nv-guomingz · 2026-06-23T13:22:52Z

…r Qwen3.5

On NVL multi-node systems, AllReduce must be given dtype at construction so it can build the MNNVL all-reduce path (its Lamport workspace is sized by the dtype's element size). If dtype is omitted, mnnvl_allreduce is None, so the op falls back to the generic NCCL all-reduce across nodes — functionally correct but lower performance than the NVLink-fabric MNNVL path.

…r Qwen3.5 Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>

nv-guomingz · 2026-06-23T13:24:58Z

/bot run --disable-fail-fast

coderabbitai · 2026-06-23T13:26:58Z

📝 Walkthrough

Walkthrough

In modeling_qwen3_next.py, three AllReduce constructor calls are updated to pass dtype=config.torch_dtype explicitly. The affected constructors are in Qwen3NextSparseMoeBlock, Qwen3NextLinearDecoderLayer, and Qwen3NextFullAttentionDecoderLayer.

Changes

AllReduce dtype propagation in Qwen3Next

Layer / File(s)	Summary
Pass dtype to AllReduce in all three decoder components `tensorrt_llm/_torch/models/modeling_qwen3_next.py`	`AllReduce` construction in `Qwen3NextSparseMoeBlock` (line 128), `Qwen3NextLinearDecoderLayer` (line 350), and `Qwen3NextFullAttentionDecoderLayer` (line 515) now each supply `dtype=config.torch_dtype`; previously `dtype` was omitted from all three calls.

Estimated code review effort

🎯 1 (Trivial) | ⏱️ ~2 minutes

🚥 Pre-merge checks | ✅ 3 | ❌ 2

❌ Failed checks (1 warning, 1 inconclusive)

Check name	Status	Explanation	Resolution
Description check	⚠️ Warning	The pull request description provides sufficient context about the technical rationale for the changes but lacks required sections from the template.	Add missing template sections: provide a clear short description, explicitly list test coverage validation, and complete the PR checklist items required for this repository.
Title check	❓ Inconclusive	The title is truncated and appears incomplete ('fo…' suggests the text was cut off). While it references the main change (passing dtype to AllReduce), the truncation makes it unclear and prevents full assessment of clarity.	Complete the pull request title to fully convey the change. A complete title might be: '[fix] Pass dtype to AllReduce ctor to enable MNNVL all-reduce for Qwen3' or similar, ensuring all key information is visible.

✅ Passed checks (3 passed)

Check name	Status	Explanation
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

_{Comment @coderabbitai help to get the list of available commands.}

tensorrt-cicd · 2026-06-23T13:31:24Z

PR_Github #55247 [ run ] triggered by Bot. Commit: cc449b3 Link to invocation

None][fix] Pass dtype to AllReduce ctor to enable MNNVL all-reduce fo…

cc449b3

…r Qwen3.5 Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>

nv-guomingz requested a review from a team as a code owner June 23, 2026 13:22

nv-guomingz requested a review from Wanli-Jiang June 23, 2026 13:22

github-actions Bot assigned nv-guomingz Jun 23, 2026

nv-guomingz changed the title ~~None][fix] Pass dtype to AllReduce ctor to enable MNNVL all-reduce fo…~~ [None][fix] Pass dtype to AllReduce ctor to enable MNNVL all-reduce fo… Jun 23, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[None][fix] Pass dtype to AllReduce ctor to enable MNNVL all-reduce fo…#15547

[None][fix] Pass dtype to AllReduce ctor to enable MNNVL all-reduce fo…#15547
nv-guomingz wants to merge 1 commit into
NVIDIA:mainfrom
nv-guomingz:user/guomingz/fix-mnnvl-qwen3.5

nv-guomingz commented Jun 23, 2026 •

edited

Loading

Uh oh!

nv-guomingz commented Jun 23, 2026

Uh oh!

coderabbitai Bot commented Jun 23, 2026 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

❌ Failed checks (1 warning, 1 inconclusive)

Uh oh!

tensorrt-cicd commented Jun 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

nv-guomingz commented Jun 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nv-guomingz commented Jun 23, 2026

Uh oh!

coderabbitai Bot commented Jun 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

❌ Failed checks (1 warning, 1 inconclusive)

Uh oh!

tensorrt-cicd commented Jun 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

nv-guomingz commented Jun 23, 2026 •

edited

Loading

coderabbitai Bot commented Jun 23, 2026 •

edited

Loading