feat: support Qwen3.5-VL model on npu device[6/N]. by yingxudeng · Pull Request #1212 · jd-opensource/xllm

yingxudeng · 2026-04-07T16:57:02Z

No description provided.

gemini-code-assist

Code Review

This pull request adds support for the Qwen3.5 VL model, including updates for linear attention, multimodal Rotary Positional Embeddings (mRoPE), and deepstack processing. It also refactors model type detection and enhances NPU attention layers. Review feedback highlights a critical compilation error from modifying a constant reference, a memory allocation mismatch in the KV cache, and a style guide violation regarding anonymous namespaces. Additionally, suggestions were made to fix hardcoded logic in RoPE and deepstack retrieval to ensure safety and configuration compliance.

…n llm backend.

wly-115 · 2026-04-10T07:13:47Z

torch::Tensor deepstack_process(torch::Tensor hidden_states, torch::Tensor visual_pos_masks, const torch::Tensor& visual_embeds)
I remember Qwen 3.5 already removed DeepStack.

gemini-code-assist Bot reviewed Apr 7, 2026

View reviewed changes

Comment thread xllm/models/vlm/qwen3_5_vl.h

Comment thread xllm/core/distributed_runtime/vlm_engine.cpp

Comment thread xllm/core/layers/common/rotary_embedding_util.cpp

Comment thread xllm/core/layers/npu_torch/qwen3_next_attention.cpp

Comment thread xllm/models/vlm/qwen3_5_vl.h

yingxudeng changed the title ~~feat: support Qwen3.5-VL model on npu device.~~ feat: support Qwen3.5-VL model on npu device[6/N]. Apr 8, 2026

yingxudeng marked this pull request as ready for review April 8, 2026 08:34

yingxudeng requested review from DongheJin, JimHsiung, RobbieLeung, XuZhang99, liutongxuan, walsonyang and yq33victor as code owners April 8, 2026 08:34

yingxudeng force-pushed the feat/qwen35_video_2_ok_3_ing_2 branch from f137ab2 to 8f60c20 Compare April 9, 2026 17:37

yingxudeng added 2 commits April 10, 2026 14:11

feat: support Qwen3.5-VL model on npu device.

2650e93

bugfix: fix Qwen3.5 multimodal checkpoint loading and runtime flags o…

9ddb0d7

…n llm backend.

yingxudeng force-pushed the feat/qwen35_video_2_ok_3_ing_2 branch from 8f60c20 to 9ddb0d7 Compare April 10, 2026 06:13

wly-115 self-requested a review April 10, 2026 07:11

yingxudeng closed this Apr 28, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support Qwen3.5-VL model on npu device[6/N].#1212

feat: support Qwen3.5-VL model on npu device[6/N].#1212
yingxudeng wants to merge 2 commits intojd-opensource:mainfrom
yingxudeng:feat/qwen35_video_2_ok_3_ing_2

yingxudeng commented Apr 7, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wly-115 commented Apr 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

yingxudeng commented Apr 7, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wly-115 commented Apr 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants