feat(sparsity): add VecAttention sparse prefill for VLM by anminliu · Pull Request #320 · Tencent/AngelSlim

anminliu · 2026-05-28T06:21:51Z

Integrate VecAttention into AngelSlim as a sparse attention method for Vision-Language Models (Qwen2.5-VL).

Add vecattention subpackage under compressor/sparsity/
Add vllm-flash-attention as git submodule for sparse_attn_func kernel
Add Triton kernels for MinP threshold selection and query pooling
Add run_vecattention.py tool for image/video inference

Integrate VecAttention into AngelSlim as a sparse attention method for Vision-Language Models (Qwen2.5-VL). - Add vecattention subpackage under compressor/sparsity/ - Add vllm-flash-attention as git submodule for sparse_attn_func kernel - Add Triton kernels for MinP threshold selection and query pooling - Add run_vecattention.py tool for image/video inference

yghstill previously approved these changes May 28, 2026

View reviewed changes

anminliu dismissed yghstill’s stale review via 033eb00 May 28, 2026 13:06

style: add copyright header for VecAttention

3197960

anminliu force-pushed the dev_vecattention branch from 033eb00 to 3197960 Compare May 28, 2026 13:10

anminliu requested a review from yghstill May 28, 2026 14:20

yghstill approved these changes May 29, 2026

View reviewed changes

anminliu merged commit bec53be into Tencent:main May 29, 2026
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(sparsity): add VecAttention sparse prefill for VLM#320

feat(sparsity): add VecAttention sparse prefill for VLM#320
anminliu merged 2 commits into
Tencent:mainfrom
anminliu:dev_vecattention

anminliu commented May 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

anminliu commented May 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants