Skip to content

[Feat] Update sparse method patches for vllm 0.11.0#638

Merged
Infinite666 merged 2 commits intoModelEngine-Group:developfrom
AooooooA-C:dev_patch
Jan 22, 2026
Merged

[Feat] Update sparse method patches for vllm 0.11.0#638
Infinite666 merged 2 commits intoModelEngine-Group:developfrom
AooooooA-C:dev_patch

Conversation

@AooooooA-C
Copy link
Copy Markdown
Contributor

Purpose

Update sparse method patches to adapt to vLLM 0.11.0 version.

Modifications

Apply the patches to vLLM and vLLM-Ascend using the git apply command.

Test

Conducted tests on gsa_on_device in the CUDA environment.

Comment thread examples/offline_inference_kvcomphbm.py
Comment thread ucm/integration/vllm/patch/0.11.0/vllm-adapt-sparse.patch Outdated
@AooooooA-C AooooooA-C force-pushed the dev_patch branch 3 times, most recently from 6b340ab to bf3f81a Compare January 21, 2026 08:39
@Infinite666 Infinite666 changed the title [Fix] Update sparse method patches for vllm 0.11.0 [Feat] Update sparse method patches for vllm 0.11.0 Jan 22, 2026
@Infinite666 Infinite666 merged commit f5261e9 into ModelEngine-Group:develop Jan 22, 2026
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants