[feat] add monkey patch for gsa on device v0.9.2#618
Merged
ygwpz merged 3 commits intoModelEngine-Group:developfrom Jan 22, 2026
Merged
[feat] add monkey patch for gsa on device v0.9.2#618ygwpz merged 3 commits intoModelEngine-Group:developfrom
ygwpz merged 3 commits intoModelEngine-Group:developfrom
Conversation
cbb8c78 to
0c81482
Compare
Infinite666
reviewed
Jan 4, 2026
8e2fc27 to
c6f93dd
Compare
813ca54 to
d0961f0
Compare
Contributor
|
You should clearly specify which test you executed. |
d0961f0 to
4504383
Compare
4504383 to
cd4acd1
Compare
Contributor
Author
describe how was this patch tested |
Infinite666
previously approved these changes
Jan 22, 2026
25103a2 to
6249ff4
Compare
ygwpz
reviewed
Jan 22, 2026
ygwpz
approved these changes
Jan 22, 2026
ygwpz
approved these changes
Jan 22, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Purpose
What this PR does / why we need it?
Add a monkey patch for gsa on device to enable this module to be used for profile analysis.Modifications
Does this PR introduce any user-facing change?
Add a monkey patch for gsa on device to enable this module to be used for profile analysis.
Test
How was this patch tested?
export MODEL_PATH="/home/models/DeepSeek-V2-Lite-Chat"

export VLLM_HASH_ATTENTION=1
python examples/offline_inference_gsa_on_device.py
export MODEL_PATH="/home/models/Qwen3-32B"

export VLLM_HASH_ATTENTION=1
python examples/offline_inference_gsa_on_device.py