Skip to content

feat: unify parallel dimension abstraction and support sglang DP Attention#170

Draft
zhjc1124 wants to merge 1 commit into
taco-project:feat/layerwise_rebasefrom
zhjc1124:support_pp
Draft

feat: unify parallel dimension abstraction and support sglang DP Attention#170
zhjc1124 wants to merge 1 commit into
taco-project:feat/layerwise_rebasefrom
zhjc1124:support_pp

Conversation

@zhjc1124
Copy link
Copy Markdown
Contributor

@zhjc1124 zhjc1124 commented May 20, 2026

Summary

This PR refactors the parallel dimension abstraction in FlexKV to be framework-agnostic, enabling clean three-way unification across sglang, vLLM, and TensorRT-LLM and introduces full support for sglang --enable-dp-attention mode

Parallel Dimension Model

After this PR, FlexKV's unified 5-D parallel abstraction is:

(instance, DP, PP, CP, TP)
Plane Dimensions Key
Control plane instance × DP dp_client_id = instance_id × dp_size + dp_rank
Worker index instance × DP × PP WorkerKey(dp_client_id, pp_rank)
Worker data plane CP × TP effective_tp_size = cp_size × tp_size

@zhjc1124 zhjc1124 changed the title feat: support sglang DP Attention and unify parallel dimension abstraction feat: unify parallel dimension abstraction and support sglang DP Attention May 20, 2026
@zhjc1124 zhjc1124 force-pushed the support_pp branch 2 times, most recently from ccaecd5 to 6060581 Compare May 20, 2026 08:14
@zhjc1124 zhjc1124 marked this pull request as draft May 21, 2026 04:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant