feat: add MUSA support for FlexKV by superleo · Pull Request #126 · taco-project/FlexKV

superleo · 2026-03-23T06:45:19Z

Description

This PR adds support for MUSA as an alternative backend to CUDA in FlexKV.

Summary

Backend abstraction: Introduces gpu_backend.py and gpu_runtime.py so Python code uses a single dispatch layer instead of scattered torch.cuda calls. This keeps the CUDA path unchanged and allows MUSA to be added without #ifdef in existing CUDA sources.
MUSA C++ extension: Adds a parallel MUSA implementation under csrc/musa/:
- Transfer kernels (transfer_musa.mu)
- GDS manager and layout transform for MUSA
- Thread groups for transfer and GDS
- Python bindings (bindings_musa.cpp)
Build system: Adds build_config.py and updates setup.py to support conditional MUSA builds via FLEXKV_USE_MUSA=1. CUDA and MUSA extensions can be built independently.
Integration: Updates memory_handle.py, worker.py, allocator.py, and the vLLM/TensorRT-LLM adapters to use the GPU runtime abstraction.
Documentation: Adds docs/musa/musa_support_system_design.md and docs/musa/musa_test_plan.md.
Tests: Adds tests for backend dispatch, GPU runtime, MUSA build, and MUSA transfer.

Design principles

Same API shape as CUDA (musa* types/functions, mcc compiler)
No changes to existing CUDA code paths
Backend abstraction first, then MUSA wiring

Testing

tests/test_gpu_backend_dispatch.py – backend selection

tests/test_gpu_runtime.py – runtime abstraction

tests/test_musa_build.py – MUSA build (when FLEXKV_USE_MUSA=1)

tests/test_transfer_musa.py – MUSA transfer behavior

- Add GPU backend abstraction layer (gpu_backend.py, gpu_runtime.py) for dispatching between CUDA and MUSA - Implement MUSA C++ extension (csrc/musa/): transfer kernels, GDS manager, layout transform, thread groups - Add build_config.py and extend setup.py for conditional MUSA build (FLEXKV_USE_MUSA=1) - Modify memory_handle.py, worker.py, allocator.py to use gpu_runtime for backend-agnostic stream/device/memory operations - Update vLLM and TensorRT-LLM adapters for backend dispatch - Add requirements-musa.txt and MUSA build/test documentation - Add tests: gpu_backend_dispatch, gpu_runtime, musa_build, transfer_musa

YconquestY · 2026-03-24T08:46:39Z

Hi @superleo. We appreciate your contribution :) Please wait for a moment. We are designing official abstraction and API for integrating variaous AI accelerators.

cc @linhu-nv @feiqiangs

YconquestY requested review from YconquestY and linhu-nv March 24, 2026 08:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add MUSA support for FlexKV#126

feat: add MUSA support for FlexKV#126
superleo wants to merge 1 commit into
taco-project:mainfrom
superleo:main

superleo commented Mar 23, 2026

Uh oh!

YconquestY commented Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

superleo commented Mar 23, 2026

Description

Summary

Design principles

Testing

Uh oh!

YconquestY commented Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants