Skip to content

Commit bb681dc

Browse files
committed
update
1 parent d6dfc97 commit bb681dc

1 file changed

Lines changed: 3 additions & 1 deletion

File tree

_articles/LightX2VPlatform.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ tags: [Deploy, Multi-Platform Deployment, Non-Nvidia Platform Deployment]
88

99
Video generation inference has long been tightly coupled to the NVIDIA CUDA ecosystem. FlashAttention, cuBLAS, and NCCL are deeply embedded in the hot path of DiT inference. When deploying LightX2V on domestic or alternative AI accelerators—Cambricon MLU, Ascend NPU, Hygon DCU, MetaX, AMD ROCm, and others—the challenge is not just "make PyTorch run," but **aligning every performance-critical operator** (Attention, quantized MatMul, RMSNorm, RoPE, etc.) with the chip vendor's native kernel APIs.
1010

11-
`lightx2v_platform` is a **standalone functional layer** decoupled from the core `lightx2v` inference engine. Its job is to unify inference interfaces across non-NVIDIA chip backends. To support a new accelerator, you only need to implement the corresponding device abstraction and operator kernels inside `lightx2v_platform`—the upper-level model runners, schedulers, and pipeline logic remain unchanged.
11+
[`lightx2v_platform`](https://github.com/ModelTC/LightX2V/tree/main/lightx2v_platform) is a **standalone functional layer** decoupled from the core `lightx2v` inference engine. Its job is to unify inference interfaces across non-NVIDIA chip backends. To support a new accelerator, you only need to implement the corresponding device abstraction and operator kernels inside `lightx2v_platform`—the upper-level model runners, schedulers, and pipeline logic remain unchanged.
1212

1313
**Table of contents:**
1414

@@ -49,6 +49,8 @@ The result: LightX2V's upper layers always call the same interface (`AttnWeightT
4949

5050
![lightx2v_platform architecture overview]({{ site.baseurl }}/assets/LightX2VPlatform/platform_img1.png)
5151

52+
Source code: [`lightx2v_platform`](https://github.com/ModelTC/LightX2V/tree/main/lightx2v_platform)
53+
5254
The module has two main parts:
5355

5456
- **`base/`** — Device abstraction. Each chip backend registers a `*Device` class that handles device initialization, availability checks, device name resolution, and distributed backend setup (e.g. NCCL for CUDA, CNCL for MLU, HCCL for NPU).

0 commit comments

Comments
 (0)