Skip to content

Commit 905b4d6

Browse files
authored
Merge pull request #11 from ModelTC/wan22
update
2 parents 44bc588 + 09b3997 commit 905b4d6

1 file changed

Lines changed: 2 additions & 0 deletions

File tree

_articles/Wan22-NVFP4-Sparse.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -6,6 +6,8 @@ date: 2026-06-09
66
tags: [Wan2.2, NVFP4, Sparse Attention, Video Generation]
77
---
88

9+
[![HuggingFace](https://img.shields.io/badge/HuggingFace-Wan2.2--NVFP4--Sparse-yellow)](https://huggingface.co/lightx2v/Wan2.2-NVFP4-Sparse)
10+
911
Video generation has received broad attention in recent years, driven by the impressive visual quality and motion consistency of models such as Wan, Sora, Seedance, and other large-scale diffusion Transformers. These models have made it possible to generate high-resolution, temporally coherent videos from text or image prompts, opening up new workflows for creative production, simulation, advertising, and interactive content.
1012

1113
However, the same capabilities also make modern video generation extremely resource intensive. For 14B video DiT models, inference can easily become impractical on consumer GPUs because both latency and memory usage scale aggressively with model size, video resolution, frame count, and sequence length. In practice, running a large model such as Wan2.2-A14B on a single consumer GPU is challenging without a carefully optimized inference stack.

0 commit comments

Comments
 (0)