|
17 | 17 | [](https://github.com/AI-Hypercomputer/maxdiffusion/actions/workflows/UnitTests.yml) |
18 | 18 |
|
19 | 19 | # What's new? |
| 20 | +- **`2026/04/16`**: Support for Tokamax Ring Attention kernel is now added. |
20 | 21 | - **`2026/03/31`**: Wan2.2 SenCache inference is now supported for T2V and I2V (up to 1.4x speedup) |
21 | 22 | - **`2026/03/25`**: Wan2.1 and Wan2.2 Magcache inference is now supported |
22 | 23 | - **`2026/03/25`**: LTX-2 Video Inference is now supported |
@@ -623,6 +624,24 @@ To generate images, run the following command: |
623 | 624 | ... |
624 | 625 | ``` |
625 | 626 |
|
| 627 | +### Ring Attention |
| 628 | +We added ring attention support for Wan models. Below are the stats for one `720p` (81 frames) video generation (with CFG DP): |
| 629 | +| Accelerator | Model | Attention Type | Inference Steps | Sharding | e2e Generation Time | |
| 630 | +| -- | -- | -- | -- | -- | -- | |
| 631 | +| v7x-8 | WAN 2.1 | Tokamax Flash | 50 | dp2-fsdp1-context4-tp1 | 264.2 | |
| 632 | +| v7x-8 | WAN 2.1 | Tokamax Ring | 50 | dp2-fsdp1-context4-tp1 | **252.4** | |
| 633 | +| v7x-8 | WAN 2.2 | Tokamax Flash | 40 | dp2-fsdp1-context4-tp1 | 212.7 | |
| 634 | +| v7x-8 | WAN 2.2 | Tokamax Ring | 40 | dp2-fsdp1-context4-tp1 | **201.7** | |
| 635 | + |
| 636 | +| Accelerator | Model | Attention Type | Inference Steps | Sharding | e2e Generation Time | |
| 637 | +| -- | -- | -- | -- | -- | -- | |
| 638 | +| v7x-16 | WAN 2.1 | Tokamax Flash | 50 | dp2-fsdp1-context8-tp1 | 146.6 | |
| 639 | +| v7x-16 | WAN 2.1 | Tokamax Ring | 50 | dp2-fsdp1-context8-tp1 | **137.2** | |
| 640 | +| v7x-16 | WAN 2.2 | Tokamax Flash | 40 | dp2-fsdp1-context8-tp1 | **117.8** | |
| 641 | +| v7x-16 | WAN 2.2 | Tokamax Ring | 40 | dp2-fsdp1-context8-tp1 | 137.5 | |
| 642 | + |
| 643 | +(* There are some known stability issues for ring attention on 16 TPUs, please use `tokamax_flash` attention instead.) |
| 644 | + |
626 | 645 | ## Flux |
627 | 646 |
|
628 | 647 | First make sure you have permissions to access the Flux repos in Huggingface. |
|
0 commit comments