[feat] Upstream Attn-QAT Video Diffusion Code by RandNMR73 · Pull Request #1225 · hao-ai-lab/FastVideo

RandNMR73 · 2026-04-09T04:14:40Z

Summary

Add Attn-QAT training kernels
Add flashinfer NVFP4 linear layers (currently hardcoded for Wan-2.1 arch)
Add Modified SageAttention3 kernels in the fastvideo-kernel package
Add Attn-QAT video model training scripts

Remove explicit_package_bases setting from mypy configuration

jzhang38 and others added 30 commits December 20, 2025 11:09

+baseline

da3f43c

+ fp4 linear + fp4 attn, all with 16-bit bwd

85553c2

+ generator sage 3

6a58c3a

update

9161ef6

update

8abfe23

stash

a1ab4a7

update

2a28c08

update

032016f

update

c47be5c

update

fa91001

save

130b466

1005 morning

84ed3ce

update

18aebfa

add real and fake quant precision tests

c137c9f

fake quant done

3a9e4d3

save

b628b5f

checkpoint

ebdb160

nvfp4 utils in progress

9b12552

add inference repo

2fa6fc1

fix DeepGEMM path

fe9e94e

fix DeepGEMM path

6a7f08e

qat attn in progress + refactor nvfp4 utils

b06c038

fix import

d41912c

checkpoint (qat attn in progress)

9bcc411

fix masking + causal and non-causal logic in qat attn

3b70702

5090 testing

7afe836

add SageAttn3 with QAT

8d68a4f

print sageattn file

3641487

adjust sage3 block size to 64x64

460df09

adjus quant kernel block size

52480b2

RandNMR73 added 27 commits April 9, 2026 07:16

fix

e4a7074

fix

1d3ff73

fix

24d3f97

fix

a6a1efd

fix

5230aaa

fix

6e1b285

fix

6ccc603

fix

5fab762

fix

eccf1a9

fix

10d99ea

fix

bf39c83

fix

952b702

fix

2619107

fix

b46ce2c

fix

ee775e8

fix

cb5eb77

fix

a3a4e04

fix

88f9ac1

all tests passing

16d8874

fix

15bda19

fix

b978a92

fix

ca96349

fix

aafa257

fix

12a993e

fix

6b42166

fix

52e3484

fix

42a292e

jzhang38 approved these changes Apr 10, 2026

View reviewed changes

RandNMR73 added 2 commits April 10, 2026 19:40

Update .gitignore

567dd87

Remove explicit_package_bases from mypy settings

3f818d0

Remove explicit_package_bases setting from mypy configuration

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feat] Upstream Attn-QAT Video Diffusion Code#1225

[feat] Upstream Attn-QAT Video Diffusion Code#1225
RandNMR73 wants to merge 203 commits intomainfrom
sync-branch

RandNMR73 commented Apr 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

15 participants

Conversation

RandNMR73 commented Apr 9, 2026

Summary

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

15 participants