Skip to content

Add NVFP4 per-token quantization recipe#3045

Draft
cael-ling wants to merge 2 commits into
NVIDIA:mainfrom
cael-ling:feature/nvfp4-per-token-recipe
Draft

Add NVFP4 per-token quantization recipe#3045
cael-ling wants to merge 2 commits into
NVIDIA:mainfrom
cael-ling:feature/nvfp4-per-token-recipe

Commits

Commits on May 26, 2026