Skip to content

Fix weight-only quantization for TEGroupedMLP (MoE models)#971

Merged
jenchen13 merged 7 commits intoNVIDIA:mainfrom
jQizhang:weight_only_te_fix
Apr 3, 2026
Merged

Fix weight-only quantization for TEGroupedMLP (MoE models)#971
jenchen13 merged 7 commits intoNVIDIA:mainfrom
jQizhang:weight_only_te_fix

Commits

Commits on Mar 12, 2026

Commits on Mar 18, 2026

Commits on Mar 21, 2026

Commits on Mar 23, 2026

Commits on Mar 25, 2026

Commits on Apr 3, 2026