Skip to content

voxtral_tts: enable CUDA backend with 4w quantization (Ampere + Blackwell pre-exported artifacts)#19093

Merged
seyeong-han merged 10 commits into
pytorch:mainfrom
seyeong-han:voxtral-tts
Apr 27, 2026
Merged

voxtral_tts: enable CUDA backend with 4w quantization (Ampere + Blackwell pre-exported artifacts)#19093
seyeong-han merged 10 commits into
pytorch:mainfrom
seyeong-han:voxtral-tts

Commits

Commits on Apr 16, 2026

Commits on Apr 17, 2026

Commits on Apr 21, 2026

Commits on Apr 23, 2026