Skip to content

Enhance CUDA flash attention kernel selection for DKQ=512 with low gq…#6

Open
Ooooze wants to merge 1 commit into
feature/turboquant-kv-cachefrom
fix/cuda-mma-dkq512-fallback
Open

Enhance CUDA flash attention kernel selection for DKQ=512 with low gq…#6
Ooooze wants to merge 1 commit into
feature/turboquant-kv-cachefrom
fix/cuda-mma-dkq512-fallback

Enhance CUDA flash attention kernel selection for DKQ=512 with low gq…

2374b99
Select commit
Loading
Failed to load commit list.
Sign in for the full log view
labeler
succeeded May 8, 2026 in 11s