Enhance CUDA flash attention kernel selection for DKQ=512 with low gq…#6
Open
Ooooze wants to merge 1 commit into
Open
Enhance CUDA flash attention kernel selection for DKQ=512 with low gq…#6Ooooze wants to merge 1 commit into
Ooooze wants to merge 1 commit into
background
wait
wait-all
cancel
parallel
Loading