Skip to content

perf: turbo VEC flash attention — +9% decode on CUDA via autoresearch#53

Open
signalnine wants to merge 153 commits into
TheTom:feature/turboquant-kv-cachefrom
signalnine:pr/fattn-vec-turbo-opts
Open

perf: turbo VEC flash attention — +9% decode on CUDA via autoresearch#53
signalnine wants to merge 153 commits into
TheTom:feature/turboquant-kv-cachefrom
signalnine:pr/fattn-vec-turbo-opts

perf: turbo VEC flash attention — +9% decode on CUDA via autoresearch

348fb77
Select commit
Loading
Failed to load commit list.
Sign in for the full log view
labeler
succeeded Apr 9, 2026 in 15s