Skip to content

sparse V: skip negligible attention weights across all backends#98

Closed
TheTom wants to merge 2 commits intofeature/turboquant-kv-cachefrom
feature/sparse-v-metal
Closed

sparse V: skip negligible attention weights across all backends#98
TheTom wants to merge 2 commits intofeature/turboquant-kv-cachefrom
feature/sparse-v-metal

Commits

Commits on May 1, 2026