Skip to content

Commit 73d8740

Browse files
author
Sven
authored
Add SeerAttention and SlimAttention Paper (#135)
* Add slim-attention: transform KV-cache to K cache only Signed-off-by: sven <svenzhang@live.com> * Add SeerAttention: learnable sparse attention like NSA(deepseek) MoBA Signed-off-by: sven <svenzhang@live.com> --------- Signed-off-by: sven <svenzhang@live.com>
1 parent 8a0ae90 commit 73d8740

1 file changed

Lines changed: 305 additions & 303 deletions

File tree

0 commit comments

Comments
 (0)