Commit 73d8740
Sven
Add SeerAttention and SlimAttention Paper (#135)
* Add slim-attention: transform KV-cache to K cache only
Signed-off-by: sven <svenzhang@live.com>
* Add SeerAttention: learnable sparse attention like NSA(deepseek) MoBA
Signed-off-by: sven <svenzhang@live.com>
---------
Signed-off-by: sven <svenzhang@live.com>1 parent 8a0ae90 commit 73d8740
1 file changed
Lines changed: 305 additions & 303 deletions
0 commit comments