Commit 79a8168
committed
Use unfused SDPA for short sequences (q_len <= 128 or kv_len <= 128)
ATT
Differential Revision: [D96044308](https://our.internmc.facebook.com/intern/diff/D96044308/)
ghstack-source-id: 361224789
Pull Request resolved: #186511 parent c234cd2 commit 79a8168
1 file changed
+7
-1
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
412 | 412 | | |
413 | 413 | | |
414 | 414 | | |
415 | | - | |
| 415 | + | |
| 416 | + | |
| 417 | + | |
| 418 | + | |
| 419 | + | |
| 420 | + | |
| 421 | + | |
416 | 422 | | |
417 | 423 | | |
418 | 424 | | |
| |||
0 commit comments