Reduce allocation overhead in quantized sdpa #4137
| Job | Run time |
|---|---|
| 22m 9s | |
| 21m 54s | |
| 21m 56s | |
| 22m 28s | |
| 21m 35s | |
| 21m 25s | |
| 21m 45s | |
| 19m 27s | |
| 19m 34s | |
| 25m 53s | |
| 18m 1s | |
| 34m 7s | |
| 23m 21s | |
| 36m 12s | |
| 24m 22s | |
| 24m 42s | |
| 36m 43s | |
| 34m 30s | |
| 7m 54s | |
| 33m 51s | |
| 23m 43s | |
| 2s | |
| 0s | |
| 8h 35m 34s |
| Job | Run time |
|---|---|
| 22m 9s | |
| 21m 54s | |
| 21m 56s | |
| 22m 28s | |
| 21m 35s | |
| 21m 25s | |
| 21m 45s | |
| 19m 27s | |
| 19m 34s | |
| 25m 53s | |
| 18m 1s | |
| 34m 7s | |
| 23m 21s | |
| 36m 12s | |
| 24m 22s | |
| 24m 42s | |
| 36m 43s | |
| 34m 30s | |
| 7m 54s | |
| 33m 51s | |
| 23m 43s | |
| 2s | |
| 0s | |
| 8h 35m 34s |