Commit 6c59bc7
authored
perf: exl3 decode kernel optimization experiments (#1655)
* perf: exl3 decode kernel optimization experiments
Signed-off-by: AlpinDale <alpindale@gmail.com>
* fix: remove unsafe EXL3 shape overrides
Signed-off-by: AlpinDale <alpindale@gmail.com>
---------
Signed-off-by: AlpinDale <alpindale@gmail.com>1 parent c005aac commit 6c59bc7
4 files changed
Lines changed: 17 additions & 14 deletions
File tree
- aphrodite
- config
- model_executor/layers/quantization
- csrc/quantization/exl3/exllamav3_ext/quant
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
585 | 585 | | |
586 | 586 | | |
587 | 587 | | |
| 588 | + | |
588 | 589 | | |
589 | 590 | | |
590 | 591 | | |
| |||
667 | 668 | | |
668 | 669 | | |
669 | 670 | | |
| 671 | + | |
| 672 | + | |
| 673 | + | |
| 674 | + | |
| 675 | + | |
| 676 | + | |
| 677 | + | |
| 678 | + | |
| 679 | + | |
670 | 680 | | |
671 | 681 | | |
672 | 682 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
184 | 184 | | |
185 | 185 | | |
186 | 186 | | |
| 187 | + | |
| 188 | + | |
187 | 189 | | |
188 | 190 | | |
189 | 191 | | |
| |||
Lines changed: 1 addition & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
223 | 223 | | |
224 | 224 | | |
225 | 225 | | |
226 | | - | |
| 226 | + | |
Lines changed: 4 additions & 13 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
159 | 159 | | |
160 | 160 | | |
161 | 161 | | |
162 | | - | |
163 | | - | |
164 | | - | |
165 | | - | |
166 | | - | |
167 | | - | |
168 | | - | |
169 | | - | |
170 | | - | |
171 | | - | |
172 | | - | |
173 | | - | |
174 | | - | |
| 162 | + | |
| 163 | + | |
| 164 | + | |
| 165 | + | |
175 | 166 | | |
176 | 167 | | |
177 | 168 | | |
| |||
0 commit comments