Commit ec562eb
opencl: add q5_0 and q5_1 MoE for Adreno (#22985)
* opencl: add q5_0 moe support
* opencl: add q5_1 moe support
* opencl: avoid potential leak
* opencl: suppress unused var warning when building for non-Adreno
---------
Co-authored-by: Li He <lih@qti.qualcomm.com>1 parent 95d469a commit ec562eb
7 files changed
Lines changed: 1914 additions & 67 deletions
File tree
- ggml/src/ggml-opencl
- kernels
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
106 | 106 | | |
107 | 107 | | |
108 | 108 | | |
| 109 | + | |
| 110 | + | |
| 111 | + | |
| 112 | + | |
109 | 113 | | |
110 | 114 | | |
111 | 115 | | |
| |||
0 commit comments