Commit 1ec7ba0
opencl: add q4_1 MoE for Adreno (ggml-org#22856)
* Q4_1 MoE CLC pass sanity check
* remove unnecessary code
* opencl: remove unnecessary asserts and reformat
* opencl: fix supports_op for q4_1 moe
* q4_1 moe is supported by Adreno with certain shapes
---------
Co-authored-by: Li He <lih@qti.qualcomm.com>1 parent 8e1f9d0 commit 1ec7ba0
5 files changed
Lines changed: 798 additions & 33 deletions
File tree
- ggml/src/ggml-opencl
- kernels
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
104 | 104 | | |
105 | 105 | | |
106 | 106 | | |
| 107 | + | |
| 108 | + | |
107 | 109 | | |
108 | 110 | | |
109 | 111 | | |
| |||
0 commit comments