Commit 1e1aca0
authored
ggml-webgpu: Improve prefill speeds for k-quants + refactor matmul for Q4/Q5/Q8 and k-quants (ggml-org#24225)
* ggml-webgpu: Improve prefill speeds + refactor matmul for quants
* Fixes for editroconfig checker1 parent 7d2b45b commit 1e1aca0
1 file changed
Lines changed: 267 additions & 543 deletions
0 commit comments