Skip to content

Commit 1e1aca0

Browse files
authored
ggml-webgpu: Improve prefill speeds for k-quants + refactor matmul for Q4/Q5/Q8 and k-quants (ggml-org#24225)
* ggml-webgpu: Improve prefill speeds + refactor matmul for quants * Fixes for editroconfig checker
1 parent 7d2b45b commit 1e1aca0

1 file changed

Lines changed: 267 additions & 543 deletions

File tree

0 commit comments

Comments
 (0)