Commit 1e796eb
ggml-cpu: add 128-bit RVV implementation for Quantization Vector Dot (ggml-org#20633)
* ggml-cpu: add 128-bit impls for i-quants, ternary quants
* ggml-cpu: add 128-bit impls for iq2_xs, iq3_s, iq3_xxs, tq2_0
Co-authored-by: Rehan Qasim <rehan.qasim@10xengineers.ai>
* ggml-cpu: refactor; add rvv checks
---------
Co-authored-by: taimur-10x <taimur.ahmad@10xengineers.ai>
Co-authored-by: Rehan Qasim <rehan.qasim@10xengineers.ai>1 parent 5637536 commit 1e796eb
1 file changed
+902
-70
lines changed
0 commit comments