Skip to content

Add CUDA kernel support for 4-bit quantization with blocksize=32 #1885

Add CUDA kernel support for 4-bit quantization with blocksize=32

Add CUDA kernel support for 4-bit quantization with blocksize=32 #1885