You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Guard blocksize=64 quantize instantiations for warp size compatibility
On AMD CDNA GPUs (warp size 64), blocksize=64 would mean only 1 thread
per warp in the quantize kernels, which is incompatible. Wrap these
instantiations with #if BNB_WARP_SIZE == 32 so they only compile on
NVIDIA.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
0 commit comments