Commit 53fec13
fix: Correct dequant inverse for CUTLASS GEMM convention
The CUTLASS fused quantize GEMM computes A @ R (no transpose on the
rotation matrix R). The dequant inverse must therefore apply R^T, not R.
This was masked with the plain Hadamard (which is symmetric, H = H^T)
but broke with the randomized Hadamard (R ≠ R^T).
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>1 parent 56eac41 commit 53fec13
2 files changed
+8
-7
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
915 | 915 | | |
916 | 916 | | |
917 | 917 | | |
918 | | - | |
919 | | - | |
920 | | - | |
| 918 | + | |
| 919 | + | |
| 920 | + | |
| 921 | + | |
921 | 922 | | |
922 | 923 | | |
923 | 924 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1237 | 1237 | | |
1238 | 1238 | | |
1239 | 1239 | | |
1240 | | - | |
1241 | | - | |
| 1240 | + | |
| 1241 | + | |
1242 | 1242 | | |
1243 | 1243 | | |
1244 | | - | |
1245 | | - | |
| 1244 | + | |
| 1245 | + | |
1246 | 1246 | | |
1247 | 1247 | | |
1248 | 1248 | | |
| |||
0 commit comments