Commit 24cf7d1
feat: Update vq_linear dispatch to use MMA kernel for M=5-16
- Route M<=4 to scalar GEMV, M=5-16 to vq_gemm_prod, M>16 to
dequant+cuBLAS (matching kbit_linear dispatch pattern)
- Update vq_linear_workspace to include C_workspace and tile_counters
- Un-skip MMA test stubs, replace with actual vq_gemm_prod tests
- All 100 VQ tests pass (50 scalar GEMV + 50 dispatch/MMA)
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>1 parent 79ac4dc commit 24cf7d1
2 files changed
+53
-6
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1560 | 1560 | | |
1561 | 1561 | | |
1562 | 1562 | | |
1563 | | - | |
| 1563 | + | |
| 1564 | + | |
1564 | 1565 | | |
1565 | 1566 | | |
1566 | 1567 | | |
| |||
1574 | 1575 | | |
1575 | 1576 | | |
1576 | 1577 | | |
| 1578 | + | |
| 1579 | + | |
1577 | 1580 | | |
1578 | 1581 | | |
1579 | 1582 | | |
| |||
1590 | 1593 | | |
1591 | 1594 | | |
1592 | 1595 | | |
1593 | | - | |
| 1596 | + | |
| 1597 | + | |
| 1598 | + | |
| 1599 | + | |
| 1600 | + | |
| 1601 | + | |
| 1602 | + | |
| 1603 | + | |
| 1604 | + | |
| 1605 | + | |
| 1606 | + | |
| 1607 | + | |
1594 | 1608 | | |
1595 | 1609 | | |
1596 | 1610 | | |
| |||
1619 | 1633 | | |
1620 | 1634 | | |
1621 | 1635 | | |
1622 | | - | |
| 1636 | + | |
1623 | 1637 | | |
| 1638 | + | |
| 1639 | + | |
| 1640 | + | |
1624 | 1641 | | |
1625 | 1642 | | |
1626 | 1643 | | |
1627 | 1644 | | |
| 1645 | + | |
| 1646 | + | |
1628 | 1647 | | |
1629 | 1648 | | |
1630 | 1649 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1203 | 1203 | | |
1204 | 1204 | | |
1205 | 1205 | | |
1206 | | - | |
1207 | 1206 | | |
1208 | | - | |
1209 | | - | |
| 1207 | + | |
| 1208 | + | |
| 1209 | + | |
| 1210 | + | |
| 1211 | + | |
| 1212 | + | |
| 1213 | + | |
| 1214 | + | |
| 1215 | + | |
| 1216 | + | |
| 1217 | + | |
| 1218 | + | |
| 1219 | + | |
| 1220 | + | |
| 1221 | + | |
| 1222 | + | |
| 1223 | + | |
| 1224 | + | |
| 1225 | + | |
| 1226 | + | |
| 1227 | + | |
| 1228 | + | |
| 1229 | + | |
| 1230 | + | |
| 1231 | + | |
| 1232 | + | |
| 1233 | + | |
| 1234 | + | |
| 1235 | + | |
| 1236 | + | |
| 1237 | + | |
0 commit comments