Commit 5ea3c89
Remove redundant quant_storage parametrization from test_gemv_4bit
quant_storage is a dtype view over byte-identical packed 4-bit data,
introduced for FSDP sharding compatibility. The gemv kernel reads raw
bytes via void pointer and never branches on B.dtype, so testing 4
quant_storage values exercises the same code path 4 times.
Fix test_gemv_4bit to use only uint8 (the default), reducing from
1,536 to 384 test cases (~18 min saved). Add a dedicated
test_quant_storage_shard_roundtrip that properly tests what
quant_storage is for: verifying bytes survive FSDP-style
flatten/chunk/reassemble with all 4 storage dtypes.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>1 parent 2d22247 commit 5ea3c89
2 files changed
+38
-6
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1281 | 1281 | | |
1282 | 1282 | | |
1283 | 1283 | | |
1284 | | - | |
1285 | | - | |
1286 | | - | |
1287 | | - | |
1288 | | - | |
1289 | 1284 | | |
1290 | | - | |
| 1285 | + | |
| 1286 | + | |
1291 | 1287 | | |
1292 | 1288 | | |
1293 | 1289 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
248 | 248 | | |
249 | 249 | | |
250 | 250 | | |
| 251 | + | |
| 252 | + | |
| 253 | + | |
| 254 | + | |
| 255 | + | |
| 256 | + | |
| 257 | + | |
| 258 | + | |
| 259 | + | |
| 260 | + | |
| 261 | + | |
| 262 | + | |
| 263 | + | |
| 264 | + | |
| 265 | + | |
| 266 | + | |
| 267 | + | |
| 268 | + | |
| 269 | + | |
| 270 | + | |
| 271 | + | |
| 272 | + | |
| 273 | + | |
| 274 | + | |
| 275 | + | |
| 276 | + | |
| 277 | + | |
| 278 | + | |
| 279 | + | |
| 280 | + | |
| 281 | + | |
| 282 | + | |
| 283 | + | |
| 284 | + | |
| 285 | + | |
| 286 | + | |
251 | 287 | | |
252 | 288 | | |
253 | 289 | | |
| |||
0 commit comments