Commit deb0e3a
committed
Address review: gate fix on is_input_quantized; expand tests
@meenchen: the unconditional input_quantizer.enable() in the
uncalibrated branch wrongly turned on input quantization for weight-only
AWQ configs (e.g. INT4_AWQ_CFG, where the user's config sets
*input_quantizer enable=False and setup() therefore never disabled it).
Gate the entire postprocess block — per-channel-amax collapse and
enable() — behind module.awq_lite.is_input_quantized so weight-only
configs are untouched.
@coderabbitai: strengthen the existing regression test to also assert
the export-critical scalar amax invariant (axis=None, numel==1) when
amax exists, and add a companion test on INT4_AWQ_CFG asserting the
uncalibrated linear's input_quantizer stays disabled.
The NVFP4 test now requires CUDA (dynamic block quantization is
CUDA-only), guarded with pytest.mark.skipif.
Signed-off-by: Chenjie Luo <chenjiel@nvidia.com>1 parent af1bfd6 commit deb0e3a
2 files changed
Lines changed: 68 additions & 29 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1316 | 1316 | | |
1317 | 1317 | | |
1318 | 1318 | | |
1319 | | - | |
1320 | | - | |
1321 | | - | |
1322 | | - | |
1323 | | - | |
1324 | | - | |
1325 | | - | |
1326 | | - | |
1327 | | - | |
1328 | | - | |
1329 | | - | |
1330 | | - | |
1331 | | - | |
| 1319 | + | |
| 1320 | + | |
| 1321 | + | |
| 1322 | + | |
| 1323 | + | |
| 1324 | + | |
| 1325 | + | |
| 1326 | + | |
| 1327 | + | |
| 1328 | + | |
| 1329 | + | |
| 1330 | + | |
| 1331 | + | |
| 1332 | + | |
| 1333 | + | |
1332 | 1334 | | |
1333 | 1335 | | |
1334 | 1336 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
312 | 312 | | |
313 | 313 | | |
314 | 314 | | |
| 315 | + | |
| 316 | + | |
| 317 | + | |
| 318 | + | |
| 319 | + | |
| 320 | + | |
| 321 | + | |
| 322 | + | |
| 323 | + | |
| 324 | + | |
| 325 | + | |
| 326 | + | |
| 327 | + | |
| 328 | + | |
| 329 | + | |
315 | 330 | | |
316 | 331 | | |
317 | 332 | | |
| |||
322 | 337 | | |
323 | 338 | | |
324 | 339 | | |
325 | | - | |
326 | | - | |
327 | | - | |
328 | | - | |
329 | | - | |
330 | | - | |
331 | | - | |
332 | | - | |
333 | | - | |
334 | | - | |
335 | | - | |
336 | | - | |
337 | | - | |
338 | | - | |
339 | 340 | | |
| 341 | + | |
| 342 | + | |
| 343 | + | |
340 | 344 | | |
341 | | - | |
| 345 | + | |
342 | 346 | | |
343 | 347 | | |
344 | 348 | | |
345 | | - | |
| 349 | + | |
346 | 350 | | |
347 | 351 | | |
348 | 352 | | |
| |||
351 | 355 | | |
352 | 356 | | |
353 | 357 | | |
| 358 | + | |
| 359 | + | |
| 360 | + | |
| 361 | + | |
| 362 | + | |
| 363 | + | |
| 364 | + | |
| 365 | + | |
| 366 | + | |
| 367 | + | |
| 368 | + | |
| 369 | + | |
| 370 | + | |
| 371 | + | |
| 372 | + | |
| 373 | + | |
| 374 | + | |
| 375 | + | |
| 376 | + | |
| 377 | + | |
| 378 | + | |
| 379 | + | |
| 380 | + | |
| 381 | + | |
| 382 | + | |
| 383 | + | |
| 384 | + | |
| 385 | + | |
| 386 | + | |
| 387 | + | |
| 388 | + | |
| 389 | + | |
| 390 | + | |
354 | 391 | | |
355 | 392 | | |
356 | 393 | | |
| |||
0 commit comments