Skip to content

Commit 5658381

Browse files
committed
Drop default layerwise_checkpoint_dir from max PTQ recipe
Max calibration is fast enough that checkpointing each layer adds unnecessary I/O and disk usage. Comment explains why it is omitted. Signed-off-by: realAsma <akuriparambi@nvidia.com>
1 parent e0bdc73 commit 5658381

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

modelopt_recipes/general/ptq/nvfp4_experts_only-fp8_kv.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -19,8 +19,8 @@ metadata:
1919
quantize:
2020
algorithm:
2121
method: max
22+
# Max calibration is fast and does not typically need checkpointing.
2223
layerwise: true
23-
layerwise_checkpoint_dir: output/layerwise_ckpts/
2424
quant_cfg:
2525
- quantizer_name: '*'
2626
enable: false

0 commit comments

Comments
 (0)