Skip to content

Commit a4530c1

Browse files
Update examples/llm_ptq/hf_ptq.py
Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com> Signed-off-by: Adi Renduchintala <adithya.r@gmail.com>
1 parent 56a9f2b commit a4530c1

1 file changed

Lines changed: 3 additions & 3 deletions

File tree

examples/llm_ptq/hf_ptq.py

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1088,9 +1088,9 @@ def parse_args() -> argparse.Namespace:
10881088
parser.add_argument(
10891089
"--skip_generate",
10901090
help=(
1091-
"Skip the pre/post-quantization generation preview calls. "
1092-
"Use this for very large models that cannot run forward passes "
1093-
"(e.g. models split across GPU and CPU via device_map)."
1091+
"Skip pre/post-quantization preview calls that invoke model.generate(). "
1092+
"Note: this does not skip calibration or batch-size probing. "
1093+
"For very large models, pair with --batch_size 1 to avoid max-batch probing."
10941094
),
10951095
default=False,
10961096
action="store_true",

0 commit comments

Comments
 (0)