Skip to content

Commit 5c2e857

Browse files
nit pick
Signed-off-by: vipandya <vipandya@nvidia.com>
1 parent 6a86052 commit 5c2e857

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

examples/windows/onnx_ptq/genai_llm/README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -66,8 +66,8 @@ The table below lists key command-line arguments of the ONNX PTQ example script.
6666
| `--awqclip_alpha_min` | 0.5 (default) | Minimum AWQ weight-clipping threshold, user-defined |
6767
| `--awqclip_bsz_col` | 1024 (default) | Chunk size in columns during weight clipping, user-defined |
6868
| `--calibration_eps` | dml, cuda, cpu, NvTensorRtRtx (default: [cuda,cpu]) | List of execution-providers to use for session run during calibration |
69-
| `--add_position_ids` | Default: position_ids input disabled | Use this option to enable position_ids input in calibration data|
70-
| `--enable_mixed_quant` | Default: disabled mixed quant | Use this option to enable mixed precision quantization|
69+
| `--add_position_ids` | Default: position_ids input is disabled | Use this option to enable position_ids input in calibration data|
70+
| `--enable_mixed_quant` | Default: mixed-quant is disabled | Use this option to enable mixed precision quantization|
7171
| `--layers_8bit` | Default: None | Use this option to override default mixed-quant strategy|
7272
| `--gather_quantize_axis` | Default: None | Use this option to enable INT4 quantization of Gather nodes - choose 0 or 1|
7373
| `--gather_block_size` | Default: 32 | Block-size for Gather node's INT4 quantization (when its enabled using gather_quantize_axis option)|

0 commit comments

Comments
 (0)