Skip to content

Commit 93c4138

Browse files
committed
Clarify active MoE AutoQuant ratio
Signed-off-by: weimingc <17592131+meenchen@users.noreply.github.com>
1 parent e490b85 commit 93c4138

1 file changed

Lines changed: 3 additions & 1 deletion

File tree

examples/llm_ptq/hf_ptq.py

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1418,7 +1418,9 @@ def parse_args() -> argparse.Namespace:
14181418
help=(
14191419
"Routed MoE expert active ratio for --auto_quantize_cost_model active_moe. "
14201420
"For top-k MoE this is top_k / num_experts. If omitted, common model config "
1421-
"fields such as num_experts_per_tok and num_experts are used when available."
1421+
"fields such as num_experts_per_tok and num_experts are used when available. "
1422+
"This only affects AutoQuant cost accounting and does not change calibration "
1423+
"routing; use --moe_calib_experts_ratio to control calibration expert coverage."
14221424
),
14231425
)
14241426
parser.add_argument(

0 commit comments

Comments
 (0)