Skip to content

Commit f48a600

Browse files
Metal backend: use HQQ in Parakeet quantization (pytorch#17258)
Bump ao pin to get pytorch/ao#3829, and use HQQ in Parakeet quantization
1 parent d05fe5e commit f48a600

2 files changed

Lines changed: 2 additions & 1 deletion

File tree

examples/models/parakeet/quantize.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -37,6 +37,7 @@ def quantize_model_( # noqa: C901
3737
config = UIntxWeightOnlyConfig(
3838
group_size=qlinear_group_size,
3939
bitwidth=4,
40+
uintx_choose_qparams_algorithm="hqq",
4041
)
4142

4243
def linear_filter(m, fqn):

0 commit comments

Comments
 (0)