You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Qualcomm AI Engine Direct - Support 2-bits quantization 16a2w (pytorch#19632)
Qualcomm AI Engine Direct - Support 2-bits quantization 16a2w
Summary:
1.Add 2-bits quantization basis 16a2w quantizer with standard symmetric
2.Support per channel and linear layers
3.Currently support soc model SM8850
Test plan:
python backends/qualcomm/tests/test_qnn_delegate.py
TestQNNQuantizedOperator.test_qnn_backend_16a2w_conv2d -b build-android
-H ${HOST} -s ${SN} -m SM8850 python
backends/qualcomm/tests/test_qnn_delegate.py
TestQNNQuantizedOperator.test_qnn_backend_16a2w_linear -b build-android
-H ${HOST} -s ${SN} -m SM8850
cc @cccclai@cbilgin@abhinaykukkadapu
0 commit comments