Add cortex_m MVE/Helium int16 quantize/dequantize support (#19218)#19218
Add cortex_m MVE/Helium int16 quantize/dequantize support (#19218)#19218meta-codesync[bot] merged 1 commit intomainfrom
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19218
Note: Links to docs will display an error until the docs builds have been completed. ⏳ 1 Pending, 2 Unrelated FailuresAs of commit f3fe5c6 with merge base e84a418 ( BROKEN TRUNK - The following jobs failed but were present on the merge base:👉 Rebase onto the `viable/strict` branch to avoid these failures
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
|
@3l1 has exported this pull request. If you are a Meta employee, you can view the originating Diff in D103129855. |
This PR needs a
|
Summary: Adds MVE-vectorized int16 (Short) paths to the Cortex-M ExecuTorch quant ops, plus the AoT partitioner update so the ARM partitioner rewrites int16 quantized_decomposed::* to cortex_m::*. Previously cortex_m only handled int8 — int16 quant ops fell through to the portable scalar fallback (no MVE on Cortex-M55, since the portable path is gated on __aarch64__ || __ARM_NEON__ which Cortex-M55 has neither of). Differential Revision: D103129855
6b9ce08 to
a27f25f
Compare
Summary: Adds MVE-vectorized int16 (Short) paths to the Cortex-M ExecuTorch quant ops, plus the AoT partitioner update so the ARM partitioner rewrites int16 quantized_decomposed::* to cortex_m::*. Previously cortex_m only handled int8 — int16 quant ops fell through to the portable scalar fallback. Differential Revision: D103129855
a27f25f to
89e6708
Compare
Summary: Adds MVE-vectorized int16 (Short) paths to the Cortex-M ExecuTorch quant ops, so the ARM partitioner rewrites int16 quantized_decomposed::* to cortex_m::*. Previously cortex_m only handled int8 — int16 quant ops fell through to the portable scalar fallback. Differential Revision: D103129855
89e6708 to
5cea79a
Compare
Summary: Adds MVE-vectorized int16 (Short) paths to the Cortex-M ExecuTorch quant ops, so the ARM partitioner rewrites int16 quantized_decomposed::* to cortex_m::*. Previously cortex_m only handled int8 — int16 quant ops fell through to the portable scalar fallback. Differential Revision: D103129855
5cea79a to
f3fe5c6
Compare
Summary:
Adds MVE-vectorized int16 (Short) paths to the Cortex-M ExecuTorch quant ops, so the ARM partitioner rewrites int16 quantized_decomposed::* to cortex_m::*. Previously cortex_m only handled int8 — int16 quant ops fell through to the portable scalar fallback.
Differential Revision: D103129855
cc @digantdesai @freddan80 @per @zingo @oscarandersson8218 @mansnils @Sebastian-Larsson @robell