You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Revert "Revert "Arm backend: Run adaptive_avg_pool2d before quantization"" (#17831)
Reverts #17595 to essentially reland #17494. Original
summary from @gggekov here:
To run mobilenet_v2 with good performance on Ethos-U55, we need to
export the model in channels_last. If we export in channels_first
(default behaviour), we pay a hefty performance penalty because the
Ethos-U55 hardware is not efficient at doing Transpose
(see details in #17157). The
adaptive_avg_pool2d operator, part of mv2,
is traced differently by ExecuTorch
depending on whether it was exported in
channels-first(operator not decomposed) or
exported in channels-last
(operator is decomposed by ExecuTorch in to_edge). To work around that,
we add adaptive_avg_pool2d
to the transform_for_annotation
pipeline in order to decompose the
operator before quantization.
0 commit comments