You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Arm backend: Run adaptive_avg_pool2d before quantization
To run mobilenet_v2 with good performance on Ethos-U55, we
need to export the model in channels_last. If we export in
channels_first (default behaviour), we pay a hefty
performance penalty because the Ethos-U55 hardware is not
efficient at doing Transpose
(see details in #17157).
The adaptive_avg_pool2d operator, part of mv2,
is traced differently by ExecuTorch
depending on whether it was exported in
channels-first(operator not decomposed) or
exported in channels-last
(operator is decomposed by ExecuTorch in to_edge).
To work around that, we add adaptive_avg_pool2d
to the transform_for_annotation
pipeline in order to decompose the
operator before quantization.
Signed-off-by: George Gekov <george.gekov@arm.com>
Change-Id: I3e98a2d52f6d0e7c79f82188a5e6c4eb6a63448b
0 commit comments