Implement PackDepthwiseConvMatrix in NEON + deprecate aarch64 compat layers (#5779) by Nicoshev · Pull Request #5779 · pytorch/FBGEMM

Nicoshev · 2026-05-23T14:41:38Z

Summary:
X-link: https://github.com/facebookresearch/FBGEMM/pull/2709

Add a NEON-based aarch64 implementation of the PackedDepthWiseConvMatrix constructor in PackDepthwiseConvMatrix.cc, alongside the existing AVX2 x86 implementation. The constructor packs depthwise convolution weight matrices into a SIMD-friendly interleaved layout.

Rename depthwise-convolution related files, as NEON and AVX2 implementations already co-exist

Remove compilation of avx2 source files for aarch64 targets and remove usage of aarch64 compat layers

Reviewed By: q10, YifanYuan3

Differential Revision: D106137964

meta-codesync · 2026-05-23T14:42:04Z

@Nicoshev has exported this pull request. If you are a Meta employee, you can view the originating Diff in D106137964.

…layers (pytorch#5779) Summary: X-link: https://github.com/facebookresearch/FBGEMM/pull/2709 Pull Request resolved: pytorch#5779 Add a NEON-based aarch64 implementation of the `PackedDepthWiseConvMatrix` constructor in `PackDepthwiseConvMatrix.cc`, alongside the existing AVX2 x86 implementation. The constructor packs depthwise convolution weight matrices into a SIMD-friendly interleaved layout. Rename depthwise-convolution related files, as NEON and AVX2 implementations already co-exist Remove compilation of avx2 source files for aarch64 targets and remove usage of aarch64 compat layers Differential Revision: D106137964

…layers (pytorch#5779) Summary: X-link: https://github.com/facebookresearch/FBGEMM/pull/2709 Pull Request resolved: pytorch#5779 Add a NEON-based aarch64 implementation of the `PackedDepthWiseConvMatrix` constructor in `PackDepthwiseConvMatrix.cc`, alongside the existing AVX2 x86 implementation. The constructor packs depthwise convolution weight matrices into a SIMD-friendly interleaved layout. Rename depthwise-convolution related files, as NEON and AVX2 implementations already co-exist Remove compilation of avx2 source files for aarch64 targets and remove usage of aarch64 compat layers Reviewed By: q10, YifanYuan3 Differential Revision: D106137964

meta-codesync · 2026-05-30T03:42:29Z

This pull request has been merged in 07767a8.

meta-codesync Bot added fb-exported meta-exported labels May 23, 2026

meta-cla Bot added the cla signed label May 23, 2026

meta-codesync Bot changed the title ~~Implement PackDepthwiseConvMatrix in NEON + deprecate aarch64 compat layers~~ Implement PackDepthwiseConvMatrix in NEON + deprecate aarch64 compat layers (#5779) May 25, 2026

Nicoshev force-pushed the export-D106137964 branch from 3cf57eb to 32e1b26 Compare May 25, 2026 13:57

Nicoshev force-pushed the export-D106137964 branch 2 times, most recently from bcc69a5 to 513f790 Compare May 29, 2026 18:54

Nicoshev force-pushed the export-D106137964 branch from 513f790 to bad96b5 Compare May 29, 2026 19:00

Nicoshev force-pushed the export-D106137964 branch from bad96b5 to 0a88cea Compare May 29, 2026 19:05

meta-codesync Bot closed this in 07767a8 May 30, 2026

facebook-github-tools Bot added the Merged label May 30, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement PackDepthwiseConvMatrix in NEON + deprecate aarch64 compat layers (#5779)#5779

Implement PackDepthwiseConvMatrix in NEON + deprecate aarch64 compat layers (#5779)#5779
Nicoshev wants to merge 1 commit into
pytorch:mainfrom
Nicoshev:export-D106137964

Nicoshev commented May 23, 2026 •

edited by meta-codesync Bot

Loading

Uh oh!

meta-codesync Bot commented May 23, 2026

Uh oh!

meta-codesync Bot commented May 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Nicoshev commented May 23, 2026 • edited by meta-codesync Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

meta-codesync Bot commented May 23, 2026

Uh oh!

meta-codesync Bot commented May 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Nicoshev commented May 23, 2026 •

edited by meta-codesync Bot

Loading