You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Optimized BGEMV for NEOVERSEN2, NEOVERSEV1 and NEOVERSEV2 targets
- Adds bgemv T based off of sbgemv T kernel
- Adds bgemv N which is slightly alterated to not use Y as an
accumulator due to the output being bf16 which results in loss of
precision
- Extends NEOVERSEV2 from NEOVERSEN2 to match existing BF16 kernels with
SVE enabled
- Re-uses the existing kernels for BGEMM and BGEMV in NEOVERSEN2 and
NEOVERSEV2 targets
- Enables BGEMM_GEMV_FORWARD to proxy BGEMM to BGEMV with new kernels
0 commit comments