Skip to content

feat: add runtime batch_bool mask overloads for load_masked/store_masked#1332

Open
DiamonDinoia wants to merge 1 commit intoxtensor-stack:masterfrom
DiamonDinoia:feat/dynamic-masks
Open

feat: add runtime batch_bool mask overloads for load_masked/store_masked#1332
DiamonDinoia wants to merge 1 commit intoxtensor-stack:masterfrom
DiamonDinoia:feat/dynamic-masks

Conversation

@DiamonDinoia
Copy link
Copy Markdown
Contributor

Add runtime-mask overloads of xsimd::load_masked and xsimd::store_masked across AVX2, AVX-512, SSE, SVE, RVV, and NEON. The generic common-path fallback is collapsed to a whole-vector select, and the unaligned page-cross fast path is dropped since the underlying intrinsics suppress faults on masked-off lanes regardless of alignment.

Add runtime-mask overloads of xsimd::load_masked and xsimd::store_masked
across AVX2, AVX-512, SSE, SVE, RVV, and NEON. The generic common-path
fallback is collapsed to a whole-vector select, and the unaligned
page-cross fast path is dropped since the underlying intrinsics suppress
faults on masked-off lanes regardless of alignment.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant