Skip to content

Improve masked load/store for sse2 when only the first element is sel…#1216

Merged
serge-sans-paille merged 1 commit intomasterfrom
feature/improve-load-store-masked
Nov 28, 2025
Merged

Improve masked load/store for sse2 when only the first element is sel…#1216
serge-sans-paille merged 1 commit intomasterfrom
feature/improve-load-store-masked

Conversation

@serge-sans-paille
Copy link
Copy Markdown
Contributor

…ected

@serge-sans-paille
Copy link
Copy Markdown
Contributor Author

cc @DiamonDinoia . It mostly helps GCC, clang is generally capable of finding the right pattern on its own.

@DiamonDinoia
Copy link
Copy Markdown
Contributor

Nice! I will review more carefully next week but I like the changes.

@serge-sans-paille serge-sans-paille force-pushed the feature/improve-load-store-masked branch from c130389 to 0093e62 Compare November 20, 2025 21:12
@serge-sans-paille serge-sans-paille force-pushed the feature/improve-load-store-masked branch from 0093e62 to d0ecc44 Compare November 27, 2025 22:26
@serge-sans-paille
Copy link
Copy Markdown
Contributor Author

@DiamonDinoia I've rebased on top of latest changes, we're getting close to a release, I only have to implement wasm support and we're good to go (and maybe ppc too)...

@serge-sans-paille
Copy link
Copy Markdown
Contributor Author

@DiamonDinoia gentle ping :-)

Copy link
Copy Markdown
Contributor

@DiamonDinoia DiamonDinoia left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM.

I was thinking of doing something like: mask.countr_one() == 1 && mask.popcount() == 1 instead of mask.mask() == 0x1 bit I think the latter is better in this case.

@serge-sans-paille serge-sans-paille merged commit 6280185 into master Nov 28, 2025
120 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants