Skip to content

Faster reduce_add on avx#1112

Merged
serge-sans-paille merged 1 commit intomasterfrom
feature/reduce-add-float
Apr 27, 2025
Merged

Faster reduce_add on avx#1112
serge-sans-paille merged 1 commit intomasterfrom
feature/reduce-add-float

Conversation

@serge-sans-paille
Copy link
Copy Markdown
Contributor

Just forward to sse after a split and a sum.

Just forward to sse after a split and a sum. Improve the generic
reduce_add as a side effect.
@serge-sans-paille serge-sans-paille force-pushed the feature/reduce-add-float branch from 222c647 to ad31228 Compare April 27, 2025 14:35
@serge-sans-paille
Copy link
Copy Markdown
Contributor Author

serge-sans-paille commented Apr 27, 2025

the optimized generated code looks similar but at least we've improved the generic case and we have the same code path.

@serge-sans-paille serge-sans-paille merged commit fb25021 into master Apr 27, 2025
120 checks passed
@JohanMabille JohanMabille deleted the feature/reduce-add-float branch May 6, 2025 08:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant