-
Notifications
You must be signed in to change notification settings - Fork 2k
perf : Optimize count distinct #21456
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Changes from all commits
8367a75
f03c0d0
e8fe907
8ce93d2
c09aaa0
eb0632f
e5de29b
c8e208c
e7c793a
7e67e2e
a13bcaa
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -81,7 +81,7 @@ name = "approx_distinct" | |
| harness = false | ||
|
|
||
| [[bench]] | ||
| name = "first_last" | ||
| name = "count_distinct" | ||
|
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. can you please add this benchmark as a separate PR (so we can use our standard benchmark runner to confirm the results)?
Contributor
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Thats sounds like a good idea . Let me put up a PR to add benchmarks to
Contributor
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Raised : #21521 |
||
| harness = false | ||
|
|
||
| [[bench]] | ||
|
|
||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think you can probably use a
BooleanBufferfrom Arrow to make this signifcantly faster (I think[booluses a byte for each booelan) 🤔There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Great idea ! Let me try and experiment with that