Fix `bitwise_unary_op` by connortsui20 · Pull Request #6940 · vortex-data/vortex

connortsui20 · 2026-03-13T15:20:22Z

Summary

Closes: #6895 (which might seem completely unrelated but is relevant because of an incorrect mask)

Fixes a bug in bitwise_unary_op where the it used Arrow's UnalignedBitChunk iterator, which for buffers larger than 16 bytes (128 bits) uses align_to::<u64>() and introduces lead padding zeros when the byte pointer isn't u64-aligned. After applying the operation (e.g. NOT), those padding bits were written into a fresh, aligned output buffer at real data positions, corrupting the first padding bits of the result

This change just delegates to the correct mut implementation which is likely also faster.

Testing

Adds a simple regression test.

Signed-off-by: Connor Tsui <connor.tsui20@gmail.com>

robert3005 · 2026-03-13T15:22:21Z

Ah, that's why you can't just change this. I made this change in #6880 which means that binary op needs adjustment as well. I don't see how the mut op would be faster though

codspeed-hq · 2026-03-13T15:24:22Z

Merging this PR will degrade performance by 12.27%

⚡ 1 improved benchmark
❌ 1 regressed benchmark
✅ 1007 untouched benchmarks
⏩ 1515 skipped benchmarks¹

⚠️ Please fix the performance issues or acknowledge them on CodSpeed.

Performance Changes

	Mode	Benchmark	`BASE`	`HEAD`	Efficiency
⚡	Simulation	`binary_search_std`	582.8 ns	524.4 ns	+11.12%
❌	Simulation	`true_count_vortex_buffer[128]`	1 µs	1.2 µs	-12.27%

_{Comparing ct/fix-bit-unary (e0daf70) with develop (18bef2b)}

1515 benchmarks were skipped, so the baseline results were used instead. If they were deleted from the codebase, click here and archive them to remove them from the performance reports. ↩

robert3005 · 2026-03-13T15:24:42Z

align_to is not the problem here though? It's that the padding is wrong so offset and length are wrong

connortsui20 · 2026-03-13T15:39:56Z

I thought that align to caused the padding to be wrong in this case.

Anyways I don't see much reason to not just delegate to the _mut version, the regression on codspeed is likely because it was just incorrect before.

robert3005 · 2026-03-13T15:43:15Z

align_to just splits the buffer (it's a const function). But the padding is clearly not preserved for any buffer in 17-32 range potentially. Anyway I agree with your fix. We should make sure to fix the binary op as well since we're here

connortsui20 · 2026-03-13T15:49:49Z

what do you mean by fix the binary op?

robert3005

I can follow up with the fix to binary op

robert3005 · 2026-03-13T17:00:05Z

Ok, binary op is not a problem since it's only ever used for aligned arrays (i.e. 8 byte aligned) but we really only need 1 byte alignment

gatesn · 2026-03-16T13:43:13Z

-    let result = Buffer::<u64>::from_trusted_len_iter(iter).into_byte_buffer();
-
-    BitBuffer::new_with_offset(result, buffer.len(), buffer.offset())
+    let mut buf = buffer.clone().into_mut();


@robert3005 we should kill the into_mut() functions. If we cannot into_mut, then we do a memcopy for no reason since we're about to overwrite the data anyway.

We should just have try_into_mut and the caller can either do in-place, or allocate a new buffer and compute directly into that.

fix bitwise_unary_op

e0daf70

Signed-off-by: Connor Tsui <connor.tsui20@gmail.com>

connortsui20 requested review from gatesn and robert3005 March 13, 2026 15:20

connortsui20 added the changelog/fix A bug fix label Mar 13, 2026

robert3005 approved these changes Mar 13, 2026

View reviewed changes

connortsui20 merged commit 4b7207e into develop Mar 13, 2026
54 of 56 checks passed

connortsui20 deleted the ct/fix-bit-unary branch March 13, 2026 15:58

connortsui20 mentioned this pull request Mar 13, 2026

Fuzzing Crash: AssertionFailed in file_io #6896

Closed

gatesn reviewed Mar 16, 2026

View reviewed changes

connortsui20 mentioned this pull request Mar 18, 2026

Reader error on 0.63.0 due to bug in bitwise_unary_op #6975

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix `bitwise_unary_op`#6940

Fix `bitwise_unary_op`#6940
connortsui20 merged 1 commit into
developfrom
ct/fix-bit-unary

connortsui20 commented Mar 13, 2026

Uh oh!

robert3005 commented Mar 13, 2026

Uh oh!

codspeed-hq Bot commented Mar 13, 2026

Uh oh!

robert3005 commented Mar 13, 2026

Uh oh!

connortsui20 commented Mar 13, 2026

Uh oh!

robert3005 commented Mar 13, 2026

Uh oh!

connortsui20 commented Mar 13, 2026

Uh oh!

robert3005 left a comment

Uh oh!

Uh oh!

robert3005 commented Mar 13, 2026

Uh oh!

gatesn Mar 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

connortsui20 commented Mar 13, 2026

Summary

Testing

Uh oh!

robert3005 commented Mar 13, 2026

Uh oh!

codspeed-hq Bot commented Mar 13, 2026

Merging this PR will degrade performance by 12.27%

Performance Changes

Footnotes

Uh oh!

robert3005 commented Mar 13, 2026

Uh oh!

connortsui20 commented Mar 13, 2026

Uh oh!

robert3005 commented Mar 13, 2026

Uh oh!

connortsui20 commented Mar 13, 2026

Uh oh!

robert3005 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

robert3005 commented Mar 13, 2026

Uh oh!

gatesn Mar 16, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants