Skip to content

Add bit-packed cast benchmark#8058

Merged
joseph-isaacs merged 3 commits into
developfrom
codex/cast-bitpacked-benchmark
May 22, 2026
Merged

Add bit-packed cast benchmark#8058
joseph-isaacs merged 3 commits into
developfrom
codex/cast-bitpacked-benchmark

Conversation

@joseph-isaacs

Copy link
Copy Markdown
Contributor

Summary

  • Add a cast_bitpacked divan benchmark for widening bit-packed u16 arrays to u32.
  • Compare the public array.cast(u32).execute() path against an explicit canonicalize-then-cast baseline.
  • Cover single and chunked inputs with and without patches.

Checks

  • cargo +nightly fmt --all
  • cargo bench -p vortex-fastlanes --features _test-harness --bench cast_bitpacked --no-run

Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>
@joseph-isaacs joseph-isaacs added the changelog/skip Do not list PR in the changelog label May 22, 2026
@joseph-isaacs joseph-isaacs marked this pull request as ready for review May 22, 2026 10:00
u
Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>
@codspeed-hq

codspeed-hq Bot commented May 22, 2026

Copy link
Copy Markdown

Merging this PR will not alter performance

⚠️ Unknown Walltime execution environment detected

Using the Walltime instrument on standard Hosted Runners will lead to inconsistent data.

For the most accurate results, we recommend using CodSpeed Macro Runners: bare-metal machines fine-tuned for performance measurement consistency.

⚡ 2 improved benchmarks
❌ 1 regressed benchmark
✅ 1234 untouched benchmarks
🆕 6 new benchmarks

Warning

Please fix the performance issues or acknowledge them on CodSpeed.

Performance Changes

Mode Benchmark BASE HEAD Efficiency
Simulation baseline_eq[16, 65536] 287.6 µs 259.6 µs +10.75%
Simulation baseline_lt[16, 65536] 302.7 µs 274.8 µs +10.16%
🆕 Simulation cast_execute[(262144, 1, 0.0)] N/A 1.9 ms N/A
🆕 Simulation cast_execute[(262144, 1, 0.01)] N/A 1.9 ms N/A
🆕 Simulation cast_execute[(65536, 1, 0.0)] N/A 517.6 µs N/A
🆕 Simulation cast_execute[(65536, 1, 0.01)] N/A 525.8 µs N/A
🆕 Simulation cast_execute[(65536, 4, 0.0)] N/A 2.8 ms N/A
🆕 Simulation cast_execute[(65536, 4, 0.01)] N/A 2.8 ms N/A
Simulation fast_lt_out_of_range[4, 65536] 204.3 µs 262.3 µs -22.12%

Tip

Investigate this regression by commenting @codspeedbot fix this regression on this PR, or directly use the CodSpeed MCP with your agent.


Comparing codex/cast-bitpacked-benchmark (05a73b7) with develop (012d0ec)

Open in CodSpeed

u
Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>
@joseph-isaacs joseph-isaacs enabled auto-merge (squash) May 22, 2026 10:19
@joseph-isaacs joseph-isaacs merged commit dba7935 into develop May 22, 2026
61 checks passed
@joseph-isaacs joseph-isaacs deleted the codex/cast-bitpacked-benchmark branch May 22, 2026 10:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

changelog/skip Do not list PR in the changelog

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants