Skip to content

ci: Fix bencher.dev thresholds#459

Merged
samuelburnham merged 1 commit into
mainfrom
ci-benchmarking
Jun 30, 2026
Merged

ci: Fix bencher.dev thresholds#459
samuelburnham merged 1 commit into
mainfrom
ci-benchmarking

Conversation

@samuelburnham

Copy link
Copy Markdown
Member

Fixes erroneous alerts for the bencher.dev benchmarks on main, which were due to a brittle threshold reset mechanism between Lean toolchain bumps. This PR instead sets a manual threshold reset action, which triggers either by a !bencher-thresholds-reset <ix-compile|aiur|all> PR comment or a manual workflow_dispatch action after merge.

These should be used whenever a new baseline is expected to be set by a PR, either from a performance improvement (e.g. lowering Aiur FFTs) or regression (e.g. more constants added to Mathlib on a toolchain bump). The baselines are split by workload, so changes to ix-compile need not affect the baseline for aiur. Currently all thresholds for a given workload are reset together, so any new metrics from the first few runs on a new baseline should be carefully reviewed for any performance changes.

Future work: Add Zisk and SP1 benchmarks to bencher.dev and integrate with the new threshold/alert system.

Note

Before merge, we'll need to update the testbed for Ix compilation from warp-ubuntu-x64-32x to ix-compile-x64-32x via the bencher API/web console to ensure the history is preserved.

@samuelburnham samuelburnham merged commit 4b0af6c into main Jun 30, 2026
16 checks passed
@samuelburnham samuelburnham deleted the ci-benchmarking branch June 30, 2026 18:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants