ci: add benchmark workflow by wakonig · Pull Request #1127 · bec-project/bec_widgets

wakonig · 2026-04-17T08:08:57Z

Description

This PR adds a new ci workflow for checking performance metrics, either from a bash script (e.g. for benchmarking import and startup times) or from a pytest-benchmark run (e.g. for runtime benchmarks such as adding components to a dockarea).

To not get completely overshadowed with the CI fluctuations, a matrix job is used and results are averaged across 3 runs.

Once merged, the results are pushed to gh-pages and kept as json files. This only happens on main though; PR will simply run their checks locally and compare against the reference.

Let's test it for some time in BW. If we are happy with it, I'd suggest to create a bespoke action out of it such that we can also reuse it for other repos.

github-actions · 2026-04-17T08:15:06Z

Benchmark comparison

Threshold: 10% (lower is better).
Result: 2 regression(s), 2 improvement(s) beyond threshold.

2 benchmark(s) regressed beyond the configured threshold.

Benchmark	Baseline	Current	Change
BEC IPython client with companion app	2.40488 s	7.74267 s	+221.96%
tests/unit_tests/benchmarks/test_dock_area_benchmark.py::test_add_waveform_to_dock_area	0.0100101 s	0.18571 s	+1755.22%

2 benchmark(s) improved beyond the configured threshold.

Benchmark	Baseline	Current	Change
BEC IPython client without companion app	7.75936 s	2.19367 s	-71.73%
Import bec_widgets	1.28091 s	0.0135032 s	-98.95%

All benchmark results

Benchmark	Baseline	Current	Change	Status
BEC IPython client with companion app	2.40488 s	7.74267 s	+221.96%	🔴 regressed
BEC IPython client without companion app	7.75936 s	2.19367 s	-71.73%	🟢 improved
Import bec_widgets	1.28091 s	0.0135032 s	-98.95%	🟢 improved
tests/unit_tests/benchmarks/test_dock_area_benchmark.py::test_add_waveform_to_dock_area	0.0100101 s	0.18571 s	+1755.22%	🔴 regressed

codecov · 2026-04-17T08:31:44Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

Copilot

Pull request overview

Adds a CI benchmarking pipeline to track performance metrics over time (hyperfine startup/import benchmarks + pytest-benchmark runtime benchmarks), compare results to a stored baseline, and (intended) publish historical benchmark JSON to gh-pages.

Changes:

Introduces a reusable benchmark.yml workflow plus helper scripts to run/aggregate/compare benchmarks and comment results on PRs.
Adds initial benchmark definitions (hyperfine shell benchmarks + a pytest-benchmark dock area benchmark).
Updates existing pytest workflows to exclude the new benchmark tests from normal unit-test runs.

Reviewed changes

Copilot reviewed 13 out of 13 changed files in this pull request and generated 9 comments.

Show a summary per file

File	Description
`tests/unit_tests/benchmarks/test_dock_area_benchmark.py`	Adds a pytest-benchmark test for adding a widget to a `BECDockArea`.
`tests/benchmarks/hyperfine/utils/exit_bec_startup.py`	Adds an IPython helper to exit BEC startup for launch-time benchmarking.
`tests/benchmarks/hyperfine/benchmark_launch_bec_without_companion.sh`	Adds a hyperfine benchmark script for launching BEC (metadata currently inconsistent).
`tests/benchmarks/hyperfine/benchmark_launch_bec_with_companion.sh`	Adds a hyperfine benchmark script for launching BEC (metadata currently inconsistent).
`tests/benchmarks/hyperfine/benchmark_import_bec_widgets.sh`	Adds a hyperfine benchmark script for import-time measurement.
`.github/workflows/pytest.yml`	Ignores benchmark tests during coverage/unit-test workflow runs.
`.github/workflows/pytest-matrix.yml`	Ignores benchmark tests during python-version-matrix test runs.
`.github/workflows/ci.yml`	Hooks the benchmark workflow into Full CI and adds `contents: write` permission.
`.github/workflows/benchmark.yml`	New reusable workflow to run benchmarks (3 attempts), aggregate, compare, PR-comment, and publish to `gh-pages`.
`.github/scripts/run_with_bec_servers.py`	New helper to start Redis + BEC services and run the benchmark command under that environment.
`.github/scripts/run_benchmarks.sh`	New runner that executes hyperfine + pytest-benchmark suites and aggregates results.
`.github/scripts/compare_benchmarks.py`	New comparator to generate a Markdown summary and fail on threshold regressions.
`.github/scripts/aggregate_benchmarks.py`	New aggregator to normalize and median-aggregate benchmark results.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

cappel89

LGTM. Just out of curiosity, where is the benchmark fixture coming from?

wakonig · 2026-04-21T09:48:13Z

LGTM. Just out of curiosity, where is the benchmark fixture coming from?

that's pytest-benchmark https://pytest-benchmark.readthedocs.io/en/latest/

wakonig · 2026-04-21T09:49:22Z

but I obviously forgot to add it to pyproject... let me fix that quickly

wakonig force-pushed the ci/benchmark_workflow branch 2 times, most recently from 51bacc4 to 501a793 Compare April 17, 2026 08:22

wakonig force-pushed the ci/benchmark_workflow branch 3 times, most recently from 861678d to 64c435f Compare April 17, 2026 12:58

wakonig marked this pull request as ready for review April 17, 2026 13:46

Copilot AI review requested due to automatic review settings April 17, 2026 13:46

Copilot started reviewing on behalf of wakonig April 17, 2026 13:47 View session

wakonig force-pushed the ci/benchmark_workflow branch from 8d70b23 to 29f7dc3 Compare April 17, 2026 13:47

Copilot AI reviewed Apr 17, 2026

View reviewed changes

wakonig force-pushed the ci/benchmark_workflow branch 2 times, most recently from 12cad30 to 4243909 Compare April 17, 2026 14:06

wakonig requested a review from a team April 17, 2026 14:23

wakonig force-pushed the ci/benchmark_workflow branch from 858fd86 to 8a7dd9c Compare April 21, 2026 07:49

cappel89 previously approved these changes Apr 21, 2026

View reviewed changes

wakonig dismissed cappel89’s stale review via 6553f5a April 21, 2026 09:51

wakonig force-pushed the ci/benchmark_workflow branch from 6553f5a to f18f93c Compare April 21, 2026 09:52

wakonig requested a review from cappel89 April 21, 2026 09:52

wakonig enabled auto-merge (rebase) April 21, 2026 09:58

wakonig added 2 commits April 21, 2026 12:43

ci: add benchmark workflow

3a140d4

build: add pytest-benchmark dependency

c9db094

wakonig force-pushed the ci/benchmark_workflow branch from f18f93c to c9db094 Compare April 21, 2026 10:43

cappel89 approved these changes Apr 21, 2026

View reviewed changes

wakonig merged commit 551d38d into main Apr 21, 2026
42 of 43 checks passed

wakonig deleted the ci/benchmark_workflow branch April 21, 2026 11:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ci: add benchmark workflow#1127

ci: add benchmark workflow#1127
wakonig merged 2 commits into
mainfrom
ci/benchmark_workflow

wakonig commented Apr 17, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Apr 17, 2026 •

edited

Loading

Uh oh!

codecov Bot commented Apr 17, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cappel89 left a comment

Uh oh!

wakonig commented Apr 21, 2026

Uh oh!

wakonig commented Apr 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

wakonig commented Apr 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

github-actions Bot commented Apr 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmark comparison

Uh oh!

codecov Bot commented Apr 17, 2026

Codecov Report

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cappel89 left a comment

Choose a reason for hiding this comment

Uh oh!

wakonig commented Apr 21, 2026

Uh oh!

wakonig commented Apr 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

wakonig commented Apr 17, 2026 •

edited

Loading

github-actions Bot commented Apr 17, 2026 •

edited

Loading