Skip to content

Build: Bench reporter surfaces wins#180

Merged
jlukic merged 7 commits intomainfrom
bench/surface-wins
May 5, 2026
Merged

Build: Bench reporter surfaces wins#180
jlukic merged 7 commits intomainfrom
bench/surface-wins

Conversation

@jlukic
Copy link
Copy Markdown
Member

@jlukic jlukic commented May 5, 2026

The bench bot surfaces regression-from-peak signals but does not show WIN and TIED-PEAK, only in the JSON in the test runner.

For perf work that iterates, the affirmative signal matters as much as the regression signal. "This iteration is the new peak" is what tells an agent the change worked.

This PR mirrors the existing Regressions from peak section with New peaks for the affirmative case.

Changes

  • New peaks surface as a headline N new peaks count and a New peaks table mirroring the regressions section.
  • Drift flags render symmetrically on WIN rows when peak and current iterations had different baselines.
  • Regressions table units unified to %.
  • Harness-only commits no longer appear as bisect or credit candidates.

Risk

0/10 — internal CI infrastructure. No public code touched.

@vercel
Copy link
Copy Markdown

vercel Bot commented May 5, 2026

The latest updates on your projects. Learn more about Vercel for GitHub.

Project Deployment Actions Updated (UTC)
semantic-next Ready Ready Preview, Comment May 5, 2026 1:00pm
1 Skipped Deployment
Project Deployment Actions Updated (UTC)
mcp Ignored Ignored Preview May 5, 2026 1:00pm

Request Review

@jlukic jlukic changed the title Build: Bench reporter — surface WINs, unify units, filter harness commits Bug: Bench reporter surfaces wins May 5, 2026
@jlukic jlukic changed the title Bug: Bench reporter surfaces wins Build: Bench reporter surfaces wins May 5, 2026
jlukic added 2 commits May 5, 2026 08:32
The verb 'improved' already encodes direction; 'improved -25%' read as
contradicting itself. Use Math.abs so the cell renders 'improved 25%' to
mirror the regression side's 'regressed +25%' framing.
The pp initialism reads as jargon (no engineer says "pp" aloud); standardise
on % everywhere in comments. Em-dashes in PR-added prose comments swap to
period or colon depending on whether the clause continues a thought or
introduces a list.
@jlukic jlukic merged commit 8f33fab into main May 5, 2026
7 of 8 checks passed
@jlukic jlukic deleted the bench/surface-wins branch May 5, 2026 12:58
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant