perf(worker): Optimize flake processing by batching testruns by sentry[bot] · Pull Request #874 · codecov/umbrella

sentry · 2026-04-20T13:27:26Z

Fixes WORKER-Y97. The issue was that: The process_single_upload function executes an N+1 query for testruns, causing excessive database load and task timeouts.

Modified get_testruns to accept a list of upload IDs, enabling batch retrieval of testruns.
Removed the process_single_upload function, integrating its logic into process_flakes_for_commit.
Refactored process_flakes_for_commit to fetch all relevant testruns across multiple uploads in a single query.
Consolidated flake processing logic within process_flakes_for_commit, eliminating the per-upload iteration.
Improved efficiency by performing a single bulk_update for all processed testruns.

This fix was generated by Seer in Sentry, triggered automatically. 👁️ Run ID: 13573144

Not quite right? Click here to continue debugging with Seer.

Legal Boilerplate

Look, I get it. The entity doing business as "Sentry" was incorporated in the State of Delaware in 2015 as Functional Software, Inc. In 2022 this entity acquired Codecov and as result Sentry is going to need some rights from me in order to utilize my contributions in this PR. So here's the deal: I retain all rights, title and interest in and to my contributions, and by keeping this boilerplate intact I confirm that Sentry can use, modify, copy, and redistribute my contributions, under Sentry's choice of terms.

Note

Medium Risk
Refactors flake processing to batch-fetch and bulk-update Testruns across uploads, which could subtly change processing order/coverage if query semantics differ (e.g., empty upload sets) but is otherwise a contained performance change.

Overview
Optimizes commit-level flake detection by removing per-upload testrun fetching and instead retrieving all recent Testruns for the commit’s relevant uploads in a single upload_id__in query.

Consolidates the per-upload processing loop into process_flakes_for_commit, logging upload IDs directly and performing one bulk_update of testrun outcomes after processing.

^{Reviewed by Cursor Bugbot for commit ef6cfa4. Bugbot is set up for automated code reviews on this repo. Configure here.}

sentry · 2026-04-20T13:30:53Z

@@ -79,38 +78,18 @@ def handle_failure(
        testrun.outcome = "flaky_fail"




Bug: Processing testruns globally by timestamp, instead of per-upload, can alter flake state calculations when testrun timestamps from different uploads overlap, leading to incorrect flake counts.
_{Severity: MEDIUM}

Suggested Fix

To preserve the original processing logic while retaining the performance benefit of a single query, first fetch all testruns ordered by timestamp. Then, group the testruns by upload_id in memory. Finally, iterate through the uploads in a deterministic order (e.g., by upload_id) and process the testruns for each upload, ensuring the processing order remains consistent with the previous behavior.

Prompt for AI Agent

Review the code at the location below. A potential bug has been identified by an AI agent. Verify if this is a real issue. If it is, propose a fix; if not, explain why it's not valid. Location: apps/worker/services/test_analytics/ta_process_flakes.py#L80 Potential issue: The logic was changed to process testruns from all associated uploads in a single batch, ordered globally by timestamp. Previously, testruns were processed sequentially for each upload. This change in ordering can lead to incorrect flake statistics. If a 'pass' testrun from a later upload has an earlier timestamp than a 'failure' testrun from an earlier upload, the pass may be processed first. This can cause a flake to be prematurely marked as resolved (e.g., by reaching 30 passes) and deleted, only for the subsequent failure to create a new, separate flake record with reset counts, leading to inaccurate analytics.

_{Did we get this right? 👍 / 👎 to inform future reviews.}

sentry · 2026-04-20T13:35:57Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 92.25%. Comparing base (0ad8a0c) to head (ef6cfa4).
✅ All tests successful. No failed tests found.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #874      +/-   ##
==========================================
- Coverage   92.25%   92.25%   -0.01%     
==========================================
  Files        1307     1307              
  Lines       48017    48011       -6     
  Branches     1636     1636              
==========================================
- Hits        44299    44293       -6     
  Misses       3407     3407              
  Partials      311      311

Flag	Coverage Δ
workerintegration	`58.55% <6.66%> (+<0.01%)`	⬆️
workerunit	`90.38% <100.00%> (-0.01%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

codecov-notifications · 2026-04-20T13:36:05Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ All tests successful. No failed tests found.

📢 Thoughts on this report? Let us know!

perf(worker): Optimize flake processing by batching testruns

ef6cfa4

sentry Bot commented Apr 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(worker): Optimize flake processing by batching testruns#874

perf(worker): Optimize flake processing by batching testruns#874
sentry[bot] wants to merge 1 commit intomainfrom
seer/perf/batch-flake-processing-GSZgm7

sentry Bot commented Apr 20, 2026 •

edited by cursor Bot

Loading

Uh oh!

sentry Bot Apr 20, 2026

Uh oh!

sentry Bot commented Apr 20, 2026 •

edited

Loading

Uh oh!

codecov-notifications Bot commented Apr 20, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

0 participants

		@@ -79,38 +78,18 @@ def handle_failure(
		testrun.outcome = "flaky_fail"

Conversation

sentry Bot commented Apr 20, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Legal Boilerplate

Uh oh!

sentry Bot Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

sentry Bot commented Apr 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

codecov-notifications Bot commented Apr 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

0 participants

sentry Bot commented Apr 20, 2026 •

edited by cursor Bot

Loading

sentry Bot commented Apr 20, 2026 •

edited

Loading

codecov-notifications Bot commented Apr 20, 2026 •

edited

Loading