feat: two-phase parallel candidate evaluator and batch_refine API by KRRT7 · Pull Request #2125 · codeflash-ai/codeflash

KRRT7 · 2026-05-07T01:43:23Z

the core evaluation algorithm. adds:

ParallelCandidateEvaluator with two-phase design: Phase 1 runs behavioral correctness tests concurrently (one worktree slot per candidate), Phase 2 runs benchmarks sequentially (no CPU contention for accurate timing)
batch_refine endpoint on AiServiceClient

depends on #2124 (infrastructure).

this is PR 2/4 in a stack. review and merge in order:

feat: parallel evaluation infrastructure (worktree pool, async subprocess, shared types) #2124 — infrastructure
feat: two-phase parallel candidate evaluator and batch_refine API #2125 (this) — evaluator algorithm
feat: integrate parallel evaluator into FunctionOptimizer #2126 — optimizer integration
fix: race conditions, re-staging bug, and parallel evaluator test suite #2127 — bug fixes + tests

Phase 1 (concurrent): behavioral correctness tests run in parallel. Failed candidates release their worktree slot immediately. Phase 2 (sequential): only passing candidates get benchmarked, one at a time, for accurate timing without CPU contention. EvalFailure carries test diffs for repair context.

Adds the API method for submitting multiple candidates for refinement in a single request — used by the parallel evaluator to dispatch refinement/repair after evaluation completes.

dataclass(slots=True) requires Python 3.10+.

KRRT7 requested review from aseembits93 and misrasaurabh1 as code owners May 7, 2026 01:43

KRRT7 added 3 commits May 6, 2026 20:52

feat: add batch_refine endpoint to AiServiceClient

0d0716b

Adds the API method for submitting multiple candidates for refinement in a single request — used by the parallel evaluator to dispatch refinement/repair after evaluation completes.

fix: remove slots=True for Python 3.9 compatibility

22736d3

dataclass(slots=True) requires Python 3.10+.

KRRT7 force-pushed the parallel-eval/02-evaluator branch from ec123fd to 22736d3 Compare May 7, 2026 01:53

KRRT7 closed this May 7, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: two-phase parallel candidate evaluator and batch_refine API#2125

feat: two-phase parallel candidate evaluator and batch_refine API#2125
KRRT7 wants to merge 3 commits into
parallel-eval/01-infrastructurefrom
parallel-eval/02-evaluator

KRRT7 commented May 7, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

KRRT7 commented May 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

KRRT7 commented May 7, 2026 •

edited

Loading