Parent: #1411
Goal
Add a benchmark/test gate proving compression reduces context size without reducing pass rate on representative PDD tasks.
Prototype links
Acceptance criteria
- Benchmark compares full tests, AST tests, AST+contracts, full few-shot, and compressed few-shot.
- Reports pass rate, token counts, wall-clock time, output churn, and missing-contract failures.
- Fails CI or benchmark gate if compression loses required contract symbols or regresses pass rate on frozen fixtures.
- Includes at least one fixture that previously failed without contract-source preservation.
Parent epic: #873.
Migrated from gltanaka/pdd#1418 (originally filed 2026-05-08 by gltanaka). gltanaka/pdd is deprecated.
Parent: #1411
Goal
Add a benchmark/test gate proving compression reduces context size without reducing pass rate on representative PDD tasks.
Prototype links
Acceptance criteria
Parent epic: #873.
Migrated from gltanaka/pdd#1418 (originally filed 2026-05-08 by
gltanaka). gltanaka/pdd is deprecated.