awesome-agentic-patterns/patterns/spec-as-test-feedback-loop.md at main · Fr-e-d/awesome-agentic-patterns

title

Spec-As-Test Feedback Loop

status

emerging

authors

Nikola Balic (@nibzard)

based_on

Jory Pestorious

Problem

Even in spec-first projects, implementations can drift as code evolves and the spec changes (or vice-versa). Silent divergence erodes trust.

Generate executable assertions directly from the spec (e.g., unit or integration tests) and let the agent:

This creates a continuous feedback loop ensuring specification and implementation remain synchronized.

Four-phase architecture:

Evidence Grade: medium
Most Valuable Findings:
- Production use at Anthropic (Constitutional AI), OpenAI (Evals), and LangChain
- Academic foundations in QuickCheck (property-based testing) and Design by Contract
- Effective when combined with Feature List as Immutable Contract
Unverified: Long-term impact on agent quality scores; most implementations are recent (2022-2024)

Pros:
- Catches drift early; prevents silent spec-implementation divergence
- Immune to "pass by deletion" when combined with immutable feature lists
- Provides measurable progress metrics (X/Y features passing)
- Survives session boundaries; test state persists across context loss
Cons:
- Heavy CI usage; false positives if spec wording is ambiguous
- Upfront spec investment required; overhead exceeds benefit for small/one-off tasks
- Test explosion risk without intelligent selection; spec churn creates test churn