audit: harden 10 agents against 9 RCA patterns (generic guardrails)#15
audit: harden 10 agents against 9 RCA patterns (generic guardrails)#15singabhi wants to merge 2 commits into
Conversation
5-Whys analysis of real-world failure patterns observed in evidence from a separate project. All 9 patterns confirmed with file-path or commit evidence; all insertions are generic — no project-specific nouns appear in agent files. Patterns addressed: P1 UI control shipped without backing implementation P2 Status entry marked complete without traceable artifact P3 User preference written but never read P4 New entity introduced; downstream surfaces not updated P5 UI contrast not verified across all visual contexts P6 Feature ships on one platform; other platforms not recorded P7 Deployment prerequisite enforced by warn-only CI gate P8 Mock-only unit suite passes; cross-surface path broken P9 Code review approved without checking P1–P8 Agent files edited (localized insertions only, marker: audit/naamjap-rca): agents/quality-testing/code-reviewer.md (P1 P3 P4 P5 P9) agents/development/full-stack-developer.md (P1 P3 P4) agents/development/mobile-developer.md (P1 P6) agents/business/product-manager.md (P1 P2) agents/specialization/documentation-expert.md (P2) agents/quality-testing/architect-review.md (P4) agents/development/ui-designer.md (P5) agents/quality-testing/qa-expert.md (P6 P8) agents/infrastructure/deployment-engineer.md (P7) agents/quality-testing/test-automator.md (P7 P8) New files: audits/2026-05-naamjap-rca.md RCA evidence + 5-Whys per pattern audits/stub-declaration-contract.md UI-logic pairing / stub contract audits/PR_BODY.md Draft PR body Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Review Summary by QodoHarden agent prompts against 9 recurring failure patterns with generic guardrails
WalkthroughsDescription• Hardened 10 agent prompts with 9 RCA-derived guardrails against recurring failure patterns • Added explicit checks for UI-implementation pairing, preference read sites, and cross-surface propagation • Introduced stub declaration contract and status accuracy rules to prevent silent defects • Created comprehensive RCA evidence document with 5-Whys analysis grounded in real commits Diagramflowchart LR
RCA["9 Failure Patterns<br/>P1-P9 Identified"]
RCA -->|"P1: UI without impl"| CodeReviewer["code-reviewer.md"]
RCA -->|"P2: Stale status"| DocExpert["documentation-expert.md"]
RCA -->|"P3: Pref no read"| FullStack["full-stack-developer.md"]
RCA -->|"P4: Isolated feature"| ArchReview["architect-review.md"]
RCA -->|"P5: No contrast check"| UIDesigner["ui-designer.md"]
RCA -->|"P6: Platform gap"| Mobile["mobile-developer.md"]
RCA -->|"P7: Warn-only gate"| Deploy["deployment-engineer.md"]
RCA -->|"P8: Mock-only tests"| TestAuto["test-automator.md"]
RCA -->|"P9: Review gap"| QAExpert["qa-expert.md"]
CodeReviewer --> Contract["stub-declaration-contract.md"]
DocExpert --> Contract
FullStack --> Contract
ArchReview --> Contract
UIDesigner --> Contract
Mobile --> Contract
Deploy --> Contract
TestAuto --> Contract
QAExpert --> Contract
Contract --> Evidence["2026-05-naamjap-rca.md"]
File Changes1. agents/business/product-manager.md
|
Code Review by Qodo
1.
|
There was a problem hiding this comment.
Code Review
This pull request introduces a comprehensive set of guardrails and technical standards across various agent prompt files to address nine recurring failure patterns identified in a recent audit. These patterns include UI-only stubs, stale status markers, and incomplete cross-surface data propagation. The feedback identifies a numbering conflict in the deployment engineer's guiding principles and suggests including a missing reference to artifact-backed completion (P2) in the code reviewer's integration surface sign-off checklist.
- deployment-engineer.md: renumber guiding principles (1-8) after P7 insertion caused duplicate number 2 - code-reviewer.md: add P2 (artifact-backed completion) to the P9 integration surface sign-off checklist; re-letter items (a)-(h) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
5-Whys analysis of real-world failure patterns observed in evidence from a separate project. All 9 patterns confirmed with file-path or commit evidence; all insertions are generic — no project-specific nouns appear in agent files.
Patterns addressed:
P1 UI control shipped without backing implementation
P2 Status entry marked complete without traceable artifact
P3 User preference written but never read
P4 New entity introduced; downstream surfaces not updated
P5 UI contrast not verified across all visual contexts
P6 Feature ships on one platform; other platforms not recorded
P7 Deployment prerequisite enforced by warn-only CI gate
P8 Mock-only unit suite passes; cross-surface path broken
P9 Code review approved without checking P1–P8
Agent files edited (localized insertions only, marker: audit/naamjap-rca):
agents/quality-testing/code-reviewer.md (P1 P3 P4 P5 P9)
agents/development/full-stack-developer.md (P1 P3 P4)
agents/development/mobile-developer.md (P1 P6)
agents/business/product-manager.md (P1 P2)
agents/specialization/documentation-expert.md (P2)
agents/quality-testing/architect-review.md (P4)
agents/development/ui-designer.md (P5)
agents/quality-testing/qa-expert.md (P6 P8)
agents/infrastructure/deployment-engineer.md (P7)
agents/quality-testing/test-automator.md (P7 P8)
New files:
audits/2026-05-naamjap-rca.md RCA evidence + 5-Whys per pattern
audits/stub-declaration-contract.md UI-logic pairing / stub contract
audits/PR_BODY.md Draft PR body