Skip to content

Commit acf0df4

Browse files
committed
Round 88: 00-core Human Review preamble guards
Add ## Human Review preambles routing to reflective-risk on six risk-bearing 00-core prompts; extend test_core_prompts_eval_harness with shared prompt_eval_helpers guards. GLOSSARY playbook step 20.
1 parent 4a6f8e9 commit acf0df4

15 files changed

Lines changed: 98 additions & 11 deletions

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -21,7 +21,7 @@ Full library docs: [reflective-prompt-library/README.md](reflective-prompt-libra
2121
## Governance
2222

2323
- **Contributing:** [CONTRIBUTING.md](CONTRIBUTING.md) — quality gates, routing maintenance (R8–R12), `make all`
24-
- **Panel record:** [multi-agent-panel-consensus](reflective-prompt-library/plans/multi-agent-panel-consensus-2026-06-25.md) — six-lens Socratic consensus (Rounds 1–87)
24+
- **Panel record:** [multi-agent-panel-consensus](reflective-prompt-library/plans/multi-agent-panel-consensus-2026-06-25.md) — six-lens Socratic consensus (Rounds 1–88)
2525
- **Operator playbook:** [GLOSSARY.md](reflective-prompt-library/GLOSSARY.md) — Governance Maintenance Playbook
2626

2727
The repository contains:

reflective-prompt-library/00-core/core-full.md

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -18,6 +18,11 @@ Canonical full English protocol for reflective engineering hosts. Primary workfl
1818

1919
Every recommendation names what observation would prove it wrong.
2020

21+
## Human Review
22+
23+
Escalate to `reflective-risk` with an explicit Human Review gate when the work implies irreversible or high-blast-radius action.
24+
25+
2126
## Identity
2227

2328
You are a Reflective Engineering Agent.

reflective-prompt-library/00-core/core-minimal.md

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -20,6 +20,11 @@ Shortest general-purpose reflective engineering opener. Primary workflow surface
2020

2121
State what evidence would overturn the current framing before deeper work.
2222

23+
## Human Review
24+
25+
Escalate to `reflective-risk` with an explicit Human Review gate when the work implies irreversible or high-blast-radius action.
26+
27+
2328
```markdown
2429
請以「反思型工程代理人」處理此任務。先判斷真正目標,再定義假設、範圍、輸入/輸出、失敗條件、驗收標準與可證偽測試。若模糊但安全,明確列出假設並繼續;若涉及架構、安全、隱私、資料遺失、金錢或不可逆決策,請要求人工審查。回答時優先給乾淨交付成果,請勿輸出未經整理的原始推理過程。結構化推理段落(Goal/Assumptions/Socratic audit 等)是要求的輸出格式,不屬於隱藏思考鏈。
2530
```

reflective-prompt-library/00-core/core-short.md

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -20,6 +20,11 @@ Global short instruction surface for host custom instructions. Primary workflow
2020

2121
Name one observation that would prove the recommended plan wrong before execution.
2222

23+
## Human Review
24+
25+
Escalate to `reflective-risk` with an explicit Human Review gate when the work implies irreversible or high-blast-radius action.
26+
27+
2328
```markdown
2429
你是 Reflective Engineering Agent。核心原則是:Doing the right thing > doing things right。
2530

reflective-prompt-library/00-core/custom-instruction-en.md

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -20,6 +20,11 @@ Length-limited English custom instruction distillate. Primary workflow surface:
2020

2121
Name one check that would prove the answer wrong after delivery.
2222

23+
## Human Review
24+
25+
Escalate to `reflective-risk` with an explicit Human Review gate when the work implies irreversible or high-blast-radius action.
26+
27+
2328
```markdown
2429
Act as a Reflective Engineering Agent: Doing the right thing > doing things right. For non-trivial tasks, define goal, assumptions, scope, inputs/outputs, failure conditions, acceptance criteria, falsifiability, plan, implementation, validation, and self-check. Prefer tests, schemas, types, examples, and artifacts over vague prompt rules. If ambiguity is safe, state assumptions and proceed. If it affects architecture, security, privacy, data loss, cost, or irreversible decisions, request Human Review. Do not dump raw, unfiltered reasoning tokens. Structured reasoning sections (Goal/Assumptions/Socratic audit/etc.) are the required output format and are not hidden chain-of-thought. Provide clean deliverables and concise reasoning summaries.
2530
```

reflective-prompt-library/00-core/custom-instruction-zh.md

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -20,6 +20,11 @@ Length-limited Traditional Chinese custom instruction distillate. Primary workfl
2020

2121
Name one check that would prove the answer wrong after delivery.
2222

23+
## Human Review
24+
25+
Escalate to `reflective-risk` with an explicit Human Review gate when the work implies irreversible or high-blast-radius action.
26+
27+
2328
```markdown
2429
請以「反思型工程代理人」執行:做正確的事大於把事情做正確。
2530

reflective-prompt-library/00-core/important-task-full.md

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -21,6 +21,11 @@ High-rigor reflection for important decisions. Primary workflow surfaces: `refle
2121

2222
State the observation or experiment that would overturn the recommended plan.
2323

24+
## Human Review
25+
26+
Escalate to `reflective-risk` with an explicit Human Review gate when the work implies irreversible or high-blast-radius action.
27+
28+
2429
```markdown
2530
請以 Reflective Engineering Agent + Socratic Reviewer + Critical Thinking Auditor 模式處理。
2631

reflective-prompt-library/GLOSSARY.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -337,7 +337,7 @@ Curated top-of-cheatsheet summary of high-confusion routing traps (ROUTE-002 hol
337337

338338
## Governance Maintenance Playbook / 治理維護手冊
339339

340-
Ongoing upkeep after panel close (Rounds 1–87). Not agent instructions — operator checklist.
340+
Ongoing upkeep after panel close (Rounds 1–88). Not agent instructions — operator checklist.
341341

342342
**Operational test:** Before router tuning, add fresh ROUTE-002/003 holdout phrases; run `make all`; record decisions in `PROJECT_KNOWLEDGE.md` Decision Index when governance surface changes.
343343

@@ -360,3 +360,4 @@ Ongoing upkeep after panel close (Rounds 1–87). Not agent instructions — ope
360360
17. When editing composable prompts (`00-core``06-repo`), keep `Primary workflow surface(s)` / Supporting-lens preamble lines and run `test_*_prompts_eval_harness.py` primary-surface guards.
361361
18. When adding or editing composable prompts (`02-engineering``06-repo`) with `## Human Review`, keep preamble escalation routed to `reflective-risk` and run Human Review guards in `test_*_prompts_eval_harness.py` (exact heading match via `prompt_eval_helpers.py`).
362362
19. When editing Human Review guards, use `prompt_eval_helpers.assert_human_review_preamble` in all `test_*_prompts_eval_harness.py` files (thinking lenses + composable categories).
363+
20. When adding or editing risk-bearing `00-core/` prompts with `## Human Review`, keep preamble escalation routed to `reflective-risk` and run `test_core_prompts_eval_harness.py` Human Review guards via `prompt_eval_helpers.py`.

reflective-prompt-library/PROJECT_KNOWLEDGE.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -73,6 +73,7 @@ deferred promotions are recurrence-gated — see [panel backlog](plans/multi-age
7373
## Decision Index
7474

7575
- 2026-06-25 Round 85 panel — composable prompt Primary workflow surface preamble guards (`test_*_prompts_eval_harness.py`) + Supporting-lens exemption → [record](plans/multi-agent-panel-consensus-2026-06-25.md)
76+
- 2026-06-25 Round 88 panel — `00-core` Human Review preamble guards on risk-bearing prompts + `test_core_prompts_eval_harness.py`[record](plans/multi-agent-panel-consensus-2026-06-25.md)
7677
- 2026-06-25 Round 87 panel — Human Review helper DRY + GLOSSARY playbook step repair → [record](plans/multi-agent-panel-consensus-2026-06-25.md)
7778
- 2026-06-25 Round 86 panel — composable Human Review preamble guards + `reflective-risk` routing alignment → [record](plans/multi-agent-panel-consensus-2026-06-25.md)
7879
- 2026-06-25 Round 84 panel — `00-core` Primary workflow surface parity + primary-line trim → [record](plans/multi-agent-panel-consensus-2026-06-25.md)

reflective-prompt-library/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -30,7 +30,7 @@ Pick **Strictness L1–L6** first (`skills/reflective-dispatch/SKILL.md`, [GLOSS
3030

3131
## Governance Panel Record
3232

33-
Multi-agent Socratic consensus on project goals and the nine skills (Rounds 1–87, options A–FO) is recorded in [plans/multi-agent-panel-consensus-2026-06-25.md](plans/multi-agent-panel-consensus-2026-06-25.md). Run `make all` before claiming routing or governance changes are verified.
33+
Multi-agent Socratic consensus on project goals and the nine skills (Rounds 1–88, options A–FS) is recorded in [plans/multi-agent-panel-consensus-2026-06-25.md](plans/multi-agent-panel-consensus-2026-06-25.md). Run `make all` before claiming routing or governance changes are verified.
3434

3535
## Directory Map
3636

0 commit comments

Comments
 (0)