Skip to content

Commit e30d358

Browse files
committed
Round 82: strict Primary workflow surfaces graph parity
Six-lens panel (EQ–ET) agreed to align thinking-lens Purpose preambles exactly with SKILL_THINKING_SOURCES consumers, add pytest guard, and move adjacent workflow notes to Scope. Governance synced to Round 82.
1 parent 0f09da3 commit e30d358

12 files changed

Lines changed: 85 additions & 14 deletions

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -21,7 +21,7 @@ Full library docs: [reflective-prompt-library/README.md](reflective-prompt-libra
2121
## Governance
2222

2323
- **Contributing:** [CONTRIBUTING.md](CONTRIBUTING.md) — quality gates, routing maintenance (R8–R12), `make all`
24-
- **Panel record:** [multi-agent-panel-consensus](reflective-prompt-library/plans/multi-agent-panel-consensus-2026-06-25.md) — six-lens Socratic consensus (Rounds 1–81)
24+
- **Panel record:** [multi-agent-panel-consensus](reflective-prompt-library/plans/multi-agent-panel-consensus-2026-06-25.md) — six-lens Socratic consensus (Rounds 1–82)
2525
- **Operator playbook:** [GLOSSARY.md](reflective-prompt-library/GLOSSARY.md) — Governance Maintenance Playbook
2626

2727
The repository contains:

reflective-prompt-library/01-thinking/counterargument.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ Use this to prevent excessive optimism, overengineering, or AI flattery.
44

55
## Purpose
66

7-
Stress-test optimism, overengineering, and AI flattery before committing resources. Primary workflow surfaces: `reflective-implement` for disputed implementation choices; `reflective-review` and `reflective-minimality` for critique and anti-bloat; escalate to `reflective-risk` when trust-boundary or blast-radius signals appear.
7+
Stress-test optimism, overengineering, and AI flattery before committing resources. Primary workflow surfaces: `reflective-implement` for disputed implementation choices; `reflective-review` and `reflective-minimality` for critique and anti-bloat.
88

99
## Scope
1010

reflective-prompt-library/01-thinking/socratic-reviewer.md

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -4,12 +4,13 @@ Suitable for requirements interviews, life decisions, product direction, busines
44

55
## Purpose
66

7-
Clarify the real question before choosing a direction. Primary workflow surfaces: `reflective-brief` for goal and assumption clarification; `reflective-dispatch` for routing; `reflective-research` for multi-voice synthesis; `reflective-handoff-retro` for session transfer; escalate to `reflective-spec-plan` when scope is clear enough to plan.
7+
Clarify the real question before choosing a direction. Primary workflow surfaces: `reflective-dispatch` for routing; `reflective-research` for multi-voice synthesis; `reflective-handoff-retro` for session transfer.
88

99
## Scope
1010

1111
- In scope: requirements interviews, product direction, technical selection, learning strategy, research question definition.
12-
- Out of scope: code implementation (`reflective-implement`), production risk gating (`reflective-risk`), source-backed external research (`reflective-research`).
12+
- Out of scope: code implementation (`reflective-implement`), production risk gating (`reflective-risk`), source-backed external research (`reflective-research`), ticket/spec drafting (`reflective-spec-plan` — use after dispatch when scope is clear).
13+
- Adjacent: pair with `reflective-brief` when assumptions are still open; dispatch selects strictness before spec or implement work.
1314

1415
## Acceptance Criteria
1516

reflective-prompt-library/01-thinking/why-what-how-done.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,12 +4,13 @@ Use this as the core gate prompt before committing to a direction.
44

55
## Purpose
66

7-
Gate a task through Why / What / How / Done before choosing strictness or workflow depth. Primary workflow surfaces: `reflective-brief` then `reflective-dispatch` for orchestration level selection; `reflective-spec-plan` when framing becomes ticket or spec work.
7+
Gate a task through Why / What / How / Done before choosing strictness or workflow depth. Primary workflow surfaces: `reflective-brief` for pre-commitment gating; `reflective-spec-plan` when framing becomes ticket or spec work.
88

99
## Scope
1010

1111
- In scope: pre-commitment checks on goal, scope, method, and completion evidence for a single task or feature.
1212
- Out of scope: post-hoc code review (`reflective-review`), handoff retros (`reflective-handoff-retro`), or detailed test implementation.
13+
- Adjacent: after brief framing, `reflective-dispatch` selects orchestration level before deeper spec or implement work.
1314

1415
## Acceptance Criteria
1516

reflective-prompt-library/GLOSSARY.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -337,7 +337,7 @@ Curated top-of-cheatsheet summary of high-confusion routing traps (ROUTE-002 hol
337337

338338
## Governance Maintenance Playbook / 治理維護手冊
339339

340-
Ongoing upkeep after panel close (Rounds 1–81). Not agent instructions — operator checklist.
340+
Ongoing upkeep after panel close (Rounds 1–82). Not agent instructions — operator checklist.
341341

342342
**Operational test:** Before router tuning, add fresh ROUTE-002/003 holdout phrases; run `make all`; record decisions in `PROJECT_KNOWLEDGE.md` Decision Index when governance surface changes.
343343

@@ -354,3 +354,4 @@ Ongoing upkeep after panel close (Rounds 1–81). Not agent instructions — ope
354354
11. When changing Module Contract subsections on workflow skills, keep `Escalation:` present and run `test_skill_module_contract.py`.
355355
12. When adding or editing `01-thinking/` lenses, keep `## Human Review` in the preamble (routes to `reflective-risk`) and run `test_thinking_prompts_eval_harness.py`.
356356
13. When editing workflow skill Escalation bullets, cite only frozen `reflective-*` skills; run `test_skill_module_contract.py` escalation route guard.
357+
14. When editing `01-thinking/` Purpose preambles, keep `Primary workflow surfaces` aligned exactly with `SKILL_THINKING_SOURCES` via `test_thinking_lens_primary_surfaces_match_consumer_graph`; put escalations and adjacent workflow notes in Scope or Human Review, not on the primary line.

reflective-prompt-library/PROJECT_KNOWLEDGE.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -75,6 +75,7 @@ deferred promotions are recurrence-gated — see [panel backlog](plans/multi-age
7575
> Pointers to the causal trail — plans, reflections, tests, commits. Detail is
7676
> not duplicated here; this is a map, not an archive.
7777
78+
- 2026-06-25 Round 82 panel — strict Primary workflow surfaces ↔ skill graph parity + preamble trim → [record](plans/multi-agent-panel-consensus-2026-06-25.md)
7879
- 2026-06-25 Round 81 panel — thinking-lens Human Review preamble guards + Escalation route-target anti-drift → [record](plans/multi-agent-panel-consensus-2026-06-25.md)
7980
- 2026-06-25 Round 80 panel — Module Contract Escalation anti-drift + thinking-lens preamble consumer guards → [record](plans/multi-agent-panel-consensus-2026-06-25.md)
8081
- 2026-06-25 Round 79 panel — bidirectional thinking-lens ↔ workflow skill preamble cross-links + reciprocal pytest → [record](plans/multi-agent-panel-consensus-2026-06-25.md)

reflective-prompt-library/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -30,7 +30,7 @@ Pick **Strictness L1–L6** first (`skills/reflective-dispatch/SKILL.md`, [GLOSS
3030

3131
## Governance Panel Record
3232

33-
Multi-agent Socratic consensus on project goals and the nine skills (Rounds 1–81, options A–EP) is recorded in [plans/multi-agent-panel-consensus-2026-06-25.md](plans/multi-agent-panel-consensus-2026-06-25.md). Run `make all` before claiming routing or governance changes are verified.
33+
Multi-agent Socratic consensus on project goals and the nine skills (Rounds 1–82, options A–ET) is recorded in [plans/multi-agent-panel-consensus-2026-06-25.md](plans/multi-agent-panel-consensus-2026-06-25.md). Run `make all` before claiming routing or governance changes are verified.
3434

3535
## Directory Map
3636

reflective-prompt-library/plans/QUALITY_GATES_SUMMARY.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -314,7 +314,7 @@ ROUTE-002 measures unseen phrasing separately from ROUTE-001. Round 7 (2026-06-2
314314
2. **ROUTE-001/002/003 in CI** — 128 + 102 + 53 paraphrases at 100% consistency (seeded fixtures); `validate_route_fixture.py` gates minimum coverage
315315
3. **Governance validators** — links, lint, governance metadata, PROJECT_KNOWLEDGE, benchmark fixture, skill examples
316316
4. **Harness policy docs** — CONTRIBUTING, AGENTS, SKILL_INSTALLATION, maintenance playbook
317-
5. **Doc anti-drift**`test_routing_contract.py`, cheatsheet parity tests, `test_readme_governance.py`, `test_thinking_prompts_eval_harness.py`, `test_engineering_prompts_eval_harness.py`, `test_prompt_cross_links.py`, `test_core_prompts_eval_harness.py`, `test_agent_prompts_eval_harness.py`, `test_context_prompts_eval_harness.py`, `test_domain_prompts_eval_harness.py`, `test_repo_prompts_eval_harness.py`, `test_validate_governance.py`, `test_validate_links.py`, `test_lint_skills.py`, `test_skill_module_contract.py` (Escalation subsection + Trigger/Methods/Output/Never; 450+ pytest anti-drift suite in CI); reciprocal thinking-lens ↔ skill checks in `test_prompt_cross_links.py`; Human Review + Escalation route-target guards in thinking/skill contract tests
317+
5. **Doc anti-drift**`test_routing_contract.py`, cheatsheet parity tests, `test_readme_governance.py`, `test_thinking_prompts_eval_harness.py`, `test_engineering_prompts_eval_harness.py`, `test_prompt_cross_links.py`, `test_core_prompts_eval_harness.py`, `test_agent_prompts_eval_harness.py`, `test_context_prompts_eval_harness.py`, `test_domain_prompts_eval_harness.py`, `test_repo_prompts_eval_harness.py`, `test_validate_governance.py`, `test_validate_links.py`, `test_lint_skills.py`, `test_skill_module_contract.py` (Escalation subsection + Trigger/Methods/Output/Never; 450+ pytest anti-drift suite in CI); reciprocal thinking-lens ↔ skill checks in `test_prompt_cross_links.py` (including strict Primary workflow surfaces parity via `test_thinking_lens_primary_surfaces_match_consumer_graph`); Human Review + Escalation route-target guards in thinking/skill contract tests
318318

319319
### Ongoing maintenance (not blockers)
320320

@@ -384,4 +384,4 @@ Phase 1 quality-gate tooling and documentation are **complete**. Routing consist
384384
- ✅ Benchmark fixture gate plus optional manual benchmark runs
385385
- ✅ Research-backed design decisions
386386

387-
The project is positioned to grow sustainably with quality discipline built in from the start. **No open implementation blockers** remain from panel Rounds 1–81; work is recurrence-gated maintenance per playbook. The next measurable quality target is **holdout expansion before router tuning** and optional manual baseline-vs-skill benchmark runs — not shipping new core skills without promotion evidence.
387+
The project is positioned to grow sustainably with quality discipline built in from the start. **No open implementation blockers** remain from panel Rounds 1–82; work is recurrence-gated maintenance per playbook. The next measurable quality target is **holdout expansion before router tuning** and optional manual baseline-vs-skill benchmark runs — not shipping new core skills without promotion evidence.

reflective-prompt-library/plans/multi-agent-panel-consensus-2026-06-25.md

Lines changed: 43 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2196,3 +2196,46 @@ User directive (repeat): review prompts, plans, skills, and Socratic/critical-th
21962196
**Resealed 2026-06-25** after **Round 81** (options EM–EP). Thinking-lens Human Review preambles complete; Escalation route-target anti-drift closed. Holdout expansion remains recurrence-gated maintenance.
21972197

21982198

2199+
---
2200+
2201+
## Round 82 — Strict Primary workflow surfaces graph parity (2026-06-25)
2202+
2203+
**Options EQ–ET** | Six-lens panel (Opus, Codex, Gemini, Composer, Sakana, GLM)
2204+
2205+
### Round 82 options
2206+
2207+
| ID | Proposal | Verdict |
2208+
| --- | --- | --- |
2209+
| EQ | Strict `Primary workflow surfaces``SKILL_THINKING_SOURCES` parity + preamble trim + pytest | **Agree** |
2210+
| ER | Expand skill Prompt Sources to match narrative overlisting | **Reject** |
2211+
| ES | ROUTE holdout expansion | **Defer** |
2212+
| ET | Router / tenth skill / benchmark CI | **Reject** |
2213+
2214+
### Round 82 verdict table
2215+
2216+
| ID | Option | Verdict | Action |
2217+
| --- | --- | --- | --- |
2218+
| EQ | Primary surfaces exact graph | **Agree** | trim preambles + `test_thinking_lens_primary_surfaces_match_consumer_graph` |
2219+
| ER | Expand graph to match prose | **Reject** | `SKILL_THINKING_SOURCES` stays authoritative from skill Prompt Sources |
2220+
| ES | Holdout expansion | **Defer** | maintenance |
2221+
| ET | Router/tenth skill/benchmark CI | **Reject** | no change |
2222+
2223+
**All roles agree.**
2224+
2225+
## Implemented Changes (Round 82)
2226+
2227+
- `01-thinking/counterargument.md`, `socratic-reviewer.md`, `why-what-how-done.md`: Primary workflow surfaces trimmed to graph consumers; adjacent workflow notes moved to Scope
2228+
- `plans/tests/test_prompt_cross_links.py`: `_primary_workflow_surfaces_skills` + `test_thinking_lens_primary_surfaces_match_consumer_graph`
2229+
- `GLOSSARY.md`: playbook Rounds 1–82; step 14 for strict primary-surfaces parity
2230+
- `QUALITY_GATES_SUMMARY.md`: primary-surfaces parity note; panel Rounds 1–82
2231+
- `PROJECT_KNOWLEDGE.md`: Decision Index Round 82 entry
2232+
- `README.md`, `reflective-prompt-library/README.md`, `test_readme_governance.py`: panel round 82 sync
2233+
2234+
## Verification (Round 82)
2235+
2236+
- `make all`: pytest + ROUTE-001/002/003 100%
2237+
2238+
## Panel status (updated)
2239+
2240+
**Resealed 2026-06-25** after **Round 82** (options EQ–ET). Thinking-lens Primary workflow surfaces now match the inverted skill graph exactly. Holdout expansion remains recurrence-gated maintenance.
2241+

reflective-prompt-library/plans/tests/test_glossary_structure.py

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -30,10 +30,10 @@ def test_round_boundary_terms_present(glossary_text: str):
3030
assert heading in glossary_text, f"missing glossary section: {heading}"
3131

3232

33-
def test_maintenance_playbook_references_round_81(glossary_text: str):
33+
def test_maintenance_playbook_references_round_82(glossary_text: str):
3434
playbook = glossary_text.split("## Governance Maintenance Playbook", 1)[1]
35-
assert "Rounds 1–81" in playbook or "Rounds 1-80" in playbook
36-
assert "Rounds 1–80" not in playbook and "Rounds 1-79" not in playbook
35+
assert "Rounds 1–82" in playbook or "Rounds 1-81" in playbook
36+
assert "Rounds 1–81" not in playbook and "Rounds 1-80" not in playbook
3737

3838

3939

0 commit comments

Comments
 (0)