Round 95: cross-category workflow skill coverage library registry

johnteee · johnteee · commit 2485b20b7bd3 · 2026-06-25T17:14:09.000+08:00
Six-lens panel (GW–HA): DRY assert_category_workflow_skill_coverage,
frozen *_COVER_WORKFLOW_SKILLS per harness, and
test_workflow_skill_coverage_library_registry.py with 01-thinking exempt.
diff --git a/README.md b/README.md
@@ -21,7 +21,7 @@ Full library docs: [reflective-prompt-library/README.md](reflective-prompt-libra
 ## Governance
 
 - **Contributing:** [CONTRIBUTING.md](CONTRIBUTING.md) — quality gates, routing maintenance (R8–R12), `make all`
-- **Panel record:** [multi-agent-panel-consensus](reflective-prompt-library/plans/multi-agent-panel-consensus-2026-06-25.md) — six-lens Socratic consensus (Rounds 1–94)
+- **Panel record:** [multi-agent-panel-consensus](reflective-prompt-library/plans/multi-agent-panel-consensus-2026-06-25.md) — six-lens Socratic consensus (Rounds 1–95)
 - **Operator playbook:** [GLOSSARY.md](reflective-prompt-library/GLOSSARY.md) — Governance Maintenance Playbook
 
 The repository contains:
diff --git a/reflective-prompt-library/GLOSSARY.md b/reflective-prompt-library/GLOSSARY.md
@@ -337,7 +337,7 @@ Curated top-of-cheatsheet summary of high-confusion routing traps (ROUTE-002 hol
 
 ## Governance Maintenance Playbook / 治理維護手冊
 
-Ongoing upkeep after panel close (Rounds 1–94). Not agent instructions — operator checklist.
+Ongoing upkeep after panel close (Rounds 1–95). Not agent instructions — operator checklist.
 
 **Operational test:** Before router tuning, add fresh ROUTE-002/003 holdout phrases; run `make all`; record decisions in `PROJECT_KNOWLEDGE.md` Decision Index when governance surface changes.
 
@@ -367,3 +367,4 @@ Ongoing upkeep after panel close (Rounds 1–94). Not agent instructions — ope
 24. When adding composable prompts or editing `*_SKILL_LINKS` / `*_THINKING_LINKS`, keep per-category dict keys aligned with prompt globs and run `test_prompt_skill_links_library_registry.py` plus `test_all_*_prompts_have_skill_link` in `test_prompt_cross_links.py`.
 25. When adding composable prompts or editing eval_harness contract preambles, keep `PROMPT_CONTRACT_HEADINGS` / `PROMPT_EVAL_MIN_SCORE` in `prompt_eval_helpers.py` and run `test_prompt_contract_library_registry.py` plus per-category `test_*_prompts_eval_harness.py` guards.
 26. When editing composable prompt Purpose preambles, keep `Primary workflow surface(s)` / Supporting-lens lines via `assert_primary_workflow_surface_preamble` in `prompt_eval_helpers.py`; update `SUPPORTING_LENS_PRIMARY_SURFACE_BY_CATEGORY` for exemptions; run `test_prompt_primary_workflow_surface_library_registry.py` plus per-category `test_*_prompts_eval_harness.py` guards.
+27. When editing category workflow skill coverage tuples, keep frozen `*_COVER_WORKFLOW_SKILLS` in `test_*_prompts_eval_harness.py` aligned with `assert_category_workflow_skill_coverage`; `01-thinking` stays exempt (consumer graph); run `test_workflow_skill_coverage_library_registry.py`.
diff --git a/reflective-prompt-library/PROJECT_KNOWLEDGE.md b/reflective-prompt-library/PROJECT_KNOWLEDGE.md
@@ -74,6 +74,7 @@ deferred promotions are recurrence-gated — see [panel backlog](plans/multi-age
 
 - 2026-06-25 Round 85 panel — composable prompt Primary workflow surface preamble guards (`test_*_prompts_eval_harness.py`) + Supporting-lens exemption → [record](plans/multi-agent-panel-consensus-2026-06-25.md)
 - 2026-06-25 Round 94 panel — cross-category Primary workflow surface preamble library registry (`test_prompt_primary_workflow_surface_library_registry.py`, DRY `assert_primary_workflow_surface_preamble`) → [record](plans/multi-agent-panel-consensus-2026-06-25.md)
+- 2026-06-25 Round 95 panel — cross-category workflow skill coverage library registry (`test_workflow_skill_coverage_library_registry.py`, DRY `assert_category_workflow_skill_coverage`) → [record](plans/multi-agent-panel-consensus-2026-06-25.md)
 - 2026-06-25 Round 93 panel — cross-category eval_harness contract heading library registry (`test_prompt_contract_library_registry.py`, DRY `PROMPT_CONTRACT_HEADINGS`) → [record](plans/multi-agent-panel-consensus-2026-06-25.md)
 - 2026-06-25 Round 92 panel — cross-category skill/thinking cross-link library registry (`test_prompt_skill_links_library_registry.py`) + missing `test_all_*_prompts_have_skill_link` guards → [record](plans/multi-agent-panel-consensus-2026-06-25.md)
 - 2026-06-25 Round 91 panel — cross-category Human Review library registry (`test_human_review_library_registry.py`, `PROMPT_LIBRARY_CATEGORIES`) → [record](plans/multi-agent-panel-consensus-2026-06-25.md)
diff --git a/reflective-prompt-library/README.md b/reflective-prompt-library/README.md
@@ -30,7 +30,7 @@ Pick **Strictness L1–L6** first (`skills/reflective-dispatch/SKILL.md`, [GLOSS
 
 ## Governance Panel Record
 
-Multi-agent Socratic consensus on project goals and the nine skills (Rounds 1–94, options A–GV) is recorded in [plans/multi-agent-panel-consensus-2026-06-25.md](plans/multi-agent-panel-consensus-2026-06-25.md). Run `make all` before claiming routing or governance changes are verified.
+Multi-agent Socratic consensus on project goals and the nine skills (Rounds 1–95, options A–HA) is recorded in [plans/multi-agent-panel-consensus-2026-06-25.md](plans/multi-agent-panel-consensus-2026-06-25.md). Run `make all` before claiming routing or governance changes are verified.
 
 ## Directory Map
 
diff --git a/reflective-prompt-library/plans/QUALITY_GATES_SUMMARY.md b/reflective-prompt-library/plans/QUALITY_GATES_SUMMARY.md
@@ -314,7 +314,7 @@ ROUTE-002 measures unseen phrasing separately from ROUTE-001. Round 7 (2026-06-2
 2. **ROUTE-001/002/003 in CI** — 128 + 102 + 53 paraphrases at 100% consistency (seeded fixtures); `validate_route_fixture.py` gates minimum coverage
 3. **Governance validators** — links, lint, governance metadata, PROJECT_KNOWLEDGE, benchmark fixture, skill examples
 4. **Harness policy docs** — CONTRIBUTING, AGENTS, SKILL_INSTALLATION, maintenance playbook
-5. **Doc anti-drift** — `test_routing_contract.py`, cheatsheet parity tests, `test_readme_governance.py`, `test_thinking_prompts_eval_harness.py`, `test_engineering_prompts_eval_harness.py`, `test_prompt_cross_links.py`, `test_core_prompts_eval_harness.py`, `test_human_review_library_registry.py`, `test_prompt_skill_links_library_registry.py`, `test_prompt_contract_library_registry.py`, `test_prompt_primary_workflow_surface_library_registry.py`, `test_agent_prompts_eval_harness.py`, `test_context_prompts_eval_harness.py`, `test_domain_prompts_eval_harness.py`, `test_repo_prompts_eval_harness.py`, `test_validate_governance.py`, `test_validate_links.py`, `test_lint_skills.py`, `test_skill_module_contract.py` (Escalation subsection + Trigger/Methods/Output/Never; 630+ pytest anti-drift suite in CI); reciprocal thinking-lens ↔ skill checks and `00-core` + composable `Primary workflow surface(s)` ↔ `*_SKILL_LINKS` parity in `test_prompt_cross_links.py` (including strict Primary workflow surfaces parity via `test_thinking_lens_primary_surfaces_match_consumer_graph`); Human Review + Escalation route-target guards in thinking/skill contract tests; composable `Primary workflow surface(s)` / Supporting-lens preamble guards and composable `## Human Review` preamble guards (route to `reflective-risk`) via `prompt_eval_helpers.assert_human_review_preamble` in `test_*_prompts_eval_harness.py`; frozen `*_HUMAN_REVIEW_REQUIRED` / `*_HUMAN_REVIEW_EXEMPT` set parity across all prompt categories (Round 90); library-wide contract heading registry (`PROMPT_CONTRACT_HEADINGS`, Round 93)
+5. **Doc anti-drift** — `test_routing_contract.py`, cheatsheet parity tests, `test_readme_governance.py`, `test_thinking_prompts_eval_harness.py`, `test_engineering_prompts_eval_harness.py`, `test_prompt_cross_links.py`, `test_core_prompts_eval_harness.py`, `test_human_review_library_registry.py`, `test_prompt_skill_links_library_registry.py`, `test_prompt_contract_library_registry.py`, `test_prompt_primary_workflow_surface_library_registry.py`, `test_workflow_skill_coverage_library_registry.py`, `test_agent_prompts_eval_harness.py`, `test_context_prompts_eval_harness.py`, `test_domain_prompts_eval_harness.py`, `test_repo_prompts_eval_harness.py`, `test_validate_governance.py`, `test_validate_links.py`, `test_lint_skills.py`, `test_skill_module_contract.py` (Escalation subsection + Trigger/Methods/Output/Never; 640+ pytest anti-drift suite in CI); reciprocal thinking-lens ↔ skill checks and `00-core` + composable `Primary workflow surface(s)` ↔ `*_SKILL_LINKS` parity in `test_prompt_cross_links.py` (including strict Primary workflow surfaces parity via `test_thinking_lens_primary_surfaces_match_consumer_graph`); Human Review + Escalation route-target guards in thinking/skill contract tests; composable `Primary workflow surface(s)` / Supporting-lens preamble guards and composable `## Human Review` preamble guards (route to `reflective-risk`) via `prompt_eval_helpers.assert_human_review_preamble` in `test_*_prompts_eval_harness.py`; frozen `*_HUMAN_REVIEW_REQUIRED` / `*_HUMAN_REVIEW_EXEMPT` set parity across all prompt categories (Round 90); library-wide contract heading registry (`PROMPT_CONTRACT_HEADINGS`, Round 93); workflow skill coverage registry (`*_COVER_WORKFLOW_SKILLS`, Round 95)
 
 ### Ongoing maintenance (not blockers)
 
@@ -384,4 +384,4 @@ Phase 1 quality-gate tooling and documentation are **complete**. Routing consist
 - ✅ Benchmark fixture gate plus optional manual benchmark runs
 - ✅ Research-backed design decisions
 
-The project is positioned to grow sustainably with quality discipline built in from the start. **No open implementation blockers** remain from panel Rounds 1–94; work is recurrence-gated maintenance per playbook. The next measurable quality target is **holdout expansion before router tuning** and optional manual baseline-vs-skill benchmark runs — not shipping new core skills without promotion evidence.
+The project is positioned to grow sustainably with quality discipline built in from the start. **No open implementation blockers** remain from panel Rounds 1–95; work is recurrence-gated maintenance per playbook. The next measurable quality target is **holdout expansion before router tuning** and optional manual baseline-vs-skill benchmark runs — not shipping new core skills without promotion evidence.
diff --git a/reflective-prompt-library/plans/tests/prompt_eval_helpers.py b/reflective-prompt-library/plans/tests/prompt_eval_helpers.py
@@ -118,3 +118,13 @@ def assert_primary_workflow_surface_preamble(
             f"{prompt_path.name} Purpose should list Primary workflow surface(s)"
         )
 
+def assert_category_workflow_skill_coverage(
+    prompts: tuple[Path, ...],
+    required_skills: tuple[str, ...],
+    category_label: str,
+) -> None:
+    """Category corpus must mention each required workflow skill at least once."""
+    text = "\n".join(p.read_text(encoding="utf-8") for p in prompts)
+    for skill in required_skills:
+        assert skill in text, f"{category_label} should reference {skill}"
+
diff --git a/reflective-prompt-library/plans/tests/test_agent_prompts_eval_harness.py b/reflective-prompt-library/plans/tests/test_agent_prompts_eval_harness.py
@@ -9,7 +9,7 @@
 sys.path.insert(0, str(Path(__file__).parent))
 
 from eval_harness import EvalHarness  # noqa: E402
-from prompt_eval_helpers import assert_human_review_preamble, assert_primary_workflow_surface_preamble, prompts_with_human_review, assert_human_review_required_matches_detection, assert_human_review_exempt_have_no_preamble_section, assert_human_review_sets_partition, PROMPT_CONTRACT_HEADINGS, PROMPT_EVAL_MIN_SCORE, assert_prompt_contract_headings  # noqa: E402
+from prompt_eval_helpers import assert_category_workflow_skill_coverage, assert_human_review_preamble, assert_primary_workflow_surface_preamble, prompts_with_human_review, assert_human_review_required_matches_detection, assert_human_review_exempt_have_no_preamble_section, assert_human_review_sets_partition, PROMPT_CONTRACT_HEADINGS, PROMPT_EVAL_MIN_SCORE, assert_prompt_contract_headings  # noqa: E402
 
 REQUIRED_HEADINGS = PROMPT_CONTRACT_HEADINGS
 MIN_SCORE = PROMPT_EVAL_MIN_SCORE
@@ -18,6 +18,13 @@
 REPO_ROOT = str(Path(__file__).parent.parent.parent.parent)
 
 AGENT_PROMPTS = tuple(sorted(AGENT_DIR.glob("*.md")))
+AGENT_COVER_WORKFLOW_SKILLS = (
+    "reflective-dispatch",
+    "reflective-spec-plan",
+    "reflective-review",
+    "reflective-handoff-retro",
+    "reflective-research",
+)
 AGENT_PROMPTS_WITH_HUMAN_REVIEW = prompts_with_human_review(AGENT_PROMPTS)
 AGENT_HUMAN_REVIEW_REQUIRED = frozenset({
     "agent-scaffold-provenance.md",
@@ -62,15 +69,9 @@ def test_agent_prompts_reference_workflow_skills():
 
 
 def test_agent_prompts_cover_agent_workflow_surfaces():
-    text = "\n".join(p.read_text(encoding="utf-8") for p in AGENT_PROMPTS)
-    for skill in (
-        "reflective-dispatch",
-        "reflective-spec-plan",
-        "reflective-review",
-        "reflective-handoff-retro",
-        "reflective-research",
-    ):
-        assert skill in text, f"04-agent should reference {skill}"
+    assert_category_workflow_skill_coverage(
+        AGENT_PROMPTS, AGENT_COVER_WORKFLOW_SKILLS, "04-agent"
+    )
 
 
 def test_agent_prompts_have_workflow_surface_preamble_line():
diff --git a/reflective-prompt-library/plans/tests/test_context_prompts_eval_harness.py b/reflective-prompt-library/plans/tests/test_context_prompts_eval_harness.py
@@ -9,7 +9,7 @@
 sys.path.insert(0, str(Path(__file__).parent))
 
 from eval_harness import EvalHarness  # noqa: E402
-from prompt_eval_helpers import assert_human_review_preamble, assert_primary_workflow_surface_preamble, prompts_with_human_review, assert_human_review_required_matches_detection, assert_human_review_exempt_have_no_preamble_section, assert_human_review_sets_partition, PROMPT_CONTRACT_HEADINGS, PROMPT_EVAL_MIN_SCORE, assert_prompt_contract_headings  # noqa: E402
+from prompt_eval_helpers import assert_category_workflow_skill_coverage, assert_human_review_preamble, assert_primary_workflow_surface_preamble, prompts_with_human_review, assert_human_review_required_matches_detection, assert_human_review_exempt_have_no_preamble_section, assert_human_review_sets_partition, PROMPT_CONTRACT_HEADINGS, PROMPT_EVAL_MIN_SCORE, assert_prompt_contract_headings  # noqa: E402
 
 REQUIRED_HEADINGS = PROMPT_CONTRACT_HEADINGS
 MIN_SCORE = PROMPT_EVAL_MIN_SCORE
@@ -18,6 +18,12 @@
 REPO_ROOT = str(Path(__file__).parent.parent.parent.parent)
 
 CONTEXT_PROMPTS = tuple(sorted(CONTEXT_DIR.glob("*.md")))
+CONTEXT_COVER_WORKFLOW_SKILLS = (
+    "reflective-dispatch",
+    "reflective-brief",
+    "reflective-handoff-retro",
+    "reflective-research",
+)
 CONTEXT_PROMPTS_WITH_HUMAN_REVIEW = prompts_with_human_review(CONTEXT_PROMPTS)
 CONTEXT_HUMAN_REVIEW_REQUIRED = frozenset({
     "context-handoff.md",
@@ -60,14 +66,9 @@ def test_context_prompts_reference_workflow_skills():
 
 
 def test_context_prompts_cover_context_workflow_surfaces():
-    text = "\n".join(p.read_text(encoding="utf-8") for p in CONTEXT_PROMPTS)
-    for skill in (
-        "reflective-dispatch",
-        "reflective-brief",
-        "reflective-handoff-retro",
-        "reflective-research",
-    ):
-        assert skill in text, f"03-context should reference {skill}"
+    assert_category_workflow_skill_coverage(
+        CONTEXT_PROMPTS, CONTEXT_COVER_WORKFLOW_SKILLS, "03-context"
+    )
 
 
 def test_context_prompts_have_primary_workflow_surfaces_line():
diff --git a/reflective-prompt-library/plans/tests/test_core_prompts_eval_harness.py b/reflective-prompt-library/plans/tests/test_core_prompts_eval_harness.py
@@ -13,7 +13,7 @@
     PROMPT_CONTRACT_HEADINGS,
     PROMPT_EVAL_MIN_SCORE,
     assert_primary_workflow_surface_preamble,
-    assert_prompt_contract_headings,  # noqa: E402
+    assert_category_workflow_skill_coverage, assert_prompt_contract_headings,  # noqa: E402
     assert_human_review_exempt_have_no_preamble_section,
     assert_human_review_preamble,
     assert_human_review_required_matches_detection,
@@ -28,6 +28,10 @@
 REPO_ROOT = str(Path(__file__).parent.parent.parent.parent)
 
 CORE_PROMPTS = tuple(sorted(CORE_DIR.glob("*.md")))
+CORE_COVER_WORKFLOW_SKILLS = (
+    "reflective-brief",
+    "reflective-dispatch",
+)
 CORE_HUMAN_REVIEW_REQUIRED = frozenset({
     "core-full.md",
     "core-minimal.md",
@@ -72,9 +76,9 @@ def test_core_prompts_reference_workflow_skills():
 
 
 def test_core_prompts_cover_brief_and_dispatch():
-    text = "\n".join(p.read_text(encoding="utf-8") for p in CORE_PROMPTS)
-    assert "reflective-brief" in text
-    assert "reflective-dispatch" in text
+    assert_category_workflow_skill_coverage(
+        CORE_PROMPTS, CORE_COVER_WORKFLOW_SKILLS, "00-core"
+    )
 
 
 def test_core_prompts_have_primary_workflow_surfaces_line():
diff --git a/reflective-prompt-library/plans/tests/test_domain_prompts_eval_harness.py b/reflective-prompt-library/plans/tests/test_domain_prompts_eval_harness.py
@@ -9,7 +9,7 @@
 sys.path.insert(0, str(Path(__file__).parent))
 
 from eval_harness import EvalHarness  # noqa: E402
-from prompt_eval_helpers import assert_human_review_preamble, assert_primary_workflow_surface_preamble, prompts_with_human_review, assert_human_review_required_matches_detection, assert_human_review_exempt_have_no_preamble_section, assert_human_review_sets_partition, PROMPT_CONTRACT_HEADINGS, PROMPT_EVAL_MIN_SCORE, assert_prompt_contract_headings  # noqa: E402
+from prompt_eval_helpers import assert_category_workflow_skill_coverage, assert_human_review_preamble, assert_primary_workflow_surface_preamble, prompts_with_human_review, assert_human_review_required_matches_detection, assert_human_review_exempt_have_no_preamble_section, assert_human_review_sets_partition, PROMPT_CONTRACT_HEADINGS, PROMPT_EVAL_MIN_SCORE, assert_prompt_contract_headings  # noqa: E402
 
 REQUIRED_HEADINGS = PROMPT_CONTRACT_HEADINGS
 MIN_SCORE = PROMPT_EVAL_MIN_SCORE
@@ -18,6 +18,13 @@
 REPO_ROOT = str(Path(__file__).parent.parent.parent.parent)
 
 DOMAIN_PROMPTS = tuple(sorted(DOMAIN_DIR.glob("*.md")))
+DOMAIN_COVER_WORKFLOW_SKILLS = (
+    "reflective-risk",
+    "reflective-research",
+    "reflective-brief",
+    "reflective-spec-plan",
+    "reflective-review",
+)
 DOMAIN_PROMPTS_WITH_HUMAN_REVIEW = prompts_with_human_review(DOMAIN_PROMPTS)
 DOMAIN_HUMAN_REVIEW_REQUIRED = frozenset({
     "creative-template.md",
@@ -60,15 +67,9 @@ def test_domain_prompts_reference_workflow_skills():
 
 
 def test_domain_prompts_cover_domain_workflow_surfaces():
-    text = "\n".join(p.read_text(encoding="utf-8") for p in DOMAIN_PROMPTS)
-    for skill in (
-        "reflective-risk",
-        "reflective-research",
-        "reflective-brief",
-        "reflective-spec-plan",
-        "reflective-review",
-    ):
-        assert skill in text, f"05-domain should reference {skill}"
+    assert_category_workflow_skill_coverage(
+        DOMAIN_PROMPTS, DOMAIN_COVER_WORKFLOW_SKILLS, "05-domain"
+    )
 
 
 def test_domain_prompts_have_primary_workflow_surfaces_line():
diff --git a/reflective-prompt-library/plans/tests/test_engineering_prompts_eval_harness.py b/reflective-prompt-library/plans/tests/test_engineering_prompts_eval_harness.py
@@ -9,7 +9,7 @@
 sys.path.insert(0, str(Path(__file__).parent))
 
 from eval_harness import EvalHarness  # noqa: E402
-from prompt_eval_helpers import assert_human_review_preamble, assert_primary_workflow_surface_preamble, prompts_with_human_review, assert_human_review_required_matches_detection, assert_human_review_exempt_have_no_preamble_section, assert_human_review_sets_partition, PROMPT_CONTRACT_HEADINGS, PROMPT_EVAL_MIN_SCORE, assert_prompt_contract_headings  # noqa: E402
+from prompt_eval_helpers import assert_category_workflow_skill_coverage, assert_human_review_preamble, assert_primary_workflow_surface_preamble, prompts_with_human_review, assert_human_review_required_matches_detection, assert_human_review_exempt_have_no_preamble_section, assert_human_review_sets_partition, PROMPT_CONTRACT_HEADINGS, PROMPT_EVAL_MIN_SCORE, assert_prompt_contract_headings  # noqa: E402
 
 REQUIRED_HEADINGS = PROMPT_CONTRACT_HEADINGS
 MIN_SCORE = PROMPT_EVAL_MIN_SCORE
@@ -18,6 +18,12 @@
 REPO_ROOT = str(Path(__file__).parent.parent.parent.parent)
 
 ENGINEERING_PROMPTS = tuple(sorted(ENGINEERING_DIR.glob("*.md")))
+ENGINEERING_COVER_WORKFLOW_SKILLS = (
+    "reflective-brief",
+    "reflective-spec-plan",
+    "reflective-implement",
+    "reflective-review",
+)
 ENGINEERING_PROMPTS_WITH_HUMAN_REVIEW = prompts_with_human_review(ENGINEERING_PROMPTS)
 ENGINEERING_HUMAN_REVIEW_REQUIRED = frozenset({
     "code-reviewer.md",
@@ -62,14 +68,9 @@ def test_engineering_prompts_reference_workflow_skills():
 
 def test_engineering_prompts_cover_core_workflows():
     """At least one prompt per implement/review/spec-plan/brief surface."""
-    text = "\n".join(p.read_text(encoding="utf-8") for p in ENGINEERING_PROMPTS)
-    for skill in (
-        "reflective-brief",
-        "reflective-spec-plan",
-        "reflective-implement",
-        "reflective-review",
-    ):
-        assert skill in text, f"02-engineering should reference {skill}"
+    assert_category_workflow_skill_coverage(
+        ENGINEERING_PROMPTS, ENGINEERING_COVER_WORKFLOW_SKILLS, "02-engineering"
+    )
 
 
 def test_engineering_prompts_have_primary_workflow_surfaces_line():
diff --git a/reflective-prompt-library/plans/tests/test_glossary_structure.py b/reflective-prompt-library/plans/tests/test_glossary_structure.py
diff --git a/reflective-prompt-library/plans/tests/test_readme_governance.py b/reflective-prompt-library/plans/tests/test_readme_governance.py
diff --git a/reflective-prompt-library/plans/tests/test_repo_prompts_eval_harness.py b/reflective-prompt-library/plans/tests/test_repo_prompts_eval_harness.py
diff --git a/reflective-prompt-library/plans/tests/test_thinking_prompts_eval_harness.py b/reflective-prompt-library/plans/tests/test_thinking_prompts_eval_harness.py
diff --git a/reflective-prompt-library/plans/tests/test_workflow_skill_coverage_library_registry.py b/reflective-prompt-library/plans/tests/test_workflow_skill_coverage_library_registry.py