feat: wire H4 policy and RBAC in shadow mode (Sprint 2)

johnteee · cursoragent · johnteee · commit 9f39baf3e47d · 2026-06-10T10:39:02.000+08:00
Add governance h4_integration for approval-path policy shadow logging
and subagent launch RBAC (shadow default, enforce via env). Defer
consensus_validation per ADR 0029; update receipts and wiring watch-list.

Constraint: policy shadow never blocks; RBAC enforce opt-in only
Tested: test_h4_shadow_wiring.py; test_validate_wiring.py; run_test_tier.py --tier smoke (159 passed)
Confidence: high
Co-authored-by: Cursor &lt;cursoragent@cursor.com&gt;
diff --git a/docs/adr/0029-consensus-validation-deferred.md b/docs/adr/0029-consensus-validation-deferred.md
@@ -0,0 +1,41 @@
+# ADR 0029: Consensus Validation Deferred Behind Approval Queue
+
+## Status
+
+Accepted — 2026-06-10
+
+**Expiry review:** 2026-12-10 (re-score whether `consensus_validation` should gate destructive actions)
+
+## Context
+
+Horizon H4 delivered `teaagent/consensus_validation.py` (~600 lines) with passing
+tests but no production import path (ENG-R1). Sprint 2 wired policy engine and
+RBAC in shadow/enforce mode. Consensus validation overlaps with:
+
+- Existing centralized subagent approval queue (ADR 0022)
+- Federated swarm consensus in `teaagent/consensus.py` (ADR 0019)
+
+Shipping a third consensus surface without wiring invites doc⇄reality drift.
+
+## Decision
+
+**Defer** wiring `consensus_validation` into the destructive-action path until
+2026-12-10. Until then:
+
+1. `consensus_validation` remains labeled `experimental — unwired`.
+2. Destructive actions continue to flow through the existing approval queue and
+   JIT approval coordinator — no duplicate consensus gate.
+3. WDA-006 acceptance is met by this ADR plus the wiring validator watch-list.
+
+## Consequences
+
+- Positive: avoids parallel consensus systems; Sprint 2 scope stays bounded.
+- Negative: multi-agent consensus claims must not cite `consensus_validation` as live.
+- Follow-up: on expiry, choose **wire behind approval queue** or **delete/quarantine**
+  with import-graph evidence.
+
+## References
+
+- [Work Direction Decomposition (WDA-006)](../plans/work-direction-decomposition-2026-06-10.md)
+- [Engineering Critique Refresh (ENG-R1)](../analysis/engineering-critique-refresh-2026-06-10.md)
+- ADR 0019, ADR 0022
diff --git a/docs/adr/README.md b/docs/adr/README.md
@@ -31,6 +31,7 @@ This directory contains all Architecture Decision Records (ADRs) for the TeaAgen
 | 0023 | Strict Plan-Before-Write Enforcement | Accepted and Implemented | 2026-05-29 | - |
 | 0024 | Automated Memory Invalidation | Accepted and Implemented | 2026-05-29 | - |
 | 0025 | Shared ChatSessionController for Chat Surfaces | Accepted and Implemented | 2026-06-01 | 2026-06-04 13:18:00 +0800 |
+| 0029 | Consensus Validation Deferred Behind Approval Queue | Accepted | 2026-06-10 | 2026-12-10 (expiry review) |
 
 ## ADR Categories
 
diff --git a/docs/generated/docs-inventory.md b/docs/generated/docs-inventory.md
@@ -6,7 +6,7 @@
 Generated by `python3 scripts/generate_docs_inventory.py`.
 Do not edit this file manually — regenerate instead.
 
-**Markdown files:** 551
+**Markdown files:** 552
 
 | Path | Bytes | SHA256 (12) |
 | --- | ---: | --- |
@@ -39,6 +39,7 @@ Do not edit this file manually — regenerate instead.
 | `adr/0026-cli-execution-abstraction-layer.md` | 560 | `2498fb2f04a4` |
 | `adr/0027-context-bus-architecture.md` | 543 | `6fa1d2ced665` |
 | `adr/0028-tournament-swarm-architecture.md` | 594 | `ee8dec0fdb60` |
+| `adr/0029-consensus-validation-deferred.md` | 1587 | `8a2da40abc07` |
 | `adr/README.md` | 6552 | `83d807309b2d` |
 | `agent-mode-operator-guide.md` | 2778 | `25b258ab7bfe` |
 | `analysis/active-findings-status-ledger-2026-06-06.md` | 4724 | `34c514f544b8` |
@@ -448,7 +449,7 @@ Do not edit this file manually — regenerate instead.
 | `plans/ticket-plans/WDG-002-plan.md` | 1712 | `16cb2bb47cbc` |
 | `plans/ux-improvement-roadmap-2026-05-31.md` | 15201 | `368416e593d4` |
 | `plans/work-direction-decomposition-2026-06-10.md` | 10371 | `cba4dd33a15d` |
-| `plans/work-direction-execution-index-2026-06-10.md` | 4993 | `6400e46356aa` |
+| `plans/work-direction-execution-index-2026-06-10.md` | 5011 | `94a33014dd66` |
 | `plugin-skill-catalog.md` | 4118 | `8d42b8f0c492` |
 | `processes/breaking-changes.md` | 820 | `2a43f4d37b6c` |
 | `processes/community-presence.md` | 5009 | `f33f69b2e8ff` |
@@ -487,7 +488,7 @@ Do not edit this file manually — regenerate instead.
 | `reviews/project-state-critical-questioning-2026-06-04.md` | 7340 | `78b9b54c3a9c` |
 | `reviews/security-risk-assessment-2026-06-02.md` | 24112 | `4c9e2e00d001` |
 | `reviews/seven-control-loops-critical-questioning-2026-06-05.md` | 7531 | `ae1e34b8369d` |
-| `roadmap-status.md` | 19951 | `52f9b5a28fdf` |
+| `roadmap-status.md` | 19926 | `3c7c01ed0772` |
 | `run-evidence-and-audit-guide.md` | 1980 | `97b527c850b1` |
 | `security-whitepaper.md` | 9691 | `d65a19a755cb` |
 | `security/approval-abuse-cases-2026-06-02.md` | 1281 | `4c43296d1c66` |
diff --git a/docs/plans/work-direction-execution-index-2026-06-10.md b/docs/plans/work-direction-execution-index-2026-06-10.md
@@ -69,7 +69,7 @@ any time after S1 WDA-001 lands (needs honest module labels for concept audit).
 
 | Sprint | IDs | Notes |
 | --- | --- | --- |
-| S2 | WDA-002, WDA-003, WDA-006 | Shadow policy + RBAC; consensus ADR |
+| S2 | WDA-002, WDA-003, WDA-006 | **Closed** — shadow policy + RBAC enforce; ADR 0029 |
 | S3 | WDA-004, WDA-005, WDD-001, WDD-002 | Release gate CI; single-platform update proof |
 | S4 | WDC-002, WDC-003, WDC-004 | Three-concept onboarding; terminology freeze |
 | S5 | WDE-001, WDE-002, WDE-003, WDF-001, WDF-002 | Remote backend; root-module freeze |
diff --git a/docs/roadmap-status.md b/docs/roadmap-status.md
@@ -29,7 +29,7 @@ Provide a single source of truth for roadmap item status, ownership, confidence,
 | H1 | Daily operator loop | Setup, daily cockpit, plan, execute, approve, verify, recover, and remember are one coherent journey | governance | Complete | High | H2 | Journey acceptance tests pass across CLI/TUI baseline; acceptance tier 628 passed at `85109e4` (2026-06-10) |
 | H2 | Multi-surface continuity | CLI, TUI, IDE, dashboard, background, cloud, and gateway share one run-state contract | TBD | Partially fixed — M2 foundation wired | Medium | WDA-002 | M2 acceptance complete; full surface parity (IDE/dashboard/cloud) still open |
 | H3 | Ecosystem trust | MCP, plugins, skills, hooks, subagents, and automations are explainable, revocable, and testable | TBD | Partially fixed — M3 tests pass | Medium | WDC-002 | M3 acceptance complete; general-user trust onboarding simplification still open |
-| H4 | Durable team operations | Long-running and team workflows have durable execution, control-plane views, policy, audit, and cost attribution | TBD | Partially fixed — unwired | Low | WDA-002 | RBAC/policy/consensus modules exist but unwired; cockpit data source only wired H4 surface (ENG-R1) |
+| H4 | Durable team operations | Long-running and team workflows have durable execution, control-plane views, policy, audit, and cost attribution | TBD | Partially fixed — shadow wired | Low | WDA-004 | Policy/RBAC shadow-wired (WDA-002/003); consensus deferred (ADR 0029) |
 | H5 | Quality and eval loop | Prompt/runtime/model changes cannot silently degrade daily outcomes | TBD | Partially fixed — unwired | Low | WDA-004 | `context_health` wired via TUI; eval suite/release gate clusters unwired (ENG-R1) |
 | H6 | Packaging and adoption | Desktop/client-server and external-facing release channels have supply-chain, update, and support plans | TBD | Partially fixed — unwired | Low | WDA-005 | `update/*` package implemented but unwired; no single-platform proof yet |
 
diff --git a/scripts/run_test_tier.py b/scripts/run_test_tier.py
@@ -30,6 +30,7 @@
     str(_TESTS / 'test_phase5_context_bus.py'),
     str(_TESTS / 'test_governance_hardening.py'),
     str(_TESTS / 'test_validate_wiring.py'),
+    str(_TESTS / 'test_h4_shadow_wiring.py'),
     str(_TESTS / 'regression'),
 )
 
diff --git a/scripts/validate_wiring.py b/scripts/validate_wiring.py
@@ -28,8 +28,6 @@
 
 # H4/H5/H6 clusters from ENG-R1 — must be production-wired or explicitly labeled.
 WATCH_MODULES: tuple[str, ...] = (
-    'teaagent.rbac',
-    'teaagent.policy_engine',
     'teaagent.policy_routing',
     'teaagent.consensus_validation',
     'teaagent.release_gate',
diff --git a/teaagent/governance/__init__.py b/teaagent/governance/__init__.py
@@ -4,6 +4,13 @@
     AuditCompletenessReport,
     check_audit_completeness,
 )
+from teaagent.governance.h4_integration import (
+    H4GovernanceMode,
+    check_subagent_launch_rbac,
+    evaluate_approval_policy_shadow,
+    policy_governance_mode,
+    rbac_governance_mode,
+)
 from teaagent.governance.plan_gate import (
     WRITE_TOOLS,
     ReviewGate,
@@ -18,7 +25,12 @@
     'ToolLintIssue',
     'WRITE_TOOLS',
     'assert_write_allowed',
+    'H4GovernanceMode',
     'check_audit_completeness',
+    'check_subagent_launch_rbac',
+    'evaluate_approval_policy_shadow',
     'lint_registry',
+    'policy_governance_mode',
+    'rbac_governance_mode',
     'require_review_gate',
 ]
diff --git a/teaagent/governance/h4_integration.py b/teaagent/governance/h4_integration.py
@@ -0,0 +1,161 @@
+"""H4 governance shadow wiring (WDA-002 / WDA-003).
+
+Connects the policy engine and RBAC modules to production entry paths in
+shadow mode by default. RBAC may be switched to enforce via
+``TEAAGENT_H4_RBAC_MODE=enforce``.
+"""
+
+from __future__ import annotations
+
+import os
+from enum import Enum
+from pathlib import Path
+from typing import Any, Optional
+
+from teaagent.policy_engine import PolicyEffect, PolicyEngine, PolicyStore, PolicyType
+
+
+class H4GovernanceMode(str, Enum):
+    SHADOW = 'shadow'
+    ENFORCE = 'enforce'
+
+
+def _mode_from_env(var_name: str, *, default: H4GovernanceMode) -> H4GovernanceMode:
+    raw = os.environ.get(var_name, default.value).strip().lower()
+    if raw in {m.value for m in H4GovernanceMode}:
+        return H4GovernanceMode(raw)
+    return default
+
+
+def policy_governance_mode() -> H4GovernanceMode:
+    return _mode_from_env('TEAAGENT_H4_POLICY_MODE', default=H4GovernanceMode.SHADOW)
+
+
+def rbac_governance_mode() -> H4GovernanceMode:
+    return _mode_from_env('TEAAGENT_H4_RBAC_MODE', default=H4GovernanceMode.SHADOW)
+
+
+def _policy_engine_for_root(root: str | Path) -> PolicyEngine:
+    return PolicyEngine(PolicyStore(Path(root).resolve()))
+
+
+def record_h4_shadow_event(
+    audit: Any,
+    run_id: str,
+    *,
+    surface: str,
+    mode: H4GovernanceMode,
+    allowed: bool,
+    reason: str,
+    context: dict[str, Any],
+    enforced: bool,
+    details: Optional[list[dict[str, Any]]] = None,
+) -> None:
+    audit.record(
+        'h4_governance_shadow',
+        run_id,
+        surface=surface,
+        mode=mode.value,
+        allowed=allowed,
+        enforced=enforced,
+        reason=reason,
+        context=context,
+        details=details or [],
+    )
+
+
+def evaluate_approval_policy_shadow(
+    *,
+    workspace_root: str | Path | None,
+    audit: Any,
+    run_id: str,
+    tool_name: str,
+    arguments: dict[str, Any],
+    destructive: bool,
+    call_id: str,
+) -> bool:
+    """Evaluate approval policies and record a shadow receipt. Never blocks."""
+    if workspace_root is None:
+        return True
+
+    mode = policy_governance_mode()
+    context = {
+        'action': 'approve_tool',
+        'tool_name': tool_name,
+        'call_id': call_id,
+        'destructive': destructive,
+        'arguments': arguments,
+    }
+    engine = _policy_engine_for_root(workspace_root)
+    effect, details = engine.evaluate_with_explanation(
+        context,
+        policy_type=PolicyType.APPROVAL,
+    )
+    allowed = effect == PolicyEffect.ALLOW
+    denying = next(
+        (d for d in details if d.get('applies') and d.get('effect') == 'deny'),
+        None,
+    )
+    reason = (
+        f'Policy {denying["policy_id"]} would deny'
+        if denying
+        else 'Policy evaluation would allow'
+    )
+    record_h4_shadow_event(
+        audit,
+        run_id,
+        surface='approval',
+        mode=mode,
+        allowed=allowed,
+        reason=reason,
+        context=context,
+        enforced=False,
+        details=details,
+    )
+    return True
+
+
+def check_subagent_launch_rbac(
+    *,
+    workspace_root: str | Path,
+    audit: Any | None,
+    parent_run_id: str,
+    assignee: str,
+    def_name: str,
+    depth: int,
+) -> tuple[bool, str]:
+    """RBAC gate for subagent launch. Shadow by default; enforce when configured."""
+    from teaagent.rbac import Permission, RBACSystem
+
+    mode = rbac_governance_mode()
+    context = {
+        'action': 'launch_subagent',
+        'subagent': def_name,
+        'depth': depth,
+        'parent_run_id': parent_run_id,
+    }
+    rbac = RBACSystem(workspace_root)
+    allowed, reason = rbac.check_action_permission(
+        assignee,
+        'start_workflow',
+        context,
+    )
+    if audit is not None:
+        record_h4_shadow_event(
+            audit,
+            parent_run_id,
+            surface='subagent_launch',
+            mode=mode,
+            allowed=allowed,
+            reason=reason,
+            context={
+                **context,
+                'assignee': assignee,
+                'permission': Permission.START_WORKFLOW.value,
+            },
+            enforced=mode == H4GovernanceMode.ENFORCE and not allowed,
+            details=[],
+        )
+    if mode == H4GovernanceMode.ENFORCE and not allowed:
+        return False, reason
+    return True, reason
diff --git a/teaagent/policy_engine.py b/teaagent/policy_engine.py
@@ -1,6 +1,6 @@
 """Policy engine for collaboration rules and team operations.
 
-experimental — unwired
+Wired in shadow mode via ``teaagent.governance.h4_integration`` (WDA-002).
 
 This module provides the foundation for defining, storing, and evaluating
 policies for collaborative agent workflows, including role-based access
diff --git a/teaagent/rbac.py b/teaagent/rbac.py
@@ -1,6 +1,6 @@
 """Role-Based Access Control (RBAC) system.
 
-experimental — unwired
+Wired in shadow/enforce mode via ``teaagent.governance.h4_integration`` (WDA-003).
 
 This module provides role definitions, role assignment, and permission
 checking for collaborative agent workflows.
diff --git a/teaagent/run_receipt.py b/teaagent/run_receipt.py
@@ -185,6 +185,22 @@ def format_run_receipt(  # noqa: C901
             suffix = f' ({scope})' if scope else ''
             lines.append(f'  - {tool}: {decision}{suffix}')
 
+    if events:
+        shadow_events = [
+            event
+            for event in events
+            if event.get('event_type') == 'h4_governance_shadow'
+        ]
+        if shadow_events:
+            lines.append('H4 governance (shadow):')
+            for event in shadow_events[:10]:
+                payload = _safe_payload(event)
+                surface = payload.get('surface', '?')
+                allowed = payload.get('allowed', '?')
+                mode = payload.get('mode', 'shadow')
+                reason = payload.get('reason', '')
+                lines.append(f'  - {surface}: allowed={allowed} mode={mode} ({reason})')
+
     if bundle and bundle.routes:
         route = bundle.routes[-1]
         lines.append(
diff --git a/teaagent/runner/_approval_manager.py b/teaagent/runner/_approval_manager.py
@@ -2,6 +2,7 @@
 
 from __future__ import annotations
 
+from pathlib import Path
 from typing import Any, Optional
 
 from teaagent.audit import AuditLogger
@@ -26,10 +27,12 @@ def __init__(
         approval_policy: ApprovalPolicy,
         approval_handler: Optional[ApprovalHandler] = None,
         jit_state: Optional[JITApprovalState] = None,
+        workspace_root: Optional[Path] = None,
     ) -> None:
         self.approval_policy = approval_policy
         self.approval_handler = approval_handler
         self.jit_state = jit_state or JITApprovalState()
+        self.workspace_root = workspace_root
 
     def can_request_approval(self, destructive: bool) -> bool:
         """Check if approval can be requested for a tool call."""
@@ -77,6 +80,18 @@ def handle_approval_request(
 
         Returns True if approved, False if denied.
         """
+        from teaagent.governance.h4_integration import evaluate_approval_policy_shadow
+
+        evaluate_approval_policy_shadow(
+            workspace_root=self.workspace_root,
+            audit=audit,
+            run_id=run_id,
+            tool_name=approval_request.tool_name,
+            arguments=approval_request.arguments,
+            destructive=bool(approval_request.annotations.get('destructive')),
+            call_id=approval_request.call_id,
+        )
+
         pending_payload = approval_request.to_dict()
         pending_payload.pop('run_id', None)
         if reason_code is not None:
diff --git a/teaagent/runner/_core.py b/teaagent/runner/_core.py
@@ -152,6 +152,7 @@ def __init__(
             approval_policy=self.approval_policy,
             approval_handler=approval_handler,
             jit_state=jit_state,
+            workspace_root=workspace_root,
         )
         self._budget_prompt_handler = budget_prompt_handler
         self._budget_monitor = budget_monitor or BudgetMonitor(budget=self.budget)
diff --git a/teaagent/subagents/_manager.py b/teaagent/subagents/_manager.py
diff --git a/tests/test_h4_shadow_wiring.py b/tests/test_h4_shadow_wiring.py
diff --git a/tests/test_validate_wiring.py b/tests/test_validate_wiring.py

Original file line number	Diff line number	Diff line change
`@@ -30,6 +30,7 @@`
`30`	`30`	`str(_TESTS / 'test_phase5_context_bus.py'),`
`31`	`31`	`str(_TESTS / 'test_governance_hardening.py'),`
`32`	`32`	`str(_TESTS / 'test_validate_wiring.py'),`
	`33`	`+ str(_TESTS / 'test_h4_shadow_wiring.py'),`
`33`	`34`	`str(_TESTS / 'regression'),`
`34`	`35`	`)`
`35`	`36`