TeaEntityLab
diff --git a/‎.pre-commit-config.yaml‎
Lines changed: 9 additions & 0 deletions b/‎.pre-commit-config.yaml‎
Lines changed: 9 additions & 0 deletions
diff --git a/‎README.md‎
Lines changed: 5 additions & 4 deletions b/‎README.md‎
Lines changed: 5 additions & 4 deletions
diff --git a/‎docs/INDEX.md‎
Lines changed: 1 addition & 0 deletions b/‎docs/INDEX.md‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎docs/adr/0032-run-event-taxonomy.md‎
Lines changed: 186 additions & 0 deletions b/‎docs/adr/0032-run-event-taxonomy.md‎
Lines changed: 186 additions & 0 deletions
diff --git a/‎docs/adr/README.md‎
Lines changed: 3 additions & 1 deletion b/‎docs/adr/README.md‎
Lines changed: 3 additions & 1 deletion
@@ -48,3 +48,12 @@ repos:
         entry: bash -c 'if [ "${TEAAGENT_PRECOMMIT_FULL:-0}" = 1 ]; then env UV_CACHE_DIR=.uv-cache uv run pytest -q; else env UV_CACHE_DIR=.uv-cache uv run pytest tests/test_p0_harness.py tests/test_surface_auth_hardening.py tests/test_policy.py tests/test_phase5_context_bus.py tests/test_governance_hardening.py tests/regression/ tests/acceptance/test_subagent_lineage_flow.py -q; fi'
         language: system
         pass_filenames: false
+      - id: check-docs-inventory
+        name: check-docs-inventory
+        # Fail the commit if docs/generated/docs-inventory.md is stale. --check is
+        # read-only: it never writes, so a failure forces the committer to run the
+        # generator and stage the result. (A regenerate-then-check entry would pass
+        # trivially while leaving the regenerated file unstaged.)
+        entry: python3 scripts/generate_docs_inventory.py --check
+        language: system
+        pass_filenames: false
@@ -1,11 +1,12 @@
 # TeaAgent
 
-> **Last reviewed:** 2026-06-06
+> **Last reviewed:** 2026-06-13
 > **Review trigger:** README feature claims, golden path, or provider count changes.
+> **Direction record:** [Harness-First Direction](docs/strategy/harness-first-direction-2026-06-13.md) (owner-operator persona, aspirational adoption)
 
-Governance-first agent harness for autonomous coding tasks. Thin orchestration layer with tool governance, state boundaries, audit logging, and destructive-tool approval.
+A personal, local-first governance harness for autonomous coding tasks — built by and for the owner-operator who maintains, uses, and audits his own runs. Thin orchestration layer with tool governance, state boundaries, audit logging, and destructive-tool approval.
 
-**TeaAgent is not** a generic IDE agent clone or hosted cloud delegate. It is a local-first harness you operate — with explicit permission modes, hash-chained audit logs, and verification commands a security reviewer can run. See [When Not to Use TeaAgent](docs/guides/when-not-to-use-teaagent.md) for honest non-fit cases.
+**TeaAgent is not** a generic IDE agent clone, enterprise multi-user platform, or hosted cloud delegate. It is a local-first harness you operate — with explicit permission modes, hash-chained audit logs, and verification commands a security reviewer can run. See [When Not to Use TeaAgent](docs/guides/when-not-to-use-teaagent.md) for honest non-fit cases.
 
 ## Governance-first harness
 
@@ -19,7 +20,7 @@ Governance-first agent harness for autonomous coding tasks. Thin orchestration l
 
 Trust model: [Trust and Audit Whitepaper](docs/governance/trust-and-audit-whitepaper.md). Enterprise NIST mapping: [Security Whitepaper](docs/security-whitepaper.md).
 
-**Start by persona:** [Solo CLI](docs/guides/getting-started-solo-cli.md) · [Team operator](docs/guides/getting-started-team-operator.md) · [Tool/plugin author](docs/guides/getting-started-tool-plugin-author.md) · [Security reviewer](docs/guides/getting-started-security-reviewer.md)
+**Getting started:** [Owner-operator quickstart](docs/guides/getting-started-solo-cli.md) · [Tool/plugin author](docs/guides/getting-started-tool-plugin-author.md) · [Security reviewer](docs/guides/getting-started-security-reviewer.md)
 
 ## What makes it different
 
 
@@ -58,6 +58,7 @@ governance ledgers should prefer the canonical set above.
 
 | Topic | Evidence package |
 | --- | --- |
+| **June 12 reflective intent critical review** | [Intent Critical Review and Worklist 2026-06-12](analysis/intent-critical-review-and-worklist-2026-06-12.md) |
 | **June 10 system critical review package (current)** | [System Critical Review Package 2026-06-10](analysis/system-critical-review-2026-06-10-INDEX.md) |
 | June 10 engineering critique refresh | [Engineering Architecture Critique Refresh](analysis/engineering-critique-refresh-2026-06-10.md) |
 | June 10 remote multi-agent readiness refresh | [Remote Multi-Agent Readiness Refresh](analysis/remote-multi-agent-readiness-refresh-2026-06-10.md) |
 
@@ -0,0 +1,186 @@
+# ADR 0032: Run Event Taxonomy and Event Spine
+
+## Status
+
+Accepted — owner-approved 2026-06-13 (unblocks M1: AuditLogger as consumer)
+
+## Date
+
+2026-06-13
+
+## Context
+
+Three parallel half-systems currently handle run-lifecycle events, making it difficult to reason about governance, audit, and receipts as a unified concern:
+
+1. **Audit strings** (`audit.record('run_started', ...)` etc.) — scattered call sites, implicit taxonomy of event names, consumed by receipts and evidence.
+2. **HookRegistry** (teaagent/hooks.py) — Claude-Code-compatible hook events (SessionStart, UserPromptSubmit, PreToolUse, PostToolUse, PreCompact, Stop, SubagentStop, SessionEnd), wired only at the tool boundary, carries veto semantics via HookError.
+3. **ContextBus** (teaagent/context_bus.py) — separate event mechanism for deltas.
+
+Meanwhile, every governance gate (approval, budget, plan, tool policy) is inlined in AgentRunner (runner/_core.py), which creates a gravity well and makes testing governance independently difficult. The control-loop ownership map (docs/architecture/control-loop-ownership-map-2026-06-11.md) identifies this as a core architectural pain point.
+
+## Decision
+
+Introduce a typed **run-lifecycle event spine** with explicit event taxonomy and two subscriber classes:
+
+### 1. RunEvent Type System
+
+Define a `RunEventType(str, Enum)` whose members are seeded from:
+- The union of existing audit event names (run_started, iteration_started, tool_call_completed, tool_call_failed, context_compacted, validation_started)
+- The run-lifecycle taxonomy from harness-first-direction §6.3 (plan_resolved, decision_received, tool_call_requested, budget_checkpoint, context_compacted, iteration_completed, final_validation, run_completed, run_failed, run_pending_approval, run_cancelled, receipt_emitted, session_start, session_end, etc.)
+
+Minimal M0 set for this spike:
+- `RUN_STARTED` — run begins
+- `ITERATION_STARTED` — iteration loop begins
+- `TOOL_CALL_REQUESTED` — tool call requested (before gates)
+- `TOOL_CALL_COMPLETED` — tool call succeeded
+- `TOOL_CALL_FAILED` — tool call errored
+- `RUN_COMPLETED` — run ends successfully
+- `RUN_FAILED` — run ends in failure
+
+(Extendable; the full taxonomy is defined in this ADR and documented in code comments.)
+
+### 2. Event Spine Architecture
+
+**RunEvent dataclass** (frozen, immutable):
+```
+type: RunEventType
+run_id: str
+payload: Mapping[str, Any]  # typed payload; structure per event type
+seq: int  # monotonic sequence number per spine instance
+```
+
+**EventSpine class** (sync-first, in-process, deterministic):
+```
+register_interceptor(fn, *, name: str) -> None
+  # Callable[[RunEvent], None]; may raise to veto
+  # Interceptors run in registration order before consumers
+  # Exceptions propagate (veto semantics)
+
+register_consumer(fn, *, name: str) -> None
+  # Callable[[RunEvent], None]; never veto
+  # Consumers run after interceptors
+  # Exceptions are caught, logged, and isolated (never affect run)
+
+emit(event: RunEvent) -> None
+  # Fire an event: run interceptors in order, then consumers
+  # If any interceptor raises, propagate immediately (no further subscribers run)
+  # If any consumer raises, log and continue
+  # Return normally on success or after isolated consumer failure
+```
+
+### 3. Subscriber Semantics
+
+**Interceptors:**
+- Represent governance gates (plan validation, approval, budget, policy)
+- Run in declared order before any consumer sees the event
+- May raise any exception (converted to DenialReasonCode if ToolPermissionError or similar)
+- Exception from interceptor halts the spine (veto)
+- Used to enforce hard constraints
+
+**Consumers:**
+- Represent audit, receipt building, evidence, ContextBus, webhook sinks
+- Run after all interceptors complete
+- Each wrapped in try/except (exception logged via logging module, never propagates)
+- Never affect the run (crash-safe)
+- Used for side effects and derived state
+
+### 4. HookRegistry Alignment
+
+Existing Claude-Code hook names (SessionStart, PreToolUse, PostToolUse, etc.) are preserved as **aliases** to RunEventType members where semantically equivalent (e.g., PRE_TOOL_USE ← PreToolUse). The public hook API (teaagent/hooks.py) will be re-homed onto the spine in a later migration step (M5).
+
+### 5. Compliance with ADR 0030
+
+New code lives inside the existing `teaagent/runner/` package (teaagent/runner/_events.py) — no new root module. The module freeze is respected.
+
+## Rationale
+
+- **Single contract**: One typed enum replaces three implicit taxonomies; claim-testable, refactorable, and extensible.
+- **Determinism**: Sync-first, in-process, no threads — deterministic for tests, safe for receipts.
+- **Veto clarity**: Interceptor ordering and exception semantics are explicit, enabling governance gates to be extracted without rewriting the runner.
+- **Gradual migration**: Dual-write (M0) allows the old audit.record() paths to coexist with new events, so the migration is strangler-safe.
+- **Test leverage**: Lifecycle tests can assert event sequences instead of implementation internals, decoupling tests from runner refactors.
+
+## Implementation
+
+### Phase M0 (this ADR, this spike)
+
+1. Define `RunEventType(str, Enum)` and `RunEvent` dataclass in teaagent/runner/_events.py.
+2. Define `EventSpine` class with register_interceptor, register_consumer, emit semantics.
+3. Add optional `event_spine: EventSpine | None` parameter to AgentRunner (default: fresh spine, no subscribers).
+4. At existing audit.record call sites, **dual-write**: emit corresponding RunEvent (audit calls unchanged).
+5. Lifecycle tests assert the event sequence for the five-minute-proof scenario.
+6. Acceptance tier stays green.
+
+### Future Phases (M1–M6)
+
+| Step | Change | Invariant |
+| --- | --- | --- |
+| M1 | AuditLogger becomes a consumer (serializes RunEvents to JSONL) | Byte-equivalent audit on proof scenario |
+| M2 | Receipts/evidence fold over event stream | Receipt completeness guaranteed structurally |
+| M3 | Plan gate moves to interceptor | Same denials, same reason codes |
+| M4 | Approval and budget gates to interceptors | Same semantics, extracted from runner |
+| M5 | HookRegistry re-homed onto spine; public hook API documented | Existing hook tests pass via aliases |
+| M6 | ContextBus + webhook sinks consume spine; inline emission paths deleted | No orphaned eventing modules |
+
+## Consequences
+
+**Positive:**
+- Unified event contract enables incremental gate extraction without rewriting AgentRunner.
+- Governance gates become testable independently via lifecycle assertions.
+- Receipts/audit can be derived from a single immutable event stream (M2+), eliminating synthetic-vs-real gaps.
+- Hook ordering and error semantics are explicit and stable for the public API.
+
+**Negative:**
+- M0 dual-write adds ~5 lines per call site (acceptable; temporary until M1).
+- EventSpine is new infrastructure; must be proven correct before gates migrate to interceptors.
+- Full governance-gate extraction (M3–M4) is multi-phase and requires consecutive landing without behavioral changes (per stop-rule in strategy doc §6.4).
+
+## Alternatives Considered
+
+1. **Extend HookRegistry instead of creating EventSpine**: HookRegistry is Claude-Code-specific and tool-boundary-scoped; the spine covers the full run lifecycle and cannot be scoped to tools. Separate design avoids conflating concerns.
+
+2. **Async event sink**: Async sinks (queue-based consumers) would enable webhook delivery and distributed audit. Rejected at M0 for determinism: tests must not depend on timing. Async can be added at M2+ if friction evidence justifies it.
+
+3. **Fold events into context/observations**: Events would become observation slots instead of a separate spine. Rejected: observations are model-visible; governance events must be opaque to the model and ordered by the harness.
+
+## References
+
+- [Harness-First Direction §6](../strategy/harness-first-direction-2026-06-13.md#6-core-architecture-one-event-spine-gates-as-interceptors)
+- [Control-Loop Ownership Map §6.1](../architecture/control-loop-ownership-map-2026-06-11.md)
+- [ADR 0030: Root-Module Freeze](0030-root-module-freeze.md)
+- [ADR 0009: 5-Loop Governance System](0009-five-loop-governance.md)
+
+## Full Event Taxonomy (M0 + Planned)
+
+```
+RUN_STARTED              # Run begins; payload: run_id, task, model, etc.
+SESSION_START            # Session begins (alias: SessionStart)
+PLAN_RESOLVED            # Plan loaded/validated
+ITERATION_STARTED        # Iteration loop begins
+DECISION_RECEIVED        # Model returns a decision (tool call or final answer)
+TOOL_CALL_REQUESTED      # Tool call identified (before gates)
+TOOL_CALL_APPROVED       # Approval gate approved
+TOOL_CALL_DENIED         # Approval gate denied
+TOOL_CALL_COMPLETED      # Tool call succeeded
+TOOL_CALL_FAILED         # Tool call errored
+CONTEXT_COMPACTED        # Context compaction occurred
+BUDGET_CHECKPOINT        # Budget check (not veto; informational)
+ITERATION_COMPLETED      # Iteration loop ends
+FINAL_VALIDATION         # Final answer validation
+RUN_COMPLETED            # Run ends successfully
+RUN_FAILED               # Run ends in failure
+RUN_PENDING_APPROVAL     # Run paused for approval
+RUN_CANCELLED            # Run cancelled by user
+RECEIPT_EMITTED          # Receipt finalized
+SESSION_END              # Session ends (alias: SessionEnd)
+SKILL_LOAD               # Skill loaded
+MODEL_ROUTE              # Model routed (provider selection)
+GIT_SANDBOX_STARTED      # Sandbox workspace initialized
+GIT_SANDBOX_RESOLVED     # Sandbox resolved/cleaned
+UNDO_PERFORMED           # Undo action executed
+PRE_TOOL_USE             # Hook: before tool execution (alias: PreToolUse)
+POST_TOOL_USE            # Hook: after tool execution (alias: PostToolUse)
+PRE_COMPACT              # Hook: before context compaction (alias: PreCompact)
+```
+
+The M0 spike covers RUN_STARTED, ITERATION_STARTED, TOOL_CALL_REQUESTED, TOOL_CALL_COMPLETED, TOOL_CALL_FAILED, RUN_COMPLETED, RUN_FAILED. Extended events are added in later phases as gates migrate.
@@ -33,6 +33,7 @@ This directory contains all Architecture Decision Records (ADRs) for the TeaAgen
 | 0025 | Shared ChatSessionController for Chat Surfaces | Accepted and Implemented | 2026-06-01 | 2026-06-04 13:18:00 +0800 |
 | 0029 | Consensus Validation Deferred Behind Approval Queue | Accepted | 2026-06-10 | 2026-12-10 (expiry review) |
 | 0031 | Shadow Mode Exit Criteria | Proposed | 2026-06-12 | 2026-09-12 (expiry review) |
+| 0032 | Run Event Taxonomy and Event Spine | Accepted | 2026-06-13 | - |
 
 ## ADR Categories
 
@@ -48,13 +49,14 @@ This directory contains all Architecture Decision Records (ADRs) for the TeaAgen
 - **0007**: ANP Adapter Boundary - External federation boundary
 - **0008**: P4 Strategic Posture - Storage, TLS, P2P auth posture
 
-### Governance Hardening (0009, 0022-0024, 0029, 0031)
+### Governance Hardening (0009, 0022-0024, 0029, 0031-0032)
 - **0009**: 5-Loop Governance System - Comprehensive governance loops
 - **0022**: Centralized Approval Queue for Subagents - Batch approval management
 - **0023**: Strict Plan-Before-Write Enforcement - Plan validation
 - **0024**: Automated Memory Invalidation - Memory hygiene
 - **0029**: Consensus Validation Deferred Behind Approval Queue - Consensus gate deferral
 - **0031**: Shadow Mode Exit Criteria - Policy/RBAC shadow→enforce promotion path
+- **0032**: Run Event Taxonomy and Event Spine - Unified run-lifecycle event contract
 
 ### Multi-Agent & Swarm (0019)
 - **0019**: Phase 4 - Federated Swarm Consensus & Peer Attestations - Swarm coordination