TeaEntityLab
diff --git a/‎docs/adr/0032-run-event-taxonomy.md‎
Lines changed: 17 additions & 11 deletions b/‎docs/adr/0032-run-event-taxonomy.md‎
Lines changed: 17 additions & 11 deletions
diff --git a/‎docs/generated/docs-inventory.md‎
Lines changed: 2 additions & 2 deletions b/‎docs/generated/docs-inventory.md‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎docs/plans/adr-0032-m1-m6-work-plan-2026-06-13.md‎
Lines changed: 13 additions & 11 deletions b/‎docs/plans/adr-0032-m1-m6-work-plan-2026-06-13.md‎
Lines changed: 13 additions & 11 deletions
@@ -149,10 +149,10 @@ recorded here as the durable contract (see the per-phase work-logs under
 - **M5** — **hook OBSERVABILITY** is typed onto the spine; **hook EXECUTION stays
   in the tool-dispatch layer** (`teaagent/tools.py`) because PreToolUse/PostToolUse
   mutate in-flight args/results, and the session-lifecycle hooks are unwired.
-  *Scope note (review F1): the 5 hook audit events are typed and reader-visible
-  but NOT folded into evidence — no extractor reads them and `RunEvidenceBundle`
-  has no hooks field. Surfacing hook activity in receipts is backlog (needs a
-  bundle field + extractor), not delivered by M5.*
+  *Scope note (review F1, RESOLVED 2026-06-14): the 5 hook audit events are typed
+  and now folded into evidence via `RunEvidenceBundle.hook_activity` +
+  `extract_hook_activity` — hook veto/mutation appears in the bundle/receipt and
+  folds through the typed stream.*
 - **M6** — **evidence/receipts fold over the typed stream** (`build_evidence_from_events`,
   now the production path inside `build_run_evidence_bundle`). Fixed a real
   lossiness gap (typed `RunEvent` now carries `created_at`).
@@ -179,13 +179,19 @@ EventSpine.emit ──(register_audit_consumer, M1)──▶ AuditLogger.record
   they are not lifecycle buses. The guard's allowlist names every sanctioned
   event-delivery surface. The taxonomy-closure check proves no `RunEventType` is
   orphaned from the audit record.
-- *Guard scope (review F3): the orphan-bus check is a **heuristic tripwire**, not
-  a proof. It keys on specific high-signal method names (`register_consumer`,
-  `register_interceptor`, `add_sink`, `on_event`, `publish_delta`,
-  `subscribe_deltas`) and deliberately excludes generic `publish`/`emit` to avoid
-  noise — so a bus shaped like `RunEventStream` (`subscribe`+`emit`) is not
-  detected. It catches the common shapes and forces a conscious allowlist
-  decision for them; it does not guarantee detection of every conceivable bus.*
+- A third check (review F2) AST-discovers the audit `event_type` literals the
+  evidence extractors read (`run_evidence.py`, `proof_of_use.py`) and asserts
+  each is in `RunEventType` — so the M6 FOLD-T002 cutover (which drops unmapped
+  types) can never silently lose evidence as extractors evolve.
+- *Guard scope (review F3, narrowed 2026-06-14): the orphan-bus check keys on
+  high-signal method names (`register_consumer`, `register_interceptor`,
+  `add_sink`, `on_event`, `publish_delta`, `subscribe_deltas`) **plus the
+  `subscribe`+`emit` pub/sub pair** (so the `RunEventStream` shape is now caught).
+  It remains a heuristic — a bus using entirely novel naming could still evade
+  it — but it catches every shape that occurs in-tree and forces a conscious
+  allowlist decision. The F2 discovery similarly resolves `==`/`in` against
+  string literals and module-level `frozenset`/`set` constants; exotic dynamic
+  lookups are out of scope.*
 
 **Lesson:** the spine's realized value is the **typed read side** (evidence →
 receipts) and a single typed lifecycle path — not wholesale relocation of
 
@@ -42,7 +42,7 @@ Do not edit this file manually — regenerate instead.
 | `adr/0029-consensus-validation-deferred.md` | working | 1587 | `8a2da40abc07` |
 | `adr/0030-root-module-freeze.md` | working | 1297 | `bee25422e85f` |
 | `adr/0031-shadow-mode-exit-criteria.md` | working | 3598 | `46a9a0d5eaac` |
-| `adr/0032-run-event-taxonomy.md` | working | 14751 | `259c84511705` |
+| `adr/0032-run-event-taxonomy.md` | working | 15143 | `dbeac8e89ac2` |
 | `adr/README.md` | working | 7109 | `713a782f5411` |
 | `agent-contribution-contract.md` | constitution | 5204 | `9c2dad1195d2` |
 | `agent-mode-operator-guide.md` | working | 2778 | `25b258ab7bfe` |
@@ -414,7 +414,7 @@ Do not edit this file manually — regenerate instead.
 | `ops/security-hardening.md` | working | 11733 | `0a385c7dab82` |
 | `ops/troubleshooting.md` | working | 9127 | `4921b6d50f5c` |
 | `permission-and-approval-playbook.md` | working | 6560 | `813bc74bb156` |
-| `plans/adr-0032-m1-m6-work-plan-2026-06-13.md` | archive | 61230 | `54a436ad04b7` |
+| `plans/adr-0032-m1-m6-work-plan-2026-06-13.md` | archive | 61722 | `c6144278d07d` |
 | `plans/agent-ecosystem-acceptance-roadmap-2026-05-31.md` | archive | 29099 | `7c4a4972cfeb` |
 | `plans/community-pain-points-response-plan-2026-06-05.md` | archive | 7276 | `571d010133ad` |
 | `plans/competitive-positioning-plan-2026-05-31.md` | archive | 8726 | `d16dfd2bdd99` |
 
@@ -177,9 +177,9 @@ consumers by M6.
 | ADR-0032-M2 (REDEFINED, taxonomy-only §16) | Every audit event the evidence bundle reads is typed in `RunEventType` and mapped both directions, so the M2-T001 reader surfaces it **from the audit JSONL** (mapper is sufficient; emit-site migration is NOT in M2 — it is deferred to the component milestones, §16). Covers routes, git-sandbox, skills, tests, undo, provenance, approval/tool-call decision events, cancelled/pending lifecycle. Pure additive; zero behavior change. (Old M2 "evidence/receipt fold" moved to M6 — §14.) |
 | ADR-0032-M3 | Plan gate is an interceptor using `PlanValidator`, landed parity-first (§13.3): a shadow-parity test asserting interceptor==inline per reason code went green before the inline branch was deleted in a separate commit. Denials and reason codes match current behavior; adversarial and first-hour tests remain green. |
 | ADR-0032-M4 (CLOSED — owner decisions B + B-analog, 2026-06-13) | **No gate moves to an interceptor; approval AND budget enforcement both STAY INLINE.** Both proved runtime-stateful on assessment, a poor fit for the pure-interceptor model. **Approval** (decision B): live JIT/session state, tool handler, auto-mode-swappable policy — every coupling gap was invisible to a unit parity test (`docs/work-log/m4-approval-sliceB-blocked-2026-06-13.md`). **Budget** (decision B-analog): it is three mechanisms — only the global cost cap (`_assert_cost_budget`) is stateless; the phase budget (live `phase_tracker`) and the warning ladder (`_budget_warning_levels_emitted` + `BudgetMonitor._emitted_levels`/`_prompted` dedup sets + an interactive `on_prompt` side-effect handler — the same `assert_allowed` shadow-coexistence trap that blocked approval) are stateful, and even the cost cap is enforced at two evolving-cost points per iteration that do not map 1:1 to events (`docs/work-log/m4-budget-stays-inline-2026-06-13.md`). Both gates' observability is already provided by M2 (their audit events — `tool_call_*`, `approval_*`, `budget_warning`, `budget_prompt`, `phase_budget_warning` — are typed + reader-surfaced); the M6 fold reads them without owning enforcement. Approval/budget behavior unchanged. **Net: plan gate (M3) is the sole governance gate moved to an interceptor.** |
-| ADR-0032-M5 (REVISED — observability-only, 2026-06-13) | **Hook OBSERVABILITY folds onto the spine; hook EXECUTION stays in the tool-dispatch layer.** Assessment found the planned "HookRegistry on spine" unsuitable for the same runtime-coupling reason as approval/budget: PreToolUse/PostToolUse run in `teaagent/tools.py::execute` and **mutate in-flight `arguments`/`result`** (the spine has no channel to ferry mutated payloads back to the dispatch site), and the 6 session-lifecycle hooks (SessionStart/End, UserPromptSubmit, PreCompact, Stop, SubagentStop) have **no production caller** — nothing to strangle; wiring them is feature work. Done: the 5 dispatch-layer hook audit events (`tool_hook_pre_mutation`, `tool_hook_pre_mutation_blocked`, `tool_hook_vetoed`, `tool_hook_post_mutation`, `tool_hook_post_failed`) are typed in `RunEventType` + mapped both directions, so the M2-T001 reader can surface them as typed RunEvents. **Correction (post-migration review F1):** this is typing + reader-visibility ONLY — it is NOT yet folded into evidence/receipts. No evidence extractor reads `tool_hook_*` and `RunEvidenceBundle` has no hooks field, so hook veto/mutation activity does not currently appear in any bundle/receipt. Surfacing it would need a new `RunEvidenceBundle` hooks field + extractor (backlog). Mapping/reader only; audit bytes unchanged; hook execution + mutation semantics unchanged. See `docs/work-log/m5-hooks-observability-only-2026-06-13.md`. |
+| ADR-0032-M5 (REVISED — observability-only, 2026-06-13) | **Hook OBSERVABILITY folds onto the spine; hook EXECUTION stays in the tool-dispatch layer.** Assessment found the planned "HookRegistry on spine" unsuitable for the same runtime-coupling reason as approval/budget: PreToolUse/PostToolUse run in `teaagent/tools.py::execute` and **mutate in-flight `arguments`/`result`** (the spine has no channel to ferry mutated payloads back to the dispatch site), and the 6 session-lifecycle hooks (SessionStart/End, UserPromptSubmit, PreCompact, Stop, SubagentStop) have **no production caller** — nothing to strangle; wiring them is feature work. Done: the 5 dispatch-layer hook audit events (`tool_hook_pre_mutation`, `tool_hook_pre_mutation_blocked`, `tool_hook_vetoed`, `tool_hook_post_mutation`, `tool_hook_post_failed`) are typed in `RunEventType` + mapped both directions, so the M2-T001 reader can surface them as typed RunEvents. **Update (review F1 RESOLVED, 2026-06-14):** initially this was typing + reader-visibility only; the "fold" claim was hollow because no extractor read `tool_hook_*`. Now fixed end-to-end: added `HookActivityRecord` + `RunEvidenceBundle.hook_activity` + `extract_hook_activity()`, wired into `_assemble_evidence_bundle`, so hook veto/mutation activity now appears in the bundle (and folds through the typed stream — the M5 typing was the prerequisite). Audit bytes unchanged; hook execution + mutation semantics unchanged. See `docs/work-log/m5-hooks-observability-only-2026-06-13.md`. |
 | ADR-0032-M6 (was M2 fold; corrected scope A) — **COMPLETE (FOLD-T001 + T002)** | Evidence and receipts are folded from the typed event stream and equal the legacy builder on success/failure/pending fixtures (cancelled once emitted in M2); the fold reads the full stream (no fallback flag, per Q1). **FOLD-T001**: `build_evidence_from_events()` parallel builder sharing `_assemble_evidence_bundle` with the legacy path (cannot drift; only the event *source* differs), parity-asserted (`tests/test_run_evidence.py::test_m6_fold_*`). Fixed a structural gap: the typed `RunEvent` was lossy — dropped top-level `created_at` (threaded into command/test/approval timestamps); added optional `RunEvent.created_at`, reader populates it. **FOLD-T002 (cutover DONE)**: `build_run_evidence_bundle` now routes production evidence THROUGH the typed reader + fold — the typed stream is the production path; the raw-dict assembly survives only as the shared helper (so the two cannot diverge). Suite-wide green (evidence/receipt/summary/5-min-proof/first-hour/adversarial + all bundle consumers, ~218 tests). **Finding: no synthetic receipt-only fixtures existed to retire** — the receipt/evidence path was already event-backed (`test_run_receipt.py` writes real RunStore events; `test_real_run_receipt_completeness_from_plan` validates a real run); direct `RunEvidenceBundle(...)` constructions are legitimate downstream-consumer/checker unit tests, not masking fixtures. The plan anticipated a gap that does not exist. Parity test re-anchored against `_assemble_evidence_bundle` (the raw-dict path) so it stays meaningful post-cutover. |
-| ADR-0032-M7 (was M6) — **COMPLETE as guard + document, 2026-06-13** | Original goal ("ContextBus + webhook consume the spine; delete inline eventing") **NOT done — it is a regression or vacuous.** Webhook is an `audit.add_sink` already fed transitively by the M1 spine→audit consumer; a *direct* spine consumer would see only the spine-emitted subset (coverage regression). ContextBus + integration `RunEventStream` are **unwired in production** (no callers) — nothing to migrate. The inline `audit.record` calls are the **complete event record** (read by evidence/receipts/webhook), not redundant eventing to delete. **Done instead (owner: guard + document):** `scripts/validate_event_spine_wiring.py` + `tests/test_event_spine_wiring.py` enforce the realized invariant — one typed lifecycle path (EventSpine→audit consumer), an allowlist of sanctioned event-delivery surfaces so a NEW competing lifecycle bus fails the gate, and taxonomy closure (no RunEventType orphaned from the audit record). Added as a pre-commit hook. ADR 0032 "Realized architecture (M1–M7)" section documents the outcome. **MIGRATION COMPLETE.** |
+| ADR-0032-M7 (was M6) — **COMPLETE as guard + document, 2026-06-13** | Original goal ("ContextBus + webhook consume the spine; delete inline eventing") **NOT done — it is a regression or vacuous.** Webhook is an `audit.add_sink` already fed transitively by the M1 spine→audit consumer; a *direct* spine consumer would see only the spine-emitted subset (coverage regression). ContextBus + integration `RunEventStream` are **unwired in production** (no callers) — nothing to migrate. The inline `audit.record` calls are the **complete event record** (read by evidence/receipts/webhook), not redundant eventing to delete. **Done instead (owner: guard + document):** `scripts/validate_event_spine_wiring.py` + `tests/test_event_spine_wiring.py` enforce the realized invariant with three checks — (A) taxonomy closure (no RunEventType orphaned from the audit record); (B) no orphaned event bus (allowlist of sanctioned surfaces; high-signal methods **plus** the subscribe+emit pub/sub pair so the RunEventStream shape is caught — review F3); (C) evidence-extractor type coverage (AST-discovers the event_type literals run_evidence/proof_of_use read and asserts each is typed, so the M6 cutover can't silently drop evidence — review F2). Added as a pre-commit hook. ADR 0032 "Realized architecture (M1–M7)" section documents the outcome. **MIGRATION COMPLETE.** |
 
 ## 8. Task Plan
 
@@ -725,15 +725,17 @@ commit once Slice A is green.
 
 ### ADR32-M6-T003: Orphaned Eventing Validator [DONE]
 
-> **DONE (2026-06-13).** `scripts/validate_event_spine_wiring.py` +
-> `tests/test_event_spine_wiring.py`. Two checks: (A) taxonomy closure — every
-> `RunEventType` maps losslessly to the audit record (no orphaned typed event);
-> (B) no orphaned event bus — an AST scan for high-signal lifecycle-event methods
-> (`register_consumer`/`register_interceptor`/`add_sink`/`on_event`/
-> `publish_delta`/`subscribe_deltas`; generic `publish`/`emit` excluded to avoid
-> noise) must match a curated allowlist of sanctioned surfaces, so a new
-> competing bus fails. Seeded-bad-fixture tests included. Added as the
-> `check-event-spine-wiring` pre-commit hook.
+> **DONE (2026-06-13; checks extended 2026-06-14 per review F2/F3).**
+> `scripts/validate_event_spine_wiring.py` + `tests/test_event_spine_wiring.py`.
+> Three checks: (A) taxonomy closure — every `RunEventType` maps losslessly to
+> the audit record; (B) no orphaned event bus — AST scan for high-signal
+> lifecycle-event methods (`register_consumer`/`register_interceptor`/`add_sink`/
+> `on_event`/`publish_delta`/`subscribe_deltas`) **plus the `subscribe`+`emit`
+> pub/sub pair** (F3) must match a curated allowlist; (C) evidence-extractor type
+> coverage — AST-discovers the `event_type` literals `run_evidence`/`proof_of_use`
+> read (incl. annotated module-level frozensets) and asserts each is typed, so
+> the M6 cutover can't silently drop evidence (F2). Seeded-bad-fixture tests
+> included. Added as the `check-event-spine-wiring` pre-commit hook.
 
 - Goal: prove there are no competing lifecycle event systems left after M6.
 - Scope: static validation over audit strings, HookRegistry emissions,