Phase 0 Priority Work Items

2026-06-04

Purpose

This work log turns the cross-review and critical questioning documents into concrete tasks. It intentionally favors trust repair over feature expansion.

Related implementation/reflection record: parallel-phase-0-implementation-report-2026-06-04.md.

Work Items

ID	Priority	Task	Evidence	Acceptance criteria
P0-TR-001	P0	Gate `allow_all_destructive` behind explicit full-access semantics	Historical behavior allowed `allow_all_destructive` in prompt mode	Prompt mode with `allow_all_destructive=True` fails; legitimate bypass callers must use an explicit broad permission mode and tests cover the contract
P0-TR-002	P0	Rename or consolidate runner-local `ApprovalManager`	Historical duplicate name; current code has `RunnerApprovalCoordinator`	Only one canonical approval authority name remains; runner helper name reflects workflow role
P0-TR-003	P0	Break policy/approval lazy reverse import	Historical reverse import risk; current import-order tests pass	Shared normalization helper extracted or no reverse import remains; import-order smoke test added
P0-TR-004	P0	Make memory canonical source structural	Historical split; current `memory_legacy.py` re-exports `memory.catalog`	One runtime implementation remains, or duplicate is quarantined with tests proving import target
P0-TR-005	P0	Add coverage omit ledger	16 omit patterns in `pyproject.toml`; ledger now has smoke candidates	Each omit has owner, reason, risk, expected return milestone, smoke-test candidate, and docs validator coverage
P0-TR-006	P0	Add optional-extra dependency audit policy	`google-adk` optional tree can pull vulnerable `fastapi` / `starlette` transitive deps	Security docs and workflow distinguish base audit, dev/lockfile audit, and optional-extra audit cadence
P0-TR-007	P0	Assign or close proposed ADRs	Source check found no current Proposed ADRs; stale docs overstated six proposed ADRs	ADR index and ADR 0025 reflect closed/proper current states
P1-TR-008	P1	Add at least one smoke test per coverage-omitted package	TUI, tournament, validation, WASM and other paths are omitted	Smoke test exists or explicit non-testable rationale exists
P1-TR-009	P1	Build generated docs front door	435 Markdown files make discovery hard	`docs/INDEX.md` or equivalent links current status, risk, roadmap, tickets, ADRs, and historical evidence
P1-TR-010	P1	Calibrate security severity levels	Review says Critical, module docs say High for similar bypasses	Shared severity rubric exists and high-risk docs are updated
P1-TR-011	P1	Separate "behavior preservation" tests from "safety intent" tests	Current tests can preserve risky bypass behavior	Security tests assert desired contract, not only legacy behavior
P1-TR-012	P1	Refresh dependency audit report after security workflow change	Dependency report now has supersession note and scope-refresh successor	Report states base/dev/optional audit surfaces separately

Recommended Execution Order

P0-TR-001: approval bypass semantics.
P0-TR-002: approval authority naming.
P0-TR-003: policy/approval import boundary.
P0-TR-004: memory catalog canonicalization.
P0-TR-005 and P0-TR-006: governance ledgers.
P0-TR-007: ADR ownership cleanup.
P1 items only after at least the first four P0 items have tests.

Human Review Gates

Human review should be required before:

Removing or redefining danger-full-access.
Changing default permission mode behavior.
Deleting memory/catalog.py or memory_legacy.py.
Removing coverage omit entries without replacement tests.
Adding a broad dependency override for security scan convenience.

Done Means

Phase 0 trust repair is done only when the following are all true:

A destructive operation cannot bypass approval unless the user explicitly selected and acknowledged the relevant full-access semantics.
Approval code has one obvious authority path.
Memory code has one obvious authority path.
Docs expose the current truth without requiring search through dated layers.
Security CI distinguishes base package safety from optional runtime safety.
Tests fail when the trust contract is weakened.

Status Log

2026-06-04 — P0-TR-001 DONE. allow_all_destructive is now inert in non-full-access modes (notably prompt) even when full_access_acknowledged=True is set. The acknowledgement flag records ceremony metadata; it does not grant authority by itself. The bypass at PermissionModeEnforcer.check now fails safe and raises ToolPermissionError with reason code DenialReasonCode.FULL_ACCESS_NOT_ACKNOWLEDGED. The two legitimate callers promote explicitly: auto mode (runner/_auto_mode_manager.py, opt-in) returns a danger-full-access policy scoped by AutoModeGuard, and chat (chat_agent.py) maps the explicit --allow-destructive user flag to danger-full-access. No CLI/UX change for existing --allow-destructive users. Covered by tests/test_full_access_gate.py (enforcer, policy/manager, and auto-mode layers) plus a regression guard in tests/regression/test_contract_approval.py::test_allow_all_destructive_without_ack_blocks. Current verification: 3273 passed, 141 skipped, 18 subtests passed, 0 failed on Python 3.12.8.
2026-06-04 — P0-TR-002 DONE. The runner-local approval workflow helper is now RunnerApprovalCoordinator, while the canonical authority remains teaagent.approval_manager.ApprovalManager. rg 'class ApprovalManager' teaagent tests returns only the canonical runtime class. Regression guard: tests/test_circular_imports.py::test_runner_approval_helper_is_not_named_approval_manager.
2026-06-04 — P0-TR-003 VERIFY/CLOSE. teaagent.approval_manager no longer imports teaagent.policy; import-order smoke tests cover policy-first and approval-manager-first load order. Remaining work is to keep future normalization helpers out of either side of the policy boundary.
2026-06-04 — P0-TR-004 DONE. teaagent.memory.catalog is the canonical implementation. teaagent.memory and teaagent.memory_legacy both re-export the same classes for compatibility. Regression guard: tests/test_circular_imports.py::test_memory_catalog_canonical_export_path.
2026-06-04 — P0-TR-005 DONE. docs/governance/coverage-omit-ledger.md lists all 16 [tool.coverage.run].omit patterns with owner, reason, risk, return milestone, and smoke-test candidate. validate_docs_consistency.py now fails when the ledger and pyproject.toml drift.
2026-06-04 — P0-TR-006 DONE. The dependency audit policy and security workflow now separate base, dev/lockfile, and optional-extra audit lanes. The base PR gate uses uv export --no-dev --no-emit-project instead of unscoped environment auditing. Optional extras run in a non-blocking scheduled/manual matrix and remain a release-review gate.
2026-06-04 — P0-TR-007 DONE. ADR status review found no current Proposed ADRs in docs/adr/README.md; ADR 0010/0012/0014/0015/0017/0018 are closed or accepted, and ADR 0025 now reflects implemented REPL/TUI controller unification.
2026-06-04 — P1-TR-012 DONE. The dependency audit report now has a supersession note and points to docs/security/dependency-audit-scope-refresh-2026-06-04.md, which separates base, dev/lockfile, and optional-extra findings.
2026-06-04 — P1-TR-008 DONE. All 16 [tool.coverage.run].omit patterns in pyproject.toml have a matching row in docs/governance/coverage-omit-ledger.md with at least one existing smoke-test candidate; scripts/validate_docs_consistency.py reports "Docs consistency check passed." Representative subset green on Python 3.12.8 (tests/test_workspace_tools.py tests/test_git_tools.py tests/test_validation.py tests/test_wasm_runtime.py tests/test_tsb_format.py → 92 passed).
2026-06-04 — P1-TR-009 DONE. docs/INDEX.md is the documentation front door and links current status, risk register, roadmap, tickets, ADR index (adr/README.md), and historical evidence/review. All in-doc links resolve; the ADR index link was the one missing required category and is now present.
2026-06-04 — P1-TR-010 DONE. docs/security/severity-calibration-rubric.md defines the shared Critical/High/Medium/Low scale and the calibrated mapping (DANGER_FULL_ACCESS, allow_all_destructive bypass, and audit-chain forgeability are Critical). The approval-bypass class is now labeled consistently across high-risk docs; docs/modules/approval_manager/risks.md (APR-R-003) and docs/modules/governance/risks.md (GOV-R-001) now carry an explicit Severity: Critical label that points back to the rubric. No remaining Critical-vs-High contradiction for the bypass class.
2026-06-04 — P1-TR-011 DONE. Safety-intent tests are separated from behavior-preservation tests. The safe contract is asserted directly: tests/regression/test_contract_approval.py::test_allow_all_destructive_without_ack_blocks and tests/test_full_access_gate.py::TestPolicyGate::test_allow_all_without_ack_raises_with_reason_code require reason code FULL_ACCESS_NOT_ACKNOWLEDGED, and the behavior-preservation bypass test now requires explicit full_access_acknowledged=True. Green on Python 3.12.8 (tests/test_full_access_gate.py tests/regression/test_contract_approval.py → 28 passed).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Phase 0 Priority Work Items

2026-06-04

Purpose

Work Items

Recommended Execution Order

Human Review Gates

Done Means

Status Log

Uh oh!

FilesExpand file tree

phase-0-priority-work-items-2026-06-04.md

Latest commit

History

phase-0-priority-work-items-2026-06-04.md

File metadata and controls

Phase 0 Priority Work Items

2026-06-04

Purpose

Work Items

Recommended Execution Order

Human Review Gates

Done Means

Status Log