TeaAgent Threat Model

Last updated: 2026-05-29 (3rd edition — post comprehensive audit)

This document maps threats to mitigations and verification. It complements tool-authoring.md and architecture.md.

Threat	Impact	Mitigation (current)	Verification	Gap / notes
Prompt injection causing destructive tools	High	Permission modes; `ApprovalPolicy`; policy-as-code deny rules	`test_policy_as_code_flow.py`, `tests/policy/test_permission_matrix.py`	Model may still request bad tools; harness blocks execution
Mislabelled tool annotations (`read_only` on write tool)	High	`teaagent tool lint`; runtime policy on destructive flag	`tests/test_tranche_b_governance.py`, `tests/test_governance_fuzz.py`	Plugin handlers not sandboxed by default — trust boundary
Path traversal / symlink escape	High	Workspace path resolution; protected paths	`test_contract_policy.py`, `test_protected_paths_flow.py`	Fuzz coverage ongoing
Shell mutation in workspace-write mode	High	`ApprovalPolicy` blocks non-file destructive tools in workspace-write	`tests/policy/test_permission_matrix.py`	Dangerous flag lists in shell classifier — maintain with tests
Shell command obfuscation bypass	High	Multi-pass `_normalize_shell_arg` (quotes, escapes, backticks, `$()`, shlex); list-argument normalization	`tests/test_policy.py` (MultiSigQuorumTests)	Catches `rm -r"f" /prod`, backtick injection, list-based bypass
Secret leakage in audit logs	Medium	Audit redaction keys and truncation	`test_audit_chain_integrity_flow.py`	Over-redaction reduces debuggability — export tiers future work
AuditLevel.L3 plaintext storage	Medium	L3 docstring states no encryption at rest; use L2/default for shared disks	`tests/test_audit_levels.py` (when added), operator docs	Optional `audit-encryption` extra planned for compliance deployments
Bearer token file storage	Medium	Tokens hashed at load; operational guidance: chmod 600, store outside repo	`http-surface-auth.md`, operator docs	Encryption at rest not implemented; treat token files as sensitive
Code Mode fork backend on untrusted input	Medium	`ChildProcessCodeModeBackend(trusted_only=False)` raises; Docker backend for untrusted code	`tests/test_code_mode_trusted_only.py`	Fork shares FDs with parent — document trust boundary
Audit log tampering	Medium	Hash chain (`audit verify`); fsync	`test_audit_chain_integrity_flow.py`	Local-only unless signed export
Plugin supply-chain execution	High	Plugin verify/install gates; entry-point audit	`test_plugin_install_security_flow.py`	Capability manifest formalization in progress
MCP server tool explosion / exfiltration	High	MCP tool filter hook; HTTP auth for remote MCP	`test_remote_mcp_consumption_flow.py`	MCP trust CLI implemented — per-server defaults
Memory poisoning / failure-card bias	Medium	Failure cards; warning injection; automated invalidation rules	`test_memory_auto_curation_flow.py`, `tests/test_governance_fuzz.py`	TTL/confidence schema enforced with conservative defaults
Subagent privilege escalation	High	Per-agent JIT approval (`_agent_approved_tools`); subagent defs; lineage; isolation modes; centralized approval queue	`test_subagent_lineage_flow.py`, `test_approval_is_per_agent_not_global`, worktree/container isolation flows, `tests/test_governance_fuzz.py`	Global mutation fixed — approval now scoped per-agent
Parallel branch contamination	Medium	Git sandbox branches; worktree isolation	`test_subagent_worktree_isolation_flow.py`	Main-branch write blocked in tournament — verify per release
Unplanned destructive writes	High	Strict plan-before-write enforcement in workspace-write mode	`tests/test_governance_fuzz.py`, `tests/test_tranche_b_governance.py`	`--skip-plan-check` override available for power users
Provider response schema drift	Low	JSON schema for model decisions	`test_live_provider_conformance_flow.py`	Provider-specific quirks remain
Unbounded run cost	Medium	`RunBudget`; iteration/tool/cost caps	`test_p0_harness.py`, `test_p0_slo_flow.py`	User must configure caps
JIT approval server unresponsive during wait	High	Async `_wait_for_approval` using `asyncio.Event` + `asyncio.wait_for`; SSE server remains responsive	`tests/test_phase6_jit_server.py`	Fixed synchronous `time.sleep` spin-lock blocking event loop
Context Bus SQLite lock contention / transaction leaks	Medium	Per-thread SQLite connections (`threading.local`); `timeout=5.0` on connect; WAL pragmas on each new connection; `_execute_with_retry` with exponential backoff (5 retries) + generation-based reconnect (per-thread only); explicit `conn.rollback()` on write failure; `cleanup_old_deltas` scoped to `workflow_id`	`tests/test_phase5_context_bus.py` (incl. parallel publish + workflow-scoped cleanup), `tests/test_remediation_p1_p2.py`	—
Federated sync state corruption on crash	Medium	`atomic_write_text` + file lock on `federated_sync_state.json`; lock on pending changes	`tests/test_federated_sync.py`	File-based multi-sig quorum still experimental
JIT approval server race on approve/reject	Medium	`threading.Lock` on `_requests` / `_pending_events`	`tests/test_phase5_jit_approval_server.py`	Approve from thread without running event loop still drops SSE broadcast
Asyncio event loop starvation from synchronous P2P approval polling	High	`collect_approval_signatures` is `async def` with `asyncio.sleep`; blocking I/O uses `run_in_executor`; HTTP WAN path uses `SignatureRelayClient` + bearer auth	`tests/test_federated_sync.py`, `tests/test_signature_relay.py`	—
Shell normalization bypass via brace expansion / process substitution	High	Multi-pass `_normalize_shell_arg` now handles `{a,b}` expansion, `<()` process substitution, and non-string/non-list fallback	`tests/test_policy.py`	Catches `/pr{od,oduction}`, `<(echo /prod)`, dict-type command args
Protected directory bypass via alternate write tools	High	`workspace_write_` tool pattern + `.git` argument pattern covers all write tools and subdirectory contents	`tests/test_policy.py`, `tests/test_file_policy.py`	Previously only `workspace_write_file` was covered
Swarm hang / undetected thread deadlock	High	`ThreadPoolExecutor.as_completed(timeout=...)` with partial result collection; `Subagent` tracks `is_running`/`last_heartbeat`; `_heartbeat_monitor_loop` uses thread-ref liveness instead of defunct PID-based `is_process_alive`; heartbeat hangs merged into swarm `results`	`tests/test_swarm.py`, `tests/test_remediation_p1_p2.py`	Previously heartbeat monitor checked parent PID (always alive) — now detects actual thread hangs
Git stash stack corruption in parallel sandboxes	Critical	`stash_save` returns actual stash reflog selector; `stash_pop` accepts specific ref	`tests/test_sandbox.py`	Previously hardcoded `stash@{0}` caused cross-agent stash confusion
Workflow self-healing infinite recursion	High	`_execute_step` accepts `current_attempt` parameter preserved across recursive re-execution; abort guard checks attempts against max before proceeding	`tests/test_phase5_workflow_engine.py`	Previously `self_healing_attempts` reset to 0 on new `StepExecution` — now passed through recursion chain
Workflow strict validation rollback never executed	High	`execute_workflow` integrates `UndoJournal` + `AuditLogger`; checks `result.requires_rollback` and calls `journal.restore()` on strict validation failure	`tests/test_remediation_p1_p2.py` (`WorkflowRollbackTests`), `tests/test_phase5_workflow_engine.py`	Previously `requires_rollback` flag set but never consumed — now triggers full workspace undo
NFS / multi-writer JSONL corruption	High	Single-writer per workspace; `fcntl` + `atomic_write_text` on supported local FS; SQLite for Context Bus / OAuth	ADR 0008, `teaagent selftest`	Do not share `.teaagent/runs/*.jsonl` across concurrent hosts on NFS without DB migration

Trust Boundaries

Trusted:   TeaAgent harness (Runner, Policy, Audit, built-in workspace tools)
Reviewed:  Project plugins, MCP servers, skills (manifest + human enable)
Untrusted: Model output, external MCP payloads, arbitrary plugin handlers

Operator Checklist (High-Risk Repos)

Start with --permission-mode read-only.
Run teaagent tool lint after adding plugins.
Use --require-plan + --from-plan for writes (now enforced by default in workspace-write mode).
Use --skip-plan-check only when you understand the security implications.
Run teaagent memory failures auto-invalidate periodically to clean stale failure cards.
Never use danger-full-access outside an isolated sandbox.
Review teaagent runs export <run_id> after each autonomous edit session.
Check CI governance gate results before merging changes.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

TeaAgent Threat Model

Trust Boundaries

Operator Checklist (High-Risk Repos)

Uh oh!

FilesExpand file tree

threat-model.md

Latest commit

History

threat-model.md

File metadata and controls

TeaAgent Threat Model

Trust Boundaries

Operator Checklist (High-Risk Repos)