From c4f916f52ce567d65883b479d483190a0fceeaff Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Alexander=20K=C3=B6lnberger?= <159939812+ProfRandom92@users.noreply.github.com> Date: Wed, 20 May 2026 08:28:02 -0700 Subject: [PATCH] Align README with MCP trace replay artifacts --- README.md | 30 +++++++++++++++++++++++++++--- 1 file changed, 27 insertions(+), 3 deletions(-) diff --git a/README.md b/README.md index 0c5095d..d8f0f17 100644 --- a/README.md +++ b/README.md @@ -5,7 +5,7 @@

CompText V7

- Deterministic replay-survivability validation for compressed operational state in long-horizon AI agents. + Deterministic replay-integrity validation for compressed MCP-style operational traces.

@@ -27,7 +27,7 @@ ยท Failure Taxonomy

-CompText V7 does not ask whether a compressed summary sounds good. It asks whether the compressed state can still replay the operational facts required to continue the work. +CompText V7 validates whether compressed operational commitments survive deterministic replay reconstruction in MCP-style agent workflows. --- @@ -78,9 +78,26 @@ flowchart LR --- +## Operational commitments + +CompText V7 validates whether deterministic replay reconstruction preserves: + +- evidence +- constraints +- blockers +- dependencies +- recovery paths +- tool order +- capability boundaries +- governance/policy gates + +The `mcp_trace_replay` fixture family validates deterministic replay safety for tool order, validation-before-action, dependency chains, recovery paths, and capability boundaries. Registered contracts: `tool_call_order_preserved`, `validation_before_unsafe_action`, `dependency_chain_preserved`, `recovery_path_available`, `capability_boundary_respected`. + +--- + ## Current fixture-bound signal -- Three manifest-registered operational fixture families. +- Four manifest-registered operational fixture families. - Standard levels: `baseline`, `mild`, `moderate`, `severe`. - Deterministic evaluation mode. - Exact rational scoring. @@ -157,6 +174,8 @@ flowchart LR | `artifacts/layered_admissibility_results.json` | Layered admissibility outputs. | | `artifacts/multi_family_admissibility_results.json` | Multi-family deterministic aggregates. | | `artifacts/multi_family_admissibility_curves.svg` | Deterministic degradation curve rendering. | +| `artifacts/mcp_trace_replay_results.json` | Deterministic MCP trace replay contract outcomes. | +| `artifacts/replay_semantic_integrity_results.json` | Deterministic replay semantic integrity outcomes. | | `docs/benchmarks/multi_family_admissibility_benchmark.md` | Benchmark method and interpretation boundaries. | | `docs/failure_taxonomy.md` | Failure label documentation. | @@ -181,6 +200,7 @@ pytest tests/test_paper_replay_bench.py tests/test_agent_trace_replay.py -q - `coding_workflow_pr_review` - `incident_response_page_triage` - `cross_domain_operational_dependency_workflow` +- `mcp_trace_replay` ```mermaid flowchart LR @@ -196,6 +216,10 @@ flowchart LR C --> L2 C --> L3 C --> L4 + D["mcp_trace_replay"] --> L1 + D --> L2 + D --> L3 + D --> L4 L1 --> M["manifest registration"] L2 --> M L3 --> M