Skip to content

Commit dac2df2

Browse files
ProfRandom92ProfRandom92
andauthored
Align project docs with replay validation scope (#216)
* Align project docs with replay validation scope * Address PR review comments on project source docs --------- Co-authored-by: ProfRandom92 <akoelnberger92@gmail.com>
1 parent 9f785b3 commit dac2df2

3 files changed

Lines changed: 36 additions & 22 deletions

File tree

docs/project_source.md

Lines changed: 25 additions & 16 deletions
Original file line numberDiff line numberDiff line change
@@ -1,22 +1,22 @@
11
# Project Source: CompText V7
22

33
## Canonical Positioning
4-
CompTextv7 is the deterministic replay-integrity layer for compressed operational agent traces.”
4+
CompText V7 is a deterministic replay-validation prototype for compact operational agent/MCP traces, with a KVTC-V7 technical-log compression prototype.”
55

66
## Strategic Category
7-
CompTextv7 is a deterministic replay-integrity validation research prototype.
7+
CompText V7 is a deterministic replay-validation research prototype.
88

99
## Core Thesis
1010
The core project question is:
1111

12-
“Can a compressed agent state still reproduce a safe operational trajectory?”
12+
“Can compact operational trace state still reproduce a safe replay trajectory?”
1313

1414
The governing research direction is:
1515

16-
“Deterministic Replay Integrity Validation for Compressed Operational Agent Traces
16+
“Deterministic replay validation for compact operational agent/MCP traces
1717

1818
## Research Alignment Principle
19-
Every accepted change must strengthen deterministic, offline, reproducible replay-integrity validation for compressed operational traces, or be strictly maintenance-only.
19+
Every accepted change must strengthen deterministic, offline, reproducible replay validation for compact operational agent/MCP traces, or be strictly maintenance-only.
2020

2121
## Current Strategic State
2222
The project remains:
@@ -29,6 +29,15 @@ The project remains:
2929
- audit-oriented
3030

3131
## Current Implementation State
32+
Current implemented surfaces:
33+
- curated agent trace fixtures in `tests/fixtures/agent_traces/`
34+
- deterministic replay runner in `tests/utils/agent_trace_replay_runner.py`
35+
- MCP replay payload layer in `src/comptext_v7/mcp/`
36+
- evidence survival helpers in `src/validation/evidence.py`
37+
- replay failure labels in `src/validation/replay_failure_classifier.py`
38+
- committed artifacts such as `artifacts/agent_trace_replay_results.json`
39+
- KVTC-V7 technical-log compression prototype in `src/core/kvtc_v7.py`
40+
3241
The `mcp_trace_replay` fixture family is hardened via explicit capability-boundary structures, deterministic degraded variants, and `failure_label_on_violation` handling.
3342

3443
A deterministic artifact evidence index maps committed evidence artifacts to their generators, evidence categories, fixture-family coverage, manifest alignment, deterministic evaluation status, LLM-free status, and external-API-free status.
@@ -37,11 +46,11 @@ A deterministic artifact evidence index maps committed evidence artifacts to the
3746
See `NON_GOALS.md` in the repository root for the full list of prohibited expansion paths and the hard scope test.
3847

3948
## Hard Scope Rule
40-
A proposal is out of scope unless it directly improves deterministic replay-integrity validation for compressed operational traces without introducing probabilistic or platform-expansion behavior.
49+
A proposal is out of scope unless it directly improves deterministic replay validation for compact operational agent/MCP traces without introducing probabilistic or platform-expansion behavior.
4150

4251
## Scope Evolution & RFC Process
4352
Scope-expanding proposals require an RFC before implementation. RFCs must:
44-
1. define the exact replay-integrity gain,
53+
1. define the exact replay-validation gain,
4554
2. prove deterministic and offline reproducibility,
4655
3. show no hidden orchestration/platform expansion,
4756
4. describe artifact and failure-taxonomy implications,
@@ -50,21 +59,21 @@ Scope-expanding proposals require an RFC before implementation. RFCs must:
5059
Default decision for scope-expanding proposals is rejection until the RFC is accepted.
5160

5261
## Correct Abstraction Level
53-
CompTextv7 focuses on replay-integrity contracts, trace-derived artifacts, fixture families, deterministic validators, and auditable evidence outputs. It does not absorb adjacent runtime or product-platform responsibilities.
62+
CompText V7 focuses on replay contracts, trace-derived artifacts, fixture families, deterministic validators, and auditable evidence outputs. It does not absorb adjacent runtime or product-platform responsibilities.
5463

5564
## Preferred Terminology
5665
Prefer:
57-
- deterministic replay-integrity validation
58-
- compressed operational traces
66+
- deterministic replay validation
67+
- compact operational agent/MCP traces
5968
- artifact evidence
6069
- failure taxonomy and labels
6170
- capability boundary
6271
- offline reproducibility
6372

64-
Avoid vague or marketing abstractions that hide deterministic constraints.
73+
Avoid vague or marketing abstractions that hide deterministic constraints. Do not frame the repository as industrial, enterprise-ready, production-ready, certification-grade, showcase-first, or a cognitive fabric.
6574

6675
## Roadmap From Here
67-
1. Expand deterministic replay-integrity fixture coverage.
76+
1. Expand deterministic replay-validation fixture coverage.
6877
2. Strengthen artifact-evidence traceability and reproducibility checks.
6978
3. Improve contract-level validation clarity and failure classification precision.
7079
4. Add narrowly scoped capabilities only via approved RFC.
@@ -73,11 +82,11 @@ Avoid vague or marketing abstractions that hide deterministic constraints.
7382
- Documentation-only governance changes should stay small and focused.
7483
- Core logic changes must include targeted tests.
7584
- No merge-ready state until CI is green and review threads are resolved/outdated.
76-
- Keep docs, showcase, and core refactors separated.
85+
- Keep unrelated docs, demos, and core refactors separated.
7786

7887
## PR Template Additions
7988
PR descriptions should explicitly confirm:
80-
- deterministic replay-integrity alignment,
89+
- deterministic replay-validation alignment,
8190
- non-goals preserved,
8291
- artifact regeneration expectations,
8392
- evidence-index impact (updated or unchanged),
@@ -93,10 +102,10 @@ PR descriptions should explicitly confirm:
93102
Governance is conservative, boundary-enforcing, and evidence-led. CompTextv7 accepts incremental improvements over broad platform ambitions.
94103

95104
## Current Priority Stack
96-
1. Deterministic replay-integrity correctness.
105+
1. Deterministic replay-validation correctness.
97106
2. Artifact evidence traceability.
98107
3. Failure-label clarity and consistency.
99108
4. Scope-bound documentation and review discipline.
100109

101110
## One-Sentence Rule
102-
If a change does not directly improve deterministic replay-integrity validation for compressed operational traces, it should not merge into core.
111+
If a change does not directly improve deterministic replay validation for compact operational agent/MCP traces, it should not merge into core.

docs/validation.md

Lines changed: 10 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -21,10 +21,6 @@ convenience. It does not define workspaces, dependencies, or a root Node app.
2121
The dashboard remains the only Node application in this repository, with its
2222
dependency management in `dashboard/app`.
2323

24-
The embedded showcase was removed from Comptextv7 and is maintained separately
25-
in `ProfRandom92/comptext-v7-monaco-showcase`. Historical Comptextv7 showcase
26-
files are preserved on the `archive/showcase-before-extraction` branch.
27-
2824
Root npm scripts use `npm --prefix` to delegate to the dashboard directory and
2925
use `pytest` for Python validation. No root `node_modules` directory or root npm
3026
dependencies are required for the wrapper itself.
@@ -67,7 +63,7 @@ npm run smoke:release-health
6763
Use these commands for dashboard TypeScript changes, release-health UI changes,
6864
and `dashboard/app/src/core/foundation/` modules.
6965

70-
## Python validation from the repository root
66+
## Python replay validation from the repository root
7167

7268
Run Python tests from the repository root:
7369

@@ -80,6 +76,12 @@ pytest tests/test_paper_replay_bench.py tests/test_agent_trace_replay.py tests/t
8076
The focused replay command validates the deterministic paper replay, agent trace
8177
replay, and replay continuity surfaces without changing benchmark logic.
8278

79+
Agent trace replay is fixture-bound: curated traces live in
80+
`tests/fixtures/agent_traces/`, the deterministic runner is
81+
`tests/utils/agent_trace_replay_runner.py`, and committed replay output is stored
82+
in `artifacts/agent_trace_replay_results.json`. Validation is local and does not
83+
use embeddings, vector databases, LLM judges, or external APIs.
84+
8385
Install the Python test dependency set:
8486

8587
```bash
@@ -93,3 +95,6 @@ python tests/utils/paper_replay_runner.py
9395
python tests/utils/agent_trace_replay_runner.py
9496
python benchmarks/run_replay_continuity.py --iterations 250 --output-dir reports/replay_continuity
9597
```
98+
99+
The KVTC-V7 technical-log compressor in `src/core/kvtc_v7.py` is a deterministic
100+
prototype, not a production-readiness or certification claim.

pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
[project]
22
name = "comptextv7"
33
version = "0.1.0"
4-
description = "CompText V7 Cognitive Fabric prototype"
4+
description = "Deterministic replay-validation prototype for compact operational agent/MCP traces"
55
requires-python = ">=3.11"
66
license = "Apache-2.0"
77
license-files = ["LICENSE"]

0 commit comments

Comments
 (0)