ProfRandom92
diff --git a/‎PROJEKT.md‎
Lines changed: 3 additions & 3 deletions b/‎PROJEKT.md‎
Lines changed: 3 additions & 3 deletions
diff --git a/‎docs/VALIDATE_BENCHMARK.md‎
Lines changed: 50 additions & 0 deletions b/‎docs/VALIDATE_BENCHMARK.md‎
Lines changed: 50 additions & 0 deletions
diff --git a/‎reports/phase_9_status.md‎
Lines changed: 38 additions & 0 deletions b/‎reports/phase_9_status.md‎
Lines changed: 38 additions & 0 deletions
@@ -21,8 +21,8 @@ CompText CLI is an experimental terminal context client for building determinist
 ```text
 CURRENT_PHASE: 9
 CURRENT_TASK: Validate and Benchmark
-LAST_GREEN_PHASE: 8
-STATUS: active
+LAST_GREEN_PHASE: 9
+STATUS: complete
 ```
 
 ### Autonomy Contract
@@ -83,7 +83,7 @@ git push
 | **Phase 6** | Apply Gate | Implement `ctxt apply` to confirm/apply changes and run verification | **COMPLETE** |
 | **Phase 7** | Provider Config Layer | Support dynamic provider profile switching and configurations | **COMPLETE** |
 | **Phase 8** | OpenAI-Compatible Adapter | Implement OpenAI adapter skeleton | **COMPLETE** |
-| **Phase 9** | Validate and Benchmark | Local validation, dry-runs, and deterministic benchmark flows | **ACTIVE** |
+| **Phase 9** | Validate and Benchmark | Local validation, dry-runs, and deterministic benchmark flows | **COMPLETE** |
 
 ---
 
 
@@ -0,0 +1,50 @@
+# Validation and Benchmarking
+
+This document details the usage and specifications for the local validation and benchmarking features of `comptext` CLI (`ctxt`).
+
+## 1. Local Validation Command
+
+The `ctxt validate` command prints the standard local validation commands used to ensure codebase integrity and safety compliance.
+
+### Usage
+```bash
+ctxt validate
+```
+
+### Output
+```text
+Standard local validation commands:
+cargo fmt --all --check
+cargo check
+cargo test
+cargo clippy -- -D warnings
+```
+
+---
+
+## 2. Deterministic Benchmark Command
+
+The `ctxt benchmark` command evaluates context packaging and model request generation deterministically under an offline sandbox.
+
+### Usage
+```bash
+ctxt benchmark --provider dummy "How should I test this repo?"
+```
+
+- **`--provider`**: Optional argument. Currently, only `"dummy"` is supported to prevent unauthorized live network calls (fails closed if another provider is specified). Defaults to `"dummy"`.
+- **task description**: The target prompt to run the benchmark against.
+
+### Artifact Outputs
+
+Each benchmark run builds a schema-checked Context Pack and runs the offline model query. It writes a deterministic JSON artifact to `.comptext/benchmark.latest.json` containing:
+
+- `schema_version`: Version of the benchmark format.
+- `task`: The prompt task.
+- `provider`: The provider used.
+- `context_pack_path`: Filepath to the generated Context Pack.
+- `request_artifact_path`: Filepath to the generated Model Request.
+- `response_artifact_path`: Filepath to the generated Model Response.
+- `validation_commands`: List of local validation commands.
+- `network`: Network state declaration (always `"offline-only"` in this phase).
+- `secrets`: Secrets handling status (always `"redacted"`).
+- `status`: Benchmark run completion status (always `"success"` if successful).
@@ -0,0 +1,38 @@
+# Phase 9 Status Report: Validate and Benchmark
+
+## Status Summary
+- **Phase**: Phase 9: Validate and Benchmark
+- **Status**: success
+- **Date**: 2026-06-04
+
+---
+
+## Metadata details
+- **PHASE**: Phase 9: Validate and Benchmark
+- **STATUS**: success
+- **FILES_CHANGED**:
+  - `src/cli.rs`
+  - `tests/cli_smoke.rs`
+  - `docs/VALIDATE_BENCHMARK.md`
+  - `reports/phase_9_status.md`
+  - `PROJEKT.md`
+- **COMMANDS_RUN**:
+  - `cargo fmt --all --check`
+  - `cargo check`
+  - `cargo test`
+  - `cargo clippy -- -D warnings`
+  - `cargo run --bin ctxt -- validate`
+  - `cargo run --bin ctxt -- benchmark --provider dummy "How should I test this repo?"`
+- **VALIDATION**:
+  - Code formatting checked and green.
+  - Compilation successful without warnings.
+  - All 35 tests (27 unit tests, 8 integration smoke tests) passed successfully.
+  - Manual execution of `ctxt validate` and `ctxt benchmark` successfully verified.
+- **ARTIFACTS**:
+  - `.comptext/benchmark.latest.json` (generated during benchmark run, ignored by git)
+- **GIT**: Pending commit and push
+- **NETWORK**: offline-only (no network requests executed)
+- **SECRETS**: Redacted from all outputs and metadata.
+- **POLICY_DECISIONS**: Benchmark execution fails closed if any non-dummy/network provider is specified.
+- **RISKS**: None. Clean offline mock execution maintains sandbox boundaries.
+- **NEXT**: Validate and finalize