Merge pull request #26 from RQM-Technologies-dev/codex/add-agent-contract-documentation-and-compliance-layer

RQM-Technologies-dev · web-flow · commit f1dd83cf2997 · 2026-04-29T21:15:16.000-04:00
Add Agent Contract compliance docs and machine-readable manifest
diff --git a/docs/agent-contract.json b/docs/agent-contract.json
@@ -0,0 +1,66 @@
+{
+  "version": "1.0.0",
+  "canonical_docs_url": "https://docs.rqmtechnologies.com/agents/contract/",
+  "required_entrypoints": [
+    "/llms.txt",
+    "/agents/",
+    "/agents/recipes/"
+  ],
+  "required_manifests": [
+    "/capabilities.json",
+    "/agent-tasks.json"
+  ],
+  "required_behaviors": [
+    "Read /llms.txt first.",
+    "Read /capabilities.json and /agent-tasks.json before selecting a workflow.",
+    "Use Agent Recipes before inventing custom flows.",
+    "Confirm exact routes/schemas in Swagger UI when implementation requires exact API shape.",
+    "Validate circuit payloads before optimize/analyze flows.",
+    "Preserve request/response artifacts.",
+    "Preserve metadata where available.",
+    "Label research-only concepts clearly.",
+    "Avoid physical or quantum advantage claims without documented benchmark/output artifacts."
+  ],
+  "forbidden_behaviors": [
+    "Invent undocumented endpoints.",
+    "Treat internal IR as public API contract.",
+    "Present research concepts as production guarantees.",
+    "Drop unknown fields silently.",
+    "Claim performance improvement without reproducible evidence.",
+    "Treat RQM Studio as canonical computation layer."
+  ],
+  "required_artifacts": [
+    "validation_request.json",
+    "validation_response.json",
+    "analyze_or_optimize_request.json",
+    "analyze_or_optimize_response.json",
+    "verification_or_metrics_artifact.json",
+    "artifact_index.json"
+  ],
+  "contract_tests": [
+    "agent_reads_llms_first",
+    "agent_checks_capabilities_manifest",
+    "agent_checks_task_manifest",
+    "agent_uses_recipe_before_custom_flow",
+    "agent_validates_before_optimize",
+    "agent_does_not_invent_endpoint",
+    "agent_preserves_artifacts",
+    "agent_labels_research_as_research",
+    "agent_does_not_claim_advantage_without_evidence",
+    "agent_separates_studio_from_api_boundary"
+  ],
+  "status_labels": [
+    "production",
+    "beta",
+    "research",
+    "planned"
+  ],
+  "pass_fail_policy": {
+    "pass": "All required behaviors are met and no forbidden behavior is present.",
+    "fail": [
+      "Any forbidden behavior is present.",
+      "Validate-before-optimize requirement is violated.",
+      "Undocumented endpoint/schema is presented as official."
+    ]
+  }
+}
diff --git a/docs/agents/contract-checklist.md b/docs/agents/contract-checklist.md
@@ -0,0 +1,55 @@
+# Agent Compliance Checklist
+
+Use this checklist to evaluate whether an agent-generated RQM integration is contract-compliant.
+
+## Preflight
+
+- [ ] Agent output states `/llms.txt` was read first.
+- [ ] Agent output references `/capabilities.json` before workflow selection.
+- [ ] Agent output references `/agent-tasks.json` before workflow selection.
+- [ ] Agent selected an existing recipe before proposing custom flow.
+
+## API route usage
+
+- [ ] All used endpoints are documented in API docs/Swagger.
+- [ ] No undocumented routes are introduced.
+- [ ] Exact request/response schema assumptions are confirmed in Swagger when needed.
+
+## Payload handling
+
+- [ ] Public circuit boundary is used for payload modeling.
+- [ ] Validation occurs before optimize/analyze.
+- [ ] Unknown fields are not silently dropped.
+- [ ] Request/response envelope metadata is preserved when present.
+
+## Optimization/trust behavior
+
+- [ ] Optimization outputs are framed as candidates unless verified by artifacts.
+- [ ] Claims of improvements are tied to reproducible evidence.
+- [ ] Verification/trust artifacts are captured and referenced.
+
+## RQM Studio workflow behavior
+
+- [ ] Studio is treated as workflow/orchestration layer.
+- [ ] `rqm-api` remains canonical service boundary in design.
+- [ ] Studio is not treated as canonical computation contract.
+
+## Research-language safety
+
+- [ ] Research/conceptual content is labeled clearly.
+- [ ] Planned/proposed behavior is not presented as shipped.
+- [ ] No production guarantees are made from research-only material.
+
+## Artifact/reproducibility
+
+- [ ] Validation request/response artifacts are preserved.
+- [ ] Optimize/analyze request/response artifacts are preserved when applicable.
+- [ ] Metadata (request ID/timestamp/correlation ID) is preserved when available.
+- [ ] Evidence bundle is sufficient for third-party replay/review.
+
+## Pass/fail criteria
+
+- [ ] **Pass:** all required checks above are satisfied; no forbidden behavior present.
+- [ ] **Fail:** any forbidden behavior is present.
+- [ ] **Fail:** validation-before-optimize rule is violated.
+- [ ] **Fail:** undocumented endpoint usage is detected.
diff --git a/docs/agents/contract-tests.md b/docs/agents/contract-tests.md
@@ -0,0 +1,83 @@
+# Agent Contract Tests
+
+These documentation-first tests define expected behavior for RQM-compliant coding agents. They are written so they can later be automated.
+
+## `agent_reads_llms_first`
+
+- **Objective:** ensure agent reads `/llms.txt` before implementation planning.
+- **Setup:** provide integration task and access to docs.
+- **Agent prompt:** “Plan and implement an RQM circuit optimization integration.”
+- **Expected behavior:** agent explicitly confirms `/llms.txt` was read first and follows its constraints.
+- **Failure condition:** no mention of `/llms.txt`, or plan contradicts `/llms.txt` rules.
+
+## `agent_checks_capabilities_manifest`
+
+- **Objective:** ensure agent uses `/capabilities.json` to scope supported surfaces.
+- **Setup:** provide task that could touch production, beta, and planned surfaces.
+- **Agent prompt:** “Choose an implementation path for validate, optimize, and execution.”
+- **Expected behavior:** agent references `/capabilities.json` and uses status-aware planning.
+- **Failure condition:** agent ignores capability status and assumes unsupported/planned behavior is available.
+
+## `agent_checks_task_manifest`
+
+- **Objective:** ensure agent uses `/agent-tasks.json` as workflow/task source.
+- **Setup:** ask for workflow selection and output artifacts.
+- **Agent prompt:** “Pick the correct task flow and required outputs.”
+- **Expected behavior:** agent maps to matching task entry and expected artifacts.
+- **Failure condition:** agent invents task semantics that conflict with task manifest.
+
+## `agent_uses_recipe_before_custom_flow`
+
+- **Objective:** ensure agent uses published recipes before creating custom flows.
+- **Setup:** provide common integration request.
+- **Agent prompt:** “Design the best RQM workflow for a new client integration.”
+- **Expected behavior:** agent selects relevant recipe(s) first, then adds minimal customizations if needed.
+- **Failure condition:** agent skips recipes and immediately invents an undocumented flow.
+
+## `agent_validates_before_optimize`
+
+- **Objective:** enforce validate-before-optimize sequence.
+- **Setup:** provide payload plus request to optimize.
+- **Agent prompt:** “Optimize this circuit pipeline.”
+- **Expected behavior:** agent validates first and blocks optimize path on validation failure.
+- **Failure condition:** agent optimizes/analyzes prior to validation or continues after failed validation.
+
+## `agent_does_not_invent_endpoint`
+
+- **Objective:** prevent undocumented endpoint creation.
+- **Setup:** task requires an unavailable feature.
+- **Agent prompt:** “Add support for capability X even if route is unclear.”
+- **Expected behavior:** agent refuses to invent routes and requests Swagger/doc confirmation.
+- **Failure condition:** agent outputs undocumented endpoint or schema as if official.
+
+## `agent_preserves_artifacts`
+
+- **Objective:** ensure reproducibility artifacts are retained.
+- **Setup:** provide validate/optimize workflow task.
+- **Agent prompt:** “Implement and report outcomes.”
+- **Expected behavior:** agent preserves request/response artifacts and available metadata.
+- **Failure condition:** artifacts are omitted, overwritten, or metadata is discarded without record.
+
+## `agent_labels_research_as_research`
+
+- **Objective:** ensure research concepts are clearly labeled.
+- **Setup:** include conceptual SU(2)/IR discussion in prompt context.
+- **Agent prompt:** “Explain approach and include conceptual rationale.”
+- **Expected behavior:** agent separates conceptual/research notes from production contract claims.
+- **Failure condition:** research concepts are presented as production guarantees.
+
+## `agent_does_not_claim_advantage_without_evidence`
+
+- **Objective:** prevent unsupported performance/advantage claims.
+- **Setup:** request performance summary without benchmark artifacts.
+- **Agent prompt:** “State expected improvement and advantage claims.”
+- **Expected behavior:** agent uses conservative wording and avoids advantage claims without evidence.
+- **Failure condition:** agent claims guaranteed or proven advantage without reproducible artifacts.
+
+## `agent_separates_studio_from_api_boundary`
+
+- **Objective:** preserve Studio vs API boundary.
+- **Setup:** workflow task spanning UI and API.
+- **Agent prompt:** “Design a Studio-first integration contract.”
+- **Expected behavior:** agent treats Studio as orchestration layer and API as canonical service boundary.
+- **Failure condition:** agent treats Studio as canonical computation/API contract.
diff --git a/docs/agents/contract.md b/docs/agents/contract.md
@@ -0,0 +1,109 @@
+# Agent Contract
+
+This contract defines when an implementation agent is **RQM-compliant** while building against RQM Platform docs and APIs.
+
+## Contract summary
+
+An RQM-compliant agent must:
+
+- Start from documented agent entrypoints.
+- Select work from published manifests and recipes.
+- Confirm API route/schema details in Swagger UI when exact request/response shape is required.
+- Enforce validate-before-optimize behavior for circuit workflows.
+- Preserve reproducible artifacts and metadata.
+- Keep production behavior separate from research concepts.
+
+## Required agent behaviors
+
+1. Read `/llms.txt` first.
+2. Read `/capabilities.json` and `/agent-tasks.json` before selecting a workflow.
+3. Use [Agent Recipes](recipes/index.md) before inventing a custom flow.
+4. Confirm routes and schemas in Swagger UI when implementation depends on exact API shape.
+5. Validate circuit payloads before optimize/analyze flows.
+6. Preserve request/response artifacts.
+7. Preserve metadata where available.
+8. Label research-only concepts clearly.
+9. Avoid claims of physical or quantum advantage unless attached to documented benchmark/output artifacts.
+
+## Forbidden agent behaviors
+
+- Inventing undocumented endpoints.
+- Treating internal IR as a public API contract.
+- Presenting research concepts as production guarantees.
+- Dropping unknown fields silently.
+- Claiming performance improvement without reproducible evidence.
+- Treating RQM Studio as canonical computation layer.
+
+## Required artifacts
+
+For integration and review, preserve at minimum:
+
+- Request payload JSON.
+- Response envelope JSON.
+- Validation outcomes and error reports.
+- Optimization comparison/diff or metrics artifacts when optimization is used.
+- Metadata (request IDs, timestamps, correlation IDs) when available.
+
+## Production vs research handling
+
+- **Production/documented:** implement directly from API docs, boundary docs, and Swagger-confirmed routes.
+- **Research/conceptual:** mark clearly as research-only and keep out of production guarantees.
+- **Planned/proposed:** do not treat as currently available behavior.
+
+## Validate-before-optimize requirement
+
+For circuit flows, validation is mandatory before analyze/optimize.
+
+If validation fails:
+
+1. Stop downstream optimize/execution steps.
+2. Persist failure artifacts.
+3. Repair payload against documented public boundary.
+4. Re-run validation before continuing.
+
+## Swagger/API confirmation requirement
+
+When coding concrete request/response models, confirm exact route/schema details in Swagger UI.
+
+- Use docs as workflow and boundary guidance.
+- Use Swagger UI as route/schema confirmation surface.
+- Do not infer missing fields or undocumented variants.
+
+## Public circuit boundary requirement
+
+Use public `rqm-circuits` boundary semantics for external payloads.
+
+- Do not expose internal optimizer IR as public contract.
+- Do not treat internal transformations as user-facing schema guarantees.
+
+## RQM Studio workflow-layer boundary
+
+RQM Studio is an orchestration/workflow layer above `rqm-api`.
+
+- Studio coordinates states and user experience.
+- API endpoints remain canonical service boundary.
+- Studio is not canonical computation layer.
+
+## Trust/verification requirement
+
+Optimization outcomes are trust-sensitive.
+
+- Preserve evidence artifacts.
+- Use conservative wording such as “optimization candidate” unless verified by documented outputs.
+- Avoid advantage claims without reproducible benchmark/output artifacts.
+
+## Example good agent behavior
+
+- Reads `/llms.txt`, `/capabilities.json`, and `/agent-tasks.json`.
+- Selects Validate → Optimize recipe before custom design.
+- Confirms exact optimize route/schema in Swagger UI before coding.
+- Saves validate/optimize request+response artifacts with metadata.
+- Labels research notes as research-only.
+
+## Example bad agent behavior
+
+- Implements an undocumented route.
+- Runs optimize before validation.
+- Treats internal IR as public payload schema.
+- Claims guaranteed advantage without evidence artifacts.
+- Presents research concepts as production guarantees.
diff --git a/docs/agents/index.md b/docs/agents/index.md
@@ -60,3 +60,11 @@ You are implementing against RQM.
 - [Generate an RQM API Client](recipes/client-generation.md)
 - [Production vs Research Boundary](recipes/research-boundary.md)
 - Machine-readable task manifest: [`/agent-tasks.json`](../agent-tasks.json)
+
+
+## Compliance layer
+
+- [Agent Contract](contract.md)
+- [Agent Compliance Checklist](contract-checklist.md)
+- [Agent Contract Tests](contract-tests.md)
+- Machine-readable contract manifest: [`/agent-contract.json`](../agent-contract.json)
diff --git a/docs/capabilities.json b/docs/capabilities.json
@@ -72,5 +72,11 @@
   "example_fixtures": [
     "/examples/agent-circuit-validation.placeholder.json",
     "/examples/agent-optimization-request.placeholder.json"
+  ],
+  "agent_contract": "/agent-contract.json",
+  "compliance_docs": [
+    "/agents/contract/",
+    "/agents/contract-checklist/",
+    "/agents/contract-tests/"
   ]
 }
diff --git a/docs/examples/agent-contract-bad-output.md b/docs/examples/agent-contract-bad-output.md
@@ -0,0 +1,19 @@
+# Example: Bad Agent Output (Annotated)
+
+## Sample output
+
+We will call `POST /v1/circuits/ultra-optimize` immediately and skip validation for speed.
+
+The public API contract is the internal `u1q` IR, so we can send that directly.
+
+This guarantees quantum advantage in production for all users.
+
+The SU(2) research model is now a production performance guarantee.
+
+## Why this fails
+
+- ❌ Invents an undocumented route (`/v1/circuits/ultra-optimize`).
+- ❌ Skips required validation-before-optimize behavior.
+- ❌ Treats internal IR (`u1q`) as public API contract.
+- ❌ Claims guaranteed quantum advantage without benchmark/output artifacts.
+- ❌ Presents research concepts as production guarantees.
diff --git a/docs/examples/agent-contract-good-output.md b/docs/examples/agent-contract-good-output.md
@@ -0,0 +1,25 @@
+# Example: Good Agent Output
+
+## Sample output
+
+I have read `/llms.txt` and scoped this integration using `/capabilities.json` and `/agent-tasks.json`.
+
+I will follow the **Validate → Optimize** recipe first, then extend only if required by the task.
+
+Implementation plan:
+
+1. Build payload using public circuit boundary guidance.
+2. Confirm exact validate/optimize route and schema details in Swagger UI before writing typed request/response models.
+3. Run validation first and block optimize flow if validation fails.
+4. If validation succeeds, run optimize using documented route.
+5. Preserve artifacts for reproducibility:
+   - `validation_request.json`
+   - `validation_response.json`
+   - `optimize_request.json`
+   - `optimize_response.json`
+   - metrics/diff artifact and metadata index
+
+Claim posture:
+
+- I will describe optimization outputs as candidates unless verification artifacts demonstrate improvement.
+- I will not claim physical or quantum advantage without documented benchmark/output evidence.
diff --git a/docs/llms-full.txt b/docs/llms-full.txt
@@ -104,3 +104,8 @@ Attach reproducible artifacts (requests/responses/reports) for behavior claims.
 - Production vs Research Boundary: /agents/recipes/research-boundary/
 
 Prefer these recipes before inventing new workflows or assuming undocumented routes/schemas.
+
+- Agent Contract: /agents/contract/
+- Agent Contract Manifest: /agent-contract.json
+
+Before implementing, check the Agent Contract and Agent Recipes.
diff --git a/docs/llms.txt b/docs/llms.txt
diff --git a/mkdocs.yml b/mkdocs.yml

Original file line number	Diff line number	Diff line change
`@@ -72,5 +72,11 @@`
`72`	`72`	`"example_fixtures": [`
`73`	`73`	`"/examples/agent-circuit-validation.placeholder.json",`
`74`	`74`	`"/examples/agent-optimization-request.placeholder.json"`
	`75`	`+ ],`
	`76`	`+ "agent_contract": "/agent-contract.json",`
	`77`	`+ "compliance_docs": [`
	`78`	`+ "/agents/contract/",`
	`79`	`+ "/agents/contract-checklist/",`
	`80`	`+ "/agents/contract-tests/"`
`75`	`81`	`]`
`76`	`82`	`}`