Cross-runtime conformance

Every language green box must produce byte-identical /api/v1/* responses for a shared set of scenarios. This document describes the harness that enforces that guarantee, how to run it locally, and how to update it when the contract changes.

Why

We have one HTTP contract and (soon) multiple language-native implementations. If any runtime drifts — different status codes, different response shapes, different error envelopes — operators swapping BACKEND_URL between runtimes get silently inconsistent behavior.

Fixture-based diffs catch drift the moment it shows up, regardless of which runtime introduced it.

What lives where

conformance/
├── scenarios.json              ← machine-readable scenarios
├── scenarios.md                ← narrative counterpart
├── fixtures/                   ← expected normalized responses
│   ├── workspace-crud-basic.json
│   ├── workspace-kind-is-immutable.json
│   ├── workspace-credentials-must-be-secret-ref.json
│   ├── workspace-test-connection-mock.json
│   ├── workspace-api-key-lifecycle.json
│   ├── knowledge-base-crud-basic.json
│   ├── kb-document-crud-basic.json
│   ├── kb-search-empty.json
│   ├── chunking-service-crud-lifecycle.json
│   ├── embedding-service-crud-lifecycle.json
│   ├── knowledge-filter-crud-lifecycle.json
│   ├── agent-crud-basic.json
│   └── agent-error-envelopes.json
├── mock-astra/
│   └── server.ts               ← stand-in Astra endpoint (Node)
├── normalize.mjs               ← shape-agnostic placeholder scrubber
├── runner.mjs                  ← generic scenario runner
└── README.md                   ← overview for contributors

And inside the TypeScript runtime:

runtimes/typescript/
├── scripts/
│   └── conformance-regenerate.ts   ← runs the canonical TS runtime against itself,
│                                     writes fixtures
└── tests/conformance/
    └── drift.test.ts               ← drift guard — fails if the TS runtime
                                      changes shape without updating fixtures

Scenarios

conformance/scenarios.json is a JSON array. Each entry:

{
  "slug": "workspace-crud-basic",
  "description": "Minimum viable workspace lifecycle.",
  "steps": [
    { "method": "POST", "path": "/api/v1/workspaces", "body": {...} },
    { "method": "GET",  "path": "/api/v1/workspaces" },
    { "method": "GET",  "path": "/api/v1/workspaces/$1.workspaceId" },
    ...
  ]
}

$N.field references the field of step N's raw response body (1-indexed). Supports dot-paths: $1.workspaceId, $2.workspace.workspaceId, etc.

Current scenarios (kept in sync with conformance/scenarios.json — regenerate fixtures via npm run conformance:regenerate whenever this list changes):

Slug	Covers
`workspace-crud-basic`	Workspace POST / GET / PATCH / DELETE lifecycle
`workspace-kind-is-immutable`	Workspace `kind` cannot be changed after creation
`workspace-credentials-must-be-secret-ref`	Raw credential values are rejected before reaching the SecretResolver
`workspace-test-connection-mock`	Mock workspace connection probe response shape
`workspace-api-key-lifecycle`	API-key issue, list, revoke, list lifecycle
`knowledge-base-crud-basic`	Knowledge-base POST / GET / PATCH / DELETE with bound services
`kb-document-crud-basic`	Sync document upsert + retrieval shape
`kb-search-empty`	Search response envelope when the KB has no records
`chunking-service-crud-lifecycle`	Chunking-service POST / GET / PATCH / DELETE shape
`embedding-service-crud-lifecycle`	Embedding-service POST / GET / PATCH / DELETE shape
`knowledge-filter-crud-lifecycle`	KB-scoped knowledge-filter POST / GET / PATCH / DELETE shape
`agent-crud-basic`	Agent POST / GET / PATCH / DELETE lifecycle
`agent-error-envelopes`	Stable error envelopes on the agent surface (404 / 422 / etc.)

The KB / agent fixtures pin the wire shape; routes that stay runtime-only by design (timing- or driver-method-dependent) remain covered by the per-runtime test suite.

Routes that stay runtime-only by design (timing- or driver-method-dependent):

GET /knowledge-bases/{kb}/documents/{d}/chunks — driver-side listRecords filtered by documentId
DELETE /knowledge-bases/{kb}/documents/{d} — chunk-cascade via driver deleteRecords

These move into conformance once a second runtime starts implementing them and the fixture format proves stable across drivers.

More land as KB scenarios are reauthored and as chat / MCP routes ship.

Fixtures

One JSON file per scenario in conformance/fixtures/:

{
  "slug": "workspace-crud-basic",
  "description": "...",
  "captures": [
    {
      "step": 1,
      "request":  { "method": "POST", "path": "/api/v1/workspaces", "body": {...} },
      "response": { "status": 201, "body": {...} }
    },
    ...
  ]
}

All volatile values (UUIDs, timestamps, request IDs) are replaced with stable placeholders — see Normalization.

Fixtures are the source of truth for cross-runtime behavior. Any change to them must be accompanied by matching updates in every language runtime, all in one PR.

Normalization

normalize.mjs walks any JSON tree and substitutes:

Value shape	Replacement	Strategy
RFC 4122 UUID	`{{UUID_N}}`	1-indexed, by first appearance
ISO-8601 timestamp	`{{TS}}`	Collapsed to a single placeholder (millisecond collisions between records made indexed placeholders non-deterministic)
32-char hex request ID	`{{REQID_N}}`	1-indexed, by first appearance

Object keys are also sorted alphabetically for deterministic output.

Port this file verbatim into any language whose conformance harness needs to compare against these fixtures — the ordering rules and placeholder names are the contract.

Running the suite

TypeScript runtime (default)

The drift test is part of the main Vitest suite:

npm test

See runtimes/typescript/tests/conformance/drift.test.ts — one test per scenario plus one "every scenario has a fixture" sanity check.

Regenerating fixtures

Only when you've intentionally changed the contract:

npm run conformance:regenerate

This spins up a fresh memory-backed TS runtime in-process, replays every scenario via app.request(...), normalizes the captures, and writes the fixture files. Commit the output alongside the runtime change in the same PR.

Python runtime

Point at a running mock Astra (needed so the runtime has a deterministic backend):

# Repo root, terminal 1:
npm run conformance:mock          # listens on :4010

# From runtimes/python/, terminal 2:
python -m venv .venv && source .venv/bin/activate
pip install -e '.[dev]'
pytest

Until the Python runtime's FastAPI routes are implemented, xfail(strict=True) markers guard the scenarios. Each marker comes off as its scenario goes green.

The mock Astra server

Every green box's conformance suite points its Astra config at conformance/mock-astra/. It's a tiny Node HTTP server that:

Accepts any request and returns a stub success envelope.
Captures every inbound request for optional inspection via GET /_captured.
Resets its capture log via POST /_reset.

It exists because the fixtures describe the runtime's outbound responses to its clients, but a runtime still needs some Astra endpoint to talk to during tests. The mock gives every language runtime the same deterministic backend without needing Astra credentials in CI.

The mock's request log is not a conformance assertion target — it's a debugging tool. If your runtime's responses differ from the fixtures, you can inspect GET /_captured to see what it sent upstream.

When to update fixtures

✅ You intentionally changed a response body, status code, or error envelope shape.
✅ You added or removed a route exercised by a scenario.
✅ You added a new scenario.
❌ Never to "fix a flake." Fixtures are deterministic by construction; a flake means normalization is wrong or a runtime is non-deterministic. Fix the root cause.
❌ Never in isolation. A fixture update with no code change means the committed fixture is wrong. Revert and investigate.

Adding a scenario

Append to conformance/scenarios.json.
Add narrative description to conformance/scenarios.md.
Run npm run conformance:regenerate to materialize the fixture.
Run every language runtime's tests — any drift surfaces the runtimes that need updates. Update them in the same PR.
Commit fixtures + scenario + every runtime update together.

Real-Astra smoke (separate harness)

The conformance harness above runs against the deterministic mock-astra stand-in so it can stay in CI on every PR. A second harness lives at runtimes/typescript/scripts/smoke-astra.ts that boots the runtime in-process against a real Astra Data API and exercises the full workspace → services → knowledge-base → sync ingest → async ingest → search → cleanup pipeline. Run locally with:

ASTRA_DB_API_ENDPOINT=https://<db-id>-<region>.apps.astra.datastax.com \
ASTRA_DB_APPLICATION_TOKEN=AstraCS:... \
  npm run smoke:astra

The script exits 0 with a "skipping" message when the env vars aren't set, so:

Forks and external-contributor PRs see a green "skipped" run.
The dedicated smoke-astra.yml workflow is wired to org secrets; it runs daily on main, on every push that touches the astra driver / astra-store / smoke script, and on manual dispatch.

The smoke is intentionally not part of the per-PR verify job — Astra round-trips are too slow and burn API quota. It's a "did our changes break the actual product" guard, layered on top of the per-PR conformance contract guard.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cross-runtime conformance

Why

What lives where

Scenarios

Fixtures

Normalization

Running the suite

TypeScript runtime (default)

Regenerating fixtures

Python runtime

The mock Astra server

When to update fixtures

Adding a scenario

Real-Astra smoke (separate harness)

FilesExpand file tree

conformance.md

Latest commit

History

conformance.md

File metadata and controls

Cross-runtime conformance

Why

What lives where

Scenarios

Fixtures

Normalization

Running the suite

TypeScript runtime (default)

Regenerating fixtures

Python runtime

The mock Astra server

When to update fixtures

Adding a scenario

Real-Astra smoke (separate harness)