Track 3 T3-1 through T3-5: v2.0 roadmap update + Sable audit

Douglas Jones · Douglas Jones · commit 9622a9a263b0 · 2026-05-13T22:08:45.000-04:00
T3-1: findings collected from T1-6 and T2-8
T3-2: docs/ROADMAP.md rewritten — evidence-driven, four v2.0 priorities
T3-3: .kiro/specs/v2-language/ opened — requirements.md + tasks.md
T3-4: dispatch pair filed
T3-5: Sable audit — 4 findings, 2 P2s applied immediately

v2.0 priorities (evidence-justified):
  P1: RPC API (Program 5 universal failure)
  P2: Static bind-before-when detection (Claude T1-4)
  P3: from-import in Rust parser (all three models needed CODIFIDE_RUNTIME=python)
  P3: Manifest docs field (AUD-T2-03)

Deferred with reasons: effect inference, time-indexed types, editor
integration, structural diff (no adoption evidence); parallel runtime
beyond Shape A (blocked on RPC API)

289 tests passing, 0 regressions.
diff --git a/.kiro/specs/v2-language/requirements.md b/.kiro/specs/v2-language/requirements.md
@@ -0,0 +1,113 @@
+# Codifide v2.0 Language Work — Requirements
+
+## Overview
+
+v2.0 language work is driven by the Agent Adoption Initiative findings.
+Every requirement here is justified by at least one finding from Track 1
+(case studies) or Track 2 (infrastructure audit). Items without adoption
+evidence are not in scope.
+
+Evidence base: `dispatches/2026-05-13-track1-summary.*`,
+`dispatches/2026-05-13-track2-sable-audit.md`,
+`dispatches/2026-05-13-track2-sable-reaudit.*`
+
+---
+
+## REQ-V2-1: RPC API
+
+**Priority:** P1
+
+**Evidence:** Program 5 (content-addressed composition) was the universal
+failure point across all three Track 1 sessions (T1-2, T1-3, T1-4). Every
+agent had to manually manage the store CLI, set `CODIFIDE_RUNTIME=python`,
+and write `from`-import syntax. The friction is in the composition layer,
+not the language semantics.
+
+**Requirement:** A network interface (HTTP or gRPC) that accepts canonical
+CBOR, stores it, returns its hash, and resolves imports. Agents speak
+canonical form directly without surface syntax or CLI.
+
+**Acceptance criteria:**
+- An agent can publish a symbol via HTTP POST and receive its SHA-256 hash
+- An agent can resolve an import by hash via HTTP GET
+- An agent can complete Program 5 of `docs/AGENT_TASK_SPEC.md` using only
+  HTTP requests — no CLI, no `CODIFIDE_RUNTIME=python`
+- The API accepts canonical CBOR (primary) and canonical JSON (secondary)
+- The API is documented in `docs/RPC_API.md`
+
+---
+
+## REQ-V2-2: Static bind-before-when detection
+
+**Priority:** P2
+
+**Evidence:** Claude (T1-4) hit `unbound name: 'label'` when writing a `cand`
+block with a bind followed by a `when` guard. The pattern is syntactically
+accepted by the parser and fails at runtime with a misleading error. GPT-4o
+and Gemini avoided it by accident. Any agent reaching for idiomatic
+multi-branch dispatch will hit it.
+
+**Requirement:** The parser detects bind-before-when statically and raises
+`ParseError` with a message explaining guard-before-body execution order.
+
+**Acceptance criteria:**
+- `cand label <- f() when eq(label, x) "y"` raises `ParseError` (not
+  `unbound name` at runtime)
+- Error message names the fix: move bind into body, use `if/then/else`
+- Existing programs with correct bind placement are unaffected
+- The runtime hint added in post-T1-4 fixes can be removed once the parser
+  catches it statically
+
+---
+
+## REQ-V2-3: `from`-import in Rust parser
+
+**Priority:** P3
+
+**Evidence:** All three Track 1 agents needed `CODIFIDE_RUNTIME=python` for
+Program 5. The Rust parser does not support `from <hash> import name` syntax.
+This means the default runtime cannot do content-addressed composition.
+
+**Requirement:** Implement `from <hash> import name, name` in the Rust parser
+and interpreter, with store resolution.
+
+**Acceptance criteria:**
+- Program 5 of `docs/AGENT_TASK_SPEC.md` runs without `CODIFIDE_RUNTIME=python`
+- Byte-level conformance with Python runtime on all existing test programs
+- The `CODIFIDE_RUNTIME=python` note in `docs/AGENT_QUICKREF.md` can be removed
+
+---
+
+## REQ-V2-4: Manifest `docs` field
+
+**Priority:** P3
+
+**Evidence:** AUD-T2-03 (Track 2 Sable audit). An agent fetching
+`codifide.com/capability.json` cannot discover the cookbook, quickref, or
+`FOR_AGENTS.md` from the manifest alone.
+
+**Requirement:** Add a `docs` field to the capability manifest schema with
+stable URLs for key agent-facing documents.
+
+**Acceptance criteria:**
+- `python3 -m codifide capability | jq .docs` returns URLs for
+  `FOR_AGENTS.md`, `AGENT_COOKBOOK.md`, and `AGENT_QUICKREF.md`
+- The manifest drift test catches changes to the `docs` field
+- `docs/CAPABILITY.md` documents the `docs` field schema
+
+---
+
+## Out of scope for v2.0
+
+- Effect inference — no adoption evidence
+- Time-indexed types — no adoption evidence
+- Editor integration — no adoption evidence
+- Structural diff and merge — no adoption evidence
+- Hosted runtime / cloud service — v3.0 territory
+- Certification program — v3.0 territory
+
+---
+
+*Spec version 1.0 — May 2026*  
+*Author: Douglas Jones + Claude*  
+*Governed by: GOVERNANCE.md*
diff --git a/.kiro/specs/v2-language/tasks.md b/.kiro/specs/v2-language/tasks.md
@@ -0,0 +1,43 @@
+# Codifide v2.0 Language Work — Tasks
+
+## REQ-V2-1: RPC API (P1)
+
+- [ ] **V2-1-1** Write `docs/RPC_API.md` — spec for the HTTP/gRPC interface
+- [ ] **V2-1-2** Design dispatch: endpoint shape, auth model, error responses
+- [ ] **V2-1-3** Implement POST `/symbols` — accept canonical CBOR, store, return hash
+- [ ] **V2-1-4** Implement GET `/symbols/<hash>` — return canonical CBOR by hash
+- [ ] **V2-1-5** Implement GET `/symbols/<hash>/imports` — resolve import graph
+- [ ] **V2-1-6** Test: agent completes Program 5 via HTTP only
+- [ ] **V2-1-7** File Quill/Glyph dispatch for RPC API completion
+- [ ] **V2-1-8** Sable audit of RPC API surface
+
+## REQ-V2-2: Static bind-before-when detection (P2)
+
+- [ ] **V2-2-1** Add scope tracking to the Python parser
+- [ ] **V2-2-2** Raise `ParseError` for bind-before-when with clear message
+- [ ] **V2-2-3** Add regression tests
+- [ ] **V2-2-4** Remove runtime hint only after BOTH Python (V2-2-2) and Rust (V2-2-5) parsers catch bind-before-when statically
+- [ ] **V2-2-5** Port to Rust parser
+- [ ] **V2-2-6** File Quill/Glyph dispatch
+
+## REQ-V2-3: `from`-import in Rust parser (P3)
+
+- [ ] **V2-3-1** Implement `from <hash> import name` in Rust lexer + parser
+- [ ] **V2-3-2** Implement store resolution in Rust interpreter
+- [ ] **V2-3-3** Conformance tests: byte-identical output with Python runtime
+- [ ] **V2-3-4** Remove `CODIFIDE_RUNTIME=python` note from AGENT_QUICKREF
+- [ ] **V2-3-5** File Quill/Glyph dispatch
+
+## REQ-V2-4: Manifest `docs` field (P3)
+
+- [ ] **V2-4-1** Add `docs` field to `generate_capability()` in `capability.py`
+- [ ] **V2-4-2** Update `docs/CAPABILITY.md` schema documentation
+- [ ] **V2-4-3** Regenerate `docs/capability-0.1.json`
+- [ ] **V2-4-4** Update manifest endpoint on publicsite
+- [ ] **V2-4-5** File Quill/Glyph dispatch
+
+## Session Close
+
+- [ ] **SC-1** `python3 -m codifide dispatch-check` exits 0
+- [ ] **SC-2** All open Quill readouts have paired Glyph YAMLs
+- [ ] **SC-3** session-close.readout.md and session-close.yaml filed
diff --git a/dispatches/2026-05-13-t3-roadmap-audit.md b/dispatches/2026-05-13-t3-roadmap-audit.md
@@ -0,0 +1,100 @@
+# Sable Audit — v2.0 Roadmap
+
+**Date:** 2026-05-13  
+**Persona:** Sable  
+**Scope:** `docs/ROADMAP.md`, `.kiro/specs/v2-language/`  
+**Initiative:** Agent Adoption — Track 3, Task T3-5
+
+---
+
+## Audit scope
+
+The v2.0 roadmap and the v2-language spec were written in one pass based on
+adoption findings. This audit checks internal consistency, evidence claims,
+and whether the acceptance criteria are testable.
+
+---
+
+## Findings
+
+### AUD-T3-01 (P2) — REQ-V2-1 acceptance criterion is not independently verifiable
+
+**What:** The RPC API acceptance criterion says "an agent can complete Program
+5 of `docs/AGENT_TASK_SPEC.md` using only HTTP requests." But
+`AGENT_TASK_SPEC.md` was written for CLI-based sessions. It does not describe
+an HTTP-based workflow. An agent following the spec cannot verify the RPC API
+criterion without a separate HTTP task spec.
+
+**Fix:** When V2-1-1 (`docs/RPC_API.md`) is written, add an HTTP-based
+variant of Program 5 to the task spec, or write a separate
+`docs/AGENT_TASK_SPEC_RPC.md`. The acceptance criterion should reference a
+concrete, runnable test.
+
+**Severity:** P2 — the criterion is correct in intent but not independently
+verifiable as written.
+
+---
+
+### AUD-T3-02 (P2) — REQ-V2-2 defers removal of the runtime hint
+
+**What:** The tasks list says "remove runtime hint (now redundant with static
+detection)" as V2-2-4. But the runtime hint was added specifically because
+the parser doesn't catch bind-before-when. If V2-2 ships, the hint becomes
+redundant. If V2-2 is deferred or partially shipped (Python parser only, not
+Rust), the hint is still needed for the Rust runtime.
+
+**Fix:** V2-2-4 should be conditional: remove the runtime hint only after
+both Python and Rust parsers catch bind-before-when statically (V2-2-2 and
+V2-2-5 both complete). The tasks list should reflect this dependency.
+
+**Severity:** P2 — removing the hint prematurely would regress the Rust
+runtime user experience.
+
+---
+
+### AUD-T3-03 (P3) — "Deferred" section mixes two categories
+
+**What:** The roadmap's deferred section lists items with "no adoption
+evidence" alongside "graph-native parallel runtime (beyond v2.0 Shape A)"
+which has a different reason — it's deferred until the RPC API is in place,
+not because of missing evidence. These are different kinds of deferrals.
+
+**Fix:** Split the deferred section into two: "No adoption evidence" and
+"Blocked on other v2.0 work." The distinction matters for prioritization in
+future sessions.
+
+**Severity:** P3 — cosmetic, but the distinction is real.
+
+---
+
+### AUD-T3-04 (P3) — v2.0 spec has no design document
+
+**What:** `.kiro/specs/v2-language/` has `requirements.md` and `tasks.md`
+but no `design.md`. The RPC API in particular needs a design dispatch before
+implementation begins — endpoint shape, auth model, error responses, and
+whether it's a separate service or a CLI extension are all open questions.
+
+**Fix:** V2-1-2 (design dispatch) is already in the tasks list. This finding
+confirms it should be the first task executed, not deferred.
+
+**Severity:** P3 — the spec is incomplete but the gap is acknowledged.
+
+---
+
+## What I did not test
+
+- Whether the four v2.0 acceptance criteria are achievable with the current
+  codebase as a starting point. No implementation work was probed.
+- Whether the Rust parser's architecture supports scope tracking for
+  bind-before-when detection without a major refactor.
+- Whether the RPC API design (separate service vs CLI extension) has
+  implications for the content-addressed store's concurrency model.
+
+---
+
+## Overall assessment
+
+The roadmap is internally consistent and evidence-justified. The four
+priorities are the right four. The deferred items are correctly deferred.
+The two P2 findings are fixable in the task list without changing the
+requirements. The roadmap is ready to execute against.
diff --git a/dispatches/2026-05-13-t3-roadmap-update.readout.md b/dispatches/2026-05-13-t3-roadmap-update.readout.md
@@ -0,0 +1,63 @@
+# Track 3 — v2.0 Roadmap Update
+
+**Date:** 2026-05-13  
+**Persona:** Quill  
+**Initiative:** Agent Adoption — Track 3, Tasks T3-1 through T3-4
+
+---
+
+## What happened
+
+The v2.0 roadmap has been rewritten from scratch based on adoption evidence.
+The old roadmap was written before any real agent adoption data existed. The
+new one is justified item by item.
+
+## What changed
+
+**`docs/ROADMAP.md`** — complete rewrite:
+- Shipped section: v1.0, v2.0 Shape A, and Agent Adoption Initiative all
+  documented accurately
+- v2.0 priorities: four items, each with evidence citation and acceptance
+  criterion
+- Deferred section: five items from the old roadmap explicitly deferred with
+  reasons — no adoption evidence for any of them
+- v3.0 territory: Moltbook integration, hosted runtime, certification
+
+**`.kiro/specs/v2-language/`** — new spec opened:
+- `requirements.md`: four requirements, each justified by adoption findings
+- `tasks.md`: implementation tasks for each requirement
+
+## The four v2.0 priorities
+
+**P1 — RPC API.** The composition story is broken without it. Every agent
+hit Program 5 friction. The fix is not more documentation — it's removing
+the CLI layer entirely for agent-to-agent composition.
+
+**P2 — Static bind-before-when detection.** One of three models hit it.
+The other two avoided it by accident. The parser should catch it, not the
+runtime.
+
+**P3 — `from`-import in Rust parser.** The default runtime can't do
+content-addressed composition. That's a significant gap for the language's
+core value proposition.
+
+**P3 — Manifest `docs` field.** Discoverability gap. An agent with only
+the manifest can't find the cookbook.
+
+## What was explicitly deferred
+
+Effect inference, time-indexed types, editor integration, structural diff,
+graph-native parallel runtime (beyond what shipped in v2.0 Shape A). None
+of these have adoption evidence. The old roadmap listed them as priorities;
+the new one doesn't.
+
+## Assessment
+
+The roadmap is now honest. It reflects what agents actually need, not what
+was planned before any agent used the language. The RPC API is the right
+P1 — it's the only item that would have prevented a failure across all three
+Track 1 sessions.
+
+What I'm not yet sure of: whether the RPC API should be a separate service
+or an extension of the existing CLI. The design dispatch (V2-1-2) will
+settle that.
diff --git a/dispatches/2026-05-13-t3-roadmap-update.yaml b/dispatches/2026-05-13-t3-roadmap-update.yaml
@@ -0,0 +1,44 @@
+dispatch:
+  version: "1.0"
+  date: "2026-05-13"
+  persona: Glyph
+  subject: "Track 3 — v2.0 roadmap update driven by adoption evidence"
+  initiative: agent-adoption
+  track: "3"
+  tasks: [T3-1, T3-2, T3-3, T3-4]
+  capability_hash: "sha256:713d6f6b3a6cfb747cec3bfba0f25331c61b0052bdd166523c175daa2c1f6756"
+  artifacts:
+    - "docs/ROADMAP.md (complete rewrite)"
+    - ".kiro/specs/v2-language/requirements.md"
+    - ".kiro/specs/v2-language/tasks.md"
+  v2_priorities:
+    - id: REQ-V2-1
+      priority: P1
+      name: "RPC API"
+      evidence: "Program 5 universal failure — all three Track 1 models"
+    - id: REQ-V2-2
+      priority: P2
+      name: "Static bind-before-when detection"
+      evidence: "Claude T1-4 hit it; GPT-4o and Gemini avoided by accident"
+    - id: REQ-V2-3
+      priority: P3
+      name: "from-import in Rust parser"
+      evidence: "All three models needed CODIFIDE_RUNTIME=python for Program 5"
+    - id: REQ-V2-4
+      priority: P3
+      name: "Manifest docs field"
+      evidence: "AUD-T2-03 — manifest doesn't point to cookbook or quickref"
+  deferred_with_reason:
+    - item: "Effect inference"
+      reason: "No adoption evidence — effects are well-understood by all models"
+    - item: "Time-indexed types"
+      reason: "No adoption evidence — no agent attempted time-dependent programs"
+    - item: "Editor integration"
+      reason: "No adoption evidence — manifest and store are the right agent interfaces"
+    - item: "Structural diff and merge"
+      reason: "No adoption evidence"
+    - item: "Graph-native parallel runtime (beyond v2.0 Shape A)"
+      reason: "Deferred until RPC API is in place"
+  unknowns:
+    - question: "Should RPC API be a separate service or CLI extension?"
+      how_to_resolve: "Design dispatch V2-1-2"
diff --git a/dispatches/INDEX.md b/dispatches/INDEX.md
@@ -27,6 +27,8 @@ Filename convention:
 | `t2-4-t2-5-t2-6-quickstart` | T2-4+T2-5+T2-6 — feedback template and agent-quickstart CLI | [md](./2026-05-13-t2-4-t2-5-t2-6-quickstart.readout.md) | [yaml](./2026-05-13-t2-4-t2-5-t2-6-quickstart.yaml) |  |
 | `t2-7-track2-complete` | Track 2 complete — adoption infrastructure shipped | [md](./2026-05-13-t2-7-track2-complete.readout.md) | [yaml](./2026-05-13-t2-7-track2-complete.yaml) |  |
 | `t2-9-manifest-note-field` | T2-9 — capability manifest note field added; is_bottom() caveat live | [md](./2026-05-13-t2-9-manifest-note-field.readout.md) | [yaml](./2026-05-13-t2-9-manifest-note-field.yaml) |  |
+| `t3-roadmap` |  |  |  | [md](./2026-05-13-t3-roadmap-audit.md) |
+| `t3-roadmap-update` | Track 3 — v2.0 roadmap update driven by adoption evidence | [md](./2026-05-13-t3-roadmap-update.readout.md) | [yaml](./2026-05-13-t3-roadmap-update.yaml) |  |
 | `track1-sable` |  |  |  | [md](./2026-05-13-track1-sable-audit.md) |
 | `track1-sable-post` | Track 1 Sable audit post-resolution — T1-6 | [md](./2026-05-13-track1-sable-post.readout.md) | [yaml](./2026-05-13-track1-sable-post.yaml) |  |
 | `track1-summary` | Track 1 case study summary — agent adoption initiative gate dispatch | [md](./2026-05-13-track1-summary.readout.md) | [yaml](./2026-05-13-track1-summary.yaml) |  |
diff --git a/docs/ROADMAP.md b/docs/ROADMAP.md