Reclassify post-M7 backlog as up-next critical scope

rand · rand · commit 72788bc39a92 · 2026-02-19T18:08:16.000-07:00
diff --git a/.beads/issues.jsonl b/.beads/issues.jsonl
@@ -19,7 +19,7 @@
 {"id":"loop-9mn","title":"Spec Agent Implementation","description":"Implement specialized agent for specification creation and refinement.","status":"closed","priority":1,"issue_type":"feature","created_at":"2026-01-16T10:32:52Z","updated_at":"2026-01-16T12:14:27Z","closed_at":"2026-01-16T12:14:27Z"}
 {"id":"loop-9t4","title":"Phase 4: Python Bindings (PyO3)","description":"Create Python bindings for rlm-core using PyO3.\n\n## Deliverables\n- PyO3 build configuration\n- Core bindings: Orchestrator, MemoryStore, TrajectoryEmitter\n- Async support via asyncio\n- Type stubs for IDE support\n- PyPI-publishable package\n\n## Technical Notes\n- Use maturin for build\n- Support Python 3.11+\n- Full type annotations","acceptance_criteria":"- [ ] PyO3 bindings compile\n- [ ] All core traits exposed\n- [ ] Async operations work\n- [ ] Package installable via pip","status":"closed","priority":1,"issue_type":"feature","created_at":"2026-01-15T16:08:07Z","updated_at":"2026-01-15T17:29:58Z","closed_at":"2026-01-15T17:29:58Z","close_reason":"Complete - PyO3 bindings, maturin build, type stubs, all tests pass"}
 {"id":"loop-avh","title":"E2E Validation: Exercise rlm-core capabilities end-to-end","description":"Exercise rlm-core capabilities end-to-end corresponding to actual usage by rlm-claude-code and recurse. This is a precondition for migrations.\n\n## Test Categories\n\n### 1. Complexity Classification \u0026 Activation\n- Simple queries → bypass RLM (fast-path)\n- Multi-file references → activate\n- Cross-context reasoning → activate\n- Debugging tasks → activate\n- User intent signals (thorough/fast)\n\n### 2. Memory System (Hypergraph)\n- Node CRUD (Entity, Fact, Experience, Decision, Snippet)\n- Tier evolution (Task → Session → LongTerm → Archive)\n- HyperEdge relationships\n- Content search (BM25/semantic)\n- Memory gate (epistemic filtering)\n\n### 3. REPL Environment\n- Context externalization (files, messages, tool_outputs)\n- Helper functions (peek, search, summarize)\n- Deferred operations (LLM calls)\n- Sandbox restrictions\n\n### 4. Trajectory Streaming\n- All 20+ event types\n- Verbosity levels\n- Cost tracking events\n- JSON export\n\n### 5. Recursive Orchestration\n- Depth ladder (0→1→2→3)\n- Model tier selection per depth\n- Cost aggregation across depths\n\n### 6. Epistemic Verification (Strawberry)\n- Claim extraction\n- KL divergence computation\n- Evidence scrubbing\n- Memory gate filtering\n\n### 7. LLM Client \u0026 Routing\n- Multi-provider support\n- Smart routing based on query type\n- Prompt caching\n\n### 8. Formal Verification (Lean)\n- Lean REPL interaction\n- Proof automation tiers\n- Dual-track sync (Topos ↔ Lean)","notes":"## E2E Validation Results (2026-01-17)\n\n### ✅ PASSED\n1. **Rust build** - rlm-core compiles with 14 warnings (unused code)\n2. **Python bindings** - PyO3 builds and installs successfully\n3. **Complexity classification** - PatternClassifier works, correctly detects signals\n4. **Memory system** - CRUD, search, tier evolution, statistics all work\n5. **Trajectory events** - All 21 event types, factory methods, log_line serialization\n6. **Go bindings** - All 21 tests pass after fixes\n7. **Python REPL** - Helpers (peek, search), sandbox (blocks dangerous ops), protocol\n\n### 🔧 FIXED (committed)\n- Go bindings: Added missing header includes to memory.go, trajectory.go\n- Go bindings: Removed unused json import from rlmcore.go  \n- Go bindings: Fixed example function name in example_test.go\n\n### ⚠️ GAPS DISCOVERED\n1. **loop-ocz**: Epistemic verification not exposed in Python bindings (blocks loop-cyl migration)\n2. **loop-eby**: Lean REPL requires Lean 4 installation for full testing (environment gap)\n\n### RECOMMENDATION\nThe core rlm-core capabilities are validated and ready for migrations. The epistemic module gap (loop-ocz) should be addressed before rlm-claude-code migration since that plugin uses hallucination detection.","status":"closed","priority":1,"issue_type":"task","assignee":"claude","created_at":"2026-01-16T17:13:10Z","updated_at":"2026-01-16T17:53:23Z","closed_at":"2026-01-16T17:53:23Z","close_reason":"E2E validation complete. Core capabilities validated. 2 gaps discovered and tracked. Go binding fixes committed."}
-{"id":"loop-azq","title":"Post-M7 deferred SPEC refinements backlog","description":"Track deferred implementation items left after M7 closure: SPEC-20 composition validation hardening, SPEC-21 custom switch strategy support, SPEC-22 lean diagnostic-feedback loop integration, SPEC-23 CLI/advanced HTML visualization controls, SPEC-24 metric trait/object-safety refinements, SPEC-25 explicit SizeConfig/auto_chunk APIs, and SPEC-26 provider-aware rate-limit/backoff policy. This issue owns decomposition into executable tranche tasks.","status":"open","priority":2,"issue_type":"task","owner":"rand.arete@gmail.com","created_at":"2026-02-20T00:57:15Z","created_by":"Rand Arete","updated_at":"2026-02-20T00:57:15Z","labels":["execution-plan","spec-gap"]}
+{"id":"loop-azq","title":"Up-next critical SPEC refinements backlog","description":"Track up-next critical implementation scope following M7 closure: SPEC-20 composition validation hardening, SPEC-21 custom switch strategy support, SPEC-22 lean diagnostic-feedback loop integration, SPEC-23 CLI/advanced HTML visualization controls, SPEC-24 metric trait/object-safety refinements, SPEC-25 explicit SizeConfig/auto_chunk APIs, and SPEC-26 provider-aware rate-limit/backoff policy. This issue owns immediate decomposition into executable tranche tasks.","status":"open","priority":0,"issue_type":"task","owner":"rand.arete@gmail.com","created_at":"2026-02-20T00:57:15Z","created_by":"Rand Arete","updated_at":"2026-02-20T01:04:29Z","labels":["critical-scope","execution-plan","spec-gap","up-next"]}
 {"id":"loop-b7b","title":"Add REPL FFI bindings to rlm-core","description":"The rlm-core Rust crate has a REPL module (src/repl.rs) with:\n- ReplConfig for configuration\n- ReplHandle for process management  \n- execute(), get_variable(), set_variable(), resolve_operation() methods\n\nBut there's no FFI binding exposed yet. This blocks Phase 4 of the recurse-rlmcore migration.\n\nRequired work:\n1. Add FFI functions in src/ffi/repl.rs\n2. Add C header declarations in include/rlm_core.h\n3. Add Go bindings in go/rlmcore/repl.go\n4. Test cross-language REPL operations","status":"closed","priority":2,"issue_type":"task","assignee":"claude","owner":"rand.arete@gmail.com","created_at":"2026-01-20T10:03:28Z","created_by":"Rand Arete","updated_at":"2026-01-20T10:29:45Z","closed_at":"2026-01-20T10:29:45Z","close_reason":"Completed REPL FFI bindings: Rust FFI (src/ffi/repl.rs), C header (include/rlm_core.h), Go bindings (go/rlmcore/repl.go). All builds compile successfully. Integration testing requires Python rlm-repl package."}
 {"id":"loop-bih","title":"M7 spec completion and integration hardening","notes":"M7 execution plan and gate matrix authored in docs/execution-plan; tasks loop-bih.1..loop-bih.10 ready for implementation sequencing.","status":"closed","priority":1,"issue_type":"epic","owner":"rand.arete@gmail.com","created_at":"2026-02-19T23:11:57Z","created_by":"Rand Arete","updated_at":"2026-02-20T01:01:47Z","closed_at":"2026-02-20T01:01:47Z","close_reason":"M7 execution tranche complete: loop-bih.1..loop-bih.11 closed with evidence-backed gate coverage; residual deferred spec refinements moved to loop-azq.","labels":["execution-plan","m7"]}
 {"id":"loop-bih.1","title":"M7-T01 SPEC-26 LLM_BATCH end-to-end runtime closure","notes":"Recovered crash-session REPL/SUBMIT + llm_batch helper/test bundle integrated on codex/recovery-crash-integration with M7-T01 gate artifacts. Remaining SPEC-26 host-orchestration closure still pending.","status":"closed","priority":1,"issue_type":"task","owner":"rand.arete@gmail.com","created_at":"2026-02-19T23:11:57Z","created_by":"Rand Arete","updated_at":"2026-02-19T23:55:38Z","closed_at":"2026-02-19T23:55:38Z","close_reason":"Completed M7-T01: implemented Rust host llm_batch resolution path (pending_operations RPC + ReplHandle::resolve_pending_llm_batches), refreshed VG-LOOP-BATCH-001/VG-LOOP-REPL-001/VG-EFFICACY-001 artifacts, and added host roundtrip batch integration evidence.","labels":["execution-plan","m7"],"dependencies":[{"issue_id":"loop-bih.1","depends_on_id":"loop-bih","type":"parent-child","created_at":"2026-02-19T16:11:57Z","created_by":"Rand Arete","metadata":"{}"}]}
diff --git a/docs/execution-plan/STATUS.md b/docs/execution-plan/STATUS.md
@@ -38,7 +38,7 @@ Last updated: 2026-02-20
 | F10 | No executable performance gate harness for REPL startup/batch throughput | Resolved by M5-T01 (`run_m5_perf_harness.sh` + VG-PERF artifacts) | M5 |
 | F11 | Efficacy scenario suite lacked explicit mixed batch and fallback-non-submit coverage | Resolved by M5-T02 scenario matrix + targeted tests (`45 passed`) | M5 |
 | F12 | No baseline-vs-candidate performance/efficacy rollup report | Resolved by M5-T03 comparative analysis report with regression check | M5 |
-| F13 | Residual implementation gaps remained across SPEC-20..27 after M0-M6 closure | M7 task cards (`M7-T01`..`M7-T10`) are complete; deferred post-M7 refinements are explicitly tracked in `loop-azq` with reconciled spec metadata | M7/post-M7 |
+| F13 | Residual implementation gaps remained across SPEC-20..27 after M0-M6 closure | M7 task cards (`M7-T01`..`M7-T10`) are complete; post-M7 up-next critical refinements are explicitly tracked in `loop-azq` with reconciled spec metadata | M7/post-M7 |
 
 ## Active Blockers
 
@@ -98,13 +98,13 @@ Last updated: 2026-02-20
 | R45 | Closed M7-T07 by implementing optimizer reasoning-capture summaries and persistence helpers (`OptimizedModule::save/load`), then validating optimizer/efficacy/perf guardrails | `evidence/2026-02-20/milestone-M7/M7-T07-validation-summary.md` |
 | R46 | Closed M7-T08 by enforcing SPEC-25 root prompt submit semantics, aligning helper-surface guidance with runtime helpers, and passing context/REPL/doc gates | `evidence/2026-02-20/milestone-M7/M7-T08-validation-summary.md` |
 | R47 | Closed M7-T09 by delivering `io_rflx_interop.v0` fixture/calibration artifacts, executable RFLX fixture gate coverage, and refreshed RFLX/contract/perf evidence | `evidence/2026-02-20/milestone-M7/M7-T09-validation-summary.md` |
-| R48 | Closed M7-T10 by reconciling SPEC-20..27 status/governance metadata, refreshing consumer claim evidence, and assigning deferred post-M7 gaps to `loop-azq` | `evidence/2026-02-20/milestone-M7/M7-T10-validation-summary.md` |
+| R48 | Closed M7-T10 by reconciling SPEC-20..27 status/governance metadata, refreshing consumer claim evidence, and assigning post-M7 up-next critical gaps to `loop-azq` | `evidence/2026-02-20/milestone-M7/M7-T10-validation-summary.md` |
 
 ## Top Priority Queue (Next 9 Tasks)
 
 | Priority | Task ID | Description |
 |---|---|---|
-| P0 | loop-azq | Decompose and execute deferred post-M7 spec refinements backlog |
+| P0 | loop-azq | Decompose and execute up-next critical post-M7 spec refinements backlog |
 
 ## Consumer Readiness Snapshot
 
diff --git a/docs/execution-plan/TASK-REGISTRY.md b/docs/execution-plan/TASK-REGISTRY.md
@@ -30,7 +30,7 @@ Single source of truth for execution tasks and dependencies.
 | 9 | M7-T09 (`loop-bih.9`) | done | Delivered loop-owned io-rflx fixture corpus + calibration gate and captured RFLX/contract/perf evidence (`M7-T09-validation-summary.md`) |
 | 10 | M7-T10 (`loop-bih.10`) | done | Reconciled SPEC/governance metadata and refreshed consumer support claims (`M7-T10-validation-summary.md`) |
 | 11 | Ops-Weekly | in_progress | Continue steady-state compatibility cadence post-M7 completion |
-| 12 | Post-M7 deferred SPEC refinements (`loop-azq`) | todo | Decompose and execute deferred spec/runtime refinements carried forward from SPEC-20..26 |
+| 12 | Up-next critical SPEC refinements (`loop-azq`) | todo | Decompose and execute up-next critical spec/runtime refinements carried forward from SPEC-20..26 |
 
 ## M0 Tasks (Foundation and Contracts)
 
diff --git a/docs/execution-plan/WORKBOARD.md b/docs/execution-plan/WORKBOARD.md
@@ -17,7 +17,7 @@ Owner: Orchestrator thread
 | Orchestrator | M7 tranche orchestration + safe-mode enforcement | in_progress | M7 plan published; execute task cards sequentially with evidence-first closure |
 | Lane A | M7 core runtime closure (`M7-T01`..`M7-T08`) | complete | Runtime closure complete with evidence under `evidence/2026-02-20/milestone-M7/` |
 | Lane B | M7 docs/governance reconciliation (`M7-T10`) | complete | SPEC/governance reconciliation complete; consumer claims refreshed |
-| Lane C | Ops-Weekly cadence + post-M7 deferred backlog (`loop-azq`) | in_progress | Keep D-017 clean-clone policy active; execute cadence and decompose deferred refinements |
+| Lane C | Ops-Weekly cadence + post-M7 up-next critical backlog (`loop-azq`) | in_progress | Keep D-017 clean-clone policy active; execute cadence and decompose up-next critical refinements |
 
 ## Next Queue by Lane
 
@@ -28,7 +28,7 @@ Owner: Orchestrator thread
 ## Lane Activation Rules
 
 - Lane A and Lane B are complete for M7 and should remain read-only unless regressions are discovered.
-- Lane C is the primary active lane for heavy compatibility/cadence and deferred refinement intake.
+- Lane C is the primary active lane for heavy compatibility/cadence and up-next critical refinement intake.
 - Never run heavy commands concurrently across lanes.
 
 ## Handoff Intake Checklist (Orchestrator)
diff --git a/docs/execution-plan/evidence/2026-02-20/milestone-M7/M7-T10-VG-DOC-SPEC-002.md b/docs/execution-plan/evidence/2026-02-20/milestone-M7/M7-T10-VG-DOC-SPEC-002.md
@@ -3,22 +3,22 @@ Date: 2026-02-20
 Task: M7-T10 spec/governance reconciliation and promotion
 
 ## Scope
-Reconcile SPEC-20 through SPEC-27 status metadata, implementation snapshots, and deferred-gap traceability after M7 runtime closure.
+Reconcile SPEC-20 through SPEC-27 status metadata, implementation snapshots, and up-next critical gap traceability after M7 runtime closure.
 
 ## Checklist
 - [x] SPEC-20..SPEC-27 status headers reviewed and aligned with current runtime truth.
 - [x] Implementation snapshot timestamps reconciled to current review date where needed.
-- [x] Deferred items are explicitly tied to a tracked backlog issue (`loop-azq`).
+- [x] Remaining critical items are explicitly tied to a tracked up-next backlog issue (`loop-azq`).
 - [x] M7 closure sequence in execution-plan trackers reflects `M7-T01`..`M7-T10` ordering and completion state.
 - [x] No remaining spec/runtime drift items are left untracked.
 
-## Deferred Gap Tracking
-Residual deferred items from partially implemented specs are explicitly tracked in:
-- `loop-azq` — Post-M7 deferred SPEC refinements backlog
+## Up-Next Gap Tracking
+Residual critical items from partially implemented specs are explicitly tracked in:
+- `loop-azq` — Up-next critical SPEC refinements backlog
 
 ## Result
 - Pass
-- SPEC status/governance docs now reflect post-M7 runtime state with explicit deferred-gap ownership.
+- SPEC status/governance docs now reflect post-M7 runtime state with explicit up-next critical gap ownership.
 
 ## References
 - `/Users/rand/src/loop/docs/spec/SPEC-20-typed-signatures.md`
diff --git a/docs/execution-plan/evidence/2026-02-20/milestone-M7/M7-T10-validation-summary.md b/docs/execution-plan/evidence/2026-02-20/milestone-M7/M7-T10-validation-summary.md
@@ -11,7 +11,7 @@ Task: M7-T10 spec/governance reconciliation and promotion
 
 ## Key Results
 - Reconciled SPEC-20..SPEC-27 status metadata and implementation snapshots against current runtime state.
-- Linked all residual deferred spec items to tracked backlog issue `loop-azq`.
+- Linked all residual up-next critical spec items to tracked backlog issue `loop-azq`.
 - Updated compatibility and contract policy docs to reference latest M7 consumer evidence and active io-rflx fixture gate model.
 - Refreshed consumer-gate evidence for RCC, loop-agent seam, and io-rflx compile baseline.
 
diff --git a/docs/execution-plan/milestones/M7.md b/docs/execution-plan/milestones/M7.md
@@ -14,7 +14,7 @@ This milestone is a `dp-codex` execution tranche derived from a constrained deco
 
 ## Exit Criteria
 
-- SPEC-20..SPEC-27 items below are implemented or explicitly deferred with accepted decision records.
+- SPEC-20..SPEC-27 items below are implemented or explicitly moved to up-next critical scope with accepted decision records.
 - Remaining runtime gaps are closed with deterministic test evidence.
 - `io-rflx` adapter fixtures and benchmark calibration are implemented and validated.
 - M7 minimum gate set passes with evidence artifacts under `docs/execution-plan/evidence/<date>/milestone-M7/`.
@@ -141,7 +141,7 @@ This milestone is a `dp-codex` execution tranche derived from a constrained deco
 ### M7-T09 io-rflx Adapter Fixture + Calibration Delivery
 
 - Tracking issue: `loop-bih.9`
-- Goal: implement previously deferred interop fixtures and benchmark calibration.
+- Goal: implement previously queued interop fixtures and benchmark calibration.
 - Scope:
 - `docs/execution-plan/contracts/IO-RFLX-INTEROP-CONTRACT.md`
 - adapter fixture assets and calibration scripts/artifacts for `io-rflx`
diff --git a/docs/spec/SPEC-20-typed-signatures.md b/docs/spec/SPEC-20-typed-signatures.md
@@ -2,7 +2,7 @@
 
 > DSPy-inspired typed signatures for rlm-core
 
-**Status**: Partially implemented (typed runtime protocol and validation parity implemented through M7-T03; remaining composition/runtime-governance refinements are tracked in `loop-azq`)
+**Status**: Partially implemented (typed runtime protocol and validation parity implemented through M7-T03; remaining composition/runtime-governance refinements are up-next critical scope tracked in `loop-azq`)
 **Created**: 2026-01-20
 **Epic**: loop-zcx (DSPy-Inspired RLM Improvements)
 **Tasks**: loop-d75, loop-jqo, loop-9l6, loop-bzz
diff --git a/docs/spec/SPEC-21-dual-model-optimization.md b/docs/spec/SPEC-21-dual-model-optimization.md
@@ -2,7 +2,7 @@
 
 > Cost-optimized model selection for RLM orchestration
 
-**Status**: Partially implemented (router and orchestrator-boundary dual-model routing are implemented; remaining dual-model strategy refinements are tracked in `loop-azq`)
+**Status**: Partially implemented (router and orchestrator-boundary dual-model routing are implemented; remaining dual-model strategy refinements are up-next critical scope tracked in `loop-azq`)
 **Created**: 2026-01-20
 **Epic**: loop-zcx (DSPy-Inspired RLM Improvements)
 **Task**: loop-z6x
@@ -77,7 +77,7 @@ pub enum SwitchStrategy {
 **Acceptance Criteria**:
 - [ ] DualModelConfig serializable to/from JSON
 - [ ] SwitchStrategy covers common use cases
-- [ ] Custom strategy allows user flexibility (currently deferred)
+- [ ] Custom strategy allows user flexibility (up-next critical scope)
 
 ### SPEC-21.02: SmartRouter Integration
 
diff --git a/docs/spec/SPEC-22-proof-protocol.md b/docs/spec/SPEC-22-proof-protocol.md
@@ -2,7 +2,7 @@
 
 > Numina-inspired focused proof strategy for Lean REPL
 
-**Status**: Partially implemented (session/protocol enforcement and proof-engine execution/persistence are implemented; remaining Lean diagnostic-feedback integration is tracked in `loop-azq`)
+**Status**: Partially implemented (session/protocol enforcement and proof-engine execution/persistence are implemented; remaining Lean diagnostic-feedback integration is up-next critical scope tracked in `loop-azq`)
 **Created**: 2026-01-20
 **Epic**: loop-zcx (DSPy-Inspired RLM Improvements)
 **Task**: loop-dzv
diff --git a/docs/spec/SPEC-23-graph-visualization.md b/docs/spec/SPEC-23-graph-visualization.md
@@ -2,7 +2,7 @@
 
 > Interactive debugging visualization for reasoning traces
 
-**Status**: Partially implemented (core exports + TUI/MCP integration endpoints are implemented; deferred CLI/advanced HTML controls are tracked in `loop-azq`)
+**Status**: Partially implemented (core exports + TUI/MCP integration endpoints are implemented; remaining CLI/advanced HTML controls are up-next critical scope tracked in `loop-azq`)
 **Created**: 2026-01-20
 **Epic**: loop-zcx (DSPy-Inspired RLM Improvements)
 **Task**: loop-wve
@@ -20,7 +20,7 @@ Add interactive graph visualization for ReasoningTrace to enable debugging of co
 | SPEC-23.01 Graph export formats | Implemented (NetworkX JSON, DOT, HTML, enhanced Mermaid) | `rlm-core/src/reasoning/visualize.rs` |
 | SPEC-23.02 NetworkX schema | Implemented (runtime node-link schema) | `NetworkXGraph` types and export tests in `rlm-core/src/reasoning/visualize.rs` |
 | SPEC-23.03 HTML visualization | Partially implemented | `ReasoningTrace::to_html` + `test_html_export` in `rlm-core/src/reasoning/visualize.rs` |
-| SPEC-23.04 Integration points | Partially implemented (TUI + MCP, CLI deferred) | `TUIAdapter::render_trace_panel` and `trace_visualize` in `rlm-core/src/adapters/` |
+| SPEC-23.04 Integration points | Partially implemented (TUI + MCP, CLI queued as up-next critical) | `TUIAdapter::render_trace_panel` and `trace_visualize` in `rlm-core/src/adapters/` |
 
 ## Requirements
 
diff --git a/docs/spec/SPEC-24-bootstrap-optimizer.md b/docs/spec/SPEC-24-bootstrap-optimizer.md
diff --git a/docs/spec/SPEC-25-context-externalization.md b/docs/spec/SPEC-25-context-externalization.md
diff --git a/docs/spec/SPEC-26-batched-queries.md b/docs/spec/SPEC-26-batched-queries.md