RandomCoder-lab
diff --git a/‎CHANGELOG.md‎
Lines changed: 44 additions & 0 deletions b/‎CHANGELOG.md‎
Lines changed: 44 additions & 0 deletions
diff --git a/‎README.md‎
Lines changed: 1 addition & 0 deletions b/‎README.md‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎ROADMAP.md‎
Lines changed: 2 additions & 2 deletions b/‎ROADMAP.md‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎experiments/prometheus_parity/results_torch_substrate_q.json‎
Lines changed: 46 additions & 0 deletions b/‎experiments/prometheus_parity/results_torch_substrate_q.json‎
Lines changed: 46 additions & 0 deletions
diff --git a/‎experiments/prometheus_parity/torch_substrate_q.py‎
Lines changed: 254 additions & 0 deletions b/‎experiments/prometheus_parity/torch_substrate_q.py‎
Lines changed: 254 additions & 0 deletions
@@ -13,6 +13,7 @@ Read top-to-bottom for the arc; jump to any chapter for the detail.
 
 | Tag | Date | One-line |
 |---|---|---|
+| [v0.6-fibtier-memory](#v06-fibtier-memory--2026-05-17) | 2026-05-17 | Fibtier-bounded eviction for memory: cap the index at fibonacci-tier capacity (default 232), evicted entries still recoverable by hash. Memory now safe for arbitrarily long agent sessions. |
 | [v0.5-substrate-memory](#v05-substrate-memory--2026-05-17) | 2026-05-17 | Substrate-keyed conversation memory: `omc_memory_store` / `recall` / `list` / `stats` MCP tools + filesystem-backed persistence. **Hits the 10× target** — measured 10.61× LLM context-budget reduction on a 20-turn agent task. |
 | [v0.4-substrate-context](#v04-substrate-context--2026-05-17) | 2026-05-17 | Symbolic compression end-to-end: `omc_compress_context` / `omc_decompress` tools + `format=codec` thumbnails + directory ingest. Measured 1.85×–2.81× LLM context budget reduction. |
 | [v0.3.1-symbolic-compression](#v031-symbolic-compression--2026-05-17) | 2026-05-17 | `omc_predict` gains `format=hash`/`signature`/`full` (default = compressed hash form, 3.8× smaller context cost) + `omc_fetch_by_hash` companion for on-demand recovery |
@@ -28,6 +29,49 @@ Read top-to-bottom for the arc; jump to any chapter for the detail.
 
 ---
 
+## [v0.6-fibtier-memory] - 2026-05-17
+
+**Fibtier-bounded eviction for `MemoryStore`: memory growth is now safe for arbitrarily long agent sessions, and evicted entries remain recoverable by hash.**
+
+v0.5 shipped substrate-keyed memory with an honest limit ("memory grows unbounded"). v0.6 closes that gap by mirroring the existing `fibtier.omc` Fibonacci-tier semantics in the Rust `MemoryStore`.
+
+### What changed
+
+- `MemoryStore::max_entries_per_namespace: Option<usize>` — when set, the index is bounded after each store
+- `FIBTIER_DEFAULT_SIZES = [1,2,3,5,8,13,21,34,55,89,144,233,377,610,987,1597]` mirrors fibtier.omc
+- `FIBTIER_DEFAULT_MAX_ENTRIES = 232` = sum of first 10 tier sizes
+- `OMC_MEMORY_MAX_ENTRIES` env var to override (0 = unbounded)
+- `MemoryStore::with_max_entries(n)` builder for explicit caps
+- `MemoryStore::evict_to_cap(namespace, keep)` — manual prune helper, returns count dropped
+- **Eviction is index-only**: body files stay on disk so `recall(hash)` still works for entries that fell out of the chronological list (matches fibtier's "bounded active capacity, unbounded historical recall" semantics)
+
+### New MCP tool
+
+- `omc_memory_evict(namespace, keep)` → `{namespace, dropped, kept}`. Manual control for session boundaries or aggressive pruning.
+- `omc_memory_stats` now includes `fibtier_cap` so an agent can see its budget.
+
+### Tests
+
+32/32 MCP integration tests pass (was 27 + 5 new): auto-eviction at cap, manual evict tool, evicted entries recoverable by hash, stats includes cap, tools/list now shows omc_memory_evict.
+
+15/15 memory module unit tests pass (was 10 + 5 new): eviction bounds the index, evicted entries still recoverable, evict_to_cap returns drop count, unbounded mode keeps everything, default cap matches first-10-tier sum.
+
+### Why it matters
+
+An agent running for hours or days will hit memory bounds. v0.6 makes that case safe by default — the agent's MOST RECENT 232 turns stay in the chronological list (easy browse via `omc_memory_list`), while older turns remain recoverable by hash but don't bloat the index. Combined with v0.5's compression, a 100-turn agent session uses bounded memory rather than the 10MB+ it would otherwise accumulate.
+
+### Honest framing
+
+This is index-only eviction, not full deletion — body files on disk grow with every store. A long-running agent would still benefit from an external cleanup pass for the files (cron / GC tool). A future v0.6.1 candidate: physical eviction with optional cold-storage archival.
+
+### Files
+
+- `omnimcode-core/src/memory.rs` — `FIBTIER_DEFAULT_*` constants, `max_entries_per_namespace`, `evict_to_cap`, auto-eviction in `store`
+- `omnimcode-mcp/src/main.rs` — `omc_memory_evict` tool, `fibtier_cap` in stats
+- `omnimcode-mcp/tests/integration.rs` — 5 new tests
+
+---
+
 ## [v0.5-substrate-memory] - 2026-05-17
 
 **Substrate-keyed conversation memory: an LLM agent's prior turns stay in cheap-reference form (canonical hash), recovered only when reasoning needs them. Measured 10.61× LLM context-budget reduction on a 20-turn agent task — hitting the original target.**
 
@@ -268,6 +268,7 @@ If you're trying to understand how OMC got here, **read the [GitHub Releases](ht
 | [v0.3.1-symbolic-compression](https://github.com/RandomCoder-lab/OMC/releases/tag/v0.3.1-symbolic-compression) | `omc_predict` learns to compress: `format=hash` default is 3.8× smaller, with `omc_fetch_by_hash` for on-demand body recovery |
 | [v0.4-substrate-context](https://github.com/RandomCoder-lab/OMC/releases/tag/v0.4-substrate-context) | Symbolic compression end-to-end: `omc_compress_context` / `omc_decompress` + directory ingest + measured 2-3× LLM context-budget reduction |
 | [v0.5-substrate-memory](https://github.com/RandomCoder-lab/OMC/releases/tag/v0.5-substrate-memory) | Substrate-keyed conversation memory: `omc_memory_store` / `recall` / `list` / `stats` + filesystem persistence. **10.61× LLM context-budget reduction** on a 20-turn agent task. |
+| [v0.6-fibtier-memory](https://github.com/RandomCoder-lab/OMC/releases/tag/v0.6-fibtier-memory) | Fibtier-bounded eviction for memory: cap the index at fibonacci-tier capacity (default 232); evicted entries still recoverable by hash. Memory now safe for arbitrarily long agent sessions. |
 
 ---
 
 
@@ -1,7 +1,7 @@
 # OMC Roadmap
 
-Current chapter: **v0.5-substrate-memory** (shipped 2026-05-17).
-Next chapter: open — candidates listed below. The five-chapter symbolic-context arc (v0.3 → v0.3.1 → v0.4 → v0.5) has landed with the 10× target hit (10.61× measured).
+Current chapter: **v0.6-fibtier-memory** (shipped 2026-05-17).
+Next chapter: GPU Prometheus scaffold (in flight). The six-chapter symbolic-context arc (v0.3 → v0.6) has landed.
 
 See [CHANGELOG.md](CHANGELOG.md) and [GitHub Releases](https://github.com/RandomCoder-lab/OMC/releases) for the chapter-by-chapter history of how OMC got here. This file describes what's on the path going forward.
 
 
@@ -0,0 +1,46 @@
+{
+  "results": {
+    "Q0": {
+      "vals": [
+        2.96439205010732,
+        3.2229747931162516,
+        2.8303927103678386
+      ],
+      "mean": 3.005919851197137,
+      "std": 0.19955849172933476
+    },
+    "Q1": {
+      "vals": [
+        3.4802677392959596,
+        3.147650456428528,
+        2.8683457454045613
+      ],
+      "mean": 3.165421313709683,
+      "std": 0.30634781569057495
+    },
+    "Q2": {
+      "vals": [
+        2.897973410288493,
+        3.229221320152283,
+        3.236746565500895
+      ],
+      "mean": 3.1213137653138907,
+      "std": 0.19345501535639265
+    }
+  },
+  "config": {
+    "seeds": "42,7,123",
+    "steps": 1500,
+    "lr": 0.005,
+    "seq_len": 32,
+    "d_model": 32,
+    "n_heads": 4,
+    "ff_dim": 64,
+    "n_blocks": 4,
+    "alpha": 1.0,
+    "gamma": 0.2,
+    "variants": "Q0,Q1,Q2",
+    "out": "results_torch_substrate_q.json"
+  },
+  "best": "Q0"
+}
@@ -0,0 +1,254 @@
+"""Does substrate-Q resample stack on top of the v0.1 K + S-MOD + V win?
+
+The v0.1 chapter shipped three stacked substrate-attention components:
+  - K = CRT-Fibonacci substrate (no learnable W_K)
+  - softmax → S-MOD α=1.0 (off-attractor weights dampened)
+  - V = substrate_resample(x @ W_v) post-projection (off-attractor V mags dampened)
+
+Q is the last unmodified component. The V finding's mechanism was
+"modulation > replacement" — keep the learned W, apply substrate as
+post-projection dampening. The natural Q recipe mirrors it:
+
+  Q1 (resample): q = substrate_resample(x @ W_q)
+
+If the same modulation pattern generalizes to Q, that's a 4th
+stacked substrate-component — every attention primitive now substrate-
+aware. If it doesn't, we learn whether the V recipe was specific to
+the value path or whether it's a general "post-projection modulation"
+principle.
+
+Three Q variants tested:
+  Q0 (baseline): q = x @ W_q                          (current production)
+  Q1 (resample): q = substrate_resample(x @ W_q)      (post-projection snap)
+  Q2 (modulate): q = (x @ W_q) * (1 + γ·near_attractor_signal(x))
+                                                      (input-conditional)
+
+3 seeds on TinyShakespeare with S-MOD α=1.0, substrate-V (V1) already
+active. Q is the only thing varying.
+"""
+
+from __future__ import annotations
+
+import argparse
+import json
+import random
+import statistics
+from pathlib import Path
+
+import torch
+import torch.nn as nn
+import torch.nn.functional as F
+
+from torch_4way import lcg, make_matrix, crt_pe, build_vocab
+from torch_substrate_softmax import (
+    attractor_distance, softmax_smod,
+)
+from torch_substrate_v import substrate_resample, near_attractor_signal
+
+
+class AttentionL1QV(nn.Module):
+    """L1 multi-head + S-MOD softmax + substrate-V (V1) + pluggable Q variant.
+
+    This is the v0.1 production stack with one variable: how Q is built.
+    """
+    def __init__(self, d_model, n_heads, seq_len, seed,
+                 q_variant="Q0", alpha=1.0, gamma=0.2):
+        super().__init__()
+        assert d_model % n_heads == 0
+        self.d_model, self.n_heads = d_model, n_heads
+        self.d_head = d_model // n_heads
+        self.q_variant = q_variant
+        self.alpha = alpha
+        self.gamma = gamma
+        s = seed + 11
+        W_q, s = make_matrix(d_model, d_model, 0.3, s)
+        W_v, s = make_matrix(d_model, d_model, 0.3, s)
+        W_o, s = make_matrix(d_model, d_model, 0.3, s)
+        self.W_q = nn.Parameter(W_q)
+        self.W_v = nn.Parameter(W_v)
+        self.W_o = nn.Parameter(W_o)
+        pe_full = crt_pe(seq_len, d_model)
+        pe_per_head = pe_full.view(seq_len, n_heads,
+                                    self.d_head).transpose(0, 1)
+        self.register_buffer("K_const_mh", pe_per_head)
+        self.rng_state = s
+
+    def forward(self, x):
+        T, D = x.shape
+        H, dh = self.n_heads, self.d_head
+        # Q variants — this is the experimental axis.
+        q_proj = x @ self.W_q
+        if self.q_variant == "Q0":
+            q_full = q_proj
+        elif self.q_variant == "Q1":
+            q_full = substrate_resample(q_proj)
+        elif self.q_variant == "Q2":
+            gate = near_attractor_signal(x)
+            q_full = q_proj * (1.0 + self.gamma * gate)
+        else:
+            raise ValueError(self.q_variant)
+        # V always uses substrate_resample (V1, production default from v0.1).
+        v_full = substrate_resample(x @ self.W_v)
+        q = q_full.view(T, H, dh).transpose(0, 1)
+        v = v_full.view(T, H, dh).transpose(0, 1)
+        k = self.K_const_mh
+        scores = (q @ k.transpose(-2, -1)) / (dh ** 0.5)
+        attn = softmax_smod(scores, dim=-1, alpha=self.alpha)
+        out = attn @ v
+        out = out.transpose(0, 1).contiguous().view(T, D)
+        return out @ self.W_o
+
+
+class BlockQ(nn.Module):
+    def __init__(self, d_model, n_heads, ff_dim, seq_len, seed,
+                 q_variant, alpha, gamma):
+        super().__init__()
+        self.attn = AttentionL1QV(d_model, n_heads, seq_len, seed,
+                                   q_variant, alpha, gamma)
+        s = self.attn.rng_state
+        self.ln1_g = nn.Parameter(torch.ones(d_model))
+        self.ln1_b = nn.Parameter(torch.zeros(d_model))
+        W_up, s = make_matrix(d_model, ff_dim, 0.3, s + 13)
+        W_down, s = make_matrix(ff_dim, d_model, 0.3, s)
+        self.ff_up = nn.Parameter(W_up)
+        self.ff_up_b = nn.Parameter(torch.zeros(ff_dim))
+        self.ff_down = nn.Parameter(W_down)
+        self.ff_down_b = nn.Parameter(torch.zeros(d_model))
+        self.ln2_g = nn.Parameter(torch.ones(d_model))
+        self.ln2_b = nn.Parameter(torch.zeros(d_model))
+        self.rng_state = s
+
+    def forward(self, x):
+        attn_out = self.attn(x)
+        x_post_attn = x + attn_out
+        normed1 = F.layer_norm(x_post_attn, (x.size(-1),),
+                               weight=self.ln1_g, bias=self.ln1_b)
+        up = normed1 @ self.ff_up + self.ff_up_b
+        activated = F.relu(up)
+        down = activated @ self.ff_down + self.ff_down_b
+        x_post_ff = x_post_attn + down
+        return F.layer_norm(x_post_ff, (x.size(-1),),
+                            weight=self.ln2_g, bias=self.ln2_b)
+
+
+class ModelQ(nn.Module):
+    def __init__(self, vocab, d_model, n_heads, ff_dim, seq_len, n_blocks,
+                 seed, q_variant, alpha, gamma):
+        super().__init__()
+        s = seed
+        E, s = make_matrix(vocab, d_model, 0.3, s)
+        self.embedding = nn.Parameter(E)
+        self.register_buffer("pe_table", crt_pe(seq_len, d_model))
+        self.blocks = nn.ModuleList()
+        for i in range(n_blocks):
+            b = BlockQ(d_model, n_heads, ff_dim, seq_len,
+                       s + 100 * (i + 1), q_variant, alpha, gamma)
+            self.blocks.append(b)
+            s = b.rng_state
+        W_head, _ = make_matrix(d_model, vocab, 0.3, s + 17)
+        self.head = nn.Parameter(W_head)
+        self.head_b = nn.Parameter(torch.zeros(vocab))
+
+    def forward(self, token_ids):
+        x = self.embedding[token_ids] + self.pe_table[:token_ids.size(0)]
+        for b in self.blocks:
+            x = b(x)
+        return x @ self.head + self.head_b
+
+
+def train_one(q_variant, train_ids, val_ids, vocab_size, args, seed):
+    torch.manual_seed(seed)
+    random.seed(seed)
+    model = ModelQ(vocab_size, args.d_model, args.n_heads, args.ff_dim,
+                   args.seq_len, args.n_blocks, seed, q_variant,
+                   args.alpha, args.gamma)
+    opt = torch.optim.AdamW(model.parameters(), lr=args.lr,
+                             betas=(0.9, 0.999), eps=1e-8)
+    n_train, n_val = len(train_ids), len(val_ids)
+    train_t = torch.tensor(train_ids, dtype=torch.long)
+    val_t = torch.tensor(val_ids, dtype=torch.long)
+    for step in range(args.steps):
+        start = random.randint(0, n_train - args.seq_len - 2)
+        w = train_t[start:start + args.seq_len]
+        t = train_t[start + 1:start + 1 + args.seq_len]
+        loss = F.cross_entropy(model(w), t)
+        opt.zero_grad()
+        loss.backward()
+        opt.step()
+    model.eval()
+    vls = []
+    with torch.no_grad():
+        for _ in range(30):
+            vs = random.randint(0, n_val - args.seq_len - 2)
+            vw = val_t[vs:vs + args.seq_len]
+            vt = val_t[vs + 1:vs + 1 + args.seq_len]
+            vls.append(F.cross_entropy(model(vw), vt).item())
+    return sum(vls) / len(vls)
+
+
+def main():
+    parser = argparse.ArgumentParser()
+    parser.add_argument("--seeds", type=str, default="42,7,123")
+    parser.add_argument("--steps", type=int, default=1500)
+    parser.add_argument("--lr", type=float, default=0.005)
+    parser.add_argument("--seq-len", type=int, default=32)
+    parser.add_argument("--d-model", type=int, default=32)
+    parser.add_argument("--n-heads", type=int, default=4)
+    parser.add_argument("--ff-dim", type=int, default=64)
+    parser.add_argument("--n-blocks", type=int, default=4)
+    parser.add_argument("--alpha", type=float, default=1.0)
+    parser.add_argument("--gamma", type=float, default=0.2)
+    parser.add_argument("--variants", type=str, default="Q0,Q1,Q2")
+    parser.add_argument("--out", type=str,
+                         default="results_torch_substrate_q.json")
+    args = parser.parse_args()
+
+    corpus = (Path(__file__).parent.parent / "transformerless_lm"
+              / "tinyshakespeare.txt").read_text()
+    chars, lookup = build_vocab(corpus)
+    vocab_size = len(chars)
+    ids = [lookup[c] for c in corpus]
+    split = int(len(ids) * 0.9)
+    train_ids, val_ids = ids[:split], ids[split:]
+    seeds = [int(s) for s in args.seeds.split(",")]
+    variants = args.variants.split(",")
+
+    print("=== Substrate-Q on L1-MH + S-MOD + V1 (TinyShakespeare) ===")
+    print(f"variants={variants} seeds={seeds} steps={args.steps} "
+          f"α={args.alpha} γ={args.gamma}\n", flush=True)
+
+    results = {}
+    for v in variants:
+        vals = []
+        for seed in seeds:
+            vm = train_one(v, train_ids, val_ids, vocab_size, args, seed)
+            vals.append(vm)
+            print(f"  {v}  seed={seed}  val={vm:.4f}", flush=True)
+        results[v] = {
+            "vals": vals,
+            "mean": sum(vals) / len(vals),
+            "std": statistics.stdev(vals) if len(vals) > 1 else 0.0,
+        }
+        print(f"[{v}] mean val={results[v]['mean']:.4f}  "
+              f"std={results[v]['std']:.4f}\n", flush=True)
+
+    print("=== Summary ===")
+    base = results[variants[0]]["mean"]
+    print(f"{'variant':>8}  {'mean val':>10}  {'std':>7}  {'vs Q0':>8}")
+    for v in variants:
+        m = results[v]["mean"]
+        rel = (m - base) / base * 100
+        marker = "—" if v == variants[0] else f"{rel:+.2f}%"
+        print(f"{v:>8}  {m:>10.4f}  {results[v]['std']:>7.4f}  {marker:>8}")
+    best = min(variants, key=lambda v: results[v]["mean"])
+    print(f"\nBest: {best}  ({results[best]['mean']:.4f})")
+
+    out_path = Path(__file__).parent / args.out
+    with open(out_path, "w") as f:
+        json.dump({"results": results, "config": vars(args),
+                    "best": best}, f, indent=2, default=float)
+    print(f"Wrote {out_path}")
+
+
+if __name__ == "__main__":
+    main()