Skip to content

v0.0.5 — Substrate codec, kernel, OMC-PROTOCOL v1

Choose a tag to compare

@RandomCoder-lab RandomCoder-lab released this 17 May 16:48
· 332 commits to master since this release

WHAT CHANGED

  • Substrate codec (omc_codec_encode / omc_codec_decode_lookup):
    canonicalize source, tokenize, sample every Nth ID, return
    compressed payload + content hash. Library-lookup decode.
  • omc-kernel: content-addressed filesystem store at
    ~/.omc/kernel/store/<hex_hash>.omc. Alpha-rename invariant — two
    processes converging on the same canonical form produce the same
    address. CLI: ingest, fetch, stat, ls, sign, verify.
  • omc-grep: code archaeology via canonical hash. Found 31.7%
    redundancy in OMC's own examples tree.
  • OMC-PROTOCOL v1: formalized substrate-signed wire format for
    inter-agent messaging. No PKI; integrity via canonical-hash
    recompute.
  • MCP server (omnimcode-mcp): exposes OMC as a runtime to LLM clients.
  • Substrate-aware tokenizer: 285+ builtins, 113 phrase-level dict
    entries, CRT-packed (kind, vocab_id, position_class) IDs,
    token_distance metric, attractor folding.

WHY IT MATTERS
The substrate gains an identity layer (canonical hash) and a wire
format. Two agents talking over OMC-PROTOCOL can verify each other's
claims by recomputing hashes — no shared keys needed. The tokenizer
turns OMC source into a substrate-typed symbol stream — the
foundation for the substrate-indexed completion engine that comes next.

NOW POSSIBLE

  • Compress code by 10-50× via library-lookup codec.
  • Persist Values content-addressed and dedupe across processes.
  • Inter-agent messaging with cryptographic-style integrity but no
    key infrastructure.
  • LLM clients can drive OMC over MCP.

See CHANGELOG.md#v0.0.5-codec-kernel-protocol for the chapter index.