Skip to content

Latest commit

 

History

History
54 lines (44 loc) · 3.53 KB

File metadata and controls

54 lines (44 loc) · 3.53 KB

Architecture Decision Records

This directory contains Architecture Decision Records (ADRs) for the DLM-proposer + AR-verifier project. Each ADR captures a single architectural decision, the context that led to it, the alternatives considered, and the consequences of choosing one path over another.

Why ADRs

Code shows what we built. ADRs show why — and just as importantly, what we deliberately rejected and on what grounds. Without ADRs, every new contributor (human or agent) re-derives the same decision tree from scratch and either burns time or re-opens settled debates.

We follow a lightweight variant of the Michael Nygard format: Context → Decision → Consequences, plus an explicit Alternatives considered section because most of the value comes from showing the reader what was not chosen.

Conventions

  • File name: NNNN-kebab-case-title.md where NNNN is a four-digit zero-padded monotonically increasing number.
  • Status: Proposed / Accepted / Superseded by NNNN / Deprecated.
  • Once Accepted, an ADR is immutable except for the Status field. Disagreements are resolved by writing a new ADR that supersedes it.
  • Length: aim for ≤ 5 pages of rendered markdown. If longer, split.

Index

# Title Status
0001 Proposer sizing, alignment, and verifier decoupling Accepted
0002 Verifier selection, quantization, and the open-vs-closed-weight constraint Accepted
0003 Verifier ↔ slab pool integration: deferred refactor + intermediate step Accepted
0004 Alignment training data preparation policy (Nemotron-informed) Accepted
0005 Personal layer / personal data store Planned
0006 Project positioning as local agent infrastructure Accepted
0007 Cross-request KV cache reuse for long sessions Superseded by 0008
0008 Session-bound runtime + gRPC protocol Accepted
0009 Multi-host milestone: AR-verifier / dLM-proposer on mlx.distributed + agent capability exchange Accepted
0012 Proposer/verifier value proposition: bounded-memory + recall, platform-forked throughput Accepted
0013 Distributed inference topology: what AR sequentiality allows Accepted
0014 Agent-connection capacity & cross-host proposer/verifier topology: test plan & results Accepted
0015 Kakeya Inference Engine: a product-grade vLLM replacement, Kakeya Attention native Accepted

Note: ADR numbering is monotonically increasing; in-flight or planned numbers (0005) appear in the index so readers can see the planned shape of the decision tree even before those ADRs are written. When an ADR moves from "Planned" to "In flight" it gets a PR link; when it merges, the row updates to "Accepted" with a file link.