A small, opinionated control plane for a crew of AI coding agents working in parallel. One tmux session, a typed task DAG, isolated VCS workspaces per agent, an audit log — and a dashboard for seeing what the crew is doing right now.
mu (no args) — read-only dashboard: agents, tracks, ready /
in-progress / blocked tasks, log tail, workspaces, doctor.
mu workstream init auth-refactor
mu task add --title "Design auth" --impact 80 --effort-days 2
mu task add --title "Build auth" --impact 80 --effort-days 5 --blocked-by design_auth
mu task add --title "Review auth" --impact 60 --effort-days 1 --blocked-by build_auth
mu agent spawn worker-1 --workspace
mu agent spawn reviewer-1 --workspace --role read-only
mu agent send worker-1 "Pick up the next ready task and design the auth module."
tmux a -t mu-auth-refactor # watch the whole crew live
mu # human home base: TUI with every workstream loaded
mu state -w auth-refactor --json # agent/script API: typed static state for jq
mu log --tail # subscribe to every state changeCrew, graph, and workspaces are all real things you can poke at:
tmux panes (tmux attach), a SQLite DAG with parallel-tracks +
diamond-merge, and jj workspace / sl share / git worktree on disk.
mu persists state and coordinates handoffs; the model still
decides what to do. Workers reach for the bundled
SKILL.md when they need in-pane ground truth.
Bare mu on a TTY launches the read-only multi-workstream TUI;
piped/scripted calls print help, so use typed verbs + --json (or
MU_NO_TUI=1) for agent/API flows.
- Parallelism that doesn't trip over itself. Per-agent VCS
workspaces (jj/sl/git auto-detected;
cp -afor non-VCS); a real task DAG with deterministic parallel-track detection + diamond-merge so two agents are never assigned tasks that share a prerequisite. - Stay out of the model's way. Mu coordinates handoffs; it does
not make choices on behalf of the model.
--cli <key>uppercases to$MU_<KEY>_COMMAND, so your shell rc owns model/provider selection and thinking-effort flags. The orchestrator and workers drive; mu records, scopes, and yanks the next command. - A local typed state surface. One CLI, one SQLite registry,
typed verbs,
--jsoneverywhere useful, and a read-only dashboard for humans.
- Not a build tool. mu doesn't compile, test, or deploy anything.
- Not a chat protocol. Agents communicate via the work graph and the activity log, never agent-to-agent messaging.
- Not a verifier.
task close --evidence "tests pass"records the claim; mu doesn't run the tests. - Not a replacement for pi-subagents. Different problem (persistent crew vs one-shot focused delegation). Install both.
- Not a hosted service. Local-first SQLite.
- DB-undoable, not tmux-undoable. Every destructive verb
auto-captures a whole-DB snapshot first;
mu undo --yesrestores the DB. Killed panes and freed workspace dirs are NOT replayed.
Use mu for — multi-phase investigations; tasks worth gating with review; parallel read-only/audit work alongside a heavier task; implementation + reviewer splits with isolated workspaces; anything where "what was decided and why" needs to outlive a single agent's scrollback.
Don't use mu for — tiny direct edits; quick local inspection; single-context work where no durable coordination is needed. The orchestrator's first decision is whether to reach for mu at all.
# 1. The CLI.
npm install -g @martintrojer/mu
mu --version
# 2. The skill (teaches your coding agent how to drive mu).
npx skills add martintrojer/mu # auto-detects pi / claude-code / codex / etc.
# Add -g to install globally (~/.<agent>/skills/), -y to skip prompts.Requirements:
- Node 20, 22, or 23 (LTS recommended; see
.nvmrc). Node 24+ is currently blocked by abetter-sqlite3native-build incompatibility. - tmux ≥ 3.0 (
mu doctorchecks) - pi (the agent CLI mu orchestrates)
- For
--workspace: jj, sl, or git on PATH (or--backend none)
Update: npm install -g @martintrojer/mu@latest for the CLI;
npx skills update mu for the skill.
Install from source (hacking on mu itself):
git clone https://github.com/martintrojer/mu
cd mu
npm install -g . # `prepare` script auto-builds; `mu` lands on $PATH
npx skills add ./skills/mu # local-path source formatMore install patterns (alias-to-dist for fastest dev iteration) in
docs/USAGE_GUIDE.md § 1 Setup. After
installing, run bare mu (or mu state --tui) for the dashboard tour.
Bare mu in a TTY launches the flagship read-only dashboard across
all workstreams; mu state --tui -w <workstream> is the explicit
single/multi-workstream form. Non-TTY callers and scripts keep the
static/help path, and mu state --json is the API.
The dashboard has ten cards: Commits, Agents, Tracks, Ready, Activity
log, Workspaces, In-progress, Blocked, Recent, and Doctor. Drill into
any numbered card fullscreen with Shift+0-Shift+9; g opens the
full DAG and t opens the all-tasks list. ? shows the complete
keymap. Keyboard and mouse both work: navigate with keys, double-click
cards or rows to drill, and scroll popup bodies with the mouse wheel.
The TUI is read-only by design. y yanks the canonical mu command
for the focused row to your clipboard; you run it in a shell, so every
mutation still goes through a short-lived typed CLI invocation. The
one exception is user-driven: t inside a commit/show drill suspends
mu's alt-screen and hands off to tuicr -r <sha>, then restores the
dashboard when tuicr exits.
# Make sure you're inside tmux.
tmux
# Initialize the workstream (creates tmux session mu-auth-refactor)
mu workstream init auth-refactor
# Plan the work as a DAG. IDs auto-derive from titles.
mu task add --title "Design auth module" --impact 80 --effort-days 2
mu task add --title "Build auth" --impact 80 --effort-days 5 --blocked-by design_auth_module
mu task add --title "Review auth" --impact 60 --effort-days 1 --blocked-by build_auth
# Spawn a crew with isolated workspaces.
mu agent spawn worker-1 --workspace
mu agent spawn reviewer-1 --workspace --role read-only
# Human home base: interactive read-only TUI across every workstream
# (same dashboard is explicit with: mu state --tui -w auth-refactor).
mu
# Agent/script API: static state stays explicit and JSON-friendly.
mu state -w auth-refactor --json
# Inside an agent's pane, the agent claims and closes tasks
# without ever knowing its own name (mu reads $TMUX_PANE).
mu task claim design_auth_module
mu task note design_auth_module "DECISION: JWT, 24h expiry, refresh via cookie"
mu task close design_auth_module --evidence "design doc reviewed by reviewer-1"
# Subscribe to events instead of polling.
mu log --tail
# Cleanup (auto-snapshots + auto-exports the conversation first).
mu workstream destroy --yesFull tour: docs/USAGE_GUIDE.md, including the expanded dashboard/state guide.
State lives in one SQLite DB, so it travels. mu db export <file>
writes a consistent whole-DB copy plus a manifest sidecar; mu db import <file> ships it back, with per-workstream drift detection
(dry-run by default; --apply commits) and a sharp --force-source
that parks the loser to a sidecar before clobbering. Hard rule: don't
edit the same workstream on two machines concurrently.
For humans / git / docs, mu workstream export and mu archive export render a workstream (or an archive bucket) as Markdown:
per-task .md files plus an INDEX.md, suitable for committing,
reviewing, or pasting. Bucket exports are read-only artifacts; the
lossless un-archive path back into a live workstream is mu archive restore <label> --as <new-ws>.
pi-subagents |
mu |
|
|---|---|---|
| Best for | "Send this focused task to a specialist, return a result" | "Stand up a crew of agents I can keep talking to and watch live" |
| Lifetime | one-shot per task | long-lived by default |
| Substrate | pi subprocess + result files |
tmux panes running pi sessions |
| Built-in task graph | no | yes: parallel-tracks union-find with diamond-merge |
| Drivable from outside pi | no (extension-only) | yes (mu is a real CLI) |
The two play well together. A pi session can install both.
- docs/USAGE_GUIDE.md — practical tour of every verb. Start here.
- skills/mu/SKILL.md — what an LLM running inside an agent pane sees: the in-pane working loop, subscribe-vs-poll pattern.
- docs/ARCHITECTURE.md — module map, reconciliation algorithm, schema seam (surrogate INTEGER PKs + the SDK boundary discipline).
- docs/VOCABULARY.md — canonical terms; source of truth for every word in code, docs, error messages.
- docs/VISION.md — the load-bearing pillars
- the prior-runtime retrospective.
- docs/ROADMAP.md — what's next + the anti-feature pledges + explicitly-rejected ideas.
- CHANGELOG.md — release notes.
MIT.
