VisionClaw is the distributed coordination platform that emerges when VisionClaw, Agentbox, the Nostr+Solid protocol layer, and the DreamLab forum mesh together. No single repository is VisionClaw — it is the coordination layer binding five independent systems into a platform for solving hard problems with distributed human and AI intelligence.
Hard problems — climate modelling, drug discovery, organisational transformation, creative production at scale — share a common shape: they require diverse intelligence (human intuition, domain expertise, machine pattern recognition, formal reasoning) to collaborate across trust boundaries, at scales that defeat centralised coordination.
Existing approaches fail in predictable ways:
- Centralised AI platforms scale tokens but not trust. One vendor, one failure mode, one billing relationship.
- Agent frameworks scale tasks but not governance. Fast and broken is still broken.
- Knowledge management tools scale information but not reasoning. More data doesn't mean better decisions.
- Collaboration platforms scale communication but not coordination. More Slack channels don't solve alignment.
VisionClaw solves all four simultaneously because the substrate handles identity, provenance, access control, and semantic reasoning as first-class architectural primitives — not bolted-on features.
flowchart TB
subgraph VF["VisionClaw Coordination Layer"]
direction LR
MESH["Nostr Relay Mesh\n(NIP-42 AUTH, bidirectional)"]
DID["DID:Nostr Identity Spine\n(secp256k1 everywhere)"]
WAC["WAC Access Control\n(Solid Protocol)"]
PROV["Immutable Provenance\n(content-addressed beads)"]
end
subgraph VC["VisionClaw — Knowledge Engineering"]
OWL["OWL 2 EL Reasoning\n(Whelk-rs)"]
GPU["92 CUDA Kernels\n(semantic physics)"]
XR["Immersive XR\n(multi-user)"]
MCP_VC["7 MCP Ontology Tools"]
end
subgraph AB["Agentbox — Harness Engineering"]
AGENTS["88 Agent Skills\n(manifest-driven)"]
NIX["Reproducible Runtime\n(Nix flakes)"]
TOOLS["180+ MCP Tools\n(browser, 3D, media, data)"]
PRIV["Privacy Filter\n(PII redaction)"]
end
subgraph PROTO["Protocol Infrastructure"]
SPR["solid-pod-rs\n(JSS Rust port, ~98% parity)"]
NRF["nostr-rust-forum\n(11 nostr-bbs-* crates)"]
FC["forum-config\n(operator overlay)"]
end
subgraph EDGE["DreamLab Edge Deployment"]
CF["Cloudflare Workers\n(5 Rust workers)"]
FORUM["Leptos Forum\n(WASM, 19 routes)"]
SITE["React Marketing\n(Vite SPA)"]
end
VC <--> VF
AB <--> VF
PROTO <--> VF
EDGE <--> VF
VC <-.->|"Broker Bridge\ngraph-state ingress\ngit enrichment"| AB
PROTO -->|"kit crates"| EDGE
SPR -->|"dep"| NRF
SPR -->|"dep"| AB
SPR -->|"dep"| VC
style VF fill:#1a1a2e,stroke:#e94560,color:#fff
style VC fill:#0a1a2a,stroke:#00d4ff,color:#fff
style AB fill:#1a0a2a,stroke:#8b5cf6,color:#fff
style PROTO fill:#0a2a1a,stroke:#10b981,color:#fff
style EDGE fill:#2a1a0a,stroke:#f59e0b,color:#fff
Every actor — human, agent, server, worker — is identified by a single secp256k1 keypair expressed as did:nostr:<hex-pubkey>. This identity is:
- Verified at the relay via NIP-42 AUTH challenge-response
- Verified at every HTTP request via NIP-98 Schnorr signatures (URL + method + body hash binding)
- Evaluated against WAC ACLs on every Solid pod read/write
- Embedded in every provenance bead as the event author
- Resolvable as a DID Document at
/.well-known/did.json
No shared session store. No token exchange between tiers. The cryptographic primitive is the coordination primitive.
The shared semantic substrate where humans and agents reason together. VisionClaw ingests knowledge, applies OWL 2 EL formal reasoning, renders it as a navigable 3D space, and exposes it to agents through MCP tools. Its unique contribution to the coordination layer:
- Shared ontology — agents and humans work against the same formal vocabulary. "Deliverable" means the same thing to a Creative Production agent and a Governance agent.
- W3C SHACL validation — every write path passes through shape constraints before reaching the store. Dual-mode gate: enforcing on mutations, advisory on reads.
- GPU-accelerated sense-making — semantic forces make related concepts cluster visually. Humans see patterns; agents traverse them algorithmically.
- Judgment Broker — the human-in-the-loop governance surface. Agents publish control panels (kind 31400); humans approve/reject via signed Nostr events (kind 31403).
- W3C PROV-O provenance — every reasoning chain reified as queryable RDF in an append-only graph. Agents can SPARQL-query "who asserted this, when, with what evidence."
The reproducible, hardened runtime for sovereign AI agents. Agentbox provides the container, the tools, the identity, and the privacy guarantees. Its unique contribution:
- Manifest-driven reproducibility — one
agentbox.toml, one Nix flake, byte-for-byte identical containers. Nonpm installat runtime. - Sovereign data stack — every agent action is stamped with a
did:nostridentity and stored in an embedded Solid pod. The agent owns its data cryptographically. - Five-slot adapter architecture — swap between standalone (SQLite, local JSONL) and federated (PostgreSQL pgvector, VisionClaw beads) by editing TOML. No recompilation.
- Privacy filter — every persistent write passes through a 1.5B-parameter MoE model that redacts PII before it reaches storage.
- 180+ tools — browser automation, 3D modelling, geospatial analysis, media processing, data science — all delivered as Nix packages, gated by manifest flags.
The Nostr+Solid layer provides the binding protocols that make coordination possible across trust boundaries:
- solid-pod-rs — the cryptographic foundation. Rust port of JSS (~98% parity). Provides LDP containers, WAC access control, DID:Nostr resolution, NIP-98 auth, HTTP 402 Web Ledger micropayments. Compiles to both native (Tokio) and WASM (CF Workers) targets via the
corefeature flag. - nostr-rust-forum — the forum kit. 11
nostr-bbs-*crates implementing passkey-first auth (WebAuthn PRF → HKDF → secp256k1), zone-based access control, governance event routing, and the Agent Control Surface Protocol (kinds 31400-31405). - forum-config — the operator overlay. Identity roster, trust thresholds, feature flags, payment configuration. The single source of truth for who can do what in a given deployment.
The user-facing surface: React marketing site, Leptos WASM forum, and five Cloudflare Workers. Consumes the protocol infrastructure as library crates. Demonstrates the VisionClaw architecture at production scale on dreamlab-ai.com.
flowchart LR
subgraph SITE["dreamlab-ai.com (Cloudflare Edge)"]
RW["Relay Worker\n(NIP-42 WebSocket)"]
AW["Auth Worker\n(WebAuthn + NIP-98)"]
PW["Pod Worker\n(R2 Solid Pods)"]
end
subgraph VC_HOST["VisionClaw (GPU Host)"]
BA["BrokerActor\n(case triage)"]
SNA["ServerNostrActor\n(signs 31400/31402)"]
PHYSICS["ForceComputeActor\n(60Hz CUDA)"]
end
subgraph AB_HOST["Agentbox (Agent Container)"]
NB["Nostr Bridge\n(relay subscriber)"]
SP["Solid Pod Server\n(:8484, alpha.15)"]
MA["Management API\n(:9090, 88 skills)"]
end
subgraph TUNNEL["Cloudflare Tunnel"]
CFT["cloudflared-pod\n→ agentbox:8484"]
end
RW <-->|"NIP-42 AUTH\ngovernance kinds\n31400-31405"| SNA
RW <-->|"NIP-42 AUTH\nagent kinds\n38000-38201"| NB
BA -->|"kind 31400\nPanelDefinition"| SNA
AW -->|"PSK provision\n/_admin/provision"| SP
SP <-->|"CF Tunnel"| CFT
MA <-->|"Broker Bridge\ngit enrichment"| BA
style SITE fill:#f59e0b15,stroke:#f59e0b
style VC_HOST fill:#00d4ff15,stroke:#00d4ff
style AB_HOST fill:#8b5cf615,stroke:#8b5cf6
style TUNNEL fill:#10b98115,stroke:#10b981
| Kind Range | Owner | Purpose |
|---|---|---|
| 1, 4, 42 | Forum users | Text notes, DMs, channel messages |
| 27235 | Any authenticated actor | NIP-98 HTTP auth tokens |
| 30001 | VisionClaw | Provenance beads (content-addressed) |
| 30910-30916 | Forum admins | Moderation (ban, mute, warning, report) |
| 31400-31405 | Registered agents | Agent Control Surface (panel, state, action, response) |
| 38000-38201 | Agentbox agents | Agent intent, job estimates, settlements |
| Tier | Host | Storage | Git | Provisioning | Identity |
|---|---|---|---|---|---|
| CF Workers | Cloudflare edge | R2 + KV | No (runtime limitation) | Auto at registration | Same did:nostr |
| Native (Agentbox) | Docker + CF Tunnel | Host filesystem | Yes (/_git/<pk>/) |
Admin PSK via auth-worker | Same did:nostr |
Both tiers verify NIP-98 independently. Same passkey-derived keypair authenticates to both. No shared session store, no token exchange. The identity is portable because the cryptographic primitive is portable.
VisionClaw scales along three axes simultaneously:
One agentbox in standalone mode. Local SQLite beads, local Solid pod, local JSONL events. 88 skills available, 180+ tools, privacy filter active. A single operator solving tasks with AI agents — token-efficient because the agent has sovereign tools, persistent memory, and doesn't need to rediscover context each session.
Cost profile: One API key. One container. Minutes to deploy.
VisionClaw + forum + agentbox on a shared relay mesh. The ontology provides shared vocabulary. The Judgment Broker provides human oversight. Agents publish control panels; humans approve actions. Every mutation passes through a GitHub PR. Every agent decision traces back through provenance beads.
Cost profile: One GPU host (VisionClaw), one agent container (agentbox), Cloudflare Workers (forum). 50-person team validated at DreamLab.
Multiple agentbox instances federated via the Nostr relay mesh. Governance event kinds (31400-31405) propagate across substrates. Each node is independently hardened (Nix reproducible, read-only filesystem, capability-dropped, seccomp-profiled). Nodes trust each other via did:nostr identity verification, not network topology.
Cost profile: Horizontal. Add nodes. Each node is sovereign — it owns its data, controls its agents, sets its trust thresholds. The mesh coordinates; no node controls.
flowchart TB
subgraph SINGLE["Single Operator"]
AB1["Agentbox\n(standalone)"]
end
subgraph TEAM["Team (50 people)"]
VC1["VisionClaw\n(shared ontology)"]
AB2["Agentbox\n(agent pool)"]
FORUM1["Forum\n(governance UI)"]
VC1 <--> AB2
VC1 <--> FORUM1
end
subgraph ENTERPRISE["Federated Enterprise"]
subgraph ORG_A["Organisation A"]
VC_A["VisionClaw"]
AB_A["Agentbox ×N"]
end
subgraph ORG_B["Organisation B"]
VC_B["VisionClaw"]
AB_B["Agentbox ×N"]
end
subgraph ORG_C["Organisation C"]
AB_C["Agentbox ×N"]
end
ORG_A <-->|"relay mesh\ndid:nostr trust"| ORG_B
ORG_B <-->|"relay mesh\ndid:nostr trust"| ORG_C
ORG_A <-->|"relay mesh"| ORG_C
end
SINGLE -.->|"add VisionClaw\n+ forum"| TEAM
TEAM -.->|"federate\nrelay mesh"| ENTERPRISE
style SINGLE fill:#0a2a1a15,stroke:#10b981
style TEAM fill:#0a1a2a15,stroke:#00d4ff
style ENTERPRISE fill:#1a0a2a15,stroke:#8b5cf6
solid-pod-rs compiles to both native Rust (Tokio) and WASM (Cloudflare Workers). The same crate powers edge pods on R2 and native pods on host filesystems. Agents run in Nix containers on any Linux host. The forum runs on CF Workers. VisionClaw runs on any CUDA-capable machine. No vendor lock-in at any layer.
Every actor owns a Solid pod. WAC ACLs — evaluated against did:nostr identities — control access. The pod is the canonical data store; the relay mesh is the coordination transport. If an organisation leaves the mesh, it takes its data with it. No migration. No export. The data was always theirs.
Every write is signed. Every event is content-addressed. Every governance decision is an immutable Nostr event with a prior_decision_id provenance chain. Auditors traverse the chain from any point back to first principles. This isn't an audit log — it's a structural guarantee.
The Judgment Broker isn't a bottleneck — it's a filter that lets everything else move faster. When agents know their authority boundary and can surface exceptions cleanly, the 90% of decisions that don't need human judgment flow without friction. The 10% that do get clean, contextualised escalation with full provenance.
| Repository | Role | Key Technology |
|---|---|---|
| VisionClaw | Knowledge engineering substrate | OWL 2 EL, 92 CUDA kernels, multi-user XR, 7 MCP tools |
| Agentbox | Harness engineering runtime | Nix flakes, 88 skills, 180+ tools, sovereign Solid pods |
| solid-pod-rs | Cryptographic foundation | JSS Rust port, DID:Nostr, WAC, Web Ledgers, HTTP 402 |
| nostr-rust-forum | Forum kit | 11 crates, passkey auth, governance event routing |
| dreamlab-ai-website | Branded deployment | React SPA, WASM forum, operator overlay |
The following Wardley map positions VisionClaw's components along the value chain (visibility to the user) and evolution axis (genesis → custom → product → commodity). It illustrates why certain layers are custom-built (semantic physics, ontology reasoning, governance protocol) while others leverage commodity infrastructure (Cloudflare, CUDA, WebSocket).
Key observations:
- Identity and provenance sit in the custom-built zone —
did:nostrand content-addressed beads are novel protocol primitives, not yet commoditised. - Agent runtime (Agentbox) occupies the product space — reproducible containers and tool orchestration are maturing but not yet commodity.
- GPU compute and relay infrastructure are approaching commodity — CUDA kernels and WebSocket relays are well-understood engineering, differentiated only by domain-specific application.
- The coordination layer (VisionClaw itself) is genesis — no existing platform combines formal reasoning, sovereign identity, and agent governance at this architectural level.
VisionClaw's protocol layer is built on Melvin Carvalho's JavaScriptSolidServer (JSS) and DID:Nostr. JSS is the AGPL-3.0 reference implementation of the Solid Protocol and the canonical source for the feature set, protocol extensions, and Web Ledger micropayment system. solid-pod-rs is a Rust port; protocol-level decisions defer to the upstream JSS repository. See MAINTAINERS.md.
