Mesh Transport Specification

Overview

Mesh Transport is a peer‑to‑peer networking layer that enables agents to discover each other and exchange messages in an ad‑hoc, possibly partitioned, network.

Requirements

Functional

Discovery
- Automatic discovery of peers on the same local network (via mDNS, UDP broadcast, or manual list).
- Support for static configuration of known peer addresses.
- Ability to filter peers by agent ID or role.
Connection Management
- Establish bidirectional connections (TCP, WebRTC, QUIC) with fallback.
- Handle connection loss and automatic reconnection with exponential backoff.
- Keep‑alive heartbeats to detect dead peers.
Message Routing
- Unreliable broadcast (flooding) for small‑swarm scenarios.
- Reliable unicast for point‑to‑point commands.
- Optional multicast groups for topic‑based messaging.
Quality of Service
- Priority queues for critical messages (e.g., emergency stop).
- Configurable retransmission for reliable delivery.
- Bandwidth throttling per peer.
Security
- TLS‑like encryption (Noise protocol) for all links.
- Peer authentication via pre‑shared keys or certificate authority.
- Message integrity and replay protection.

Non‑Functional

Latency: < 100 ms for local‑network messages.
Throughput: Support at least 1000 messages/second per agent.
Scalability: Up to 50 agents in a single mesh.
Resource usage: < 5 MB RAM and < 1% CPU when idle.

Design

Architecture

┌─────────────────────────────────────────┐
│            Mesh Transport               │
├─────────────┬─────────────┬─────────────┤
│  Discovery  │ Connection  │   Routing   │
│    Module   │   Manager   │   Module    │
├─────────────┼─────────────┼─────────────┤
│           Security Layer                │
├─────────────────────────────────────────┤
│           Transport Backend             │
│           (libp2p, smol‑net)            │
└─────────────────────────────────────────┘

Components

1. Discovery Module

Implements Discovery trait.
Provides discover_peers() -> Vec<PeerInfo>.
Emits PeerDiscovered and PeerLost events.

2. Connection Manager

Maintains a HashMap<PeerId, Connection>.
Creates outgoing connections and accepts incoming ones.
Monitors health and triggers reconnection.

3. Routing Module

Implements RouteMessage trait.
broadcast(data: Vec<u8>) floods to all connected peers.
send_to(peer: PeerId, data: Vec<u8>) delivers to a specific peer.

4. Security Layer

Wraps each connection with an authenticated encrypted channel.
Uses Noise protocol framework (XX handshake).

5. Transport Backend

Abstract trait TransportBackend allowing pluggable implementations.
Default backend: Libp2pBackend (using libp2p crate).
Alternative backend: SmolNetBackend (custom lightweight TCP/UDP).

Data Structures

pub struct PeerId([u8; 32]); // Cryptographic hash of public key

pub struct PeerInfo {
    pub id: PeerId,
    pub addresses: Vec<SocketAddr>,
    pub metadata: HashMap<String, String>,
}

pub enum TransportEvent {
    PeerDiscovered(PeerInfo),
    PeerLost(PeerId),
    MessageReceived {
        from: PeerId,
        payload: Vec<u8>,
        timestamp: Instant,
    },
    ConnectionEstablished(PeerId),
    ConnectionClosed(PeerId),
}

pub trait MeshTransport {
    fn broadcast(&self, payload: Vec<u8>) -> Result<()>;
    fn send_to(&self, peer: PeerId, payload: Vec<u8>) -> Result<()>;
    fn events(&self) -> Box<dyn Stream<Item = TransportEvent> + Send>;
    fn peers(&self) -> Vec<PeerInfo>;
}

Implementation Plan

Phase 1 – Minimal Viable Transport

Create crates/mesh-transport with libp2p as a dependency.
Implement a simple discovery via mDNS (libp2p‑mdns).
Establish TCP connections between discovered peers.
Send plain‑text “ping‑pong” messages.
Unit test with two in‑process nodes.

Phase 2 – Reliability & Security

Add Noise protocol encryption.
Implement reliable message delivery with sequence numbers and ACKs.
Add connection heartbeat and reconnection logic.
Benchmark latency and throughput.

Phase 3 – Advanced Features

Support for UDP (QUIC) for lower latency.
Multicast groups for topic‑based subscriptions.
Integration with resource monitor for adaptive QoS.

Dependencies

libp2p (with features tcp, mdns, noise, yamux)
tokio for async runtime
serde for configuration serialization
tracing for structured logging

Testing Strategy

Unit tests: Mock network interfaces with libp2p‑swarm‑test.
Integration tests: Spawn multiple OS‑level processes that communicate via loopback.
Simulation tests: Use tokio‑test and virtual time to simulate network partitions.

Open Questions

Should we support WebRTC for browser‑based agents?
Is mDNS sufficient for discovery, or do we need a custom beacon protocol?
How to handle NAT traversal (STUN/TURN)?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mesh Transport Specification

Overview

Requirements

Functional

Non‑Functional

Design

Architecture

Components

1. Discovery Module

2. Connection Manager

3. Routing Module

4. Security Layer

5. Transport Backend

Data Structures

Implementation Plan

Phase 1 – Minimal Viable Transport

Phase 2 – Reliability & Security

Phase 3 – Advanced Features

Dependencies

Testing Strategy

Open Questions

References

FilesExpand file tree

MESH_TRANSPORT_SPEC.md

Latest commit

History

MESH_TRANSPORT_SPEC.md

File metadata and controls

Mesh Transport Specification

Overview

Requirements

Functional

Non‑Functional

Design

Architecture

Components

1. Discovery Module

2. Connection Manager

3. Routing Module

4. Security Layer

5. Transport Backend

Data Structures

Implementation Plan

Phase 1 – Minimal Viable Transport

Phase 2 – Reliability & Security

Phase 3 – Advanced Features

Dependencies

Testing Strategy

Open Questions

References