Level 1: Windows SHM Transport Contract

Purpose

This document defines the shared memory region layout, synchronization protocol, and lifecycle rules for the Windows SHM transport. All implementations (C native, C MSYS, Rust native, Go native) must use this exact layout and synchronization protocol to interoperate on the same shared region.

Kernel object names

The shared memory region and synchronization objects are identified by kernel object names in the Local\ namespace:

Local\netipc-{hash:016llx}-{service}-p{profile}-s{session_id:016llx}-mapping
Local\netipc-{hash:016llx}-{service}-p{profile}-s{session_id:016llx}-req_event
Local\netipc-{hash:016llx}-{service}-p{profile}-s{session_id:016llx}-resp_event

Where:

hash = FNV-1a 64-bit hash of run_dir + "\n" + service_name + "\n" + auth_token
service = service_name (must contain only alphanumeric, dash, underscore, and dot; reject otherwise)
profile = selected profile number
session_id = server-assigned session identifier from the hello-ack payload, formatted as a zero-padded 16-character lowercase hex string

Each session gets its own set of kernel objects. The session_id ensures that multiple concurrent clients each have independent SHM regions and event objects.

The auth token is included in the hash so that only peers sharing the same token derive the same object names. This provides a basic authorization barrier for SHM access.

FNV-1a 64-bit hash

Same algorithm as the Named Pipe transport:

Offset basis: 0xcbf29ce484222325
Prime: 0x00000100000001B3
Input is the concatenation: run_dir + "\n" + service_name + "\n" + auth_token_as_decimal_string

Profile bits

Bit	Value	Name	Synchronization
1	`0x02`	SHM_HYBRID	Spin + kernel events
2	`0x04`	SHM_BUSYWAIT	Pure busy-spin, no kernel events
3	`0x08`	SHM_WAITADDR	Reserved (WaitOnAddress, not currently used cross-process)

SHM_HYBRID is the general-purpose profile. SHM_BUSYWAIT is for single-client low-latency scenarios where CPU usage is acceptable.

Region layout

The shared memory region is created via CreateFileMappingW and mapped via MapViewOfFile. The region has three sections:

[Header: 128 bytes, offset 0]
[Request area: request_capacity bytes, 64-byte aligned]
[Response area: response_capacity bytes, 64-byte aligned]

All data offsets are aligned to 64-byte cache-line boundaries.

The request and response capacities encoded in the header are fixed for the lifetime of that SHM session. Level 1 does not resize a mapped region in place. If higher layers later reconnect with larger learned limits, the new session gets a new session_id, a new mapping/event set, and capacities derived from that new handshake.

Region header (128 bytes)

Offset	Size	Type	Field	Volatile	Description
0	4	u32	magic	no	Must be `0x4e535748` ("NSWH")
4	4	u32	version	no	Must be `3`
8	4	u32	header_len	no	Must be `128`
12	4	u32	profile	no	Selected profile (2, 4, or 8)
16	4	u32	request_offset	no	Byte offset to request area
20	4	u32	request_capacity	no	Request area size in bytes
24	4	u32	response_offset	no	Byte offset to response area
28	4	u32	response_capacity	no	Response area size in bytes
32	4	u32	spin_tries	no	Spin iterations before kernel wait
36	4	LONG	req_len	yes	Current request message length
40	4	LONG	resp_len	yes	Current response message length
44	4	LONG	req_client_closed	yes	Client-side close flag
48	4	LONG	req_server_waiting	yes	Server waiting for request flag
52	4	LONG	resp_server_closed	yes	Server-side close flag
56	4	LONG	resp_client_waiting	yes	Client waiting for response flag
60	4	-	padding	no	Reserved
64	8	LONG64	req_seq	yes	Request sequence number
72	8	LONG64	resp_seq	yes	Response sequence number
80	48	-	reserved	no	Reserved for future use

Total: 128 bytes. Enforced by compile-time assertions including exact offset checks for spin_tries (32), req_len (36), resp_len (40), req_client_closed (44), req_server_waiting (48), resp_server_closed (52), resp_client_waiting (56), req_seq (64), and resp_seq (72).

Constants

Name	Value
REGION_MAGIC	`0x4e535748`
REGION_VERSION	`3`
CACHELINE_SIZE	`64` bytes
SHM_HYBRID_DEFAULT_SPIN_TRIES	`1024`
BUSYWAIT_DEADLINE_POLL_MASK	`1023`

The Windows default spin count (1024) is higher than POSIX (128) because Windows kernel synchronization primitives have higher overhead, and SHM performance on Windows requires more spinning to avoid falling into kernel wait paths.

Publication protocol

SHM_HYBRID (spin + kernel events)

Uses two named kernel events (req_event, resp_event) created via CreateEventW, plus spin loops on sequence numbers.

Client sends a request

Write the complete message into the request area.
Store the message length in req_len (interlocked exchange).
Increment req_seq (interlocked increment) to publish.
If req_server_waiting is set, signal req_event via SetEvent.

Server receives a request

Spin up to spin_tries iterations checking req_seq (interlocked compare).
If not advanced: set req_server_waiting = 1, check req_seq once more (avoid race), then WaitForSingleObject(req_event, timeout).
Clear req_server_waiting after waking.
Read req_len. If req_len is 0, report a protocol error — send rejects zero-length messages, so this indicates SHM corruption.
Validate req_len against request_capacity. If req_len exceeds the capacity, discard the message and report an error. This prevents out-of-bounds reads from a malicious or buggy peer.
Read the message bytes from the request area.

Server sends a response

Write the complete response into the response area.
Store the message length in resp_len (interlocked exchange).
Increment resp_seq (interlocked increment) to publish.
If resp_client_waiting is set, signal resp_event via SetEvent.

Client receives a response

Spin up to spin_tries iterations checking resp_seq.
If not advanced: set resp_client_waiting = 1, check resp_seq once more, then WaitForSingleObject(resp_event, timeout).
Clear resp_client_waiting after waking.
Read resp_len. If resp_len is 0, report a protocol error (SHM corruption).
Validate resp_len against response_capacity. If resp_len exceeds the capacity, discard the message and report an error.
Read the message bytes from the response area.

SHM_BUSYWAIT (pure spin, no kernel events)

No kernel event objects are created. Both sides spin continuously on sequence numbers.

The publication protocol is the same as SHM_HYBRID except:
- No SetEvent calls.
- No WaitForSingleObject calls.
- The waiter spins indefinitely (with deadline polling every BUSYWAIT_DEADLINE_POLL_MASK + 1 iterations to check for timeout expiry).

This profile burns full CPU on both client and server but achieves the lowest possible latency.

Close protocol

Client closes

Set req_client_closed = 1 (interlocked exchange).
If SHM_HYBRID: signal req_event to wake a waiting server.
Unmap the region and close handles.

Server closes

Set resp_server_closed = 1 (interlocked exchange).
If SHM_HYBRID: signal resp_event to wake a waiting client.
Unmap the region, close the file mapping handle.
Close event handles if SHM_HYBRID.

Detecting peer close

The server checks req_client_closed after waking from a wait or spin timeout. If set, the server treats the session as closed.
The client checks resp_server_closed similarly.
On peer close detection, the detecting side must report the session as disconnected (graceful close, equivalent to io.EOF).

Memory ordering

Sequence number and length stores use interlocked (atomic) operations with release semantics.
Sequence number and length loads use interlocked (atomic) operations with acquire semantics.
The sequence increment acts as the publication fence, same as POSIX.
The volatile qualifier on flag fields ensures compiler ordering but is not sufficient for cross-processor ordering — interlocked operations provide that.

Single in-flight constraint

Same as POSIX SHM: one in-flight message per direction. The client must wait for the response before sending the next request. Batch-level pipelining (multiple items in one message) is the mechanism for higher throughput.

Region lifecycle

Server creates a per-session region

The server creates one SHM region per accepted session, after the handshake negotiates an SHM profile. The kernel object names are derived from the session_id assigned during the handshake.

Derive kernel object names using session_id (see naming section).
Create the file mapping via CreateFileMappingW using the derived mapping name.
Map the view via MapViewOfFile.
Write the header: magic, version, header_len, profile, offsets, capacities, spin_tries. Initialize all volatile fields to zero.
If SHM_HYBRID: create req_event and resp_event as auto-reset kernel events (CreateEventW with bManualReset = FALSE). Auto-reset is required: SetEvent wakes exactly one waiter and resets automatically, which matches the one-writer/one-reader-per- direction model.
The region is now ready for the client.

The server must track all active per-session SHM regions so they can be cleaned up on session close and server shutdown.

If a later reconnect negotiates larger capacities, the server creates a new mapping/event set for the new session instead of resizing the old objects in place.

Client attaches to the region

Derive kernel object names using the session_id from the hello-ack.
Open the file mapping via OpenFileMappingW using the derived mapping name.
Map the view via MapViewOfFile.
Validate the header: magic, version, header_len, profile.
Read offsets, capacities, and spin_tries from the header.
If SHM_HYBRID: open req_event and resp_event kernel events.
The client is now ready to publish requests and consume responses.

Server destroys a per-session region

When a session closes (graceful or broken):

Unmap the view.
Close the file mapping handle.
Close event handles if SHM_HYBRID.
Kernel object cleanup is automatic when all handles are closed.

Client detaches

Unmap the view.
Close the file mapping handle.
Close event handles if SHM_HYBRID.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Level 1: Windows SHM Transport Contract

Purpose

Kernel object names

FNV-1a 64-bit hash

Profile bits

Region layout

Region header (128 bytes)

Constants

Publication protocol

SHM_HYBRID (spin + kernel events)

Client sends a request

Server receives a request

Server sends a response

Client receives a response

SHM_BUSYWAIT (pure spin, no kernel events)

Close protocol

Client closes

Server closes

Detecting peer close

Memory ordering

Single in-flight constraint

Region lifecycle

Server creates a per-session region

Client attaches to the region

Server destroys a per-session region

Client detaches

FilesExpand file tree

level1-windows-shm.md

Latest commit

History

level1-windows-shm.md

File metadata and controls

Level 1: Windows SHM Transport Contract

Purpose

Kernel object names

FNV-1a 64-bit hash

Profile bits

Region layout

Region header (128 bytes)

Constants

Publication protocol

SHM_HYBRID (spin + kernel events)

Client sends a request

Server receives a request

Server sends a response

Client receives a response

SHM_BUSYWAIT (pure spin, no kernel events)

Close protocol

Client closes

Server closes

Detecting peer close

Memory ordering

Single in-flight constraint

Region lifecycle

Server creates a per-session region

Client attaches to the region

Server destroys a per-session region

Client detaches