AlphaBitCore
diff --git a/‎.claude/skills/build-agent/skill.md‎
Lines changed: 24 additions & 0 deletions b/‎.claude/skills/build-agent/skill.md‎
Lines changed: 24 additions & 0 deletions
diff --git a/‎.env.example‎
Lines changed: 127 additions & 0 deletions b/‎.env.example‎
Lines changed: 127 additions & 0 deletions
diff --git a/‎.githooks/pre-commit‎
Lines changed: 7 additions & 0 deletions b/‎.githooks/pre-commit‎
Lines changed: 7 additions & 0 deletions
diff --git a/‎.github/workflows/ci.yml‎
Lines changed: 3 additions & 0 deletions b/‎.github/workflows/ci.yml‎
Lines changed: 3 additions & 0 deletions
diff --git a/‎.gitignore‎
Lines changed: 7 additions & 8 deletions b/‎.gitignore‎
Lines changed: 7 additions & 8 deletions
@@ -213,6 +213,30 @@ bash packages/agent/platform/darwin/Scripts/build-prod.sh
 
 Output: `dist/macos/NexusAgent-<VERSION>.pkg`
 
+### Optional: Vectorscan rule-pack engine (cgo, statically linked)
+
+Off by default — the agent ships the pure-Go RE2 content-scan engine. To build
+the agent with the Vectorscan accelerator instead, set `NEXUS_AGENT_VECTORSCAN=1`
+and point `LIBHS_DIR` at a tree containing the Vectorscan headers and a static
+archive per target arch:
+
+```
+$LIBHS_DIR/include/hs/hs.h
+$LIBHS_DIR/lib/arm64/libhs.a
+$LIBHS_DIR/lib/amd64/libhs.a   # required for the universal cross-compile
+```
+
+`build.sh` then adds `-tags "vectorscan vsstatic"` and links `libhs.a` (plus
+`-lc++`, since Vectorscan is C++) statically, so the universal Mach-O has no
+`libhs.dylib` runtime dependency and stays notarizable — the same model already
+used for go-sqlcipher. Verified on arm64 (self-contained binary, no dylib dep);
+the amd64 archive is the remaining vendoring step for a universal build.
+
+Do NOT enable this for a SHIPPED enforcement build until the save-time rule
+linter (rule-pack program task #12) lands: a user rule using `.` could silently
+under-match multibyte abuse content under Vectorscan's byte-mode. Tag-on builds
+for link verification are fine.
+
 ### What the script does (in order)
 
 1. **Go binary** — universal `nexus-agent` (arm64+amd64, CGO for keychain).
 
@@ -146,6 +146,57 @@ COMPLIANCE_PROXY_API_TOKEN=CHANGE_ME_COMPLIANCE_PROXY_API_TOKEN
 # transport. There is no separate ai-gateway runtime token (the previous
 # AI_GATEWAY_API_TOKEN was an unvalidated 4th token; removed in F-0243).
 
+# Performance flags — all OPTIONAL and non-secret. Every one ships with its
+# optimal value as the CODE default, so a stock deployment needs none of them;
+# set a variable only to diverge from the default.
+#   NEXUS_LAZY_CANONICAL  — compute the request canonical only when a synchronous
+#     consumer needs it (smart routing / response cache); otherwise leave it nil
+#     and let the async audit writer derive it off the latency path. On the clean
+#     path (hooks-off, simple routing, cache-off) this skips the eager request-body
+#     Normalize entirely (~29% of request-path CPU on a 50 KB body). DEFAULT ON;
+#     NEXUS_LAZY_CANONICAL=0 forces always-compute. See normalization-architecture.md §5.2.
+#   NEXUS_CGO_SCAN_LIMIT  — cap concurrent hook content-scan cgo crossings.
+#     DEFAULT "auto" (≈ CPUs − 2): tames the M-oversubscription tail under high
+#     hooks-on concurrency. "0" disables the cap; a positive integer pins it.
+#   AI_GATEWAY_AUDIT_CODEC  — inline-body compression codec on the audit side-path.
+#     DEFAULT "s2" (faster; the larger frame is covered by the spool quota).
+#     "zstd" trades CPU for a smaller frame.
+#   NEXUS_AUDIT_WIRE  — gw→hub audit wire. DEFAULT "binary" (the Hub dual-reads).
+#     "json" reverts to the legacy text wire.
+#   AI_GATEWAY_AUDIT_LOSS_MODE  — audit overflow policy. DEFAULT "spill"
+#     (non-blocking spill-defer: no loss until the spill channel + disk are
+#     saturated; drops past that are counted on dropped_total). "block" = strict
+#     back-pressure (never drops, slows the request path); "drop" = bounded loss.
+#   NEXUS_QUOTA_WRITE_BEHIND / NEXUS_CREDSTATS_WRITE_BEHIND  — defer quota and
+#     credential-stats Redis writes off the request hot path (flush on an interval,
+#     final drain on graceful shutdown). DEFAULT ON (soft quota). Overshoot per
+#     instance ≤ read-cache TTL + flush interval (~1.25s); across an N-instance
+#     fleet the blind-spend window is that × N (each instance is unaware of peers'
+#     un-flushed spend), and a hard kill loses the un-flushed increments. Set to 0
+#     for strict synchronous per-request accounting.
+#   NEXUS_EVENTS_MAX_BYTES  — NEXUS_EVENTS audit-stream cap. DEFAULT "auto"
+#     (15% of total RAM; logs a WARN at startup with the chosen value). Pin a fixed
+#     size to override, e.g. NEXUS_EVENTS_MAX_BYTES=32GB. Alias: NEXUS_STREAM_MAX_BYTES.
+#   NEXUS_EVENTS_STORAGE  — NEXUS_EVENTS storage tier. DEFAULT "memory" (the audit
+#     stream is a delay-tolerant burst buffer; keeping it in RAM frees the data disk
+#     for the durable Postgres writes, the single largest single-box throughput
+#     lever). Trade-off: a NATS broker restart/crash drops published-but-undrained
+#     events (those already reclaimed from the producer spill); the overflow→disk
+#     no-loss path only covers the stream-full case, not a broker bounce. Set
+#     NEXUS_EVENTS_STORAGE=file for a durable file-backed stream that survives a
+#     broker restart at the cost of the steady-state disk writes.
+#   GOMEMLIMIT  — Go runtime soft memory limit (read by the Go runtime, not our
+#     code). When UNSET, each service auto-sets it at boot from the cgroup memory
+#     limit (~70% of the cgroup max) when one is present, and logs a WARN with the
+#     chosen value and how to override; if no cgroup limit is detectable it is left
+#     unset (no soft cap). Without a soft cap a burst of large request/response
+#     bodies can grow the heap until the kernel OOM-kills the service (observed under
+#     high-concurrency SSE). To pin it explicitly, set ~70% of the box/cgroup memory,
+#     e.g. GOMEMLIMIT=22GiB on a 32 GiB box. The AMI/systemd deployment also stamps
+#     it; the auto-set covers hand-rolled and container deployments that don't.
+#   NEXUS_PPROF_ADDR=:6060  — bind a net/http/pprof server for profiling. Unset
+#     in production unless actively profiling.
+
 # ─────────────────────────────────────────────────────────────────────────────
 # Infrastructure URLs (REQUIRED; vary per environment)
 # ─────────────────────────────────────────────────────────────────────────────
@@ -229,6 +280,49 @@ NATS_URL=nats://localhost:4222
 # default.
 # AI_GATEWAY_AUDIT_SPOOL_DIR=/var/lib/nexus/audit-spool
 
+# In-heap audit record-buffer cap (overflow → durable spill above). Each queued
+# record pins its pooled ~50 KB body until marshaled, so this bound is the primary
+# control over the audit side-path's gw heap: 10000 (default) holds the body pool
+# near ~1 GB under a slow-publish burst vs ~5 GB at the former 50000, same spill
+# rate. Raise on a memory-rich box, lower on a constrained one. 0/unset → 10000.
+# AI_GATEWAY_AUDIT_MAX_QUEUED_RECORDS=10000
+
+# Audit overflow policy. Durable audit is a product promise + compliance
+# requirement. DEFAULT "spill" is spill-defer: the request path never
+# back-pressures — overflow goes to the durable on-disk spool and the
+# spill-recovery sweeper replays it to Postgres. No loss UNTIL the in-process
+# spill channel + disk are saturated; under sustained overload past that point
+# records are dropped and counted on dropped_total (never silently). This lifts
+# clean-path RPS past the block-mode ceiling. Alternatives: "block" = hard
+# synchronous back-pressure (never drops — slows the request path until the audit
+# pipeline drains; strictest compliance posture); "drop" = counted bounded drop
+# (lossy, non-compliance only). Empty/unknown → "block" (never silently lossy
+# from a typo).
+# AI_GATEWAY_AUDIT_LOSS_MODE=spill
+
+# End-to-end zstd compression of large captured audit bodies. The producer
+# compresses off the request path (async marshal worker), the body rides the
+# NATS wire compressed, the Hub persists the compressed bytes verbatim (no
+# decompress on ingest), and only the Control-Plane view layer decompresses.
+# Captured bodies are JSON/text (~3-10x), and the audit pipeline is disk-I/O-
+# bound at the NATS broker, so this is the direct lever on publish throughput.
+# Default true; set 0/false to disable.
+# AI_GATEWAY_AUDIT_COMPRESS=true
+# Smallest captured body worth compressing (zstd frame + base64 overhead can
+# exceed savings below this). 0/unset → 1024.
+# AI_GATEWAY_AUDIT_COMPRESS_MIN_BYTES=1024
+# zstd encoder level (1=fastest, 3=default, higher=better ratio/slower).
+# 0/unset → library default.
+# AI_GATEWAY_AUDIT_COMPRESS_LEVEL=3
+# Spill-recovery sweeper: replays sealed on-disk spool files back into the MQ
+# queue so a record that overflowed to disk still reaches the queryable store
+# (the drain half of spill-defer). ON by default whenever a spool dir is set —
+# a durable spool that never reaches Postgres is a silent data gap. Interval =
+# sweep period; pace = throttle between files (yields the box to the request
+# path). 0/unset → 2000 ms / 50 ms. Set INTERVAL_MS negative to DISABLE.
+# AI_GATEWAY_AUDIT_SPILL_RECOVERY_INTERVAL_MS=2000
+# AI_GATEWAY_AUDIT_SPILL_RECOVERY_PACE_MS=50
+
 # Service discovery — co-located services on localhost; on prod each is a
 # domain or LB. Every URL below is bare-named (no service prefix) because
 # it identifies a shared environment-level entity, not a service-private
@@ -283,6 +377,36 @@ AUTH_SERVER_ISSUER=http://127.0.0.1:3001
 # Accept localhost WebSocket origins (dev only). MUST stay false/unset in prod.
 # NEXUS_HUB_DEV_MODE=true
 
+# NEXUS_EVENTS JetStream stream cap (audit side-path burst buffer). Accepts
+# "8GB" / "512MB" / a bare byte count, or "auto"/unset. DEFAULT "auto" = 15% of
+# total RAM (a WARN at startup logs the chosen value); pin a fixed size to override.
+# The producer publishes full-speed and the Hub drains lazily, so this absorbs a
+# long burst. The stream uses DiscardNew: at the cap, NEW audit publishes fail and
+# the gateway spills them durably to disk — it does NOT discard old un-acked rows.
+# Alias NEXUS_STREAM_MAX_BYTES (perf-rig name) is honoured when this is unset.
+# NEXUS_EVENTS_MAX_BYTES=auto
+#
+# NEXUS_EVENTS storage tier. DEFAULT "memory" (in-RAM stream) — keeps the
+# delay-tolerant burst buffer off the data disk so the disk serves the durable
+# Postgres writes, the single largest single-box throughput lever. NOTE: the cap
+# above is committed to RAM, so on a 256 GiB box "auto" (15%) commits ~38 GiB to
+# the stream — size GOMEMLIMIT/cgroup accordingly. Trade-off: a NATS broker
+# restart/crash drops published-but-undrained events; the overflow→disk no-loss
+# path covers only the stream-full case, not a broker bounce. Set "file" for a
+# durable file-backed stream that survives a restart at the cost of steady-state
+# disk writes.
+# NEXUS_EVENTS_STORAGE=memory
+
+# Traffic-event drain duty cycle — how the audit drain yields CPU to a co-located
+# AI-gateway core path (yaml: consumers.trafficDrainDutyCycle, this env overrides).
+# Default 0.3 = FIXED throttle: reliably yields the single-box's memory bandwidth /
+# loopback / Postgres to the gateway core path (measured: gateway 200-VU non-SSE
+# RPS ~5150 -> ~6300, beating Bifrost 5284, no loss). 0 = ADAPTIVE CPU-pressure
+# probe (best on a small/CPU-bound box; cannot see memory-bandwidth contention on
+# a core-rich box). >=1 = OFF (dedicated Hub box). NATS file store absorbs the
+# backlog while idle; audit is delay-tolerant, no-loss preserved by retention.
+# NEXUS_HUB_AUDIT_DRAIN_DUTY_CYCLE=0.3
+
 # Control Plane knobs.
 # CONTROL_PLANE_PORT=3001
 # CONTROL_PLANE_HOST=127.0.0.1
@@ -320,6 +444,9 @@ NEXUS_ASSISTANT_SYSTEM_VK=
 # AI Gateway knobs.
 # AI_GATEWAY_PORT=3050
 # AI_GATEWAY_HOST=127.0.0.1
+# Pre-grow KiB for the request-body read scratch (server.requestReadBufKb).
+# 64 default; raise to ~128 for fleets that routinely carry ~128K-token contexts.
+# AI_GATEWAY_REQUEST_READ_BUF_KB=64
 # AI_GATEWAY_CACHE_ENABLED=true
 # AI_GATEWAY_CACHE_TTL=5m
 # AI_GATEWAY_CACHE_PREFIX=ai-gw:
 
@@ -183,6 +183,13 @@ if echo "$staged" | grep -qE '\.go$'; then
   run_hard "no prod-code TODOs" node scripts/check-no-prod-todos.mjs --staged --strict
 fi
 
+# 13b-refs. No development-program tracking references in committed comments
+# (binding; CLAUDE.md → "No archaeology in code comments"). Scoped to staged
+# Go/TS files.
+if echo "$staged" | grep -qE '\.(go|ts|tsx)$'; then
+  run_hard "no program refs in comments" node scripts/check-comment-program-refs.mjs --staged --strict
+fi
+
 # 13b. gofmt on staged Go files (binding; gofmt gate — CI enforces repo-wide in
 # the Workspace integrity job). gofmt -l lists files differing from canonical
 # formatting; it exits 0 even when it finds some, so check the output directly.
 
@@ -73,6 +73,9 @@ jobs:
       - name: No placeholder markers in production Go (strict)
         run: node scripts/check-no-prod-todos.mjs --strict
 
+      - name: No program refs in comments (strict)
+        run: node scripts/check-comment-program-refs.mjs --strict
+
       - name: No inline secrets in yaml (strict)
         run: node scripts/check-no-yaml-secrets.mjs --strict
 
 
@@ -17,6 +17,7 @@ nexus-ami/build.log*
 nexus-ami/packer_cache/
 nexus-ami/manifest.json
 nexus-ami/crash.log
+Untracked files:
 nexus-ami/.current-stage-ts
 nexus-ami/stage.log.*
 
@@ -101,6 +102,12 @@ packages/agent/agent
 packages/agent/agent.exe
 packages/agent/nexus-agent
 packages/agent/nexus-agent.rollback
+# Agent local dev state dir (agent.dev.yaml writes here): device cert +
+# private key + bearer token + SQLCipher audit.db. Must never be committed.
+packages/agent/.nexus-agent/
+# NexusWFP kernel driver build outputs (msbuild bin/obj).
+packages/agent/platform/windows/nexus-wfp-driver/bin/
+packages/agent/platform/windows/nexus-wfp-driver/obj/
 packages/ai-gateway/explicit-proxy
 packages/compliance-proxy/forward-proxy
 
@@ -170,14 +177,6 @@ packages/control-plane/.nexus/
 tests/.env.local
 tests/.env.dev
 tests/.env.prod
-benchmark/v2/.env.local
-# Benchmark runtime artifacts — regenerated on container/run start
-benchmark/v2/gateways/bifrost-data/*.db
-benchmark/v2/gateways/bifrost-data/*.db-wal
-benchmark/v2/gateways/bifrost-data/*.db-shm
-benchmark/v2/gateways/bifrost-data/logs/
-benchmark/v2/demo/pii_demo_evidence.json
-benchmark/**/.DS_Store
 /tmp/nexus-test/
 
 # Test program runtime artifacts