You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
fix(ai-plane): make a fresh irm|iex install bring up MiOS AI operational on GPU
Live-verified on a fresh dev VM: the :8640 front door now runs the full
orchestration pipeline end-to-end and returns a clean answer from granite-4.1-8b
on the RTX 4090 (181 tok/s). Root-caused + fixed the whole inert-AI chain:
- system-sync-env.sh: generate_env ended on a false `[[ -n "$SECRET" ]] && echo`
-> non-zero under set -e -> the install.env write was aborted BEFORE the mv,
so the env bridge silently produced nothing on every secret-less host. Force
`return 0`. Also emit resolved MIOS_PORT_* as their own numeric vars (systemd
EnvironmentFile + Python don't expand ${...} from sibling lines).
- mios-hermes-firstboot: VRAM probe used `command -v nvidia-smi`, but WSL2 ships
it at /usr/lib/wsl/lib/nvidia-smi which is NOT on systemd's PATH -> a 24GB RTX
4090 read as 0GB -> small tier. Probe explicit candidate paths.
- agent-pipe server.py: (1) _toml_section + the [agents.*] registry now expand
${MIOS_PORT_*} endpoint templates (systemd/Python don't); (2) _pick_agent is
degrade-open -- a health_gate primary the liveness cache can't confirm has its
endpoint blanked (-> BACKEND) and model reset; (3) at the proxy chokepoint,
when dispatch resolves to the BACKEND light lane the model is pinned to
BACKEND_MODEL (else llama-swap "no router for requested model").
- mios.toml: the :8643 heavy hermes-worker is health_gate=true so the
orchestrator drops it when the heavy lane is gated off (degrade-open) instead
of 502-ing the front door; it auto-rejoins when the lane is enabled.
(firstboot EnvironmentFile=/etc/mios/install.env + userenv.sh-deploy gaps landed
in a7cca48 / the overlay.) install-robustness 2026-06-21.
Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
0 commit comments