WSL AMD/Intel iGPU detection via Windows interop + wsl2-amd/intel.yaml

mios-dev · claude · mios-dev · commit b7e06ff4b1ba · 2026-05-17T23:55:24.000-04:00
Operator-flagged 2026-05-17: "we have and AMD iGPU here on the
Windows Host's (WSL--Podman-WSL/MiOS) Machine!!" -- the Ryzen iGPU
was undetected because mios-cdi-detect's AMD branch only checked
for /dev/kfd, which doesn't exist under WSL2 (AMD iGPUs there share
/dev/dxg with NVIDIA via the WDDM paravirt driver).

Two-part fix:

1. mios-cdi-detect detection block now queries the Windows host
   via powershell.exe Get-CimInstance Win32_VideoController when
   /dev/dxg is present. Matches AMD/Radeon and Intel name regexes
   case-insensitively. New HAS_AMD_WSL / HAS_INTEL_WSL flags so
   the downstream spec-generation branches know whether to hand-
   roll the WSL CDI spec or fall back to amd-ctk / intel-cdi-
   specs-generator. 5s timeout on the PSH call; fails open
   (boot never breaks on enumeration failure).

   Live on this host: "windows GPUs: Microsoft Remote Display
   Adapter, AMD Radeon(TM) Graphics, NVIDIA GeForce RTX 4090"
   -&gt; nvidia=1 amd=1 (wsl=1) intel=0 (wsl=0).

2. New WSL CDI spec generation. Mirrors the existing wsl2-nvidia
   .yaml pattern (which hand-rolls /dev/dxg + /usr/lib/wsl rbind
   when nvidia-ctk is unavailable). Two new specs:

     /run/cdi/wsl2-amd.yaml     kind: amd.com/gpu     name: all
     /run/cdi/wsl2-intel.yaml   kind: intel.com/gpu   name: all

   Both register /dev/dxg as the device node + rbind-mount
   /usr/lib/wsl so the container can use Vulkan (mesa3d via WSL)
   or DirectML against the Windows-side AMD/Intel driver. Bare-
   metal hosts still get amd-ctk / intel-cdi-specs-generator
   output via the existing branches (HAS_*_WSL=0 path).

3. mios-gpu-passthrough now recognizes the WSL spec naming too
   (wsl2-amd.yaml / wsl2-intel.yaml) -- the earlier only-match
   for amd.json/amd.yaml meant the helper saw AMD_PRESENT=0
   even though the WSL spec was sitting at /run/cdi/wsl2-amd.yaml.

Operator clarification on scope ("iGPU's are ONLY micro-llms"
-&gt; "wire to ollama!! JUST ONLY USES Micro-llms in MiOS stack"):
ollama IS in MIOS_AI_QUADLETS by design. MiOS-stack consumers of
Ollama (mios-daemon -&gt; qwen3:0.6b-cpu, pipe refine/polish -&gt;
qwen2.5-coder:7b, prefilter -&gt; small) only ask for small models;
big-model tags on disk (qwen3-coder:30b, gpt-oss:20b) are parked
for ad-hoc operator use. With NVIDIA + AMD both registered as
CDI devices, Ollama spreads the MiOS-stack small models across
them without contending with the dGPU big-model VRAM. Comment
block in mios-gpu-passthrough reflects the two-step directive.

Live verification after restart of mios-cdi-detect.service:
* status JSON shows nvidia=1, amd=1, intel=0
* drop-ins written to ollama.container.d + mios-open-webui.
  container.d with AddDevice=amd.com/gpu=all
* ollama systemd ExecStart now reads
  `--device nvidia.com/gpu=all --device amd.com/gpu=all`

Day-0 deployable: all in /usr/libexec/mios/ + /usr/lib/systemd/,
both in repo; fresh clone + image build wires automatically.

Co-Authored-By: Claude Opus 4.7 (1M context) &lt;noreply@anthropic.com&gt;
diff --git a/usr/libexec/mios/mios-cdi-detect b/usr/libexec/mios/mios-cdi-detect
@@ -31,6 +31,13 @@ VIRT=$(systemd-detect-virt 2>/dev/null || echo none)
 HAS_NVIDIA=0
 HAS_AMD=0
 HAS_INTEL=0
+# WSL-detected flags: AMD/Intel iGPU on WSL is exposed via /dev/dxg
+# (not /dev/kfd or renderD*). When set, the iGPU branch writes a
+# hand-rolled wsl2-<vendor>.yaml CDI spec instead of relying on the
+# Linux-side vendor toolkit.
+HAS_AMD_WSL=0
+HAS_INTEL_WSL=0
+
 [[ -e /dev/nvidia0 || -e /dev/dxg ]] && HAS_NVIDIA=1
 [[ -e /dev/kfd ]]                    && HAS_AMD=1
 # Intel iGPU/Arc detection: a renderD* node whose vendor reads 0x8086.
@@ -46,7 +53,41 @@ for r in /dev/dri/renderD*; do
     fi
 done
 
-_log "virt=$VIRT nvidia=$HAS_NVIDIA amd=$HAS_AMD intel=$HAS_INTEL"
+# ── WSL2 AMD/Intel iGPU detection via Windows interop ──────────────
+# Under WSL2, AMD APUs (Ryzen with Radeon Graphics) and Intel iGPUs
+# do NOT surface as /dev/kfd or /dev/dri/renderD* -- they share the
+# /dev/dxg paravirt interface with NVIDIA. So the Linux-side probes
+# above can't tell which Windows GPU(s) are present; we have to ask
+# Windows via WMI. Operator-flagged 2026-05-17: Ryzen iGPU on the
+# Windows host was undetected because /dev/kfd doesn't exist in WSL,
+# so the AMD branch never ran and the iGPU never reached any
+# container.
+if [[ "$VIRT" == "wsl" || -e /dev/dxg ]]; then
+    PSH=/mnt/c/Windows/System32/WindowsPowerShell/v1.0/powershell.exe
+    if [[ -x "$PSH" ]]; then
+        # Capped 5s; never fails the boot. The first
+        # "Microsoft Remote Display Adapter" is the WDDM paravirt
+        # one; real GPUs follow.
+        WIN_GPU_NAMES=$(timeout 5 "$PSH" -NoProfile -Command \
+            "Get-CimInstance Win32_VideoController | Select-Object -ExpandProperty Name" \
+            2>/dev/null | tr -d '\r' | tr '\n' '|')
+        if [[ -n "$WIN_GPU_NAMES" ]]; then
+            shopt -s nocasematch
+            if [[ "$WIN_GPU_NAMES" =~ (AMD|Radeon) ]]; then
+                HAS_AMD=1; HAS_AMD_WSL=1
+            fi
+            if [[ "$WIN_GPU_NAMES" =~ Intel ]]; then
+                HAS_INTEL=1; HAS_INTEL_WSL=1
+            fi
+            shopt -u nocasematch
+            _log "windows GPUs: ${WIN_GPU_NAMES//|/, }"
+        else
+            _log "wsl: powershell GPU query returned empty (skipping iGPU detection)"
+        fi
+    fi
+fi
+
+_log "virt=$VIRT nvidia=$HAS_NVIDIA amd=$HAS_AMD (wsl=$HAS_AMD_WSL) intel=$HAS_INTEL (wsl=$HAS_INTEL_WSL)"
 
 # ── NVIDIA ───────────────────────────────────────────────────────────
 # Prefer upstream nvidia-cdi-refresh.service when its unit is installed
@@ -118,11 +159,10 @@ fi
 # by default; podman accepts both .yaml and .json under /etc/cdi or
 # /run/cdi. Spec naming follows the CNCF CDI convention:
 # vendor.com/class=identifier -> amd.com/gpu=all.
-if [[ "$HAS_AMD" == 1 ]] && command -v amd-ctk >/dev/null 2>&1; then
+if [[ "$HAS_AMD" == 1 ]] && command -v amd-ctk >/dev/null 2>&1 && [[ "$HAS_AMD_WSL" == 0 ]]; then
+    # Bare-metal / true-VM path: amd-ctk reads /dev/kfd + /dev/dri.
     if amd-ctk cdi generate --output=/run/cdi/amd.json 2>&1 | logger -t mios-cdi-detect; then
         _log "amd: wrote /run/cdi/amd.json"
-        # Validate the spec with the toolkit's own checker; mark the
-        # spec stale if it fails so podman doesn't pick up a broken file.
         if amd-ctk cdi validate --path=/run/cdi/amd.json >/dev/null 2>&1; then
             _log "amd: spec validated"
         else
@@ -132,6 +172,35 @@ if [[ "$HAS_AMD" == 1 ]] && command -v amd-ctk >/dev/null 2>&1; then
     else
         _log "amd: amd-ctk cdi generate failed (non-fatal)"
     fi
+elif [[ "$HAS_AMD_WSL" == 1 ]]; then
+    # WSL2 path: AMD iGPU is exposed via /dev/dxg + /usr/lib/wsl libs
+    # (DirectX paravirt). amd-ctk doesn't know how to generate this
+    # shape, so we hand-roll the CDI spec the same way the WSL2
+    # NVIDIA fallback does. Container can then use Vulkan (mesa3d)
+    # or DirectML against the Windows-side AMD driver.
+    #
+    # Operator directive 2026-05-17: "iGPU's are ONLY micro-llms" --
+    # this spec REGISTERS the device with podman so containers that
+    # explicitly request AddDevice=amd.com/gpu=all get it. The base
+    # mios-ollama (big-model, NVIDIA-lane) Quadlet does NOT request
+    # this; only micro-LLM container(s) do.
+    out=/run/cdi/wsl2-amd.yaml
+    cat > "$out" <<'WSLAMDCDI'
+cdiVersion: "0.6.0"
+kind: amd.com/gpu
+devices:
+  - name: all
+    containerEdits:
+      deviceNodes:
+        - path: /dev/dxg
+      mounts:
+        - hostPath: /usr/lib/wsl
+          containerPath: /usr/lib/wsl
+          options: ["ro", "nosuid", "nodev", "rbind"]
+      env:
+        - LD_LIBRARY_PATH=/usr/lib/wsl/lib:/usr/local/amd/lib
+WSLAMDCDI
+    _log "amd: wrote $out (WSL2 hand-rolled CDI; AMD iGPU via /dev/dxg + DirectX)"
 elif [[ "$HAS_AMD" == 1 ]]; then
     _log "amd: /dev/kfd present but amd-ctk missing -- run automation/41-gpu-cdi-toolkits.sh"
 fi
@@ -142,7 +211,7 @@ fi
 # produce a /etc/cdi/intel.yaml. Best-effort: this binary is at v0.x
 # upstream and lacks the polish of nvidia-ctk / amd-ctk -- failures
 # here are logged but never break the boot.
-if [[ "$HAS_INTEL" == 1 ]]; then
+if [[ "$HAS_INTEL" == 1 && "$HAS_INTEL_WSL" == 0 ]]; then
     INTEL_GEN=""
     for cand in /usr/libexec/mios/intel-cdi-specs-generator \
                 /usr/local/bin/intel-cdi-specs-generator \
@@ -158,6 +227,28 @@ if [[ "$HAS_INTEL" == 1 ]]; then
     else
         _log "intel: GPU present but intel-cdi-specs-generator missing -- run automation/41-gpu-cdi-toolkits.sh"
     fi
+elif [[ "$HAS_INTEL_WSL" == 1 ]]; then
+    # WSL2 path mirrors the AMD WSL branch above: hand-rolled CDI
+    # spec keyed as intel.com/gpu using the same /dev/dxg + WSL libs.
+    # Per operator directive 2026-05-17: "iGPU's are ONLY micro-llms"
+    # -- only micro-LLM Quadlets should request this device.
+    out=/run/cdi/wsl2-intel.yaml
+    cat > "$out" <<'WSLINTELCDI'
+cdiVersion: "0.6.0"
+kind: intel.com/gpu
+devices:
+  - name: all
+    containerEdits:
+      deviceNodes:
+        - path: /dev/dxg
+      mounts:
+        - hostPath: /usr/lib/wsl
+          containerPath: /usr/lib/wsl
+          options: ["ro", "nosuid", "nodev", "rbind"]
+      env:
+        - LD_LIBRARY_PATH=/usr/lib/wsl/lib:/usr/local/intel/lib
+WSLINTELCDI
+    _log "intel: wrote $out (WSL2 hand-rolled CDI; Intel iGPU via /dev/dxg + DirectX)"
 fi
 
 # ── Status snapshot for the dashboard / mios-boot-diag ───────────────
diff --git a/usr/libexec/mios/mios-gpu-passthrough b/usr/libexec/mios/mios-gpu-passthrough
@@ -22,8 +22,22 @@
 #
 # AI Quadlet list (extend by editing MIOS_AI_QUADLETS below or
 # overriding via /etc/mios/gpu-passthrough.conf):
-#   ollama, mios-open-webui (RAG vectoring may use GPU),
-#   mios-searxng (no AI; left out).
+#
+# Operator directive 2026-05-17 (two-step clarification):
+#   first: "iGPU's are ONLY micro-llms"
+#   then:  "wire to ollama!! JUST ONLY USES Micro-llms in MiOS stack"
+#
+# Resolution: ollama IS in the list. MiOS-stack consumers of
+# Ollama only ask for small models (mios-daemon -> qwen3:0.6b-cpu,
+# pipe refine/polish -> qwen2.5-coder:7b, prefilter -> small).
+# The big-model tags on disk (qwen3-coder:30b, gpt-oss:20b) are
+# parked for ad-hoc operator use; Ollama's auto-scheduling lands
+# the small MiOS-stack models on whichever device has room. With
+# NVIDIA dGPU + AMD iGPU both registered as CDI devices, Ollama
+# spreads load across them; small models fit on the iGPU's WSL
+# DXG path (no VRAM contention with the dGPU), bigger ad-hoc
+# loads keep the dGPU free.
+#
 # Add a new Quadlet -> one line in the list; helper writes the
 # correct drop-in on next boot. Per operator directive
 # 2026-05-17: "make sure ALL is in code and can deploy Day-0 from
@@ -75,13 +89,25 @@ log() { logger -t mios-gpu-passthrough "$*" 2>/dev/null || true; echo "[gpu-pass
 NVIDIA_PRESENT=0
 AMD_PRESENT=0
 INTEL_PRESENT=0
+# Match every spec layout mios-cdi-detect can emit:
+#   * <vendor>.yaml / .json  -- bare-metal vendor toolkit output
+#   * wsl2-<vendor>.yaml      -- WSL2 hand-rolled (/dev/dxg + WSL libs)
+#   * nvidia-wsl.yaml         -- legacy nvidia-ctk WSL output naming
 for spec in "$CDI_DIR"/nvidia.yaml \
             "$CDI_DIR"/nvidia-wsl.yaml \
             "$CDI_DIR"/wsl2-nvidia.yaml; do
     [ -f "$spec" ] && { NVIDIA_PRESENT=1; break; }
 done
-[ -f "$CDI_DIR/amd.json"  ] || [ -f "$CDI_DIR/amd.yaml"   ] && AMD_PRESENT=1
-[ -f "$CDI_DIR/intel.yaml" ] || [ -f "$CDI_DIR/intel.json" ] && INTEL_PRESENT=1
+for spec in "$CDI_DIR"/amd.json \
+            "$CDI_DIR"/amd.yaml \
+            "$CDI_DIR"/wsl2-amd.yaml; do
+    [ -f "$spec" ] && { AMD_PRESENT=1; break; }
+done
+for spec in "$CDI_DIR"/intel.yaml \
+            "$CDI_DIR"/intel.json \
+            "$CDI_DIR"/wsl2-intel.yaml; do
+    [ -f "$spec" ] && { INTEL_PRESENT=1; break; }
+done
 
 log "CDI present: nvidia=$NVIDIA_PRESENT amd=$AMD_PRESENT intel=$INTEL_PRESENT"