feat: add model-training skill and Training category

solderzzc · solderzzc · commit 1c48af4a5783 · 2026-03-14T23:22:06.000-07:00
- New skill: skills/training/model-training/ with SKILL.md manifest
  documenting the Aegis Training Agent pipeline:
  annotated dataset → YOLO fine-tune → auto-export → deploy
- Add 'training' category to skills.json
- Add model-training entry to skills.json registry
- Update README skill catalog with Training row
- Skill count: 18 → 19 skills, 9 → 10 categories
diff --git a/README.md b/README.md
@@ -60,7 +60,7 @@
 - [x] **AI/LLM-assisted skill installation** — community-contributed skills installed and configured via AI agent
 - [x] **GPU / NPU / CPU (AIPC) aware installation** — auto-detect hardware, install matching frameworks, convert models to optimal format
 - [x] **Hardware environment layer** — shared [`env_config.py`](skills/lib/env_config.py) for auto-detection + model optimization across NVIDIA, AMD, Apple Silicon, Intel, and CPU
-- [ ] **Skill development** — 18 skills across 9 categories, actively expanding with community contributions
+- [ ] **Skill development** — 19 skills across 10 categories, actively expanding with community contributions
 
 ## 🧩 Skill Catalog
 
@@ -73,6 +73,7 @@ Each skill is a self-contained module with its own model, parameters, and [commu
 | **Privacy** | [`depth-estimation`](skills/transformation/depth-estimation/) | [Real-time depth-map privacy transform](#-privacy--depth-map-anonymization) — anonymize camera feeds while preserving activity | ✅ |
 | **Annotation** | [`sam2-segmentation`](skills/annotation/sam2-segmentation/) | Click-to-segment with pixel-perfect masks | 📐 |
 | | [`dataset-annotation`](skills/annotation/dataset-annotation/) | AI-assisted labeling → COCO export | 📐 |
+| **Training** | [`model-training`](skills/training/model-training/) | Agent-driven YOLO fine-tuning — annotate, train, export, deploy | 📐 |
 | **Camera Providers** | [`eufy`](skills/camera-providers/eufy/) · [`reolink`](skills/camera-providers/reolink/) · [`tapo`](skills/camera-providers/tapo/) | Direct camera integrations via RTSP | 📐 |
 | **Streaming** | [`go2rtc-cameras`](skills/streaming/go2rtc-cameras/) | RTSP → WebRTC live view | 📐 |
 | **Channels** | [`matrix`](skills/channels/matrix/) · [`line`](skills/channels/line/) · [`signal`](skills/channels/signal/) | Messaging channels for Clawdbot agent | 📐 |
diff --git a/skills.json b/skills.json
@@ -9,6 +9,7 @@
     "transformation": "Depth estimation, style transfer, video effects",
     "privacy": "Privacy transforms — depth maps, blur, anonymization for blind mode",
     "annotation": "Dataset labeling, COCO export, training data",
+    "training": "Model fine-tuning, hardware-optimized export, deployment",
     "camera-providers": "Camera brand integrations — clip feed, live stream",
     "streaming": "RTSP/WebRTC live view via go2rtc",
     "channels": "Messaging platform channels for Clawdbot agent",
@@ -165,6 +166,37 @@
         "privacy_overlay",
         "blind_mode"
       ]
+    },
+    {
+      "id": "model-training",
+      "name": "Model Training",
+      "description": "Agent-driven YOLO fine-tuning — annotate, train, auto-export to TensorRT/CoreML/OpenVINO, deploy as detection skill.",
+      "version": "1.0.0",
+      "category": "training",
+      "path": "skills/training/model-training",
+      "tags": [
+        "training",
+        "fine-tuning",
+        "yolo",
+        "custom-model",
+        "export"
+      ],
+      "platforms": [
+        "linux-x64",
+        "linux-arm64",
+        "darwin-arm64",
+        "darwin-x64",
+        "win-x64"
+      ],
+      "requirements": {
+        "python": ">=3.9",
+        "ram_gb": 4
+      },
+      "capabilities": [
+        "fine_tuning",
+        "model_export",
+        "deployment"
+      ]
     }
   ]
 }
diff --git a/skills/training/model-training/SKILL.md b/skills/training/model-training/SKILL.md
@@ -0,0 +1,105 @@
+---
+name: model-training
+description: "Agent-driven YOLO fine-tuning — annotate, train, export, deploy"
+version: 1.0.0
+
+parameters:
+  - name: base_model
+    label: "Base Model"
+    type: select
+    options: ["yolo26n", "yolo26s", "yolo26m", "yolo26l"]
+    default: "yolo26n"
+    description: "Pre-trained model to fine-tune"
+    group: Training
+
+  - name: dataset_dir
+    label: "Dataset Directory"
+    type: string
+    default: "~/datasets"
+    description: "Path to COCO-format dataset (from dataset-annotation skill)"
+    group: Training
+
+  - name: epochs
+    label: "Training Epochs"
+    type: number
+    default: 50
+    group: Training
+
+  - name: batch_size
+    label: "Batch Size"
+    type: number
+    default: 16
+    description: "Adjust based on GPU VRAM"
+    group: Training
+
+  - name: auto_export
+    label: "Auto-Export to Optimal Format"
+    type: boolean
+    default: true
+    description: "Automatically convert to TensorRT/CoreML/OpenVINO after training"
+    group: Deployment
+
+  - name: deploy_as_skill
+    label: "Deploy as Detection Skill"
+    type: boolean
+    default: false
+    description: "Replace the active YOLO detection model with the fine-tuned version"
+    group: Deployment
+
+capabilities:
+  training:
+    script: scripts/train.py
+    description: "Fine-tune YOLO models on custom annotated datasets"
+---
+
+# Model Training
+
+Agent-driven custom model training powered by Aegis's Training Agent. Closes the annotation-to-deployment loop: take a COCO dataset from `dataset-annotation`, fine-tune a YOLO model, auto-export to the optimal format for your hardware, and optionally deploy it as your active detection skill.
+
+## What You Get
+
+- **Fine-tune YOLO26** — start from nano/small/medium/large pre-trained weights
+- **COCO dataset input** — uses standard format from `dataset-annotation` skill
+- **Hardware-aware training** — auto-detects CUDA, MPS, ROCm, or CPU
+- **Auto-export** — converts trained model to TensorRT / CoreML / OpenVINO / ONNX via `env_config.py`
+- **One-click deploy** — replace the active detection model with your fine-tuned version
+- **Training telemetry** — real-time loss, mAP, and epoch progress streamed to Aegis UI
+
+## Training Loop (Aegis Training Agent)
+
+```
+dataset-annotation          model-training              yolo-detection-2026
+┌─────────────┐        ┌──────────────────┐        ┌──────────────────┐
+│ Annotate    │───────▶│ Fine-tune YOLO   │───────▶│ Deploy custom    │
+│ Review      │  COCO  │ Auto-export      │ .pt    │ model as active  │
+│ Export      │  JSON  │ Validate mAP     │ .engine│ detection skill  │
+└─────────────┘        └──────────────────┘        └──────────────────┘
+       ▲                                                    │
+       └────────────────────────────────────────────────────┘
+                    Feedback loop: better detection → better annotation
+```
+
+## Protocol
+
+### Aegis → Skill (stdin)
+```jsonl
+{"event": "train", "dataset_path": "~/datasets/front_door_people/", "base_model": "yolo26n", "epochs": 50, "batch_size": 16}
+{"event": "export", "model_path": "runs/train/best.pt", "formats": ["coreml", "tensorrt"]}
+{"event": "validate", "model_path": "runs/train/best.pt", "dataset_path": "~/datasets/front_door_people/"}
+```
+
+### Skill → Aegis (stdout)
+```jsonl
+{"event": "ready", "gpu": "mps", "base_models": ["yolo26n", "yolo26s", "yolo26m", "yolo26l"]}
+{"event": "progress", "epoch": 12, "total_epochs": 50, "loss": 0.043, "mAP50": 0.87, "mAP50_95": 0.72}
+{"event": "training_complete", "model_path": "runs/train/best.pt", "metrics": {"mAP50": 0.91, "mAP50_95": 0.78, "params": "2.6M"}}
+{"event": "export_complete", "format": "coreml", "path": "runs/train/best.mlpackage", "speedup": "2.1x vs PyTorch"}
+{"event": "validation", "mAP50": 0.91, "per_class": [{"class": "person", "ap": 0.95}, {"class": "car", "ap": 0.88}]}
+```
+
+## Setup
+
+```bash
+python3 -m venv .venv && source .venv/bin/activate
+pip install -r requirements.txt
+```
diff --git a/skills/training/model-training/requirements.txt b/skills/training/model-training/requirements.txt
@@ -0,0 +1,5 @@
+ultralytics>=8.3.0
+torch>=2.0.0
+coremltools>=7.0; sys_platform == 'darwin'
+onnx>=1.14.0
+onnxruntime>=1.16.0