callstack
diff --git a/‎README.md‎
Lines changed: 102 additions & 9 deletions b/‎README.md‎
Lines changed: 102 additions & 9 deletions
diff --git a/‎package.json‎
Lines changed: 21 additions & 2 deletions b/‎package.json‎
Lines changed: 21 additions & 2 deletions
diff --git a/‎website/docs/docs/_meta.json‎
Lines changed: 10 additions & 0 deletions b/‎website/docs/docs/_meta.json‎
Lines changed: 10 additions & 0 deletions
diff --git a/‎website/docs/docs/agent-setup.md‎
Lines changed: 145 additions & 0 deletions b/‎website/docs/docs/agent-setup.md‎
Lines changed: 145 additions & 0 deletions
@@ -16,23 +16,33 @@ Device automation CLI for AI agents. Mobile, TV, and desktop apps.
 
 `agent-device` lets coding agents run real apps, inspect UI state, interact with visible elements, and collect debugging evidence through one CLI.
 
-It is built around token-efficient accessibility snapshots, not pixel-first screenshots. Agents read compact UI trees, locate elements through refs like `@e3`, perform touch and text actions, and capture screenshots, video, logs, network, perf, and React profiles only when evidence is needed.
+It is built around token-efficient accessibility snapshots, not pixel-first screenshots. Agents read compact UI trees, locate elements through refs like `@e3`, perform touch and text actions, and capture screenshots, video, logs, network, CPU/memory/perf, crash-related logs, and React profiles only when evidence is needed.
+
+Agents can ingest the current docs from [llms-full.txt](https://incubator.callstack.com/agent-device/llms-full.txt). The installed CLI help remains authoritative for exact command syntax.
 
 ## Agentic QA And Development
 
-- **Quality Assurance**: dogfood flows, validate PR builds, check accessibility coverage, capture evidence, and turn stable explorations into `.ad` e2e tests.
-- **Development**: build from specs, reproduce crashes and support issues, inspect logs/network/perf data, and iterate until the UI matches the work.
+- **Quality Assurance**: dogfood flows, validate PR builds, check accessibility coverage, and turn stable explorations into `.ad` e2e tests.
+- **Development**: build from specs, inspect real runtime behavior, and iterate until the UI matches the work.
+
+`agent-device` closes the agentic development loop: agents can write code, run the real app, verify the UI end-to-end, collect screenshots/videos/logs/perf evidence, and feed bugs, crashes, or performance findings back into the next fix iteration before a human reviews the PR.
+
+![Sketch showing agent-device as the live app verification layer in the agentic development loop](./website/docs/public/agentic-development-loop.svg)
 
 If you know Vercel's [agent-browser](https://github.com/vercel-labs/agent-browser), this is the same idea for apps and devices.
 
-![agent-device demo showing an agent inspecting and interacting with a contacts app](./website/docs/public/agent-device-contacts.gif)
+Use it for AI mobile testing, AI QA for React Native and Expo apps, iOS Simulator automation, Android Emulator automation, tvOS/Android TV checks, and desktop app verification from coding agents. Humans install and configure `agent-device`; agents run the workflows.
+
+![agent-device demo showing Codex using agent-device to create a new contact in the iOS Contacts app from a simple prompt](./website/docs/public/agent-device-contacts.gif)
+
+Demo: Codex uses `agent-device` to inspect iOS Contacts through accessibility snapshots, interact with visible UI, and create a contact from a simple prompt.
 
 ## Quick Start
 
 Install the CLI first:
 
 ```bash
-npm install -g agent-device
+npm install -g agent-device@latest
 agent-device --version
 agent-device help workflow
 ```
@@ -41,9 +51,22 @@ The CLI help is the source of truth for agents and is shipped with the installed
 
 If you install skills separately, keep the CLI on `agent-device >= 0.14.0`. Older CLIs do not include the workflow help topics that the router skills expect.
 
+### AI Agent Entry Points
+
+- **Agent + terminal**: in Cursor, Codex, Claude Code, Windsurf, and similar clients, run `agent-device` in the integrated terminal. Start planning with `agent-device help workflow`; CLI help is authoritative.
+- **Skills or rules**: install the skill with `npx skills add callstackincubator/agent-device`, use the bundled [agent-device skill](skills/agent-device/SKILL.md), or mirror it as a thin project rule, so the agent checks the installed version and reads `agent-device help workflow` before acting.
+- **MCP router**: use `agent-device mcp` when an MCP-aware client needs install, status, and version-matched help discovery. MCP is intentionally a thin router; device automation still runs through CLI commands.
+
+For client-specific setup, see [AI Agent Setup](https://incubator.callstack.com/agent-device/docs/agent-setup). For agent-readable docs, use [llms-full.txt](https://incubator.callstack.com/agent-device/llms-full.txt).
+
 ### MCP Router
 
-`agent-device` also ships an official stdio MCP router for discovery-oriented clients. It exposes only `status`, `install`, and `help` tools plus workflow prompts/resources; device automation still runs through the CLI commands returned by version-matched help.
+`agent-device` ships an official stdio MCP router for discovery-oriented clients. It exposes only `status`, `install`, and `help` tools plus workflow prompts/resources; it does not expose device automation or generic shell execution over MCP.
+
+Paste one of these into clients that accept `mcpServers`, such as Cursor project `.cursor/mcp.json` or user-level MCP settings.
+
+<details>
+<summary>Global install MCP config</summary>
 
 ```json
 {
@@ -56,6 +79,24 @@ If you install skills separately, keep the CLI on `agent-device >= 0.14.0`. Olde
 }
 ```
 
+</details>
+
+<details>
+<summary>No global install MCP config</summary>
+
+```json
+{
+  "mcpServers": {
+    "agent-device": {
+      "command": "npx",
+      "args": ["-y", "agent-device@latest", "mcp"]
+    }
+  }
+}
+```
+
+</details>
+
 Registry metadata uses MCP name `io.github.callstackincubator/agent-device`, npm package `agent-device`, stdio transport, `mcpName` package verification, `server.json`, and `smithery.yaml`.
 
 ```bash
@@ -91,20 +132,69 @@ agent-device close
 
 Snapshots assign refs like `@e1`, `@e2`, and `@e3` to current-screen elements. Refs from the default snapshot are immediately actionable; for hidden content, scroll and re-snapshot.
 
+### First 5 Minutes: Expo Test App
+
+Use the bundled Expo fixture when you want a concrete first agent run with setup checks, screenshots, replay, and performance evidence. This path requires a repo checkout because `examples/test-app` and the `pnpm test-app:*` scripts are not included in the published npm package.
+
+```bash
+git clone https://github.com/callstackincubator/agent-device.git
+cd agent-device
+```
+
+First terminal:
+
+```bash
+pnpm test-app:install
+cd examples/test-app
+npx expo-doctor@latest
+cd ../..
+pnpm test-app:ios
+# or: pnpm test-app:android
+```
+
+Then give your agent this prompt:
+
+```text
+Use agent-device to dogfood the bundled Expo app and produce an evidence-backed report.
+
+Setup:
+- Read `agent-device help workflow`, `agent-device help dogfood`, `agent-device help debugging`, and `agent-device help react-devtools` before planning commands.
+- Confirm the test app setup commands were run: `pnpm test-app:install`, `cd examples/test-app && npx expo-doctor@latest`, then `pnpm test-app:ios` or `pnpm test-app:android`.
+- If Metro prints an Expo URL, prefer opening the shell with that URL. On iOS use `agent-device open "Expo Go" <url> --platform ios`; on Android use the visible Expo/dev-client target or URL. Confirm the app UI with `snapshot -i`.
+
+Run:
+- Create `./dogfood-output/screenshots`, `./dogfood-output/videos`, `./dogfood-output/traces`, `./dogfood-output/perf`, and `./dogfood-output/replays`.
+- Open a named session `expo-qa` and save a replay script to `./dogfood-output/replays/expo-test.ad`.
+- Use command shapes like `agent-device --session expo-qa open "Expo Go" <url> --platform ios --save-script ./dogfood-output/replays/expo-test.ad`, `agent-device --session expo-qa screenshot ./dogfood-output/screenshots/home.png`, `agent-device --session expo-qa perf --json > ./dogfood-output/perf/baseline.json`, and `agent-device --session expo-qa record start ./dogfood-output/videos/checkout.mp4`.
+- Capture a baseline `snapshot -i`, screenshot, and `perf --json` sample.
+- Exercise Home, Catalog, product detail, Checkout, and Settings. Re-snapshot after each mutation and use refs/selectors from fresh snapshots.
+- Capture at least one overlay-ref screenshot, one normal screenshot, one short video recording for a meaningful flow, logs marks around any issue, and trace output if a runtime symptom needs diagnostics.
+- Run focused performance checks: compare `perf --json` before and after a navigation or form flow; if React DevTools connects, capture profile slow/rerender output. If it cannot connect, include the status and continue.
+- Close the session so the `.ad` replay is written.
+
+Report:
+- Write `./dogfood-output/report.md`.
+- Link every screenshot, video, trace, log path, replay file, and performance artifact you used.
+- Include setup results, platform/device, Expo doctor outcome, coverage, severity counts, findings with repro commands, and a short performance section summarizing startup/CPU/memory/frame-health or React profile findings.
+- If no issues are found, report covered flows and residual risk instead of claiming the app is bug-free.
+```
+
 ## Where To Run agent-device
 
 | Path | Best for | Start with |
 | --- | --- | --- |
 | Local | Exploration, debugging, and development loops on simulators, emulators, physical devices, macOS apps, and Linux desktop targets. | Follow the Quick Start. |
 | CI/CD | Automated PR and merge validation with replay scripts and captured artifacts. | Start with the [EAS workflow template](https://github.com/callstackincubator/eas-agent-device/blob/main/.eas/workflows/agent-qa-mobile.yml). GitHub Actions template coming soon. |
-| Cloud | Linux runners, managed devices, and remote execution. | Use [Agent Device Cloud](https://agent-device.dev/cloud) or [contact Callstack](mailto:hello@callstack.com) for team-scale QA. |
+| Cloud / remote execution | Linux runners, managed devices, and remote execution. | Use [Agent Device Cloud](https://agent-device.dev/cloud), see [Commands](https://incubator.callstack.com/agent-device/docs/commands) for remote profiles, or [contact Callstack](mailto:hello@callstack.com) for team-scale QA. |
 
 ## Capabilities
 
 - **Platforms**: iOS, Android, tvOS, Android TV, macOS, and Linux. Real devices and simulators are supported.
-- **Capture**: screenshots, video, logs, network traffic, performance data, accessibility snapshots, and React render profiles.
+- **Agent-native UI model**: token-efficient accessibility snapshots, current-screen refs for exploration, selectors for durable replay, and skill-tested workflow guidance.
+- **Capture and debug**: screenshots, video, logs, network traffic, CPU/memory/performance data, crash-related logs, accessibility snapshots, and React render profiles.
 - **Produce**: replayable `.ad` scripts (recorded replay files that run locally or in CI), e2e test runs, snapshot and screenshot diffs, and debugging artifacts.
 - **React Native and Expo**: component tree inspection, props/state/hooks, and render profiling.
+- **MCP boundary**: discovery and help over MCP; app/device control through the CLI for explicit, auditable commands.
 - **License**: MIT. Free to use.
 
 ## How It Works
@@ -120,10 +210,13 @@ Used by teams and developers at Callstack, Expensify, Shopify, Kindred, Total Wi
 ## Documentation
 
 - [Installation](https://incubator.callstack.com/agent-device/docs/installation)
+- [AI Agent Setup](https://incubator.callstack.com/agent-device/docs/agent-setup)
 - [Typed Client](https://incubator.callstack.com/agent-device/docs/client-api)
 - [Commands](https://incubator.callstack.com/agent-device/docs/commands)
 - [Replay & E2E](https://incubator.callstack.com/agent-device/docs/replay-e2e)
+- [Security & Trust](https://incubator.callstack.com/agent-device/docs/security-trust)
 - [Known limitations](https://incubator.callstack.com/agent-device/docs/known-limitations)
+- [llms-full.txt](https://incubator.callstack.com/agent-device/llms-full.txt)
 
 Agent integration:
 
@@ -139,4 +232,4 @@ See [CONTRIBUTING.md](CONTRIBUTING.md).
 
 ## Made at Callstack
 
-agent-device is open source and MIT licensed. Try the [EAS workflow template](https://github.com/callstackincubator/eas-agent-device/blob/main/.eas/workflows/agent-qa-mobile.yml), use [Agent Device Cloud](https://agent-device.dev/cloud), or contact us at hello@callstack.com.
+agent-device is open source and MIT licensed. Visit [agent-device.dev](https://agent-device.dev/), try the [EAS workflow template](https://github.com/callstackincubator/eas-agent-device/blob/main/.eas/workflows/agent-qa-mobile.yml), read the [incubator docs](https://incubator.callstack.com/agent-device/), or contact us at hello@callstack.com.
@@ -1,7 +1,7 @@
 {
   "name": "agent-device",
   "version": "0.14.7",
-  "description": "Agent-driven CLI for mobile UI automation, network inspection, and performance diagnostics across iOS, Android, tvOS, and macOS.",
+  "description": "Agent-native CLI for AI mobile testing and app automation across iOS, Android, tvOS, Android TV, macOS, and Linux.",
   "mcpName": "io.github.callstackincubator/agent-device",
   "license": "MIT",
   "author": "Callstack",
@@ -160,7 +160,26 @@
     "performance",
     "mcp",
     "model-context-protocol",
-    "mcp-server"
+    "mcp-server",
+    "ai-agent",
+    "mobile-automation",
+    "ios-simulator",
+    "android-emulator",
+    "xcuitest",
+    "e2e-testing",
+    "cursor",
+    "claude-code",
+    "expo",
+    "mobile-testing",
+    "qa-automation",
+    "ai-testing",
+    "ios-automation",
+    "android-automation",
+    "simulator",
+    "emulator",
+    "appium",
+    "maestro",
+    "detox"
   ],
   "dependencies": {
     "fast-xml-parser": "^5.7.2",
 
@@ -9,6 +9,11 @@
     "type": "file",
     "label": "Installation"
   },
+  {
+    "name": "agent-setup",
+    "type": "file",
+    "label": "AI Agent Setup"
+  },
   {
     "name": "quick-start",
     "type": "file",
@@ -34,6 +39,11 @@
     "type": "file",
     "label": "Configuration"
   },
+  {
+    "name": "security-trust",
+    "type": "file",
+    "label": "Security & Trust"
+  },
   {
     "name": "batching",
     "type": "file",
 
@@ -0,0 +1,145 @@
+---
+title: AI Agent Setup
+description: Configure Cursor, Codex, Claude Code, Windsurf, Cline, Goose, skills, and MCP for agent-device mobile, TV, and desktop app verification.
+---
+
+# AI Agent Setup
+
+`agent-device` is built for AI agents, but humans usually install it, grant device permissions, and decide which agent client should use it.
+
+Use this page to wire Cursor, Codex, Claude Code, Windsurf, Cline, Goose, or another coding agent into mobile, TV, and desktop app verification. It covers skills, project rules, and MCP setup for React Native QA, Expo app verification, iOS Simulator automation, Android Emulator automation, tvOS checks, Android TV checks, debugging, profiling, and exploratory QA.
+
+The short version: install the CLI, make the agent read version-matched help, and let the agent run CLI commands in a terminal. MCP is available for discovery and help, not broad device control.
+
+## Prerequisite: install the CLI
+
+```bash
+npm install -g agent-device@latest
+agent-device --version
+agent-device help workflow
+```
+
+For one-off use without a global install:
+
+```bash
+npx -y agent-device@latest --version
+npx -y agent-device@latest help workflow
+```
+
+Global install is better for normal agent workflows because repeated commands, skills, and terminal sessions resolve to one stable version.
+
+For Node, Xcode, Android SDK, macOS, and iOS device prerequisites, see [Installation](/docs/installation).
+
+## Install the skill
+
+Install the skill when your agent runtime supports skills:
+
+```bash
+npx skills add callstackincubator/agent-device
+```
+
+The bundled [agent-device skill](https://github.com/callstackincubator/agent-device/blob/main/skills/agent-device/SKILL.md) is the canonical router for skill-aware clients. It intentionally points agents back to installed CLI help instead of duplicating the command manual.
+
+## Recommended agent rule
+
+Add this as a project rule, custom instruction, or skill equivalent when your agent client supports it:
+
+```text
+Use agent-device only for app/device automation tasks. Before planning commands, run `agent-device --version` and read `agent-device help workflow`. For exploratory QA, read `agent-device help dogfood`. For logs, network, traces, or runtime failures, read `agent-device help debugging`. For React Native component trees, props/state/hooks, slow renders, or rerenders, read `agent-device help react-devtools`.
+
+Use the CLI in the integrated terminal. MCP is only a discovery/help router and does not expose device automation tools. Prefer `open -> snapshot -i -> act -> re-snapshot -> verify -> close`. Use current refs such as `@e3` for exploration and selectors for durable replay. Keep mutating commands against one session serial. Capture screenshots, logs, network, perf, traces, recordings, and `.ad` replay scripts only when they add evidence.
+```
+
+## MCP router
+
+`agent-device mcp` starts the official stdio MCP router for discovery-oriented clients. It exposes only `status`, `install`, and `help` tools plus workflow prompts/resources. Device automation still runs through the CLI commands returned by version-matched help.
+
+Global install configuration:
+
+```json
+{
+  "mcpServers": {
+    "agent-device": {
+      "command": "agent-device",
+      "args": ["mcp"]
+    }
+  }
+}
+```
+
+No global install variant:
+
+```json
+{
+  "mcpServers": {
+    "agent-device": {
+      "command": "npx",
+      "args": ["-y", "agent-device@latest", "mcp"]
+    }
+  }
+}
+```
+
+Registry metadata uses MCP name `io.github.callstackincubator/agent-device`, npm package `agent-device`, stdio transport, `mcpName` package verification, `server.json`, and `smithery.yaml`.
+
+## Cursor
+
+Use Agent mode with the integrated terminal. Add the recommended rule above as a project rule, then run:
+
+```bash
+agent-device help workflow
+agent-device apps --platform ios
+agent-device open <app-or-url> --platform ios
+agent-device snapshot -i
+```
+
+Optional: paste the [MCP router](#mcp-router) configuration into `.cursor/mcp.json`.
+
+## Codex
+
+Put the recommended rule in `AGENTS.md` or the project instructions. Let Codex run `agent-device` in the terminal:
+
+```bash
+agent-device help workflow
+agent-device boot --platform ios
+agent-device open <app-or-url> --platform ios
+agent-device snapshot -i
+```
+
+For reviews or planning-only tasks, tell the agent not to run devices unless explicitly requested.
+
+## Claude Code
+
+Use the bundled skill when your Claude setup supports skills. Otherwise put the recommended rule in `CLAUDE.md`.
+
+```bash
+agent-device --version
+agent-device help workflow
+agent-device help dogfood
+```
+
+If you configure MCP, keep using CLI commands for automation. The MCP router gives Claude install/status/help context only.
+
+## Windsurf, Cline, Goose, and other MCP clients
+
+Use the [MCP router](#mcp-router) configuration when the client supports `mcpServers`, then tell the agent to run device commands through the terminal.
+
+If the client has project rules or custom instructions, add the recommended agent rule above. If it does not, start the conversation by asking the agent to run `agent-device help workflow` before planning.
+
+## Why this setup works
+
+The CLI stays the auditable automation surface, installed help stays version-matched with the commands, skills and rules route agents toward the right help topics, and MCP gives discovery-oriented clients a small install/status/help entry point.
+
+For the broader positioning, supported targets, observability features, and how `agent-device` differs from scripted test frameworks, see [Introduction](/docs/introduction). For exact command groups and platform behavior, see [Commands](/docs/commands).
+
+For the local execution model, permissions, artifacts, and sensitive data guidance, see [Security & Trust](/docs/security-trust).
+
+## Agent-readable docs
+
+Use [llms-full.txt](https://incubator.callstack.com/agent-device/llms-full.txt) when an agent needs a single text bundle of the current docs. The installed CLI remains authoritative for exact command syntax:
+
+```bash
+agent-device help
+agent-device help workflow
+agent-device help dogfood
+```