docs: add agent batching guides across readme website and skills

thymikee · thymikee · commit 6106db5c0084 · 2026-02-18T11:08:31.000+01:00
diff --git a/README.md b/README.md
@@ -45,6 +45,61 @@ agent-device click @e3
 agent-device close
 ```
 
+## Fast batching for agents (JSON steps)
+
+Use `batch` to execute multiple commands in a single daemon request.
+
+CLI examples:
+
+```bash
+agent-device batch \
+  --session sim \
+  --platform ios \
+  --udid 00008150-001849640CF8401C \
+  --steps-file /tmp/batch-steps.json \
+  --json
+```
+
+```bash
+cat /tmp/batch-steps.json | agent-device batch \
+  --session sim \
+  --platform ios \
+  --udid 00008150-001849640CF8401C \
+  --steps-stdin \
+  --json
+```
+
+Small inline payloads are also supported:
+
+```bash
+agent-device batch --steps '[{"command":"open","positionals":["settings"]},{"command":"wait","positionals":["100"]}]'
+```
+
+Batch payload format:
+
+```json
+[
+  { "command": "open", "positionals": ["settings"], "flags": {} },
+  { "command": "wait", "positionals": ["label=\"Privacy & Security\"", "3000"], "flags": {} },
+  { "command": "click", "positionals": ["label=\"Privacy & Security\""], "flags": {} },
+  { "command": "get", "positionals": ["text", "label=\"Tracking\""], "flags": {} }
+]
+```
+
+Batch response includes:
+
+- `total`, `executed`, `totalDurationMs`
+- per-step `results[]` with `durationMs`
+- failure context with failing `step` and `partialResults`
+
+Agent usage guidelines:
+
+- Keep each batch to one screen-local workflow.
+- Add sync guards (`wait`, `is exists`) after mutating steps (`open`, `click`, `fill`, `swipe`).
+- Treat refs/snapshot assumptions as stale after UI mutations.
+- Prefer `--steps-file` or `--steps-stdin` over inline JSON for reliability.
+- Keep batches moderate (about 5-20 steps) and stop on first error.
+
 ## CLI Usage
 
 ```bash
@@ -84,6 +139,7 @@ agent-device swipe 540 1500 540 500 120 --count 8 --pause-ms 30 --pattern ping-p
 
 ## Command Index
 - `boot`, `open`, `close`, `reinstall`, `home`, `back`, `app-switcher`
+- `batch`
 - `snapshot`, `find`, `get`
 - `click`, `focus`, `type`, `fill`, `press`, `long-press`, `swipe`, `scroll`, `scrollintoview`, `pinch`, `is`
 - `alert`, `wait`, `screenshot`
@@ -114,6 +170,11 @@ Flags:
 - `--pattern one-way|ping-pong` repeat pattern for `swipe`
 - `--verbose` for daemon and runner logs
 - `--json` for structured output
+- `--steps <json>` batch: JSON array of steps
+- `--steps-file <path>` batch: read step JSON from file
+- `--steps-stdin` batch: read step JSON from stdin
+- `--on-error stop` batch: stop when a step fails
+- `--max-steps <n>` batch: max allowed steps per request
 
 Pinch:
 - `pinch` is supported on iOS simulators.
diff --git a/skills/agent-device/SKILL.md b/skills/agent-device/SKILL.md
@@ -155,6 +155,59 @@ agent-device replay -u ./session.ad   # Update selector drift and rewrite .ad sc
 `--save-script` path is a file path; parent directories are created automatically.
 For ambiguous bare values, use `--save-script=workflow.ad` or `./workflow.ad`.
 
+### Fast batching for agents (JSON steps)
+
+Use `batch` when an agent already has a known short sequence and wants fewer orchestration round trips.
+
+```bash
+agent-device batch \
+  --session sim \
+  --platform ios \
+  --udid 00008150-001849640CF8401C \
+  --steps-file /tmp/batch-steps.json \
+  --json
+```
+
+```bash
+cat /tmp/batch-steps.json | agent-device batch \
+  --session sim \
+  --platform ios \
+  --udid 00008150-001849640CF8401C \
+  --steps-stdin \
+  --json
+```
+
+Inline JSON works for small payloads:
+
+```bash
+agent-device batch --steps '[{"command":"open","positionals":["settings"]},{"command":"wait","positionals":["100"]}]'
+```
+
+Step format:
+
+```json
+[
+  { "command": "open", "positionals": ["settings"], "flags": {} },
+  { "command": "wait", "positionals": ["label=\"Privacy & Security\"", "3000"], "flags": {} },
+  { "command": "click", "positionals": ["label=\"Privacy & Security\""], "flags": {} },
+  { "command": "get", "positionals": ["text", "label=\"Tracking\""], "flags": {} }
+]
+```
+
+Batch best practices for agents:
+
+- Batch one screen-local flow at a time.
+- Add sync guards (`wait`, `is exists`) after mutating steps (`open`, `click`, `fill`, `swipe`).
+- Treat prior refs/snapshot assumptions as stale after UI mutations.
+- Prefer `--steps-file` or `--steps-stdin` over inline JSON.
+- Keep batches moderate (about 5-20 steps).
+- Use failure context (`step`, `partialResults`) to replan from the failed step.
+
+Stale accessibility tree note:
+
+- Rapid mutations can outrun accessibility tree updates.
+- Mitigate with explicit waits and phase splitting (navigate, verify/extract, cleanup).
+
 ### Trace logs (XCTest)
 
 ```bash
@@ -208,3 +261,4 @@ agent-device apps --platform android --user-installed
 - [references/permissions.md](references/permissions.md)
 - [references/video-recording.md](references/video-recording.md)
 - [references/coordinate-system.md](references/coordinate-system.md)
+- [references/batching-for-agents.md](references/batching-for-agents.md)
diff --git a/skills/agent-device/references/batching-for-agents.md b/skills/agent-device/references/batching-for-agents.md
@@ -0,0 +1,85 @@
+# Batching for agents
+
+## When to use batch
+
+- The agent already knows a short sequence of commands.
+- Steps belong to one logical screen flow.
+- You want one result object with per-step timing and failure context.
+
+## When not to use batch
+
+- Flows are unrelated and should be retried independently.
+- The workflow is highly dynamic and requires replanning after each step.
+- You need human approvals between steps.
+
+## CLI patterns
+
+From file:
+
+```bash
+agent-device batch --session sim --platform ios --steps-file /tmp/batch-steps.json --json
+```
+
+From stdin:
+
+```bash
+cat /tmp/batch-steps.json | agent-device batch --session sim --platform ios --steps-stdin --json
+```
+
+Inline (small payloads only):
+
+```bash
+agent-device batch --steps '[{"command":"open","positionals":["settings"]}]'
+```
+
+## Step payload contract
+
+```json
+[
+  { "command": "open", "positionals": ["settings"], "flags": {} },
+  { "command": "wait", "positionals": ["label=\"Privacy & Security\"", "3000"], "flags": {} },
+  { "command": "click", "positionals": ["label=\"Privacy & Security\""], "flags": {} },
+  { "command": "get", "positionals": ["text", "label=\"Tracking\""], "flags": {} }
+]
+```
+
+Rules:
+
+- `positionals` optional, defaults to `[]`.
+- `flags` optional, defaults to `{}`.
+- nested `batch` and `replay` are rejected.
+- stop-on-first-error is the supported mode (`--on-error stop`).
+
+## Response handling
+
+Success includes:
+
+- `total`, `executed`, `totalDurationMs`
+- `results[]` entries with `step`, `command`, `durationMs`, and optional `data`
+
+Failure includes:
+
+- `details.step`
+- `details.command`
+- `details.executed`
+- `details.partialResults`
+
+Use these fields to replan from the first failing step.
+
+## Common error categories and agent actions
+
+- `INVALID_ARGS`: payload/step shape issue; fix payload and retry.
+- `SESSION_NOT_FOUND`: open or select the correct session, then retry.
+- `UNSUPPORTED_OPERATION`: switch command/target to supported operation.
+- `AMBIGUOUS_MATCH`: refine selector/locator, then retry failed step.
+- `COMMAND_FAILED`: add sync guard (`wait`, `is exists`) and retry from failed step.
+
+## Reliability guardrails
+
+- Add sync guards after mutating steps.
+- Assume snapshot/ref drift after navigation.
+- Keep batch size moderate (about 5-20 steps).
+- Split long workflows into phases:
+  1. navigate
+  2. verify/extract
+  3. cleanup
diff --git a/website/docs/docs/_meta.json b/website/docs/docs/_meta.json
@@ -19,6 +19,11 @@
     "type": "file",
     "label": "Commands"
   },
+  {
+    "name": "batching",
+    "type": "file",
+    "label": "Batching for Agents"
+  },
   {
     "name": "selectors",
     "type": "file",
diff --git a/website/docs/docs/batching.md b/website/docs/docs/batching.md
@@ -0,0 +1,120 @@
+---
+title: Batching for Agents
+---
+
+# Batching for Agents
+
+Use `batch` to run multiple commands in a single daemon request.
+
+This is useful for agent workflows that already know the next sequence of actions and want to reduce orchestration overhead.
+
+## CLI examples
+
+From a file:
+
+```bash
+agent-device batch \
+  --session sim \
+  --platform ios \
+  --udid 00008150-001849640CF8401C \
+  --steps-file /tmp/batch-steps.json \
+  --json
+```
+
+From stdin:
+
+```bash
+cat /tmp/batch-steps.json | agent-device batch \
+  --session sim \
+  --platform ios \
+  --udid 00008150-001849640CF8401C \
+  --steps-stdin \
+  --json
+```
+
+Inline for small payloads:
+
+```bash
+agent-device batch --steps '[{"command":"open","positionals":["settings"]},{"command":"wait","positionals":["100"]}]'
+```
+
+## Step payload format
+
+`batch` accepts a JSON array of steps:
+
+```json
+[
+  { "command": "open", "positionals": ["settings"], "flags": {} },
+  { "command": "wait", "positionals": ["label=\"Privacy & Security\"", "3000"], "flags": {} },
+  { "command": "click", "positionals": ["label=\"Privacy & Security\""], "flags": {} },
+  { "command": "get", "positionals": ["text", "label=\"Tracking\""], "flags": {} }
+]
+```
+
+Notes:
+
+- `positionals` is optional (defaults to `[]`).
+- `flags` is optional (defaults to `{}`).
+- nested `batch` and `replay` steps are rejected.
+- `--on-error stop` is the supported behavior.
+
+## Response shape
+
+Success:
+
+```json
+{
+  "success": true,
+  "data": {
+    "total": 4,
+    "executed": 4,
+    "totalDurationMs": 1810,
+    "results": [
+      { "step": 1, "command": "open", "ok": true, "durationMs": 1020 },
+      { "step": 2, "command": "wait", "ok": true, "durationMs": 320 },
+      { "step": 3, "command": "click", "ok": true, "durationMs": 260 },
+      { "step": 4, "command": "get", "ok": true, "durationMs": 210, "data": { "text": "..." } }
+    ]
+  }
+}
+```
+
+Failure:
+
+```json
+{
+  "success": false,
+  "error": {
+    "code": "COMMAND_FAILED",
+    "message": "Batch failed at step 3 (click): ...",
+    "details": {
+      "step": 3,
+      "command": "click",
+      "positionals": ["label=\"Privacy & Security\""],
+      "executed": 2,
+      "total": 4,
+      "partialResults": [
+        { "step": 1, "command": "open", "ok": true },
+        { "step": 2, "command": "wait", "ok": true }
+      ]
+    }
+  }
+}
+```
+
+## Agent best practices
+
+- Batch only one related screen flow at a time.
+- After mutating steps (`open`, `click`, `fill`, `swipe`), add a sync guard (`wait`, `is exists`) before critical reads.
+- Treat prior refs/snapshots as stale after UI changes.
+- Prefer `--steps-file` or `--steps-stdin` over inline JSON.
+- Keep batches moderate (about 5-20 steps).
+- Replan from the failing step using `details.step` and `details.partialResults`.
+
+## Stale accessibility tree risk
+
+Rapid UI changes can outpace accessibility tree updates. Mitigate by inserting explicit waits and splitting long workflows into phases:
+
+1. navigate
+2. verify/extract
+3. cleanup
diff --git a/website/docs/docs/commands.md b/website/docs/docs/commands.md
@@ -88,6 +88,21 @@ agent-device replay -u ./session.ad   # Update selector drift and rewrite .ad sc
 
 See [Replay & E2E (Experimental)](/docs/replay-e2e) for recording and CI workflow details.
 
+## Batch
+
+```bash
+agent-device batch --steps-file /tmp/batch-steps.json --json
+agent-device batch --steps-stdin --json
+agent-device batch --steps '[{"command":"open","positionals":["settings"]}]'
+```
+
+- `batch` runs a JSON array of steps in a single daemon request.
+- each step has `command`, optional `positionals`, and optional `flags`.
+- stop-on-first-error is the supported behavior (`--on-error stop`).
+- use `--max-steps <n>` to tighten per-request safety limits.
+
+See [Batching for Agents](/docs/batching) for payload format, response shape, and agent guidelines.
+
 ## App reinstall (fresh state)
 
 ```bash
diff --git a/website/docs/docs/quick-start.md b/website/docs/docs/quick-start.md