Skip to content

Commit 79f4dfd

Browse files
committed
docs: improve skill mental model wording
1 parent dbe2199 commit 79f4dfd

2 files changed

Lines changed: 12 additions & 6 deletions

File tree

skills/agent-device/SKILL.md

Lines changed: 11 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -12,17 +12,23 @@ Use this skill as a router.
1212
- Open-ended bug hunt with reporting: use [../dogfood/SKILL.md](../dogfood/SKILL.md).
1313
- Pass/fail QA from acceptance criteria: stay in this skill, start with [references/bootstrap-install.md](references/bootstrap-install.md), then use the QA loop in [references/exploration.md](references/exploration.md).
1414

15-
Default route inside this skill: bootstrap -> exploration -> optional debugging or verification. Open the macOS and remote-tenancy references only for those exceptions.
15+
Default route inside this skill: bootstrap -> exploration -> optional debugging or verification. Open the macOS reference only for host Mac desktop work. Open the remote-tenancy reference only for remote daemon HTTP or lease flows.
16+
17+
## Mental model
18+
19+
- First choose the correct target and open the app or session you want to work on.
20+
- Then inspect the current UI with `snapshot -i` and pick targets from the actual UI state.
21+
- Act with `press`, `fill`, `get`, `is`, `wait`, or `find`.
22+
- Re-snapshot after meaningful UI changes instead of reusing stale refs.
23+
- End by capturing proof if needed, then `close`.
1624

1725
## Decision rules
1826

19-
- Use `app` sessions to act.
20-
- Use `frontmost-app`, `desktop`, and `menubar` mainly to inspect until helper interaction parity exists.
2127
- Use plain `snapshot` when you need to verify whether text is visible.
2228
- Use `snapshot -i` mainly for interactive exploration and choosing refs.
2329
- Use `fill` to replace text.
2430
- Use `type` to append text.
25-
- Prefer selector or `@ref` targeting over raw coordinates.
31+
- Prefer `@ref` or selector targeting over raw coordinates.
2632
- Keep the default loop short: `open` -> explore/act -> optional debug or verify -> `close`.
2733

2834
## Choose a reference
@@ -32,4 +38,4 @@ Default route inside this skill: bootstrap -> exploration -> optional debugging
3238
- Need logs, network, alerts, permissions, or failure triage: [references/debugging.md](references/debugging.md)
3339
- Need screenshots, diff, recording, replay maintenance, or perf data: [references/verification.md](references/verification.md)
3440
- Need host macOS desktop behavior or surface differences: [references/macos-desktop.md](references/macos-desktop.md)
35-
- Need remote daemon transport or lease admission: [references/remote-tenancy.md](references/remote-tenancy.md)
41+
- Need to connect to a remote `agent-device` daemon over HTTP or use tenant leases: [references/remote-tenancy.md](references/remote-tenancy.md)

skills/agent-device/references/macos-desktop.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -15,7 +15,7 @@ Open this file only when `--platform macos` is involved or the task needs `front
1515

1616
## Most common mistake to avoid
1717

18-
Do not treat desktop surfaces like normal action surfaces. Use `app` sessions to act, and use `frontmost-app`, `desktop`, or `menubar` mainly to inspect until helper interaction parity exists.
18+
Do not treat every macOS surface the same. Use the normal `app` surface when you want to act inside one app. Use `frontmost-app`, `desktop`, or `menubar` mainly to inspect what is visible before switching back to `app` for most interactions.
1919

2020
## Canonical loop
2121

0 commit comments

Comments
 (0)