Skip to content

Commit 7cc3d45

Browse files
committed
fix: correct step order and broken E2B Desktop repo link
1 parent f98f78d commit 7cc3d45

1 file changed

Lines changed: 3 additions & 3 deletions

File tree

docs/use-cases/computer-use.mdx

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -12,16 +12,16 @@ For a complete working implementation, see [E2B Surf](https://github.com/e2b-dev
1212

1313
The computer use agent loop follows this pattern:
1414

15-
1. **E2B creates a desktop sandbox**an Ubuntu 22.04 environment with [XFCE](https://xfce.org/) desktop and pre-installed applications
16-
2. **User sends a command**e.g., "Open Firefox and search for AI news"
15+
1. **User sends a command**e.g., "Open Firefox and search for AI news"
16+
2. **Agent creates a desktop sandbox**an Ubuntu 22.04 environment with [XFCE](https://xfce.org/) desktop and pre-installed applications
1717
3. **Agent takes a screenshot** — captures the current desktop state via E2B Desktop SDK
1818
4. **LLM analyzes the screenshot** — a vision model (e.g., [OpenAI Computer Use API](https://platform.openai.com/docs/guides/computer-use)) decides what action to take
1919
5. **Action is executed** — click, type, scroll, or keypress via E2B Desktop SDK
2020
6. **Repeat** — new screenshot is taken and sent back to the LLM until the task is complete
2121

2222
## Install the E2B Desktop SDK
2323

24-
The [E2B Desktop](https://github.com/e2b-dev/e2b-desktop) SDK gives your agent a full Linux desktop with mouse, keyboard, and screen capture APIs.
24+
The [E2B Desktop](https://github.com/e2b-dev/desktop) SDK gives your agent a full Linux desktop with mouse, keyboard, and screen capture APIs.
2525

2626
<CodeGroup>
2727
```bash JavaScript & TypeScript

0 commit comments

Comments
 (0)