Skip to content

Commit 6484bb8

Browse files
committed
Update v3.mdx
1 parent c8cbe70 commit 6484bb8

File tree

1 file changed

+19
-0
lines changed

1 file changed

+19
-0
lines changed

content/docs/v3.mdx

Lines changed: 19 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -13,6 +13,7 @@ description: Major release focused on extensibility, expanded provider support,
1313
| **Gemini RAG** | Manage Gemini File Search Stores and manage document uploads for RAG workflows |
1414
| **Tool Support** | First-class Python function calling for LLM interactions with your local environment |
1515
| **MCP Support** | Connect to Model Context Protocol servers for extended tool capabilities |
16+
| **Computer Use** | Desktop automation - control mouse, keyboard, and take screenshots like a human |
1617
| **KaTeX Math Typesetting** | Support for beautiful rendering of LaTeX math expressions |
1718
| **Calculator UI** | Beautiful UX Friendly UI to evaluate python math expressions |
1819
| **Run Code UI** | Execute Python, JS, TypeScript and C# code scripts in a CodeMirror editor |
@@ -34,6 +35,7 @@ description: Major release focused on extensibility, expanded provider support,
3435
- [Tool Support](#tool-support)
3536
- [MCP Support](#mcp-support)
3637
- [Core Tools](#core-tools)
38+
- [Computer Use](#computer-use)
3739
- [Calculator UI](#calculator-ui)
3840
- [Run Code UI](#run-code-ui)
3941
- [KaTeX Math Typesetting](#katex-math-typesetting)
@@ -555,6 +557,23 @@ LLMS includes a suite of tools for executing code in various languages within a
555557
'Run C#': '/img/tools/tool-csharp.webp',
556558
}} />
557559

560+
## Computer Use
561+
562+
The built-in [computer_use](https://github.com/ServiceStack/llms/tree/main/llms/extensions/computer_use) extension transforms AI agents into autonomous computer operators. Based on [Anthropic's computer use tools](https://github.com/anthropics/anthropic-quickstarts/tree/main/computer-use-demo), it enables agents to see your screen, control the mouse and keyboard, execute shell commands, and edit files - just like a human sitting at the computer.
563+
564+
This unlocks powerful capabilities that traditional API-based tools cannot achieve:
565+
566+
- **Visual Verification**: Confirm that code actually renders correctly in a browser
567+
- **Desktop Automation**: Control any GUI application - web browsers, IDEs, terminals
568+
- **End-to-End Workflows**: Chain together multiple applications in a single task
569+
- **Legacy Applications**: Automate software that lacks APIs
570+
571+
For example, an agent can write a web application, open a browser, and capture a screenshot to prove it works:
572+
573+
<Screenshot src="/img/tools/computer_use_tetris.webp" />
574+
575+
See the [Computer Use](/docs/extensions/computer_use) docs for complete usage details.
576+
558577
## Calculator UI
559578

560579
As some core tools are particularly useful on their own, dedicated UIs has been added for the `calc` tool with support for evaluating mathematical python expressions, including arithmetic, comparison, boolean operators, `math.*` functions & constants and python list comprehensions

0 commit comments

Comments
 (0)