You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/10_extracting_data.md
+135-1Lines changed: 135 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -7,7 +7,7 @@ The `get()` method allows you to extract information from the screen. You can us
7
7
- Get text or data from the screen
8
8
- Check the state of UI elements
9
9
- Make decisions based on screen content
10
-
- Analyze static images and documents (see [11_file_support.md](11_file_support.md))
10
+
- Analyze static images and documents
11
11
12
12
## Basic Usage
13
13
@@ -32,6 +32,140 @@ button_count = agent.get("How many buttons are visible on this page?")
32
32
33
33
Instead of taking a screenshot, you can also analyze specific images or documents. Please refer to [11_file_support.md](11_file_support.md) for detailed instructions.
34
34
35
+
## File Support
36
+
37
+
### Overview
38
+
The AskUI Python SDK supports the use of various file formats.
39
+
40
+
**Supported File Formats**
41
+
- PDF Files (.pdf)
42
+
- Excel Files (.xlsx, .xls)
43
+
- Word Files (.docx, .doc)
44
+
- CSV Files (.csv)
45
+
46
+
**Model Compatibility Matrix**
47
+
48
+
| File Format | AskUI Gemini | Anthropic Claude | Google Gemini
0 commit comments