Skip to content

Commit ed5e9b5

Browse files
abrichrclaude
andcommitted
docs: add screenshots back to README
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
1 parent 9e3e904 commit ed5e9b5

1 file changed

Lines changed: 17 additions & 0 deletions

File tree

README.md

Lines changed: 17 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -11,6 +11,23 @@ Evaluation infrastructure for GUI agent benchmarks, built for [OpenAdapt](https:
1111

1212
OpenAdapt Evals is a unified framework for evaluating GUI automation agents against standardized benchmarks such as [Windows Agent Arena (WAA)](https://microsoft.github.io/WindowsAgentArena/). It provides benchmark adapters, agent interfaces, Azure VM infrastructure for parallel evaluation, and result visualization -- everything needed to go from "I have a GUI agent" to "here are its benchmark scores."
1313

14+
## Benchmark Viewer
15+
16+
![Benchmark Viewer Animation](animations/benchmark-viewer.gif)
17+
18+
<details>
19+
<summary>More screenshots</summary>
20+
21+
**Task Detail View** -- step-by-step replay with screenshots, actions, and execution logs:
22+
23+
![Task Detail View](docs/screenshots/desktop_task_detail.png)
24+
25+
**Cost Tracking Dashboard** -- real-time Azure VM cost monitoring with tiered sizing and spot instances:
26+
27+
![Cost Dashboard](screenshots/cost_dashboard_preview.png)
28+
29+
</details>
30+
1431
## Key Features
1532

1633
- **Benchmark adapters** for WAA (live, mock, and local modes), with an extensible base for OSWorld, WebArena, and others

0 commit comments

Comments
 (0)