Updated: January 17, 2026 | Version: 1.1.0
Goal: Record -> Train -> Evaluate GUI automation workflows.
| Package | Python | Notes |
|---|---|---|
openadapt (meta) |
>=3.10 | |
openadapt-capture |
>=3.10 | |
openadapt-ml |
>=3.10 | PR #5 fixes Python version |
openadapt-evals |
>=3.10 | |
openadapt-viewer |
>=3.10 | |
openadapt-grounding |
>=3.10 | 53 tests passing |
openadapt-retrieval |
>=3.10 | 28 tests passing |
openadapt-privacy |
>=3.10 |
- PR: #969, branch
fix/ruff-format-config - Action: Merge PR, verify CI passes (Python 3.10-3.12, macOS/Ubuntu)
- Location:
legacy/deploy/deploy/models/omniparser/Dockerfile - Actions:
- Test
docker build - Verify CUDA/GPU support
- Test model download (huggingface-cli)
- Test
- Test
pip install openadapt[core]andopenadapt[all]on Python 3.10+ - Verify:
python -c "from openadapt.cli import main"
Test the core workflow:
openadapt capture start --name my-task
openadapt train start --capture my-task
openadapt eval run --checkpoint model.ptBlockers: capture stop TODO (uses Ctrl+C), macOS needs Accessibility + Screen Recording permissions
- Package:
openadapt-ml - Test: Claude, GPT-4V, Gemini, Qwen3-VL adapters
- Add rate limit / API failure handling
- Key result: 46.7% -> 100% first-action accuracy with demo conditioning (on shared-entry-point benchmark)
- Integrate
openadapt-retrievalwithopenadapt-mladapters - Add
--demoflag toopenadapt eval run
- Blocked on: Azure VM with nested virtualization
- Goal: Validate demo-conditioning claims on Windows Agent Arena
- Pre-action validation, dangerous action detection
- Optional human-in-the-loop confirmation
- Integrate with
openadapt-mlaction replay - Providers: OmniGrounder (CUDA), GeminiGrounder (API), SoMGrounder (CUDA)
- Add video playback, action timeline, side-by-side comparison
- Filter by action type
- Create
openadapt-telemetrypackage - GlitchTip/Sentry integration with privacy filtering
- Design:
docs/design/telemetry-design.md
- WebArena, OSWorld, MiniWoB++
- Deploy MkDocs to docs.openadapt.ai
| Phase | Criteria |
|---|---|
| P0 | CI green, Docker builds, openadapt[core] installs, basic workflow works |
| P1 | API agents with demo conditioning, WAA baseline established |
| P2 | Safety gates, grounding integration, viewer with video |
| P3 | Telemetry published, docs live, additional benchmarks |
capture stopuses Ctrl+C instead of signal/file- Legacy code in
/legacy/(not blocking v1.0)
| Resource | Status |
|---|---|
| Azure credits (WAA) | Available |
| API keys (Claude, GPT-4V, Gemini) | In openadapt-ml/.env |
| DNS (docs.openadapt.ai) | Configured |