Skip to content

Commit e061daa

Browse files
abrichrclaude
andcommitted
fix: review fixes — accuracy, claims, and add builders section
- Use 46.7% consistently (not 33-47% range) - Change "core goal" to "planned" in 2x2 matrix - Drop "superhuman" for Agent S3 (barely above human baseline) - Fix possessive "our" to "OpenAdapt's" in competitor table - Add "Built for Builders" section for non-technical users - Renumber subsequent sections Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
1 parent 667a9bb commit e061daa

File tree

2 files changed

+31
-7
lines changed

2 files changed

+31
-7
lines changed

README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -170,8 +170,8 @@ Zero-shot VLMs fail on GUI tasks not due to lack of capability, but due to **amb
170170

171171
| | No Retrieval | With Retrieval |
172172
|---|---|---|
173-
| **No Fine-tuning** | 33–47% (zero-shot baseline) | **100%** (validated, n=45) |
174-
| **Fine-tuning** | Standard SFT baseline | **Demo-conditioned FT** (core goal) |
173+
| **No Fine-tuning** | 46.7% (zero-shot baseline) | **100%** (validated, n=45) |
174+
| **Fine-tuning** | Standard SFT (baseline) | **Demo-conditioned FT** (planned) |
175175

176176
The bottom-right cell is OpenAdapt's unique value: training models to **use** demonstrations they haven't seen before, combining retrieval with fine-tuning for maximum accuracy. Phase 2 (retrieval-only prompting) is validated; Phase 3 (demo-conditioned fine-tuning) is in progress.
177177

docs/design/landing-page-strategy.md

Lines changed: 29 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -269,8 +269,8 @@ Why: Clear 3-step process, action-oriented
269269
| Competitor | Strengths | Weaknesses | Our Advantage |
270270
|------------|-----------|------------|---------------|
271271
| **Anthropic Computer Use** | 72.5% OSWorld (near-human), simple API | Proprietary, cloud-only, no customization, per-action cost | Open source, model-agnostic, trainable, runs locally |
272-
| **Agent S3 (Simular)** | 72.6% OSWorld (superhuman), open source | Zero-shot only, no demo conditioning, no fine-tuning pipeline | Demo-conditioned agents, capture-to-train pipeline |
273-
| **OpenCUA (XLANG Lab)** | NeurIPS Spotlight, 45% OSWorld, open models (7B-72B) | Zero-shot at inference — demos used only for training, not runtime | Runtime demo conditioning (unique); OpenCUA reused our macOS a11y code |
272+
| **Agent S3 (Simular)** | 72.6% OSWorld, open source | Zero-shot only, no demo conditioning, no fine-tuning pipeline | Demo-conditioned agents, capture-to-train pipeline |
273+
| **OpenCUA (XLANG Lab)** | NeurIPS Spotlight, 45% OSWorld, open models (7B-72B) | Zero-shot at inference — demos used only for training, not runtime | Runtime demo conditioning (unique); OpenCUA reused OpenAdapt's macOS a11y code |
274274
| **Browser Use** | 50k+ GitHub stars, 89% WebVoyager | Browser-only, no desktop, no training pipeline | Full desktop support, fine-tuning, demo library |
275275
| **UI-TARS (ByteDance)** | Local models (2B-72B), Apache 2.0 | No demo conditioning, no capture pipeline | End-to-end record→train→deploy, demo retrieval |
276276
| **CUA / Bytebot** | Container infra, YC-backed | Infrastructure-only, no ML training pipeline | Full pipeline: capture + train + eval + deploy |
@@ -378,7 +378,31 @@ Your choice of AI: Claude, GPT-4V, Gemini, Qwen-VL, or fine-tune your own.
378378
MIT licensed. Run locally, in the cloud, or hybrid.
379379
```
380380

381-
### 5.5 For Developers Section
381+
### 5.5 For Builders Section
382+
383+
````
384+
## Built for Builders
385+
386+
### Show it once. Done.
387+
Record yourself doing a task. OpenAdapt handles the rest.
388+
No code, no prompts, no configuration.
389+
390+
### Three commands
391+
```bash
392+
pip install openadapt
393+
openadapt capture start --name my-task # Record
394+
openadapt run --capture my-task # Replay with AI
395+
```
396+
397+
### Works with the AI you already use
398+
Claude, GPT-4V, Gemini, Qwen — pick your model.
399+
Or let OpenAdapt train a small one that runs on your laptop.
400+
401+
### Your data stays yours
402+
Everything runs locally. Nothing leaves your machine unless you want it to.
403+
````
404+
405+
### 5.6 For Developers Section
382406

383407
````
384408
## Built for Developers
@@ -415,7 +439,7 @@ Compare your models against published baselines.
415439
[View Documentation] [GitHub Repository]
416440
````
417441

418-
### 5.6 For Enterprise Section
442+
### 5.7 For Enterprise Section
419443

420444
```
421445
## Enterprise-Ready Automation
@@ -438,7 +462,7 @@ Custom development, training, and support packages available.
438462
[Contact Sales: sales@openadapt.ai]
439463
```
440464

441-
### 5.7 Use Cases Section (Refined)
465+
### 5.8 Use Cases Section (Refined)
442466

443467
**Current**: Generic industry grid
444468

0 commit comments

Comments
 (0)