update readme

filip-michalsky · filip-michalsky · commit d665e99649e0 · 2025-06-15T11:20:10.000-04:00
diff --git a/README.md b/README.md
@@ -36,12 +36,26 @@
 </p>
 
 <div class="note" style="background-color: #808096; border-left: 5px solid #ffeb3b; padding: 15px; margin: 10px 0; color: white;">
-  <strong>NOTE:</strong> This is a Python SDK for Stagehand. The original implementation is in TypeScript and is available <a href="https://github.com/browserbase/stagehand" style="color: blue;">here</a>.
+  This is a Python SDK for Stagehand. We also have a TypeScript SDK available <a href="https://github.com/browserbase/stagehand" style="color: blue;">here</a>.
 </div>
 
----
+> Stagehand Python SDK is currently available as an early release, and we're actively seeking feedback from the community. Please join our [Slack community](https://stagehand.dev/slack) to stay updated on the latest developments and provide feedback.  
 
-Stagehand is the easiest way to build browser automations with AI-powered interactions.
+## Why Stagehand?
+
+*Stagehand is the easiest way to build browser automations with AI-powered interactions.*
+
+Most existing browser automation tools either require you to write low-level code in a framework like Selenium, Playwright, or Puppeteer, or use high-level agents that can be unpredictable in production. By letting developers choose what to write in code vs. natural language, Stagehand is the natural choice for browser automations in production.
+
+1. **Choose when to write code vs. natural language**: use AI when you want to navigate unfamiliar pages, and use code ([Playwright](https://playwright.dev/)) when you know exactly what you want to do.
+
+2. **Preview and cache actions**: Stagehand lets you preview AI actions before running them, and also helps you easily cache repeatable actions to save time and tokens.
+
+3. **Computer use models with one line of code**: Stagehand lets you integrate SOTA computer use models from OpenAI and Anthropic into the browser with one line of code.
+
+-----
+
+### TL;DR Automate the web *reliably* with natural language:
 
 - **act** — Instruct the AI to perform actions (e.g. click a button or scroll).
 ```python
@@ -60,6 +74,12 @@ await stagehand.page.observe("find the search bar")
 await stagehand.agent.execute("book a reservation for 2 people for a trip to the Maldives")
 ```
 
+
+## Installation:
+
+`pip install stagehand`
+
+
 ## Quickstart
 
 ```python
@@ -145,174 +165,9 @@ if __name__ == "__main__":
     asyncio.run(main())
 ```
 
-## Installation
-
-### Creating a Virtual Environment (Recommended)
-
-First, create and activate a virtual environment to keep your project dependencies isolated:
-
-```bash
-# Create a virtual environment
-python -m venv stagehand-env
-
-# Activate the environment
-# On macOS/Linux:
-source stagehand-env/bin/activate
-# On Windows:
-stagehand-env\Scripts\activate
-```
-
-### Install Stagehand
-
-**Normal Installation:**
-```bash
-pip install stagehand
-```
+## Documentation
 
-**Local Development Installation:**
-If you're contributing to Stagehand or want to modify the source code:
-
-```bash
-# Clone the repository
-git clone https://github.com/browserbase/stagehand-python.git
-cd stagehand-python
-
-# Install in editable mode with development dependencies
-pip install -e ".[dev]"
-```
-
-## Requirements
-
-- Python 3.9+
-- All dependencies are automatically handled when installing via `pip`
-
-The main dependencies include:
-- httpx (for async HTTP client)
-- requests (for sync HTTP client)
-- pydantic (for data validation)
-- playwright (for browser automation)
-- python-dotenv (for environment variable support)
-- browserbase (for Browserbase integration)
-
-### Development Dependencies
-
-The development dependencies are automatically installed when using `pip install -e ".[dev]"` and include:
-- pytest, pytest-asyncio, pytest-mock, pytest-cov (testing)
-- black, isort, ruff (code formatting and linting)
-- mypy (type checking)
-- rich (enhanced terminal output)
-
-## Environment Variables
-
-Before running your script, copy `.env.example` to `.env.` set the following environment variables:
-
-```bash
-export BROWSERBASE_API_KEY="your-api-key" # if running remotely
-export BROWSERBASE_PROJECT_ID="your-project-id" # if running remotely
-export MODEL_API_KEY="your-openai-api-key"  # or your preferred model's API key
-export STAGEHAND_API_URL="url-of-stagehand-server" # if running remotely
-export STAGEHAND_ENV="BROWSERBASE" # or "LOCAL" to run Stagehand locally
-```
-
-You can also make a copy of `.env.example` and add these to your `.env` file.
-
-## Agent Example
-
-```python
-import os
-from stagehand.sync import Stagehand
-from stagehand import StagehandConfig
-from stagehand.schemas import AgentConfig, AgentExecuteOptions, AgentProvider
-from dotenv import load_dotenv
-
-load_dotenv()
-
-def main():
-    # Configure Stagehand
-    config = StagehandConfig(
-        env="BROWSERBASE",
-        api_key=os.getenv("BROWSERBASE_API_KEY"),
-        project_id=os.getenv("BROWSERBASE_PROJECT_ID"),
-        model_name="gpt-4o",
-        model_client_options={"apiKey": os.getenv("MODEL_API_KEY")}
-    )
-
-    # Initialize Stagehand
-    stagehand = Stagehand(config=config, api_url=os.getenv("STAGEHAND_API_URL"))
-    stagehand.init()
-    print(f"Session created: {stagehand.session_id}")
-    
-    # Navigate to Google
-    stagehand.page.goto("https://google.com/")
-    
-    # Configure the agent
-    agent_config = AgentConfig(
-        provider=AgentProvider.OPENAI,
-        model="computer-use-preview",
-        instructions="You are a helpful web navigation assistant. You are currently on google.com."
-        options={"apiKey": os.getenv("MODEL_API_KEY")}
-    )
-    
-    # Define execution options
-    execute_options = AgentExecuteOptions(
-        instruction="Search for 'latest AI news' and extract the titles of the first 3 results",
-        max_steps=10,
-        auto_screenshot=True
-    )
-    
-    # Execute the agent task
-    agent_result = stagehand.agent.execute(agent_config, execute_options)
-    
-    print(f"Agent execution result: {agent_result}")
-    
-    # Close the session
-    stagehand.close()
-
-if __name__ == "__main__":
-    main()
-```
-
-## Pydantic Schemas
-
-- **ActOptions**  
-
-  The `ActOptions` model takes an `action` field that tells the AI what to do on the page, plus optional fields such as `useVision` and `variables`:
-  ```python
-  from stagehand.schemas import ActOptions
-  
-  # Example:
-  await page.act(ActOptions(action="click on the 'Quickstart' button"))
-  ```
-
-- **ObserveOptions**  
-
-  The `ObserveOptions` model lets you find elements on the page using natural language. The `onlyVisible` option helps limit the results:
-  ```python
-  from stagehand.schemas import ObserveOptions
-  
-  # Example:
-  await page.observe(ObserveOptions(instruction="find the button labeled 'News'", onlyVisible=True))
-  ```
-
-- **ExtractOptions**  
-
-  The `ExtractOptions` model extracts structured data from the page. Pass your instructions and a schema defining your expected data format. **Note:** If you are using a Pydantic model for the schema, call its `.model_json_schema()` method to ensure JSON serializability.
-  ```python
-  from stagehand.schemas import ExtractOptions
-  from pydantic import BaseModel
-  
-  class DescriptionSchema(BaseModel):
-      description: str
-  
-  # Example:
-  data = await page.extract(
-      ExtractOptions(
-          instruction="extract the description of the page",
-          schemaDefinition=DescriptionSchema.model_json_schema()
-      )
-  )
-  description = data.get("description") if isinstance(data, dict) else data.description
-  ```
+See our full documentation [here](https://docs.stagehand.dev/).
 
 ## Actions caching
 
@@ -338,78 +193,6 @@ action_preview = await page.observe("Click the quickstart link")
 await page.act(action_preview[0])
 ```
 
-### Simple caching
-
-Here's an example of implementing a simple file-based cache:
-
-```python
-import json
-from pathlib import Path
-from typing import Optional, Dict, Any
-
-# Get the cached value (None if it doesn't exist)
-async def get_cache(key: str) -> Optional[Dict[str, Any]]:
-    try:
-        cache_path = Path("cache.json")
-        if not cache_path.exists():
-            return None
-        with open(cache_path) as f:
-            cache = json.load(f)
-            return cache.get(key)
-    except Exception:
-        return None
-
-# Set the cache value
-async def set_cache(key: str, value: Dict[str, Any]) -> None:
-    cache_path = Path("cache.json")
-    cache = {}
-    if cache_path.exists():
-        with open(cache_path) as f:
-            cache = json.load(f)
-    cache[key] = value
-    with open(cache_path, "w") as f:
-        json.dump(cache, f)
-```
-
-### Act with cache
-
-Here's a function that checks the cache, gets the action, and runs it:
-
-```python
-async def act_with_cache(page, key: str, prompt: str):
-    # Check if we have a cached action
-    cached_action = await get_cache(key)
-    
-    if cached_action:
-        # Use the cached action
-        action = cached_action
-    else:
-        # Get the observe result (the action)
-        action = await page.observe(prompt)
-        # Cache the action
-        await set_cache(key, action[0])
-    
-    # Run the action (no LLM inference)
-    await page.act(action[0])
-```
-
-You can now use `act_with_cache` to run an action with caching:
-
-```python
-prompt = "Click the quickstart link"
-key = prompt  # Simple cache key
-await act_with_cache(page, key, prompt)
-```
-
-
-## Why?
-**Stagehand adds determinism to otherwise unpredictable agents.**
-
-While there's no limit to what you could instruct Stagehand to do, our primitives allow you to control how much you want to leave to an AI. It works best when your code is a sequence of atomic actions. Instead of writing a single script for a single website, Stagehand allows you to write durable, self-healing, and repeatable web automation workflows that actually work.
-
-> [!NOTE] 
-> `Stagehand` is currently available as an early release, and we're actively seeking feedback from the community. Please join our [Slack community](https://join.slack.com/t/stagehand-dev/shared_invite/zt-2tdncfgkk-fF8y5U0uJzR2y2_M9c9OJA) to stay updated on the latest developments and provide feedback.
-
 
 ## Configuration
 
@@ -449,6 +232,33 @@ config = StagehandConfig(
 )
 ```
 
+## Contributing
+
+First, create and activate a virtual environment to keep your project dependencies isolated:
+
+```bash
+# Create a virtual environment
+python -m venv stagehand-env
+
+# Activate the environment
+# On macOS/Linux:
+source stagehand-env/bin/activate
+# On Windows:
+stagehand-env\Scripts\activate
+```
+
+**Local Development Installation:**
+
+
+```bash
+# Clone the repository
+git clone https://github.com/browserbase/stagehand-python.git
+cd stagehand-python
+
+# Install in editable mode with development dependencies
+pip install -e ".[dev]"
+```
+
 
 ## License