Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
5 changes: 5 additions & 0 deletions .changeset/tricky-bats-pay.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
---
"@browserbasehq/stagehand": patch
---

Update docs / logging to reflect gpt 5.4 and gemini 3.1 family compatibility with agent hybrid mode
5 changes: 3 additions & 2 deletions claude.md
Original file line number Diff line number Diff line change
Expand Up @@ -238,8 +238,9 @@ Hybrid mode uses both DOM-based and coordinate-based tools (act, click, type, dr

**Recommended models for hybrid mode:**

- `google/gemini-3-flash-preview`
- `anthropic/claude-sonnet-4-20250514`, `anthropic/claude-sonnet-4-5-20250929`, `anthropic/claude-haiku-4-5-20251001`
- `google/gemini-3-flash-preview`, `google/gemini-3.1-flash-lite-preview`, `google/gemini-3.1-pro-preview`
- `openai/gpt-5.4`, `openai/gpt-5.4-mini`
- Any `anthropic/claude-*` model

```typescript
const stagehand = new Stagehand({
Expand Down
2 changes: 2 additions & 0 deletions packages/core/lib/v3/handlers/v3AgentHandler.ts
Original file line number Diff line number Diff line change
Expand Up @@ -166,6 +166,8 @@ export class V3AgentHandler {
if (
this.mode === "hybrid" &&
!baseModel.modelId.includes("gemini-3-flash") &&
!baseModel.modelId.includes("gemini-3.1") &&
!baseModel.modelId.includes("gpt-5.4") &&
Comment thread
tkattkat marked this conversation as resolved.
!baseModel.modelId.includes("claude")
) {
this.logger({
Expand Down
5 changes: 3 additions & 2 deletions packages/docs/v3/basics/agent.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -137,8 +137,9 @@ Both DOM and CUA modes have their strengths and weaknesses. Hybrid mode combines

<Warning>
**Model Requirements:** Hybrid mode requires models that can reliably perform coordinate-based actions from screenshots. The following models are recommended:
- **Google:** `google/gemini-3-flash-preview`
- **Anthropic:** `anthropic/claude-sonnet-4-20250514`, `anthropic/claude-sonnet-4-5-20250929`, `anthropic/claude-haiku-4-5-20251001`
- **Google:** `google/gemini-3-flash-preview`, `google/gemini-3.1-flash-lite-preview`, `google/gemini-3.1-pro-preview`
- **OpenAI:** `openai/gpt-5.4`, `openai/gpt-5.4-mini`
- **Anthropic:** Any `anthropic/claude-*` model

Other models may not reliably produce accurate coordinates for clicking and typing.

Expand Down
5 changes: 3 additions & 2 deletions packages/docs/v3/references/agent.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -131,8 +131,9 @@ interface AgentInstance {

<Warning>
**Hybrid Mode Model Requirements:** Only use hybrid mode with models that can reliably perform coordinate-based actions:
- **Google:** `google/gemini-3-flash-preview`
- **Anthropic:** `anthropic/claude-sonnet-4-20250514`, `anthropic/claude-sonnet-4-5-20250929`, `anthropic/claude-haiku-4-5-20251001`
- **Google:** `google/gemini-3-flash-preview`, `google/gemini-3.1-flash-lite-preview`, `google/gemini-3.1-pro-preview`
- **OpenAI:** `openai/gpt-5.4`, `openai/gpt-5.4-mini`
- **Anthropic:** Any `anthropic/claude-*` model

Requires `experimental: true` in Stagehand constructor.
</Warning>
Expand Down