Choose the right prompt optimization provider for your workflow. Whisper transcription always uses OpenAI regardless of this choice.
flowchart TD
Start[Need prompt optimization?] -->|No| TranscribeOnly[Disable optimization<br/>Use Transcribe mode only]
Start -->|Yes| Privacy{Privacy priority?}
Privacy -->|Local / offline| Local[Ollama or OpenCode]
Privacy -->|Cloud OK| Existing{Already have API keys?}
Existing -->|OpenAI| OpenAI[OpenAI GPT-4o]
Existing -->|Anthropic| Anthropic[Anthropic Claude]
Existing -->|Google| Google[Google Gemini]
Existing -->|Cursor account| Cursor[Cursor SDK]
Existing -->|Many models / one key| OpenRouter[OpenRouter]
Existing -->|Enterprise Azure| Azure[Azure OpenAI]
Local --> OllamaChoice[Ollama: simple local models]
Local --> OpenCodeChoice[OpenCode: multi-provider proxy]
| Provider | Cost/Transform* | Speed | Privacy | Quality | Best For |
|---|---|---|---|---|---|
| OpenAI GPT-4o | ~$0.01 | Fast | Cloud | High | General use; reuse Whisper key |
| Anthropic Claude | ~$0.01–0.02 | Fast | Cloud | Very High | Complex reasoning |
| Google Gemini | ~$0.001 | Very Fast | Cloud | Good | Cost-sensitive usage |
| Azure OpenAI | Varies | Fast | Private Cloud | High | Enterprise deployments |
| Ollama | Free | Medium | Local | Good | Privacy-first, offline |
| OpenCode | Free | Medium | Local | High | Reuse OpenCode multi-provider setup |
| OpenRouter | Varies | Fast | Cloud | High | 200+ models with one API key |
| Cursor | ~$0.01 | Fast | Cloud | High | Cursor Composer and frontier models |
*Plus Whisper transcription (~$0.006/min, always OpenAI)
OpenAI — Same API key as Whisper, gpt-4o default, fast and reliable.
Anthropic Claude 3.5 Sonnet — Strong reasoning and structured output.
Google Gemini Flash — Very fast, low per-request cost.
Ollama — Run llama3.1:8b or similar locally. Whisper still sends audio to OpenAI.
OpenCode — Route through your existing opencode-llm-proxy with provider/model identifiers.
OpenRouter — Access OpenAI, Anthropic, Google, and more through a single gateway.
Cursor SDK — Use Cursor API key and models (composer-2.5, etc.) from any editor.
Azure OpenAI — Private cloud deployment with your own Azure resource.
- API keys are stored per provider (
promptimize.apiKey.{provider}) - Switching providers does not delete saved keys
- Change provider in the configuration webview or via Configure Prompt Optimization Provider
- Run Test Configuration after switching
See also: Configuration Guide · Configuration Webview Guide