-
Notifications
You must be signed in to change notification settings - Fork 10
docs(pricing-may-2026): customer-supplied inference (BYOK + CIE + BYOLLM) #115
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from 1 commit
4fcbfed
86818df
d3c5b54
5bf93db
7e7d0d0
504efc4
4bff48e
97590c9
14ab6e2
7ee9219
3501639
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -1,29 +1,46 @@ | ||
| --- | ||
| title: Bring Your Own API Key | ||
| title: Bring your own API key | ||
| description: >- | ||
| Warp's paid plans include the ability to bring your own API keys (BYOK) for | ||
| OpenAI, Anthropic, and Google AI models. | ||
| Use your own OpenAI, Anthropic, or Google API keys. Never consumes AI | ||
| credits — on Business and Enterprise, platform credits may apply for | ||
| local agent runs. | ||
| --- | ||
|
|
||
| Warp supports **Bring Your Own Key (BYOK)** for users who want to connect Warp’s agent to their own Anthropic, OpenAI, or Google API accounts. | ||
| Warp supports **Bring your own API key (BYOK)** for users who want to connect Warp's agents to their own Anthropic, OpenAI, or Google API accounts. | ||
|
|
||
| This lets you use your own API keys to access models directly, giving you full control over model selection, billing, and data routing. See [Model Choice](/agent-platform/capabilities/model-choice/) for a list of supported models. | ||
| BYOK gives you full control over model selection, billing, and data routing. See [Model Choice](/agent-platform/capabilities/model-choice/) for the full list of supported models. When you route a request through your own key, Warp **never consumes your** [credits](/support-and-community/plans-and-billing/credits/) for that request. | ||
|
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
|
||
|
|
||
| BYOK provides greater flexibility in model access and ensures Warp **never consumes your** [credits](/support-and-community/plans-and-billing/credits/) for requests routed through your own keys. | ||
| :::note | ||
| On the Business and Enterprise plans, local agent runs that use BYOK still consume platform credits for Warp's platform infrastructure (run lifecycle, integrations, observability). See [platform credits](/support-and-community/plans-and-billing/platform-credits/) for what's covered. | ||
| ::: | ||
|
|
||
| :::note | ||
| BYOK is currently only available on Warp's paid plans, starting with Build. Learn more about plans and pricing [warp.dev/pricing](https://www.warp.dev/pricing). | ||
| BYOK is available on the Free plan and on all eligible paid plans. See [warp.dev/pricing](https://www.warp.dev/pricing) for the current list of eligible plans. | ||
| ::: | ||
|
|
||
| :::caution | ||
| BYOK and customer-supplied inference (BYOLLM via Amazon Bedrock or Google Vertex, plus custom endpoints) are available to individual users and organizations with 10 or fewer employees or users on any plan. Organizations with more than 10 employees or users must be on a Warp Business or Enterprise plan to use BYOK or customer-supplied inference. See Warp's [Terms of Service](https://www.warp.dev/terms-of-service) for details. | ||
| :::note | ||
| BYOK and custom inference endpoint support are available for individual users and organizations with 10 or fewer employees, subject to Warp's [Terms of Service](https://www.warp.dev/terms-of-service). Companies or organizations with more than 10 employees require a Warp Business or Enterprise plan to use BYOK or customer-supplied inference. | ||
| ::: | ||
|
|
||
| ## How does BYOK work? | ||
| ## How BYOK differs from Custom inference endpoint and BYOLLM | ||
|
|
||
| Warp offers three ways to bring your own AI infrastructure. Use this table to pick the right one, and follow the links for full details. | ||
|
|
||
| | Name | Meaning | Plans | | ||
| | --- | --- | --- | | ||
| | **Bring your own API key** (BYOK) | Use your own API key for OpenAI, Anthropic, or Google models. Keys are stored locally on your device. | Free and all eligible paid plans | | ||
| | **[Custom inference endpoint](/support-and-community/plans-and-billing/custom-inference-endpoint/)** (CIE) | Connect Warp to an OpenAI-compatible endpoint such as OpenRouter, LiteLLM, z.ai, or an internal gateway. | Free and all eligible paid plans | | ||
| | **[Bring your own LLM](/enterprise/enterprise-features/bring-your-own-llm/)** (BYOLLM) | Enterprise-managed inference through your cloud provider (AWS Bedrock, Azure Foundry, Google Vertex) or approved internal infrastructure, with Warp handling routing, orchestration, governance, and observability. | Enterprise only | | ||
|
|
||
| See [warp.dev/pricing](https://www.warp.dev/pricing) for current plan availability. | ||
|
|
||
| Platform credits may apply for local agent runs on Business and Enterprise when using BYOK, CIE, or BYOLLM. See [platform credits](/support-and-community/plans-and-billing/platform-credits/). | ||
|
|
||
| ## How BYOK works | ||
|
|
||
| When you add your own model API keys in Warp, those keys are stored **locally on your device** and are **never synced to the cloud**. | ||
|
|
||
| Warp uses these API keys to directly route your agent requests to the model provider you've configured. | ||
| Warp uses these API keys to route your agent requests directly to the model provider you've configured. | ||
|
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Is this correct to say? Requests still technically go through our server right? |
||
|
|
||
| :::caution | ||
| BYOK does not apply to [Cloud Agents](/agent-platform/cloud-agents/overview/). Because your API keys are stored locally on your device, they are not available to cloud-hosted agent runs. Cloud agent runs always consume [Warp credits](/support-and-community/plans-and-billing/credits/). | ||
|
|
@@ -57,9 +74,9 @@ When you explicitly select a model with a key icon, Warp routes requests through | |
|
|
||
| ### Auto Model | ||
|
|
||
| Warp's **Auto** models dynamically route requests across different models based on context and performance. Because this routing logic depends on Warp’s infrastructure, **Auto always consumes Warp's credits**, even if you’ve configured your own API keys. | ||
| Warp's **Auto** models dynamically route requests across different models based on context and performance. Because this routing logic depends on Warp's infrastructure, **Auto always consumes Warp's credits**, even if you've configured your own API keys. | ||
|
|
||
| To use your own key, select a specific provider model (for example, Claude Sonnet 4.5, GPT-5, or Gemini 2.5 Pro) directly from the model picker with a key icon. | ||
| To use your own key, select a specific provider model (for example, Claude Opus 4.7, Claude Sonnet 4.6, GPT-5.5, or Gemini 3.1 Pro) directly from the model picker with a key icon. | ||
|
|
||
| ### Credit usage | ||
|
|
||
|
|
@@ -97,7 +114,7 @@ If your key: | |
|
|
||
| **Failover and fallback:** | ||
|
|
||
| By default, Warp does not fall back to your credits when a BYOK (Bring Your Own Key) request fails. | ||
| By default, Warp does not fall back to your credits when a BYOK request fails. | ||
|
|
||
| You can choose to enable **Warp credit fallback**. When enabled, if an agent request fails with your BYOK model (for example, due to an API error or quota limit), Warp will automatically route the request to one of Warp’s provided models. Warp always prioritizes your API keys first and only uses Warp credits when necessary. | ||
|
|
||
|
|
@@ -117,12 +134,19 @@ Warp itself never stores your LLM API keys. | |
|
|
||
| ### BYOK on Enterprise and Business plans | ||
|
|
||
| Organizations with more than 10 employees or users must be on a Warp Business or Enterprise plan to use BYOK or customer-supplied inference. See Warp's [Terms of Service](https://www.warp.dev/terms-of-service) for the full eligibility rule. | ||
| BYOK is available to individual users and to organizations with 10 or fewer employees, subject to Warp's [Terms of Service](https://www.warp.dev/terms-of-service). Companies or organizations with more than 10 employees need a Warp Business or Enterprise plan to use BYOK or customer-supplied inference. | ||
|
|
||
| Today, BYOK is configured at the **user level** on every plan, including Enterprise and Business: | ||
|
|
||
| * Each team member can add and manage their own API keys locally on their device. | ||
| * Centrally configured, admin-managed BYOK is not yet available — admins cannot enforce or share API keys across team members from a single place. | ||
| * There is no organization-level Admin Panel for BYOK management today. | ||
|
|
||
| Currently, BYOK is configured at the **user level**, not the team or admin level: | ||
| If your organization needs centrally managed model routing now, see [Bring your own LLM](/enterprise/enterprise-features/bring-your-own-llm/) for the Enterprise-managed option. To discuss a fit, contact us at [warp.dev/contact-sales](https://www.warp.dev/contact-sales). | ||
|
|
||
| * Each team member can add and manage their own API keys locally. | ||
| * Team admins cannot yet enforce or share API keys across members. | ||
| * There is currently no organization-level Admin Panel for BYOK management. | ||
| ## Related resources | ||
|
|
||
| If your organization has specific needs for managed keys or enterprise-level control, please contact us at [warp.dev/contact-sales](https://www.warp.dev/contact-sales). | ||
| * [Custom inference endpoint](/support-and-community/plans-and-billing/custom-inference-endpoint/) — Route Warp through any OpenAI-compatible endpoint, such as OpenRouter, LiteLLM, z.ai, or an internal gateway. | ||
| * [Bring your own LLM](/enterprise/enterprise-features/bring-your-own-llm/) — Enterprise-managed inference through your cloud provider or approved infrastructure. | ||
| * [Model Choice](/agent-platform/capabilities/model-choice/) — Full list of supported models and `model_id` values. | ||
| * [Credits](/support-and-community/plans-and-billing/credits/) — How Warp credits work and when they're consumed. | ||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.