Skip to content

Commit 14ab6e2

Browse files
hongyi-chenoz-agent
andcommitted
docs(pricing-may-2026): reframe custom inference endpoint intro to lead with powering Warp's agents
Mirror the BYOK page's intro pattern so it's explicit upfront that a custom inference endpoint is used to power Warp's agents. New opening: Warp supports custom inference endpoints for users who want to power Warp's agents with any OpenAI-compatible inference endpoint \u2014 a model router, hosted gateway, or internal infrastructure they already run. This lets you route AI requests through your preferred provider, run inference behind your own gateway, or use a router like OpenRouter or LiteLLM, while keeping the agent experience inside Warp. No other changes. Co-Authored-By: Oz <oz-agent@warp.dev>
1 parent 97590c9 commit 14ab6e2

1 file changed

Lines changed: 2 additions & 2 deletions

File tree

src/content/docs/support-and-community/plans-and-billing/custom-inference-endpoint.mdx

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -5,9 +5,9 @@ description: >-
55
OpenRouter, LiteLLM, z.ai, or an internal gateway you already run.
66
---
77

8-
A **custom inference endpoint** lets you connect Warp's agents to any OpenAI-compatible inference endpoint, so you can route AI requests through your preferred model router, hosted gateway, or internal infrastructure — without giving up the agent experience inside Warp.
8+
Warp supports **custom inference endpoints** for users who want to power Warp's agents with any OpenAI-compatible inference endpoint — a model router, hosted gateway, or internal infrastructure they already run.
99

10-
This is the right fit when you want to choose your provider, run inference behind your own gateway, or use a router like OpenRouter or LiteLLM.
10+
This lets you route AI requests through your preferred provider, run inference behind your own gateway, or use a router like OpenRouter or LiteLLM, while keeping the agent experience inside Warp.
1111

1212
:::note
1313
Custom inference endpoints are available on Free and all eligible paid plans for individual users and organizations with 10 or fewer employees, subject to Warp's [Terms of Service](https://www.warp.dev/terms-of-service). Larger organizations need a Business or Enterprise plan. See [warp.dev/pricing](https://www.warp.dev/pricing) for current availability.

0 commit comments

Comments
 (0)