docs(pricing-may-2026): reframe custom inference endpoint intro to lead with powering Warp's agents

hongyi-chen · oz-agent · hongyi-chen · commit 14ab6e2d9498 · 2026-05-20T17:31:23.000-07:00
Mirror the BYOK page's intro pattern so it's explicit upfront that a
custom inference endpoint is used to power Warp's agents. New opening:

  Warp supports custom inference endpoints for users who want to power
  Warp's agents with any OpenAI-compatible inference endpoint \u2014 a
  model router, hosted gateway, or internal infrastructure they
  already run.

  This lets you route AI requests through your preferred provider, run
  inference behind your own gateway, or use a router like OpenRouter
  or LiteLLM, while keeping the agent experience inside Warp.

No other changes.

Co-Authored-By: Oz &lt;oz-agent@warp.dev&gt;
diff --git a/src/content/docs/support-and-community/plans-and-billing/custom-inference-endpoint.mdx b/src/content/docs/support-and-community/plans-and-billing/custom-inference-endpoint.mdx
@@ -5,9 +5,9 @@ description: >-
   OpenRouter, LiteLLM, z.ai, or an internal gateway you already run.
 ---
 
-A **custom inference endpoint** lets you connect Warp's agents to any OpenAI-compatible inference endpoint, so you can route AI requests through your preferred model router, hosted gateway, or internal infrastructure — without giving up the agent experience inside Warp.
+Warp supports **custom inference endpoints** for users who want to power Warp's agents with any OpenAI-compatible inference endpoint — a model router, hosted gateway, or internal infrastructure they already run.
 
-This is the right fit when you want to choose your provider, run inference behind your own gateway, or use a router like OpenRouter or LiteLLM.
+This lets you route AI requests through your preferred provider, run inference behind your own gateway, or use a router like OpenRouter or LiteLLM, while keeping the agent experience inside Warp.
 
 :::note
 Custom inference endpoints are available on Free and all eligible paid plans for individual users and organizations with 10 or fewer employees, subject to Warp's [Terms of Service](https://www.warp.dev/terms-of-service). Larger organizations need a Business or Enterprise plan. See [warp.dev/pricing](https://www.warp.dev/pricing) for current availability.