Merge branch 'DOC-1867-Document-feature-AI-Gateway-help-cloud-team-polish-clean-up' into adp-pkg1

Paulo Borges · Paulo Borges · commit 73be85dfad56 · 2026-02-09T09:31:18.000-08:00
# Conflicts:
#	modules/ai-agents/pages/mcp/overview.adoc
diff --git a/modules/ai-agents/pages/ai-gateway/admin/setup-guide.adoc b/modules/ai-agents/pages/ai-gateway/admin/setup-guide.adoc
@@ -1,5 +1,5 @@
 = AI Gateway Setup Guide
-:description: Complete setup guide for administrators to enable providers, configure models, create gateways, and set up routing policies.
+:description: Set up AI Gateway for your organization. Enable providers, configure failover for high availability, set budget controls, and create gateways with team-level isolation.
 :page-topic-type: how-to
 :personas: platform_admin
 :learning-objective-1: Enable LLM providers and models in the catalog
diff --git a/modules/ai-agents/pages/ai-gateway/gateway-architecture.adoc b/modules/ai-agents/pages/ai-gateway/gateway-architecture.adoc
@@ -1,5 +1,5 @@
 = AI Gateway Architecture
-:description: Technical architecture of Redpanda AI Gateway, including request lifecycle, supported providers, deployment models, and implementation details.
+:description: Technical architecture of Redpanda AI Gateway, including how the control plane, data plane, and observability plane deliver high availability, cost governance, and multi-tenant isolation.
 :page-topic-type: concept
 :personas: app_developer, platform_admin
 :learning-objective-1: Describe the three architectural planes of AI Gateway
diff --git a/modules/ai-agents/pages/ai-gateway/gateway-quickstart.adoc b/modules/ai-agents/pages/ai-gateway/gateway-quickstart.adoc
@@ -1,5 +1,5 @@
 = AI Gateway Quickstart
-:description: Get started with AI Gateway by configuring providers, creating your first gateway, and routing requests through unified LLM endpoints.
+:description: Get started with AI Gateway. Configure providers, create your first gateway with failover and budget controls, and route your first request.
 :page-topic-type: quickstart
 :personas: app_developer, platform_admin
 :learning-objective-1: Enable an LLM provider and create your first gateway
@@ -8,7 +8,7 @@
 
 include::ai-agents:partial$ai-gateway-byoc-note.adoc[]
 
-Redpanda AI Gateway provides unified access to multiple large language model (LLM) providers and glossterm:MCP[,Model Context Protocol (MCP)] servers through a single endpoint. This quickstart walks you through configuring your first gateway and routing requests through it.
+Redpanda AI Gateway keeps your AI-powered applications running and your costs under control by routing all LLM and MCP traffic through a single managed layer with automatic failover and budget enforcement. This quickstart walks you through configuring your first gateway and routing requests through it.
 
 == Prerequisites
 
diff --git a/modules/ai-agents/pages/ai-gateway/index.adoc b/modules/ai-agents/pages/ai-gateway/index.adoc
@@ -1,5 +1,5 @@
 = AI Gateway
-:description: Learn about the unified access layer for LLM providers and AI tools with centralized routing, policy enforcement, cost management, and observability.
+:description: Keep AI-powered apps running with automatic provider failover, prevent runaway spend with centralized budget controls, and govern access across teams, apps, and service accounts.
 :page-layout: index
 :personas: platform_admin, app_developer, evaluator
 
diff --git a/modules/ai-agents/pages/ai-gateway/what-is-ai-gateway.adoc b/modules/ai-agents/pages/ai-gateway/what-is-ai-gateway.adoc
@@ -1,30 +1,54 @@
 = What is an AI Gateway?
-:description: Understand what an AI Gateway is, the problems it solves, and how it benefits your AI infrastructure.
+:description: Understand how AI Gateway keeps AI-powered apps highly available across providers and prevents runaway AI spend with centralized cost governance.
 :page-topic-type: concept
 :personas: app_developer, platform_admin
-:learning-objective-1: Describe how AI Gateway centralizes LLM provider management and reduces operational complexity
-:learning-objective-2: Identify key features that address common LLM integration challenges
-:learning-objective-3: Determine whether AI Gateway fits your use case based on traffic volume and provider diversity
+:learning-objective-1: Explain how AI Gateway keeps AI-powered apps highly available through governed provider failover
+:learning-objective-2: Describe how AI Gateway prevents runaway AI spend with centralized budget controls and tenancy-based governance
+:learning-objective-3: Identify when AI Gateway fits your use case based on availability requirements, cost governance needs, and multi-provider or MCP tool usage
 
 include::ai-agents:partial$ai-gateway-byoc-note.adoc[]
 
-Redpanda AI Gateway is a unified access layer for LLM providers and AI tools that sits between your applications and the AI services they use. It provides centralized routing, policy enforcement, cost management, and observability for all your AI traffic.
+Redpanda AI Gateway keeps your AI-powered applications highly available and your AI spend under control. It sits between your applications and the LLM providers and AI tools they depend on, providing automatic provider failover so your apps stay up even when a provider goes down, and centralized budget controls so costs never run away. For platform teams, it adds governance at the model-fallback level, tenancy modeling for teams, individuals, apps, and service accounts, and a single proxy layer for both LLM models and MCP tool servers.
 
 == The problem
 
-Modern AI applications face four critical challenges that increase costs, reduce reliability, and slow down development.
+Modern AI applications face two business-critical challenges: staying up and staying on budget.
 
-First, applications typically hardcode provider-specific SDKs. An application using OpenAI's SDK cannot easily switch to Anthropic or Google without code changes and redeployment. This tight coupling makes testing across providers time-consuming and error-prone, and means provider outages directly impact your application availability.
+First, applications typically hardcode provider-specific SDKs. An application using OpenAI's SDK cannot easily switch to Anthropic or Google without code changes and redeployment. When a provider hits rate limits, suffers an outage, or degrades in performance, your application goes down with it. Your end users don't care which provider you use; they care that the app works.
 
-Second, costs can spiral without visibility into usage patterns. Without a centralized view of token consumption across teams and applications, it's difficult to attribute costs to specific customers, features, or environments. Testing and debugging can generate unexpected bills, and there's no way to enforce budgets or rate limits per team or customer.
+Second, costs can spiral without centralized controls. Without a single view of token consumption across teams and applications, it's difficult to attribute costs to specific customers, features, or environments. Testing and debugging can generate unexpected bills, and there's no way to enforce budgets or rate limits per team, application, or service account. The result: runaway spend that finance discovers only after the fact.
 
-Third, glossterm:AI agent[,AI agents] that use glossterm:MCP[,Model Context Protocol (MCP)] servers face tool coordination challenges. Managing tool discovery and execution is repetitive across projects, and agents typically load all available tools upfront, which creates high token costs. There's also no centralized governance over which tools agents can access.
-
-Finally, observability is fragmented across provider dashboards. You cannot reconstruct user sessions that span multiple models, compare latency and costs across providers in a unified view, or efficiently debug issues. Troubleshooting "the AI gave the wrong answer" requires manual log diving across different systems.
+These two challenges are compounded by fragmented observability across provider dashboards, which makes it harder to detect availability issues or cost anomalies in time to act. And as organizations adopt glossterm:AI agent[,AI agents] that call glossterm:MCP tool[,MCP tools], the lack of centralized tool governance adds another dimension of uncontrolled cost and risk.
 
 == What AI Gateway solves
 
-Redpanda AI Gateway addresses these challenges through the following core capabilities:
+Redpanda AI Gateway delivers two core business outcomes, high availability and cost governance, backed by platform-level controls that set it apart from simple proxy layers:
+
+=== High availability through governed failover
+
+Your end users don't care whether you use OpenAI, Anthropic, or Google; they care that your app stays up. AI Gateway lets you configure provider pools with automatic failover so that when your primary provider hits rate limits, times out, or returns errors, the gateway routes requests to a fallback provider with no code changes and no downtime for your users.
+
+Unlike simple retry logic, AI Gateway provides governance at the failover level: you define which providers fail over to which, under what conditions, and with what priority. This controlled failover can significantly improve uptime even during extended provider outages.
+
+=== Cost governance and budget controls
+
+AI Gateway gives you centralized fiscal control over AI spend. Set monthly budget caps per gateway, enforce them automatically, and set rate limits per team, environment, or application. No more runaway costs discovered after the fact.
+
+You can route requests to different models based on user attributes. For example, to direct premium users to a more capable model while routing free tier users to a cost-effective option, use a CEL expression:
+
+[source,cel]
+----
+// Route premium users to best model, free users to cost-effective model
+request.headers["x-user-tier"] == "premium"
+  ? "anthropic/claude-opus-4.6"
+  : "anthropic/claude-sonnet-4.5"
+----
+
+You can also set different rate limits and spend limits per environment to prevent staging or development traffic from consuming production budgets.
+
+=== Tenancy and access governance
+
+AI Gateway provides multi-tenant isolation by design. Create separate gateways for teams, individual developers, applications, or service accounts, each with their own budgets, rate limits, routing policies, and observability scope. This tenancy model lets platform teams govern who uses what, how much they spend, and which models and tools they can access, without building custom authorization layers.
 
 === Unified LLM access (single endpoint for all providers)
 
@@ -85,27 +109,9 @@ response = client.chat.completions.create(
 
 To switch providers, you change only the `model` parameter from `openai/gpt-5.2` to `anthropic/claude-sonnet-4.5`. No code changes or redeployment needed.
 
-=== Policy-based routing and cost control
-
-AI Gateway lets you define routing rules, rate limits, and budgets once, then enforces them automatically for all requests.
-
-You can route requests to different models based on user attributes. For example, to direct premium users to a more capable model while routing free tier users to a cost-effective option, use a CEL expression:
-
-[source,cel]
-----
-// Route premium users to best model, free users to cost-effective model
-request.headers["x-user-tier"] == "premium"
-  ? "anthropic/claude-opus-4.6"
-  : "anthropic/claude-sonnet-4.5"
-----
-
-You can also set different rate limits and spend limits per environment to prevent staging or development traffic from consuming production budgets.
-
-For reliability, you can configure provider pools with automatic failover. If you configure OpenAI GPT-4 as your primary model and Anthropic Claude Opus as the fallback, the gateway automatically routes requests to the fallback when it detects rate limits or timeouts from the primary provider. This configuration can significantly improve uptime (potentially up to 99.9% in some configurations) even during provider outages.
-
-=== MCP aggregation and orchestration
+=== Proxy for LLM models and MCP tool servers
 
-AI Gateway aggregates multiple glossterm:MCP server[,MCP servers] and provides deferred tool loading, which dramatically reduces token costs for AI agents.
+AI Gateway acts as a single proxy layer for both LLM model requests and MCP tool servers. For LLM traffic, it provides the unified endpoint described above. For AI agents that use MCP tools, it aggregates multiple MCP servers and provides deferred tool loading, which dramatically reduces token costs.
 
 Without AI Gateway, agents typically load all available glossterm:MCP tool[,tools] from multiple MCP servers at startup. This approach sends 50+ tool definitions with every request, creating high token costs (thousands of tokens per request), slow agent startup times, and no centralized governance over which tools agents can access.
 
diff --git a/modules/ai-agents/pages/index.adoc b/modules/ai-agents/pages/index.adoc
@@ -1,4 +1,4 @@
 = Agentic AI
-:description: Learn about the Redpanda Agentic Data Plane, including the AI Gateway, AI agents, and MCP servers.
+:description: Learn about the Redpanda Agentic Data Plane. Keep AI-powered apps highly available, control costs across providers, and govern access for teams, apps, and service accounts.
 :page-layout: index
 :page-aliases: develop:agents/about.adoc, develop:ai-agents/about.adoc
diff --git a/modules/ai-agents/pages/mcp/index.adoc b/modules/ai-agents/pages/mcp/index.adoc
@@ -1,8 +1,8 @@
 = Model Context Protocol (MCP)
-:description: Learn about the Model Context Protocol (MCP) in Redpanda Cloud.
+:description: Give AI agents direct access to your databases, queues, CRMs, and other business systems without writing custom glue code.
 :page-layout: index
 
-The Model Context Protocol (MCP) provides a standardized way for AI agents to connect with external data sources and tools in Redpanda Cloud.
+AI agents need context from your business systems. The Model Context Protocol (MCP) translates agent intent into real connections to databases, queues, CRMs, HRIS, and other systems of record, without you writing custom integration code. Redpanda's MCP servers are built on the same proven connectors that power the world's largest e-commerce, electric vehicle, energy, and AI companies.
 
 Redpanda Cloud offers two complementary MCP options:
 
diff --git a/modules/ai-agents/pages/mcp/local/index.adoc b/modules/ai-agents/pages/mcp/local/index.adoc
@@ -1,4 +1,4 @@
 = Redpanda Cloud Management MCP Server
 :page-beta: true
-:description: Find links to information about the Redpanda Cloud Management MCP Server and its features for building and managing AI agents that can interact with your Redpanda Cloud account and clusters.
+:description: Manage your Redpanda Cloud clusters, topics, and users through AI agents using natural language commands.
 :page-layout: index
diff --git a/modules/ai-agents/pages/mcp/local/overview.adoc b/modules/ai-agents/pages/mcp/local/overview.adoc
@@ -1,6 +1,6 @@
 = Redpanda Cloud Management MCP Server
 :page-beta: true
-:description: Learn about the Redpanda Cloud Management MCP Server, which lets AI agents securely access and operate your Redpanda Cloud account and clusters.
+:description: Let AI agents securely operate your Redpanda Cloud clusters, topics, and users through natural language commands.
 :page-topic-type: overview
 :personas: evaluator, agent_developer, platform_admin
 // Reader journey: "I'm new"
diff --git a/modules/ai-agents/pages/mcp/overview.adoc b/modules/ai-agents/pages/mcp/overview.adoc
@@ -1,7 +1,7 @@
 = MCP Servers for Redpanda Cloud Overview
-:description: Learn about Model Context Protocol (MCP) in Redpanda Cloud, including the two complementary options: the Redpanda Cloud Management MCP Server and Remote MCP.
+:description: Connect AI agents to your databases, queues, CRMs, and other business systems without writing glue code, using Redpanda's proven connectors.
 :page-topic-type: overview
-:personas: evaluator, agent_developer
+:personas: evaluator, ai_agent_developer
 // Reader journey: "I'm new" - understanding the landscape
 // Learning objectives - what readers should understand after reading this page:
 :learning-objective-1: Describe what MCP enables for AI agents
@@ -18,13 +18,9 @@ After reading this page, you will be able to:
 
 == What is MCP?
 
-The Model Context Protocol (MCP) provides a standardized way for AI agents to connect with external data sources and tools in Redpanda Cloud.
+MCP (Model Context Protocol) is an open standard that translates AI agent intent into real connections to databases, queues, CRMs, HRIS, accounting software, and other business systems. Instead of writing custom glue code for every integration, you define your tools once using MCP, and any MCP-compatible AI client can discover and use them.
 
-Each MCP server hosts a set of tools that AI clients can discover and invoke. Tools are custom integrations that expose data, APIs, or workflows to AI agents.
-
-Think of MCP like a universal adapter: instead of building custom integrations for every AI system, you define your tools once using MCP, and any MCP-compatible AI client can discover and use them.
-
-Without MCP, connecting AI to your business systems requires custom API code, authentication handling, and response formatting for each AI platform. With MCP, you describe what a tool does and what inputs it needs, and the protocol handles the rest.
+Without MCP, connecting AI to your business systems requires custom API code, authentication handling, and response formatting for each AI platform. With MCP, you describe what a tool does and what inputs it needs, and the protocol handles the rest. Redpanda's MCP servers are built on the same proven connectors that power the world's largest e-commerce, electric vehicle, energy, and AI companies today.
 
 == MCP options in Redpanda Cloud
 
@@ -89,9 +85,9 @@ You can use both options together. For example, use the Redpanda Cloud Managemen
 
 == Get started
 
-* xref:ai-agents:mcp/local/quickstart.adoc[]
-* xref:ai-agents:mcp/remote/quickstart.adoc[]
+* xref:ai-agents:mcp/local/quickstart.adoc[]: Connect Claude to your Redpanda Cloud account
+* xref:ai-agents:mcp/remote/quickstart.adoc[]: Build and deploy custom MCP tools
 
 == Suggested reading
 
-* xref:home:ROOT:mcp-setup.adoc[]
+* xref:home:ROOT:mcp-setup.adoc[]: Access Redpanda documentation through AI agents (read-only, no Cloud access required)
diff --git a/modules/ai-agents/pages/mcp/remote/concepts.adoc b/modules/ai-agents/pages/mcp/remote/concepts.adoc
@@ -1,5 +1,5 @@
 = MCP Tool Execution and Components
-:description: Understand the MCP execution model, choose the right component type, and use traces for observability.
+:description: Understand how MCP tools execute requests, choose the right Redpanda Connect component type, and use traces for observability.
 :page-aliases: ai-agents:mcp/remote/understanding-mcp-tools.adoc
 :page-topic-type: concepts
 :personas: agent_developer, streaming_developer
diff --git a/modules/ai-agents/pages/mcp/remote/index.adoc b/modules/ai-agents/pages/mcp/remote/index.adoc
@@ -1,3 +1,3 @@
 = Remote MCP Servers for Redpanda Cloud
-:description: Enable AI agents to directly interact with your Redpanda Cloud clusters and streaming data.
+:description: Build MCP tools that connect AI agents to databases, queues, CRMs, and other business systems using Redpanda's proven connectors.
 :page-layout: index
diff --git a/modules/ai-agents/pages/mcp/remote/overview.adoc b/modules/ai-agents/pages/mcp/remote/overview.adoc
@@ -1,5 +1,5 @@
 = Remote MCP Server Overview
-:description: Discover how AI agents can interact with your streaming data and how to connect them to Redpanda Cloud.
+:description: Build and host MCP tools that connect AI agents to your business systems without writing glue code, using Redpanda's proven connectors.
 :page-topic-type: overview
 :personas: evaluator, agent_developer
 // Reader journey: "I'm evaluating this"
@@ -8,7 +8,7 @@
 :learning-objective-2: Identify use cases where Remote MCP provides business value
 :learning-objective-3: Describe how MCP tools expose Redpanda Connect components to AI
 
-This page introduces Remote MCP servers and helps you decide if they're right for your use case.
+Remote MCP lets you give AI agents access to your databases, queues, CRMs, and other systems of record without writing custom integration code. This page introduces Remote MCP servers and helps you decide if they're right for your use case.
 
 After reading this page, you will be able to:
 
diff --git a/modules/ai-agents/pages/mcp/remote/quickstart.adoc b/modules/ai-agents/pages/mcp/remote/quickstart.adoc
@@ -1,5 +1,5 @@
 = Remote MCP Server Quickstart
-:description: Learn how to extend AI agents with custom tools that interact with your Redpanda data using the Model Context Protocol (MCP).
+:description: Build and deploy your first MCP tools to connect AI agents to your Redpanda data without writing custom integration code.
 :page-topic-type: tutorial
 :personas: agent_developer, streaming_developer, evaluator
 // Reader journey: "I want to try it now"