You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: modules/ai-agents/pages/ai-gateway/admin/setup-guide.adoc
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -234,9 +234,9 @@ If a provider pool contains multiple providers, you can distribute traffic to ba
234
234
235
235
== Configure MCP tools (optional)
236
236
237
-
If your users will build AI agents that need access to tools via MCP (Model Context Protocol), configure MCP tool aggregation.
237
+
If your users will build glossterm:AI agent[,AI agents] that need access to glossterm:MCP tool[,tools] via glossterm:MCP[,Model Context Protocol (MCP)], configure MCP tool aggregation.
238
238
239
-
On the gateway details page, select the *MCP* tab to configure tool discovery and execution. The MCP proxy aggregates multiple MCP servers, allowing agents to find and call tools through a single endpoint.
239
+
On the gateway details page, select the *MCP* tab to configure tool discovery and execution. The MCP proxy aggregates multiple glossterm:MCP server[,MCP servers], allowing agents to find and call tools through a single endpoint.
This guide shows you how to connect your AI agent or application to a Redpanda AI Gateway. This is also called "Bring Your Own Agent" (BYOA). You'll configure your client SDK, make your first request, and validate the integration.
11
+
This guide shows you how to connect your glossterm:AI agent[] or application to a Redpanda AI Gateway. This is also called "Bring Your Own Agent" (BYOA). You'll configure your client SDK, make your first request, and validate the integration.
Copy file name to clipboardExpand all lines: modules/ai-agents/pages/ai-gateway/builders/discover-gateways.adoc
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -159,7 +159,7 @@ These limits help control costs and ensure fair resource allocation across teams
159
159
160
160
=== MCP Tools
161
161
162
-
If MCPaggregation is enabled for your gateway, you can access tools from multiple MCP servers through a single endpoint.
162
+
If glossterm:MCP[,Model Context Protocol (MCP)] aggregation is enabled for your gateway, you can access glossterm:MCP tool[,tools] from multiple glossterm:MCP server[,MCP servers] through a single endpoint.
Copy file name to clipboardExpand all lines: modules/ai-agents/pages/ai-gateway/gateway-architecture.adoc
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -12,7 +12,7 @@ This page provides technical details about AI Gateway's architecture, request pr
12
12
13
13
== Architecture overview
14
14
15
-
AI Gateway consists of three planes: a control plane for configuration and management, a data plane for request processing and routing, and an observability plane for monitoring and analytics.
15
+
AI Gateway consists of three planes: a glossterm:control plane[] for configuration and management, a glossterm:data plane[] for request processing and routing, and an observability plane for monitoring and analytics.
Redpanda AI Gateway provides unified access to multiple Large Language Model (LLM) providers and Model Context Protocol (MCP) servers through a single endpoint. This quickstart walks you through configuring your first gateway and routing requests through it.
11
+
Redpanda AI Gateway provides unified access to multiple large language model (LLM) providers and glossterm:MCP[,Model Context Protocol (MCP)] servers through a single endpoint. This quickstart walks you through configuring your first gateway and routing requests through it.
12
12
13
13
== Prerequisites
14
14
@@ -275,9 +275,9 @@ Monitor the fallback rate in observability to detect primary provider issues ear
275
275
276
276
== Configure MCP tools (optional)
277
277
278
-
If you're using AI agents, configure MCP (Model Context Protocol) tool aggregation.
278
+
If you're using glossterm:AI agent[,AI agents], configure glossterm:MCP[,Model Context Protocol (MCP)] tool aggregation.
279
279
280
-
On the Gateways page, select the *MCP* tab to configure tool discovery and execution. The MCP proxy aggregates multiple MCP servers behind a single endpoint, allowing agents to discover and call tools through the gateway.
280
+
On the Gateways page, select the *MCP* tab to configure tool discovery and execution. The MCP proxy aggregates multiple glossterm:MCP server[,MCP servers] behind a single endpoint, allowing agents to discover and call glossterm:MCP tool[,tools] through the gateway.
AI Gateway provides MCP (Model Context Protocol) aggregation, allowing AI agents to access tools from multiple MCP servers through a single unified endpoint. This eliminates the need for agents to manage multiple MCP connections and significantly reduces token costs through deferred tool loading.
11
+
AI Gateway provides glossterm:MCP[,Model Context Protocol (MCP)] aggregation, allowing glossterm:AI agent[,AI agents] to access glossterm:MCP tool[,tools] from multiple glossterm:MCP server[,MCP servers] through a single unified endpoint. This eliminates the need for agents to manage multiple MCP connections and significantly reduces token costs through deferred tool loading.
12
12
13
13
MCP aggregation benefits:
14
14
@@ -20,7 +20,7 @@ MCP aggregation benefits:
20
20
21
21
== What is MCP?
22
22
23
-
Model Context Protocol (MCP) is a standard for exposing tools (functions) that AI agents can discover and invoke. MCP servers provide tools like:
23
+
glossterm:MCP[,Model Context Protocol (MCP)] is a standard for exposing tools (functions) that AI agents can discover and invoke. MCP servers provide tools like:
Copy file name to clipboardExpand all lines: modules/ai-agents/pages/ai-gateway/what-is-ai-gateway.adoc
+3-3Lines changed: 3 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -18,7 +18,7 @@ First, applications typically hardcode provider-specific SDKs. An application us
18
18
19
19
Second, costs can spiral without visibility into usage patterns. Without a centralized view of token consumption across teams and applications, it's difficult to attribute costs to specific customers, features, or environments. Testing and debugging can generate unexpected bills, and there's no way to enforce budgets or rate limits per team or customer.
20
20
21
-
Third, AI agents that use MCP (Model Context Protocol) servers face tool coordination challenges. Managing tool discovery and execution is repetitive across projects, and agents typically load all available tools upfront, which creates high token costs. There's also no centralized governance over which tools agents can access.
21
+
Third, glossterm:AI agent[,AI agents] that use glossterm:MCP[,Model Context Protocol (MCP)] servers face tool coordination challenges. Managing tool discovery and execution is repetitive across projects, and agents typically load all available tools upfront, which creates high token costs. There's also no centralized governance over which tools agents can access.
22
22
23
23
Finally, observability is fragmented across provider dashboards. You cannot reconstruct user sessions that span multiple models, compare latency and costs across providers in a unified view, or efficiently debug issues. Troubleshooting "the AI gave the wrong answer" requires manual log diving across different systems.
24
24
@@ -100,9 +100,9 @@ For reliability, you can configure provider pools with automatic failover. If yo
100
100
101
101
=== MCP aggregation and orchestration
102
102
103
-
AI Gateway aggregates multiple MCP (Model Context Protocol) servers and provides deferred tool loading, which dramatically reduces token costs for AI agents.
103
+
AI Gateway aggregates multiple glossterm:MCP server[,MCP servers] and provides deferred tool loading, which dramatically reduces token costs for AI agents.
104
104
105
-
Without AI Gateway, agents typically load all available tools from multiple MCP servers at startup. This approach sends 50+ tool definitions with every request, creating high token costs (thousands of tokens per request), slow agent startup times, and no centralized governance over which tools agents can access.
105
+
Without AI Gateway, agents typically load all available glossterm:MCP tool[,tools] from multiple MCP servers at startup. This approach sends 50+ tool definitions with every request, creating high token costs (thousands of tokens per request), slow agent startup times, and no centralized governance over which tools agents can access.
106
106
107
107
With AI Gateway, you configure approved MCP servers once, and the gateway loads only search and orchestrator tools initially. Agents query for specific tools only when needed, which often reduces token usage by 80-90% depending on your configuration and the number of tools aggregated. You also gain centralized approval and governance over which MCP servers your agents can access.
Copy file name to clipboardExpand all lines: modules/ai-agents/pages/mcp/remote/concepts.adoc
+2-1Lines changed: 2 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -64,7 +64,8 @@ The `redpanda.otel_traces` topic has a predefined retention policy. Configuratio
64
64
65
65
The topic persists in your cluster even after all MCP servers are deleted, allowing you to retain historical trace data for analysis.
66
66
67
-
Trace data may contain sensitive information from your tool inputs and outputs. Consider implementing appropriate glossterm:ACL[,access control lists (ACLs)] for the `redpanda.otel_traces` topic, and review the data in traces before sharing or exporting to external systems.
67
+
Trace data may contain sensitive information from your
68
+
tool inputs and outputs. Consider implementing appropriate glossterm:ACL[,access control lists (ACLs)] for the `redpanda.otel_traces` topic, and review the data in traces before sharing or exporting to external systems.
0 commit comments