You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: modules/ai-agents/pages/adp-overview.adoc
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -7,7 +7,7 @@
7
7
:learning-objective-2: Describe how each component addresses enterprise governance and reliability requirements
8
8
:learning-objective-3: Determine whether Redpanda ADP fits your organization's requirements for AI agent deployment
9
9
10
-
glossterm:AI agent[,AI agents] are moving from demos to production. Enterprises need governance, reliability, and cost control to deploy them safely. Redpanda Agentic Data Plane (ADP) combine a streaming-native immutable log, 300+ proven data connectors, and declarative glossterm:AI agent[,AI agents] into a unified platform with built-in compliance-grade audit trails.
10
+
glossterm:AI agent[,AI agents] are moving from demos to production. Enterprises need governance, reliability, and cost control to deploy them safely. Redpanda Agentic Data Plane (ADP) combines a streaming-native immutable log, 300+ proven data connectors, and declarative AI agents into a unified platform with built-in compliance-grade audit trails.
Copy file name to clipboardExpand all lines: modules/ai-agents/pages/agents/overview.adoc
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -18,11 +18,11 @@ After reading this page, you will be able to:
18
18
19
19
== What is an AI agent?
20
20
21
-
An AI agent is a system built around a glossterm:large language model (LLM)[] that interprets user intent, selects the right tools, and chains multiple steps into a workflow. In Redpanda Cloud, agents are declarative: you configure what the agent should do (its role, constraints, and available tools) rather than writing imperative agent code. This is possible because Redpanda Connect provides 300+ connectors and robust data processing capabilities that the framework orchestrates for you.
21
+
An AI agent is a system built around a glossterm:large language model (LLM)[] that interprets user intent, selects the right tools, and chains multiple steps into a workflow. In Redpanda Cloud, agents are declarative: you configure what the agent should do (its role, constraints, and available tools) rather than writing imperative agent code. This is possible because Redpanda Connect provides the connectors and robust data processing capabilities that the framework orchestrates for you.
22
22
23
23
== How agents work
24
24
25
-
When you create an agent, you configure these four components through the Redpanda Cloud Console rather than writing code:
25
+
When you create an agent, you configure the components through the Redpanda Cloud Console rather than writing code:
26
26
27
27
* *System prompt*: Defines the agent's role, responsibilities, and constraints
28
28
* *LLM*: Interprets user intent and decides which tools to invoke
This page provides technical details about AI Gateway's architecture, request processing, and capabilities. For an introduction to AI Gateway and the problems it solves, see xref:ai-agents:ai-gateway/what-is-ai-gateway.adoc[]
11
+
This page provides technical details about AI Gateway's architecture, request processing, and capabilities. For an overview of AI Gateway, see xref:ai-agents:ai-gateway/what-is-ai-gateway.adoc[]
12
12
13
13
== Architecture overview
14
14
15
-
AI Gateway consists of three planes: a glossterm:control plane[] for configuration and management, a glossterm:data plane[] for request processing and routing, and an observability plane for monitoring and analytics.
15
+
AI Gateway consists of a glossterm:control plane[] for configuration and management, a glossterm:data plane[] for request processing and routing, and an observability plane for monitoring and analytics.
16
16
17
17
// PLACEHOLDER: Add architecture diagram showing:
18
18
// 1. Control Plane:
@@ -38,7 +38,7 @@ AI Gateway consists of three planes: a glossterm:control plane[] for configurati
38
38
The control plane manages gateway configuration and policy definition:
39
39
40
40
* **Workspace management**: Multi-tenant isolation with separate namespaces for different teams or environments
Copy file name to clipboardExpand all lines: modules/ai-agents/pages/ai-gateway/gateway-quickstart.adoc
+5-9Lines changed: 5 additions & 9 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,7 +1,7 @@
1
1
= AI Gateway Quickstart
2
2
:description: Get started with AI Gateway. Configure providers, create your first gateway with failover and budget controls, and route your first request.
:learning-objective-1: Enable an LLM provider and create your first gateway
6
6
:learning-objective-2: Route your first request through AI Gateway and verify it works
7
7
:learning-objective-3: Verify request routing and token usage in the gateway overview
@@ -61,7 +61,7 @@ ifdef::ai-hub-available[]
61
61
====
62
62
When creating a gateway, you choose between two modes:
63
63
64
-
* *AI Hub Mode*: Zero-configuration with pre-configured routing and backend pools. Just add provider credentials and start routing requests. Ideal for quick starts and standard use cases.
64
+
* *AI Hub Mode*: Zero-configuration with pre-configured routing and backend pools. Just add provider credentials and start routing requests. Ideal for quickstarts and standard use cases.
65
65
* *Custom Mode*: Full control over all routing rules, backend pools, and policies. Requires manual configuration. Ideal for custom routing logic and specialized requirements.
66
66
67
67
See xref:ai-gateway/gateway-modes.adoc[] to understand which mode fits your needs. This quickstart focuses on Custom mode configuration.
@@ -273,9 +273,9 @@ The gateway provides these built-in MCP tools:
273
273
* *Data catalog API*: Query your data catalog
274
274
* *Memory store*: Persistent storage for agent state
275
275
* *Vector search*: Semantic search over embeddings
276
-
* *MCP orchestrator*: Built-in tool for programmatic multi-tool workflows
276
+
* *MCP Orchestrator*: Built-in tool for programmatic multi-tool workflows
277
277
278
-
The *MCP orchestrator* enables agents to generate JavaScript code that calls multiple tools in a single orchestrated step, reducing round trips. For example, a workflow requiring 47 file reads can be reduced from 49 round trips to just 1.
278
+
The *MCP Orchestrator* enables agents to generate JavaScript code that calls multiple tools in a single orchestrated step, reducing round trips. For example, a workflow requiring 47 file reads can be reduced from 49 round trips to just 1.
279
279
280
280
To add external tools (for example, Slack, GitHub), add their MCP server endpoints to your gateway configuration.
281
281
@@ -284,14 +284,10 @@ To add external tools (for example, Slack, GitHub), add their MCP server endpoin
284
284
When many tools are aggregated, listing all tools upfront can consume significant tokens. With deferred tool loading, the MCP gateway initially returns only:
285
285
286
286
* A tool search capability
287
-
* The MCP orchestrator
287
+
* The MCP Orchestrator
288
288
289
289
Agents then search for specific tools they need, retrieving only that subset. This can reduce token usage by 80-90% when you have many tools configured.
290
290
291
-
// REVIEWERS: When/how exactly do you use the orchestrator? Also what happens after they create a gateway? Please provide an example of how to validate end-to-end routing against the gateway endpoint!
292
-
293
-
// REVIEWERS: How do users connect to the ADP catalog + MCP servers exposed through RPCN?
294
-
295
291
== Configure CEL routing rule (optional)
296
292
297
293
Use CEL (Common Expression Language) expressions to route requests dynamically based on headers, content, or other request properties.
glossterm:MCP[,Model Context Protocol (MCP)] is a standard for exposing tools (functions) that AI agents can discover and invoke. MCP servers provide tools like:
23
+
Model Context Protocol (MCP) is a standard for exposing tools (functions) that AI agents can discover and invoke. MCP servers provide tools like:
24
24
25
25
* Database queries
26
26
* File system operations
@@ -58,14 +58,14 @@ glossterm:MCP[,Model Context Protocol (MCP)] is a standard for exposing tools (f
58
58
│ (Claude, GPT) │
59
59
└────────┬────────┘
60
60
│
61
-
│ 1. Discover tools via /mcp endpoint
61
+
│ 1. Discover tools with /mcp endpoint
62
62
│ 2. Invoke specific tool
63
63
│
64
64
┌────────▼────────────────────────────────┐
65
65
│ AI Gateway (MCP Aggregator) │
66
66
│ │
67
67
│ ┌─────────────────────────────────┐ │
68
-
│ │ Deferred Tool Loading │ │
68
+
│ │ Deferred tool loading │ │
69
69
│ │ (Send search + orchestrator │ │
70
70
│ │ initially, defer others) │ │
71
71
│ └─────────────────────────────────┘ │
@@ -249,8 +249,8 @@ Agent receives result and can continue reasoning.
249
249
Traditional MCP (No deferred loading):
250
250
251
251
1. Agent connects to MCP endpoint
252
-
2. Gateway sends ALL tools from ALL MCP servers (50+ tools)
253
-
3. Agent includes ALL tool definitions in EVERY LLM request
252
+
2. Gateway sends all tools from all MCP servers (50+ tools)
253
+
3. Agent includes all tool definitions in every LLM request
254
254
4. High token cost: ~5,000-10,000 tokens per request
0 commit comments