Skip to content

Commit b297689

Browse files
committed
add google AI
1 parent 30709dc commit b297689

7 files changed

Lines changed: 68 additions & 75 deletions

File tree

modules/ai-agents/pages/ai-gateway/admin/setup-guide.adoc

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -19,16 +19,16 @@ After completing this guide, you will be able to:
1919
== Prerequisites
2020

2121
* Access to the Redpanda Cloud Console with administrator privileges
22-
* API keys for at least one LLM provider (OpenAI or Anthropic)
22+
* API keys for at least one LLM provider (OpenAI, Anthropic, Google AI)
2323
* (Optional) MCP server endpoints if you plan to use tool aggregation
2424

2525
== Enable a provider
2626

27-
Providers represent upstream services (Anthropic, OpenAI) and associated credentials. Providers are disabled by default and must be enabled explicitly by an administrator.
27+
Providers represent upstream services (Anthropic, OpenAI, Google AI) and associated credentials. Providers are disabled by default and must be enabled explicitly by an administrator.
2828

2929
. In the Redpanda Cloud Console, navigate to *AI Gateway* → *Providers*.
30-
. Select a provider (for example, Anthropic or OpenAI).
31-
. On the *Configuration* tab for the provider, click *Add configuration*.
30+
. Select a provider (for example, Anthropic).
31+
. On the Configuration tab for the provider, click *Add configuration*.
3232
. Enter your API Key for the provider.
3333
+
3434
TIP: Store provider API keys securely. Each provider configuration can have multiple API keys for rotation and redundancy.

modules/ai-agents/pages/ai-gateway/builders/connect-your-agent.adoc

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@
88

99
include::ai-agents:partial$ai-gateway-byoc-note.adoc[]
1010

11-
This guide shows you how to connect your AI agent or application to a Redpanda AI Gateway. You'll configure your client SDK, make your first request, and validate the integration.
11+
This guide shows you how to connect your AI agent or application to a Redpanda AI Gateway. This is also called "Bring Your Own Agent" (BYOA). You'll configure your client SDK, make your first request, and validate the integration.
1212

1313
After completing this guide, you will be able to:
1414

modules/ai-agents/pages/ai-gateway/builders/discover-gateways.adoc

Lines changed: 9 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -226,29 +226,29 @@ If you have access to multiple gateways, consider which one to use based on your
226226

227227
Organizations often create separate gateways for different environments:
228228

229-
* *Production gateway*: Higher rate limits, access to all models, monitoring enabled
230-
* *Staging gateway*: Lower rate limits, restricted models, aggressive cost controls
231-
* *Development gateway*: Minimal limits, all models for experimentation
229+
* Production gateway: Higher rate limits, access to all models, monitoring enabled
230+
* Staging gateway: Lower rate limits, restricted models, aggressive cost controls
231+
* Development gateway: Minimal limits, all models for experimentation
232232

233233
Choose the gateway that matches your deployment environment.
234234

235235
=== By team or project
236236

237237
Gateways may be organized by team or project for cost tracking and isolation:
238238

239-
* *team-ml-gateway*: For machine learning team
240-
* *team-product-gateway*: For product team
241-
* *customer-facing-gateway*: For production customer workloads
239+
* team-ml-gateway: For machine learning team
240+
* team-product-gateway: For product team
241+
* customer-facing-gateway: For production customer workloads
242242

243243
Use the gateway designated for your team to ensure proper cost attribution.
244244

245245
=== By capability
246246

247247
Different gateways may have different features enabled:
248248

249-
* *Gateway with MCP tools*: Use if your agent needs to call tools
250-
* *Gateway without MCP*: Use for simple LLM completions
251-
* *Gateway with specific models*: Use if you need access to particular models
249+
* Gateway with MCP tools: Use if your agent needs to call tools
250+
* Gateway without MCP: Use for simple LLM completions
251+
* Gateway with specific models: Use if you need access to particular models
252252

253253
== Example: Complete discovery workflow
254254

modules/ai-agents/pages/ai-gateway/gateway-architecture.adoc

Lines changed: 36 additions & 36 deletions
Original file line numberDiff line numberDiff line change
@@ -94,54 +94,54 @@ For MCP tool requests, the lifecycle differs slightly to support deferred tool l
9494

9595
The gateway only loads and exposes specific tools when requested, which dramatically reduces the token overhead compared to loading all tools upfront.
9696

97-
== Supported features
97+
// == Supported features
9898

99-
=== LLM providers
99+
// === LLM providers
100100

101-
* OpenAI
102-
* Anthropic
103-
* // PLACEHOLDER: Google, AWS Bedrock, Azure OpenAI, others?
101+
// * OpenAI
102+
// * Anthropic
103+
// * // PLACEHOLDER: Google, AWS Bedrock, Azure OpenAI, others?
104104

105-
=== API compatibility
105+
// === API compatibility
106106

107-
* OpenAI-compatible `/v1/chat/completions` endpoint
108-
* // PLACEHOLDER: Streaming support?
109-
* // PLACEHOLDER: Embeddings support?
110-
* // PLACEHOLDER: Other endpoints?
107+
// * OpenAI-compatible `/v1/chat/completions` endpoint
108+
// * // PLACEHOLDER: Streaming support?
109+
// * // PLACEHOLDER: Embeddings support?
110+
// * // PLACEHOLDER: Other endpoints?
111111

112-
=== Policy features
112+
// === Policy features
113113

114-
* CEL-based routing expressions
115-
* Rate limiting (// PLACEHOLDER: per-gateway, per-header, per-tenant?)
116-
* Monthly spend limits (// PLACEHOLDER: per-gateway, per-workspace?)
117-
* Provider pools with automatic failover
118-
* // PLACEHOLDER: Caching support?
114+
// * CEL-based routing expressions
115+
// * Rate limiting (// PLACEHOLDER: per-gateway, per-header, per-tenant?)
116+
// * Monthly spend limits (// PLACEHOLDER: per-gateway, per-workspace?)
117+
// * Provider pools with automatic failover
118+
// * // PLACEHOLDER: Caching support?
119119

120-
=== MCP support
120+
// === MCP support
121121

122-
* MCP server aggregation
123-
* Deferred tool loading (often 80-90% token reduction depending on configuration)
124-
* JavaScript orchestrator for multi-step workflows
125-
* // PLACEHOLDER: Tool execution sandboxing?
122+
// * MCP server aggregation
123+
// * Deferred tool loading (often 80-90% token reduction depending on configuration)
124+
// * JavaScript orchestrator for multi-step workflows
125+
// * PLACEHOLDER: Tool execution sandboxing?
126126

127-
=== Observability
127+
// === Observability
128128

129-
* Request logs with full prompt/response history
130-
* Token usage tracking
131-
* Estimated cost per request
132-
* Latency metrics
133-
* // PLACEHOLDER: Metrics export? OpenTelemetry support?
129+
// * Request logs with full prompt/response history
130+
// * Token usage tracking
131+
// * Estimated cost per request
132+
// * Latency metrics
133+
// * PLACEHOLDER: Metrics export? OpenTelemetry support?
134134

135-
== Current limitations
135+
// == Current limitations
136136

137-
* // PLACEHOLDER: List current limitations, for example:
138-
** // - Custom model deployments (Azure OpenAI BYOK, AWS Bedrock custom models)
139-
** // - Response caching
140-
** // - Prompt templates/versioning
141-
** // - Guardrails (PII detection, content moderation)
142-
** // - Multi-region active-active deployment
143-
** // - Metrics export to external systems
144-
** // - Budget alerts/notifications
137+
// * // PLACEHOLDER: List current limitations, for example:
138+
// ** // - Custom model deployments (Azure OpenAI BYOK, AWS Bedrock custom models)
139+
// ** // - Response caching
140+
// ** // - Prompt templates/versioning
141+
// ** // - Guardrails (PII detection, content moderation)
142+
// ** // - Multi-region active-active deployment
143+
// ** // - Metrics export to external systems
144+
// ** // - Budget alerts/notifications
145145

146146
== Next steps
147147

modules/ai-agents/pages/ai-gateway/gateway-quickstart.adoc

Lines changed: 15 additions & 20 deletions
Original file line numberDiff line numberDiff line change
@@ -16,23 +16,18 @@ Before starting, ensure you have:
1616

1717
* Access to the AI Gateway UI (provided by your administrator)
1818
* Admin permissions to configure providers and gateways
19-
* API key for at least one LLM provider (OpenAI or Anthropic)
19+
* API key for at least one LLM provider (OpenAI, Anthropic, or Google AI)
2020
* Python 3.8+, Node.js 18+, or cURL (for testing)
2121

2222
== Configure a provider
2323

2424
Providers represent upstream LLM services (OpenAI, Anthropic) and their associated credentials. Providers are disabled by default and must be enabled explicitly.
2525

2626
. In AI Gateways, navigate to *Providers*.
27-
. Select a provider (for example, OpenAI or Anthropic).
28-
. On the *Configuration* tab, click *Add configuration* and enter your API Key.
27+
. Select a provider (for example, OpenAI, Anthropic, Google AI).
28+
. On the Configuration tab, click *Add configuration* and enter your API Key.
2929
. Verify the provider status shows "Active".
3030

31-
AI Gateway currently supports:
32-
33-
* OpenAI
34-
* Anthropic
35-
3631
== Enable models
3732

3833
After enabling a provider, enable the specific models you want to make available through your gateways.
@@ -60,9 +55,9 @@ A gateway is a logical configuration boundary that defines routing policies, rat
6055
. Click *Create Gateway*.
6156
. Configure the gateway:
6257
+
63-
* *Name*: Choose a descriptive name (for example, `my-first-gateway`)
64-
* *Workspace*: Select a workspace (conceptually similar to a resource group)
65-
* *Description*: Optional metadata for documentation
58+
* Name: Choose a descriptive name (for example, `my-first-gateway`)
59+
* Workspace: Select a workspace (conceptually similar to a resource group)
60+
* Description: Optional metadata for documentation
6661

6762
. After creation, copy the *Gateway Endpoint* and *Gateway ID* from the gateway detail page. You'll need these for sending requests.
6863

@@ -72,11 +67,11 @@ Gateway Endpoint: https://gw.ai.panda.com
7267
Gateway ID: gw_abc123...
7368
----
7469

75-
Common gateway patterns:
70+
Common gateway patterns include the following:
7671

77-
* *Environment separation*: Create separate gateways for staging and production
78-
* *Team isolation*: One gateway per team for budget tracking
79-
* *Customer multi-tenancy*: One gateway per customer for isolated policies
72+
* Environment separation: Create separate gateways for staging and production
73+
* Team isolation: One gateway per team for budget tracking
74+
* Customer multi-tenancy: One gateway per customer for isolated policies
8075

8176
== Send your first request
8277

@@ -193,10 +188,10 @@ Expected response:
193188

194189
If your request fails, check these common issues:
195190

196-
* *401 Unauthorized*: Verify your API key is valid
197-
* *404 Not Found*: Confirm the base URL matches your gateway endpoint
198-
* *Model not found*: Ensure the model is enabled in Step 2
199-
* *Missing rp-aigw-id*: Add the gateway ID header to your request
191+
* 401 Unauthorized: Verify your API key is valid
192+
* 404 Not Found: Confirm the base URL matches your gateway endpoint
193+
* Model not found: Ensure the model is enabled in Step 2
194+
* Missing rp-aigw-id: Add the gateway ID header to your request
200195

201196
== Verify in observability dashboard
202197

@@ -290,7 +285,7 @@ To add external tools (for example, Slack, GitHub), add their MCP server endpoin
290285

291286
=== Deferred tool loading
292287

293-
When many tools are aggregated, listing all tools upfront can consume significant tokens. With *deferred tool loading*, the MCP gateway initially returns only:
288+
When many tools are aggregated, listing all tools upfront can consume significant tokens. With deferred tool loading, the MCP gateway initially returns only:
294289

295290
* A tool search capability
296291
* The MCP orchestrator
Lines changed: 2 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,8 +1,6 @@
11
= AI Gateway
2-
:description: Unified access layer for LLM providers and AI tools with centralized routing, policy enforcement, cost management, and observability.
2+
:description: Learn about the unified access layer for LLM providers and AI tools with centralized routing, policy enforcement, cost management, and observability.
33
:page-layout: index
44
:personas: platform_admin, app_developer, evaluator
55

6-
include::ai-agents:partial$ai-gateway-byoc-note.adoc[]
7-
8-
Redpanda AI Gateway provides a unified access layer for LLM providers and AI tools that sits between your applications and the AI services they use. It delivers centralized routing, policy enforcement, cost management, and observability for all your AI traffic.
6+
include::ai-agents:partial$ai-gateway-byoc-note.adoc[]

modules/ai-agents/pages/ai-gateway/mcp-aggregation-guide.adoc

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -20,7 +20,7 @@ MCP aggregation benefits:
2020
2121
== What is MCP?
2222

23-
*Model Context Protocol (MCP)* is a standard for exposing tools (functions) that AI agents can discover and invoke. MCP servers provide tools like:
23+
Model Context Protocol (MCP) is a standard for exposing tools (functions) that AI agents can discover and invoke. MCP servers provide tools like:
2424

2525
* Database queries
2626
* File system operations

0 commit comments

Comments
 (0)