You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
:learning-objective-1: Explain how traces and spans capture execution flow
6
-
:learning-objective-2: Interpret trace structure for debugging and monitoring
7
-
:learning-objective-3: Distinguish between observability traces and audit logs
5
+
:learning-objective-1: Explain how transcripts and spans capture execution flow
6
+
:learning-objective-2: Interpret transcript structure for debugging and monitoring
7
+
:learning-objective-3: Distinguish between transcripts and audit logs
8
8
9
-
Redpanda automatically captures execution traces for both AI agents and MCP servers, providing complete observability into how your agentic systems operate.
9
+
Redpanda automatically captures execution transcripts for both AI agents and MCP servers, providing complete observability into how your agentic systems operate.
10
10
11
11
After reading this page, you will be able to:
12
12
@@ -27,7 +27,7 @@ Transcripts capture:
27
27
* Error conditions
28
28
* Performance metrics
29
29
30
-
With 100% sampling, every operation is captured, creating complete transcripts that you can use for debugging, monitoring, and performance analysis.
30
+
With 100% sampling, every operation is captured, enabling comprehensive debugging, monitoring, and performance analysis.
31
31
32
32
== Traces and spans
33
33
@@ -37,13 +37,13 @@ OpenTelemetry traces provide a complete picture of how a request flows through y
37
37
* A _span_ represents a single unit of work within that trace (such as a data processing operation or an external API call).
38
38
* A trace contains one or more spans organized hierarchically, showing how operations relate to each other.
39
39
40
-
== Agent trace hierarchy
40
+
== Agent transcript hierarchy
41
41
42
42
Agent executions create a hierarchy of spans that reflect how agents process requests. Understanding this hierarchy helps you interpret agent behavior and identify where issues occur.
43
43
44
44
=== Agent span types
45
45
46
-
Agent traces contain these span types:
46
+
Agent transcripts contain these span types:
47
47
48
48
[cols="2,3,3", options="header"]
49
49
|===
@@ -90,13 +90,13 @@ This shows:
90
90
91
91
Examine span durations to identify where time is spent and optimize accordingly.
92
92
93
-
== MCP server trace hierarchy
93
+
== MCP server transcript hierarchy
94
94
95
-
MCP server executions create a different hierarchy that reflects tool invocations and internal processing. Understanding this hierarchy helps you debug tool execution and identify performance bottlenecks.
95
+
MCP server tool invocations produce a different span hierarchy focused on tool execution and internal processing. This structure reveals performance bottlenecks and helps debug tool-specific issues.
96
96
97
97
=== MCP server span types
98
98
99
-
MCP server traces contain these span types:
99
+
MCP server transcripts contain these span types:
100
100
101
101
[cols="2,3,3", options="header"]
102
102
|===
@@ -143,13 +143,13 @@ This shows:
143
143
144
144
The majority of time (4+ seconds) is spent in tool execution, while internal processing (mapping) takes only microseconds. This indicates the tool itself (likely making external API calls or database queries) is the bottleneck, not Redpanda Connect's internal processing.
145
145
146
-
== Trace layers and scope
146
+
== Transcript layers and scope
147
147
148
-
Traces contain multiple layers of instrumentation, from HTTP transport through application logic to external service calls. The `scope.name` field in each span identifies which layer of instrumentation created that span.
148
+
Transcripts contain multiple layers of instrumentation, from HTTP transport through application logic to external service calls. The `scope.name` field in each span identifies which instrumentation layer created that span.
149
149
150
150
=== Instrumentation layers
151
151
152
-
A complete agent trace includes these layers:
152
+
A complete agent transcript includes these layers:
153
153
154
154
[cols="2,2,4", options="header"]
155
155
|===
@@ -178,7 +178,7 @@ A complete agent trace includes these layers:
178
178
179
179
=== How layers connect
180
180
181
-
Layers connect through parent-child relationships in a single trace:
181
+
Layers connect through parent-child relationships in a single transcript:
182
182
183
183
----
184
184
ai-agent-http-server (HTTP Server layer)
@@ -191,7 +191,7 @@ ai-agent-http-server (HTTP Server layer)
191
191
└── chat gpt-5-nano (AI SDK layer, LLM call 2)
192
192
----
193
193
194
-
This shows:
194
+
The request flow demonstrates:
195
195
196
196
1. HTTP request arrives at agent
197
197
2. Agent invokes sub-agent
@@ -201,9 +201,9 @@ This shows:
201
201
6. Agent makes second LLM call with tool results
202
202
7. Response returns through HTTP layer
203
203
204
-
=== Cross-service traces
204
+
=== Cross-service transcripts
205
205
206
-
When agents call MCP tools, the trace spans multiple services. Each service has a different `service.name` in the resource attributes:
206
+
When agents call MCP tools, the transcript spans multiple services. Each service has a different `service.name` in the resource attributes:
207
207
208
208
* Agent spans: `"service.name": "ai-agent"`
209
209
* MCP server spans: `"service.name": "mcp-{server-id}"`
@@ -240,9 +240,9 @@ Redpanda Connect layer:
240
240
241
241
- Component-specific attributes from your tool configuration
242
242
243
-
Use `scope.name` to filter spans by layer when analyzing traces.
243
+
Use `scope.name` to filter spans by layer when analyzing transcripts.
244
244
245
-
== Understand the trace structure
245
+
== Understand the transcript structure
246
246
247
247
Each span captures a unit of work. Here's what a typical MCP tool invocation looks like:
248
248
@@ -273,7 +273,7 @@ Key elements to understand:
273
273
274
274
=== Parent-child relationships
275
275
276
-
Traces show how operations relate. A tool invocation (parent) may trigger internal operations (children):
276
+
Transcripts show how operations relate. A tool invocation (parent) may trigger internal operations (children):
277
277
278
278
[,json]
279
279
----
@@ -289,9 +289,9 @@ Traces show how operations relate. A tool invocation (parent) may trigger intern
289
289
290
290
The `parentSpanId` links this child span to the parent tool invocation. Both share the same `traceId` so you can reconstruct the complete operation.
291
291
292
-
== Error events in traces
292
+
== Error events in transcripts
293
293
294
-
When something goes wrong, traces capture error details:
294
+
When something goes wrong, transcripts capture error details:
The `events` array captures what happened and when. Use `timeUnixNano` to see exactly when the error occurred within the operation.
315
315
316
316
[[opentelemetry-traces-topic]]
317
-
== How Redpanda stores traces
317
+
== How Redpanda stores trace data
318
318
319
-
The `redpanda.otel_traces` topic stores OpenTelemetry spans in JSON format, following the https://opentelemetry.io/docs/specs/otel/protocol/[OpenTelemetry Protocol (OTLP)^] specification. A Protobuf schema named `redpanda.otel_traces-value` is also automatically registered with the topic, enabling clients to deserialize trace data correctly.
319
+
The `redpanda.otel_traces` topic stores OpenTelemetry spans using Redpanda's Schema Registry wire format with a custom Protobuf schema that closely follows the https://opentelemetry.io/docs/specs/otel/protocol/[OpenTelemetry Protocol (OTLP)^] specification. A schema named `redpanda.otel_traces-value` is automatically registered in the Schema Registry with the topic, enabling clients to deserialize trace data correctly.
320
320
321
-
The `redpanda.otel_traces` topic and its schema are managed automatically by Redpanda. If you delete either the topic or the schema, they are recreated automatically. However, deleting the topic permanently deletes all trace data, and the topic comes back empty. Do not produce your own data to this topic. It is reserved for OpenTelemetry traces.
321
+
Redpanda manages both the topic and its schema automatically. If you delete either the topic or the schema, they are recreated automatically. However, deleting the topic permanently deletes all trace data, and the topic comes back empty. Do not produce your own data to this topic. It is reserved for OpenTelemetry traces.
322
322
323
323
=== Topic configuration and lifecycle
324
324
325
325
The `redpanda.otel_traces` topic has a predefined retention policy. Configuration changes to this topic are not supported. If you modify settings, Redpanda reverts them to the default values.
326
326
327
327
The topic persists in your cluster even after all agents and MCP servers are deleted, allowing you to retain historical trace data for analysis.
328
328
329
-
Trace data may contain sensitive information from your tool inputs and outputs. Consider implementing appropriate glossterm:ACL[access control lists (ACLs)] for the `redpanda.otel_traces` topic, and review the data in traces before sharing or exporting to external systems.
329
+
Transcripts may contain sensitive information from your tool inputs and outputs. Consider implementing appropriate glossterm:ACL[access control lists (ACLs)] for the `redpanda.otel_traces` topic, and review the data in transcripts before sharing or exporting to external systems.
330
330
331
-
== Traces compared to audit logs
331
+
== Transcripts compared to audit logs
332
332
333
-
OpenTelemetry traces are designed for observability and debugging, not audit logging or compliance.
333
+
Transcripts are designed for observability and debugging, not audit logging or compliance.
334
334
335
-
Traces provide:
335
+
Transcripts provide:
336
336
337
337
* Hierarchical view of request flow through your system (parent-child span relationships)
338
338
* Detailed timing information for performance analysis
339
339
* Ability to reconstruct execution paths and identify bottlenecks
340
340
* Insights into how operations flow through distributed systems
341
341
342
-
Traces are not:
342
+
Transcripts are not:
343
343
344
344
* Immutable audit records for compliance purposes
345
345
* Designed for "who did what" accountability tracking
0 commit comments