You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
<p>Ship agents reliably. Parse OTLP streams and Jaeger JSON traces, then evaluate against golden eval sets using ADK's evaluation framework.</p>
33
+
<p>Benchmark your agents before they hit production. AgentEvals scores performance and inference quality from OpenTelemetry traces — no re-runs, no guesswork.</p>
0 commit comments