[WIP] feat(deepagents): add instrumentation plugin by 123liuziming · Pull Request #190 · alibaba/loongsuite-python

123liuziming · 2026-05-21T01:45:12Z

Description

Please include a summary of the change and which issue is fixed. Please also include relevant motivation and context. List any dependencies that are required for this change.

Fixes # (issue)

Type of change

Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

How Has This Been Tested?

Please describe the tests that you ran to verify your changes. Provide instructions so we can reproduce. Please also list any relevant details for your test configuration

Test A

Does This PR Require a Core Repo Change?

Yes. - Link to PR:
No.

Checklist:

See contributing.md for styleguide, changelog guidelines, and more.

Followed the style guidelines of this project
Changelogs have been updated
Unit tests have been added
Documentation has been updated

CLAassistant · 2026-05-21T01:45:20Z

All committers have signed the CLA.

Copilot

Pull request overview

Note

Copilot was unable to run its full agentic suite in this review.

Adds a new loongsuite-instrumentation-deepagents package to instrument the deepagents framework with ENTRY span creation, span enrichment, and GenAI metrics emission.

Changes:

Introduces a DeepAgentsInstrumentor that wraps deepagents.graph.create_deep_agent, installs a LangChain callback enricher, and registers a metrics SpanProcessor.
Adds internal helpers for extracting metadata/messages and for generating ENTRY spans + metrics.
Adds a dedicated test suite plus packaging/docs (pyproject, README) and registers the package in the instrumentation index.

Reviewed changes

Copilot reviewed 16 out of 16 changed files in this pull request and generated 7 comments.

Show a summary per file

File	Description
instrumentation-loongsuite/loongsuite-instrumentation-deepagents/src/opentelemetry/instrumentation/deepagents/init.py	New instrumentor wiring ENTRY patch, enricher, and metrics processor
instrumentation-loongsuite/loongsuite-instrumentation-deepagents/src/opentelemetry/instrumentation/deepagents/internal/_entry_patch.py	Wraps `create_deep_agent` and graph methods to create/manage ENTRY spans
instrumentation-loongsuite/loongsuite-instrumentation-deepagents/src/opentelemetry/instrumentation/deepagents/internal/_enricher.py	LangChain callback handler + callback-manager patch to enrich active spans
instrumentation-loongsuite/loongsuite-instrumentation-deepagents/src/opentelemetry/instrumentation/deepagents/internal/_metrics_processor.py	SpanProcessor that emits GenAI metrics from finished spans
instrumentation-loongsuite/loongsuite-instrumentation-deepagents/src/opentelemetry/instrumentation/deepagents/internal/_utils.py	Shared helpers for metadata/version detection and message conversion
instrumentation-loongsuite/loongsuite-instrumentation-deepagents/src/opentelemetry/instrumentation/deepagents/internal/_attributes.py	Centralized constants for attributes, span kinds, and metric names
instrumentation-loongsuite/loongsuite-instrumentation-deepagents/src/opentelemetry/instrumentation/deepagents/package.py	Declares supported deepagents versions and metrics support
instrumentation-loongsuite/loongsuite-instrumentation-deepagents/src/opentelemetry/instrumentation/deepagents/version.py	Declares package version for distribution
instrumentation-loongsuite/loongsuite-instrumentation-deepagents/src/opentelemetry/instrumentation/deepagents/internal/init.py	Internal package init
instrumentation-loongsuite/loongsuite-instrumentation-deepagents/tests/conftest.py	Test fixtures for tracing/metrics providers and env setup
instrumentation-loongsuite/loongsuite-instrumentation-deepagents/tests/test_entry_patch.py	Tests ENTRY patch wrapping behavior and span attributes
instrumentation-loongsuite/loongsuite-instrumentation-deepagents/tests/test_enricher.py	Tests span enrichment from callback handler
instrumentation-loongsuite/loongsuite-instrumentation-deepagents/tests/test_metrics.py	Tests metrics processor behavior and warnings
instrumentation-loongsuite/loongsuite-instrumentation-deepagents/pyproject.toml	New distributable project config + dependencies
instrumentation-loongsuite/loongsuite-instrumentation-deepagents/README.md	Package documentation and local install instructions
instrumentation-loongsuite/README.md	Adds deepagents instrumentation to the index table

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+    async def on_chain_start_async(
+        self,
+        serialized: dict[str, Any],
+        inputs: dict[str, Any],
+        *,
+        run_id: Any,
+        parent_run_id: Any | None = None,
+        tags: list[str] | None = None,
+        metadata: dict[str, Any] | None = None,
+        **kwargs: Any,
+    ) -> Any:
+        del serialized, inputs, run_id, parent_run_id, tags, kwargs
+        self._enrich_agent_or_chain(metadata or {})


+    async def on_tool_start_async(
+        self,
+        serialized: dict[str, Any],
+        input_str: str,
+        *,
+        run_id: Any,
+        parent_run_id: Any | None = None,
+        tags: list[str] | None = None,
+        metadata: dict[str, Any] | None = None,
+        **kwargs: Any,
+    ) -> Any:
+        del input_str, run_id, parent_run_id, tags, metadata
+        self._enrich_tool(serialized, kwargs)


+        labels = {
+            "spanKind": str(span_kind),
+            "modelName": _model_name(attributes),
+        }


+            usage_labels = {
+                "spanKind": SPAN_KIND_LLM,
+                "modelName": labels["modelName"],
+                "usageType": usage_type,
+            }


+)
+
+_logger = logging.getLogger(__name__)
+_processors_by_provider_id: dict[int, "DeepAgentMetricsSpanProcessor"] = {}


+    provider_id = id(tracer_provider)
+    if provider_id in _processors_by_provider_id:
+        return
+    processor = DeepAgentMetricsSpanProcessor(meter_provider=meter_provider)
+    tracer_provider.add_span_processor(processor)
+    _processors_by_provider_id[provider_id] = processor


+## Local Install
+
+Install the shared GenAI utility from the same source tree first, then install
+the dependent LangChain, LangGraph, and deepagents instrumentations:


Copilot

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

ralf0131

Review by github-manager-bot

Summary

New instrumentation for deepagents (built on LangGraph): wraps create_deep_agent and the resulting graph's invoke/ainvoke/stream/astream to produce ENTRY spans, propagates a subagent registry via contextvars, patches the subagent task tool, and adds a langchain-tracer callback + subagent task-context resolution. Substantial and well-tested. Since it touches the shared langchain tracer, please give those changes a careful eye.

Note: PR is marked [WIP]; focusing on design-level feedback that should survive further iteration.

Findings

[Warning] _entry_patch.py — _SubagentRunnableProxy overrides only invoke and ainvoke, while __getattr__ delegates everything else (incl. stream/astream) to the raw runnable. The top-level graph wraps all four methods, so streaming subagent graphs silently miss the ReAct-metadata injection that sync/async subagents get — inconsistent telemetry coverage depending on how the subagent is invoked. Either wrap stream/astream in the proxy too, or drop the proxy in favour of the same _wrap_graph_methods used at the top level.
[Warning] _entry_patch.py — streaming ENTRY span lifecycle. The stream wrappers yield in a try with _finish_entry in except Exception, but no finally. If a consumer abandons a stream early (partial consume, no close()/with), the generator is closed by GeneratorExit, which is a BaseException and is not caught by except Exception → _finish_entry is skipped → the ENTRY span is never stop_entry/fail_entry'd (leak). Add a finally (with a guard to avoid double-finish) so the entry span always ends.
[Info] _entry_patch.py uses module-level mutable flags (_is_entry_patched, _top_level_patched, _is_subagent_task_patched, _handler). Concurrent instrument()/uninstrument() would race; document single-threaded setup at startup or guard with a lock.
[Info] _metrics_processor.py / langchain _tracer.py — _get_deepagents_subagent_task_context iterates list(self._runs.values()) per subagent chain start (O(n) per chain). Bounded and fine for typical traces, but note the cost for very large multi-subagent graphs.
[Info] Truncated (2-line) Apache headers across these files; align with the full header used in e.g. the anthropic plugin for lint consistency.

Suggestions

# in the stream generator wrappers
try:
    yield from inner
finally:
    _finish_entry(invocation, token, result=None, exc=current_exc, finished=bool(...))

Cross-repo Note

No shared API surface with loongsuite-pilot; the langchain-tracer changes are internal to this plugin, no cross-repo change required.

Automated review by github-manager-bot

Copilot AI review requested due to automatic review settings May 21, 2026 01:45

github-actions Bot assigned 123liuziming, Cirilla-zmh and ralf0131 May 21, 2026

github-actions Bot requested review from Cirilla-zmh and ralf0131 May 21, 2026 01:52

Copilot AI reviewed May 21, 2026

View reviewed changes

Copilot started reviewing on behalf of 123liuziming May 21, 2026 02:18 View session

Copilot AI review requested due to automatic review settings May 21, 2026 03:08

Copilot AI reviewed May 21, 2026

Copilot started reviewing on behalf of 123liuziming May 21, 2026 04:49 View session

observability_dev_agent and others added 4 commits May 27, 2026 14:32

feat(deepagents): add instrumentation plugin

fe2dbbc

fix(deepagents): repair agent semantics

838e3d2

fix(deepagents): repair subagent agent spans

f023588

fix(deepagents): register scoped LoongSuite CI

651c3a3

sipercai force-pushed the feat/deepagents branch from 102008d to 651c3a3 Compare May 27, 2026 07:10

ralf0131 reviewed Jun 17, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] feat(deepagents): add instrumentation plugin#190

[WIP] feat(deepagents): add instrumentation plugin#190
123liuziming wants to merge 4 commits into
mainfrom
feat/deepagents

123liuziming commented May 21, 2026

Uh oh!

CLAassistant commented May 21, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI left a comment

Uh oh!

ralf0131 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

123liuziming commented May 21, 2026

Description

Type of change

How Has This Been Tested?

Does This PR Require a Core Repo Change?

Checklist:

Uh oh!

CLAassistant commented May 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

ralf0131 left a comment

Choose a reason for hiding this comment

Review by github-manager-bot

Summary

Findings

Suggestions

Cross-repo Note

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

CLAassistant commented May 21, 2026 •

edited

Loading