feat: OpenAI responses create instrumentation by eternalcuriouslearner · Pull Request #4474 · open-telemetry/opentelemetry-python-contrib

eternalcuriouslearner · 2026-04-21T23:45:49Z

Description

This PR adds instrumentation around OpenAI's Responses api's create method.

Fixes # (issue)

Type of change

Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

How Has This Been Tested?

Please describe the tests that you ran to verify your changes. Provide instructions so we can reproduce. Please also list any relevant details for your test configuration

Created vcr based tests to verify the span creation for Responses api's create function.

Does This PR Require a Core Repo Change?

Yes. - Link to PR:
No.

Checklist:

See contributing.md for styleguide, changelog guidelines, and more.

Followed the style guidelines of this project
Changelogs have been updated
Unit tests have been added
Documentation has been updated

…es-create-instrumentation-first-part

…ponses wrapping.

…-first-part

Copilot

Pull request overview

This PR adds OpenTelemetry instrumentation for the OpenAI Responses API create method (sync + streaming) using the TelemetryHandler inference invocation lifecycle, along with VCR-based tests to validate spans/log behavior and response attribute extraction.

Changes:

Add patching for openai.resources.responses.responses.Responses.create to emit inference spans and capture request/response attributes.
Extend response extraction to cover request params, finish reasons, tool calls, reasoning parts, and cache token usage.
Add a comprehensive test_responses.py suite plus VCR cassettes and supporting test utilities/fixtures.

Reviewed changes

Copilot reviewed 31 out of 31 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
instrumentation-genai/opentelemetry-instrumentation-openai-v2/src/opentelemetry/instrumentation/openai_v2/patch_responses.py	New wrapper for `Responses.create` that starts/stops/fails inference invocations and wraps streaming responses.
instrumentation-genai/opentelemetry-instrumentation-openai-v2/src/opentelemetry/instrumentation/openai_v2/init.py	Registers/unregisters the Responses `create` wrapper when latest experimental semconv mode is enabled and the SDK module exists.
instrumentation-genai/opentelemetry-instrumentation-openai-v2/src/opentelemetry/instrumentation/openai_v2/response_extractors.py	Adds request attribute handling, inference creation kwargs, and expanded output/finish-reason parsing (tool calls + reasoning).
instrumentation-genai/opentelemetry-instrumentation-openai-v2/src/opentelemetry/instrumentation/openai_v2/response_wrappers.py	Updates stream wrapper lifecycle to use invocation `stop()`/`fail()` instead of handler callbacks; adjusts event handling and `parse()`.
instrumentation-genai/opentelemetry-instrumentation-openai-v2/tests/test_responses.py	New test suite validating spans/logs for Responses `create` across streaming, errors, tool calls, reasoning tokens, and content capture modes.
instrumentation-genai/opentelemetry-instrumentation-openai-v2/tests/test_response_extractors.py	Updates extractor tests for new tool-call and reasoning output item mappings and finish-reason aggregation.
instrumentation-genai/opentelemetry-instrumentation-openai-v2/tests/test_response_wrappers.py	Updates wrapper tests to reflect invocation `stop()`/`fail()` API expectations.
instrumentation-genai/opentelemetry-instrumentation-openai-v2/tests/test_utils.py	Adds Responses tool definition helper and cache-token attribute assertions.
instrumentation-genai/opentelemetry-instrumentation-openai-v2/tests/conftest.py	Adds an `instrument_event_only` fixture to exercise event-only content capture behavior.
instrumentation-genai/opentelemetry-instrumentation-openai-v2/tests/cassettes/*.yaml	Adds VCR recordings for the new Responses tests.
instrumentation-genai/opentelemetry-instrumentation-openai-v2/CHANGELOG.md	Documents the new Responses `create` instrumentation feature under Unreleased.

…class.

…-first-part

Copilot

Pull request overview

Copilot reviewed 34 out of 34 changed files in this pull request and generated no new comments.

Copilot

Pull request overview

Copilot reviewed 34 out of 34 changed files in this pull request and generated 1 comment.

Copilot · 2026-04-24T05:09:01Z

+@pytest.mark.vcr()
+def test_responses_create_streaming_delegates_response_attribute(
+    request, openai_client, instrument_no_content
+):
+    _skip_if_not_latest()
+
+    stream = openai_client.responses.create(
+        model=DEFAULT_MODEL,
+        instructions=SYSTEM_INSTRUCTIONS,
+        input="Say hi.",
+        stream=True,
+    )
+
+    assert stream.response is not None
+    assert stream.response.status_code == 200
+    assert stream.response.headers.get("x-request-id") is not None
+    stream.close()
+


The streaming test that closes the stream immediately (test_responses_create_streaming_delegates_response_attribute) doesn’t assert any telemetry outcome. To satisfy the GenAI instrumentation test requirements (stream closed early by the caller), please assert that exactly one span is finished and that it is finalized without being marked as an error (and/or clearly document the expected behavior for early-close).

eternalcuriouslearner added 9 commits April 11, 2026 20:15

wip: first draft of openai responses instrumentation.

799e80e

Merge remote-tracking branch 'upstream/main' into feat/openai-respons…

13d5d03

…es-create-instrumentation-first-part

wip: converted responses to use new handler factory methods.

89c8434

WIP: Adding test files and refined the missing parts.

b36205d

WIP: Moving cache assertions to common utils file.

0b241e3

WIP: removing the async create method instrumentation.

9dcb021

WIP: removed the unnecessary cassette checks added context around res…

881686b

…ponses wrapping.

WIP: fixing the lint in files.

0c277ac

WIP: fixing the precommit stuff.

ecc3774

eternalcuriouslearner requested a review from a team as a code owner April 21, 2026 23:45

github-project-automation Bot added this to Python PR digest Apr 21, 2026

github-actions Bot assigned lmolkova Apr 21, 2026

github-actions Bot requested a review from lmolkova April 21, 2026 23:46

eternalcuriouslearner added 4 commits April 21, 2026 19:48

WIP: added changelog.

d4ff5af

Merge branch 'main' into feat/openai-responses-create-instrumentation…

c5abec7

…-first-part

wip: fixing the wrap configuration.

0293f21

Merge branch 'main' into feat/openai-responses-create-instrumentation…

9bb3221

…-first-part

lmolkova requested a review from Copilot April 23, 2026 19:12

Copilot started reviewing on behalf of lmolkova April 23, 2026 19:13 View session

Copilot AI reviewed Apr 23, 2026

View reviewed changes

Comment thread ...instrumentation-openai-v2/src/opentelemetry/instrumentation/openai_v2/response_extractors.py

eternalcuriouslearner added 4 commits April 23, 2026 23:36

wip: remove pydantic based validation and convert the request to data…

57a7715

…class.

wip: delete old pydantic job.

8558a3e

Merge branch 'main' into feat/openai-responses-create-instrumentation…

0634075

…-first-part

wip: removed the unwanted LLMInvocation.

da22aa9

eternalcuriouslearner requested a review from Copilot April 24, 2026 03:56

Copilot started reviewing on behalf of eternalcuriouslearner April 24, 2026 03:56 View session

Copilot AI reviewed Apr 24, 2026

View reviewed changes

eternalcuriouslearner added 3 commits April 24, 2026 00:30

wip: cleaning up the functions.

2daf193

wip: cleaning up the failing tests.

d9374b3

wip: fixing precommit.

0520e0b

eternalcuriouslearner added 2 commits April 24, 2026 00:42

wip: cleaning the precommit.

ce8882d

polish: cleaning up tests and lint.

e74229a

eternalcuriouslearner requested a review from Copilot April 24, 2026 05:04

Copilot started reviewing on behalf of eternalcuriouslearner April 24, 2026 05:04 View session

Copilot AI reviewed Apr 24, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: OpenAI responses create instrumentation#4474

feat: OpenAI responses create instrumentation#4474
eternalcuriouslearner wants to merge 22 commits intoopen-telemetry:mainfrom
eternalcuriouslearner:feat/openai-responses-create-instrumentation-first-part

eternalcuriouslearner commented Apr 21, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

eternalcuriouslearner commented Apr 21, 2026

Description

Type of change

How Has This Been Tested?

Does This PR Require a Core Repo Change?

Checklist:

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants