Handle incomplete/failed streaming responses and expose usage in metadata. by homin-bt · Pull Request #122 · braintrustdata/braintrust-sdk-go

homin-bt · 2026-04-28T14:45:48Z

Previously, the streaming response parser only processed response.completed events, silently dropping terminal events for truncated or failed responses. This meant that when a streaming call hit max_output_tokens or encountered a server error, no span metadata was recorded.

Changes:

trace/contrib/openai/responses.go: Extend parseStreamingResponse to handle response.incomplete and response.failed events in addition to response.completed. Add "status" and "usage" to the metadata fields captured in handleResponseCompletedMessage, so token counts and terminal status are always visible in the Braintrust UI.
trace/contrib/openai/responses_test.go: Unit tests covering all three terminal event types, verifying status and usage appear in span metadata.
examples/openai/main.go: Update to use the Responses API (replacing the Chat Completions API), with explicit handling for completed, incomplete, and failed status.

Resolves #115

…handling

…responses; add tests

Matt Perpick (clutchski) · 2026-04-28T14:58:24Z

+func TestResponsesIncompleteStreaming(t *testing.T) {
+	rt, exporter := newTestResponsesTracer(t)
+
+	sseBody := `event: response.output_text.delta


i think rather than having a strict unit test for parsing the SSE stream, we should try to make a real request to the server (using VCR, etc) that reveals this test (e.g. with a very low max tokens). Look at the other tests to see the pattern to follow.

Abhijeet Prasad (AbhiPrasad) · 2026-04-28T15:07:47Z

@@ -181,6 +180,7 @@ func (rt *responsesTracer) handleResponseCompletedMessage(span trace.Span, rawMs
 	metadataFields := []string{


now that we are collecting response.incomplete, I think should add incomplete_details as a field here as well.

Abhijeet Prasad (AbhiPrasad) · 2026-04-28T15:08:12Z

-			// parse the other messages too?
-			if msgType == "response.completed" {
+			switch msgType {
+			case "response.completed", "response.failed", "response.incomplete":


For the response.failed case, can we capture the error somehow?

…ration tests in favor of unit tests

homin-bt added 3 commits April 28, 2026 10:42

chore(examples): migrate openai example to Responses API with status …

2e2f5cc

…handling

feat(openai): capture status/usage in metadata for incomplete/failed …

715e38b

…responses; add tests

chore: remove non-existent make command

e867cfa

homin-bt marked this pull request as ready for review April 28, 2026 14:53

homin-bt requested review from Abhijeet Prasad (AbhiPrasad) and Matt Perpick (clutchski) April 28, 2026 14:54

homin-bt changed the title ~~Issue115~~ Handle incomplete/failed streaming responses and expose usage in metadata. Apr 28, 2026

Matt Perpick (clutchski) reviewed Apr 28, 2026

View reviewed changes

Abhijeet Prasad (AbhiPrasad) reviewed Apr 28, 2026

View reviewed changes

feat(openai): capture incomplete_details/error in metadata; add integ…

3521459

…ration tests in favor of unit tests

Abhijeet Prasad (AbhiPrasad) approved these changes Apr 28, 2026

View reviewed changes

Matt Perpick (clutchski) approved these changes Apr 29, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle incomplete/failed streaming responses and expose usage in metadata.#122

Handle incomplete/failed streaming responses and expose usage in metadata.#122
homin-bt wants to merge 4 commits intomainfrom
issue115

homin-bt commented Apr 28, 2026 •

edited

Loading

Uh oh!

Matt Perpick (clutchski) Apr 28, 2026

Uh oh!

Abhijeet Prasad (AbhiPrasad) Apr 28, 2026

Uh oh!

Abhijeet Prasad (AbhiPrasad) Apr 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		@@ -181,6 +180,7 @@ func (rt *responsesTracer) handleResponseCompletedMessage(span trace.Span, rawMs
		metadataFields := []string{

Conversation

homin-bt commented Apr 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Matt Perpick (clutchski) Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

Abhijeet Prasad (AbhiPrasad) Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

Abhijeet Prasad (AbhiPrasad) Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

homin-bt commented Apr 28, 2026 •

edited

Loading