feat(ai): expose inputMessages and partialText in onAbort callback by Hovo-Dev · Pull Request #14141 · vercel/ai

Hovo-Dev · 2026-04-05T09:37:52Z

Background

When a streamText call is aborted — via AbortSignal, the user stopping generation, or a dropped connection — the onAbort callback fires but receives no usage data. onFinish never fires on abort at all. This makes it impossible to track token consumption for billing or quota purposes when streams don't complete naturally.

Summary

packages/ai/src/generate-text/stream-text.ts

Added usage?: LanguageModelUsage and totalUsage?: LanguageModelUsage to StreamTextOnAbortCallback
Added inputMessages: ModelMessage[] — the full message list passed to the model for the aborted step (initial messages + completed prior step tool
call/result pairs)
Added partialText?: string — the text actively being streamed at abort time; undefined if no text was generated; resets at each step boundary
Introduced currentStepUsage, completedStepsUsage, currentStepInputMessages, and pullLevelTextContent as outer-scope tracking variables
Wired all fields into the existing abort() function — if the provider had not yet sent usage data before the abort, usage is undefined (honest, no
estimation)

packages/openai/src/chat/openai-chat-language-model.ts

OpenAI's finish chunk (which carries real usage) was only emitted in flush(), which never runs on abort
Now emits the finish chunk as soon as usage data arrives in the stream transform(), with a finishSent guard to prevent double-emission in flush()

content/docs/07-reference/01-ai-sdk-core/02-stream-text.mdx U

Documented usage, totalUsage, inputMessages, and partialText fields in the onAbort callback parameters

content/docs/03-ai-sdk-core/50-error-handling.mdx

Updated onAbort section to describe all four new fields

Usage

onAbort({ totalUsage, usage, inputMessages, partialText }) {                                      
  const completedTokens = totalUsage?.totalTokens ?? 0;                                                                                                     

  // Use real usage if provider sent it, otherwise estimate with your own tokenizer:
  const abortedStepTokens = usage?.totalTokens                         
    ?? (countTokens(inputMessages) + countTokens(partialText ?? ''));                                                                                       
                                                                                                                                                            
  trackUsage(completedTokens + abortedStepTokens);                                                                                                          
}

Manual Verification

Verified via automated tests that exercise the full stream processing pipeline:

Abort fires before any usage chunk → usage and totalUsage are undefined
Abort fires after the provider sent a usage chunk → usage and totalUsage contain the real token counts
Multi-step stream, abort in step 2 → totalUsage correctly sums step 1 and step 2 usage
Abort before first chunk → inputMessages contains the step's input, partialText is undefined
Abort mid-text → partialText contains the streamed text up to abort
Multi-step abort → inputMessages includes tool call/result pairs from prior steps

All 2200+ existing tests continue to pass.

Checklist

Tests have been added / updated (for bug fixes / features)
Documentation has been added / updated (for bug fixes / features)
A patch changeset for relevant packages has been added (for bug fixes / features - run pnpm changeset in the project root)
I have reviewed this pull request (self-review)

Related Issues

Fixes #7628
Fixes #7805

When a streamText call is aborted, the onAbort callback now receives usage (current step) and totalUsage (all steps combined) containing whatever real token data the provider had reported before the abort. If the provider had not yet sent a usage chunk, the values are undefined — no estimation is used. The OpenAI provider is updated to emit its finish chunk (which carries real usage) as soon as usage data arrives in the stream, rather than waiting for the stream to fully close. This ensures usage is available even when an abort fires before natural completion. Fixes vercel#7628 Fixes vercel#7805

The finish chunk was unreachable — OpenAI sends usage in a chunk with empty choices[], so choice?.delta == null triggered an early return before the emission block. Move the check before the delta guard.

Hovo-Dev · 2026-04-05T10:45:02Z

Hi @gr2m,

Took a look at this and managed to get it working cleanly — happy to share the approach.

The SDK was already holding the token data we needed, it just wasn't being handed off when a stream was aborted. We also noticed that usage from OpenAI arrives mid-stream but was only forwarded at natural close, so we made sure it gets captured as soon as it arrives. That way, whether the stream finishes or gets cut short, the data is there.

Looking forward to your feedback!

…rt-usage

Hovo-Dev · 2026-04-11T18:00:48Z

@lgrammel - Could you please take a look?

reactsaas · 2026-04-13T20:40:10Z

can someone explain is token usage returend by the model on abort or is like usage or token info avaible mid stream ??

currently having problem with abortsignal getting no usage at all with gemini model

Hovo-Dev · 2026-04-14T08:54:31Z

@reactsaas Hey good to see you here! Actually, abort signal neither returns usage nor any data (history or partial generated text) to count the tokens manually, that's why you have to merge the generated stream data on fly with OnChunk method and then count the tokens accordingly. Given that, my current PR is especially designed to return the conversation list and partially generated text, so the client can at least count the tokens with ease without merging the mid-generated stream data on the fly and get rid of boilerplate code.

Example of a usage with the new release.

onAbort({ totalUsage, usage, inputMessages, partialText }) {                                      
  const completedTokens = totalUsage?.totalTokens ?? 0;                                                                                                     

  // Use real usage if provider sent it, otherwise estimate with your own tokenizer:
  const abortedStepTokens = usage?.totalTokens                         
    ?? (countTokens(inputMessages) + countTokens(partialText ?? ''));                                                                                       
                                                                                                                                                            
  trackUsage(completedTokens + abortedStepTokens);                                                                                                          
}

reactsaas · 2026-04-14T17:08:14Z

@reactsaas Hey good to see you here! Actually, abort signal neither returns usage nor any data (history or partial generated text) to count the tokens manually, that's why you have to merge the generated stream data on fly with OnChunk method and then count the tokens accordingly. Given that, my current PR is especially designed to return the conversation list and partially generated text, so the client can at least count the tokens with ease without merging the mid-generated stream data on the fly and get rid of boilerplate code.

Example of a usage with the new release.

onAbort({ totalUsage, usage, inputMessages, partialText }) {                                      
  const completedTokens = totalUsage?.totalTokens ?? 0;                                                                                                     

  // Use real usage if provider sent it, otherwise estimate with your own tokenizer:
  const abortedStepTokens = usage?.totalTokens                         
    ?? (countTokens(inputMessages) + countTokens(partialText ?? ''));                                                                                       
                                                                                                                                                            
  trackUsage(completedTokens + abortedStepTokens);                                                                                                          
}

Hey thanks for the answer. I took your aproach now counting the tokens. 🍪

Hovo-Dev · 2026-04-17T07:06:15Z

@reactsaas Glad to hear that, but I believe you can use it once this PR is merged.

vercel Bot reviewed Apr 5, 2026

View reviewed changes

Comment thread packages/openai/src/chat/openai-chat-language-model.ts Outdated

Hovo-Dev added 2 commits April 5, 2026 13:56

fix: move early finish emission before delta guard in OpenAI provider

21ede4d

The finish chunk was unreachable — OpenAI sends usage in a chunk with empty choices[], so choice?.delta == null triggered an early return before the emission block. Move the check before the delta guard.

example: update abort example to demonstrate onAbort usage metrics

de7c3a3

Hovo-Dev force-pushed the fix/stream-text-abort-usage branch from 2dfe4e0 to de7c3a3 Compare April 5, 2026 10:13

Hovo-Dev mentioned this pull request Apr 5, 2026

[Feature Request] Token usage unavailable during streaming abort/interruption #7628

Open

feat(ai): expose inputMessages and partialText in onAbort callback

898e2ba

Hovo-Dev changed the title ~~feat(ai): expose usage metrics in onAbort callback~~ feat(ai): expose inputMessages and partialText in onAbort callback Apr 6, 2026

Hovo-Dev and others added 4 commits April 7, 2026 12:47

Merge branch 'main' into fix/stream-text-abort-usage

8a75e7f

Merge branch 'main' into fix/stream-text-abort-usage

45f9f15

Merge branch 'main' into fix/stream-text-abort-usage

c9ed9ad

Merge remote-tracking branch 'upstream/main' into fix/stream-text-abo…

1208802

…rt-usage

Merge branch 'main' into fix/stream-text-abort-usage

e18d09f

Hovo-Dev closed this Apr 18, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(ai): expose inputMessages and partialText in onAbort callback#14141

feat(ai): expose inputMessages and partialText in onAbort callback#14141
Hovo-Dev wants to merge 9 commits into
vercel:mainfrom
Hovo-Dev:fix/stream-text-abort-usage

Hovo-Dev commented Apr 5, 2026 •

edited

Loading

Uh oh!

Uh oh!

Hovo-Dev commented Apr 5, 2026

Uh oh!

Hovo-Dev commented Apr 11, 2026 •

edited

Loading

Uh oh!

reactsaas commented Apr 13, 2026

Uh oh!

Hovo-Dev commented Apr 14, 2026 •

edited

Loading

Uh oh!

reactsaas commented Apr 14, 2026

Uh oh!

Hovo-Dev commented Apr 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Hovo-Dev commented Apr 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Background

Summary

Usage

Manual Verification

Checklist

Uh oh!

Uh oh!

Hovo-Dev commented Apr 5, 2026

Uh oh!

Hovo-Dev commented Apr 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

reactsaas commented Apr 13, 2026

Uh oh!

Hovo-Dev commented Apr 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

reactsaas commented Apr 14, 2026

Uh oh!

Hovo-Dev commented Apr 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Hovo-Dev commented Apr 5, 2026 •

edited

Loading

Hovo-Dev commented Apr 11, 2026 •

edited

Loading

Hovo-Dev commented Apr 14, 2026 •

edited

Loading