You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Remove maxContextWindowTokens and refine token limit docs
Remove maxContextWindowTokens from all SDKs - it is an internal runtime
fallback that should not be exposed as public SDK API.
Refine doc comments for maxOutputTokens and maxPromptTokens to explain
what happens when each limit is hit:
- maxOutputTokens: sent as max_tokens per LLM request; model stops
generating and returns a truncated response when hit.
- maxPromptTokens: used by the runtime to trigger conversation
compaction before sending a request when the prompt exceeds this limit.
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
0 commit comments