You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
fix(copilot): lower static Claude context limits and expose them to Claude Code
The Copilot API enforces per-account prompt token limits (128K individual,
168K business) that are lower than the total context window (200K). When
the dynamic /models API fetch fails or returns no capabilities.limits,
the static fallback of 200K exceeds the real enforced limit, causing
intermittent "prompt token count exceeds the limit" errors.
Two complementary fixes:
1. Lower static Copilot Claude model ContextLength from 200000 to 128000
(the conservative default matching defaultCopilotContextLength). Dynamic
API limits override this when available.
2. Add context_length and max_completion_tokens to Claude-format model
responses so Claude Code CLI can learn the actual Copilot limit instead
of relying on its built-in 1M context configuration.
0 commit comments