feat(config): allow context length for OpenAI-compatible models by EricLi404 · Pull Request #3794 · router-for-me/CLIProxyAPI

EricLi404 · 2026-06-10T07:18:01Z

Summary

Add optional context-length configuration to openai-compatibility.models[], allowing proxy operators to override the advertised context window for OpenAI-compatible models.

Motivation

Some large-context models (e.g. DeepSeek V4 Pro, MiniMax-M3 1M, Mimo V2.5 Pro) support up to 1,000,000 tokens of context, but their upstream /v1/models endpoint may not advertise this — or a proxy may want to explicitly set it regardless of upstream metadata.

Without this field, Codex and other clients that rely on /v1/models to determine model capabilities see an incorrect or default context_window, which can cause:

Truncation warnings at thresholds far below the model's actual capacity
Unnecessary context compaction when the model could handle more
Misleading model selection where users avoid models that appear to have smaller windows

Changes

internal/config/config.go: add ContextLength int field to OpenAICompatibilityModel with yaml tag context-length
sdk/cliproxy/service.go: propagate configured context length into registered model metadata; clamp negative values to 0
internal/watcher/diff/model_hash.go: include context length in model hash so hot reloads detect metadata-only changes
sdk/cliproxy/openai_compat_models_test.go: unit tests for context length propagation and negative value clamping
internal/watcher/diff/model_hash_test.go: unit test for hash sensitivity to context length changes

Configuration Example

openai-compatibility:
  - name: "deepseek"
    base-url: "https://api.deepseek.com/v1"
    api-key-entries:
      - api-key: "sk-..."
    models:
      - name: "deepseek-v4-pro"
        context-length: 1000000  # advertise 1M context window
      - name: "deepseek-chat"
        # omit context-length to use upstream default

Backward Compatibility

context-length is optional and defaults to 0 (omitted from JSON output). Existing configurations without this field behave identically to before.

Test Plan

GOWORK=off go test ./internal/watcher/diff ./sdk/cliproxy -run 'TestComputeOpenAICompatModelsHash|TestBuildOpenAICompatibilityConfigModelsIncludesContextLength' -count=1
GOWORK=off go build -o /tmp/cliproxyapi-pr-build ./cmd/server

github-actions · 2026-06-10T07:18:11Z

This pull request targeted main.

The base branch has been automatically changed to dev.

gemini-code-assist

Code Review

This pull request introduces a ContextLength configuration option to override the advertised model context window for OpenAI compatibility models. This field is integrated into the model hashing mechanism, configuration mapping, and verified with unit tests. Feedback suggests validating and clamping negative ContextLength values to zero to prevent downstream issues.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 329e758e01

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

Match the management API convention where all config keys are kebab-case (base-url, api-key-entries). Using snake_case here would silently ignore 'context-length' in JSON payloads.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 610dc37785

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

EricLi404 · 2026-06-11T06:26:35Z

/gemini Please review the latest head commit b1427c7. The PR now exposes context-length in both the Codex client catalog and the standard OpenAI-compatible /v1/models response, with tests covering both paths.

gemini-code-assist

Code Review

This pull request introduces support for overriding the advertised model context window (context-length) in OpenAI compatibility configurations, updating the configuration schema, hashing logic, API response filtering, and model building processes, along with adding corresponding unit tests. Feedback highlights a backward compatibility issue and an overwrite bug in applyCodexClientContextWindowOverride and applyCodexClientModelMetadata where user-configured overrides could be ignored or template defaults could be unintentionally overwritten.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

EricLi404 · 2026-06-11T14:38:28Z

/gemini Please review the latest head commit 95116e0. The PR history was rewritten only to normalize commit author/committer identity; the code diff is unchanged from the prior head.

EricLi404 · 2026-06-11T14:38:28Z

@codex review latest head commit 95116e0. The PR history was rewritten only to normalize commit author/committer identity; the code diff is unchanged from the prior head.

gemini-code-assist

Code Review

This pull request introduces support for overriding the advertised context window (context length) for OpenAI compatibility models. It updates configuration structures, hashing logic, model metadata builders, and API handlers to support and expose this new property, backed by comprehensive unit tests. The review feedback suggests two key improvements: first, to only include context_length in the /v1/models response if it is strictly greater than zero to avoid misleading client applications; second, to simplify the signature of applyCodexClientContextWindowOverride by removing the unused id parameter.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

chatgpt-codex-connector · 2026-06-11T14:43:22Z

Codex Review: Didn't find any major issues. You're on a roll.

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

Only advertise context_length in the standard models response when the parsed value is positive. Also removes the unused Codex context helper parameter and adds coverage for omitted zero values.

EricLi404 · 2026-06-12T10:30:56Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces a new context-length configuration option for OpenAI compatibility models, allowing users to override the advertised context window. The changes propagate this setting through the configuration, hashing, and API response handlers, and include clamping logic for negative values. Comprehensive unit tests have been added to verify the new behavior. There are no review comments, and I have no additional feedback to provide.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

feat(config): allow context length for openai compat models

329e758

github-actions Bot changed the base branch from main to dev June 10, 2026 07:18

gemini-code-assist Bot reviewed Jun 10, 2026

View reviewed changes

Comment thread sdk/cliproxy/service.go

chatgpt-codex-connector Bot reviewed Jun 10, 2026

View reviewed changes

Comment thread internal/config/config.go Outdated

EricLi404 added 4 commits June 10, 2026 15:22

fix(config): clamp negative context length overrides

2e9e227

docs(config): add context-length example to openai-compat models section

48f9549

fix(config): align context-length JSON tag to kebab-case

4982613

Match the management API convention where all config keys are kebab-case (base-url, api-key-entries). Using snake_case here would silently ignore 'context-length' in JSON payloads.

Fix Codex template context override

aa0ed0f

chatgpt-codex-connector Bot reviewed Jun 11, 2026

View reviewed changes

Comment thread sdk/cliproxy/service.go

Expose context length in models response

185a2df

gemini-code-assist Bot reviewed Jun 11, 2026

View reviewed changes

Comment thread sdk/api/handlers/openai/codex_client_models.go Outdated

Preserve Codex template context defaults

95116e0

sususu98 force-pushed the dev branch from 80d61a3 to ac4017e Compare June 11, 2026 13:49

EricLi404 force-pushed the ericli404/openai-compat-context-length branch from 84182e4 to 95116e0 Compare June 11, 2026 14:36

gemini-code-assist Bot reviewed Jun 11, 2026

View reviewed changes

Comment thread sdk/api/handlers/openai/openai_handlers.go Outdated

Comment thread sdk/api/handlers/openai/codex_client_models.go Outdated

Comment thread sdk/api/handlers/openai/codex_client_models.go Outdated

Address model context review

6f9fdcd

Only advertise context_length in the standard models response when the parsed value is positive. Also removes the unused Codex context helper parameter and adds coverage for omitted zero values.

gemini-code-assist Bot reviewed Jun 12, 2026

View reviewed changes

Uh oh!

Conversation

EricLi404 commented Jun 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Motivation

Changes

Configuration Example

Backward Compatibility

Test Plan

Uh oh!

github-actions Bot commented Jun 10, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

EricLi404 commented Jun 11, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

EricLi404 commented Jun 11, 2026

Uh oh!

EricLi404 commented Jun 11, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector Bot commented Jun 11, 2026

Uh oh!

EricLi404 commented Jun 12, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

EricLi404 commented Jun 10, 2026 •

edited

Loading