Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion dist/all.json

Large diffs are not rendered by default.

4 changes: 2 additions & 2 deletions dist/custom-provider.json
Original file line number Diff line number Diff line change
Expand Up @@ -1020,7 +1020,7 @@
"cache_write": 0
},
"limit": {
"context": 204800,
"context": 200000,
"output": 131072
},
"metadata": {
Expand Down Expand Up @@ -1089,7 +1089,7 @@
"cache_write": 0
},
"limit": {
"context": 204800,
"context": 200000,
"output": 131072
},
"metadata": {
Expand Down
4 changes: 2 additions & 2 deletions manual-templates/custom-provider-overrides.json
Original file line number Diff line number Diff line change
Expand Up @@ -843,7 +843,7 @@
"id": "glm-5.1",
"name": "GLM-5.1",
"family": "glm",
"contextLength": 204800,
"contextLength": 200000,

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

🧩 Analysis chain

🌐 Web query:

What is the maximum context length for Zhipu glm-5.1 and glm-5 models?

💡 Result:

As of June 2, 2026, both the Zhipu GLM-5 and GLM-5.1 models support a context window of 200,000 (200K) tokens [1][2][3]. While some technical documentation or specific API implementations (such as the NVIDIA NIM reference for GLM-5.1) may list 131,072 tokens for specific integration environments [4], the models' native architecture is designed for and widely recognized as supporting a 200K-token context window [5][1][6][3]. This capacity is supported by the models' use of Sparse Attention mechanisms (specifically DeepSeek Sparse Attention) to maintain coherence across extended sequences, which is a key feature for their intended use in long-horizon agentic and engineering tasks [1][6][3].

Citations:


🌐 Web query:

What is the maximum context length for Zhipu glm-5v-turbo and glm-4.7 models?

💡 Result:

Both the Zhipu GLM-4.7 and GLM-5V-Turbo models support a maximum context window of approximately 203,000 tokens (often cited in documentation as 200K tokens) [1][2][3][4][5][6]. Specifically: - GLM-4.7: Features a context window of 203,000 tokens [2][3]. - GLM-5V-Turbo: Features a context window of 202,752 tokens [7][6]. For both models, the maximum output token limit is generally 128,000 to 131,072 tokens [3][8][7][5][6].

Citations:


Align contextLength values with Zhipu platform specs.

  • manual-templates/custom-provider-overrides.json: glm-5.1 (line 846) and glm-5 (line 878) contextLength: 200000 matches the current Zhipu docs.
  • glm-5v-turbo (line 910) and glm-4.7 (line 942) should not remain 204800; Zhipu docs list ~202,752 for glm-5v-turbo and ~203,000 for glm-4.7—update these values or document why the override intentionally differs.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@manual-templates/custom-provider-overrides.json` at line 846, The
contextLength entries in manual-templates/custom-provider-overrides.json for
model overrides are inconsistent with Zhipu specs: adjust the "contextLength"
values for the glm-5v-turbo and glm-4.7 overrides (currently 204800) to match
Zhipu's documented values (~202752 for glm-5v-turbo and ~203000 for glm-4.7) or
add an inline comment/README note near those model override objects explaining
why the override intentionally differs; update the JSON entries for the model
keys "glm-5v-turbo" and "glm-4.7" (and verify "glm-5.1" and "glm-5" remain at
200000) so the file consistently reflects the platform specs.

"maxTokens": 131072,
"vision": false,
"functionCall": true,
Expand Down Expand Up @@ -875,7 +875,7 @@
"id": "glm-5",
"name": "GLM-5",
"family": "glm",
"contextLength": 204800,
"contextLength": 200000,
"maxTokens": 131072,
"vision": false,
"functionCall": true,
Expand Down