Skip to content

Commit 3a68b78

Browse files
codebamclaude
andcommitted
fix: raise output token budget to 65536, drop undersized models
8K still truncated kimi-k2.6 mid-reasoning. Bump to 65536 (Gemini's hard cap, which all retained CF native models accept since they have ≥128K context). Drop hermes (8K context) and llama-3.3-70b-fp8-fast (24K context) from AVAILABLE_MODELS since they would 400 with this budget. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
1 parent b64a246 commit 3a68b78

2 files changed

Lines changed: 1 addition & 7 deletions

File tree

bot

Submodule bot updated 1 file

packages/shared/src/index.ts

Lines changed: 0 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -289,7 +289,6 @@ export const AVAILABLE_MODELS: Record<
289289
string,
290290
{ id: string; cost: number; supportsTools?: boolean; supportsVision?: boolean }
291291
> = {
292-
hermes: { id: '@hf/nousresearch/hermes-2-pro-mistral-7b', cost: 5, supportsTools: true },
293292
gemma4: { id: '@cf/google/gemma-4-26b-a4b-it', cost: 10, supportsTools: true, supportsVision: true },
294293
'google/gemini-3-flash': { id: 'google/gemini-3-flash', cost: 15, supportsTools: true, supportsVision: true },
295294
'google/gemini-3.1-flash-lite': {
@@ -307,11 +306,6 @@ export const AVAILABLE_MODELS: Record<
307306
},
308307
'kimi-k2.6': { id: '@cf/moonshotai/kimi-k2.6', cost: 40, supportsTools: true, supportsVision: true },
309308
'glm-4.7-flash': { id: '@cf/zai-org/glm-4.7-flash', cost: 10, supportsTools: true, supportsVision: true },
310-
'llama-3.3-70b': {
311-
id: '@cf/meta/llama-3.3-70b-instruct-fp8-fast',
312-
cost: 40,
313-
supportsTools: true
314-
},
315309
'deepseek-r1-32b': {
316310
id: '@cf/deepseek-ai/deepseek-r1-distill-qwen-32b',
317311
cost: 60,

0 commit comments

Comments
 (0)