Commit 3a68b78
fix: raise output token budget to 65536, drop undersized models
8K still truncated kimi-k2.6 mid-reasoning. Bump to 65536 (Gemini's hard cap,
which all retained CF native models accept since they have ≥128K context).
Drop hermes (8K context) and llama-3.3-70b-fp8-fast (24K context) from
AVAILABLE_MODELS since they would 400 with this budget.
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>1 parent b64a246 commit 3a68b78
2 files changed
Lines changed: 1 addition & 7 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
289 | 289 | | |
290 | 290 | | |
291 | 291 | | |
292 | | - | |
293 | 292 | | |
294 | 293 | | |
295 | 294 | | |
| |||
307 | 306 | | |
308 | 307 | | |
309 | 308 | | |
310 | | - | |
311 | | - | |
312 | | - | |
313 | | - | |
314 | | - | |
315 | 309 | | |
316 | 310 | | |
317 | 311 | | |
| |||
0 commit comments