You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Add real Cerebras provider with OpenPaths fallback
Add a direct Cerebras provider (CEREBRAS_API_KEY, https://api.cerebras.ai)
instead of routing Cerebras models through OpenRouter. The cerebras/* models
(gpt-oss-120b, zai-glm-4.7) auto-detect to a direct Cerebras key when present
and otherwise fall back to OpenPaths (OPENPATHS_API_KEY), which also serves the
Cerebras-hosted open-weight models.
- model-provider-info: add Cerebras provider, base-url override (CEREBRAS_BASE_URL),
prefix normalization, and env-backed auto-detection with OpenPaths fallback
- models.json: rename slugs openpaths/* -> cerebras/*
- README: document provider env-var auto-detection
- scripts/e2e_cerebras_openpaths.sh + .env.example: e2e smoke test via OpenPaths
Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
Copy file name to clipboardExpand all lines: README.md
+28Lines changed: 28 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -54,6 +54,34 @@ export OPENAI_API_KEY=sk-...
54
54
codex-infinity "your prompt"
55
55
```
56
56
57
+
### Model providers (auto-detected)
58
+
59
+
`codex-infinity` auto-detects which provider to use from the model slug and the API keys present in your environment — no `config.toml` edits required. Export a key and select a matching model with `-m`:
| Local (OSS) | — (`--oss`) | LM Studio / Ollama models |
72
+
73
+
**Cerebras** runs the fast open-weight coding models (`gpt-oss-120b`, `zai-glm-4.7`). A `cerebras/*` model prefers a direct Cerebras key (`CEREBRAS_API_KEY`, `https://api.cerebras.ai`) and otherwise falls back to **OpenPaths** ([openpaths.io](https://openpaths.io)), a router that also serves the Cerebras-hosted models — so a single `OPENPATHS_API_KEY` is enough to reach them. Override the endpoints with `CEREBRAS_BASE_URL` / `OPENPATHS_BASE_URL` if needed.
74
+
75
+
```shell
76
+
# Direct Cerebras
77
+
export CEREBRAS_API_KEY=csk-...
78
+
codex-infinity -m cerebras/gpt-oss-120b "refactor this module"
79
+
80
+
# Or via OpenPaths (also serves Cerebras models)
81
+
export OPENPATHS_API_KEY=op-...
82
+
codex-infinity -m cerebras/zai-glm-4.7 "explain this bug"
Copy file name to clipboardExpand all lines: codex-rs/models-manager/models.json
+94Lines changed: 94 additions & 0 deletions
Original file line number
Diff line number
Diff line change
@@ -588,6 +588,100 @@
588
588
"supports_search_tool": true,
589
589
"additional_speed_tiers": [],
590
590
"supports_reasoning_summaries": true
591
+
},
592
+
{
593
+
"slug": "cerebras/gpt-oss-120b",
594
+
"display_name": "GPT-OSS 120B (Cerebras)",
595
+
"description": "OpenAI gpt-oss-120b on Cerebras (~3000 tok/s open-weight coding model). Auto-routes to a direct Cerebras key (CEREBRAS_API_KEY) or falls back to OpenPaths (OPENPATHS_API_KEY).",
596
+
"default_reasoning_level": "medium",
597
+
"supported_reasoning_levels": [
598
+
{
599
+
"effort": "low",
600
+
"description": "Fast responses with lighter reasoning"
601
+
},
602
+
{
603
+
"effort": "medium",
604
+
"description": "Balances speed and reasoning depth"
605
+
},
606
+
{
607
+
"effort": "high",
608
+
"description": "Greater reasoning depth for complex problems"
609
+
}
610
+
],
611
+
"shell_type": "shell_command",
612
+
"visibility": "list",
613
+
"supported_in_api": true,
614
+
"priority": 42,
615
+
"upgrade": null,
616
+
"base_instructions": "You are Codex, a coding agent. Follow the user's instructions and use available tools to complete software engineering tasks.",
617
+
"supports_reasoning_summaries": false,
618
+
"support_verbosity": false,
619
+
"default_verbosity": null,
620
+
"apply_patch_tool_type": "freeform",
621
+
"web_search_tool_type": "text",
622
+
"truncation_policy": {
623
+
"mode": "tokens",
624
+
"limit": 10000
625
+
},
626
+
"supports_parallel_tool_calls": true,
627
+
"supports_image_detail_original": false,
628
+
"context_window": 131072,
629
+
"max_context_window": 131072,
630
+
"auto_compact_token_limit": null,
631
+
"experimental_supported_tools": [],
632
+
"input_modalities": [
633
+
"text"
634
+
],
635
+
"availability_nux": null,
636
+
"additional_speed_tiers": [],
637
+
"supports_search_tool": false
638
+
},
639
+
{
640
+
"slug": "cerebras/zai-glm-4.7",
641
+
"display_name": "GLM-4.7 (Cerebras)",
642
+
"description": "Zhipu GLM-4.7 on Cerebras — strong reasoning and coding. Auto-routes to a direct Cerebras key (CEREBRAS_API_KEY) or falls back to OpenPaths (OPENPATHS_API_KEY).",
643
+
"default_reasoning_level": "medium",
644
+
"supported_reasoning_levels": [
645
+
{
646
+
"effort": "low",
647
+
"description": "Fast responses with lighter reasoning"
648
+
},
649
+
{
650
+
"effort": "medium",
651
+
"description": "Balances speed and reasoning depth"
652
+
},
653
+
{
654
+
"effort": "high",
655
+
"description": "Greater reasoning depth for complex problems"
656
+
}
657
+
],
658
+
"shell_type": "shell_command",
659
+
"visibility": "list",
660
+
"supported_in_api": true,
661
+
"priority": 43,
662
+
"upgrade": null,
663
+
"base_instructions": "You are Codex, a coding agent. Follow the user's instructions and use available tools to complete software engineering tasks.",
0 commit comments