You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
OrcaRouter (https://www.orcarouter.ai) is an OpenAI-compatible API gateway
that aggregates ~120 chat models from OpenAI, Anthropic, Google, DeepSeek,
xAI, Qwen, Kimi, MiniMax, Z-AI and others behind a single sk-orca- key. It
also exposes an orcarouter/auto virtual model with configurable adaptive
routing (cheapest / balanced / quality / contextual bandit / difficulty-gated).
Provider class extends OpenAI and injects attribution headers (HTTP-Referer,
X-Title, User-Agent, X-Continue-Provider) and reuses the existing Anthropic
cache_control pass-through for Claude models. Also registers orcarouter in
the AI SDK provider map for use via `provider: ai-sdk`.
Follows the same patterns as CometAPI (#7809), ClawRouter (#11751), and
Tensorix (#11184).
Disclosure: I am an engineer on the OrcaRouter team.
Copy file name to clipboardExpand all lines: extensions/vscode/config_schema.json
+2Lines changed: 2 additions & 0 deletions
Original file line number
Diff line number
Diff line change
@@ -216,6 +216,7 @@
216
216
"msty",
217
217
"watsonx",
218
218
"openrouter",
219
+
"orcarouter",
219
220
"clawrouter",
220
221
"sambanova",
221
222
"nvidia",
@@ -269,6 +270,7 @@
269
270
"### Msty\nMsty is the simplest way to get started with online or local LLMs on all desktop platforms - Windows, Mac, and Linux. No fussing around, one-click and you are up and running. To get started, follow these steps:\n1. Download from [Msty.app](https://msty.app/), open the application, and click 'Setup Local AI'.\n2. Go to the Local AI Module page and download a model of your choice.\n3. Once the model has finished downloading, you can start asking questions through Continue.\n> [Reference](https://continue.dev/docs/reference/Model%20Providers/Msty)",
270
271
"### IBM watsonx\nwatsonx, developed by IBM, offers a variety of pre-trained AI foundation models that can be used for natural language processing (NLP), computer vision, and speech recognition tasks.",
271
272
"### OpenRouter\nOpenRouter offers a single API to access almost any language model. To get started, obtain an API key from [their console](https://openrouter.ai/settings/keys).",
273
+
"### OrcaRouter\nOrcaRouter is an OpenAI-compatible API gateway that aggregates ~120 chat models from OpenAI, Anthropic, Google, DeepSeek, xAI, Qwen, Kimi, MiniMax, Z-AI, and others behind a single `sk-orca-` key. It also exposes an `orcarouter/auto` virtual model with configurable adaptive routing (cheapest / balanced / quality / contextual bandit / difficulty-gated).\nTo get started, sign up at [orcarouter.ai](https://www.orcarouter.ai) and obtain an API key from your [console](https://www.orcarouter.ai/console).\n> [Reference](https://docs.orcarouter.ai)",
272
274
"### ClawRouter\nClawRouter is an open-source LLM router that automatically selects the cheapest capable model for each request based on prompt complexity, providing 78-96% cost savings. To get started, run `npx clawrouter` to start the router at localhost:1337. A wallet is auto-generated on first run - fund it with USDC (Solana/Base) to access premium models, or use `blockrun/free` tier without payment.\n> [Reference](https://github.com/BlockRunAI/ClawRouter)",
273
275
"### SambaNova\n SambaNova provides fast inference of open-source language models with zero data retention. To get started, obtain an API key in [SambaNova Cloud](https://cloud.sambanova.ai/apis?utm_source=continue&utm_medium=external&utm_campaign=cloud_signup ).",
274
276
"### NVIDIA NIMs\nNVIDIA offers a single API to access almost any language model. To find out more, visit the [LLM APIs Documentation](https://docs.api.nvidia.com/nim/reference/llm-apis).\nFor information specific to getting a key, please check out the [docs here](https://docs.nvidia.com/nim/large-language-models/latest/getting-started.html#option-1-from-api-catalog)",
"Adaptive routing across upstream models with configurable strategy (cheapest / balanced / quality / contextual bandit / difficulty-gated). Routing pools and weights are tunable from the OrcaRouter console.",
2810
+
params: {
2811
+
title: "OrcaRouter Auto",
2812
+
model: "orcarouter/auto",
2813
+
contextLength: 128_000,
2814
+
},
2815
+
icon: "orcarouter.png",
2816
+
providerOptions: ["orcarouter"],
2817
+
isOpenSource: false,
2818
+
},
2819
+
orcarouterGpt55: {
2820
+
title: "OpenAI: GPT-5.5",
2821
+
description: "OpenAI GPT-5.5 routed through OrcaRouter.",
2822
+
params: {
2823
+
title: "OpenAI: GPT-5.5",
2824
+
model: "openai/gpt-5.5",
2825
+
contextLength: 400_000,
2826
+
},
2827
+
icon: "orcarouter.png",
2828
+
providerOptions: ["orcarouter"],
2829
+
isOpenSource: false,
2830
+
},
2831
+
orcarouterClaudeOpus47: {
2832
+
title: "Anthropic: Claude Opus 4.7",
2833
+
description:
2834
+
"Anthropic Claude Opus 4.7 routed through OrcaRouter. Reasoning model — set `requestOptions.extraBodyProperties.thinking` to control thinking budget.",
2835
+
params: {
2836
+
title: "Anthropic: Claude Opus 4.7",
2837
+
model: "anthropic/claude-opus-4.7",
2838
+
contextLength: 200_000,
2839
+
},
2840
+
icon: "orcarouter.png",
2841
+
providerOptions: ["orcarouter"],
2842
+
isOpenSource: false,
2843
+
},
2844
+
orcarouterGemini3Flash: {
2845
+
title: "Google: Gemini 3 Flash Preview",
2846
+
description:
2847
+
"Google Gemini 3 Flash Preview routed through OrcaRouter. Reasoning model — chat may appear blank for several seconds while the model reasons. Set `reasoning_effort: 'minimal'` in `requestOptions.extraBodyProperties` for fast responses.",
2848
+
params: {
2849
+
title: "Google: Gemini 3 Flash",
2850
+
model: "google/gemini-3-flash-preview",
2851
+
contextLength: 1_000_000,
2852
+
},
2853
+
icon: "orcarouter.png",
2854
+
providerOptions: ["orcarouter"],
2855
+
isOpenSource: false,
2856
+
},
2857
+
orcarouterDeepseekV4Pro: {
2858
+
title: "DeepSeek: DeepSeek V4 Pro",
2859
+
description: "DeepSeek V4 Pro routed through OrcaRouter.",
2860
+
params: {
2861
+
title: "DeepSeek: DeepSeek V4 Pro",
2862
+
model: "deepseek/deepseek-v4-pro",
2863
+
contextLength: 128_000,
2864
+
},
2865
+
icon: "orcarouter.png",
2866
+
providerOptions: ["orcarouter"],
2867
+
isOpenSource: true,
2868
+
},
2869
+
orcarouterGrok43: {
2870
+
title: "xAI: Grok 4.3",
2871
+
description: "xAI Grok 4.3 routed through OrcaRouter.",
2872
+
params: {
2873
+
title: "xAI: Grok 4.3",
2874
+
model: "grok/grok-4.3",
2875
+
contextLength: 256_000,
2876
+
},
2877
+
icon: "orcarouter.png",
2878
+
providerOptions: ["orcarouter"],
2879
+
isOpenSource: false,
2880
+
},
2881
+
orcarouterQwen36Flash: {
2882
+
title: "Alibaba: Qwen 3.6 Flash",
2883
+
description: "Alibaba Qwen 3.6 Flash routed through OrcaRouter.",
2884
+
params: {
2885
+
title: "Alibaba: Qwen 3.6 Flash",
2886
+
model: "qwen/qwen3.6-flash",
2887
+
contextLength: 128_000,
2888
+
},
2889
+
icon: "orcarouter.png",
2890
+
providerOptions: ["orcarouter"],
2891
+
isOpenSource: true,
2892
+
},
2893
+
orcarouterMinimaxM27: {
2894
+
title: "MiniMax: MiniMax M2.7",
2895
+
description: "MiniMax M2.7 routed through OrcaRouter.",
0 commit comments