Skip to content

[TEP-953] Tier / Priority-based model/provider routing with fallback retry | 基于优先级的路由负载 #953

@sfu2

Description

@sfu2

Version

v0.26

System Information

Ubuntu 22.04

Category

Other

Scenario

Keywords: Model/provider routing, load balancing, and failover

Tingly-Box is currently using round-robin routing to switch between models and providers. I would like to request a priority-based routing feature similar to the behavior in Claude Code Hub.

Each model/provider should be configurable with a priority. When handling a request, Tingly-Box should always choose the available model/provider with the highest priority. If the highest-priority option is unavailable, the system should automatically fall back to the next highest-priority available option.

After a configurable timeout, for example X minutes, Tingly-Box should check whether a higher-priority model/provider has become available again. If so, routing should switch back to the higher-priority option.

In short, the gateway should always prefer the best available model/provider, fall back when necessary, and periodically retry higher-priority options.

Proposed Solution

No response

Metadata

Metadata

Assignees

Labels

doneenhancementNew feature or requestfeedbackComes from user feedback

Type

No type
No fields configured for issues without a type.

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions