Skip to content

fix(vllm-router): allow using prefill-decode for subset of models (by checking labels) and add a fallback routing strategy#3

Merged
nejch merged 1 commit into
deployfrom
fix/allow-using-prefill-decode-for-subset-of-models-by-checking-labels-in-vllm-router
Jun 3, 2026
Merged

fix(vllm-router): allow using prefill-decode for subset of models (by checking labels) and add a fallback routing strategy#3
nejch merged 1 commit into
deployfrom
fix/allow-using-prefill-decode-for-subset-of-models-by-checking-labels-in-vllm-router