You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Oct 15, 2025. It is now read-only.
In templates/sample-application/httproutes.yaml we construct a reference to the inference pool name:
name: "{{ include "sampleApplication.sanitizedModelName" . }}-inference-pool"
However, the actual inference pool name has elsewhere been truncated to 63 characters because the name needs to be a valid DNS label. The net outcome is that the HTTPRoute object is unable to find the backend, so the the gateway returns cluster_not_found for any request.
Steps to reproduce
Deploy llm-d with a longish model name, e.g. using these overrides:
Component
I don't know
Describe the bug
In templates/sample-application/httproutes.yaml we construct a reference to the inference pool name:
However, the actual inference pool name has elsewhere been truncated to 63 characters because the name needs to be a valid DNS label. The net outcome is that the HTTPRoute object is unable to find the backend, so the the gateway returns cluster_not_found for any request.
Steps to reproduce
Additional context or screenshots
No response