Skip to content

[FEATURE]: Standardize decode worker component name across framework adapters #9319

@stevie-35

Description

@stevie-35

Feature request

Standardize the component name that decode workers register with in the /health endpoint across all framework adapters (dynamo.sglang, dynamo.trtllm, dynamo.vllm).

Currently, dynamo.sglang decode workers register as component: "backend", while dynamo.trtllm decode workers register as component: "tensorrt_llm". Prefill workers consistently use component: "prefill" across both frameworks.

A consistent component name (e.g. "backend" for all decode workers) would allow consumers of the /health endpoint to identify prefill vs decode workers without framework-specific logic.

Describe the problem you're encountering

When polling the Dynamo frontend /health endpoint to validate worker registration, the response uses different component values for decode workers depending on the framework:

SGLang decode worker response:

{"component": "backend", "endpoint": "generate", ...}

TRT-LLM decode worker response:

{"component": "tensorrt_llm", "endpoint": "generate", ...}

This means any orchestration or health-checking tool that needs to count registered decode workers must maintain a list of all possible component names per framework, rather than using a single generic filter.

Discovered while integrating dynamo.trtllm disaggregated serving into NVIDIA's Benchmarking Toolkit (BTK). Our health check plugin polls /health and filters by component name to validate worker registration.

Describe alternatives you've tried

We added an OR clause to our health check's jq filter to accept both "backend" and "tensorrt_llm" as decode components. This works but requires updating the filter every time a new framework adapter uses a different component name.

Metadata

Metadata

Assignees

Labels

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions