Skip to content

Add keda-gpu-scaler to Inference Platform section#438

Open
pmady wants to merge 1 commit into
InftyAI:mainfrom
pmady:add-keda-gpu-scaler
Open

Add keda-gpu-scaler to Inference Platform section#438
pmady wants to merge 1 commit into
InftyAI:mainfrom
pmady:add-keda-gpu-scaler

Conversation

@pmady

@pmady pmady commented May 27, 2026

Copy link
Copy Markdown

Adding keda-gpu-scaler to the Inference Platform section.

It's a KEDA external scaler that reads NVIDIA GPU metrics directly from NVML C-bindings and autoscales Kubernetes inference workloads — including scale-to-zero. Works alongside vLLM, Triton, and other inference engines.

Apache 2.0, CI, Helm chart, docs, OpenSSF Best Practices badge.

@InftyAI-Agent InftyAI-Agent added needs-triage Indicates an issue or PR lacks a label and requires one. needs-priority Indicates a PR lacks a label and requires one. do-not-merge/needs-kind Indicates a PR lacks a label and requires one. labels May 27, 2026
@InftyAI-Agent InftyAI-Agent requested review from cr7258 and samzong May 27, 2026 00:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

do-not-merge/needs-kind Indicates a PR lacks a label and requires one. needs-priority Indicates a PR lacks a label and requires one. needs-triage Indicates an issue or PR lacks a label and requires one.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants