From dd0b19a4c9c00d674c8258d041274dbe6284bcf3 Mon Sep 17 00:00:00 2001 From: pmady Date: Tue, 26 May 2026 19:41:47 -0500 Subject: [PATCH] Add keda-gpu-scaler to Inference Platform section --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index 7eb7a46..19e565f 100644 --- a/README.md +++ b/README.md @@ -78,6 +78,7 @@ * **[Kaito](https://github.com/kaito-project/Kaito)**: Kubernetes operator for large-model inference and fine-tuning, with GPU auto-provisioning, container-based hosting, and CRD-based orchestration. ![Stars](https://img.shields.io/github/stars/kaito-project/Kaito.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/kaito-project/Kaito?color=green) ![LastCommit](https://img.shields.io/github/last-commit/kaito-project/Kaito?color=green) * **[Kserve](https://github.com/kserve/kserve)**: Standardized Serverless ML Inference Platform on Kubernetes. ![Stars](https://img.shields.io/github/stars/kserve/kserve.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/kserve/kserve?color=green) ![LastCommit](https://img.shields.io/github/last-commit/kserve/kserve?color=green) * **[KubeAI](https://github.com/substratusai/kubeai)**: AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-text. ![Stars](https://img.shields.io/github/stars/substratusai/kubeai.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/substratusai/kubeai?color=green) ![LastCommit](https://img.shields.io/github/last-commit/substratusai/kubeai?color=green) +* **[keda-gpu-scaler](https://github.com/pmady/keda-gpu-scaler)**: KEDA external scaler for GPU-based autoscaling of inference workloads (vLLM, Triton) using NVML, with scale-to-zero support. ![Stars](https://img.shields.io/github/stars/pmady/keda-gpu-scaler.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/pmady/keda-gpu-scaler?color=green) ![LastCommit](https://img.shields.io/github/last-commit/pmady/keda-gpu-scaler?color=green) * **[llm-d](https://github.com/llm-d/llm-d)**: llm-d is a Kubernetes-native high-performance distributed LLM inference framework ![Stars](https://img.shields.io/github/stars/llm-d/llm-d.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/llm-d/llm-d?color=green) ![LastCommit](https://img.shields.io/github/last-commit/llm-d/llm-d?color=green) * **[llmaz](https://github.com/InftyAI/llmaz)**: ☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work! ![Stars](https://img.shields.io/github/stars/inftyai/llmaz.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/inftyai/llmaz?color=green) ![LastCommit](https://img.shields.io/github/last-commit/inftyai/llmaz?color=green) * **[Modular](https://github.com/modular/modular)**: The Modular Platform (includes MAX & Mojo) ![Stars](https://img.shields.io/github/stars/modular/modular.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/modular/modular?color=green) ![LastCommit](https://img.shields.io/github/last-commit/modular/modular?color=green)