Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions .github/workflows/build-push-images.yml
Original file line number Diff line number Diff line change
Expand Up @@ -19,6 +19,7 @@ jobs:
- chat
- image-analysis
- flux-image-gen
- omni
permissions:
contents: read
id-token: write # needed for signing the images with GitHub OIDC Token
Expand Down
33 changes: 33 additions & 0 deletions charts/azimuth-omni-backend/.helmignore
Original file line number Diff line number Diff line change
@@ -0,0 +1,33 @@
# Patterns to ignore when building packages.
# This supports shell glob matching, relative path matching, and
# negation (prefixed with !). Only one pattern per line.
.DS_Store
# Common VCS dirs
.git/
.gitignore
.bzr/
.bzrignore
.hg/
.hgignore
.svn/
# Common backup files
*.swp
*.bak
*.tmp
*.orig
*~
# Various IDEs
.project
.idea/
*.tmproj
.vscode/

# Others
README.md
Dockerfile
*kubeconfig.y[a]ml
venv/
__pycache__/
images/
*.secret
ci/
16 changes: 16 additions & 0 deletions charts/azimuth-omni-backend/Chart.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,16 @@
apiVersion: v2
name: azimuth-llm-omni-backend
description: |
In-cluster vLLM backends for the Omni multimodal interface
(text-to-text / chat, text-to-speech, text-to-image).
maintainers:
- name: "Victor HANG"
email: victor@stackhpc.com

type: application

# The version and appVersion are updated by the chart build script
version: 0.1.0
appVersion: local

icon: https://huggingface.co/datasets/huggingface/brand-assets/resolve/main/hf-logo.svg
13 changes: 13 additions & 0 deletions charts/azimuth-omni-backend/ci/test-values.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
# CI: only exercise the TTT backend with the smallest possible model.
ttt:
enabled: true
huggingface:
model: HuggingFaceTB/SmolLM2-135M-Instruct
api:
monitoring:
enabled: false
gpus: 0
tts:
enabled: false
tti:
enabled: false
17 changes: 17 additions & 0 deletions charts/azimuth-omni-backend/templates/NOTES.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,17 @@
Azimuth Omni backends provide one or more in-cluster vLLM model deployments
(text-to-text / chat, text-to-speech, text-to-image) for the Omni interface.

Enabled backends in this release:
{{- range $alias := list "ttt" "tts" "tti" }}
{{- $sub := index $.Values $alias }}
{{- if and $sub $sub.enabled }}
- {{ $alias }}: {{ $sub.huggingface.model }} (in-cluster at http://{{ $.Release.Name }}-{{ $alias }}.{{ $.Release.Namespace }}.svc)
{{- else }}
- {{ $alias }}: disabled (toggle with `{{ $alias }}.enabled=true`)
{{- end }}
{{- end }}

Each enabled backend downloads its model weights from HuggingFace on first
start, which can take a while.

Release notes: https://github.com/stackhpc/azimuth-llm/releases
Loading
Loading