One-click deployment for Live VLM WebUI (real-time Vision Language Model streaming via WebRTC) on NVIDIA Jetson devices, powered by Ollama as the inference backend.
- NVIDIA Jetson Orin or Thor series
- At least 8 GB system memory
- At least 15 GB available disk space
- USB or CSI camera for live streaming (optional)
| JetPack | L4T | Platform |
|---|---|---|
| 6.0 | 36.3.0 | Orin |
| 6.1 | 36.4.0 | Orin |
| 6.2 | 36.4.3 | Orin |
| 6.2.1 | 36.4.4 | Orin |
| 7.0 | 38.1.0 | Thor |
| 7.1 | 38.2.0 | Thor |
- No custom Dockerfile — uses official GHCR pre-built images
- Dual-container architecture — Ollama backend + WebUI frontend via docker compose
- Interactive model selection — choose from 7 pre-configured VLM models, or skip
- Smart checks — skips pulling already-downloaded models and re-creating existing containers
- Automatic GPU mode detection — supports both
--runtime nvidiaand--gpus all
Install the jetson-examples CLI (recommended):
pip install jetson-examplesOr use the source directly:
git clone https://github.com/Seeed-Projects/jetson-examples
cd jetson-examples
pip install .reComputer run live-vlm-webuiThe script will:
- Verify the JetPack version, disk space, and memory
- Probe available Docker GPU mode (
--runtime nvidia) - Start the Ollama container and wait for it to be healthy
- Ask you to select a VLM model (or skip)
- Pull the selected model if needed
- Start the live-vlm-webui container
- Wait for the HTTPS server on port 8090 to become ready
- Print access URLs
The script presents 7 pre-configured models:
| # | Model | Parameters | VRAM | Use Case |
|---|---|---|---|---|
| 1 | gemma3:4b |
4B | 6GB | Entry-level |
| 2 | gemma3:12b |
12B | 10GB | Balanced |
| 3 | llava:7b |
7B | 6GB | Vision |
| 4 | llama3.2-vision:11b |
11B | 14GB | Vision |
| 5 | moondream:latest |
~1B | 1GB | Ultra-light vision |
| 6 | gemma3:4b |
4B | 6GB | Entry-level |
| 7 | nomic-embed-text:latest |
— | — | Embedding (opt.) |
| 0 | Skip (no model) | — | — | Manual pull later |
OLLAMA_MODEL=qwen2.5-vl:7b reComputer run live-vlm-webuiOpen in browser:
https://<jetson-ip>:8090
Or test with curl:
curl -k https://localhost:8090docker logs -f live-vlm-webui
docker logs -f ollamadocker exec ollama ollama list
docker exec ollama ollama pull llama3.2-vision:11b
docker exec ollama ollama rm <model-name>Remove containers (keep images for faster next startup):
reComputer clean live-vlm-webuiTo also remove downloaded models:
docker volume rm live-vlm-webui_ollama-datareComputer run live-vlm-webui
|
+-- run.sh (docker access, GPU mode detection, container management)
|
+-- Ollama container :11434 (VLM inference backend)
+-- live-vlm-webui container :8090 HTTPS (WebRTC streaming UI)
| Variable | Description | Default |
|---|---|---|
OLLAMA_MODEL |
Pre-select a model, skip interactive prompt | — |