This Docker Compose configuration provides an AI stack with three main services:
- Ollama: AI model server with NVIDIA GPU support
- Open WebUI: Web interface for interacting with Ollama models
- Uptime Kuma: Monitoring solution for service health
- Docker and Docker Compose
- NVIDIA GPU with compatible drivers
- NVIDIA Container Toolkit installed
-
Start the services:
just start
-
Access the services:
- Open WebUI: http://localhost:8080
- Uptime Kuma: http://localhost:3001
After starting the services, configure Uptime Kuma to monitor both Ollama and Open WebUI:
- Access Uptime Kuma at http://localhost:3001
- Create a new account during first login (not really yet)
- Add monitors for both services:
- Click "Add New Monitor"
- Configure:
- Monitor Type: HTTP(s)
- Name: Open WebUI Health
- URL: http://open-webui:8080/health
- Monitoring Interval: 60 seconds
- Retry Count: 3
- Click "Add New Monitor"
- Configure:
- Monitor Type: HTTP(s)
- Name: Ollama Health
- URL: http://ollama:11434/api/health
- Monitoring Interval: 60 seconds
- Retry Count: 3
All services use persistent volumes:
- Ollama data:
/root/.ollama - Open WebUI data:
/app/backend/data - Uptime Kuma data:
/app/data
For more advanced monitoring options, including Model Connectivity and Response Testing, refer to the Open WebUI Monitoring Documentation.