Installation & Maintenance Guide

Prerequisites

Python 3.11+ (3.12 recommended)
Docker & Docker Compose (for PostgreSQL, Qdrant, Redis)
Node.js (optional, for Newman CLI testing)

Quick Start

1. Clone & Install

git clone <repo-url> && cd mini-chat-rag
python -m venv .venv && source .venv/bin/activate
pip install -e ".[dev]"

2. Configure Environment

cp .env.example .env

Edit .env with your settings:

Variable	Description	Default
`DATABASE_URL`	PostgreSQL async connection string	`postgresql+asyncpg://minirag:changeme@localhost:5432/minirag`
`REDIS_URL`	Redis connection string	`redis://localhost:6379/0`
`QDRANT_URL`	Qdrant REST URL	`http://localhost:6333`
`ENCRYPTION_KEY`	Fernet key for field encryption	required
`JWT_SECRET_KEY`	Secret for JWT signing	required
`JWT_ALGORITHM`	JWT algorithm	`HS256`
`JWT_EXPIRE_MINUTES`	JWT token TTL	`60`
`ALLOWED_ORIGINS`	CORS allowed origins (comma-separated)	`*`
`DEFAULT_LLM_MODEL`	Default LLM model	`gpt-4o-mini`
`DEFAULT_EMBEDDING_MODEL`	Default embedding model	`text-embedding-3-small`

Generate keys:

# Fernet key
python -c "from cryptography.fernet import Fernet; print(Fernet.generate_key().decode())"

# JWT secret
python -c "import secrets; print(secrets.token_urlsafe(32))"

3. Start Infrastructure

docker compose up -d postgres qdrant redis

4. Run the API

uvicorn app.main:app --reload

The API is now available at http://localhost:8000. Docs at /docs.

5. Run the Worker

In a separate terminal:

source .venv/bin/activate
python -m app.workers.main

The worker handles:

Ingestion tasks — Processing sources into chunks and vectors
Auto-refresh cron — Checks every 15 minutes for URL sources with scheduled re-ingestion (hourly/daily/weekly)

6. Access the Dashboard

Open http://localhost:8000/dashboard in your browser.

First-Run Bootstrap

Create your first tenant (no auth required):

curl -X POST http://localhost:8000/v1/tenants \
  -H "Content-Type: application/json" \
  -d '{
    "tenant_name": "My Company",
    "tenant_slug": "my-company",
    "owner_email": "admin@example.com",
    "owner_password": "supersecret123"
  }'

This returns a one-time API token. You can now log into the dashboard with the email/password above.

Docker Compose (Full Stack)

To run everything including the web server and worker:

docker compose up -d

Services:

postgres — localhost:5432 (metadata, auth)
qdrant — localhost:6333 (REST), localhost:6334 (gRPC) — vector storage
redis — localhost:6379 (task queue)
web — localhost:8000 (API + Dashboard), with Traefik labels for production reverse proxy
worker — ARQ background worker (ingestion + auto-refresh cron)

Database Migrations (Alembic)

MiniRAG uses Alembic for database schema migrations in production:

# Run pending migrations
docker compose exec web alembic upgrade head

# Check current migration status
docker compose exec web alembic current

# View migration history
docker compose exec web alembic history

# Generate a new migration after model changes
docker compose exec web alembic revision --autogenerate -m "description"

The Alembic configuration uses async SQLAlchemy and loads the database URL from the application settings automatically.

Important: After deploying new code that includes model changes, always run alembic upgrade head before restarting the web service. For Docker deployments, this can be added to your startup script or CI/CD pipeline.

Running Tests

Tests use SQLite in-memory (no Docker needed):

# Run all tests
pytest tests/ -v

# Run a specific test file
pytest tests/test_webhooks.py -v

# Run a single test
pytest tests/test_chat.py::test_chat_new_conversation -v

Currently 144 tests covering auth, CRUD, ingestion, chat, streaming, webhooks, export, analytics, NyxCore integration, and more.

Production Considerations

Run alembic upgrade head after every deployment with schema changes
Set strong ENCRYPTION_KEY and JWT_SECRET_KEY values
Configure CORS allow_origins to your domain (not *)
Use Traefik or nginx for TLS termination (Traefik handles auto-TLS via Let's Encrypt)
Set JWT_EXPIRE_MINUTES to an appropriate value
Monitor with the /v1/system/health endpoint
Set up webhook endpoints to receive notifications for ingestion and chat events
Consider log aggregation for the web and worker containers
Back up PostgreSQL regularly; Qdrant vectors can be rebuilt from source content via re-ingestion

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Installation & Maintenance Guide

Prerequisites

Quick Start

1. Clone & Install

2. Configure Environment

3. Start Infrastructure

4. Run the API

5. Run the Worker

6. Access the Dashboard

First-Run Bootstrap

Docker Compose (Full Stack)

Database Migrations (Alembic)

Running Tests

Production Considerations

Uh oh!

FilesExpand file tree

installation.md

Latest commit

History

installation.md

File metadata and controls

Installation & Maintenance Guide

Prerequisites

Quick Start

1. Clone & Install

2. Configure Environment

3. Start Infrastructure

4. Run the API

5. Run the Worker

6. Access the Dashboard

First-Run Bootstrap

Docker Compose (Full Stack)

Database Migrations (Alembic)

Running Tests

Production Considerations