| Metric | Value | Notes |
|---|---|---|
| Typical RAM | 1.5-2GB | 50K documents |
| Query CPU | 15-30% | Single query |
| Query Time (uncached) | 250ms | Average |
| Query Time (cached) | 5ms | 65-80% hit rate |
| Index Time | 15-20 min | 50K documents |
Total RAM: ~1.8GB
ββ ChromaDB: 800MB (44%)
ββ Embedding Model: 300MB (17%)
ββ Query Cache: 150MB (8%)
ββ TF-IDF: 100MB (6%)
ββ BM25 (dynamic): 150MB (8%)
ββ Application: 300MB (17%)
dense_top_k: 30
sparse_top_k: 30
max_bm25_docs: 10
cache_max_memory_entries: 500Result: ~1.2-1.5GB RAM, +50ms query time
dense_top_k: 50
sparse_top_k: 50
max_bm25_docs: 15
cache_max_memory_entries: 1000Result: ~1.8-2.2GB RAM, 250ms query time
dense_top_k: 100
sparse_top_k: 100
max_bm25_docs: 50
cache_max_memory_entries: 2000Result: ~2.5-3GB RAM, 180ms query time
| Documents | Min RAM | Recommended RAM | Query Time | Index Time |
|---|---|---|---|---|
| 10K | 1GB | 2GB | 180ms | 3 min |
| 50K | 1.5GB | 4GB | 250ms | 15 min |
| 100K | 2GB | 8GB | 350ms | 35 min |
| 500K+ | 4GB | 16GB+ | 600ms | 3+ hours |
resources:
limits:
memory: "2Gi"
cpu: "1"
reservations:
memory: "1Gi"
cpu: "0.5"resources:
limits:
memory: "4Gi"
cpu: "2"
reservations:
memory: "2Gi"
cpu: "1"resources:
limits:
memory: "8Gi"
cpu: "4"
reservations:
memory: "4Gi"
cpu: "2"Last Updated: October 27, 2024
Quick Access: Keep this handy for production deployments!