BubbleHead RAG Pipeline

BubbleHead is a local retrieval-augmented generation (RAG) application built on Ollama, ChromaDB, BM25 reranking, and an iterative gap-analysis loop. The current repo ships a FastAPI backend, a static HTML/JavaScript frontend, and a CLI for batch ingestion and ad hoc queries.

Highlights

Hybrid retrieval with ChromaDB vector search and BM25 reranking
Gap analysis loop that retries low-confidence queries with refined wording
Token-budget-aware chunking and retrieval safeguards
Multi-format ingestion for PDF, DOCX, PPTX, TXT, HTML, and CSV files
Local-first runtime with Ollama models and on-disk Chroma persistence
Browser UI plus CLI entry points for ingestion and querying

Architecture

retrieve_node -> generate_node -> gap_analysis_node
                                      |
                                   PASS -> end
                                   RETRY -> retrieve_node

Requirements

Python 3.11 or 3.12
Ollama installed locally
A pulled embedding model: nomic-embed-text
A pulled generation model: mistral:latest

Quick Start

Clone the repository and move into it.
```
git clone <your-repo-url>
cd BubbleHead
```

Create and activate a virtual environment.

python -m venv .venv

Windows:

.venv\Scripts\activate

macOS/Linux:

source .venv/bin/activate

Install Python dependencies.
```
pip install -r requirements.txt
```
Start Ollama and pull the models used by config.py.
```
ollama serve
```
In another terminal:
```
ollama pull nomic-embed-text
ollama pull mistral:latest
```

Running The App

Web UI

Start the FastAPI server with either helper script or directly:

# Windows
start_ui.bat

# macOS / Linux
./start_ui.sh

# direct
python ui.py

Open http://localhost:7860 after the server reports that the pipeline is ready.

CLI

Ingest every supported document under a directory:

python main.py ingest ./data

Run a query against the stored collection:

python main.py query "What are the main findings?"

API Surface

The FastAPI app in ui.py exposes:

GET /api/status for warm-up state
POST /api/ingest for single-file ingestion
POST /api/query for question answering
GET /api/collection for collection stats

Configuration

Project defaults live in config.py.

OLLAMA_BASE_URL = "http://localhost:11434"
EMBED_MODEL = "nomic-embed-text"
LLM_MODEL = "mistral:latest"

CHUNK_SIZE = 512
CHUNK_OVERLAP = 50
TOP_K_CANDIDATES = 10
TOP_K_FINAL = 6
TOKEN_BUDGET = 5000

GAP_CONFIDENCE_THRESHOLD = 0.6
GAP_MAX_ITERATIONS = 2

Project Layout

BubbleHead/
|-- config.py
|-- main.py
|-- ui.py
|-- frontend/
|   `-- index.html
|-- ingestion/
|   |-- Chunker.py
|   |-- Embedder.py
|   `-- parsers/Parser.py
|-- retrieval/
|   |-- Retriever.py
|   `-- gap_analysis_agent.py
|-- pipeline/
|   |-- generator.py
|   `-- pipeline.py
`-- data/
    `-- chroma/

Retrieval Flow

Documents are parsed into sections, chunked, validated, embedded, and stored in ChromaDB.
Queries are embedded with the search_query: prefix and matched against stored vectors.
The retriever reranks candidates with BM25 and trims the final context to the configured token budget.
The generator produces an answer, and the gap-analysis step decides whether retrieval should retry.

Troubleshooting

Ollama connection errors

Make sure ollama serve is running and the configured models are pulled locally.

Pipeline warming up

The UI loads before heavy pipeline imports finish. Retry once GET /api/status reports "ready": true.

Missing parser dependencies

Reinstall with pip install -r requirements.txt. PDF fallback parsing requires pdfplumber.

Collection issues

Delete data/chroma/ and re-ingest if the local collection becomes inconsistent.

Development

Suggested local checks:

black --check .
flake8 .
python -m compileall .

License

MIT. See LICENSE.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BubbleHead RAG Pipeline

Highlights

Architecture

Requirements

Quick Start

Running The App

Web UI

CLI

API Surface

Configuration

Project Layout

Retrieval Flow

Troubleshooting

Development

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.github/workflows		.github/workflows
agents		agents
data		data
frontend		frontend
ingestion		ingestion
pipeline		pipeline
retrieval		retrieval
.env.example		.env.example
.gitignore		.gitignore
Architecture.txt		Architecture.txt
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Parser.py		Parser.py
README.md		README.md
README_UI.md		README_UI.md
config.py		config.py
main.py		main.py
requirements.txt		requirements.txt
start_ui.bat		start_ui.bat
start_ui.sh		start_ui.sh
ui.py		ui.py

Folders and files

Latest commit

History

Repository files navigation

BubbleHead RAG Pipeline

Highlights

Architecture

Requirements

Quick Start

Running The App

Web UI

CLI

API Surface

Configuration

Project Layout

Retrieval Flow

Troubleshooting

Development

License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages