You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
**ConDB**is the storage and retrieval engine behind [PageIndex](https://github.com/VectifyAI/PageIndex). It stores hierarchical document trees (generated by PageIndex or other sources) in a SQLite database, and provides LLM-powered **reasoning-based retrieval** to query them — no vector DB, no chunking.
17
+
**ConDB** stores hierarchical document trees in a SQLite database and provides LLM-powered **reasoning-based retrieval** to query them — no vector DB, no chunking. It accepts pageindex-compatible trees, chat trees, and custom hierarchical JSON without taking a runtime code dependency on PageIndex itself.
24
18
25
19
**Key capabilities:**
26
20
27
-
-**Hierarchical storage** — store tree-structured documents (PDFs, Markdown, custom JSON) in SQLite
21
+
-**Hierarchical storage** — store document trees, chat trees, and custom hierarchical JSON in SQLite
28
22
-**Reasoning-based retrieval** — LLM navigates the tree to find relevant content, like a human expert
29
23
-**Multiple retrieval strategies** — beam search for small trees, block retrieval for large documents
30
24
-**Multi-provider LLM support** — works with Anthropic (Claude) and OpenAI (GPT) out of the box
@@ -38,7 +32,6 @@
38
32
39
33
```bash
40
34
pip install -r requirements.txt
41
-
pip install pageindex # optional, for PDF/Markdown indexing
42
35
```
43
36
44
37
### Basic Usage
@@ -52,29 +45,28 @@ db = contextdb.open("my_docs.sqlite")
`Block` is the best default on this `context7` snapshot: same retrieval quality as `Vertical`, with lower latency and fewer model calls. `Beam` is still workable, but it trails clearly on retrieval accuracy.
128
+
129
+
These numbers are benchmark snapshots, not hard guarantees; exact cost and latency will vary with model choice, provider pricing, prompt-cache behavior, and corpus shape.
130
+
131
+
---
132
+
121
133
## Architecture
122
134
123
135
```
124
136
contextdb/
125
137
├── api/
126
138
│ ├── condb.py # ConDB — main entry point
127
-
│ └── context_tree.py # ContextTree — file indexing API
139
+
│ └── context_tree.py # ContextTree — tree indexing + query API
0 commit comments