VectifyAI
diff --git a/‎README.md‎
Lines changed: 59 additions & 20 deletions b/‎README.md‎
Lines changed: 59 additions & 20 deletions
diff --git a/‎assets/banner.png‎
830 KB b/‎assets/banner.png‎
830 KB
@@ -1,32 +1,53 @@
 <div align="center">
 
-# ConDB
+<img src="assets/banner.png" alt="ConDB Banner" />
 
-<p align="center"><b>Context Database for Hierarchical Document Trees</b></p>
+<br/>
+
+# ConDB: The KV-Cache Native Context Database
 
 <p align="center">
-  Store, navigate, and query hierarchical document structures with LLM-powered reasoning retrieval.
+  A new type of database optimized for reasoning-driven tree search — fast, context-aware retrieval at scale with up to 70% less token cost.
 </p>
 
 </div>
 
 ---
 
-## What is ConDB?
+## 🌲 What is ConDB?
+
+**ConDB** (Context Database) is a tree-structured context database that uses LLM-powered **reasoning-based retrieval** via tree search instead of vector similarity — no vector DB, no chunking. It accepts [PageIndex](https://github.com/VectifyAI/PageIndex)-compatible document trees, [ChatIndex](https://github.com/VectifyAI/ChatIndex) conversation trees, filesystem trees, and custom hierarchical JSON — with no runtime dependency on either. The LLM reasons over the tree, like a human expert using a table of contents, to locate relevant content.
+
+### Why not vector search?
+
+- **Similarity ≠ relevance** — vector search retrieves what looks similar, not what is truly relevant. Similar-looking chunks may differ in intent (low accuracy), while truly relevant information may be expressed in very different language and get missed entirely (low recall). True relevance requires reasoning
+- **Chunking breaks semantic continuity** — documents must be split into fixed-size segments to fit embedding models, causing context fragmentation that destroys their natural structure and cross-section relationships
+- **Retrieval is blind to context** — embedding models encode the query alone, ignoring conversational history, user intent, and other contextual signals
+
+ConDB replaces this with **reasoning-based tree search**: the LLM performs node-level relevance classification over a hierarchical index, incorporating full context — making retrieval adaptive, explainable, and traceable.
 
-**ConDB** stores hierarchical document trees in a SQLite database and provides LLM-powered **reasoning-based retrieval** to query them — no vector DB, no chunking. It accepts pageindex-compatible trees, chat trees, and custom hierarchical JSON without taking a runtime code dependency on PageIndex itself.
+### What makes ConDB different
 
-**Key capabilities:**
+- **Fast tree search at scale** — reasoning-driven tree search with block partitioning and parallel processing, supporting complex, context-aware retrieval over large hierarchical structures
+- **KV-cache native** — the first database designed around LLM KV-cache reuse. By caching intermediate results during tree search, ConDB reduces token usage by up to 70% with no loss in accuracy. The same efficiency gains extend to memory systems for long-context reasoning at scale
+- **Unified long-context infrastructure** — a single system for both static and dynamic long-context workloads
 
-- **Hierarchical storage** — store document trees, chat trees, and custom hierarchical JSON in SQLite
-- **Reasoning-based retrieval** — LLM navigates the tree to find relevant content, like a human expert
+### Static long context
+Structured, persistent knowledge — documents (via [PageIndex](https://github.com/VectifyAI/PageIndex)), file systems, and codebases. Scalable retrieval within large, organized hierarchies.
+
+### Dynamic long context
+Evolving, runtime context — agent memory, long conversations (via [ChatIndex](https://github.com/VectifyAI/ChatIndex)), and autoresearch. Systems can continuously update, retrieve, and reason over newly generated information.
+
+### Key capabilities
+
+- **Hierarchical storage** — document trees, chat trees, and custom hierarchical JSON in SQLite
 - **Multiple retrieval strategies** — beam search for small trees, block retrieval for large documents
-- **Multi-provider LLM support** — works with Anthropic (Claude) and OpenAI (GPT) out of the box
+- **Multi-provider LLM support** — Anthropic (Claude) and OpenAI (GPT) out of the box
 - **Extensible** — plug in custom storage backends, LLM providers, or retrieval strategies
 
 ---
 
-## Quick Start
+## ⚡ Quick Start
 
 ### Install
 
@@ -73,7 +94,7 @@ ct.close()
 
 ---
 
-## Configuration
+## ⚙️ Configuration
 
 Create a `.env` file:
 
@@ -93,14 +114,14 @@ llm = Config.get_llm_client()
 
 ---
 
-## Retrieval Strategies
+## 🔍 Retrieval Strategies
 
 ConDB automatically selects the best retrieval strategy based on tree size:
 
 | Strategy | Best for | How it works |
 |----------|----------|--------------|
 | **Beam** | Small trees (< 50 nodes) | LLM evaluates and selects promising branches at each depth level |
-| **Block** | Large documents (50+ nodes) | Splits tree into token-bounded blocks, LLM reasons over each block |
+| **Block** | Large documents (50+ nodes) | Splits tree into token-bounded blocks, LLM reasons over each block. KV-cache native — caches intermediate block results to cut token usage by up to 70% |
 
 You can also specify a strategy explicitly:
 
@@ -110,7 +131,7 @@ result = db.query(tree_id, "question", strategy="block", beam_size=3)
 
 ---
 
-## Benchmark Snapshot
+## 📈 Benchmark Snapshot
 
 Current filesystem benchmark summary lives in [bench/fs_block_beam_vertical.md](bench/fs_block_beam_vertical.md).
 
@@ -130,7 +151,7 @@ These numbers are benchmark snapshots, not hard guarantees; exact cost and laten
 
 ---
 
-## Architecture
+## 🧩 Architecture
 
 ```
 contextdb/
@@ -151,7 +172,7 @@ contextdb/
 
 ---
 
-## Extending
+## 🔌 Extending
 
 <details>
 <summary><b>Custom Storage Backend</b></summary>
@@ -184,21 +205,39 @@ ct = ContextTree("db.sqlite", llm=MyLLM())
 
 ---
 
-## Testing
+## 🧪 Testing
 
 ```bash
 ./run_tests.sh all
 ```
 
 ---
 
-## Related Projects
+## 🧭 Related Projects
 
-- [**PageIndex**](https://github.com/VectifyAI/PageIndex) — one possible external producer of pageindex-compatible document trees
+- [**PageIndex**](https://github.com/VectifyAI/PageIndex) — vectorless, reasoning-based RAG that builds hierarchical tree indexes from long documents
+- [**ChatIndex**](https://github.com/VectifyAI/ChatIndex) — tree indexing for long conversations, enabling reasoning-based retrieval over chat histories
 - [**AgentFS**](https://github.com/anthropics/agentfs) — filesystem for AI agents
 
 ---
 
-## License
+## 📄 License
 
 Apache-2.0
+
+---
+
+### Connect with Us
+
+<div align="center">
+
+[![Twitter](https://img.shields.io/badge/Twitter-000000?style=for-the-badge&logo=x&logoColor=white)](https://x.com/PageIndexAI)&ensp;
+[![LinkedIn](https://img.shields.io/badge/LinkedIn-0077B5?style=for-the-badge&logo=linkedin&logoColor=white)](https://www.linkedin.com/company/vectify-ai/)&ensp;
+[![Discord](https://img.shields.io/badge/Discord-5865F2?style=for-the-badge&logo=discord&logoColor=white)](https://discord.com/invite/VuXuf29EUj)&ensp;
+[![Contact Us](https://img.shields.io/badge/Contact_Us-3B82F6?style=for-the-badge&logo=envelope&logoColor=white)](https://ii2abc2jejf.typeform.com/to/tK3AXl8T)
+
+</div>
+
+---
+
+© 2026 [Vectify AI](https://vectify.ai)