Merge branch 'main' into v0.3

carsontung666 · web-flow · commit e6c9e0f1189a · 2026-04-23T17:56:58.000+08:00
diff --git a/README.md b/README.md
@@ -1,32 +1,52 @@
 <div align="center">
 
-# ConDB
+<img src="https://docs.pageindex.ai/images/condb.png" alt="ConDB Banner" />
 
-<p align="center"><b>Context Database for Hierarchical Document Trees</b></p>
+<br/>
 
-<p align="center">
-  Store, navigate, and query hierarchical document structures with LLM-powered reasoning retrieval.
-</p>
+# ConDB: The KV-Cache Native Context Database
+
+<p align="center"><i>A new context database for reasoning-driven retrieval via tree search.<br/>
+Fast, context-aware retrieval at scale with up to 70% less token cost.</i></p>
 
 </div>
 
 ---
 
-## What is ConDB?
+## 🌲 What is ConDB?
+
+**ConDB** (Context Database) is a tree-structured context database that uses LLM-powered **reasoning-based retrieval** via tree search instead of vector similarity — no vector DB, no chunking. It accepts [PageIndex](https://github.com/VectifyAI/PageIndex)-compatible document trees, [ChatIndex](https://github.com/VectifyAI/ChatIndex) conversation trees, filesystem trees, and custom hierarchical JSON — with no runtime dependency on either. The LLM reasons over the tree, like a human expert using a table of contents, to locate relevant content.
+
+### Why not vector search?
+
+- **Similarity ≠ relevance** — vector search retrieves what looks similar, not what is truly relevant. Similar-looking chunks may differ in intent (low accuracy), while truly relevant information may be expressed in very different language and get missed entirely (low recall). True relevance requires reasoning
+- **Chunking breaks semantic continuity** — documents must be split into fixed-size segments to fit embedding models, causing context fragmentation that destroys their natural structure and cross-section relationships
+- **Retrieval is blind to context** — embedding models encode the query alone, ignoring conversational history, user intent, and other contextual signals
+
+ConDB replaces this with **reasoning-based tree search**: the LLM performs node-level relevance classification over a hierarchical index, incorporating full context — making retrieval adaptive, explainable, and traceable.
 
-**ConDB** stores hierarchical document trees in a SQLite database and provides LLM-powered **reasoning-based retrieval** to query them — no vector DB, no chunking. It accepts pageindex-compatible trees, chat trees, and custom hierarchical JSON without taking a runtime code dependency on PageIndex itself.
+### What makes ConDB different
 
-**Key capabilities:**
+- **Fast tree search at scale** — reasoning-driven tree search with block partitioning and parallel processing, supporting complex, context-aware retrieval over large hierarchical structures
+- **KV-cache native** — the first database designed around LLM KV-cache reuse. By caching intermediate results during tree search, ConDB reduces token usage by up to 70% with no loss in accuracy. The same efficiency gains extend to memory systems for long-context reasoning at scale
+- **Unified long-context infrastructure** — a single system for both static and dynamic long-context workloads
 
-- **Hierarchical storage** — store document trees, chat trees, and custom hierarchical JSON in SQLite
-- **Reasoning-based retrieval** — LLM navigates the tree to find relevant content, like a human expert
+### Static long context
+Structured, persistent knowledge — documents (via [PageIndex](https://github.com/VectifyAI/PageIndex)), file systems, and codebases. Scalable retrieval within large, organized hierarchies.
+
+### Dynamic long context
+Evolving, runtime context — agent memory, long conversations (via [ChatIndex](https://github.com/VectifyAI/ChatIndex)), and autoresearch. Systems can continuously update, retrieve, and reason over newly generated information.
+
+### Key capabilities
+
+- **Hierarchical storage** — document trees, chat trees, and custom hierarchical JSON in SQLite
 - **Multiple retrieval strategies** — beam search for small trees, block retrieval for large documents
-- **Multi-provider LLM support** — works with Anthropic (Claude) and OpenAI (GPT) out of the box
+- **Multi-provider LLM support** — Anthropic (Claude) and OpenAI (GPT) out of the box
 - **Extensible** — plug in custom storage backends, LLM providers, or retrieval strategies
 
 ---
 
-## Quick Start
+## 🚀 Getting Started
 
 ### Install
 
@@ -71,9 +91,7 @@ tree_id = ct.index_markdown_file("doc.md", tree_builder=build_markdown_tree)
 ct.close()
 ```
 
----
-
-## Configuration
+### Configuration
 
 Create a `.env` file with your API keys:
 
@@ -104,14 +122,14 @@ LLM_MODEL=claude-opus-4-6 python your_script.py
 
 ---
 
-## Retrieval Strategies
+## 🔍 Retrieval Strategies
 
 ConDB automatically selects the best retrieval strategy based on tree size:
 
 | Strategy | Best for | How it works |
 |----------|----------|--------------|
-| **Beam** | Small trees (< 50 nodes) | LLM evaluates and selects promising branches at each depth level |
-| **Block** | Large documents (50+ nodes) | Splits tree into token-bounded blocks, LLM reasons over each block |
+| **Beam** | Small trees <br/> (< 50 nodes) | LLM evaluates and selects promising branches at each depth level |
+| **Block** | Large documents <br/> (50+ nodes) | Splits tree into token-bounded blocks, LLM reasons over each block. KV-cache native — caches intermediate block results to cut token usage by up to 70% |
 
 You can also specify a strategy explicitly:
 
@@ -121,7 +139,7 @@ result = db.query(tree_id, "question", strategy="block", beam_size=3)
 
 ---
 
-## Benchmark
+## 📈 Benchmark Snapshot
 
 Two benchmarks live under `bench/`.
 
@@ -171,7 +189,9 @@ any `--doc` and any `--config` to benchmark a different document.
 
 ---
 
-## Architecture
+## 🧩 Learn More
+
+### Architecture
 
 ```
 contextdb/
@@ -190,12 +210,9 @@ contextdb/
 └── prompts/              # Jinja2 prompt templates
 ```
 
----
+### Extending
 
-## Extending
-
-<details>
-<summary><b>Custom Storage Backend</b></summary>
+**Custom Storage Backend**
 
 ```python
 from contextdb import StorageProtocol
@@ -207,10 +224,8 @@ class MyStorage:
 
 ct = ContextTree(storage=MyStorage())
 ```
-</details>
 
-<details>
-<summary><b>Custom LLM Provider</b></summary>
+**Custom LLM Provider**
 
 ```python
 from contextdb import LLMProtocol
@@ -221,25 +236,32 @@ class MyLLM:
 
 ct = ContextTree("db.sqlite", llm=MyLLM())
 ```
-</details>
 
----
-
-## Testing
+### Testing
 
 ```bash
 ./run_tests.sh all
 ```
 
 ---
 
-## Related Projects
+## 💬 Community
+
+### Related Projects
 
-- [**PageIndex**](https://github.com/VectifyAI/PageIndex) — one possible external producer of pageindex-compatible document trees
+- [**PageIndex**](https://github.com/VectifyAI/PageIndex) — vectorless, reasoning-based RAG that builds hierarchical tree indexes from long documents
+- [**ChatIndex**](https://github.com/VectifyAI/ChatIndex) — tree indexing for long conversations, enabling reasoning-based retrieval over chat histories
 - [**AgentFS**](https://github.com/anthropics/agentfs) — filesystem for AI agents
 
+### Connect with Us
+
+[![Twitter](https://img.shields.io/badge/Twitter-000000?style=for-the-badge&logo=x&logoColor=white)](https://x.com/PageIndexAI)&ensp;
+[![LinkedIn](https://img.shields.io/badge/LinkedIn-0077B5?style=for-the-badge&logo=linkedin&logoColor=white)](https://www.linkedin.com/company/vectify-ai/)&ensp;
+[![Discord](https://img.shields.io/badge/Discord-5865F2?style=for-the-badge&logo=discord&logoColor=white)](https://discord.com/invite/VuXuf29EUj)&ensp;
+[![Contact Us](https://img.shields.io/badge/Contact_Us-3B82F6?style=for-the-badge&logo=envelope&logoColor=white)](https://ii2abc2jejf.typeform.com/to/tK3AXl8T)
+
 ---
 
-## License
+Licensed under [Apache 2.0](LICENSE).
 
-Apache-2.0
+© 2026 [Vectify AI](https://vectify.ai)