|
13 | 13 |
|
14 | 14 | # PageIndex: Vectorless, Reasoning-based RAG |
15 | 15 |
|
16 | | -<p align="center"><b>Reasoning-based RAG ◦ No Vector DB ◦ No Chunking ◦ Human-like Retrieval</b></p> |
| 16 | +<p align="center"><b>Reasoning-based RAG ◦ No Vector DB or Chunking ◦ Context-Aware ◦ Human-like Retrieval</b></p> |
17 | 17 |
|
18 | 18 | <h4 align="center"> |
19 | 19 | <a href="https://vectify.ai">🌐 Homepage</a> • |
|
33 | 33 | - 🔥 [**Agentic Vectorless RAG**](https://github.com/VectifyAI/PageIndex/blob/main/examples/agentic_vectorless_rag_demo.py) — A simple *agentic, vectorless RAG* [example](https://github.com/VectifyAI/PageIndex/blob/main/examples/agentic_vectorless_rag_demo.py) with self-hosted PageIndex, using OpenAI Agents SDK. |
34 | 34 | - [**Scale PageIndex to Millions of Documents**](https://pageindex.ai/blog/pageindex-filesystem) — *PageIndex File System* is a file-level tree layer that lets PageIndex reason over an entire corpus, not just a single document, enabling massive-scale document search. |
35 | 35 | - [PageIndex Chat](https://chat.pageindex.ai) — Human-like document analysis agent [platform](https://chat.pageindex.ai) for professional long documents. Also available via [MCP](https://pageindex.ai/developer) or [API](https://pageindex.ai/developer). |
36 | | -- [PageIndex Framework](https://pageindex.ai/blog/pageindex-intro) — Deep dive into PageIndex: an *agentic, in-context tree index* that enables LLMs to perform *reasoning-based, human-like retrieval* over long documents. |
| 36 | +- [PageIndex Framework](https://pageindex.ai/blog/pageindex-intro) — Deep dive into PageIndex: an *agentic, in-context tree index* that enables LLMs to perform *reasoning-based, context-aware retrieval* over long documents. |
37 | 37 |
|
38 | 38 | <!-- **🧪 Cookbooks:** |
39 | 39 | - [Vectorless RAG](https://docs.pageindex.ai/cookbook/vectorless-rag-pageindex): A minimal, hands-on example of reasoning-based RAG using PageIndex. No vectors, no chunking, and human-like retrieval. |
@@ -64,8 +64,9 @@ It simulates how *human experts* navigate and extract knowledge from complex doc |
64 | 64 | Compared to traditional vector-based RAG, **PageIndex** features: |
65 | 65 | - **No Vector DB**: Uses document structure and LLM reasoning for retrieval, instead of vector similarity search. |
66 | 66 | - **No Chunking**: Documents are organized into natural sections, not artificial chunks. |
| 67 | +- **Better Explainability and Traceability**: Retrieval is based on reasoning, traceable and interpretable, with page and section references. No more opaque, approximate vector search (“vibe retrieval”). |
| 68 | +- **Context-Aware Retrieval**: Retrieval depends on your full context (e.g., conversation history and domain knowledge), and easily incorporates new context. |
67 | 69 | - **Human-like Retrieval**: Simulates how human experts navigate and extract knowledge from complex documents. |
68 | | -- **Better Explainability and Traceability**: Retrieval is based on reasoning — traceable and interpretable, with page and section references. No more opaque, approximate vector search (“vibe retrieval”). |
69 | 70 |
|
70 | 71 | PageIndex powers a reasoning-based RAG system that achieved **state-of-the-art** [98.7% accuracy](https://github.com/VectifyAI/Mafin2.5-FinanceBench) on FinanceBench, demonstrating superior performance over vector-based RAG solutions in professional document analysis. See our [blog post](https://vectify.ai/blog/Mafin2.5) for details. |
71 | 72 |
|
|
0 commit comments