docs: add RagCode vs Generic RAG comparison section

doITmagic · doITmagic · commit 73ec24fd2774 · 2025-11-26T15:34:51.000+02:00
Explains key differentiators:
- Semantic chunking (functions/classes) vs arbitrary text splits
- Rich metadata (name, type, params, deps, lines) vs filename only
- Relationship tracking (inheritance, method calls, imports)
- AST parsing per language vs treating code as plain text
- Exact + semantic search combined
- Complete runnable code units vs fragments

Includes visual example showing the difference in AI assistant responses.
diff --git a/README.md b/README.md
@@ -107,6 +107,43 @@ Without RagCode, AI assistants must:
 
 **Bottom Line:** RagCode gives you enterprise-grade AI code search with zero privacy concerns and zero ongoing costs.
 
+### 🧠 RagCode vs Generic RAG Systems
+
+Most RAG systems treat code like plain text - they split files into arbitrary chunks of N tokens without understanding the code structure. **RagCode is different.**
+
+| Aspect | Generic RAG | RagCode |
+|--------|-------------|---------|
+| **Chunking** | Arbitrary text splits (512 tokens) | Semantic units (functions, classes, methods) |
+| **Context** | Random text fragments | Complete code entities with full context |
+| **Metadata** | Filename only | Name, type, parameters, return type, dependencies, line numbers |
+| **Relationships** | None | Knows `UserController` uses `UserService`, inheritance chains |
+| **Search** | Semantic similarity only | Exact name match + semantic similarity combined |
+| **Language Awareness** | Treats all text the same | AST parsing per language (Go, PHP, Python) |
+| **Results** | May return partial functions | Always returns complete, runnable code units |
+
+**Why this matters for AI assistants:**
+
+```
+❌ Generic RAG returns:
+"...the user authentication logic checks the token
+and validates against the database. If valid, it..."
+→ AI sees fragment, must guess the rest
+
+✅ RagCode returns:
+func AuthenticateUser(token string) (*User, error) {
+    // Complete function with all context
+    user, err := db.FindByToken(token)
+    if err != nil { return nil, ErrInvalidToken }
+    return user, nil
+}
+File: auth/middleware.go, Lines: 45-52
+Called by: LoginHandler, RefreshToken
+Depends on: db.FindByToken, ErrInvalidToken
+→ AI sees complete picture, responds accurately
+```
+
+**The result:** AI assistants using RagCode give more accurate answers because they see complete, contextual code - not random text fragments.
+
 ---
 
 ## ✨ Core Features & Capabilities
diff --git a/llms.txt b/llms.txt
@@ -11,6 +11,16 @@
 - **Zero cost** - no API fees, runs on local Ollama + Qdrant
 - **Privacy-first** - code never leaves your machine
 
+## RagCode vs Generic RAG
+
+Unlike generic RAG that splits code into arbitrary text chunks, RagCode:
+- **Semantic chunking** - extracts complete functions, classes, methods (not random 512-token splits)
+- **Rich metadata** - name, type, parameters, return type, dependencies, line numbers
+- **Relationships** - knows class inheritance, method calls, imports
+- **AST parsing** - understands Go, PHP, Python syntax (not regex/heuristics)
+- **Exact + semantic search** - find "UserController" exactly, not just similar text
+- **Complete code units** - AI sees runnable code, not fragments
+
 ## Compatibility
 
 - **IDEs:** Windsurf, Cursor, VS Code + GitHub Copilot, Claude Desktop, Antigravity