Skip to content

Commit 07df6e0

Browse files
committed
Update README numbers: 25 → 35 languages, fresh benchmark stats
- Feature line: list all 35 languages including Obj-C, Swift, Dart, Perl, Groovy, Erlang, R, SCSS, SQL, Dockerfile - Performance section: updated with Django (49K nodes) and Linux kernel stress test numbers from the v0.3.0 benchmark - Architecture section: 25 → 35 language specs - Token efficiency: reference 35 real-world repos instead of single project
1 parent a8abb46 commit 07df6e0

1 file changed

Lines changed: 7 additions & 6 deletions

File tree

README.md

Lines changed: 7 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ Parses source code with [tree-sitter](https://tree-sitter.github.io/tree-sitter/
88

99
## Features
1010

11-
- **25 languages**: Python, Go, JavaScript, TypeScript, TSX, Rust, Java, C++, C#, C, PHP, Lua, Scala, Kotlin, Ruby, Bash, Zig, Elixir, Haskell, OCaml, HTML, CSS, YAML, TOML, HCL
11+
- **35 languages**: Python, Go, JavaScript, TypeScript, TSX, Rust, Java, C++, C#, C, PHP, Lua, Scala, Kotlin, Ruby, Bash, Zig, Elixir, Haskell, OCaml, Objective-C, Swift, Dart, Perl, Groovy, Erlang, R, HTML, CSS, SCSS, YAML, TOML, HCL, SQL, Dockerfile
1212
- **One-command install**: `codebase-memory-mcp install` auto-detects Claude Code and Codex CLI, registers the MCP server, and installs task-specific skills
1313
- **Self-update**: `codebase-memory-mcp update` downloads the latest release, verifies checksums, and atomically swaps the binary
1414
- **Task-specific skills**: 4 skills (exploring, tracing, quality, reference) that prescribe exact tool sequences — Claude Code automatically uses graph tools instead of defaulting to grep
@@ -51,20 +51,21 @@ Claude Code formats and explains the results.
5151

5252
**Why no built-in LLM?** Other code graph tools embed an LLM to translate natural language into graph queries. This means extra API keys, extra cost per query, and another model to configure. With MCP, the AI assistant you're already talking to *is* the query translator — no duplication needed.
5353

54-
**Token efficiency**: Compared to having an AI agent grep through your codebase file by file, graph queries return precise results in a single tool call. In benchmarks on a multi-service project (2,348 nodes, 3,853 edges), five structural queries consumed ~3,400 tokens via codebase-memory-mcp versus ~412,000 tokens via file-by-file exploration — a **99.2% reduction**.
54+
**Token efficiency**: Compared to having an AI agent grep through your codebase file by file, graph queries return precise results in a single tool call. In benchmarks across 35 real-world repos (78 to 49K nodes), five structural queries consumed ~3,400 tokens via codebase-memory-mcp versus ~412,000 tokens via file-by-file exploration — a **99.2% reduction**.
5555

5656
## Performance
5757

58-
Benchmarked on a multi-service project (~37K nodes, ~35K edges) with Go 1.26 on Apple Silicon:
58+
Benchmarked on Apple M3 Pro, macOS Darwin 25.3.0:
5959

6060
| Operation | Time | Notes |
6161
|-----------|------|-------|
62-
| Fresh index (full codebase) | ~6.3s | 37K nodes, 35K edges |
62+
| Fresh index (full codebase) | ~6s | 49K nodes, 196K edges (Django) |
6363
| Incremental reindex | ~1.2s | Content-hash skip for unchanged files |
6464
| Cypher query (relationship traversal) | <1ms | Up to 600x faster than v0.1.3 for pattern queries |
6565
| Name search (regex) | <10ms | SQL LIKE pre-filtering narrows before Go regex |
6666
| Dead code detection | ~150ms | Full graph scan with degree filtering |
67-
| Trace call path (3 hops) | <10ms | BFS traversal with batch degree counting |
67+
| Trace call path (depth=5) | <10ms | BFS traversal; 129K-char result on Linux kernel with zero timeouts |
68+
| Linux kernel stress test | 20K nodes | `drivers/net/ethernet/intel/` — 387 files, 67K edges |
6869

6970
**Token efficiency**: Five structural queries consumed ~3,400 tokens via codebase-memory-mcp versus ~412,000 tokens via file-by-file grep exploration — a **99.2% reduction**.
7071

@@ -594,7 +595,7 @@ See [`BENCHMARK.md`](BENCHMARK.md) for the full 35-language benchmark with per-q
594595
cmd/codebase-memory-mcp/ Entry point (MCP stdio server + CLI mode + install/update commands)
595596
internal/
596597
store/ SQLite graph storage (nodes, edges, traversal, search)
597-
lang/ Language specs (25 languages, tree-sitter node types)
598+
lang/ Language specs (35 languages, tree-sitter node types)
598599
parser/ Tree-sitter grammar loading and AST parsing
599600
pipeline/ 4-pass indexing (structure -> definitions -> calls -> HTTP links)
600601
httplink/ Cross-service HTTP route/call-site matching

0 commit comments

Comments
 (0)