Competitive Analysis — Code Graph / Code Intelligence Tools

Date: 2026-02-25 Scope: 137+ code analysis tools evaluated, 82+ ranked against @optave/codegraph

Overall Ranking

Ranked by weighted score across 6 dimensions (each 1–5):

Tier 1: Direct Competitors (score ≥ 3.0)

#	Score	Project	Stars	Lang	License	Summary
1	4.5	joernio/joern	2,956	Scala	Apache-2.0	Full CPG analysis platform for vulnerability discovery, Scala query DSL, multi-language, daily releases
2	4.5	postrv/narsil-mcp	101	Rust	Apache-2.0	90 MCP tools, 32 languages, taint analysis, SBOM, dead code, neural semantic search, single ~30MB binary
3	4.5	vitali87/code-graph-rag	1,916	Python	MIT	Graph RAG with Memgraph, multi-provider AI, code editing, semantic search, MCP
4	4.2	Fraunhofer-AISEC/cpg	411	Kotlin	Apache-2.0	CPG library for 8+ languages with MCP module, Neo4j visualization, formal specs, LLVM IR support
5	4.2	seatedro/glimpse	349	Rust	MIT	Clipboard-first codebase-to-LLM tool with call graphs, token counting, LSP resolution
6	4.0	SimplyLiz/CodeMCP (CKB)	59	Go	Custom	SCIP-based indexing, compound operations (83% token savings), CODEOWNERS, secret scanning
7	4.0	abhigyanpatwari/GitNexus	—	TS/JS	PolyForm NC	Knowledge graph with precomputed structural intelligence, 7 MCP tools, hybrid BM25+semantic search, clustering, process tracing, KuzuDB. Non-commercial only
8	4.0	@optave/codegraph	—	JS/Rust	Apache-2.0	Sub-second incremental rebuilds, dual engine (native Rust + WASM), 11 languages, 18-tool MCP, qualified call resolution, `context`/`explain`/`where` AI-optimized commands, structure/hotspot analysis, node role classification (entry/core/utility/adapter/dead/leaf), dead code detection, zero-cost core + optional LLM enhancement
9	3.9	harshkedia177/axon	421	Python	MIT	11-phase pipeline, KuzuDB, Leiden community detection, dead code, change coupling, 7 MCP tools
10	3.8	anrgct/autodev-codebase	111	TypeScript	None	40+ languages, 7 embedding providers, Cytoscape.js visualization, LLM reranking
11	3.8	ShiftLeftSecurity/codepropertygraph	564	Scala	Apache-2.0	CPG specification + Tinkergraph library, Scala query DSL, protobuf serialization (Joern foundation)
12	3.8	Jakedismo/codegraph-rust	142	Rust	None	100% Rust GraphRAG, SurrealDB, LSP-powered dataflow analysis, architecture boundary enforcement
13	3.7	Anandb71/arbor	85	Rust	MIT	Native GUI, confidence scoring, architectural role classification, fuzzy search, MCP
14	3.7	JudiniLabs/mcp-code-graph	380	JavaScript	MIT	Cloud-hosted MCP server by CodeGPT, semantic search, dependency links (requires account)
15	3.7	entrepeneur4lyf/code-graph-mcp	80	Python	MIT	ast-grep for 25+ languages, complexity metrics, code smells, circular dependency detection
16	3.7	cs-au-dk/jelly	417	TypeScript	BSD-3	Academic-grade JS/TS points-to analysis, call graphs, vulnerability exposure, 5 published papers
17	3.5	er77/code-graph-rag-mcp	89	TypeScript	MIT	26 MCP methods, 11 languages, tree-sitter, semantic search, hotspot analysis, clone detection
18	3.5	MikeRecognex/mcp-codebase-index	25	Python	AGPL-3.0	18 MCP tools, zero runtime deps, auto-incremental reindexing via git diff
19	3.5	nahisaho/CodeGraphMCPServer	7	Python	MIT	GraphRAG with Louvain community detection, 16 languages, 14 MCP tools, 334 tests
20	3.5	colbymchenry/codegraph	165	TypeScript	MIT	tree-sitter + SQLite + MCP, Claude Code token reduction benchmarks, npx installer
21	3.5	dundalek/stratify	102	Clojure	MIT	Multi-backend extraction (LSP/SCIP/Joern), 10 languages, DGML/CodeCharta output, architecture linting
22	3.5	kraklabs/cie	9	Go	AGPL-3.0	Code Intelligence Engine: 20+ MCP tools, tree-sitter, semantic search (Ollama), Homebrew, single Go binary
23	3.4	Durafen/Claude-code-memory	72	Python	None	Memory Guard quality gate, persistent codebase memory, Voyage AI + Qdrant
24	3.3	NeuralRays/codexray	2	TypeScript	MIT	16 MCP tools, TF-IDF semantic search (~50MB), dead code, complexity, path finding
25	3.3	DucPhamNgoc08/CodeVisualizer	475	TypeScript	MIT	VS Code extension, tree-sitter WASM, flowcharts + dependency graphs, 5 AI providers, 9 themes
26	3.3	helabenkhalfallah/code-health-meter	34	JavaScript	MIT	Formal health metrics (MI, CC, Louvain modularity), published in ACM TOSEM 2025
27	3.3	JohT/code-graph-analysis-pipeline	27	Cypher	GPL-3.0	200+ CSV reports, ML anomaly detection, Leiden/HashGNN, jQAssistant + Neo4j for Java
28	3.3	Lekssays/codebadger	43	Python	GPL-3.0	Containerized MCP server using Joern CPG, 12+ languages
29	3.2	al1-nasir/codegraph-cli	11	Python	MIT	CrewAI multi-agent system, 6 LLM providers, browser explorer, DOCX export
30	3.1	anasdayeh/claude-context-local	0	Python	None	100% local, Merkle DAG incremental indexing, sharded FAISS, hybrid BM25+vector, GPU accel
31	3.0	Vasu014/loregrep	12	Rust	Apache-2.0	In-memory index library, Rust + Python bindings, AI-tool-ready schemas
32	3.0	xnuinside/codegraph	438	Python	MIT	Python-only interactive HTML dependency diagrams with zoom/pan/search
33	3.0	Adrninistrator/java-all-call-graph	551	Java	Apache-2.0	Complete Java bytecode call graphs, Spring/MyBatis-aware, SQL-queryable DB
34	3.0	Technologicat/pyan	395	Python	GPL-2.0	Python 3 call graph generator, module import analysis, cycle detection, interactive HTML
35	3.0	GaloisInc/MATE	194	Python	BSD-3	DARPA-funded interactive CPG-based bug hunting for C/C++ via LLVM
36	3.0	clouditor/cloud-property-graph	28	Kotlin	Apache-2.0	Connects code property graphs with cloud runtime security assessment

Tier 2: Niche & Single-Language Tools (score 2.0–2.9)

#	Score	Project	Stars	Lang	License	Summary
37	2.9	rahulvgmail/CodeInteliMCP	8	Python	None	DuckDB + ChromaDB (zero Docker), multi-repo, lightweight embedded DBs
38	2.8	paul-gauthier/aider	41,664	Python	Apache-2.0	AI pair programming CLI; tree-sitter repo map with PageRank-style graph ranking for LLM context selection, 100+ languages, multi-provider LLM support, git-integrated auto-commits
39	2.8	scottrogowski/code2flow	4,528	Python	MIT	Call graphs for Python/JS/Ruby/PHP via AST, DOT output, 100% test coverage
40	2.8	ysk8hori/typescript-graph	200	TypeScript	None	TypeScript file-level dependency Mermaid diagrams, code metrics (MI, CC), watch mode
41	2.8	nuanced-dev/nuanced-py	126	Python	MIT	Python call graph enrichment designed for AI agent consumption
42	2.8	Bikach/codeGraph	6	TypeScript	MIT	Neo4j graph, Claude Code slash commands, Kotlin support, 40-50% cost reduction
43	2.8	ChrisRoyse/CodeGraph	65	TypeScript	None	Neo4j + MCP, multi-language, framework detection (React, Tailwind, Supabase)
44	2.8	Symbolk/Code2Graph	48	Java	None	Multilingual code → language-agnostic graph representation
45	2.7	yumeiriowl/repo-graphrag-mcp	3	Python	MIT	LightRAG + tree-sitter, entity merge (code ↔ docs), implementation planning tool
46	2.7	davidfraser/pyan	712	Python	GPL-2.0	Python call graph generator (stable fork), DOT/SVG/HTML output, Sphinx integration
47	2.7	mamuz/PhpDependencyAnalysis	572	PHP	MIT	PHP dependency graphs, cycle detection, architecture verification against defined layers
48	2.7	faraazahmad/graphsense	35	TypeScript	MIT	MCP server providing code intelligence via static analysis
49	2.7	JonnoC/CodeRAG	14	TypeScript	MIT	Enterprise code intelligence with CK metrics, Neo4j, 23 analysis tools, MCP server
50	2.6	0xjcf/MCP_CodeAnalysis	7	Python/TS	None	Stateful tools (XState), Redis sessions, socio-technical analysis, dual language impl
51	2.5	koknat/callGraph	325	Perl	GPL-3.0	Multi-language (22+) call graph generator via regex, GraphViz output
52	2.5	RaheesAhmed/code-context-mcp	0	Python	MIT	Security pattern detection, auto architecture diagrams, code flow tracing
53	2.5	league1991/CodeAtlasVsix	265	C#	GPL-2.0	Visual Studio plugin, Doxygen-based call graph navigation (VS 2010-2015 era)
54	2.5	beicause/call-graph	105	TypeScript	Apache-2.0	VS Code extension generating call graphs via LSP call hierarchy API
55	2.5	Thibault-Knobloch/codebase-intelligence	44	Python	None	Code indexing + call graph + vector DB + natural language queries (requires OpenAI)
56	2.5	darkmacheken/wasmati	31	C++	Apache-2.0	CPG infrastructure for scanning vulnerabilities in WebAssembly
57	2.5	sutragraph/sutracli	28	Python	GPL-3.0	AI-powered cross-repo dependency graphs for coding agents
58	2.5	julianjensen/ast-flow-graph	69	JavaScript	Other	JavaScript control flow graphs from AST analysis
59	2.5	yoanbernabeu/grepai-skills	14	—	MIT	27 AI agent skills for semantic code search and call graph analysis
60	2.4	shantham/codegraph	0	TypeScript	MIT	Polished `npx` one-command installer, sqlite-vss, 7 MCP tools
61	2.3	ozyyshr/RepoGraph	251	Python	Apache-2.0	SWE-bench code graph research (ctags + networkx for LLM context)
62	2.3	emad-elsaid/rubrowser	644	Ruby	MIT	Ruby-only interactive D3 force-directed dependency graph
63	2.3	Chentai-Kao/call-graph-plugin	87	Kotlin	None	IntelliJ plugin for visualizing call graphs in IDE
64	2.3	ehabterra/apispec	72	Go	Apache-2.0	OpenAPI 3.1 spec generator from Go code via call graph analysis
65	2.3	huoyo/ko-time	61	Java	LGPL-2.1	Spring Boot call graph with runtime durations
66	2.3	Fraunhofer-AISEC/codyze	91	Kotlin	None	CPG-based analyzer for cryptographic API misuse (archived, merged into cpg repo)
67	2.3	CartographAI/mcp-server-codegraph	17	JavaScript	MIT	Lightweight MCP code graph (3 tools only, Python/JS/Rust)
68	2.3	YounesBensafia/DevLens	21	Python	None	Repo scanner with AI summaries, dead code detection (dep graph not yet implemented)
69	2.3	0xd219b/codegraph	0	Rust	None	Pure Rust, HTTP server mode, Java + Go support
70	2.3	aryx/codegraph	6	OCaml	Other	Multi-language source code dependency visualizer (the original "codegraph" name)
71	2.2	jmarkowski/codeviz	144	Python	MIT	C/C++ `#include` header dependency graph visualization
72	2.2	juanallo/vscode-dependency-cruiser	76	JavaScript	MIT	VS Code wrapper for dependency-cruiser (JS/TS)
73	2.2	hidva/as2cfg	63	Rust	GPL-3.0	Intel assembly → control flow graph
74	2.2	microsoft/cmd-call-graph	55	Python	MIT	Call graphs for Windows CMD batch files
75	2.2	siggy/gographs	52	Go	MIT	Go package dependency graph generator
76	2.2	henryhale/depgraph	33	Go	MIT	Go-focused codebase dependency analysis
77	2.2	2015xli/clangd-graph-rag	28	Python	Apache-2.0	C/C++ Neo4j GraphRAG via clangd (scales to Linux kernel)
78	2.1	floydw1234/badger-graph	0	Python	None	Dgraph backend (Docker), C struct field access tracking
79	2.0	crubier/code-to-graph	382	JavaScript	None	JS code → Mermaid flowchart (single-function, web demo)
80	2.0	khushil/code-graph-rag	0	Python	MIT	Fork of vitali87/code-graph-rag with no modifications
81	2.0	FalkorDB/code-graph-backend	26	Python	MIT	FalkorDB (Redis-based graph) code analysis demo
82	2.0	jillesvangurp/spring-depend	46	Java	MIT	Spring bean dependency graph extraction
83	2.0	ivan-m/SourceGraph	27	Haskell	GPL-3.0	Haskell graph-theoretic code analysis (last updated 2022)
84	2.0	brutski/go-code-graph	13	Go	MIT	Go codebase analyzer with MCP integration

Tier 3: Minimal or Inactive (score < 2.0)

Score	Project	Stars	Summary
1.8	m3et/CodeRAG	0	Iterative RAG with self-reflection, ChromaDB, Azure OpenAI dependent
1.8	getyourguide/spmgraph	239	Swift Package Manager dependency graph + architecture linting
1.8	mvidner/code-explorer	53	Ruby call graph and class dependency browser
1.8	ytsutano/jitana	41	Android DEX static+dynamic hybrid analysis
1.8	ShiftLeftSecurity/fuzzyc2cpg	37	[ARCHIVED] Fuzzy C/C++ parser to CPG (Joern ecosystem)
1.8	mufasadb/code-grapher	10	MCP code graph server (early stage)
1.8	dtsbourg/codegraph-fmt	7	Annotated AST graph representations from Python
1.8	mloncode/codegraph	5	Git/UAST graph experiments
1.7	ashishb/python_dep_generator	22	Python dependency graph generator
1.7	LaurEars/codegrapher	15	Python call graph visualizer
1.7	AdilZouitine/ouakha.rs	7	LLM-based Rust code analysis for suspicious code
1.7	ensozos/geneci	6	UML diagrams and call graphs from source
1.7	spullara/codegraph	5	Java JARs → Neo4j loader
1.5	z7zmey/codegraph	10	PHP code visualization (last updated 2020)
1.5	marcusva/cflow	10	C/assembler call graph generator
1.5	beacoder/call-graph	5	Emacs-based C/C++ call graph

Scoring Breakdown (Tier 1)

#	Project	Features	Analysis Depth	Deploy Simplicity	Lang Support	Code Quality	Community
1	joern	5	5	3	4	5	5
2	narsil-mcp	5	5	5	5	4	3
3	code-graph-rag	5	4	3	4	4	5
4	cpg	5	5	2	5	5	3
5	glimpse	4	4	5	3	5	5
6	CKB	5	5	4	3	4	3
7	GitNexus	5	5	4	4	4	2
8	codegraph (us)	5	4	5	4	4	2
9	axon	5	5	4	2	4	2
10	autodev-codebase	5	3	3	5	3	4
11	codepropertygraph	4	5	2	4	5	3
12	codegraph-rust	5	5	2	4	4	3
13	arbor	4	4	5	4	5	3
14	mcp-code-graph	4	3	4	4	3	4
15	code-graph-mcp	4	4	4	5	3	2
16	jelly	4	5	4	1	5	3
17	code-graph-rag-mcp	5	4	3	4	3	2
18	mcp-codebase-index	4	3	5	3	4	2
19	CodeGraphMCPServer	4	4	4	5	3	1
20	colbymchenry/codegraph	4	3	5	3	3	3
21	stratify	4	4	2	5	4	2
22	cie	5	4	4	3	4	1
23	Claude-code-memory	4	3	3	3	4	3
24	codexray	5	4	4	4	3	1
25	CodeVisualizer	4	3	5	3	3	2
26	code-health-meter	3	5	5	1	4	2
27	code-graph-analysis-pipeline	5	5	1	2	5	2
28	codebadger	4	4	3	5	3	1
29	codegraph-cli	5	3	3	2	3	2
30	claude-context-local	4	3	3	4	4	1
31	loregrep	3	3	4	3	5	2
32	xnuinside/codegraph	3	2	5	1	3	4
33	java-all-call-graph	4	4	3	1	3	3
34	pyan	3	3	5	1	4	2
35	MATE	3	5	1	1	3	2
36	cloud-property-graph	4	4	2	2	4	2

Scoring criteria:

Features (1-5): breadth of tools, MCP integration, search, visualization, export
Analysis Depth (1-5): how deep the code analysis goes (dead code, complexity, flow tracing, coupling)
Deploy Simplicity (1-5): ease of setup — zero Docker = 5, requires Docker = 3, complex multi-service = 1
Lang Support (1-5): number of well-supported programming languages
Code Quality (1-5): architecture, performance characteristics, engineering rigor
Community (1-5): stars, contributors, activity, documentation quality

Where Codegraph Wins

Strength	Details
Always-fresh graph (incremental rebuilds)	Three-tier change detection (journal → mtime+size → hash) means only changed files are re-parsed. Change 1 file in a 3,000-file project → rebuild in under a second. No other tool in this space offers this. Competitors re-index everything from scratch — making them unusable in commit hooks, watch mode, or agent-driven loops
Qualified call resolution	Import-aware resolution distinguishes method calls (`obj.method()`) from standalone function calls, filters 28+ built-in receivers (`console`, `Math`, `JSON`, `Array`, `Promise`, etc.), deduplicates edges, and respects import scope. A call to `foo()` only resolves to functions actually imported or in-scope — eliminating the false positives that plague tree-sitter-based tools. Confidence scoring (1.0 → 0.5) on every edge lets agents trust the graph
AI-optimized compound commands	`context` returns source + deps + callers + signature + related tests for a function in one call. `explain` gives structural summaries of files (public API, internals, data flow) or functions without reading the source. These save AI agents 50-80% of the token budget they'd otherwise spend navigating code. No competitor offers purpose-built compound context commands
Zero-cost core, LLM-enhanced when you choose	The full graph pipeline (parse, resolve, query, impact analysis) runs with no API keys, no cloud, no cost. LLM features (richer embeddings, semantic search) are an optional layer on top — using whichever provider the user already works with. Competitors either require cloud APIs for core features (code-graph-rag, autodev-codebase, mcp-code-graph) or offer no AI enhancement at all (CKB, axon). Nobody else offers both modes in one tool
Data goes only where you send it	Your code reaches exactly one place: the AI agent you already chose (via MCP). No additional third-party services, no surprise cloud calls. Competitors like code-graph-rag, autodev-codebase, mcp-code-graph, and Claude-code-memory send your code to additional AI providers beyond the agent you're using
Dual engine architecture	Only project with native Rust (napi-rs) + automatic WASM fallback. Others are pure Rust (narsil-mcp, codegraph-rust) OR pure JS/Python — never both
Standalone CLI + MCP	Full CLI experience (`context`, `explain`, `where`, `fn`, `diff-impact`, `map`, `deps`, `search`, `structure`, `hotspots`, `roles`) alongside 18-tool MCP server. Many competitors are MCP-only (narsil-mcp, code-graph-mcp, CodeGraphMCPServer) with no standalone query interface
Single-repo MCP isolation	Security-conscious default: tools have no `repo` property unless `--multi-repo` is explicitly enabled. Most competitors default to exposing everything
Zero-dependency deployment	`npm install` and done. No Docker, no external databases, no Python, no SCIP toolchains, no JVM. Published platform-specific binaries (`@optave/codegraph-{platform}-{arch}`) resolve automatically. Joern requires JDK 21, cpg requires Gradle + language-specific deps, codegraph-rust requires SurrealDB + LSP servers
Structure & quality analysis	`structure` shows directory cohesion scores, `hotspots` finds files with extreme fan-in/fan-out/density, `stats` includes a graph quality score (0-100) with false-positive warnings. These give agents architectural awareness without requiring external tools
Node role classification	Every symbol is auto-tagged as `entry`/`core`/`utility`/`adapter`/`dead`/`leaf` based on fan-in/fan-out patterns with adaptive median thresholds. Agents instantly know a function's architectural role without reading surrounding code. Inspired by arbor's role classification — but we compute roles automatically during graph build rather than requiring manual tagging, and we surface roles across all query commands (`where`, `explain`, `context`, `stats`, `list-functions`). Dead code detection comes free as a byproduct
Callback pattern extraction	Extracts symbols from Commander `.command().action()` (as `command:build`), Express route handlers (as `route:GET /api/users`), and event emitter listeners (as `event:data`). No competitor extracts symbols from framework callback patterns

Where Codegraph Loses

vs joern (#1, 2,956 stars)

Full Code Property Graph: AST + CFG + PDG combined for deep vulnerability analysis; our tree-sitter extraction captures structure but not control/data flow
Scala query DSL: purpose-built query language for arbitrary graph traversals vs our fixed SQL queries
Binary analysis: Ghidra frontend can analyze compiled binaries — we're source-only
Enterprise backing: ShiftLeft/Fraunhofer support, daily automated releases, Discord community, professional documentation at joern.io
Community: 2,956 stars, 389 forks — massive traction

vs narsil-mcp (#2, 101 stars)

Feature breadth: 90 MCP tools vs our 17; covers taint analysis, SBOM, license compliance, control flow graphs, data flow analysis
Language count: 32 languages (including Verilog, Fortran, PowerShell, Nix) vs our 11
Security analysis: vulnerability scanning with OWASP/CWE coverage — we have no security features
Dead code detection: built-in — (Gap closed: our roles --role dead now surfaces unreferenced non-exported symbols)
Single-binary deployment: ~30MB Rust binary via brew/scoop/cargo/npm — as easy as ours

vs code-graph-rag (#3, 1,916 stars)

Graph query expressiveness: Memgraph + Cypher enables arbitrary graph traversals; our SQL queries are more rigid
AI-powered code editing: they can surgically edit functions via AST targeting with visual diffs
Provider flexibility: they support Gemini/OpenAI/Claude/Ollama and can mix providers per task
Community: 1,916 stars — orders of magnitude more traction

vs cpg (#4, 411 stars)

Formal CPG specification: academic-grade graph representation (AST + CFG + PDG + DFG) with published specs
MCP module: built-in MCP support now, matching our integration
LLVM IR support: extends language coverage to any LLVM-compiled language (Rust, Swift, etc.)
Type inference: can analyze incomplete/partial code — our tree-sitter requires syntactically valid input

vs glimpse (#5, 349 stars)

LLM workflow optimization: clipboard-first output + token counting + XML output mode — purpose-built for "code → LLM context"
LSP-based call resolution: compiler-grade accuracy vs our tree-sitter heuristic approach
Web content processing: can fetch URLs and convert HTML to markdown for context

vs CKB (#6, 59 stars)

Indexing accuracy: SCIP provides compiler-grade cross-file references (type-aware), fundamentally more accurate than tree-sitter for supported languages
Compound operations: explore/understand/prepareChange batch multiple queries into one call — 83% token reduction. (Gap narrowed: our context and explain commands now serve the same purpose, returning full function context or file summaries in one call)
CODEOWNERS + secret scanning: enterprise features we lack entirely

vs GitNexus (#7)

Precomputed structural intelligence: 6-phase pipeline (structure, parsing, resolution, clustering, processes, search) precomputes everything at index time — queries return complete context in a single call. Our queries traverse the graph at query time
Clustering and process tracing: Leiden-style community detection groups related symbols into functional clusters; execution flow tracing from entry points. We have neither
Hybrid search: BM25 + semantic + RRF with process-grouped results — our semantic search lacks the BM25/process grouping layer
Multi-file coordinated rename: validated against graph structure and text — we have no refactoring tools
Auto-generated context files: LLM-powered wiki and AGENTS.md/CLAUDE.md generation from the knowledge graph
Tradeoff: Full pipeline re-run on changes (no incremental builds), KuzuDB graph DB (heavier than SQLite), browser mode limited to ~5,000 files

vs axon (#9, 29 stars)

Analysis depth: their 11-phase pipeline includes community detection (Leiden), execution flow tracing, git change coupling, dead code detection — (Gap narrowed: we now have dead code detection via node role classification)
Graph database: KuzuDB with native Cypher is more expressive for complex graph queries than our SQLite
Branch structural diff: compares code structure between branches using git worktrees

vs codegraph-rust (#12, 142 stars)

LSP-powered analysis: compiler-grade cross-file references via rust-analyzer, pyright, gopls vs our tree-sitter heuristics
Dataflow edges: defines/uses/flows_to/returns/mutates relationships we don't capture
Architecture boundary enforcement: configurable rules for detecting violations — we have no architectural awareness
Tiered indexing: fast/balanced/full modes for different use cases — we have one mode

vs jelly (#16, 417 stars)

Points-to analysis: flow-insensitive analysis with access paths for JS/TS — fundamentally more precise than our tree-sitter-based call resolution
Academic rigor: 5 published papers backing the methodology (Aarhus University)
Vulnerability exposure analysis: library usage pattern matching specific to the JS/TS ecosystem

vs aider (#38, 41,664 stars)

Different product category: Aider is an AI pair programming CLI, not a code graph tool — but its tree-sitter repo map with PageRank-style graph ranking is a lightweight alternative to our full graph for LLM context selection
Massive community: 41,664 stars, 3,984 forks — orders of magnitude more traction than any tool in this space. Aider is the category leader for AI-assisted coding in the terminal
100+ languages: tree-sitter parsing covers far more languages than our 11, though only for identifier extraction (not full symbol/call resolution)
Multi-provider LLM: works with Claude, GPT-4, Gemini, DeepSeek, Ollama, and virtually any LLM out of the box
Built-in code editing: Aider's core loop is "understand code → edit code → commit." We provide the understanding layer but don't edit
Where we win: Aider's repo map is shallow — file-level dependency graph with identifier ranking, no function-level call resolution, no impact analysis, no dead code detection, no complexity metrics, no MCP server, no standalone queryable graph. It answers "what's relevant?" but not "what breaks if I change this?" Our graph is deeper and persistent; Aider rebuilds its map per-request

vs colbymchenry/codegraph (#20, 165 stars)

No role classification: they lack node role classification or dead code detection — we now have both
Naming competitor: same name, same tech stack (tree-sitter + SQLite + MCP + Node.js) — marketplace confusion risk
Published benchmarks: 67% fewer tool calls and measurable Claude Code token reduction — compelling marketing angle we lack. (Gap narrowed: our context and explain compound commands now provide similar token savings by batching multiple queries into one call)
One-liner setup: npx @colbymchenry/codegraph with interactive installer auto-configures Claude Code

Features to Adopt — Priority Roadmap

Tier 1: High impact, low effort

Feature	Inspired by	Why	Status
~~Dead code detection~~	narsil-mcp, axon, codexray, CKB	~~We have the graph — find nodes with zero incoming edges (minus entry points/exports). Agents constantly ask "is this used?"~~	DONE — Delivered via node classification. `roles --role dead` lists all unreferenced, non-exported symbols
~~Fuzzy symbol search~~	arbor	~~Add Levenshtein/Jaro-Winkler to `fn` command. Currently requires exact match~~	DONE — `fn` now has relevance scoring (exact > prefix > word-boundary > substring) with fan-in tiebreaker, plus `--file` and `--kind` filters
~~Expose confidence scores~~	arbor	~~Already computed internally in import resolution — just surface them~~	DONE — confidence scores stored on every call edge, surfaced in `stats` graph quality score
Shortest path A→B	codexray, arbor	BFS on existing edges table. We have `fn` for single chains but no A→B pathfinding	TODO

Tier 2: High impact, medium effort

Feature	Inspired by	Why	Status
Optional LLM provider integration	code-graph-rag, autodev-codebase	Bring-your-own provider (OpenAI, etc.) for richer embeddings and AI-powered search. Enhancement layer only — core graph never depends on it. No other tool offers both zero-cost local and LLM-enhanced modes in one package	TODO
~~Compound MCP tools~~	CKB, colbymchenry/codegraph	~~`explore`/`understand` meta-tools that batch deps + fn + map into single responses~~	DONE — `context` returns source + deps + callers + signature + tests in one call; `explain` returns structural summaries of files or functions
Token counting on responses	glimpse, arbor	tiktoken-based counts so agents know context budget consumed	TODO
~~Node classification~~	arbor	~~Auto-tag Entry Point / Core / Utility / Adapter from in-degree/out-degree patterns~~	DONE — `classifyNodeRoles()` tags every symbol as `entry`/`core`/`utility`/`adapter`/`dead`/`leaf`. New `roles` CLI command, `node_roles` MCP tool (18 tools), `--role`/`--file` filters. Roles surfaced in `where`/`explain`/`context`/`stats`/`list-functions`
TF-IDF lightweight search	codexray	SQLite FTS5 + TF-IDF as a middle tier (~50MB) between "no search" and full transformers (~500MB)	TODO
OWASP/CWE pattern detection	narsil-mcp, CKB	Security pattern scanning on the existing AST — hardcoded secrets, SQL injection patterns, XSS	TODO
Formal code health metrics	code-health-meter	Cyclomatic complexity, Maintainability Index, Halstead metrics per function — we already parse the AST	TODO

Tier 3: High impact, high effort

Feature	Inspired by	Why	Status
Interactive HTML visualization	autodev-codebase, CodeVisualizer	`codegraph viz` → opens interactive vis.js/Cytoscape.js graph in browser	TODO
Git change coupling	axon	Analyze git history for files that always change together — enhances `diff-impact`	TODO
Community detection	axon, GitNexus, CodeGraphMCPServer	Leiden/Louvain algorithm to discover natural module boundaries vs actual file organization	TODO
Execution flow tracing	axon, GitNexus, code-context-mcp	Framework-aware entry point detection + BFS flow tracing	TODO
Dataflow analysis	codegraph-rust	Define/use chains and flows_to/returns/mutates edges — major analysis depth increase	TODO
Architecture boundary rules	codegraph-rust, stratify	User-defined rules for allowed/forbidden dependencies between modules	TODO

Paid Solutions

Sourcegraph (sourcegraph.com)

What it is: Enterprise code intelligence platform. Cloud-hosted and self-hosted. Proprietary, paid per user (free tier for individuals).

Core features:

Feature	Description	Codegraph equivalent	Gap
Code Search	Full-text regex search across all repos, branches, commits, and diffs. RE2 engine with boolean operators (`AND`/`OR`/`NOT`), compound filters (`repo:`, `file:`, `lang:`, `author:`, `before:`/`after:`), output shaping (`select:repo`, `select:symbol.function`, `select:file.owners`), and `rev:at.time()` for historical point-in-time search. Search Contexts define reusable named scopes	`codegraph search` (hybrid BM25+semantic), `where`, `list-functions` with `-f`/`-k`/`-T` filters	Partial — we have semantic+keyword search but lack boolean compound queries, diff/commit content search, output reshaping, and named search contexts. Backlog IDs 75, 79
Deep Search	Agentic natural-language search: an AI agent iteratively uses Code Search + Code Navigation tools, refining its understanding each loop until confident. Returns markdown answers with source citations. Conversational follow-ups	`codegraph search` (semantic mode) finds conceptual matches but returns raw results, not synthesized answers	Yes — we do semantic search but not agentic iterative search with synthesized answers. This is an LLM-layer feature — could be built on top of our MCP tools by an orchestrating agent rather than built into codegraph itself
Code Navigation	Go-to-definition, find-references, find-implementations across repositories. Two tiers: search-based (heuristic, instant) and precise (SCIP compiler-accurate indexers). Popover type signatures and docs inline	`codegraph where` (search-based), `codegraph query` (callers/callees), `codegraph context` (full context). No find-implementations	Partial — we have search-based navigation and caller/callee chains. We lack interface→implementation tracking (backlog ID 74) and cross-repo reference resolution (backlog ID 78)
Code Monitoring	Persistent watch rules on `type:diff`/`type:commit` queries. Fires email, Slack webhook, or custom HTTP webhook when new commits match. No limit on monitor count or monitored code volume	`codegraph build --watch` (incremental rebuild), `codegraph check --staged` (CI predicates)	Partial — we have watch-mode rebuilds and CI predicates but no persistent query-based commit monitors with notification actions. Backlog ID 76
Code Ownership	CODEOWNERS as a first-class search dimension: `file:has.owner()`, `select:file.owners`, owner-scoped queries. Resolves CODEOWNERS entries against user profiles	`codegraph owners` with `--owner`, `--boundary` filters. Integrated into `diff-impact` (affected owners + suggested reviewers). `code_owners` MCP tool	No gap — feature parity. We parse CODEOWNERS, match patterns, integrate into impact analysis, and expose via CLI + MCP. They have richer owner-as-search-filter syntax; our backlog ID 79 (advanced query language) would close this
Code Insights	Track any search query as a time-series metric on dashboards. Automatic historical backfill from git history — years of data immediately. Migration progress, tech debt trends, codebase composition over time	`codegraph stats` (point-in-time), `codegraph snapshot` (manual checkpoints)	Yes — we have point-in-time metrics and manual snapshots but no automated historical trend tracking. Backlog ID 77
Batch Changes	Declarative YAML spec → automated code changes across hundreds of repos. Creates PRs on all affected repos, tracks merge status, CI checks, review approvals. Burndown charts for migration progress	None — codegraph is read-only by design (Foundation P8: we don't edit code or make decisions)	By design — we're a graph query tool, not a code modification tool. This is out of scope per Foundation principles
CLI (`src`)	Terminal search, batch change creation, SBOM generation, repo/user/team admin, code intelligence ops, CODEOWNERS management	`codegraph` CLI with 25+ commands, MCP server	Partial — our CLI is richer for graph queries; theirs is richer for admin/batch/SBOM operations. Different focus areas

Where Sourcegraph wins over codegraph:

Advantage	Details
Scale	Designed for 100,000+ repo enterprises. Indexed search across all repos, branches, and history simultaneously. Our multi-repo mode works but is designed for tens of repos, not thousands
Precise navigation (SCIP)	Compiler-accurate go-to-definition and find-references via language-specific SCIP indexers. Our tree-sitter resolution is heuristic — good enough for most cases but fundamentally less accurate for typed languages
Diff/commit content search	First-class search within git diffs and commit messages with author/date filters. We have `co-change` (statistical correlation) but can't search actual diff content
Code monitoring	Persistent query-based alerts on new commits with webhook/Slack/email actions. Our `--watch` mode rebuilds the graph but doesn't evaluate persistent query triggers
Historical insights	Automatic time-series tracking of any metric over git history with dashboard visualization. We have manual snapshots but no automated trend tracking
Enterprise ecosystem	SSO, RBAC, audit logs, IDE extensions (VS Code, JetBrains, Neovim), browser extension for GitHub/GitLab code review. We're a CLI + MCP tool
Boolean query language	Rich boolean operators, compound filters, output reshaping, and named search contexts. Our search is either semantic (fuzzy) or exact-name (`where`)

Where codegraph wins over Sourcegraph:

Advantage	Details
Zero infrastructure	`npm install` and done. No server, no Docker, no cloud, no accounts. Sourcegraph requires either a cloud subscription or a self-hosted instance (Kubernetes/Docker Compose)
Function-level graph	We build and query at function/method/class granularity with call edges, dataflow, CFG, and impact analysis. Sourcegraph operates at file/symbol level — search finds symbols but doesn't build a persistent dependency graph with blast radius analysis
Impact analysis	`diff-impact`, `fn-impact`, `branch-compare` trace transitive blast radius through the call graph. Sourcegraph's `find-references` shows direct references but not transitive impact chains
Complexity & health metrics	Cognitive, cyclomatic, Halstead, MI per function with CI gates. Sourcegraph has no built-in code health metrics
Community detection & drift	Louvain clustering reveals architectural drift between directory structure and actual dependencies. Sourcegraph has no equivalent
Dataflow analysis	`flows_to`/`returns`/`mutates` edges track how data moves through functions. Sourcegraph doesn't do dataflow analysis
Control flow graphs	Per-function CFG with basic blocks stored in the graph. Sourcegraph doesn't build CFGs
Node role classification	Every symbol auto-tagged as entry/core/utility/adapter/dead/leaf. Sourcegraph has no architectural role concept
Cost	Completely free and open source (Apache-2.0). Sourcegraph's paid plans start at $49/user/month for enterprise features
Privacy	Your code never leaves your machine (unless you choose to connect an LLM). Sourcegraph Cloud processes your code on their infrastructure; self-hosted requires significant ops investment
AI-optimized output	`context`, `audit`, `triage`, `batch` commands are purpose-built for AI agent consumption with structured JSON. Sourcegraph's output is designed for human developers in a web UI

Not worth copying

Feature	Why skip
Memgraph/Neo4j/KuzuDB/SurrealDB	Our SQLite = zero Docker, simpler deployment. Query gap matters less than simplicity. codegraph-rust's SurrealDB requirement is its biggest weakness
SCIP indexing	Would require maintaining SCIP toolchains per language. Tree-sitter + native Rust is the right bet
Full CPG (AST+CFG+PDG)	Joern/cpg's approach requires fundamentally different parsing — we'd be rebuilding Joern. Tree-sitter gives us AST-level graphs; adding lightweight dataflow on top is the pragmatic path
Points-to analysis	Academic-grade JS analysis (jelly) — overkill for our use case and limited to JS/TS
Cloud-hosted graph service	mcp-code-graph (CodeGPT) requires accounts and cloud dependency — goes against our local-first philosophy
CrewAI multi-agent	Overengineered for a code analysis tool. Keep the scope focused
Clipboard/LLM-dump mode	Different product category (glimpse). We're a graph tool, not a context-packer

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Competitive Analysis — Code Graph / Code Intelligence Tools

Overall Ranking

Tier 1: Direct Competitors (score ≥ 3.0)

Tier 2: Niche & Single-Language Tools (score 2.0–2.9)

Tier 3: Minimal or Inactive (score < 2.0)

Scoring Breakdown (Tier 1)

Where Codegraph Wins

Where Codegraph Loses

vs joern (#1, 2,956 stars)

vs narsil-mcp (#2, 101 stars)

vs code-graph-rag (#3, 1,916 stars)

vs cpg (#4, 411 stars)

vs glimpse (#5, 349 stars)

vs CKB (#6, 59 stars)

vs GitNexus (#7)

vs axon (#9, 29 stars)

vs codegraph-rust (#12, 142 stars)

vs jelly (#16, 417 stars)

vs aider (#38, 41,664 stars)

vs colbymchenry/codegraph (#20, 165 stars)

Features to Adopt — Priority Roadmap

Tier 1: High impact, low effort

Tier 2: High impact, medium effort

Tier 3: High impact, high effort

Paid Solutions

Sourcegraph (sourcegraph.com)

Not worth copying

Uh oh!

FilesExpand file tree

COMPETITIVE_ANALYSIS.md

Latest commit

History

COMPETITIVE_ANALYSIS.md

File metadata and controls

Competitive Analysis — Code Graph / Code Intelligence Tools

Overall Ranking

Tier 1: Direct Competitors (score ≥ 3.0)

Tier 2: Niche & Single-Language Tools (score 2.0–2.9)

Tier 3: Minimal or Inactive (score < 2.0)

Scoring Breakdown (Tier 1)

Where Codegraph Wins

Where Codegraph Loses

vs joern (#1, 2,956 stars)

vs narsil-mcp (#2, 101 stars)

vs code-graph-rag (#3, 1,916 stars)

vs cpg (#4, 411 stars)

vs glimpse (#5, 349 stars)

vs CKB (#6, 59 stars)

vs GitNexus (#7)

vs axon (#9, 29 stars)

vs codegraph-rust (#12, 142 stars)

vs jelly (#16, 417 stars)

vs aider (#38, 41,664 stars)

vs colbymchenry/codegraph (#20, 165 stars)

Features to Adopt — Priority Roadmap

Tier 1: High impact, low effort

Tier 2: High impact, medium effort

Tier 3: High impact, high effort

Paid Solutions

Sourcegraph (sourcegraph.com)

Not worth copying