Skip to content

Commit bdca654

Browse files
committed
docs: redesign roadmap phases based on self-analysis and ck competitor comparison
- Reorder phases by competitive priority (close UX/perf gaps first) - Phase 32: Search UX & Output Modes (scores, full-section, JSONL, inspect) - Phase 33: Precise Token Management (moved up — ck already has this) - Phase 34: Performance & Smart Indexing (content-hash, parallel embedding) - Phase 35: Advanced Embedding & Model Selection (multi-model, RRF weights) - Phase 36: CI/CD Deep Integration (unique strength, kept) - Phase 37: VS Code Extension & Editor Integration (merged old 37+39) - Phase 38: Async Web & Real-Time Streaming (kept) - Phase 39: Cross-Language Intelligence (moved down) - Plugin Marketplace moved to Low Priority - Removed old Phase 33 (Team & Cloud Mode) — local-only tool
1 parent 932116a commit bdca654

1 file changed

Lines changed: 67 additions & 47 deletions

File tree

docs/guide/roadmap.md

Lines changed: 67 additions & 47 deletions
Original file line numberDiff line numberDiff line change
@@ -56,82 +56,102 @@ Planned improvements for CodexA, organized by priority.
5656

5757
## Upcoming Improvements
5858

59-
### Phase 32 — Cross-Language Intelligence
59+
> Phases redesigned after self-analysis with CodexA tools and competitive
60+
> comparison with [ck](https://github.com/BeaconBay/ck) (Rust-based semantic
61+
> search, v0.7.4). Priorities: close visible UX/performance gaps first, then
62+
> double down on CodexA's unique AI-powered strengths.
6063
61-
Unified code intelligence across language boundaries:
64+
### Phase 32 — Search UX & Output Modes
6265

63-
- Cross-language symbol resolution (e.g., Python calling Rust via FFI)
64-
- Polyglot dependency graphs linking imports across languages
65-
- Language-aware search boosting (prefer results in the query's context language)
66-
- Universal call graph spanning multiple languages in a workspace
66+
Close the biggest visible gaps in search ergonomics:
67+
68+
- `--scores` flag to display similarity scores with color highlighting
69+
- `--full-section` flag to return complete function/class bodies, not just chunk snippets
70+
- `--threshold` flag to filter results below a minimum similarity score
71+
- JSONL streaming output mode (`--jsonl`) for piping into downstream tools
72+
- `codexa search --inspect <file>` to visualize chunks, token counts, and embeddings for a file
73+
- `.codexaignore` auto-generation from detected binary/vendored/generated files
74+
- Smart binary detection to skip non-text files during indexing
6775

68-
### Phase 33 — Team & Cloud Mode
76+
### Phase 33 — Precise Token Management
6977

70-
Optional team collaboration features (privacy-first, opt-in):
78+
Replace rough token estimation with model-specific counting (ck already ships exact tokenization):
7179

72-
- Shared search indices with team-scoped access control
73-
- Remote index hosting for large monorepos (gRPC or HTTP)
74-
- Index sharding and distributed search across machines
75-
- Audit logging for compliance-sensitive environments
80+
- `tiktoken` for OpenAI models, HuggingFace `tokenizers` for local/Ollama models
81+
- Accurate context window budgeting with overflow protection in RAG pipeline
82+
- Token usage reporting and cost estimation per query
83+
- Smart context truncation preserving semantic boundaries (function/class edges)
84+
- `codexa search --tokens` to show token count per result
7685

77-
### Phase 34 — CI/CD Deep Integration
86+
### Phase 34 — Performance & Smart Indexing
7887

79-
First-class CI pipeline integration beyond quality gates:
88+
Content-aware incremental indexing to close the speed gap:
8089

81-
- PR diff-aware indexing — only re-index changed files in CI
82-
- Automated PR review comments via GitHub Actions / GitLab CI
83-
- Quality trend dashboards exported as CI artifacts
84-
- Breaking-change detection based on call graph + reference analysis
85-
- Configurable CI profiles (fast/thorough/security-only)
90+
- Content-hash (blake3) per chunk — skip re-embedding unchanged code
91+
- Parallel embedding with configurable worker count
92+
- Batch FAISS insertion instead of one-by-one vector adds
93+
- Memory-mapped FAISS indices for low-RAM machines
94+
- `codexa index --diff` to index only git-changed files
95+
- Indexing progress bar with ETA and throughput stats
8696

87-
### Phase 35 — Advanced Embedding & Search
97+
### Phase 35 — Advanced Embedding & Model Selection
8898

89-
Next-generation search infrastructure:
99+
Multiple embedding models and smarter search infrastructure:
90100

91-
- Fine-tuned code embedding models (CodeBERT, StarEncoder)
101+
- Support BGE, mxbai-embed, nomic-embed, jina-code-v2 alongside current MiniLM
102+
- Model switching at query time without full re-index (dual-index mode)
92103
- GPU-accelerated FAISS with IVF-PQ indices for million-file repos
93104
- Field-scoped search filters (`--lang`, `--symbol-type`, `--file`)
94105
- Configurable RRF weights for hybrid search tuning
95-
- Re-ranking with cross-encoders for precision-critical queries
106+
- `codexa models compare` to benchmark models on the user's actual codebase
96107

97-
### Phase 36 — Async Web & Real-Time Streaming
108+
### Phase 36 — CI/CD Deep Integration
98109

99-
Migrate the web server to a modern async framework:
110+
First-class CI pipeline integration — a unique CodexA strength:
100111

101-
- WebSocket streaming for live search results
102-
- Non-blocking request handling with connection pooling
103-
- Server-sent events for long-running operations (indexing progress)
104-
- Real-time collaboration widgets in the web UI
112+
- PR diff-aware indexing — only re-index changed files in CI
113+
- Automated PR review comments via GitHub Actions / GitLab CI
114+
- Quality trend dashboards exported as CI artifacts (HTML + JSON)
115+
- Breaking-change detection based on call graph + reference analysis
116+
- Configurable CI profiles (`fast` / `thorough` / `security-only`)
105117

106-
### Phase 37 — Plugin Marketplace & Sandboxing
118+
### Phase 37 — VS Code Extension & Editor Integration
107119

108-
Mature the plugin ecosystem:
120+
Marketplace-ready VS Code extension with deep editor features:
109121

110-
- Plugin sandboxing with resource limits and restricted filesystem access
111-
- Community plugin registry with versioning and discovery
112-
- Plugin dependency resolution and conflict detection
113-
- Visual plugin configuration in the web UI
122+
- Inline code explanations as CodeLens / inlay hints
123+
- Semantic go-to-definition across indexed repos
124+
- Live quality annotations in the editor gutter
125+
- Multi-root workspace support with cross-repo navigation
126+
- Extension marketplace publishing and auto-update
114127

115-
### Phase 38 — Precise Token Management
128+
### Phase 38 — Async Web & Real-Time Streaming
116129

117-
Replace rough token estimation with model-specific counting:
130+
Migrate the web server to a modern async framework:
118131

119-
- `tiktoken` for OpenAI models, model-specific tokenizers for Ollama
120-
- Accurate context window budgeting with overflow protection
121-
- Token usage reporting and cost estimation per query
122-
- Smart context truncation preserving semantic boundaries
132+
- WebSocket streaming for live search results
133+
- Non-blocking request handling with connection pooling
134+
- Server-sent events for long-running operations (indexing progress)
135+
- Real-time dashboard with quality trends and search analytics
123136

124-
### Phase 39 — LSP 2.0 & Editor Deep Integration
137+
### Phase 39 — Cross-Language Intelligence
125138

126-
Enhanced editor integration beyond current LSP:
139+
Unified code intelligence across language boundaries:
127140

128-
- Inline code explanations as CodeLens / inlay hints
129-
- Semantic go-to-definition across indexed repos
130-
- Live quality annotations in the editor gutter
131-
- Multi-root workspace support with cross-repo navigation
141+
- Cross-language symbol resolution (e.g., Python calling Rust via FFI)
142+
- Polyglot dependency graphs linking imports across languages
143+
- Language-aware search boosting (prefer results in the query's context language)
144+
- Universal call graph spanning multiple languages in a workspace
132145

133146
## Low Priority (Future)
134147

148+
### Plugin Marketplace & Sandboxing
149+
150+
- Plugin sandboxing with resource limits and restricted filesystem access
151+
- Community plugin registry with versioning and discovery
152+
- Plugin dependency resolution and conflict detection
153+
- Visual plugin configuration in the web UI
154+
135155
### Fine-Tuned Embedding Models
136156

137157
- Domain-specific vocabulary handling

0 commit comments

Comments
 (0)