@@ -56,82 +56,102 @@ Planned improvements for CodexA, organized by priority.
5656
5757## Upcoming Improvements
5858
59- ### Phase 32 — Cross-Language Intelligence
59+ > Phases redesigned after self-analysis with CodexA tools and competitive
60+ > comparison with [ ck] ( https://github.com/BeaconBay/ck ) (Rust-based semantic
61+ > search, v0.7.4). Priorities: close visible UX/performance gaps first, then
62+ > double down on CodexA's unique AI-powered strengths.
6063
61- Unified code intelligence across language boundaries:
64+ ### Phase 32 — Search UX & Output Modes
6265
63- - Cross-language symbol resolution (e.g., Python calling Rust via FFI)
64- - Polyglot dependency graphs linking imports across languages
65- - Language-aware search boosting (prefer results in the query's context language)
66- - Universal call graph spanning multiple languages in a workspace
66+ Close the biggest visible gaps in search ergonomics:
67+
68+ - ` --scores ` flag to display similarity scores with color highlighting
69+ - ` --full-section ` flag to return complete function/class bodies, not just chunk snippets
70+ - ` --threshold ` flag to filter results below a minimum similarity score
71+ - JSONL streaming output mode (` --jsonl ` ) for piping into downstream tools
72+ - ` codexa search --inspect <file> ` to visualize chunks, token counts, and embeddings for a file
73+ - ` .codexaignore ` auto-generation from detected binary/vendored/generated files
74+ - Smart binary detection to skip non-text files during indexing
6775
68- ### Phase 33 — Team & Cloud Mode
76+ ### Phase 33 — Precise Token Management
6977
70- Optional team collaboration features (privacy-first, opt-in ):
78+ Replace rough token estimation with model-specific counting (ck already ships exact tokenization ):
7179
72- - Shared search indices with team-scoped access control
73- - Remote index hosting for large monorepos (gRPC or HTTP)
74- - Index sharding and distributed search across machines
75- - Audit logging for compliance-sensitive environments
80+ - ` tiktoken ` for OpenAI models, HuggingFace ` tokenizers ` for local/Ollama models
81+ - Accurate context window budgeting with overflow protection in RAG pipeline
82+ - Token usage reporting and cost estimation per query
83+ - Smart context truncation preserving semantic boundaries (function/class edges)
84+ - ` codexa search --tokens ` to show token count per result
7685
77- ### Phase 34 — CI/CD Deep Integration
86+ ### Phase 34 — Performance & Smart Indexing
7887
79- First-class CI pipeline integration beyond quality gates :
88+ Content-aware incremental indexing to close the speed gap :
8089
81- - PR diff-aware indexing — only re-index changed files in CI
82- - Automated PR review comments via GitHub Actions / GitLab CI
83- - Quality trend dashboards exported as CI artifacts
84- - Breaking-change detection based on call graph + reference analysis
85- - Configurable CI profiles (fast/thorough/security-only)
90+ - Content-hash (blake3) per chunk — skip re-embedding unchanged code
91+ - Parallel embedding with configurable worker count
92+ - Batch FAISS insertion instead of one-by-one vector adds
93+ - Memory-mapped FAISS indices for low-RAM machines
94+ - ` codexa index --diff ` to index only git-changed files
95+ - Indexing progress bar with ETA and throughput stats
8696
87- ### Phase 35 — Advanced Embedding & Search
97+ ### Phase 35 — Advanced Embedding & Model Selection
8898
89- Next-generation search infrastructure:
99+ Multiple embedding models and smarter search infrastructure:
90100
91- - Fine-tuned code embedding models (CodeBERT, StarEncoder)
101+ - Support BGE, mxbai-embed, nomic-embed, jina-code-v2 alongside current MiniLM
102+ - Model switching at query time without full re-index (dual-index mode)
92103- GPU-accelerated FAISS with IVF-PQ indices for million-file repos
93104- Field-scoped search filters (` --lang ` , ` --symbol-type ` , ` --file ` )
94105- Configurable RRF weights for hybrid search tuning
95- - Re-ranking with cross-encoders for precision-critical queries
106+ - ` codexa models compare ` to benchmark models on the user's actual codebase
96107
97- ### Phase 36 — Async Web & Real-Time Streaming
108+ ### Phase 36 — CI/CD Deep Integration
98109
99- Migrate the web server to a modern async framework :
110+ First-class CI pipeline integration — a unique CodexA strength :
100111
101- - WebSocket streaming for live search results
102- - Non-blocking request handling with connection pooling
103- - Server-sent events for long-running operations (indexing progress)
104- - Real-time collaboration widgets in the web UI
112+ - PR diff-aware indexing — only re-index changed files in CI
113+ - Automated PR review comments via GitHub Actions / GitLab CI
114+ - Quality trend dashboards exported as CI artifacts (HTML + JSON)
115+ - Breaking-change detection based on call graph + reference analysis
116+ - Configurable CI profiles (` fast ` / ` thorough ` / ` security-only ` )
105117
106- ### Phase 37 — Plugin Marketplace & Sandboxing
118+ ### Phase 37 — VS Code Extension & Editor Integration
107119
108- Mature the plugin ecosystem :
120+ Marketplace-ready VS Code extension with deep editor features :
109121
110- - Plugin sandboxing with resource limits and restricted filesystem access
111- - Community plugin registry with versioning and discovery
112- - Plugin dependency resolution and conflict detection
113- - Visual plugin configuration in the web UI
122+ - Inline code explanations as CodeLens / inlay hints
123+ - Semantic go-to-definition across indexed repos
124+ - Live quality annotations in the editor gutter
125+ - Multi-root workspace support with cross-repo navigation
126+ - Extension marketplace publishing and auto-update
114127
115- ### Phase 38 — Precise Token Management
128+ ### Phase 38 — Async Web & Real-Time Streaming
116129
117- Replace rough token estimation with model-specific counting :
130+ Migrate the web server to a modern async framework :
118131
119- - ` tiktoken ` for OpenAI models, model-specific tokenizers for Ollama
120- - Accurate context window budgeting with overflow protection
121- - Token usage reporting and cost estimation per query
122- - Smart context truncation preserving semantic boundaries
132+ - WebSocket streaming for live search results
133+ - Non-blocking request handling with connection pooling
134+ - Server-sent events for long-running operations (indexing progress)
135+ - Real-time dashboard with quality trends and search analytics
123136
124- ### Phase 39 — LSP 2.0 & Editor Deep Integration
137+ ### Phase 39 — Cross-Language Intelligence
125138
126- Enhanced editor integration beyond current LSP :
139+ Unified code intelligence across language boundaries :
127140
128- - Inline code explanations as CodeLens / inlay hints
129- - Semantic go-to-definition across indexed repos
130- - Live quality annotations in the editor gutter
131- - Multi-root workspace support with cross-repo navigation
141+ - Cross-language symbol resolution (e.g., Python calling Rust via FFI)
142+ - Polyglot dependency graphs linking imports across languages
143+ - Language-aware search boosting (prefer results in the query's context language)
144+ - Universal call graph spanning multiple languages in a workspace
132145
133146## Low Priority (Future)
134147
148+ ### Plugin Marketplace & Sandboxing
149+
150+ - Plugin sandboxing with resource limits and restricted filesystem access
151+ - Community plugin registry with versioning and discovery
152+ - Plugin dependency resolution and conflict detection
153+ - Visual plugin configuration in the web UI
154+
135155### Fine-Tuned Embedding Models
136156
137157- Domain-specific vocabulary handling
0 commit comments