|
1 | | -# CodexA v0.4.5 — Release Notes |
| 1 | +# CodexA v0.5.0 — Release Notes |
2 | 2 |
|
3 | | -> **Released:** 2025 · **License:** MIT · **Docs:** [codex-a.dev](https://codex-a.dev) |
| 3 | +> **Released:** March 2026 · **License:** MIT · **Docs:** [codex-a.dev](https://codex-a.dev) |
4 | 4 |
|
5 | 5 | **CodexA** is a developer intelligence engine for semantic code search, AI-assisted code understanding, and agent tooling. |
6 | 6 |
|
7 | 7 | --- |
8 | 8 |
|
9 | | -## What's New in v0.4.5 |
| 9 | +## What's New in v0.5.0 |
10 | 10 |
|
11 | | -This release implements **Phase 31 — RAG Pipeline for LLM Commands**, replacing the old "dump context into prompt" approach with a proper Retrieval-Augmented Generation pipeline. |
| 11 | +The biggest release yet -- 12 phases of development in a single version. CodexA now ships with a Rust-powered search engine, a full RAG pipeline, editor plugins for 9 editors, multi-agent orchestration, and cross-language intelligence. |
| 12 | + |
| 13 | +### Rust Search Engine Core |
| 14 | +- `codexa-core` crate compiled via PyO3/maturin |
| 15 | +- HNSW vector index with `instant-distance` for O(log n) nearest-neighbour search |
| 16 | +- Memory-mapped persistence for near-instant startup |
| 17 | +- AST-aware chunker splitting code at function/class/method boundaries (10 languages) |
| 18 | +- Rust BM25 index, parallel scanner with blake3 hashing, RRF fusion |
| 19 | +- ONNX embedder for ONNX Runtime inference |
| 20 | +- All Rust components optional -- Python fallback when crate is not installed |
12 | 21 |
|
13 | 22 | ### RAG Pipeline |
14 | | -- **4-stage pipeline**: Retrieve → Deduplicate → Re-rank → Assemble — each stage optimized for precision and token efficiency |
15 | | -- **Retrieval strategies**: `semantic` (vector), `keyword` (BM25), `hybrid` (RRF merge), `multi` (parallel with diversity) |
16 | | -- **Cross-encoder re-ranking**: Optional `ms-marco-MiniLM-L-6-v2` model for high-precision re-ranking (set `rag_use_cross_encoder: true`) |
17 | | -- **Token-aware assembly**: Context is assembled within a configurable token budget (default 3000), preventing prompt overflow |
18 | | -- **Source citations**: Responses include numbered `[N]` markers citing exact file paths and line ranges |
19 | | - |
20 | | -### Configuration |
21 | | -Three new fields in `llm` config (`.codexa/config.json`): |
22 | | -```json |
23 | | -{ |
24 | | - "llm": { |
25 | | - "rag_budget_tokens": 3000, |
26 | | - "rag_strategy": "hybrid", |
27 | | - "rag_use_cross_encoder": false |
28 | | - } |
29 | | -} |
| 23 | +- 4-stage retrieval: Retrieve, Deduplicate, Re-rank, Assemble with token budget |
| 24 | +- Configurable strategies: `semantic`, `keyword`, `hybrid`, `multi` |
| 25 | +- Cross-encoder re-ranking with `ms-marco-MiniLM-L-6-v2` |
| 26 | +- Source citations with numbered `[N]` markers in LLM responses |
| 27 | + |
| 28 | +### Search Dominance |
| 29 | +- JSONL streaming, scored output, snippet control |
| 30 | +- Incremental indexing with Ctrl+C partial-save safety |
| 31 | +- Tantivy full-text engine via PyO3 |
| 32 | +- MCP Server v2 with cursor-based pagination |
| 33 | +- `.codexaignore` auto-create on first index |
| 34 | + |
| 35 | +### Model Hub and Distribution |
| 36 | +- `--switch-model` for hot-swapping embedding models |
| 37 | +- Model verification with `codexa models download --verify` |
| 38 | +- Pre-built wheels for Linux (x86_64, aarch64), macOS (universal2), Windows (x64) |
| 39 | +- Scoop and Chocolatey manifests, standalone PyInstaller binaries, Docker image |
| 40 | + |
| 41 | +### Editor Plugins |
| 42 | +First-class support for 9 editors: |
| 43 | +VS Code, Zed, JetBrains (IntelliJ/PyCharm/WebStorm), Neovim, Vim, Sublime Text, Emacs, Helix, Eclipse. MCP configs for Cursor and Windsurf. |
| 44 | + |
| 45 | +### Multi-Agent Orchestration |
| 46 | +- Thread-safe concurrent agent sessions with TTL cleanup |
| 47 | +- Shared discovery pool across multiple AI agents |
| 48 | +- Semantic diff: AST-level detection of renames, moves, signature changes |
| 49 | +- RAG-grounded code generation |
| 50 | +- Bridge session endpoints: `/sessions`, `/sessions/create`, `/sessions/close` |
| 51 | + |
| 52 | +### Cross-Language Intelligence |
| 53 | +- FFI pattern detection: Python-Rust, Python-C, JS-WASM, Java-JNI |
| 54 | +- Polyglot dependency graphs for multi-language import tracking |
| 55 | +- Language-aware search boosting with configurable boost factor |
| 56 | +- Universal call graph across languages |
| 57 | + |
| 58 | +### Install |
| 59 | +```bash |
| 60 | +pip install codexa==0.5.0 |
30 | 61 | ``` |
31 | 62 |
|
32 | | -### Integration |
33 | | -- `codexa ask` — RAG-powered context retrieval with citations |
34 | | -- `codexa chat` — RAG context injection into conversation |
35 | | -- `codexa suggest` — RAG-enhanced improvement suggestions |
36 | | -- `codexa investigate` — RAG-powered search actions with citation markers |
37 | | -- Web UI and REST API — RAG config passed through |
| 63 | +### Stats |
| 64 | +- 2657 tests passing |
| 65 | +- 42 CLI commands |
| 66 | +- 13 built-in AI agent tools |
| 67 | +- 12 language parsers |
| 68 | +- 9 editor plugins |
38 | 69 |
|
39 | 70 | --- |
40 | 71 |
|
|
0 commit comments