Skip to content

Commit 8e7bc81

Browse files
committed
refactor: split core comment heuristics
Separate comment signals, classification, confidence scoring, and tag extraction so the synthesizer root stays focused on composing comment fields and follow-up heuristic changes stay localized. Made-with: Cursor
1 parent 46cc0c8 commit 8e7bc81

File tree

7 files changed

+772
-782
lines changed

7 files changed

+772
-782
lines changed

TODO.md

Lines changed: 21 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -8,24 +8,35 @@
88
- Prefer extracting pure helpers and formatter/parsing boundaries before moving async orchestration.
99
- Keep module roots thin; if a root becomes mostly re-exports, let children carry the logic.
1010

11+
## Immediate Queue
12+
13+
- [ ] `src/core/semantic.rs`
14+
- Split semantic index/store model types and defaults from persistence I/O helpers.
15+
- Split embedding metadata compatibility and adapter/fallback embedding generation.
16+
- Split source-file discovery and excerpt/query builders from index refresh bookkeeping.
17+
- Split semantic diff retrieval and feedback-example matching from feedback-store maintenance.
18+
- [ ] `src/core/symbol_index.rs`
19+
- Split LSP command detection and extension scanning from index-building entry points.
20+
- Split regex-based symbol extraction and dependency-hint parsing from graph/file-summary registration.
21+
- Split `LspClient` protocol transport from symbol-result decoding and path/URI utilities.
22+
- Keep `build()` and `build_with_lsp()` as thin orchestration entry points.
23+
1124
## Core Backlog
1225

13-
- [ ] `src/core/comment.rs`
14-
- Split category/severity/tag heuristics from confidence/fix-effort heuristics.
1526
- [ ] `src/core/semantic.rs`
16-
- Split semantic index/store model types and embedding metadata compatibility.
17-
- Split JSON/file persistence helpers from path derivation helpers.
18-
- Split embedding adapter/fallback logic from source discovery/chunk extraction.
19-
- Split semantic matching/ranking from semantic feedback example workflows.
20-
- Split index refresh/update orchestration from low-level chunk/state transforms.
27+
- Split JSON/file persistence helpers from default path derivation and atomic write helpers.
28+
- Split semantic chunk hashing/key generation from summary/excerpt assembly.
29+
- Split changed-range filtering and per-query match scoring from context chunk rendering.
30+
- Split feedback embedding-text/fingerprint helpers from feedback-store reconciliation.
2131
- [ ] `src/config.rs`
2232
- Split defaults/model-role conversion from load/deserialize paths.
2333
- Split env/path resolution from validation/migration logic.
2434
- Split serialization-focused test helpers from production config code.
2535
- [ ] `src/core/symbol_index.rs`
26-
- Split language-specific extraction/parsing from index construction.
27-
- Split retrieval/query helpers from persistence/cache helpers.
28-
- Split ranking/path-selection helpers from graph-aware expansion helpers.
36+
- Split language-pattern tables and path candidate expansion from dependency resolution.
37+
- Split file collection and byte-size filtering from index population.
38+
- Split symbol graph and reverse-dependency registration from symbol storage.
39+
- Split LSP symbol collection/range extraction from request/notification plumbing.
2940
- [ ] `src/core/symbol_graph.rs`
3041
- Split graph construction from traversal/query helpers.
3142
- Split serialization/persistence helpers from graph algorithms.

0 commit comments

Comments
 (0)