Skip to content

Commit 1a2f566

Browse files
prosdevclaude
andcommitted
docs(plans): drop git/github indexing from Phase 2
GitHub has its own MCP server, not everyone uses GH (Linear, Jira, etc.), and git CLI is excellent. Focus on code search — our unique value. - 9 MCP tools → 6 (drop dev_history, dev_gh, dev_plan) - 3 Antfly tables → 1 (code only) - 2 indexing phases removed - User stories US-10, US-11 deprecated Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
1 parent c7cc88d commit 1a2f566

2 files changed

Lines changed: 30 additions & 23 deletions

File tree

.claude/da-plans/core/phase-2-indexing-rethink/overview.md

Lines changed: 23 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -113,33 +113,44 @@ Problems:
113113

114114
- **Scanner pipeline** — ts-morph, tree-sitter, remark (proven, well-tested)
115115
- **Document preparation**`prepareDocumentsForEmbedding()` (pure transform)
116-
- **Git indexing** — as a separate command (`dev git index`)
117-
- **GitHub indexing** — as a separate command (`dev github index`)
118116
- **MCP adapter layer** — unchanged, consumes search results
119117

118+
### What we deprecate
119+
120+
- **Git history indexing** (`dev_history`, `dev git index`) — `git log`, `git blame`,
121+
and AI tools can run git commands directly. Semantic commit search is a nice-to-have
122+
but not worth the indexing cost.
123+
- **GitHub indexing** (`dev_gh`, `dev github index`) — GitHub's own MCP server handles
124+
issues, PRs, and repo context natively. `gh` CLI is excellent. No reason to maintain
125+
a separate index of the same data.
126+
- **`dev_plan`** context assembly — was valuable when it bundled issue + code + commits.
127+
With git/github dropped, this becomes just a code search wrapper. Can revisit if needed.
128+
129+
This reduces from 3 Antfly tables to 1 (code only), and removes 2 indexing phases.
130+
120131
## Decisions
121132

122133
| Decision | Rationale | Alternatives |
123134
|----------|-----------|-------------|
124135
| Use `@parcel/watcher` | Native, `getEventsSince()` survives restarts, VS Code uses it | chokidar (no historical queries), watchman (requires daemon) |
125136
| Use Antfly Linear Merge | Server-side content hashing eliminates state file entirely | Keep state file + manual upsert (more code, same result) |
126137
| Watch from MCP server process | MCP server is the long-running process; watcher lives there | Separate daemon (more complexity), CLI-only (no auto-update) |
127-
| Decouple git/github from `dev index .` | Different update patterns, different data sources | Keep bundled (slower `dev index .`, coupled concerns) |
138+
| Drop git/github indexing entirely | GitHub has its own MCP server; `gh` and `git` CLIs are excellent; AI tools call them directly. Not everyone uses GitHub — teams use Linear, Jira, Notion, Shortcut. By not coupling to GH, we stay tool-agnostic. Focus on code search — our unique value. | Keep as optional plugins (future, if demand) |
128139
| Debounce file changes (500ms) | Avoid re-indexing mid-save; batch rapid changes | Per-file immediate (too many API calls), longer debounce (stale data) |
129140
| Drop indexer-state.json | Antfly + watcher replace all its functions | Keep for backward compat (dead code) |
130141

131142
## Parts
132143

133144
| Part | Description | User stories | Risk |
134145
|------|-------------|-------------|------|
135-
| 2.1 | Replace batch insert with Antfly Linear Merge | US-3, US-5, US-6 | Low |
136-
| 2.2 | Add `@parcel/watcher` to MCP server | US-4, US-12 | Medium |
137-
| 2.3 | Debounce + incremental re-index on file change | US-4 | Medium |
138-
| 2.4 | `getEventsSince` on MCP server startup | US-5, US-12 | Low |
139-
| 2.5 | Decouple git/github from `dev index .` | US-10, US-11 | Low |
140-
| 2.6 | Drop indexer-state.json, simplify RepositoryIndexer | US-3, US-6 | Medium |
146+
| 2.1 | Spike: verify Antfly Linear Merge API + `@parcel/watcher` | | Low |
147+
| 2.2 | Replace batch insert with Antfly Linear Merge | US-3, US-5, US-6 | Low |
148+
| 2.3 | Simplify RepositoryIndexer, drop state file | US-3, US-6 | Medium |
149+
| 2.4 | Add `@parcel/watcher` + debounced auto-index to MCP server | US-4, US-12 | Medium |
150+
| 2.5 | `getEventsSince` on MCP server startup | US-5, US-12 | Low |
151+
| 2.6 | Deprecate git/github indexing, remove adapters | US-12 | Low |
141152
| 2.7 | `dev status` rework — Antfly table stats + watcher status | US-13 | Low |
142-
| 2.8 | E2E tests: index real repo, search, verify results | US-3, US-8, US-9 | Low |
153+
| 2.8 | E2E tests: index this repo, search, verify results | US-3, US-8, US-9 | Low |
143154

144155
## Risk register
145156

@@ -159,10 +170,10 @@ Problems:
159170
- [ ] `dev_search "validateUser"` returns exact match (BM25)
160171
- [ ] `dev_search "authentication middleware"` returns semantic matches (vector)
161172
- [ ] `dev index . --force` clears and rebuilds
162-
- [ ] `dev git index` works independently
163-
- [ ] `dev github index` works independently
164173
- [ ] `dev status` shows fresh Antfly stats + watcher status
165174
- [ ] No `indexer-state.json` written or read
175+
- [ ] Git/GitHub adapters removed (dev_history, dev_gh, dev_plan)
176+
- [ ] MCP tools reduced from 9 to 6 (search, refs, map, inspect, status, health)
166177
- [ ] Works on this repo (dev-agent) end-to-end
167178

168179
## Dependencies

.claude/da-plans/core/phase-2-indexing-rethink/user-stories.md

Lines changed: 7 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -60,17 +60,13 @@ when my AI tool calls `dev_search`,
6060
the exact function should be the top result (BM25 keyword match),
6161
so that exact lookups are instant and precise.
6262

63-
**US-10: Search git history**
64-
As a developer asking "when did we add rate limiting?",
65-
when my AI tool calls `dev_history`,
66-
I should get relevant commits with context,
67-
so that the AI understands the evolution of the code.
68-
69-
**US-11: Search GitHub issues**
70-
As a developer asking "what bugs are open about search?",
71-
when my AI tool calls `dev_gh`,
72-
I should get matching issues/PRs,
73-
so that the AI can reference project context.
63+
**~~US-10: Search git history~~ (DEPRECATED)**
64+
Git history is better served by `git log`, `git blame`, and AI tools running git
65+
commands directly. Removed in Phase 2.
66+
67+
**~~US-11: Search GitHub issues~~ (DEPRECATED)**
68+
GitHub issues/PRs are better served by GitHub's own MCP server and the `gh` CLI.
69+
Removed in Phase 2.
7470

7571
## Lifecycle
7672

0 commit comments

Comments
 (0)