stainless-code
diff --git a/‎.changeset/codemap-richer-index.md‎
Lines changed: 19 additions & 0 deletions b/‎.changeset/codemap-richer-index.md‎
Lines changed: 19 additions & 0 deletions
diff --git a/‎bun.lock‎
Lines changed: 149 additions & 158 deletions b/‎bun.lock‎
Lines changed: 149 additions & 158 deletions
diff --git a/‎docs/architecture.md‎
Lines changed: 190 additions & 22 deletions b/‎docs/architecture.md‎
Lines changed: 190 additions & 22 deletions
diff --git a/‎docs/glossary.md‎
Lines changed: 40 additions & 0 deletions b/‎docs/glossary.md‎
Lines changed: 40 additions & 0 deletions
diff --git a/‎docs/golden-queries.md‎
Lines changed: 7 additions & 7 deletions b/‎docs/golden-queries.md‎
Lines changed: 7 additions & 7 deletions
@@ -0,0 +1,19 @@
+---
+"@stainless-code/codemap": minor
+---
+
+`codemap-richer-index` — substrate extraction across 12 tiers. **Schema bump** (`SCHEMA_VERSION` 10 → 26) — first run after upgrade rebuilds `.codemap/index.db` from source.
+
+**10 new substrate tables**: `import_specifiers`, `scopes`, `references`, `bindings`, `function_params`, `file_metrics`, `re_export_chains`, `module_cycles`, `runtime_markers`, `test_suites`.
+
+**Column additions**: `symbols.{name_column_start, name_column_end, scope_local_id, body_line_count, param_count, nesting_depth}` · `calls.{line_start, column_start, column_end}` · `exports.{is_re_export, line_start, line_end, column_start, column_end}` · `markers.{column_start, column_end}`.
+
+**12 new recipes**: `find-references` · `find-symbol-references` · `find-write-sites` · `find-by-param-type` · `large-functions` · `deeply-nested-functions` · `circular-imports` · `barrel-chains` · `find-leftover-console` · `env-var-audit` · `find-skipped-tests` · `tests-by-file`.
+
+**Architecture**: modular extractor pattern (R.17) splits `parser.ts` into per-tier extractors under `src/extractors/` with a shared `ExtractContext`. Targeted reindex stays sub-100ms; full reindex includes bindings resolution + Tarjan SCC + re-export chain materialisation.
+
+**Reference precision**: `references` table emits every identifier USE with column-precise positions; `kind='member'` rows distinguish non-computed property access from bindings. Native JSX tags + JSXAttribute names + long-hand object-literal keys are suppressed. `TSQualifiedName` (e.g. `React.ReactNode`) splits into namespace head (`kind='type'`) + member tail (`kind='member'`). Bindings resolver (full-rebuild only) walks same-file scope → imports → globals → unresolved with deduped TypeScript / DOM / Node / ES global sets. Re-export chains followed up to 10 hops with cycle detection.
+
+**Dependency bumps**: `oxc-parser` 0.127 → 0.130 · `zod` 4.3 → 4.4 (dedupe override added so the MCP SDK keeps a single `$ZodType` identity) · `tsdown` 0.21 → 0.22 (declared `unrun` as devDep to unblock CI build under Node's tsdown binstub).
+
+**Docs sync**: `docs/architecture.md` § Schema reflects every new table + column; `docs/glossary.md` gains 10 new entries; `docs/golden-queries.md` + `fixtures/golden/` regenerated. Templates (`templates/agents/`) updated with the new schema overview + trigger patterns.
@@ -55,6 +55,10 @@ In Codemap usage: a file with a high number of `exports` rows — typically a pu
 
 The shared `batchInsert<T>()` helper in `src/db.ts`. Splits inserts into multi-row `INSERT … VALUES (…),(…)` statements of `BATCH_SIZE` (500) rows each, with pre-computed placeholder strings. Used by every `insertX` function.
 
+### `bindings` (table) / bindings resolver
+
+Per-reference resolution to the originating symbol per [R.12]. One row per non-`member`-kind `references` row, with `resolution_kind` in `{same-file, imported, global, unresolved}` and a nullable `resolved_symbol_id` joining `symbols(id)`. Resolved in a single in-memory pass (`src/application/bindings-engine.ts`) after files/symbols/imports settle. Full-rebuild only — targeted reindex skips per [R.10]. Powers `find-symbol-references` (bindings-precise rename substrate).
+
 ### `boundaries` (config) / `boundary_rules` (table) / `boundary-violations` (recipe)
 
 Architecture-boundary substrate. Users declare `boundaries: [{name, from_glob, to_glob, action?}]` in `.codemap/config.ts`; the resolver fills `action` to `"deny"` when omitted. Every index pass calls `reconcileBoundaryRules` (in `src/db.ts`) which clears `boundary_rules` and re-inserts from the resolved config — config is the single source of truth, the table is a denormalised lookup. Bundled `boundary-violations` recipe joins `dependencies` × `boundary_rules` via SQLite `GLOB` and surfaces forbidden import edges; `--format sarif` lights up automatically because the recipe row aliases `dependencies.from_path` to `file_path`. CHECK constraint pins `action ∈ {'deny','allow'}`. v1 only honours `'deny'`; `'allow'` reserves the slot for future whitelist semantics. See [architecture.md § `boundary_rules`](./architecture.md#boundary_rules--architecture-boundary-rules-config-derived-strict-without-rowid).
@@ -107,6 +111,14 @@ CI-aggregate flag on `codemap query` and `codemap audit`. Aliases `--format sari
 
 CLI subcommand comparing on-disk SHA-256 against `files.content_hash`. Statuses: `stale | missing | unindexed`. Exits `1` on any drift.
 
+### `module_cycles` (table) / circular imports
+
+Strongly-connected components of the import dependency graph computed via Tarjan after the full index pass. Only cyclic files appear (SCC size ≥ 2, or size-1 with a self-edge). Powers `circular-imports` recipe.
+
+### `re_export_chains` (table)
+
+Materialised re-export resolution. One row per `(from_file, from_name)` walked through barrel files to the terminal definition site. Bounded at 10 hops with cycle detection. `truncated = 1` flags chains that hit the cap or an unindexed file mid-walk. Powers `barrel-chains` recipe.
+
 ### `components` (table)
 
 React components (PascalCase + JSX return or hook usage). PascalCase functions that neither return JSX nor call hooks stay in `symbols` only — never `components`. `hooks_used` is JSON-encoded. See `ComponentRow`.
@@ -275,6 +287,10 @@ Index mode that diffs against `last_indexed_commit` (git) and only re-indexes ch
 
 Symbol or file blast-radius walker. CLI: `codemap impact <target> [--direction up|down|both] [--depth N] [--via dependencies|calls|imports|all] [--limit N] [--summary] [--json]`. MCP: `impact` tool. HTTP: `POST /tool/impact`. Replaces hand-composed `WITH RECURSIVE` queries that agents struggle to write reliably. Walks compatible graphs based on resolved target kind: **symbol** targets walk `calls` (callers / callees by name); **file** targets walk `dependencies` + `imports` (`resolved_path` only). Mismatched explicit `--via` choices land in `skipped_backends` instead of failing. Cycle-detected via path-string `instr` check inside the recursive CTE; bounded by `--depth` (default 3, 0 = unbounded but still cycle-detected and limit-capped) and `--limit` (default 500). Result envelope: `{target, direction, via, depth_limit, matches: [{depth, direction, edge, kind, name?, file_path}], summary: {nodes, max_depth_reached, by_kind, terminated_by: 'depth'|'limit'|'exhausted'}}`. `--summary` trims `matches` for cheap CI gate consumption (`jq '.summary.nodes'`) but preserves the count. Pure transport-agnostic engine in `application/impact-engine.ts`; CLI / MCP / HTTP all dispatch the same `findImpact` function. `sarif` / `annotations` formats not supported (impact rows are graph traversals, not findings).
 
+### `import_specifiers` (table)
+
+Per-specifier breakdown of the `imports.specifiers` JSON blob. One row per imported binding — `imported_name` (original) and `local_name` (renamed via `as`), `kind` in `{named, default, namespace}`, `is_type_only`, column-precise position. Powers specifier-precise rewrites and the `find-import-sites` recipe.
+
 ### `imports` (table)
 
 Raw `import` statements. `specifiers` is JSON-encoded; `resolved_path` is non-null only when the resolver could map `source` to an indexed file. See `ImportRow` and the resolved view `dependencies`.
@@ -319,6 +335,14 @@ Stdio MCP (Model Context Protocol) server exposing codemap's structural-query su
 
 MCP tool with no CLI counterpart — runs N read-only SQL statements in one round-trip. Items are `string | {sql, summary?, changed_since?, group_by?}`: bare strings inherit batch-wide flag defaults; object form overrides on a per-key basis. Output is an N-element array; per-element shape mirrors single-`query`'s output for that statement's effective flag set. Per-statement errors are isolated (failed statement returns `{error}` in its slot; siblings still execute). Distinct from making `query` accept `;`-delimited batches (rejected — would need a SQL tokenizer and would diverge `query`'s output shape from its CLI counterpart). SQL-only (no `recipe` polymorphism); `query_recipe_batch` is an additive future change if a real consumer asks.
 
+### `file_metrics` (table)
+
+Per-file aggregate metrics (one row per indexed TS/JS file): `total_lines`, `code_lines`, `blank_lines`, `comment_lines`, plus symbol-kind counts (`function_count`, `class_count`, `interface_count`, `export_count`). Line classification is regex-light per [Tier 11 ship report](./plans/substrate-extraction.md#tier-11--metrics-expansion-per-symbol--per-file).
+
+### `function_params` (table)
+
+First-class function parameters — one row per leaf parameter binding, ordered by `position`. Keyed by `(file_path, owner_name, owner_kind)` to disambiguate same-name function vs method. `type_text` is the stringified annotation; `default_text` is the raw default-expression source. Powers `find-by-param-type` recipe.
+
 ### markers
 
 `TODO` / `FIXME` / `HACK` / `NOTE` comments extracted from any indexed file (TS, CSS, Markdown, JSON, YAML, …). Stored in the `markers` table; surfaced by the `markers-by-kind` recipe. See `MarkerRow`.
@@ -371,6 +395,14 @@ Code that turns source bytes into structured rows. Three implementations: `parse
 
 A `docs/plans/<feature-name>.md` file tracking in-flight work. Created on commit; deleted when the feature ships per [README § Rule 3](./README.md#rules-for-agents).
 
+### `references` (table)
+
+Every identifier USE per [R.11], column-precise per [R.6]. `kind` in `{value, type, jsx, member}`; `is_write` distinguishes reads from writes per [R.13]; `scope_local_id` joins `scopes` in the same file. Native HTML JSX tags, attribute names, and long-hand object-literal keys are NOT emitted (they're not bindings). `kind='member'` rows DO emit for non-computed property access — they're skipped by the bindings resolver but available for consumers that want member-name positions.
+
+### `runtime_markers` (table)
+
+Operational signals worth auditing: `console.*` calls, `debugger` statements, `throw` statements, `process.env.X` accesses. `kind` in `{console, debugger, throw, process-env}`; `detail` carries the qualifier (method name, env-var name, truncated throw expression). Powers `find-leftover-console` + `env-var-audit` recipes.
+
 ### `pr-comment` (CLI verb)
 
 Markdown PR-summary renderer. `codemap pr-comment <input>` (or `-` for stdin) reads a `codemap audit --json` envelope or a `codemap query --format sarif` doc and emits a markdown comment suitable for `gh pr comment <PR> -F -`. Auto-detects shape via `runs[]` (SARIF) vs `deltas` (audit); `--shape audit|sarif` overrides. Audit-mode groups by delta with collapsed `<details>` for added + removed rows; SARIF-mode groups by `ruleId`. Lists >50 entries collapse to `… and N more`. `--json` envelope `{markdown, findings_count, kind}` is the structured form action.yml consumers read. Targets the surfaces SARIF → Code Scanning doesn't cover (private repos without GHAS, aggregate audit deltas without `file:line` anchors, bot-context seeding). v1.0 ships the (b) summary-comment shape; (c) inline-review comments deferred per Q4 of [`plans/github-marketplace-action.md`](./plans/github-marketplace-action.md). Engine: `application/pr-comment-engine.ts` (pure transport-agnostic).
@@ -440,6 +472,10 @@ One record in a SQLite table. Each table has a corresponding TS interface (`File
 
 ## S
 
+### `scopes` (table) / scope tracker
+
+Lexical scope graph per [R.11]. Composite PK `(file_path, local_id)` with module scope at `local_id = 0` and nested scopes incrementing. Kinds: `module / function / arrow / class / method / interface / type-alias / for / catch`. `parent_local_id` walks the chain. Anonymous scopes (callbacks, catch, for) get owner*symbol_name=NULL; the tracker (`createScopeTracker`) emits `$anon*<localId>`segments in`scopeStr`so sibling anons don't collide on`calls.caller_scope`. Foundation for bindings resolution.
+
 ### schema
 
 Conceptually, the structure of the SQLite database — every table, column, constraint, and index. Defined by **DDL** in `src/db.ts`. Versioned by **`SCHEMA_VERSION`**. Documented in [architecture § Schema](./architecture.md#schema).
@@ -498,6 +534,10 @@ TS shape for one row of the `symbols` table.
 
 ## T
 
+### `test_suites` (table)
+
+Test metadata — describe / it / test / suite / context blocks with skip/only/todo flags and detected `framework` in `{vitest, jest, bun-test, node-test, mocha, unknown}`. Framework detection is per-file from imports (mixed-framework codebases handled automatically). `parent_suite_id` resolves nested describes. Powers `find-skipped-tests` + `tests-by-file` recipes.
+
 ### targeted reindex
 
 Index mode that re-parses only the explicit file paths passed to `--files`. Skips git diff and the full glob. See `targetedReindex` in `src/application/index-engine.ts`.
 
@@ -69,13 +69,13 @@ Scenarios live in **`fixtures/golden/scenarios.json`** (Tier A) or optional **`s
 
 ## Status
 
-| Area                          | State                                                                                                                                                                                                                                                                                                                                                                       |
-| ----------------------------- | --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
-| Tier A runner + CI            | **`bun run test:golden`** in `check`                                                                                                                                                                                                                                                                                                                                        |
-| Tier A scenario coverage      | Scenarios across every indexed table — files, symbols, imports, exports, components, dependencies, markers, type_members, calls, CSS vars/classes/keyframes — plus bundled-recipe smoke tests. Current count: `bun src/index.ts query --json "SELECT COUNT(*) AS n FROM markers"` against the fixture, or count rows in [scenarios.json](../fixtures/golden/scenarios.json) |
-| Tier B external + schema      | **`test:golden:external`**, Zod in **`scripts/query-golden/schema.ts`**                                                                                                                                                                                                                                                                                                     |
-| Subset matchers + budgets     | **`match`**, **`budgetMs`**, **`--strict-budget`**                                                                                                                                                                                                                                                                                                                          |
-| Optional CI for public corpus | Deferred — [roadmap § Backlog](./roadmap.md#backlog)                                                                                                                                                                                                                                                                                                                        |
+| Area                          | State                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      |
+| ----------------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
+| Tier A runner + CI            | **`bun run test:golden`** in `check`                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       |
+| Tier A scenario coverage      | Scenarios across every indexed table — files, symbols, imports, import_specifiers, exports, components, dependencies, markers, type_members, calls, scopes, references, bindings, function_params, runtime_markers, test_suites, re_export_chains, module_cycles, file_metrics, CSS vars/classes/keyframes — plus bundled-recipe smoke tests. Current count: `bun src/index.ts query --json "SELECT COUNT(*) AS n FROM markers"` against the fixture, or count rows in [scenarios.json](../fixtures/golden/scenarios.json) |
+| Tier B external + schema      | **`test:golden:external`**, Zod in **`scripts/query-golden/schema.ts`**                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
+| Subset matchers + budgets     | **`match`**, **`budgetMs`**, **`--strict-budget`**                                                                                                                                                                                                                                                                                                                                                                                                                                                                         |
+| Optional CI for public corpus | Deferred — [roadmap § Backlog](./roadmap.md#backlog)                                                                                                                                                                                                                                                                                                                                                                                                                                                                       |
 
 ---