Commit 0d4ceb9
omc-grep: alpha-rename-invariant code archaeology CLI
A standalone binary that walks a tree, extracts every top-level fn,
canonicalizes each one, and clusters by canonical hash. The new
primitive is `--body-only` mode: hash only the fn body (drop name
and signature) to find fns with IDENTICAL code under DIFFERENT
names — something text-grep, ast-grep, and tree-sitter queries
can't do.
Findings on OMC's own examples tree (151 files, 2388 fns):
- 31.7% redundancy (name-sensitive canonical hash)
- 33.0% redundancy (--body-only)
- largest cluster: assert_eq @ 64 copies (test helper)
- alpha-renamed clusters surfaced by --body-only that the
name-sensitive pass missed include:
is_digit / is_digit_b / is_digit_t (19 fns, 3 names)
is_alpha / is_alpha_b (16 fns)
tkind / tok_kind (15 fns — refactor leftover)
arr_concat / arr_concat_b (14 fns)
_bucket_discrete / endpoint_bucket /
status_bucket (5 fns, 3 unrelated names)
The bucket-family cluster is the proof case: three domain-specific
names sharing NO token, but the canonical body matches exactly.
That's only findable via substrate-canonical addressing.
Implementation:
- omnimcode-cli/src/bin/omc_grep.rs (new bin target)
- extract_top_level_fns made pub in omnimcode-core
- Skips target/, node_modules/, .git/, __pycache__/, omc_modules/
- Flags: --body-only, --near N, --min-cluster K
- docs/omc_grep.md with the findings table
Builds without JIT or Python deps (clean omnimcode-core only).
Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>1 parent 84a4a16 commit 0d4ceb9
5 files changed
Lines changed: 502 additions & 3 deletions
File tree
- docs
- omnimcode-cli
- src/bin
- omnimcode-core/src
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
34 | 34 | | |
35 | 35 | | |
36 | 36 | | |
| 37 | + | |
| 38 | + | |
37 | 39 | | |
38 | 40 | | |
39 | 41 | | |
| |||
262 | 264 | | |
263 | 265 | | |
264 | 266 | | |
265 | | - | |
| 267 | + | |
266 | 268 | | |
267 | 269 | | |
268 | 270 | | |
| |||
351 | 353 | | |
352 | 354 | | |
353 | 355 | | |
| 356 | + | |
354 | 357 | | |
355 | 358 | | |
356 | 359 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
| 78 | + | |
| 79 | + | |
| 80 | + | |
| 81 | + | |
| 82 | + | |
| 83 | + | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
| 87 | + | |
| 88 | + | |
| 89 | + | |
| 90 | + | |
| 91 | + | |
| 92 | + | |
| 93 | + | |
| 94 | + | |
| 95 | + | |
| 96 | + | |
| 97 | + | |
| 98 | + | |
| 99 | + | |
| 100 | + | |
| 101 | + | |
| 102 | + | |
| 103 | + | |
| 104 | + | |
| 105 | + | |
| 106 | + | |
| 107 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
21 | 21 | | |
22 | 22 | | |
23 | 23 | | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
24 | 31 | | |
25 | 32 | | |
26 | 33 | | |
| |||
0 commit comments