You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: website/src/pages/docs/comparisons.mdx
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -150,7 +150,7 @@ The remaining gaps after v1.5:
150
150
151
151
-**Data-flow / taint analysis** — requires full CPG; not in scope for LIP's current architecture. SCIP remains the right tool for taint tracking and security audit workflows.
152
152
-**Generics / trait bounds / overload resolution for TypeScript, Python, Dart** — rust-analyzer exposes this via Tier 2 for Rust. The other three languages rely on hover text parsing; deep generic instantiation is not yet extracted as structured relationships.
153
-
-**Language coverage** — Tier 2 covers 4 languages; SCIP indexers exist for 15+. Go, Java, Kotlin, and C# are on the v2.0 roadmap.
153
+
-**Language coverage** — Tier 2 covers 8 languages (Rust, Go, TypeScript, Python, Dart, Kotlin, Swift, C/C++); SCIP indexers exist for 15+. Javaand C# remain on the roadmap.
These tools require `LIP_EMBEDDING_URL` to be set unless otherwise noted.
360
+
These tools require `LIP_EMBEDDING_URL` to be set unless otherwise noted. The full table above lists every tool; the most commonly-used ones are spelled out below.
339
361
340
-
### lip_reindex_files
362
+
### lip_nearest_by_contrast
341
363
342
-
Force a re-index of specific file URIs from disk. Does not require embeddings — reads each file, detects its language, and updates the symbol graph.
364
+
Contrastive search — find files similar to `like_uri` but different from `unlike_uri`. Useful when two concepts are close in raw vector space and you want to disambiguate.
Use this after out-of-band changes that the daemon's file watcher didn't catch (e.g. selective `git checkout` or CI-generated files). Returns `DeltaAck`.
350
-
351
371
---
352
372
353
-
### lip_similarity
373
+
### lip_similarity_matrix
354
374
355
-
Pairwise cosine similarity of two stored embeddings.
375
+
Pairwise cosine similarities for a list of files in a single call. Returns a labelled N×N matrix.
Returns `null` when either URI has no cached embedding — call `lip_embedding_batch` first. Accepts both `file://` (file embeddings) and `lip://` (symbol embeddings) URIs. Safe inside `lip_batch_query`.
368
-
369
382
---
370
383
371
-
### lip_query_expansion
384
+
### lip_outliers
372
385
373
-
Embed a short query string and return the display names of the nearest symbols — useful before `lip_workspace_symbols` when the exact symbol name isn't known.
386
+
Identify semantically misplaced files within a set. For each URI, computes its leave-one-out mean cosine similarity to the other set members and ranks the lowest as outliers.
Requires symbols to have embeddings in the symbol store (populate with `lip_embedding_batch` using `lip://` URIs).
390
-
391
393
---
392
394
393
-
### lip_cluster
395
+
### lip_find_boundaries
394
396
395
-
Group a list of URIs into clusters based on embedding proximity.
397
+
Detect semantic boundaries within a single file by chunking it into line-windows and embedding each window. Returns the line positions where adjacent windows diverge most — useful for splitting overgrown files or finding logical sections.
Group 1: file:///src/auth.rs file:///src/session.rs
413
-
Group 2: file:///src/payments.rs file:///src/invoices.rs
414
-
```
415
-
416
-
`radius` is the cosine-similarity threshold. Two URIs land in the same group when their similarity is ≥ the radius. URIs without a cached embedding are silently excluded.
417
-
418
404
---
419
405
420
-
### lip_export_embeddings
406
+
### lip_explain_match
421
407
422
-
Return the raw stored embedding vectors for a list of URIs. Useful for passing to external re-ranking, custom clustering, or visualization tools.
408
+
Explain *why* a result was a strong semantic match for a query. Chunks the result file into windows, embeds each, and returns the top-scoring chunks with their line ranges.
Report embedding coverage under a filesystem path: how many indexed files have embeddings, how many don't. Useful as a CI gate or before running semantic queries.
URIs with no cached embedding are omitted from the result. Safe inside `lip_batch_query`.
426
+
---
427
+
428
+
### lip_prune_deleted
429
+
430
+
Remove index entries for files that no longer exist on disk. On long-running daemons in repos with high churn, ghost embeddings accumulate and pollute nearest-neighbour searches.
431
+
432
+
**Input:**`{}` (no arguments)
438
433
439
434
---
440
435
@@ -503,8 +498,8 @@ lip_file_status(target_uri) # confirm specific file is indexed and fresh
503
498
|`findReferences`|`lip_references`| Same semantics |
0 commit comments