MinishLab
diff --git a/‎astro.config.mjs‎
Lines changed: 10 additions & 10 deletions b/‎astro.config.mjs‎
Lines changed: 10 additions & 10 deletions
diff --git a/‎src/content/docs/packages/overview/index.mdx‎
Lines changed: 22 additions & 22 deletions b/‎src/content/docs/packages/overview/index.mdx‎
Lines changed: 22 additions & 22 deletions
diff --git a/‎src/content/docs/packages/semble/installation.mdx‎
Lines changed: 40 additions & 11 deletions b/‎src/content/docs/packages/semble/installation.mdx‎
Lines changed: 40 additions & 11 deletions
diff --git a/‎src/content/docs/packages/semble/introduction.mdx‎
Lines changed: 43 additions & 40 deletions b/‎src/content/docs/packages/semble/introduction.mdx‎
Lines changed: 43 additions & 40 deletions
@@ -73,6 +73,16 @@ gtag('config', 'G-LQWDNXKF2X');`,
                 { label: 'Integrations',  link: '/packages/model2vec/integrations/' },
               ],
             },
+            {
+              label: 'Semble',
+              items: [
+                { label: 'Introduction',   link: '/packages/semble/introduction/' },
+                { label: 'Installation',   link: '/packages/semble/installation/' },
+                { label: 'MCP Server',     link: '/packages/semble/mcp-server/' },
+                { label: 'CLI / AGENTS.md',link: '/packages/semble/usage/' },
+                { label: 'Benchmarks',     link: '/packages/semble/benchmarks/' },
+              ],
+            },
             {
               label: 'SemHash',
               items: [
@@ -94,16 +104,6 @@ gtag('config', 'G-LQWDNXKF2X');`,
                 { label: 'Supported Backends', link: '/packages/vicinity/supported-backends/' },
               ],
             },
-            {
-              label: 'Semble',
-              items: [
-                { label: 'Introduction',  link: '/packages/semble/introduction/' },
-                { label: 'Installation',  link: '/packages/semble/installation/' },
-                { label: 'Usage',         link: '/packages/semble/usage/' },
-                { label: 'MCP Server',    link: '/packages/semble/mcp-server/' },
-                { label: 'Benchmarks',    link: '/packages/semble/benchmarks/' },
-              ],
-            },
             {
               label: 'Tokenlearn',
               items: [
 
@@ -28,6 +28,28 @@ tableOfContents: false
     </div>
   </article>
 
+  <article class="overview-item">
+    <div class="overview-item-top">
+      <img class="overview-icon" src="/images/logos/semble_logo.webp" alt="Semble" loading="lazy" />
+      <div class="overview-copy">
+        <h2><a href="/packages/semble/introduction/">Semble</a></h2>
+        <p>Fast and accurate code search for agents.</p>
+      </div>
+    </div>
+    <div class="overview-item-bottom">
+      <div class="overview-tags">
+        <span class="overview-tag">Code Search</span>
+        <span class="overview-tag">MCP Server</span>
+        <span class="overview-tag">Agents</span>
+        <span class="overview-tag">Python</span>
+      </div>
+      <div class="overview-actions">
+        <a class="overview-link overview-link-primary" href="/packages/semble/introduction/">Docs</a>
+        <a class="overview-link" href="https://github.com/minishlab/semble">Repo</a>
+      </div>
+    </div>
+  </article>
+
   <article class="overview-item">
     <div class="overview-item-top">
       <img class="overview-icon" src="/images/logos/semhash_logo.webp" alt="SemHash" loading="lazy" />
@@ -71,28 +93,6 @@ tableOfContents: false
     </div>
   </article>
 
-  <article class="overview-item">
-    <div class="overview-item-top">
-      <img class="overview-icon" src="/images/logos/semble_logo.webp" alt="Semble" loading="lazy" />
-      <div class="overview-copy">
-        <h2><a href="/packages/semble/introduction/">Semble</a></h2>
-        <p>Fast and accurate code search for agents.</p>
-      </div>
-    </div>
-    <div class="overview-item-bottom">
-      <div class="overview-tags">
-        <span class="overview-tag">Code Search</span>
-        <span class="overview-tag">MCP Server</span>
-        <span class="overview-tag">Agents</span>
-        <span class="overview-tag">Python</span>
-      </div>
-      <div class="overview-actions">
-        <a class="overview-link overview-link-primary" href="/packages/semble/introduction/">Docs</a>
-        <a class="overview-link" href="https://github.com/minishlab/semble">Repo</a>
-      </div>
-    </div>
-  </article>
-
   <article class="overview-item">
     <div class="overview-item-top">
       <img class="overview-icon" src="/images/logos/tokenlearn_logo.webp" alt="Tokenlearn" loading="lazy" />
 
@@ -1,37 +1,66 @@
 ---
 title: Installation
-description: How to install Semble
+description: Install Semble, set up the MCP server, and scaffold a sub-agent
 sidebar:
   icon: seti:config
 ---
 
+There are three things you can do to install Semble, which are independent of eachother. We recommend doing all three, but you can pick and choose based on your needs:
+
+1. [Install Semble](#1-install-semble) (for the CLI and AGENTS.md flow).
+2. [Set up the MCP server](#2-mcp-server) (so your top-level agent can call Semble as a tool).
+3. [Install the sub-agent](#3-sub-agent) (so sub-agents, which can't call MCP tools, can still search).
+
 ## Requirements
 
-- Python 3.10 or higher
+- Python 3.10 or higher.
+- [uv](https://docs.astral.sh/uv/getting-started/installation/) (recommended for all three flows).
 - No GPU, API keys, or external services required. Runs fully on CPU.
 
-## Install
+## 1. Install Semble
+
+Install Semble with [`uv`](https://docs.astral.sh/uv/) (recommended) or `pip`:
 
 ```bash
-pip install semble
+uv tool install semble   # Recommended
+pip install semble       # Or with pip
 ```
 
-Or with [uv](https://docs.astral.sh/uv/):
+This gives you the [`semble` CLI](/packages/semble/usage/).
+
+### Optional: wire it into AGENTS.md
+
+Once installed, drop the [AGENTS.md snippet](/packages/semble/usage/#agentsmd-snippet) into your `AGENTS.md`, `CLAUDE.md`, `GEMINI.md`, or equivalent. This teaches any agent (including sub-agents) when to reach for `semble` instead of grep, and is the only setup needed for harnesses without MCP support.
+
+## 2. MCP Server
+
+Install Semble as an [MCP server](/packages/semble/mcp-server/) for Claude Code:
 
 ```bash
-uv add semble
+claude mcp add semble -s user -- uvx --from "semble[mcp]" semble
 ```
 
-## MCP Server Extra
+For other agents (Cursor, Codex, OpenCode, VS Code, Copilot CLI, Windsurf, Gemini, Kiro, Zed), see [MCP Server](/packages/semble/mcp-server/) for the per-harness config snippet.
 
-To use Semble as an [MCP server](/packages/semble/mcp-server/) with agents like Claude Code, Cursor, or OpenCode, install the `mcp` extra:
+## 3. Sub-agent
+
+Sub-agents typically cannot call MCP tools directly. To give a sub-agent access to Semble, run `semble init` once in your project root to scaffold a dedicated search sub-agent for your harness:
 
 ```bash
-pip install "semble[mcp]"
+semble init                      # Claude Code  → .claude/agents/semble-search.md
+semble init --agent gemini       # Gemini CLI   → .gemini/agents/semble-search.md
+semble init --agent cursor       # Cursor       → .cursor/agents/semble-search.md
+semble init --agent opencode     # OpenCode     → .opencode/agents/semble-search.md
+semble init --agent copilot      # Copilot CLI  → .github/agents/semble-search.md
+semble init --agent kiro         # Kiro         → .kiro/agents/semble-search.md
 ```
 
-Or, use [uvx](https://docs.astral.sh/uv/guides/tools/) to run it without a permanent install:
+If `semble` is not on `$PATH`, prefix the command with `uvx --from "semble[mcp]"`.
+
+## Updating Semble
 
 ```bash
-uvx --from "semble[mcp]" semble
+uv tool upgrade semble         # with uv
+pip install --upgrade semble   # with pip
+uv cache clean semble          # for MCP users (restart your MCP client after)
 ```
@@ -5,65 +5,68 @@ sidebar:
   icon: open-book
 ---
 
-[Semble](https://github.com/MinishLab/semble) is a code search library built for agents. It returns the exact code snippets they need instantly, using ~98% fewer tokens than grep+read and cutting latency on every step. Indexing and searching a full codebase end-to-end takes under a second, with ~200x faster indexing and ~10x faster queries than a code-specialized transformer, at 99% of its retrieval quality (see [benchmarks](/packages/semble/benchmarks/)). Everything runs on CPU with no API keys, GPU, or external services.
+[Semble](https://github.com/MinishLab/semble) is a code search library built for agents. It returns the exact code snippets they need instantly, using ~98% fewer tokens than grep+read. Indexing and searching a full codebase end-to-end takes under a second, with ~200x faster indexing and ~10x faster queries than a code-specialized transformer, at 99% of its retrieval quality (see [benchmarks](/packages/semble/benchmarks/)). Everything runs on CPU with no API keys, GPU, or external services. Run it as an [MCP server](/packages/semble/mcp-server/) or call it from the shell via [AGENTS.md](/packages/semble/usage/) and any agent (Claude Code, Cursor, Codex, OpenCode, etc.) gets instant access to any repo.
 
-Run it as an [MCP server](/packages/semble/mcp-server/) and any agent (Claude Code, Cursor, Codex, OpenCode, etc.) gets instant access to any repo, cloned and indexed on demand.
+## Quickstart
 
-## Quick Start
+Your agent queries Semble in natural language (e.g. `"How is authentication handled?"`) and gets back only the relevant code snippets, without grepping or reading full files. You can set it up as an MCP server or via AGENTS.md. First, install [uv](https://docs.astral.sh/uv/getting-started/installation/) if you don't have it yet.
 
-Install Semble:
 
-```bash
-pip install semble  # Install with pip
-uv add semble       # Install with uv
-```
-
-Index a repo and search it:
+### MCP (Claude Code)
 
-```python
-from semble import SembleIndex
+Add Semble to Claude Code (requires [uv](https://docs.astral.sh/uv/getting-started/installation/)):
 
-# Index a local directory
-index = SembleIndex.from_path("./my-project")
+```bash
+claude mcp add semble -s user -- uvx --from "semble[mcp]" semble
+```
 
-# Index a remote git repository
-index = SembleIndex.from_git("https://github.com/MinishLab/model2vec")
+Using another agent harness? See [MCP Server](/packages/semble/mcp-server/) for per-agent setup.
 
-# Search with a natural-language or code query
-results = index.search("save model to disk", top_k=3)
+### Bash / AGENTS.md
 
-# Find code similar to a specific result
-related = index.find_related(results[0], top_k=3)
+[Install Semble](/packages/semble/installation/), then add the [AGENTS.md snippet](/packages/semble/usage/#agentsmd-snippet) to your `AGENTS.md`, `CLAUDE.md`, or equivalent. This works for any agent and is the only option for sub-agents, which typically cannot call MCP tools directly.
 
-# Each result exposes the matched chunk
-result = results[0]
-result.chunk.file_path   # "model2vec/model.py"
-result.chunk.start_line  # 127
-result.chunk.end_line    # 150
-result.chunk.content     # "def save_pretrained(self, path: PathLike, ..."
+```bash
+uv tool install semble   # Install with uv (recommended)
+pip install semble       # Or install with pip
 ```
 
+
 ## Main Features
 
-- **Fast**: indexes a repo in ~250 ms and answers queries in ~1.5 ms, all on CPU.
+- **Fast**: indexes an average repo in ~250 ms and answers queries in ~1.5 ms, all on CPU.
 - **Accurate**: NDCG@10 of 0.854 on the [benchmarks](/packages/semble/benchmarks/), on par with code-specialized transformer models at a fraction of the size and cost.
-- **Local and remote**: pass a local path or a git URL; indexes are cached for the session.
-- **MCP server**: drop-in tool for Claude Code, Cursor, Codex, OpenCode, and any other MCP-compatible agent.
+- **Token-efficient**: returns only the relevant chunks, using [~98% fewer tokens than grep+read](/packages/semble/benchmarks/#token-efficiency).
 - **Zero setup**: runs on CPU with no API keys, GPU, or external services required.
+- **MCP server**: works with Claude Code, Cursor, Codex, OpenCode, VS Code, and any other MCP-compatible agent.
+- **Local and remote**: pass a local path or a git URL.
 
-## How It Works
-
-Semble splits each file into code-aware chunks using [Chonkie](https://github.com/chonkie-inc/chonkie), then scores every query with two complementary retrievers:
+## How it works
 
-- **Semantic**: static [Model2Vec](https://github.com/MinishLab/model2vec) embeddings from the code-specialized [potion-code-16M](https://huggingface.co/minishlab/potion-code-16M) model.
-- **Lexical**: [BM25](https://github.com/xhluca/bm25s) for exact matches on identifiers and API names.
+Semble splits each file into code-aware chunks using [tree-sitter](https://github.com/tree-sitter/py-tree-sitter), then scores every query against the chunks with two complementary retrievers: static [Model2Vec](https://github.com/MinishLab/model2vec) embeddings using the code-specialized [potion-code-16M](https://huggingface.co/minishlab/potion-code-16M) model for semantic similarity, and [BM25](https://github.com/xhluca/bm25s) for lexical matches on identifiers and API names. The two score lists are fused with Reciprocal Rank Fusion (RRF).
 
-The two score lists are fused with Reciprocal Rank Fusion (RRF) and then reranked with a set of code-aware signals:
+After fusing, results are reranked with a set of code-aware signals:
 
-- **Adaptive weighting**: symbol-like queries (`Foo::bar`, `getUserById`) get more lexical weight; natural-language queries stay balanced.
-- **Definition boosts**: a chunk that defines the queried symbol (`class`, `def`, `func`) ranks above chunks that merely reference it.
-- **Identifier stems**: query tokens are stemmed and matched against identifier stems, so `parse config` boosts chunks containing `parseConfig`, `ConfigParser`, or `config_parser`.
-- **File coherence**: when multiple chunks from the same file match, the file is boosted so the top result reflects broad file-level relevance.
-- **Noise penalties**: test files, `compat`/`legacy` shims, example code, and `.d.ts` stubs are down-ranked so canonical implementations surface first.
+- **Adaptive weighting.** Symbol-like queries (`Foo::bar`, `_private`, `getUserById`) get more lexical weight, while natural-language queries stay balanced between semantic and lexical retrievers.
+- **Definition boosts.** A chunk that defines the queried symbol (a `class`, `def`, `func`, etc.) is ranked above chunks that merely reference it.
+- **Identifier stems.** Query tokens are stemmed and matched against identifier stems in a chunk, giving an additional weight to chunks that contain them. For example, querying `parse config` boosts chunks containing `parseConfig`, `ConfigParser`, or `config_parser`.
+- **File coherence.** When multiple chunks from the same file match the query, the file is boosted so the top result reflects broad file-level relevance rather than a single out-of-context chunk.
+- **Noise penalties.** Test files, `compat/`/`legacy/` shims, example code, and `.d.ts` declaration stubs are down-ranked so canonical implementations surface first.
 
 Because the embedding model is static with no transformer forward pass at query time, all of this runs in milliseconds on CPU.
+
+## Citing
+
+If you use Semble in your research, please cite the following:
+
+```bibtex
+@software{minishlab2026semble,
+  author       = {{van Dongen}, Thomas and Stephan Tulkens},
+  title        = {Semble: Fast and Accurate Code Search for Agents},
+  year         = {2026},
+  publisher    = {Zenodo},
+  doi          = {10.5281/zenodo.19785932},
+  url          = {https://github.com/MinishLab/semble},
+  license      = {MIT}
+}
+```