docs: add detailed execute_cell security paradigm to ARCHITECTURE.md

snehshah22 · snehshah22 · commit fe33d4826e7b · 2026-04-09T21:00:28.000-07:00
diff --git a/notebook_mcp/ARCHITECTURE.md b/notebook_mcp/ARCHITECTURE.md
@@ -0,0 +1,139 @@
+# Notebook MCP Server - Architectural Design Rationale
+
+This document outlines the design necessities, modes of operation, crucial architectural decisions, and limitations of the Notebook MCP server.
+
+---
+
+## Overview & Purpose
+
+The Notebook MCP (Model Context Protocol) server acts as a bridge between Agentic AI capabilities and Jupyter Notebook operations (read, write, execute). Its primary goal is to allow AI agents to safely inspect, edit, and execute code within notebooks, whether running in a standalone local environment or deeply integrated with an active IDE session.
+
+---
+
+## Design Necessity: Zero-Config Static Distribution
+
+One of the major necessities of this design is to ensure we can have a **simple, static `mcpserver` configuration** that we can ship directly with our Agent CLI plugins. 
+
+We plan to support plugins for various CLI agents, and in those plugins, the CLI agents only support static `mcpserver` configurations. Automating environment setups for end users requires that they do not need to clone repositories manually or configure absolute system paths. The desired configuration looks like this:
+
+```json
+"notebook-tools": {
+    "command": "npx",
+    "args": [
+        "-y",
+        "github:gemini-cli-extensions/data-cloud-extension"
+    ]
+}
+```
+
+### How Our Solution Solves This
+To support this static configuration running via `npx` (which downloads the code to a random sandbox cache directory), we:
+1. **Bundled the Proxy:** We placed the compiled `mcp_proxy_bundle.cjs` directly inside the repository (`notebook_mcp/bin/`).
+2. **Relative Path Resolution:** The server computes the path to the proxy dynamically using `import.meta.url` (ESM mode).
+This allows the server to find its own resources inside the random `npx` cache without any absolute path strings being passed from the host machine.
+
+---
+
+## Modes of Operation
+
+The server supports two distinct modes depending on how the CLI agent is being executed:
+
+### 1. CLI Agent running in a Standalone Terminal
+* **Context:** The user opens a normal terminal and runs the agent. No IDE extension is wrapping the execution.
+* **Tools Served:** The server serves **Standalone Tools**. These tools directly read and write `.ipynb` files on the disk using Node filesystem APIs.
+* **Scope:** Can only target saved files. Cannot see or manipulate unsaved memory buffers in an IDE tab.
+
+### 2. CLI Agent running in an IDE's Terminal
+* **Context:** The user triggers the agent from within an extension tab or a terminal spawned by the IDE (like VS Code, Antigravity, or Jetski).
+* **Tools Served:** The server serves **Proxied Tools**. It acts as a client to a domain socket/named pipe opened by the IDE extension, forwarding all tool calls to the IDE.
+* **Scope:** Powerful. Can see unsaved changes in active tabs, trigger executions in the IDE kernel, and interact with visual elements.
+
+---
+
+## Key Features & Tools
+
+The server exposes a rich set of tools for notebook manipulation:
+* **Cell CRUD Operations:** Add, update, delete, and list cells in a notebook.
+* **Content Inspection:** Retrieve the full content of specific cells or the entire notebook.
+* **Execution (IDE Mode only):** Trigger execution of cells and capture results via the IDE's active kernel.
+
+---
+
+## Resilience Standalone Fallback
+
+To ensure the agent always has access to tools, the server implements a fallback mechanism:
+* When triggered in IDE mode, it attempts to connect to the IDE's domain socket.
+* It retries **5 times** with a **1-second delay**.
+* If the connection fails (e.g., extension not yet loaded or IDE not running), it **automatically falls back** to serving the Standalone Tools over stdio.
+* This guarantees a zero-freeze experience for the user.
+
+---
+
+## 🔒 Limitations of Standalone Mode
+
+Users and developers must be aware of the strict boundaries of the Standalone Mode:
+* **No IDE API Access:** Standalone tools run purely at the filesystem child-process level. They have **no access to VS Code or IDE platform APIs**.
+* **Saved Files Only:** They cannot read or write to unsaved memory buffers in the user's editor. They only process the **last saved copy** found on disk. Unsaved edits in the IDE will be invisible to the standalone tools until the user saves them.
+
+---
+
+## Crucial Design Decisions
+
+### Decision 1: Target Detection (Process Tree vs Env Vars)
+
+To decide whether to connect to the socket (IDE mode) or fall back to standalone file manipulation.
+
+* **Option A: Process Tree Inspection**
+  * **Pros:** Theoretically zero-config; doesn't rely on terminal environment inheritance.
+  * **Cons:** Fragile. Deeply complex to match random wrapper shell names. Fails if intermediate tools obscure the process ancestry.
+* **Option B: Environment Variables**
+  * **Pros:** 100% accurate. Simple. Zero overhead.
+  * **Cons:** Relies on the host IDE extension putting the environment variables (like `DATA_CLOUD_CURR_IDE_NAME`) into the spawned terminal session.
+* **Choice:** **Environment Variables**. Since our IDE extension natively spawns the agent terminals, it easily feeds the required indicator tag. We fell back to Standalone Mode if the variable was missing.
+
+### Decision 2: Location of the Proxy Bundle
+
+How should the server locate the `mcp_proxy_bundle` script needed to connect to the socket?
+
+* **Option A: Read from the local machine path (passed by env var)**
+  * **Pros:** Ensures the proxy matches exactly the version of the extension installed on the machine.
+  * **Cons:** Breaks zero-config `npx` setups. Requires the extension to find its own installation path and pass it down.
+* **Option B: Bundle the Proxy in the Repository**
+  * **Pros:** Completely self-contained. Works immediately with `npx` retrieval.
+  * **Cons:** Requires engineering to maintain synchronization between the extension changes and repository commits.
+* **Choice:** **Bundle in the Repository**. The benefit of instant zero-configuration execution via `npx` outweighed the burden of occasional synchronization maintenance.
+
+---
+
+## Security Constraints
+
+* **No Process Scanning:** The server does not scan the host machine's process tree for forensics, protecting system privacy and avoiding environment permission blocks.
+* **Isolated Env Tags:** It strictly obeys the isolated environment variables explicitly declared and passed to it by the IDE.
+
+### Security of `execute_cell` Tool
+
+The `execute_cell` tool involves running code on the user's compute resources, requiring a strict security paradigm:
+
+* **IDE Exclusive:** The standalone MCP server does **not** offer this tool. It is exclusively provided by the MCP server running in the IDE through the Data Cloud extension.
+* **Server-Side Enforcement:** Operations obey all security checks put in place on the server side by the Data Cloud extension.
+* **Opt-In by Default:** The tool is **disabled by default** and not served by the MCP server to its clients until the user manually enables it in the extension setting panel.
+* **Runtime Verification:** The server checks for this enablement setting even when running the tool! If a tool is disabled after being enabled (and the user did not refresh the window for the tool to be formally removed from the served registry), the tool will actively refuse execution.
+* **Explicit Elicitation (User Consent):**
+  * We support **elicitation** for the `execute_cell` tool on a best-effort basis (depending on MCP client compatibility).
+  * **Client Compatibility:** If the third-party MCP client does not support prompt/consent elicitation bridges, the tool will **not work**.
+  * **Forced Consent Prompts:** If the client supports elicitation, the system will ask for **explicit user consent** each time it is called by the agent (even if the agent is operating in autonomous "YOLO" mode).
+  * **Override Toggle:** There is a secondary toggle setting in the extension that can be used to disable these forced execution elicitations. However, elicitations are strictly **on by default**.
+
+---
+
+## Development & Maintenance Guide
+
+### Building the Bundle
+The project uses `esbuild` to compile `server.ts` into a single executable file at `dist/index.js`.
+```bash
+npm run bundle
+```
+
+### ESM vs CommonJS
+* The server uses **ESM (ES Modules)** for cleaner modern execution chains and deterministic mapping via `import.meta.url`.
+* The bundled proxy (`mcp_proxy_bundle.cjs`) retains **CommonJS** mapping using Node's `.cjs` instruction override to prevent evaluation collisions in Node's module system.