Add MCP server; support selfextracted API by doomedraven · Pull Request #2917 · kevoreilly/CAPEv2

doomedraven · 2026-02-17T11:56:39Z

Introduce a FastMCP-based MCP server (web/mcp_server.py) to interact with CAPE via tools for submitting files/URLs, querying tasks, downloading artifacts, and listing machines; add MCP docs (MCP.md) and optional dependency group (pyproject.toml). Add support for downloading "selfextracted" files: new API endpoints and handler (web/apiv2/urls.py, web/apiv2/views.py), default config toggles (conf/default/api.conf.default), and UI entries (web/templates/apiv2/index.html). Implement Windows analyzer reboot persistence/handler (analyzer/windows/analyzer.py) using RunOnce registry key and reboot trigger. Update guest docs with AutoLogon guidance (docs/.../additional_configuration.rst).

gemini-code-assist · 2026-02-17T11:57:05Z

Summary of Changes

Hello @doomedraven, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly enhances CAPE Sandbox's automation and integration capabilities. It introduces a new Model Context Protocol (MCP) server, allowing external AI agents to programmatically control and query the sandbox, thereby streamlining analysis workflows. Concurrently, the API has been expanded to facilitate the retrieval of 'selfextracted' files, crucial for in-depth malware analysis. Furthermore, the changes improve the resilience of Windows analysis by enabling the analyzer to persist across reboots, paving the way for more comprehensive and multi-stage analysis scenarios.

Highlights

New MCP Server for AI Integration: A new Model Context Protocol (MCP) server has been introduced, built on FastMCP, enabling AI agents to interact directly with CAPE Sandbox. This server exposes a wide range of tools for task submission, management, reporting, and artifact downloads.
Self-Extracted Files API Support: New API endpoints and a dedicated handler have been added to support the downloading of 'selfextracted' files from analysis tasks. This includes updates to API configuration and the web UI to expose this functionality.
Windows Analyzer Reboot Persistence: The Windows analyzer now includes functionality to persist across system reboots. This is achieved by utilizing the RunOnce registry key to ensure the analyzer restarts and continues its operation after a VM reboot.
Guest VM AutoLogon Documentation: Documentation for guest VM configuration has been updated to include instructions on enabling AutoLogon for Windows, which is essential for features like reboot analysis to function correctly.
New Optional Dependencies: The pyproject.toml file has been updated to include a new optional dependency group mcp, adding fastmcp and httpx as required libraries for the new MCP server.

Changelog

MCP.md
- Documented the new CAPE Sandbox MCP server, detailing its features for task submission, management, reporting, and various file downloads.
- Provided installation instructions for the mcp extra dependency group.
- Outlined configuration requirements, including CAPE_API_URL and CAPE_API_TOKEN environment variables.
- Included examples for running the server and integrating it with Claude Desktop.
- Added a security note regarding the exposure of sandbox capabilities to AI agents.
analyzer/windows/analyzer.py
- Implemented a handle_reboot method to establish analyzer persistence across reboots using Windows RunOnce registry keys.
- Added logic to formulate the command for re-executing the analyzer after a reboot, ensuring the correct working directory.
- Included error handling for setting the persistence key and initiated a system reboot using shutdown command.
- Introduced a _handle_reboot method to process reboot requests received from the monitored process, calling the new handle_reboot functionality.
conf/default/api.conf.default
- Added a new configuration section [taskselfextracted] to control the API endpoint for downloading self-extracted files.
- Set default values for enabled, auth_only, rps, and rpm for the taskselfextracted endpoint.
docs/book/src/installation/guest/additional_configuration.rst
- Appended a new section titled 'Enable AutoLogon' to the guest configuration documentation.
- Provided command-line instructions for configuring Windows AutoLogon via registry keys.
- Mentioned the Microsoft Sysinternals tool Autologon as an alternative method for enabling AutoLogon.
pyproject.toml
- Added a new optional dependency group mcp under [project.optional-dependencies].
- Included fastmcp and httpx as dependencies for the mcp group.
web/apiv2/urls.py
- Registered two new re_path patterns for /tasks/get/selfextracted/ to handle requests for self-extracted files.
- One pattern allows downloading all self-extracted files for a task, and the other allows filtering by a specific tool.
web/apiv2/views.py
- Implemented the tasks_selfextracted view function to handle requests for downloading self-extracted files.
- Added checks for API enablement, task validation, and TLP restrictions.
- Included logic to retrieve self-extraction data from MongoDB, Elasticsearch, or the report.json file.
- Developed functionality to create a ZIP archive of self-extracted files, either all or filtered by a specific tool, handling file naming and pathing within the archive.
- Returned the ZIP file as a StreamingHttpResponse.
web/mcp_server.py
- Created a new Python script that implements a FastMCP server to expose CAPE Sandbox API functionalities.
- Defined helper functions get_headers for API authentication and _request for asynchronous HTTP requests to the CAPE API.
- Implemented _download_file helper for streaming file downloads from API endpoints.
- Decorated numerous asynchronous functions with @mcp.tool() to expose them as MCP tools, covering task submission (file, URL, dlnexec, static), task management (search, list, view, reschedule, reprocess, delete, status, latest, statistics), reports (report, iocs, config), and various file downloads (screenshot, pcap, tlspcap, evtx, dropped, self_extracted_files, surifile, mitmdump, payloadfiles, procdumpfiles, procmemory, fullmemory).
- Included tools for infrastructure and file management (view_file, download_sample, list_machines, view_machine, list_exitnodes, get_cuckoo_status).
web/templates/apiv2/index.html
- Added a new table row for 'Task Self Extracted Files' in the API documentation section.
- Displayed the enabled status, RPS, and RPM for the taskselfextracted API endpoint.
- Provided curl examples for accessing the new /tasks/get/selfextracted/ API endpoints, including the optional tool parameter.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces a new MCP server for AI agent interaction, adds support for downloading self-extracted files, and implements reboot persistence for the Windows analyzer. While these features enhance the platform's capabilities, several high-severity security vulnerabilities were identified, primarily related to improper handling of user-supplied paths and unvalidated data from analysis reports. These issues could lead to arbitrary file read/write on the server and host systems. Additionally, there are some suggestions for improving code quality and documentation clarity.

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Add input validation and security hardening plus submission data helper. - MCP.md: update example to run mcp_server via poetry and use /opt/CAPEv2 path. - web/apiv2/views.py: enforce SHA256 format check when iterating extracted files to skip invalid values. - web/mcp_server.py: introduce ALLOWED_SUBMISSION_DIR (env var) and enforce files are submitted only from that directory; sanitize filenames from Content-Disposition with os.path.basename; add _build_submission_data helper to normalize/skip empty values and stringify booleans/ints; replace repeated data population with helper calls across submit_file/submit_url/submit_dlnexec/submit_static; narrow JSON parsing exceptions to json.JSONDecodeError and return clearer security violation messages.

…features

Replace legacy MCP.md with a new docs page (docs/book/src/usage/mcp.rst) and add it to the usage index. Extend conf/default/api.conf.default with per-section mcp = no toggles (defaulting to disabled) so MCP tools can be enabled/disabled via config. Refactor web/mcp_server.py to load CAPE config, derive API URL from api.conf when unset, add CAPE root to sys.path, handle imports gracefully, introduce per-request token support and global auth enforcement, and register MCP tools conditionally based on api.conf mcp flags. Also improve download/submission helpers and add token parameters to most tool endpoints.

Revise MCP authentication docs to recommend storing CAPE_API_TOKEN in client config (not global env), add clear Local (stdio) vs Remote (SSE) deployment guidance, and document authentication priority. In web/mcp_server.py import additional web_utils helpers, add a startup warning when token auth is enabled but no default token is set, and expose two new MCP tools: `extendedtasksearch` (returns available advanced search terms/filters/hash types) and `verify_auth` (lightweight token validity check using cuckoo status). These changes improve security guidance and provide runtime helpers for building and validating authenticated requests.

Update poetry.lock

4d620fd

gemini-code-assist Bot reviewed Feb 17, 2026

View reviewed changes

doomedraven and others added 10 commits February 17, 2026 13:03

Update web/apiv2/views.py

5f5175b

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Update analyzer/windows/analyzer.py

2ef3c73

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Merge branch 'few_features' of github.com:kevoreilly/CAPEv2 into few_…

c2e9865

…features

Update mcp_server.py

bab25d7

Update views.py

dced750

Update views.py

c2560a1

Update mcp_server.py

cc887cb

doomedraven merged commit 7af8021 into master Feb 23, 2026
7 checks passed

doomedraven deleted the few_features branch February 23, 2026 08:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add MCP server; support selfextracted API#2917

Add MCP server; support selfextracted API#2917
doomedraven merged 12 commits into
masterfrom
few_features

doomedraven commented Feb 17, 2026

Uh oh!

gemini-code-assist Bot commented Feb 17, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

doomedraven commented Feb 17, 2026

Uh oh!

gemini-code-assist Bot commented Feb 17, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant