feat: implement audit findings — security, perf, and feature wins by Razee4315 · Pull Request #7 · Razee4315/NetLogo-MCP

Razee4315 · 2026-05-11T20:22:54Z

Lands a focused batch of improvements identified in the senior-staff audit:

Security

New NETLOGO_MCP_RESTRICTED env switch blocks dangerous NetLogo primitives (file-, import/export-world, set-current-directory, user-, sh:/py:/web:) when opted in; off by default to preserve the unrestricted core flow.
README now ships an explicit Security Model section documenting the trust boundary (any connected MCP client gets host-level file/network I/O via NetLogo primitives).
Switched bspace.parse_experiments to defusedxml — model files may originate from CoMSES (untrusted), so guard against entity-expansion attacks.
Dropped shell=True from the BehaviorSpace launcher; .bat is now invoked via "cmd.exe /c " with shell=False. Hand-rolled _quote_for_cmd is gone. CoMSES-supplied experiment names are sanitized to a safe argv charset before they hit the child process.

Concurrency / performance

lifespan now holds an asyncio.Lock; every JVM-touching tool routes through _jvm_call(), which acquires the lock and runs the blocking pynetlogo call in asyncio.to_thread. Fixes both the workspace race and event-loop blocking with one helper.
_require_model trusts current_model_path on the fast path, so most tool calls no longer pay a per-call "max-pxcor" JVM round-trip.
get_world_state batches its 8 nl.report calls into a single locked thread hop instead of 8 separate JNI dispatches.
get_patch_data accepts max_cells (default 10000) and downsamples large worlds with a _meta block explaining the trim.
export_view / export_world prune oldest files via NETLOGO_EXPORTS_MAX_FILES (default 200) so long-lived sessions don't fill the disk.

Features / correctness

New get_agent_sample tool fills the gap between get_world_state (aggregates only) and hand-crafted reporters: batched single-roundtrip reporter, identifier-validated breed/attributes, markdown-table output with "Sampled N of M" footer.
_wrap_nlogox now reads the live netlogo-version captured at lifespan start instead of hardcoding "NetLogo 7.0.3".
set_parameter rejects nan/inf up front — NetLogo has no such literal, and the wrapped compiler error was previously cryptic.
CoMSES download persists title/license in a .comses_metadata.json sidecar so cache hits stay informative; the sidecar is filtered out of read_comses_files responses.
Fixed .env.example NETLOGO_GUI default comment (was "false", actual default is "true").

CI / tooling

ci.yml now runs "pre-commit run --all-files" instead of duplicate ruff/format steps — single source of truth with .pre-commit-config.yaml.
Added defusedxml>=0.7 to project dependencies.

Tests

New tests cover nan/inf rejection, get_agent_sample validation and rendering, _sanitize_experiment_name behavior. Total: 188 tests pass.

Verified: ruff check, ruff format, mypy, and pytest all clean.

Lands a focused batch of improvements identified in the senior-staff audit: Security - New NETLOGO_MCP_RESTRICTED env switch blocks dangerous NetLogo primitives (file-*, import/export-world, set-current-directory, user-*, sh:/py:/web:) when opted in; off by default to preserve the unrestricted core flow. - README now ships an explicit Security Model section documenting the trust boundary (any connected MCP client gets host-level file/network I/O via NetLogo primitives). - Switched bspace.parse_experiments to defusedxml — model files may originate from CoMSES (untrusted), so guard against entity-expansion attacks. - Dropped shell=True from the BehaviorSpace launcher; .bat is now invoked via "cmd.exe /c <bat>" with shell=False. Hand-rolled _quote_for_cmd is gone. CoMSES-supplied experiment names are sanitized to a safe argv charset before they hit the child process. Concurrency / performance - lifespan now holds an asyncio.Lock; every JVM-touching tool routes through _jvm_call(), which acquires the lock and runs the blocking pynetlogo call in asyncio.to_thread. Fixes both the workspace race and event-loop blocking with one helper. - _require_model trusts current_model_path on the fast path, so most tool calls no longer pay a per-call "max-pxcor" JVM round-trip. - get_world_state batches its 8 nl.report calls into a single locked thread hop instead of 8 separate JNI dispatches. - get_patch_data accepts max_cells (default 10000) and downsamples large worlds with a _meta block explaining the trim. - export_view / export_world prune oldest files via NETLOGO_EXPORTS_MAX_FILES (default 200) so long-lived sessions don't fill the disk. Features / correctness - New get_agent_sample tool fills the gap between get_world_state (aggregates only) and hand-crafted reporters: batched single-roundtrip reporter, identifier-validated breed/attributes, markdown-table output with "Sampled N of M" footer. - _wrap_nlogox now reads the live netlogo-version captured at lifespan start instead of hardcoding "NetLogo 7.0.3". - set_parameter rejects nan/inf up front — NetLogo has no such literal, and the wrapped compiler error was previously cryptic. - CoMSES download persists title/license in a .comses_metadata.json sidecar so cache hits stay informative; the sidecar is filtered out of read_comses_files responses. - Fixed .env.example NETLOGO_GUI default comment (was "false", actual default is "true"). CI / tooling - ci.yml now runs "pre-commit run --all-files" instead of duplicate ruff/format steps — single source of truth with .pre-commit-config.yaml. - Added defusedxml>=0.7 to project dependencies. Tests - New tests cover nan/inf rejection, get_agent_sample validation and rendering, _sanitize_experiment_name behavior. Total: 188 tests pass. Verified: ruff check, ruff format, mypy, and pytest all clean.

Razee4315 merged commit c1225cd into main May 11, 2026
3 of 4 checks passed

Razee4315 mentioned this pull request May 13, 2026

fix(ci): satisfy ruff UP038, ruff-format, and mypy pre-commit hook #8

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: implement audit findings — security, perf, and feature wins#7

feat: implement audit findings — security, perf, and feature wins#7
Razee4315 merged 1 commit into
mainfrom
feat/audit-implementations

Razee4315 commented May 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Razee4315 commented May 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant