fix: 修复 MCP 协议兼容性问题

SummerOneTwo · claude · SummerOneTwo · commit 6421f5d47ce3 · 2026-04-09T18:03:47.000+08:00
P1-1: 添加真实 MCP 端到端测试
- 新增 test_e2e_mcp.py，通过 stdio 启动 MCP Server 进程
- 测试 MCP 握手、工具列表、工具调用、JSON 格式、中文编码

P1-2: TextContent 输出改为标准 JSON
- server.py 使用 json.dumps 替代 str()
- 输出现在是标准 JSON（双引号），而非 Python repr（单引号）
- 支持中文非转义输出（ensure_ascii=False）

P2: 补充传输层/客户端兼容性说明
- README 添加 Transport &amp; Compatibility 表格
- 明确当前仅支持 stdio 传输
- 列出 Claude Code/Cursor/OpenCode 验证状态

附带修复：
- problem.py 类型错误：returncode → return_code

Co-Authored-By: Claude Opus 4.6 &lt;noreply@anthropic.com&gt;
diff --git a/README.md b/README.md
@@ -19,7 +19,7 @@ AutoCode MCP Server provides 15 atomic tools that enable AI assistants to create
 - **Multi-Strategy Generation** — Four generation strategies: tiny (exhaustive), random, extreme (edge cases), and TLE-inducing
 - **Stress Testing** — Automated comparison between optimal and brute-force solutions with configurable trial counts
 - **MCP Protocol** — Native support for Claude Code, Cursor, and other MCP-compatible AI tools
-- **Safe Execution** — Timeout control, memory limits (Linux), and temporary directory isolation
+- **Execution Control** — Timeout control, memory limits (Linux), and temporary directory isolation (local trusted environments only)
 - **Polygon Packaging** — Export problems in Polygon format for Codeforces-style platforms
 
 ## Installation
@@ -110,6 +110,18 @@ stress_test_run(problem_dir="problems/ab", trials=100)
 
 ## MCP Client Setup
 
+### Transport & Compatibility
+
+**Current Support**: Local stdio transport only. The server communicates via standard input/output streams and is designed for local trusted environments.
+
+| Client | Status | Notes |
+|--------|--------|-------|
+| Claude Code | ✅ Verified | Primary development environment |
+| Cursor | ⚠️ Config provided | Not yet tested end-to-end |
+| OpenCode | ⚠️ Config provided | Not yet tested end-to-end |
+
+**Not Supported**: HTTP/SSE transport, remote connections, or multi-tenant environments.
+
 ### Claude Code
 
 Edit `~/.config/claude-code/config.json`:
@@ -436,7 +448,18 @@ problem_pack_polygon(
 
 3. **Unified Return Format** — All tools return `{success, error, data}` for consistent error handling.
 
-4. **Safe Execution** — Timeout control, memory limits (Linux via prlimit), and temporary directory isolation.
+4. **Execution Control** — Timeout control, memory limits (Linux via prlimit), and temporary directory isolation.
+
+### Security Boundaries
+
+⚠️ **Important: This tool is designed for local trusted environments only**
+
+- **File Operations**: `file_read` and `file_save` can read/write arbitrary paths (use `problem_dir` parameter to limit scope)
+- **Code Execution**: Compiles and executes AI-generated C++ code with only time/memory limits, no sandbox isolation
+- **Use Cases**: Local development, competitive programming problem creation, AI-assisted coding in trusted environments
+- **Not Suitable For**: Multi-tenant environments, untrusted code execution, production-grade code execution platforms
+
+For stronger isolation, run inside a container or virtual machine.
 
 ### Generation Strategies
 
diff --git a/README_CN.md b/README_CN.md
@@ -19,7 +19,7 @@ AutoCode MCP Server 提供 15 个原子工具，让 AI 助手能够创建、验
 - **多策略生成** — 四种生成策略：tiny（穷举）、random（随机）、extreme（边界情况）、tle（诱导超时）
 - **压力测试** — 自动比较最优解和暴力解，可配置测试轮数
 - **MCP 协议** — 原生支持 Claude Code、Cursor 等 MCP 兼容的 AI 工具
-- **安全执行** — 超时控制、内存限制（Linux）、临时目录隔离
+- **执行控制** — 超时控制、内存限制（Linux）、临时目录隔离（仅限本地可信环境）
 - **Polygon 打包** — 导出为 Polygon 格式，适用于 Codeforces 等平台
 
 ## 安装
@@ -110,6 +110,18 @@ stress_test_run(problem_dir="problems/ab", trials=100)
 
 ## MCP 客户端配置
 
+### 传输层与兼容性
+
+**当前支持**：仅支持本地 stdio 传输。服务器通过标准输入/输出流通信，适用于本地可信环境。
+
+| 客户端 | 状态 | 说明 |
+|--------|------|------|
+| Claude Code | ✅ 已验证 | 主要开发环境 |
+| Cursor | ⚠️ 配置已提供 | 尚未端到端测试 |
+| OpenCode | ⚠️ 配置已提供 | 尚未端到端测试 |
+
+**不支持**：HTTP/SSE 传输、远程连接或多租户环境。
+
 ### Claude Code
 
 编辑 `~/.config/claude-code/config.json`：
@@ -189,7 +201,7 @@ stress_test_run(problem_dir="problems/ab", trials=100)
 
 ### 验证安装
 
-配置完成后，重启 MCP 客户端并检查工具是否可用。你应该能看到 15 个以 `autocode_` 为前缀的工具。
+配置完成后，重启 MCP 客户端并检查工具是否可用。你应该能看到 15 个工具，包括 `solution_build`、`validator_build`、`generator_build` 等。
 
 ## 工具参考
 
@@ -438,6 +450,17 @@ problem_pack_polygon(
 
 4. **安全执行** — 超时控制、内存限制（Linux 通过 prlimit）、临时目录隔离。
 
+### 安全边界
+
+⚠️ **重要提示：本工具仅适用于本地可信环境**
+
+- **文件操作**：`file_read` 和 `file_save` 可读写任意路径（需显式指定 `problem_dir` 参数限制范围）
+- **代码执行**：编译并执行 AI 生成的 C++ 代码，仅提供时间/内存限制，无沙箱隔离
+- **适用场景**：本地开发、竞赛编程出题、可信环境下的 AI 辅助编程
+- **不适用场景**：多租户环境、不可信代码执行、生产级代码运行平台
+
+如需更强的安全隔离，建议在容器或虚拟机中运行。
+
 ### 生成策略
 
 | 策略 | 类型码 | 用途 |
diff --git a/src/autocode_mcp/server.py b/src/autocode_mcp/server.py
@@ -7,6 +7,7 @@
 from __future__ import annotations
 
 import asyncio
+import json
 from typing import Any
 
 from mcp.server import Server
@@ -109,15 +110,15 @@ async def call_tool(name: str, arguments: dict[str, Any]) -> CallToolResult:
         result = await tool.execute(**arguments)
         result_dict = result.to_dict()
         return CallToolResult(
-            content=[TextContent(type="text", text=str(result_dict))],
+            content=[TextContent(type="text", text=json.dumps(result_dict, ensure_ascii=False))],
             structuredContent=result_dict,
             isError=not result.success,
         )
     except Exception as e:
         error_result = ToolResult.fail(str(e))
         error_dict = error_result.to_dict()
         return CallToolResult(
-            content=[TextContent(type="text", text=str(error_dict))],
+            content=[TextContent(type="text", text=json.dumps(error_dict, ensure_ascii=False))],
             structuredContent=error_dict,
             isError=True,
         )
diff --git a/src/autocode_mcp/tools/problem.py b/src/autocode_mcp/tools/problem.py
@@ -387,7 +387,7 @@ async def execute(
                 # Validator 过滤
                 if validator_available:
                     val_result = await run_binary(val_exe, input_data, timeout=timeout)
-                    if val_result.returncode != 0:
+                    if val_result.return_code != 0:
                         # 输入无效，跳过
                         seed += 1
                         continue
diff --git a/tests/test_e2e_mcp.py b/tests/test_e2e_mcp.py
@@ -0,0 +1,240 @@
+"""真实 MCP 端到端兼容性测试。
+
+通过 stdio 启动 MCP Server 进程，进行完整的协议握手和工具调用验证。
+"""
+
+import asyncio
+import json
+import os
+import sys
+import tempfile
+
+import pytest
+
+
+class MCPClient:
+    """简单的 MCP 客户端，用于端到端测试。"""
+
+    def __init__(self, process: asyncio.subprocess.Process):
+        self.process = process
+        self.request_id = 0
+
+    async def send_request(self, method: str, params: dict | None = None) -> dict:
+        """发送 JSON-RPC 请求并等待响应。"""
+        self.request_id += 1
+        request = {
+            "jsonrpc": "2.0",
+            "id": self.request_id,
+            "method": method,
+            "params": params or {},
+        }
+
+        # 发送请求
+        message = json.dumps(request) + "\n"
+        self.process.stdin.write(message.encode())
+        await self.process.stdin.drain()
+
+        # 读取响应
+        response_line = await self.process.stdout.readline()
+        if not response_line:
+            raise RuntimeError("MCP server closed connection")
+
+        response = json.loads(response_line.decode())
+
+        if "error" in response:
+            raise RuntimeError(f"MCP error: {response['error']}")
+
+        return response.get("result", {})
+
+    async def initialize(self) -> dict:
+        """执行 MCP 初始化握手。"""
+        # 发送 initialize 请求
+        result = await self.send_request(
+            "initialize",
+            {
+                "protocolVersion": "2024-11-05",
+                "capabilities": {},
+                "clientInfo": {"name": "test-client", "version": "1.0.0"},
+            },
+        )
+
+        # 发送 initialized 通知
+        notification = {"jsonrpc": "2.0", "method": "notifications/initialized"}
+        message = json.dumps(notification) + "\n"
+        self.process.stdin.write(message.encode())
+        await self.process.stdin.drain()
+
+        return result
+
+    async def list_tools(self) -> list[dict]:
+        """获取工具列表。"""
+        result = await self.send_request("tools/list")
+        return result.get("tools", [])
+
+    async def call_tool(self, name: str, arguments: dict) -> dict:
+        """调用工具。"""
+        return await self.send_request("tools/call", {"name": name, "arguments": arguments})
+
+    async def close(self) -> None:
+        """关闭连接。"""
+        if self.process.stdin:
+            self.process.stdin.close()
+        try:
+            self.process.kill()
+        except ProcessLookupError:
+            pass
+
+
+@pytest.fixture
+async def mcp_client():
+    """启动 MCP Server 并返回客户端实例。"""
+    # 使用 uv run 启动 autocode-mcp
+    process = await asyncio.create_subprocess_exec(
+        sys.executable,
+        "-m",
+        "autocode_mcp.server",
+        stdin=asyncio.subprocess.PIPE,
+        stdout=asyncio.subprocess.PIPE,
+        stderr=asyncio.subprocess.PIPE,
+        env={**os.environ, "PYTHONIOENCODING": "utf-8"},
+    )
+
+    client = MCPClient(process)
+
+    try:
+        yield client
+    finally:
+        await client.close()
+
+
+@pytest.mark.asyncio
+async def test_mcp_handshake(mcp_client: MCPClient):
+    """测试 MCP 协议握手。"""
+    result = await mcp_client.initialize()
+
+    assert "protocolVersion" in result
+    assert "serverInfo" in result
+    assert result["serverInfo"]["name"] == "autocode-mcp"
+
+
+@pytest.mark.asyncio
+async def test_mcp_list_tools(mcp_client: MCPClient):
+    """测试获取工具列表。"""
+    await mcp_client.initialize()
+
+    tools = await mcp_client.list_tools()
+
+    # 验证有 15 个工具
+    assert len(tools) == 15
+
+    # 验证关键工具存在
+    tool_names = {t["name"] for t in tools}
+    expected_tools = {
+        "file_read",
+        "file_save",
+        "solution_build",
+        "solution_run",
+        "validator_build",
+        "generator_build",
+        "checker_build",
+        "stress_test_run",
+        "problem_create",
+        "problem_generate_tests",
+    }
+    assert expected_tools.issubset(tool_names)
+
+
+@pytest.mark.asyncio
+async def test_mcp_call_file_read(mcp_client: MCPClient):
+    """测试通过 MCP 调用 file_read 工具。"""
+    await mcp_client.initialize()
+
+    with tempfile.NamedTemporaryFile(mode="w", suffix=".txt", delete=False, encoding="utf-8") as f:
+        f.write("hello world")
+        temp_path = f.name
+
+    try:
+        result = await mcp_client.call_tool("file_read", {"path": temp_path})
+
+        # 验证返回结构
+        assert "content" in result
+        assert not result.get("isError", True)
+
+        # 验证 content 是列表且包含 TextContent
+        content = result["content"]
+        assert isinstance(content, list)
+        assert len(content) == 1
+        assert content[0]["type"] == "text"
+
+        # 验证文本内容是有效 JSON
+        text = content[0]["text"]
+        parsed = json.loads(text)
+        assert parsed["success"] is True
+        assert "data" in parsed
+        assert parsed["data"]["content"] == "hello world"
+
+        # 验证 structuredContent 存在
+        assert "structuredContent" in result
+        assert result["structuredContent"]["success"] is True
+    finally:
+        os.unlink(temp_path)
+
+
+@pytest.mark.asyncio
+async def test_mcp_call_unknown_tool(mcp_client: MCPClient):
+    """测试调用不存在的工具返回错误。"""
+    await mcp_client.initialize()
+
+    result = await mcp_client.call_tool("nonexistent_tool", {})
+
+    assert result.get("isError") is True
+    assert "Unknown tool" in result["content"][0]["text"]
+
+
+@pytest.mark.asyncio
+async def test_mcp_text_content_is_valid_json(mcp_client: MCPClient):
+    """测试 TextContent 的文本是有效 JSON（不是 Python repr）。"""
+    await mcp_client.initialize()
+
+    with tempfile.NamedTemporaryFile(mode="w", suffix=".txt", delete=False, encoding="utf-8") as f:
+        f.write("test")
+        temp_path = f.name
+
+    try:
+        result = await mcp_client.call_tool("file_read", {"path": temp_path})
+
+        text = result["content"][0]["text"]
+
+        # 必须是有效 JSON
+        parsed = json.loads(text)
+
+        # 不能是 Python repr 格式（如 {'success': True}）
+        # Python repr 使用单引号，JSON 使用双引号
+        assert "'" not in text  # JSON 不使用单引号
+        assert parsed["success"] is True
+    finally:
+        os.unlink(temp_path)
+
+
+@pytest.mark.asyncio
+async def test_mcp_chinese_text_encoding(mcp_client: MCPClient):
+    """测试中文文本编码正确处理。"""
+    await mcp_client.initialize()
+
+    chinese_content = "你好世界"
+    with tempfile.NamedTemporaryFile(mode="w", suffix=".txt", delete=False, encoding="utf-8") as f:
+        f.write(chinese_content)
+        temp_path = f.name
+
+    try:
+        result = await mcp_client.call_tool("file_read", {"path": temp_path})
+
+        text = result["content"][0]["text"]
+        parsed = json.loads(text)
+
+        # 验证中文正确编码（ensure_ascii=False）
+        assert parsed["data"]["content"] == chinese_content
+        # 原始文本应该包含中文字符，不是 \uXXXX 转义
+        assert chinese_content in text
+    finally:
+        os.unlink(temp_path)