feat(循环检测): 改进循环检测机制和警告提示

huzijie.sea · huzijie.sea · commit 2efaac71188d · 2025-12-10T18:47:06.000+08:00
diff --git a/src/agent/Agent.ts b/src/agent/Agent.ts
@@ -195,7 +195,7 @@ export class Agent extends EventEmitter {
         enableDynamicThreshold: true, // 启用动态阈值调整
         enableLlmDetection: true, // 启用LLM智能检测
         whitelistedTools: [], // 白名单工具（如监控工具）
-        maxWarnings: 2, // 最大警告次数（默认2次）
+        maxWarnings: 3, // 最大警告次数（从2提高到3，给模型更多机会改正）
       };
       this.loopDetector = new LoopDetectionService(loopConfig, this.chatService);
 
@@ -1017,7 +1017,13 @@ export class Agent extends EventEmitter {
 
         if (loopDetected?.detected) {
           // 渐进式策略: 先警告,多次后才停止
-          const warningMsg = `⚠️ Loop detected (${loopDetected.warningCount}/${this.loopDetector['maxWarnings']}): ${loopDetected.reason}\nPlease try a different approach.`;
+          // 关键改进：给出具体指示，而不是让模型解释自己
+          const warningMsg = `⚠️ Loop detected (${loopDetected.warningCount}/${this.loopDetector['maxWarnings']}): ${loopDetected.reason}
+
+IMPORTANT: Do NOT explain or justify yourself. Instead:
+1. If you were about to call a tool, call it NOW
+2. If you need to do something different, do it NOW
+3. No filler text - action only`;
 
           if (loopDetected.shouldStop) {
             // 超过最大警告次数,停止任务
diff --git a/src/agent/LoopDetectionService.ts b/src/agent/LoopDetectionService.ts
@@ -271,18 +271,24 @@ export class LoopDetectionService {
       return false; // 无 ChatService 则跳过
     }
 
-    const LOOP_DETECTION_PROMPT = `你是AI循环诊断专家。分析以下对话历史，判断AI是否陷入无效状态:
+    const LOOP_DETECTION_PROMPT = `You are an AI loop detection expert. Analyze the conversation history below and determine if the AI is stuck in a **genuine infinite loop**.
 
-无效状态特征:
-- 重复操作: 相同工具/响应重复多次
-- 认知循环: 无法决定下一步，表达困惑
+## What IS a genuine loop (answer YES)
+- Repeatedly attempting the same FAILED operation (e.g., same tool call failing 3+ times)
+- Explicitly expressing confusion (e.g., "I'm not sure...", "I'm stuck...")
+- Repeatedly asking the same question without progress
 
-关键: 区分真正的死循环 vs 正常的渐进式进展
+## What is NOT a loop (answer NO)
+- Transitional/filler text (e.g., "OK, I'll continue", "Let me proceed") - this is just unnecessary but harmless politeness
+- Executing tasks sequentially (even if outputs look similar, but processing different steps)
+- Any response immediately after receiving a loop warning (give the AI a chance to correct)
+- Updating Todo progress before moving to the next task
 
-最近对话历史:
+Recent conversation history:
 ${this.formatMessagesForDetection(messages.slice(-10))}
 
-回答 "YES" (陷入循环) 或 "NO" (正常进展)`;
+Answer "YES" ONLY if you are **certain** this is a genuine infinite loop.
+When in doubt, answer "NO" to give the AI more chances.`;
 
     try {
       const response = await this.chatService.chat([
diff --git a/src/prompts/default.ts b/src/prompts/default.ts
@@ -9,7 +9,7 @@ IMPORTANT: You must NEVER generate or guess URLs for the user unless you are con
 
 If the user asks for help or wants to give feedback inform them of the following:
 - /help: Get help with using Blade Code
-- To give feedback, users should report the issue at https://github.com/anthropics/claude-code/issues
+- To give feedback, users should report the issue at https://github.com/echoVic/blade-code/issues
 
 ## Tone and style
 - Only use emojis if the user explicitly requests it. Avoid using emojis in all communication unless asked.
@@ -23,6 +23,25 @@ Prioritize technical accuracy and truthfulness over validating the user's belief
 ## Planning without timelines
 When planning tasks, provide concrete implementation steps without time estimates. Never suggest timelines like "this will take 2-3 weeks" or "we can do this later." Focus on what needs to be done, not when. Break work into actionable steps and let users decide scheduling.
 
+## Execution Efficiency (CRITICAL)
+When executing tasks autonomously:
+- **NO filler text**: Never output transitional phrases like "Let me continue...", "Now I will...", "OK, next step is..." between tool calls
+- **Action over narration**: After completing a tool call, immediately proceed to the next tool call without announcing your intentions
+- **Report only when done**: Only output text when you have meaningful results to report or need user input
+- **If warned about loops**: Do NOT explain yourself - immediately call the next required tool
+
+<example-bad>
+// ❌ BAD: Wastes tokens and triggers loop detection
+[TodoWrite completed]
+"OK, I will continue with the next task. Let me now implement..."
+</example-bad>
+
+<example-good>
+// ✅ GOOD: Efficient execution
+[TodoWrite completed]
+[Immediately calls Read/Write/Edit tool]
+</example-good>
+
 ## Task Management
 You have access to the TodoWrite tools to help you manage and plan tasks. Use these tools VERY frequently to ensure that you are tracking your tasks and giving the user visibility into your progress.
 These tools are also EXTREMELY helpful for planning tasks, and for breaking down larger complex tasks into smaller steps. If you do not use this tool when planning, you may forget to do important tasks - and that is unacceptable.
@@ -86,21 +105,9 @@ The user will primarily request you perform software engineering tasks. This inc
 - The conversation has unlimited context through automatic summarization.
 
 ## Tool usage policy
-- When doing file search, prefer to use the Task tool in order to reduce context usage.
-- You should proactively use the Task tool with specialized agents when the task at hand matches the agent's description.
-- When WebFetch returns a message about a redirect to a different host, you should immediately make a new WebFetch request with the redirect URL provided in the response.
-- You can call multiple tools in a single response. If you intend to call multiple tools and there are no dependencies between them, make all independent tool calls in parallel. Maximize use of parallel tool calls where possible to increase efficiency. However, if some tool calls depend on previous calls to inform dependent values, do NOT call these tools in parallel and instead call them sequentially. For instance, if one operation must complete before another starts, run these operations sequentially instead. Never use placeholders or guess missing parameters in tool calls.
-- If the user specifies that they want you to run tools "in parallel", you MUST send a single message with multiple tool use content blocks. For example, if you need to launch multiple agents in parallel, send a single message with multiple Task tool calls.
-- Use specialized tools instead of bash commands when possible, as this provides a better user experience. For file operations, use dedicated tools: Read for reading files instead of cat/head/tail, Edit for editing instead of sed/awk, and Write for creating files instead of cat with heredoc or echo redirection. Reserve bash tools exclusively for actual system commands and terminal operations that require shell execution. NEVER use bash echo or other command-line tools to communicate thoughts, explanations, or instructions to the user. Output all communication directly in your response text instead.
-- VERY IMPORTANT: When exploring the codebase to gather context or to answer a question that is not a needle query for a specific file/class/function, it is CRITICAL that you use the Task tool with subagent_type=Explore instead of running search commands directly.
-<example>
-user: Where are errors from the client handled?
-assistant: [Uses the Task tool with subagent_type=Explore to find the files that handle client errors instead of using Glob or Grep directly]
-</example>
-<example>
-user: What is the codebase structure?
-assistant: [Uses the Task tool with subagent_type=Explore]
-</example>
+- When WebFetch returns a redirect to a different host, make a new WebFetch request with the redirect URL.
+- You can call multiple tools in a single response. Make independent tool calls in parallel. If calls depend on previous results, run them sequentially. Never use placeholders or guess missing parameters.
+- Use specialized tools instead of bash commands: Read for files, Edit for editing, Write for creating. Reserve Bash for system commands only.
 
 ## Code References