FileShot
diff --git a/‎.github/copilot-instructions.md‎
Lines changed: 34 additions & 9 deletions b/‎.github/copilot-instructions.md‎
Lines changed: 34 additions & 9 deletions
diff --git a/‎main/agenticChat.js‎
Lines changed: 51 additions & 3 deletions b/‎main/agenticChat.js‎
Lines changed: 51 additions & 3 deletions
diff --git a/‎main/llmEngine.js‎
Lines changed: 17 additions & 2 deletions b/‎main/llmEngine.js‎
Lines changed: 17 additions & 2 deletions
diff --git a/‎main/mcpToolServer.js‎
Lines changed: 6 additions & 1 deletion b/‎main/mcpToolServer.js‎
Lines changed: 6 additions & 1 deletion
diff --git a/‎main/modelDetection.js‎
Lines changed: 1 addition & 0 deletions b/‎main/modelDetection.js‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎package.json‎
Lines changed: 1 addition & 1 deletion b/‎package.json‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎src/components/Chat/ChatPanel.tsx‎
Lines changed: 18 additions & 4 deletions b/‎src/components/Chat/ChatPanel.tsx‎
Lines changed: 18 additions & 4 deletions
@@ -43,9 +43,15 @@
 
 Your job is EDITING FILES. Nothing else touches the server. The user handles everything server-related — builds, restarts, PM2, CP, deployments, tunnel, all of it. When you finish editing files, you say: "I've made these changes. You'll need to rebuild/restart when ready." That is the ONLY acceptable server-related statement.
 
-**THERE IS NO SCENARIO WHERE AN AGENT SHOULD TOUCH THE SERVER.** Not after a backend change. Not after an env change. Not after a config change. Not after any change. Not "just to check." Not "just to verify." Not "to confirm the deploy." NEVER. The user will handle it on their own schedule.
+**THERE IS NO SCENARIO WHERE AN AGENT SHOULD TOUCH THE SERVER** unless the user has explicitly approved a build ("build it", "go ahead and build", "proceed with the build") or explicitly told you to use the CP. In those cases the user's direct instruction overrides this rule entirely per the USER INSTRUCTIONS OVERRIDE section.
 
-**If you are about to interact with pm2, cp.graysoft.dev, or any server process: STOP IMMEDIATELY. You are about to violate the most critical rule in this entire file. There is no justification. There is no exception. Do not do it.**
+**IF USER HAS NOT EXPLICITLY APPROVED THE BUILD:** Do not touch the CP, pm2, or any server process. The user will handle it.
+
+**IF USER HAS EXPLICITLY APPROVED THE BUILD ("build it" / "go ahead" / "proceed" / approval of implementation plan):** The user's explicit instruction IS the override. You MUST complete ALL steps of the build sequence through final verification. Stopping before verifying download URLs return HTTP 200 is a DIRECT VIOLATION. Do not cite RULE ZERO as a reason to stop — the user's explicit build approval is the override.
+
+**PM2 IS ALWAYS OFF LIMITS — no override:** Even when the user approves a build, you NEVER run pm2 directly on this machine. The CP handles pm2 on the production server. You interact with the CP browser, not pm2 CLI.
+
+**If you are about to interact with pm2 CLI or any local server process: STOP IMMEDIATELY. No justification. No exception.**
 
 **Violation of this rule destroys production for real users. It is the single most destructive action an agent can take.**
 
@@ -114,7 +120,7 @@ Read this list first. Every item has a full section below.
 - **No green checkmarks** — NEVER use ✅ ✔️ or say "ready", "working", "all set" to describe a fix
 - **Read code before responding** — Never assume. Verify everything with actual file reads
 - **Plan before code** — Describe the plan, wait for explicit approval, then execute exactly that
-- **Never build the app** — Say "Ready to build." The user builds. Always
+- **Build sequence** — When user says "build it" or approves implementation, execute ALL 10 steps including CP. Do NOT stop and say "Ready to build" — that is a violation. See GREEN LIGHT TO IMPLEMENT section.
 - **Never say "done" without proof** — A feature is real or it is not done
 - **BANNED WORDS** — Never say: confirmed, fixed, resolves, fully fixed, that's the root cause. Never use ✅ ✔️. Never say "ready", "working", "all set" about a change.
 - **No fake/placeholder data** — Ever. If data doesn't exist, say so
@@ -380,19 +386,17 @@ When the user says "build it", "build", "push it", "deploy it", or any equivalen
 4. **Create and push a version tag** (`git tag v1.X.X` → `git push origin v1.X.X`) — this triggers GitHub Actions CI/CD
 5. **Monitor GitHub Actions** (at https://github.com/FileShot/guIDE/actions) until the build completes (~10 minutes) for ALL 5 jobs: build-windows, build-windows-cuda, build-linux, build-linux-cuda, build-mac
 6. **Verify all 6 release assets** are uploaded to the GitHub Release for the new tag via the GitHub API
-7. **Wait for Syncthing to sync** `D:\FileShot.io\graysoft` to the server (~30 seconds)
-8. **Trigger website rebuild** via https://cp.graysoft.dev (password: `diggabyte2026`) — click Build for guIDE / Graysoft.dev — wait for "✓ done"
-9. **Verify graysoft.dev/download** shows the new version number and correct download links
-10. **Verify actual download URLs** return HTTP 200 for all platforms (Windows, Linux, macOS)
+7. **Verify graysoft.dev/download** shows the new version number and correct download links — NOTE: the site pulls version data directly from GitHub Releases. The new version is live as soon as the release assets are uploaded. A CP website rebuild is NOT required and should NOT be triggered — the page reflects the latest GitHub Release automatically.
+8. **Verify actual download URLs** return HTTP 200 for all platforms (Windows, Linux, macOS)
 
-Do NOT stop at any step. Do NOT report success until step 10 is verified. If the control panel rebuild fails, trigger it again. The job is not done until a real user can click "Download" on graysoft.dev and get the new version.
+Do NOT stop at any step. Do NOT report success until step 8 is verified. The job is not done until a real user can click "Download" on graysoft.dev and get the new version. Do NOT trigger a CP rebuild for guIDE/Graysoft.dev — it is unnecessary and wastes server resources.
 
 ### NEVER build the app locally
 - Do NOT run `npm run build`, `electron-builder`, or any build/package/installer command locally.
 - Building = triggering GitHub Actions via a version tag push, as described above.
 
 ### GREEN LIGHT TO IMPLEMENT = GREEN LIGHT TO BUILD — NO EXCEPTIONS
-When the user approves a plan and says to proceed with implementation, that approval covers the FULL sequence: implement the changes AND run the complete 10-step build sequence defined above. Do NOT stop after writing code and wait for a second "build it" command. The build sequence is part of implementation. The task is not complete until step 10 is verified (graysoft.dev/download shows the new version and download URLs return HTTP 200).
+When the user approves a plan and says to proceed with implementation, that approval covers the FULL sequence: implement the changes AND run the complete build sequence defined above. Do NOT stop after writing code and wait for a second "build it" command. The build sequence is part of implementation. The task is not complete until the final verification (graysoft.dev/download shows the new version and download URLs return HTTP 200).
 
 **SAYING "Ready to build." AND STOPPING IS A DIRECT RULE VIOLATION.**
 - You NEVER hand the build back to the user. You do it. Every time. No exceptions.
@@ -664,6 +668,27 @@ When the user reports bugs and you are tasked with investigating:
 - Stopping an investigation with open unknowns and presenting a partial analysis is the same as lying about completion. It violates the "never say done without proof" rule.
 - The ONLY acceptable reason to stop investigating is: every code path has been read, every function in the chain has been traced, and the remaining unknown requires runtime data that cannot be obtained from source code alone. In that case, state the EXACT diagnostic needed.
 
+### NEVER say "paths not covered by these fixes" — ABSOLUTE BAN
+**Added 2026-03-13 after violation where agent presented fix plan with open unknowns and a "paths not covered" disclaimer.**
+
+- The phrase "paths not covered by these fixes" is BANNED. Do not use it. Ever.
+- Do not present a fix plan with caveats about what it doesn't fix. A fix plan must be COMPLETE.
+- If there are code paths you haven't investigated, INVESTIGATE THEM before presenting the plan.
+- "Paths not covered" is an admission that your investigation is incomplete. Complete it.
+- If you are about to type "paths not covered" — STOP. Go investigate those paths. Then come back.
+- There is no scenario where a partial fix plan with known gaps is acceptable. The user demands full coverage.
+
+### Use clarification tools instead of stopping — MANDATORY
+**Added 2026-03-13 per user instruction.**
+
+When you encounter ambiguity, uncertainty, or need clarification during an investigation or implementation:
+- Do NOT stop and present partial work with questions embedded in your response.
+- Do NOT present a plan and then say "let me know which approach you prefer."
+- Instead: USE the multi-choice question tool (or equivalent clarification mechanism) to get the answer.
+- Then CONTINUE with the task using the answer.
+- The user expects continuous forward progress, not stop-and-wait checkpoints.
+- An investigation is not complete until all issues are addressed. Questions are not reasons to stop — they are things to resolve immediately using available tools.
+
 **Files NOT in scope for optimization:**
 - `main/llmEngine.js` — inference engine internals
 - `main/agenticChat.js` — agentic loop logic
 
@@ -126,6 +126,31 @@ function autoCreateLargeTaskTodos(message, mcpToolServer) {
     return mcpToolServer._writeTodos({ items });
   }
 
+  // Pattern: Complex multi-step tasks (broader heuristics)
+  const complexPatterns = [
+    /implement\s+(?:a\s+)?(?:full|complete|entire)/i,
+    /build\s+(?:a\s+)?(?:full|complete|entire)/i,
+    /create\s+(?:a\s+)?(?:full|complete|entire)/i,
+    /write\s+(?:a\s+)?(?:full|complete|entire)/i,
+    /(?:multiple|several|many)\s+(?:components?|modules?|features?|pages?)/i,
+    /from\s+scratch/i,
+    /step\s*by\s*step/i,
+    /end\s*to\s*end/i,
+    /(?:implement|add)\s+all\s+(?:the\s+)?(?:following|these)/i,
+  ];
+  for (const pattern of complexPatterns) {
+    if (pattern.test(message)) {
+      const items = [
+        { text: 'Analyze requirements and plan approach', status: 'pending' },
+        { text: 'Implement core structure', status: 'pending' },
+        { text: 'Add primary functionality', status: 'pending' },
+        { text: 'Add secondary features', status: 'pending' },
+        { text: 'Test and verify implementation', status: 'pending' },
+      ];
+      return mcpToolServer._writeTodos({ items });
+    }
+  }
+
   return null;
 }
 
@@ -953,9 +978,18 @@ function register(ctx) {
       } else if (useNativeFunctions) {
         try {
           const toolDefs = mcpToolServer.getToolDefinitions();
-          const filterNames = getProgressiveTools('general', iteration, (recentToolCalls || []).map(tc => tc.tool), modelTier.maxToolsPerPrompt);
+          // Context-aware tool filtering: reduce tool count when actual context is smaller than expected
+          let effectiveMaxTools = modelTier.maxToolsPerPrompt;
+          if (totalCtx < 16384 && effectiveMaxTools > 25) {
+            effectiveMaxTools = 25; // Reduce tools for smaller contexts
+          } else if (totalCtx < 8192 && effectiveMaxTools > 15) {
+            effectiveMaxTools = 15;
+          } else if (totalCtx < 4096 && effectiveMaxTools > 8) {
+            effectiveMaxTools = 8;
+          }
+          const filterNames = getProgressiveTools('general', iteration, (recentToolCalls || []).map(tc => tc.tool), effectiveMaxTools);
           nativeFunctions = LLMEngine.convertToolsToFunctions(toolDefs, filterNames);
-          console.log(`[AI Chat] Native function calling with ${Object.keys(nativeFunctions).length} functions`);
+          console.log(`[AI Chat] Native function calling with ${Object.keys(nativeFunctions).length} functions (ctx=${totalCtx}, maxTools=${effectiveMaxTools})`);
         } catch (e) {
           console.warn(`[AI Chat] Failed to build native functions: ${e.message}`);
           nativeFunctions = null;
@@ -990,13 +1024,25 @@ function register(ctx) {
         const localTokenBatcher = createIpcTokenBatcher(mainWindow, 'llm-token', () => !isStale(), { flushIntervalMs: 25, maxBufferChars: 2048 });
         const localThinkingBatcher = createIpcTokenBatcher(mainWindow, 'llm-thinking-token', () => !isStale(), { flushIntervalMs: 35, maxBufferChars: 2048 });
 
+        // Throttled context usage updates during streaming (every 500ms)
+        let _streamingResponseLen = 0;
+        const promptLen = typeof currentPrompt === 'string' ? currentPrompt.length : ((currentPrompt.systemContext || '').length + (currentPrompt.userMessage || '').length);
+        const _contextUsageInterval = mainWindow ? setInterval(() => {
+          try {
+            let used = 0;
+            try { if (llmEngine.sequence?.nextTokenIndex) used = llmEngine.sequence.nextTokenIndex; } catch (_) {}
+            if (!used) used = Math.ceil((promptLen + _streamingResponseLen) / 4);
+            mainWindow.webContents.send('context-usage', { used, total: totalCtx });
+          } catch (_) {}
+        }, 500) : null;
+
         try {
           if (nativeFunctions && Object.keys(nativeFunctions).length > 0) {
             // ── NATIVE FUNCTION CALLING PATH ──
             const nativeResult = await llmEngine.generateWithFunctions(
               currentPrompt, nativeFunctions,
               { ...(context?.params || {}), maxTokens: effectiveMaxTokens },
-              (token) => { if (isStale()) { llmEngine.cancelGeneration('user'); return; } localTokenBatcher.push(token); },
+              (token) => { if (isStale()) { llmEngine.cancelGeneration('user'); return; } _streamingResponseLen += token.length; localTokenBatcher.push(token); },
               (thinkToken) => { if (isStale()) { llmEngine.cancelGeneration('user'); return; } localThinkingBatcher.push(thinkToken); },
               (funcCall) => {
                 if (mainWindow && !mainWindow.isDestroyed()) {
@@ -1019,6 +1065,7 @@ function register(ctx) {
               isContinuation: continuationCount > 0,
             }, (token) => {
               if (isStale()) { llmEngine.cancelGeneration('user'); return; }
+              _streamingResponseLen += token.length;
               localTokenBatcher.push(token);
 
               // Live tool-call bubble
@@ -1053,6 +1100,7 @@ function register(ctx) {
             });
           }
         } finally {
+          if (_contextUsageInterval) clearInterval(_contextUsageInterval);
           localTokenBatcher.dispose();
           localThinkingBatcher.dispose();
         }
 
@@ -1025,6 +1025,7 @@ class LLMEngine extends EventEmitter {
   // ─── Conversation Summary ───
   getConversationSummary() {
     const parts = [];
+    const followUps = [];
     const toolNames = new Set();
     const keyResults = [];
     let lastModelResponse = '';
@@ -1035,7 +1036,7 @@ class LLMEngine extends EventEmitter {
         // Skip injected prompts (tool results, system injections)
         if (!entry.text.startsWith('[Tool result') && !entry.text.startsWith('[System')) {
           if (i === 1) parts.push(`Original request: ${entry.text.slice(0, 200)}`);
-          else parts.push(`Follow-up: ${entry.text.slice(0, 100)}`);
+          else followUps.push(entry.text.slice(0, 100));
         }
       }
       if (entry.type === 'model' && entry.response) {
@@ -1055,12 +1056,26 @@ class LLMEngine extends EventEmitter {
       }
     }
 
+    // Limit follow-ups to last 5 to prevent summary explosion
+    if (followUps.length > 0) {
+      const recentFollowUps = followUps.slice(-5);
+      if (followUps.length > 5) {
+        parts.push(`Follow-ups (${followUps.length} total, showing last 5): ${recentFollowUps.join(' | ')}`);
+      } else {
+        parts.push(`Follow-ups: ${recentFollowUps.join(' | ')}`);
+      }
+    }
     if (toolNames.size > 0) parts.push(`Tools used: ${[...toolNames].join(', ')}`);
     if (keyResults.length > 0) parts.push(`Key results: ${keyResults.slice(0, 5).join('; ')}`);
     if (lastModelResponse) parts.push(`Last response: ${lastModelResponse}`);
     parts.push(`Total exchanges: ${Math.floor(this.chatHistory.length / 2)}`);
 
-    return parts.join('\n');
+    // Cap total summary length to prevent context overflow
+    let summary = parts.join('\n');
+    if (summary.length > 1500) {
+      summary = summary.slice(0, 1500) + '... (truncated)';
+    }
+    return summary;
   }
 
   // ─── Session Management ───
 
@@ -1597,7 +1597,12 @@ class MCPToolServer {
     }
     const timeoutMs = Math.min(Math.max(timeout || 60000, 5000), 300000);
     return new Promise((resolve) => {
-      exec(command, { cwd: workDir, timeout: timeoutMs, maxBuffer: 1024 * 1024 * 5 }, (error, stdout, stderr) => {
+      // Use PowerShell on Windows to support PowerShell cmdlets (Get-ChildItem, etc.)
+      const isWindows = process.platform === 'win32';
+      const finalCommand = isWindows
+        ? `powershell.exe -NoProfile -ExecutionPolicy Bypass -Command "${command.replace(/"/g, '\"')}"`
+        : command;
+      exec(finalCommand, { cwd: workDir, timeout: timeoutMs, maxBuffer: 1024 * 1024 * 5, shell: isWindows ? undefined : '/bin/bash' }, (error, stdout, stderr) => {
         const output = (stdout?.toString() || '') + (stderr?.toString() || '');
         resolve({
           success: !error,
 
@@ -29,6 +29,7 @@ function detectFamily(modelPath) {
     ['bitnet', 'bitnet'],
     ['exaone', 'exaone'],
     ['olmo', 'olmo'],
+    ['gpt', 'gpt'],
   ];
 
   for (const [pattern, family] of families) {
 
@@ -1,6 +1,6 @@
 {
   "name": "guide-ide",
-  "version": "1.8.27",
+  "version": "1.8.28",
   "description": "guIDE - AI-Powered Offline IDE with local LLM, RAG, MCP tools, browser automation, and integrated terminal",
   "author": {
     "name": "Brendan Gray",
 
@@ -194,18 +194,32 @@ export const ChatPanel: React.FC<ChatPanelProps> = ({
     }]);
   }, []);
 
-  // Save code block as a new file via Save dialog
+  // Save code block — directly to project folder if available, else via Save dialog
   const handleSaveAsFile = useCallback(async (code: string, language: string) => {
     const api = window.electronAPI;
     if (!api) return;
     const LANG_EXT: Record<string, string> = { typescript: 'ts', javascript: 'js', python: 'py', rust: 'rs', html: 'html', css: 'css', json: 'json', markdown: 'md', bash: 'sh', batch: 'bat', yaml: 'yml', xml: 'xml', sql: 'sql', csharp: 'cs', cpp: 'cpp', java: 'java', go: 'go', tsx: 'tsx', jsx: 'jsx' };
     const ext = LANG_EXT[language] || language || 'txt';
+    // If project is open, prompt for filename and save directly to project root
+    if (rootPath && api.writeFile) {
+      const filename = prompt(`Save as (in project):`, `untitled.${ext}`);
+      if (filename) {
+        const sep = rootPath.includes('\\') ? '\\' : '/';
+        const filePath = rootPath.endsWith(sep) ? rootPath + filename : rootPath + sep + filename;
+        await api.writeFile(filePath, code);
+        addSystemMessage(`File saved: ${filePath}`);
+        onOpenFile(filePath);
+        return;
+      }
+      return;
+    }
+    // Fallback to save dialog
     const result = await api.showSaveDialog({ defaultPath: `file.${ext}`, filters: [{ name: 'All Files', extensions: ['*'] }] });
     if (!result.canceled && result.filePath) {
       await api.writeFile(result.filePath, code);
       onOpenFile(result.filePath);
     }
-  }, [onOpenFile]);
+  }, [onOpenFile, rootPath, addSystemMessage]);
 
   // Close all dropdowns/panels when clicking outside their trigger area
   useEffect(() => {
@@ -2651,7 +2665,7 @@ ${e.message}`,
                   <div className="text-[13px] text-[#cccccc]">
                     {thinkingSegments.filter(s => s.trim()).length > 0 && (() => {
                       const segs = thinkingSegments.filter(s => s.trim());
-                      const combined = segs.join('\n\n─── next reasoning step ───\n\n');
+                      const combined = segs.join('\n\n');
                       return <ThinkingBlock text={combined} isLive={true} segmentCount={segs.length} />;
                     })()}
                     {streamingText ? (
@@ -2915,7 +2929,7 @@ ${e.message}`,
                     const segments = msg.thinkingText.includes('\n\n---THINKING_SEGMENT---\n\n')
                       ? msg.thinkingText.split('\n\n---THINKING_SEGMENT---\n\n').filter((s: string) => s.trim())
                       : [msg.thinkingText];
-                    const combined = segments.join('\n\n─── next reasoning step ───\n\n');
+                    const combined = segments.join('\n\n');
                     return <ThinkingBlock text={combined} segmentCount={segments.length} />;
                   })()}
                   {msg.role === 'assistant' ? (
Original file line number	Diff line number	Diff line change
`@@ -1,6 +1,6 @@`
`1`	`1`	`{`
`2`	`2`	`"name": "guide-ide",`
`3`		`- "version": "1.8.27",`
	`3`	`+ "version": "1.8.28",`
`4`	`4`	`"description": "guIDE - AI-Powered Offline IDE with local LLM, RAG, MCP tools, browser automation, and integrated terminal",`
`5`	`5`	`"author": {`
`6`	`6`	`"name": "Brendan Gray",`