merge from t3code by totaland · Pull Request #2626 · pingdotgg/t3code

totaland · 2026-05-10T07:17:46Z

What Changed

Why

UI Changes

Checklist

This PR is small and focused
I explained what changed and why
I included before/after screenshots for any UI changes
I included a video for animation/interaction changes

Note

Add push-to-talk voice input and Codex MCP server configuration to the desktop app

Adds a native macOS Swift helper (macos-push-to-talk-helper.swift) that monitors Control+Option key state and emits start/stop events, compiled automatically via a new build step.
Adds a floating overlay window in main.ts that follows the cursor and reflects push-to-talk status; users can trigger voice input via mic button, Control+Option, or the global shortcut Control+Alt+Space.
Transcription runs locally using whisper-cli and ffmpeg, with on-demand model download to the app state directory if not bundled or configured via environment variables.
Extends CodexSettings and CodexSessionRuntime to accept and forward MCP server definitions (with placeholder interpolation) into Codex thread start parameters.
Behavioral Change: Desktop dev builds now use browser history instead of hash history, and backend readiness is awaited before the main window is created in development.

📊 Macroscope summarized b999587. 14 files reviewed, 4 issues evaluated, 0 issues filtered, 2 comments posted

🗂️ Filtered Issues

coderabbitai · 2026-05-10T07:17:54Z

Important

Review skipped

Auto reviews are disabled on this repository. Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

Run ID: 82b0384c-e4fe-4ca6-bf68-f76da3d2a567

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

Tip

💬 Introducing Slack Agent: The best way for teams to turn conversations into code.

Slack Agent is built on CodeRabbit's deep understanding of your code, so your team can collaborate across the entire SDLC without losing context.

Generate code and open pull requests
Plan features and break down work
Investigate incidents and troubleshoot customer tickets together
Automate recurring tasks and respond to alerts with triggers
Summarize progress and report instantly

Built for teams:

Shared memory across your entire org—no repeating context
Per-thread sandboxes to safely plan and execute work
Governance built-in—scoped access, auditability, and budget controls

One agent for your entire SDLC. Right inside Slack.

👉 Get started

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: b999587b5b

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-05-10T07:21:11Z

+      window.sessionStorage.setItem("t3code:voice-autostart", "1");
+      void handleNewThread(defaultProjectRef, { envMode: defaultThreadEnvMode });


Cancel voice autostart after an early release

When push-to-talk is started with no active thread, this flag is set before handleNewThread finishes mounting the composer. If the user releases Ctrl+Option while the thread is still being created, the route-level listener ignores the stop event (it returns unless event.type === "start"), but the new composer still consumes t3code:voice-autostart and starts recording with no remaining stop event, leaving voice capture active until the user manually stops it.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-05-10T07:21:11Z

+        }
+
+        try {
+          const stream = await navigator.mediaDevices.getUserMedia({ audio: true });


Handle releases during microphone startup

For an existing composer, releasing the push-to-talk keys before getUserMedia resolves is lost because stopVoiceListening() only stops voiceRecorderRef.current, which is not assigned until after this await. On a permission prompt or slow device, the recorder can start after the keys are already up and then never receives a stop event, so the mic remains recording until another manual toggle.

Useful? React with 👍 / 👎.

macroscopeapp · 2026-05-10T07:22:28Z

+              ? "audio/webm"
+              : "";
+          const chunks: Blob[] = [];
+          const recorder = new MediaRecorder(stream, mimeType ? { mimeType } : undefined);


🟠 High chat/ChatComposer.tsx:1162

If new MediaRecorder(stream, ...) throws an exception (e.g., NotSupportedError), the stream obtained from getUserMedia leaks because the ref hasn't been set yet and the catch block only cleans up voiceRecorderRef.current?.stream. The stream tracks are never stopped in this error path.

🤖 Copy this AI Prompt to have your agent fix this:

In file apps/web/src/components/chat/ChatComposer.tsx around line 1162: If `new MediaRecorder(stream, ...)` throws an exception (e.g., `NotSupportedError`), the `stream` obtained from `getUserMedia` leaks because the ref hasn't been set yet and the catch block only cleans up `voiceRecorderRef.current?.stream`. The stream tracks are never stopped in this error path. Evidence trail: apps/web/src/components/chat/ChatComposer.tsx lines 1156 (getUserMedia), 1163 (new MediaRecorder — potential throw site), 1165-1170 (voiceRecorderRef.current assignment — not yet executed), 1275 (catch block cleanup uses voiceRecorderRef.current?.stream which is null). Reviewed at REVIEWED_COMMIT.

macroscopeapp · 2026-05-10T07:22:28Z

+function registerPushToTalkShortcuts(): void {
+  if (!app.isReady()) {
+    return;
+  }
+  globalShortcut.unregister(PUSH_TO_TALK_GLOBAL_SHORTCUT);
+  const registered = globalShortcut.register(PUSH_TO_TALK_GLOBAL_SHORTCUT, () => {
+    if (pushToTalkActive) {
+      stopPushToTalkFromShortcut();
+    } else {
+      startPushToTalkFromShortcut();
+    }
+  });
+  if (!registered) {
+    writeDesktopLogHeader(
+      `push-to-talk failed to register shortcut accelerator=${PUSH_TO_TALK_GLOBAL_SHORTCUT}`,
+    );
+  }
+  showIdlePushToTalkOverlay();
+}


🟡 Medium src/main.ts:2657

showIdlePushToTalkOverlay() is called unconditionally at line 2674, even when globalShortcut.register returns false at line 2662. This displays "Hold Ctrl+Option" to users when the shortcut failed to register, so the on-screen instructions don't actually work. Consider only showing the overlay when registration succeeds.

function registerPushToTalkShortcuts(): void { if (!app.isReady()) { return; } globalShortcut.unregister(PUSH_TO_TALK_GLOBAL_SHORTCUT); const registered = globalShortcut.register(PUSH_TO_TALK_GLOBAL_SHORTCUT, () => { if (pushToTalkActive) { stopPushToTalkFromShortcut(); } else { startPushToTalkFromShortcut(); } }); if (!registered) { writeDesktopLogHeader( `push-to-talk failed to register shortcut accelerator=${PUSH_TO_TALK_GLOBAL_SHORTCUT}`, ); + return; } showIdlePushToTalkOverlay(); }

🤖 Copy this AI Prompt to have your agent fix this:

In file apps/desktop/src/main.ts around lines 2657-2675: `showIdlePushToTalkOverlay()` is called unconditionally at line 2674, even when `globalShortcut.register` returns `false` at line 2662. This displays "Hold Ctrl+Option" to users when the shortcut failed to register, so the on-screen instructions don't actually work. Consider only showing the overlay when registration succeeds. Evidence trail: apps/desktop/src/main.ts lines 2657-2675 (at REVIEWED_COMMIT): `registerPushToTalkShortcuts()` function. `globalShortcut.register()` result stored in `registered` (line 2662), failure logged in `if (!registered)` block (lines 2669-2672), then `showIdlePushToTalkOverlay()` called unconditionally at line 2674.

macroscopeapp · 2026-05-10T07:23:00Z

Approvability

Verdict: Needs human review

2 blocking correctness issues found. This PR introduces a complete push-to-talk voice input feature including native macOS keyboard event handling, audio recording, local transcription, and MCP server configuration. The significant new capability combined with multiple unresolved bugs (stream leaks, race conditions, timing issues) identified in review comments warrants human review.

^{You can customize Macroscope's approvability policy. Learn more.}

Copilot

Pull request overview

This PR expands Codex configuration support by adding MCP server definitions to server settings (and injecting them into Codex session config), and introduces a desktop push-to-talk voice flow (global shortcut + overlay + local transcription) integrated into the web UI.

Changes:

Add providers.codex.mcpServers settings schema + patch support, plus docs and server-side session config injection.
Add desktop push-to-talk IPC surface, Electron main/preload implementation (overlay, native helper, local STT), and web UI hooks to start voice capture/new threads from push-to-talk events.
Adjust Electron routing history behavior to use browser history in HTTP (dev) shells.

Reviewed changes

Copilot reviewed 20 out of 21 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
packages/contracts/src/settings.ts	Adds Codex MCP server settings + patch schema.
packages/contracts/src/ipc.ts	Extends `DesktopBridge` contract with push-to-talk + transcription types/APIs.
docs/codex-mcp-integrations.md	Documents MCP server settings shape and placeholders.
apps/web/src/routes/_chat.tsx	Subscribes to push-to-talk events to auto-start a new thread for voice.
apps/web/src/main.tsx	Refines history strategy for Electron dev vs packaged shells.
apps/web/src/localApi.test.ts	Updates test desktop bridge stub for new methods.
apps/web/src/components/settings/SettingsPanels.browser.tsx	Updates browser stub for new desktop bridge methods.
apps/web/src/components/KeybindingsToast.browser.tsx	Updates base server config fixture to include `mcpServers`.
apps/web/src/components/chat/ChatComposer.tsx	Implements voice recording, transcription, overlay updates, and UI controls.
apps/server/src/serverSettings.test.ts	Adds coverage for normalizing MCP server updates.
apps/server/src/provider/Layers/ProviderInstanceRegistryLive.test.ts	Updates Codex config fixtures to include `mcpServers`.
apps/server/src/provider/Layers/CodexSessionRuntime.ts	Builds MCP session config and injects into Codex thread start params.
apps/server/src/provider/Layers/CodexSessionRuntime.test.ts	Adds tests for MCP session config rendering/interpolation.
apps/server/src/provider/Layers/CodexAdapter.ts	Passes MCP server settings into Codex session runtime options.
apps/server/src/provider/Layers/CodexAdapter.test.ts	Updates adapter validation test expectations for `mcpServers`.
apps/desktop/src/preload.ts	Exposes new push-to-talk/transcription IPC methods to the renderer.
apps/desktop/src/main.ts	Implements global shortcut, overlay window, native helper integration, and local STT transcription.
apps/desktop/scripts/build-native-helpers.mjs	Adds a build step to compile the macOS native helper.
apps/desktop/package.json	Wires native helper build into dev/build scripts.
apps/desktop/native/macos-push-to-talk-helper.swift	Adds macOS accessibility-based modifier detection helper.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+  const targetWindow = mainWindow ?? BrowserWindow.getAllWindows()[0] ?? null;
+  targetWindow?.webContents.send(PUSH_TO_TALK_EVENT_CHANNEL, { type });


+                    <TooltipPopup side="top">
+                      {isVoiceSupported
+                        ? "Hold Ctrl+Option to speak. Ctrl+Option+Space works globally."
+                        : "Voice input is unavailable in this runtime."}
+                    </TooltipPopup>


+        if (voiceRecorderRef.current || isVoiceListening) {
+          if (voiceRecorderRef.current) {
+            voiceRecorderRef.current.autoSubmit =
+              voiceRecorderRef.current.autoSubmit || options.autoSubmit;
+          }


+  updatePushToTalkOverlayPosition();
+  pushToTalkOverlayFollowTimer = setInterval(updatePushToTalkOverlayPosition, 33);
+  pushToTalkOverlayFollowTimer.unref();


+  const temporaryPath = `${modelPath}.${Crypto.randomUUID()}.tmp`;
+  const response = await fetch(LOCAL_STT_MODEL_URL);
+  if (!response.ok) {
+    throw new Error(`Failed to download local STT model: HTTP ${String(response.status)}.`);
+  }


totaland added 5 commits April 30, 2026 23:44

Add Codex MCP server settings

6a8ce8b

Add local push-to-talk voice assistant

2b04520

Fix Codex MCP provider test config

1fed778

Merge branch 'pingdotgg:main' into main

128f183

Merge branch 'pingdotgg:main' into main

b999587

Copilot AI review requested due to automatic review settings May 10, 2026 07:17

github-actions Bot added vouch:unvouched PR author is not yet trusted in the VOUCHED list. size:XXL 1,000+ changed lines (additions + deletions). labels May 10, 2026

Copilot started reviewing on behalf of totaland May 10, 2026 07:18 View session

totaland closed this May 10, 2026

chatgpt-codex-connector Bot reviewed May 10, 2026

View reviewed changes

macroscopeapp Bot reviewed May 10, 2026

View reviewed changes

Copilot AI reviewed May 10, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

merge from t3code #2626

merge from t3code #2626
totaland wants to merge 5 commits into
pingdotgg:mainfrom
totaland:main

totaland commented May 10, 2026 •

edited by macroscopeapp Bot

Loading

Uh oh!

coderabbitai Bot commented May 10, 2026

Review skipped

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot May 10, 2026

Uh oh!

chatgpt-codex-connector Bot May 10, 2026

Uh oh!

macroscopeapp Bot May 10, 2026

Uh oh!

macroscopeapp Bot May 10, 2026

Uh oh!

macroscopeapp Bot commented May 10, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		window.sessionStorage.setItem("t3code:voice-autostart", "1");
		void handleNewThread(defaultProjectRef, { envMode: defaultThreadEnvMode });

		const targetWindow = mainWindow ?? BrowserWindow.getAllWindows()[0] ?? null;
		targetWindow?.webContents.send(PUSH_TO_TALK_EVENT_CHANNEL, { type });

Conversation

totaland commented May 10, 2026 • edited by macroscopeapp Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What Changed

Why

UI Changes

Checklist

Add push-to-talk voice input and Codex MCP server configuration to the desktop app

🗂️ Filtered Issues

Uh oh!

coderabbitai Bot commented May 10, 2026

Review skipped

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot May 10, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot May 10, 2026

Choose a reason for hiding this comment

Uh oh!

macroscopeapp Bot May 10, 2026

Choose a reason for hiding this comment

Uh oh!

macroscopeapp Bot May 10, 2026

Choose a reason for hiding this comment

Uh oh!

macroscopeapp Bot commented May 10, 2026

Approvability

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

totaland commented May 10, 2026 •

edited by macroscopeapp Bot

Loading