fix(claude-code): make subscription mode a true pass-through by edenreich · Pull Request #749 · inference-gateway/cli

edenreich · 2026-07-04T22:48:55Z

Summary

Claude Code mode (claude_code.enabled: true) is now a true pass-through: infer executes claude headless and streams its output without modifying the conversation.

No injections: infer's system prompt, context blocks (git/tools/skills/etc.), and system reminders are no longer added in claude_code mode — claude uses its own system prompt and native tools. infer debug agent system_prompt reports the pass-through status in this mode.
No double execution: claude executes its tools internally; their results are now captured from the stream (domain.ToolCallResultProvider / ChatSyncResponse.ToolResults) and used verbatim instead of re-running every tool call through infer's registry and approval gate. This was the root cause of the "denied writes" / broken-todo behavior in [FEATURE] Team/department attribution for gateway and pushed metrics inference-gateway#412 — blocked fake results were being replayed to claude.
TodoWrite mapping moved to the output layer: the stored/replayed conversation keeps TaskCreate/TaskUpdate verbatim; headless stdout renders one synthesized TodoWrite call with the full accumulated todo list (statuses included), so infer-action mirroring keeps working. TaskUpdate is now mapped too.
Configurable append prompt: new prompts.agent.system_prompt_claude_code (empty default = pure pass-through), passed via --append-system-prompt (never --system-prompt), settable with INFER_PROMPTS_AGENT_SYSTEM_PROMPT_CLAUDE_CODE.
Extra CLI args: new claude_code.extra_args (config), --claude-code-extra-args flag, and INFER_CLAUDE_CODE_EXTRA_ARGS env (env > flag), appended before the trailing -p.
Robust parsing: tool_result.content now accepts both string and content-block-array shapes (fixes unmarshal errors in logs).
Debug logging: the claude invocation is logged with individually quoted args; -p is always the last argument so no flag can be swallowed as its optional prompt value.

Note: the headless continuation-check <system-reminder> in cmd/agent.go is intentionally kept — it is loop control, not the reminders feature.

Verification

task test, task lint, task build all pass; mocks regenerated.
E2E via a stdin-capturing claude shim: first-request stdin contains only the user message (no system messages/reminders); replay carries TaskCreate/TaskUpdate (never TodoWrite) with claude's real tool results; stdout shows one TodoWrite call with live statuses; --append-system-prompt and extra args appear in argv only when configured.

Cross-repo impact

infer-action: should be re-tested against this build (stdout TodoWrite shape unchanged, statuses now update) — re-run a job like [FEATURE] Team/department attribution for gateway and pushed metrics inference-gateway#412.
docs: behavior change worth a [DOCS] ticket (claude_code mode is now documented pass-through; new config keys system_prompt_claude_code, extra_args).

Fixes the behavior reported in inference-gateway/inference-gateway#412.

- No infer system prompt, context blocks, or system reminders are injected in claude_code mode; claude uses its own system prompt and native tools - Stop double-executing claude's already-executed tool calls: their results are captured from the stream (domain.ToolCallResultProvider) and replayed verbatim instead of re-running them through infer's registry/approval gate - Move the TaskCreate/TaskUpdate -> TodoWrite mapping to the output layer only: headless stdout shows one synthesized TodoWrite with the full accumulated todo list; the stored/replayed conversation stays verbatim - Add prompts.agent.system_prompt_claude_code (empty default) passed via --append-system-prompt, settable with INFER_PROMPTS_AGENT_SYSTEM_PROMPT_CLAUDE_CODE - Add claude_code.extra_args (config, --claude-code-extra-args flag, INFER_CLAUDE_CODE_EXTRA_ARGS env) appended before the trailing -p - Accept tool_result content as string or content-block array (fixes unmarshal errors in logs) - Quote claude CLI args in the debug log and keep -p as the last argument Fixes the todo-mirroring and denied-writes behavior seen in inference-gateway/inference-gateway#412

…pt_claude_code and extra_args

## [0.132.1](v0.132.0...v0.132.1) (2026-07-04) ### 🐛 Bug Fixes * allow agent to read plans from userspace config dir ([#748](#748)) ([d1c1be7](d1c1be7)), closes [#746](#746) * **config:** correct GLM 5.2 context window to 1M tokens ([#747](#747)) ([164ffaf](164ffaf)), closes [#745](#745) * **claude-code:** make subscription mode a true pass-through ([#749](#749)) ([10117b4](10117b4)), closes [inference-gateway/inference-gateway#412](inference-gateway/inference-gateway#412) ### 👷 CI/CD * **deps:** update inference workflow to version 0.15.0 ([c8e1522](c8e1522))

inference-gateway-releaser · 2026-07-04T23:13:22Z

🎉 This PR is included in version 0.132.1 🎉

The release is available on:

Your semantic-release bot 📦🚀

edenreich requested a review from a team as a code owner July 4, 2026 22:48

edenreich mentioned this pull request Jul 4, 2026

[DOCS] Document Claude Code pass-through mode, system_prompt_claude_code and extra_args inference-gateway/docs#322

Open

3 tasks

docs(readme): document Claude Code pass-through behavior, system_prom…

7d6bc5a

…pt_claude_code and extra_args

edenreich mentioned this pull request Jul 4, 2026

[TASK] Refactor chat state machine to skip local tool execution in Claude Code mode #750

Open

4 tasks

refactor: replace em dahses with regular dashes

094acba

edenreich merged commit 10117b4 into main Jul 4, 2026
9 checks passed

edenreich deleted the fix/claude-code-passthrough branch July 4, 2026 23:08

edenreich mentioned this pull request Jul 4, 2026

[FEATURE] Expose Claude Code system prompt and extra args as action inputs inference-gateway/infer-action#165

Closed

4 tasks

inference-gateway-releaser Bot added the released label Jul 4, 2026

inference-gateway-maintainer Bot mentioned this pull request Jul 4, 2026

feat: expose Claude Code system prompt and extra args as action inputs inference-gateway/infer-action#166

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(claude-code): make subscription mode a true pass-through#749

fix(claude-code): make subscription mode a true pass-through#749
edenreich merged 3 commits into
mainfrom
fix/claude-code-passthrough

edenreich commented Jul 4, 2026

Uh oh!

Uh oh!

inference-gateway-releaser Bot commented Jul 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

edenreich commented Jul 4, 2026

Summary

Verification

Cross-repo impact

Uh oh!

Uh oh!

inference-gateway-releaser Bot commented Jul 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants