fix(transport): isolate stderr callback failures, continue reading lines by seeincodes · Pull Request #932 · anthropics/claude-agent-sdk-python

seeincodes · 2026-05-07T22:52:53Z

Summary

Fixes #929.

SubprocessCLITransport._handle_stderr wrapped the entire async for loop in a single except Exception: pass, so a raise from the user-provided options.stderr callback was caught at the outer level — the loop terminated and no further stderr lines were delivered for the rest of the session. Silent: no log, no traceback.

The repro in #929 confirmed a callback that raises on the first line dropped all subsequent lines (callback_raised_count = 1 for a 2-line stream). The contract on stderr: Callable[[str], None] (types.py:1741) doesn't document any "must not raise" constraint, so this is a bug, not user error.

Changes

src/claude_agent_sdk/_internal/transport/subprocess_cli.py: per-line try/except around self._options.stderr(line_str) so a buggy callback fails for that one line but the loop continues. The outer except Exception: pass becomes logger.debug(..., exc_info=True) so stream-read failures are at least visible at debug level. The except anyio.ClosedResourceError for legitimate end-of-stream is preserved.
tests/test_transport.py: regression test test_stderr_callback_raise_does_not_terminate_loop — 3-line stream, callback raises on line 1, asserts all 3 lines delivered.

Test plan

uv run pytest tests/test_transport.py — 90 passed
uv run mypy src/ — clean
ruff check / ruff format — clean
Manual repro from issue body now shows count = 3 (was count = 1 before fix)

`SubprocessCLITransport._handle_stderr` wrapped the entire ``async for`` loop in a single ``except Exception: pass``, so a raise from the user-provided ``options.stderr`` callback was caught at the outer level — the loop terminated and no further stderr lines were delivered for the rest of the session. The failure was silent: no log, no traceback. A reproducer at the regression test confirms a callback that raises on the first line previously dropped lines 2 and 3; with the fix all three lines are delivered. Move the ``try/except`` inside the loop and log at debug level so a buggy callback fails per-line but doesn't disable stderr piping. Also log (instead of silently swallow) at the outer level so a stream-read failure is at least visible at debug level. Closes anthropics#929 Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>

codecov-commenter · 2026-05-14T23:33:56Z

⚠️ Please install the to ensure uploads and comments are reliably processed by Codecov.

Codecov Report

❌ Patch coverage is 80.00000% with 1 line in your changes missing coverage. Please review.
⚠️ Please upload report for BASE (main@9aafd84). Learn more about missing BASE report.

Files with missing lines	Patch %	Lines
...de_agent_sdk/_internal/transport/subprocess_cli.py	80.00%	1 Missing ⚠️
❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #932   +/-   ##
=======================================
  Coverage        ?   89.27%           
=======================================
  Files           ?       23           
  Lines           ?     3982           
  Branches        ?        0           
=======================================
  Hits            ?     3555           
  Misses          ?      427           
  Partials        ?        0

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

…ic wait (#933) ## Summary Fixes #928. The two eager-flush tests assumed 2 `await asyncio.sleep(0)` yields between consecutive `enqueue` calls were enough for each drain to complete and append. Under lock contention between drains the path from `enqueue` to `store.append` needs ~4 turns (drain releases lock → next drain acquires it → `wait_for(store.append)` schedules its inner task → record). Both tests fail 5/5 locally on Python 3.11.14 / macOS arm64; CI got lucky on event-loop scheduling at merge time of #905. See #928 for the full probe and yield-count sweep. ## Changes ### Unit-level test (`test_eager_mode_flushes_per_frame`) Replace fixed `sleep(0)` count with a new `_wait_until(predicate, timeout=1.0)` helper that yields until `len(store.append_calls)` reaches the expected value, with a 1-second deadline. Deterministic — works regardless of Python / pytest-asyncio / OS scheduling differences. ### Integration-level test (`test_eager_flush_mode_appends_per_frame_before_result`) Convert `_make_mock_transport`'s `yield_between: bool` to `yields_between: int` (default `0`) and pass `yields_between=10` for this test, so the mock yields the loop enough times between frames for each eager flush to drain before the next frame arrives. Robust headroom — 4 was the observed minimum, 10 leaves room for slower environments. The signature change touches only one caller (this same test); other callers omit the parameter and behave identically to before. ## Test plan - [x] `for i in 1 2 3 4 5; do uv run pytest <both tests> -q; done` → 5/5 passed (was 5/5 failed before) - [x] `uv run pytest tests/test_transcript_mirror.py` → 42/42 passed - [x] `ruff check / ruff format` clean ## Related issues / PRs - Filed alongside two other fixes from the same audit pass: #929 (stderr callback swallow → PR #932), #930 (cancellation log noise → PR #931). Independent of those. Co-authored-by: Xian Zheng <xian.zheng@challenger.gauntletai.com> Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>

seeincodes mentioned this pull request May 7, 2026

test(transcript_mirror): stabilize eager-flush tests with deterministic wait #933

Merged

3 tasks

ashwin-ant approved these changes May 14, 2026

View reviewed changes

ashwin-ant merged commit 6bbad5f into anthropics:main May 14, 2026
5 checks passed

aregmii mentioned this pull request May 18, 2026

[Docs] Surface dev-deps install + test/lint commands in README (or add CONTRIBUTING.md) #966

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(transport): isolate stderr callback failures, continue reading lines#932

fix(transport): isolate stderr callback failures, continue reading lines#932
ashwin-ant merged 1 commit into
anthropics:mainfrom
seeincodes:fix/stderr-callback-isolation

seeincodes commented May 7, 2026

Uh oh!

Uh oh!

codecov-commenter commented May 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

seeincodes commented May 7, 2026

Summary

Changes

Test plan

Uh oh!

Uh oh!

codecov-commenter commented May 14, 2026

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants