Add event hooks for streaming responses in Agent.stream()

## Context

The `Agent.stream()` method currently streams token-by-token responses but provides no hooks for applications to react to streaming events (e.g., token received, stream started, stream completed, error occurred). This forces developers to either:

1. Parse streamed text manually post-completion for state updates
2. Wrap the Agent class to inject custom streaming logic
3. Sacrifice streaming benefits to use synchronous `.run()` calls with post-processing

## Problem

Looking at the current implementation in `strands/agent.py`, the `stream()` method yields tokens but doesn't expose lifecycle events. Real-world applications need to:

- Update UI/progress indicators as tokens arrive
- Track token counts for cost monitoring
- Implement cancellation handlers
- Log or cache streaming completions
- Handle provider-specific streaming errors gracefully

Without event hooks, these become bolted-on wrappers that duplicate stream iteration logic.

## What Good Looks Like

LangChain's `BaseCallbackHandler` and LiteLLM's streaming callbacks demonstrate this pattern:

```python
class StreamingCallback:
    def on_token_received(self, token: str, **kwargs): pass
    def on_stream_start(self, **kwargs): pass
    def on_stream_end(self, **kwargs): pass
    def on_stream_error(self, error: Exception, **kwargs): pass

# Usage
agent = Agent(tools=[...], streaming_callbacks=[MyCallback()])
for chunk in agent.stream("query"):
    # Callbacks fire automatically
    pass
```

## Proposed Solution

1. Add `streaming_callbacks` parameter to `Agent.__init__()` accepting a list of callback objects
2. Define a `StreamingCallback` base class with optional methods:
   - `on_token_received(token: str, full_response: str, **kwargs)`
   - `on_stream_start(**kwargs)`
   - `on_stream_end(full_response: str, **kwargs)`
   - `on_stream_error(error: Exception, **kwargs)`
3. Update `stream()` to invoke callbacks at appropriate points
4. Document with example: token counter for billing, UI progress bar, and error recovery

## Why This Matters

Streaming is a core feature for responsive agent UX. Without events, developers can't build production-grade applications that rely on streaming (e.g., real-time dashboards, cost tracking, cancellation). This addresses issue #1819 and unblocks downstream tooling.


---
*Contributed by [Klement Gunndu](https://github.com/KlementMultiverse)*

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add event hooks for streaming responses in Agent.stream() #1824

Context

Problem

What Good Looks Like

Proposed Solution

Why This Matters

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Add event hooks for streaming responses in Agent.stream() #1824

Description

Context

Problem

What Good Looks Like

Proposed Solution

Why This Matters

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions