Refactor LangGraph documentation on StreamIO integration and retry policies

codekiln · codekiln · commit f46587239231 · 2025-05-02T10:35:22.000-04:00
- Updated the section on streaming messages into StreamIO with exponential backoff, clarifying the use of configurable retry policies.
- Added context on node-level retry policies in LangGraph, including code examples for implementing exponential backoff.
- Enhanced explanations of handling rate limits and optimizing retry logic for better clarity and usability.
diff --git a/pages/Programming___Error Handling___Retry Logic___Exponential Backoff.md b/pages/Programming___Error Handling___Retry Logic___Exponential Backoff.md
@@ -19,13 +19,29 @@ alias:: [[Exponential Backoff]]
 		- Reduces network congestion
 		- More efficient than fixed-interval retries
 	- ## Example Formula
-		- delay = min(max_delay, initial_delay * (base ^ attempt_number))
+		- `delay = min(max_delay, initial_delay * (base ^ attempt_number))`
 		- For base=2:
 			- 1st retry: 1s
 			- 2nd retry: 2s
 			- 3rd retry: 4s
 			- 4th retry: 8s
 			- etc.
+		- Note:
+			- The key property of *exponential backoff* is **multiplicative growth** of the wait-time:
+			  
+			  \[
+			  t_n = t_0 \times f^{\,n},
+			  \]
+			  
+			  where \(f>1\) is a constant factor and \(n\) is the retry count.  
+			  Doubling (\(f = 2\)) yields the classic binary-exponential sequence
+			  
+			  \[
+			  1\text{ s} \;\to\; 2\text{ s} \;\to\; 4\text{ s} \;\to\; 8\text{ s} \;\dots
+			  \]
+			  
+			  which is what the code does with `delay = initial_delay * (base ^ attempt_number)`.  
+			  Multiplying by a fixed constant on each failure therefore produces the exponential curve \(t_n = t_0\,2^{\,n}\).
 	- ## Implementation Examples
 		- [[LangChain/Blog/25/04/17 LangChain Python Improved Content Blocks Retry Logic and More]] - LangChain's Runnable.with_retry implementation
 	- ## See also
diff --git a/pages/langgraph___How To___Stream messages into StreamIO with Exponential Backoff.md b/pages/langgraph___How To___Stream messages into StreamIO with Exponential Backoff.md
@@ -17,24 +17,10 @@
 			- **Batching Chunks:** If rate limits are frequently hit, consider batching multiple chunks together and updating StreamIO less frequently. This reduces the number of API calls and better utilizes the allowed quota.
 			- **Skipping Redundant Updates:** In scenarios where only the latest chunk matters (e.g., replacing message content), it may be optimal to skip intermediate updates that failed due to rate limits and only update with the most recent chunk after the backoff period.
 			- **Inspect Rate Limit Headers:** Always inspect the `X-RateLimit-Remaining` and `X-RateLimit-Reset` headers in StreamIO responses to dynamically adjust backoff timing and avoid unnecessary retries.
-			- **Configurable Retry Policy:** In LangGraph, you can configure retry policies for nodes (see [How to add node retry policies](https://langchain-ai.github.io/langgraph/how-tos/node-retries/)), allowing for exponential backoff and custom retry logic on API errors like 429.
-		- ### Example Retry Policy in LangGraph
-			- Use the `RetryPolicy` when adding a node that updates StreamIO, specifying `initial_interval`, `backoff_factor`, `max_interval`, and `max_attempts`.
-			- Example:
-			  ~~~python
-			  from langgraph.pregel import RetryPolicy
-			  builder.add_node(
-			     "update_streamio",
-			     update_streamio_fn,
-			     retry=RetryPolicy(initial_interval=1.0, backoff_factor=2.0, max_interval=32.0, max_attempts=5)
-			  )
-			  ~~~
-			- This ensures that if a rate limit error occurs, the node will retry with exponential backoff, up to the specified maximum attempts.
 		- ### Summary
 			- When streaming from LangGraph to StreamIO, design your update logic to:
 				- Handle 429 errors with exponential backoff
 				- Consider batching or skipping redundant updates
-				- Use LangGraph's retry policies for robust error handling
 				- Monitor rate limit headers to optimize retry timing
 			- This approach balances responsiveness, efficiency, and compliance with StreamIO's rate limits.
 	- ## Algorithms
@@ -91,15 +77,16 @@
 			  ```
 			- Uses Stream's **partial-update** endpoint so you never overwrite undeclared fields ([Build an AI Assistant Using Python - getstream.io](https://getstream.io/blog/python-assistant/?utm_source=chatgpt.com)).
 			- Works with any LangGraph streaming mode; just adapt the buffer strategy for "replace" vs "append".
-		- ### Node-Level Retry Policy (optional)
-			- ```python
-			  from langgraph.pregel import RetryPolicy
-			  builder.add_node(
-			      "update_streamio",
-			      lambda state: stream_to_streamio(state["run_id"], state["msg_id"]),
-			      retry=RetryPolicy(initial_interval=1.0, backoff_factor=2.0,
-			                        max_interval=32.0, max_attempts=5)
-			  )
-			  
-			  ```
-			- This lets LangGraph itself re-invoke the node when a 429 bubbles up. ([Streaming](https://langchain-ai.github.io/langgraph/concepts/streaming/))
+	- ## Additional Context: Node-Level Retry Policies in LangGraph
+		- **Configurable Retry Policy:** In LangGraph, you can configure retry policies for nodes (see [How to add node retry policies](https://langchain-ai.github.io/langgraph/how-tos/node-retries/)), allowing for exponential backoff and custom retry logic on API errors like 429.
+		- **Note:** Node-level retry policies in LangGraph are only relevant if a LangGraph node is directly responsible for updating StreamIO. In the main scenario discussed above, streaming from LangGraph and updating StreamIO are decoupled, so node-level retry policies do not apply. If, however, you architect your graph such that a node performs the StreamIO update, you can use a retry policy as shown below.
+		- ```python
+		  from langgraph.pregel import RetryPolicy
+		  builder.add_node(
+		      "update_streamio",
+		      lambda state: stream_to_streamio(state["run_id"], state["msg_id"]),
+		      retry=RetryPolicy(initial_interval=1.0, backoff_factor=2.0,
+		                        max_interval=32.0, max_attempts=5)
+		  )
+		  ```
+		- This lets LangGraph itself re-invoke the node when a 429 bubbles up. ([Streaming](https://langchain-ai.github.io/langgraph/concepts/streaming/))