fix: implement real-time streaming for Agent.start() method by MervinPraison · Pull Request #1028 · MervinPraison/PraisonAI

MervinPraison · 2025-07-22T07:53:33Z

Summary

Fixes the real-time streaming issue where Agent.start() was simulating streaming by chunking complete responses instead of yielding actual real-time chunks from the underlying LLM.

Root Cause

The _start_stream() method called self.chat() which returns a complete response string, then artificially chunks it. The underlying LLM was already streaming internally but the Agent layer wasn't exposing it as a generator.

Solution

Added get_response_stream() generator method in llm.py that yields real-time chunks directly from litellm.completion()
Modified Agent._start_stream() to use the new streaming generator for custom LLM models
Maintained backward compatibility for OpenAI-style models with temporary fallback
Enhanced error handling with proper chat history management
Preserved all features including tools, knowledge search, and MCP integration

Changes Made

src/praisonai-agents/praisonaiagents/llm/llm.py
- Added get_response_stream() generator method (lines 1560-1685)
- Yields chunks directly from litellm.completion() with stream

Summary by CodeRabbit

New Features
- Introduced real-time streaming responses for AI agents, allowing users to receive incremental output as it is generated.
Bug Fixes
- Restored backward compatibility by updating the default streaming behavior for agents.
Tests
- Added new test scripts to verify streaming functionality and ensure correct behavior of streaming options.

- Add get_response_stream() generator method to LLM class for true real-time streaming - Modify Agent._start_stream() to use new streaming generator for custom LLMs - Replace simulated streaming with actual LiteLLM streaming chunks - Maintain backward compatibility for OpenAI-style models (temporary fallback) - Add comprehensive error handling and chat history management - Include test files for validation Fixes the issue where streaming was simulated by chunking complete responses instead of yielding real-time chunks from the underlying LLM. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-authored-by: Mervin Praison <MervinPraison@users.noreply.github.com>

- Change Agent constructor default: stream=True -> stream=False - Update documentation to clarify backward compatibility priority - Ensure existing code agent.start(''prompt'') returns string not generator - Maintain streaming capability when explicitly enabled with stream=True Fixes backward compatibility issue where basic usage would return generator instead of expected string response, breaking existing codebases. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-authored-by: Mervin Praison <MervinPraison@users.noreply.github.com>

coderabbitai · 2025-07-22T07:53:40Z

"""

Walkthrough

This change refactors the streaming logic for AI agents by updating the default stream parameter to False, introducing a robust streaming generator method in the LLM class, and extensively rewriting the agent's _start_stream method to support both true and simulated streaming. Three new standalone test scripts are added to verify streaming functionality, import correctness, and backward compatibility.

Changes

File(s)	Change Summary
src/praisonai-agents/praisonaiagents/agent/agent.py	Refactored `_start_stream` to support true/simulated streaming; changed Agent's `stream` default to `False`; updated docstrings.
src/praisonai-agents/praisonaiagents/llm/llm.py	Added `get_response_stream` generator to LLM for real-time streaming; includes error handling and streaming fallback.
test_streaming.py	New test script: runs real streaming test with an Agent, printing chunked output and reporting chunk count.
test_streaming_basic.py	New test script: verifies imports, method existence, and Agent streaming setup without external dependencies or API calls.
test_streaming_logic.py	New test script: checks default and explicit `stream` parameter values, and streaming logic for backward compatibility.

Sequence Diagram(s)

sequenceDiagram
    participant User
    participant Agent
    participant LLM

    User->>Agent: start(prompt, stream=True)
    alt Custom LLM
        Agent->>LLM: get_response_stream(...)
        loop For each chunk
            LLM-->>Agent: yield response chunk
            Agent-->>User: yield response chunk
        end
        Agent->>LLM: (append full response to history)
    else Non-custom LLM
        Agent->>LLM: chat(...)
        LLM-->>Agent: full response
        loop For each simulated chunk
            Agent-->>User: yield chunk
        end
    end

Estimated code review effort

4 (~90 minutes)

Possibly related PRs

Develop #448: Both PRs modify streaming-related parameters and behavior in the Agent class, focusing on different methods.
fix: implement real-time streaming for Agent.start() method #1027: The main PR builds upon and enhances the streaming implementation first introduced in this PR.
fix: add stream parameter to disable streaming for custom LLMs #701: The main PR extends the streaming configurability added in this earlier PR, with more extensive code changes.

Suggested labels

Review effort 4/5

Poem

In the warren, code streams anew,
Chunks of thought hop swiftly through.
Agents and LLMs, side by side,
Delivering answers, chunkified.
With tests that leap and logic tight,
This bunny cheers, "Streaming's right!"
🐇✨
"""

📜 Recent review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 4f3276b and 662e155.

📒 Files selected for processing (1)

src/praisonai-agents/praisonaiagents/llm/llm.py (1 hunks)

🚧 Files skipped from review as they are similar to previous changes (1)

src/praisonai-agents/praisonaiagents/llm/llm.py

✨ Finishing Touches

📝 Generate Docstrings

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Explain this complex logic.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai explain this code block.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and explain its main purpose.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai generate docstrings to generate docstrings for this PR.
@coderabbitai generate sequence diagram to generate a sequence diagram of the changes in this PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

MervinPraison · 2025-07-22T07:53:42Z

@claude review this pull request and do a detailed analysis and fix this if the existing code doesn't have the solution implemented. Making sure it has backward compatibility, no existing features removed. After making those changes, again review the applied changes. Use @web to search if you dont know any information or to find the latest documentation or to find the latest version. Run the code if you think you need to run it to test it. Minimal code change to start with if required any changes.

github-actions · 2025-07-22T07:53:52Z

gemini-code-assist

Summary of Changes

Hello @MervinPraison, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly upgrades the Agent.start() method to provide genuine real-time streaming from large language models, moving away from a simulated chunking approach. It introduces a dedicated streaming generator in the LLM layer and integrates it seamlessly into the Agent's workflow, while carefully preserving backward compatibility for existing implementations and enhancing error resilience.

Highlights

Real-time Streaming Implementation: Implemented true real-time streaming for the Agent.start() method by introducing a new get_response_stream() generator in llm.py that directly yields chunks from the underlying LLM (via litellm.completion(stream=True)), replacing the previous simulated chunking of complete responses.
Agent Integration and Logic: Modified Agent._start_stream() to leverage the new get_response_stream() for custom LLM models, ensuring proper handling of knowledge search, tool integration (including MCP conversion), and robust chat history management with rollback on errors.
Backward Compatibility & Defaults: Changed the default value of the stream parameter in the Agent constructor from True to False to maintain backward compatibility. For OpenAI-style models, a temporary fallback to the previous simulated streaming behavior is maintained.
Enhanced Error Handling: Improved error handling within the streaming process, specifically for custom LLM models, by implementing chat history rollback in case of streaming failures and ensuring the verbose mode is correctly restored.
New Test Coverage: Added three new test files (test_streaming.py, test_streaming_basic.py, test_streaming_logic.py) to validate the real-time streaming functionality, test the underlying infrastructure, and confirm the correct behavior of the stream parameter and backward compatibility.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point in your pull request via creating an issue comment (i.e. comment on the pull request page) using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in issue comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments or fill out our survey to provide feedback.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces a significant improvement by implementing true real-time streaming for the Agent.start() method, which was previously simulated. The changes are well-structured, and the inclusion of new tests is commendable.

I've found one potential issue in llm.py that could lead to unexpected behavior. Additionally, there are opportunities to improve code maintainability by refactoring duplicated code in agent.py and adhering to Python's import conventions. I've also noted some minor style issues in the new test files.

gemini-code-assist · 2025-07-22T07:55:41Z

src/praisonai-agents/praisonaiagents/llm/llm.py

+            messages, original_prompt = self._build_messages(
+                prompt=prompt,
+                system_prompt=system_prompt,
+                chat_history=chat_history,
+                output_json=output_json,
+                output_pydantic=output_pydantic,
+                temperature=temperature
+            )


The _build_messages method is being called within get_response_stream without passing the temperature argument, even though it's specified as a parameter in get_response_stream. This inconsistency could lead to unexpected behavior or the use of a default temperature value when a specific one is intended.

Consider explicitly passing the temperature argument to _build_messages to ensure consistency and avoid potential issues.

gemini-code-assist · 2025-07-22T07:55:42Z

src/praisonai-agents/praisonaiagents/agent/agent.py

+                # Handle knowledge search
+                actual_prompt = prompt
+                if self.knowledge:
+                    search_results = self.knowledge.search(prompt, agent_id=self.agent_id)
+                    if search_results:
+                        if isinstance(search_results, dict) and 'results' in search_results:
+                            knowledge_content = "\n".join([result['memory'] for result in search_results['results']])
+                        else:
+                            knowledge_content = "\n".join(search_results)
+                        actual_prompt = f"{prompt}\n\nKnowledge: {knowledge_content}"
+
+                # Handle tools properly
+                tools = kwargs.get('tools', self.tools)
+                if tools is None or (isinstance(tools, list) and len(tools) == 0):
+                    tool_param = self.tools
+                else:
+                    tool_param = tools

-                for i in range(0, len(words), chunk_size):
-                    chunk_words = words[i:i + chunk_size]
-                    chunk = ' '.join(chunk_words)
+                # Convert MCP tools if needed
+                if tool_param is not None:
+                    from ..mcp.mcp import MCP
+                    if isinstance(tool_param, MCP) and hasattr(tool_param, 'to_openai_tool'):
+                        openai_tool = tool_param.to_openai_tool()
+                        if openai_tool:
+                            if isinstance(openai_tool, list):
+                                tool_param = openai_tool
+                            else:
+                                tool_param = [openai_tool]
+


This block of code for handling knowledge search, tools, and MCP tool conversion is very similar to the logic in the chat method (lines 1212-1249). This duplication can make the code harder to maintain, as any future changes will need to be made in both places.

To improve maintainability, I recommend refactoring this duplicated logic into a shared private helper method. This method could take the prompt and tools as input and return the processed actual_prompt and tool_param.

gemini-code-assist · 2025-07-22T07:55:42Z

src/praisonai-agents/praisonaiagents/agent/agent.py

-                    chunk = ' '.join(chunk_words)
+                # Convert MCP tools if needed
+                if tool_param is not None:
+                    from ..mcp.mcp import MCP


According to PEP 8, imports should be at the top of the file. This local import makes the code less readable and can hide dependency issues.

Please move this import to the top of the file with the other imports.

from ..mcp.mcp import MCP

gemini-code-assist · 2025-07-22T07:55:42Z

src/praisonai-agents/praisonaiagents/llm/llm.py

+            Exception: If streaming fails or LLM call encounters an error
+        """
+        try:
+            import litellm


Per PEP 8, imports should be at the top of the file. Placing imports inside functions or methods can lead to circular dependency issues and makes it harder to see the file's dependencies at a glance.

Please move import litellm to the top of llm.py.

import litellm

gemini-code-assist · 2025-07-22T07:55:42Z

test_streaming.py

+except Exception as e:
+    print(f"❌ Error during streaming test: {e}")
+    import traceback
+    traceback.print_exc()


The file is missing a final newline character. It's a standard convention (part of POSIX standards) to end text files with a newline. This can prevent issues with some tools and file processing scripts.

gemini-code-assist · 2025-07-22T07:55:42Z

test_streaming_basic.py

+        print("✅ Streaming infrastructure is properly set up")
+        print("📝 Note: Real streaming tests require API keys and will be tested later")
+    else:
+        print("❌ Some tests failed - check the implementation")


This file is missing a final newline character. It's a common convention to end files with a newline to ensure file integrity and compatibility with various tools.

gemini-code-assist · 2025-07-22T07:55:42Z

test_streaming_logic.py

+
+if __name__ == "__main__":
+    success = test_streaming_logic()
+    sys.exit(0 if success else 1)


This file is missing a final newline character. It's a good practice to end all text files with a newline for consistency and to avoid issues with certain command-line tools.

cursor

Bug: Incorrect Parameter Passed in Streaming Method

The get_response_stream method incorrectly passes a temperature parameter to self._build_messages(). This will cause a TypeError because the _build_messages method's signature does not accept a temperature argument. This issue prevents the new streaming functionality from working. The temperature parameter should be removed from this call, as it is correctly used later in the litellm.completion call.

src/praisonai-agents/praisonaiagents/llm/llm.py#L1613-L1622

PraisonAI/src/praisonai-agents/praisonaiagents/llm/llm.py

Lines 1613 to 1622 in 6663b9b

    
           # Build messages using existing logic 
        
           messages, original_prompt = self._build_messages( 
        
               prompt=prompt, 
        
               system_prompt=system_prompt, 
        
               chat_history=chat_history, 
        
               output_json=output_json, 
        
               output_pydantic=output_pydantic, 
        
               temperature=temperature 
        
           )

Fix in Cursor • Fix in Web

BugBot free trial expires on July 29, 2025
Learn more in the Cursor dashboard.

Was this report helpful? Give feedback by reacting with 👍 or 👎

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (9)

src/praisonai-agents/praisonaiagents/llm/llm.py (3)
1614-1622: Remove unused temperature parameter from _build_messages call.

The temperature parameter is passed to _build_messages but this helper method doesn't use or need this parameter. This creates unnecessary coupling.
            messages, original_prompt = self._build_messages(
                prompt=prompt,
                system_prompt=system_prompt,
                chat_history=chat_history,
                output_json=output_json,
-               output_pydantic=output_pydantic,
-               temperature=temperature
+               output_pydantic=output_pydantic
            )
1633-1657: Streaming implementation is sound but consider adding more context to error logging.

The real-time streaming logic correctly processes chunks and yields content. However, the error handling could provide more context about what specifically failed.
                except Exception as e:
-                   logging.error(f"Streaming failed: {e}")
+                   logging.error(f"Streaming failed for model {self.model} with tools={bool(formatted_tools)}: {e}")
                    # Fall back to non-streaming if streaming fails
                    use_streaming = False
1560-1685: Consider limitations compared to full get_response method.

This streaming method focuses solely on content generation and doesn't implement the complex features of get_response like:

Tool calling iterations and execution

Self-reflection loops

Reasoning steps handling

Ollama-specific sequential logic

This is acceptable for the stated purpose of real-time streaming, but consider documenting these limitations in the docstring or adding a note about when to use get_response vs get_response_stream.

Add to the docstring:
        Args:
            prompt: The prompt to send to the LLM
            system_prompt: Optional system prompt
            chat_history: Optional chat history
            temperature: Sampling temperature
            tools: Optional list of tools for function calling
+           
+        Note:
+            This method focuses on real-time streaming and does not support
+            complex features like tool execution, self-reflection, or reasoning
+            steps. Use get_response() for full functionality with tools.
test_streaming.py (2)
18-18: Consider parameterizing the LLM model for flexibility.

The hardcoded Gemini model may not be available in all environments. Consider making it configurable or providing fallback options.
+# Allow model override via environment variable
+DEFAULT_MODEL = "gemini/gemini-2.5-flash"
+test_model = os.getenv('TEST_LLM_MODEL', DEFAULT_MODEL)
+
 agent = Agent(
     instructions="You are a helpful assistant",
-    llm="gemini/gemini-2.5-flash",
+    llm=test_model,
     stream=True,
     verbose=False  # Reduce noise during testing
 )
35-38: Enhance success criteria for streaming validation.

The current logic only checks chunk count, but real streaming should also consider timing. Consider adding temporal validation to ensure chunks arrive progressively.
+import time
+
 # Test the streaming functionality
 chunk_count = 0
+start_time = time.time()
+first_chunk_time = None
+
 for chunk in agent.start("Write a short paragraph about the benefits of real-time streaming in AI applications"):
+    if chunk_count == 0:
+        first_chunk_time = time.time()
     print(chunk, end="", flush=True)
     chunk_count += 1

+end_time = time.time()
+total_time = end_time - start_time
+time_to_first_chunk = first_chunk_time - start_time if first_chunk_time else 0

 print("\n" + "=" * 50)
-print(f"✅ Streaming test completed! Received {chunk_count} chunks.")
+print(f"✅ Streaming test completed! Received {chunk_count} chunks in {total_time:.2f}s")
+print(f"⏱️  Time to first chunk: {time_to_first_chunk:.2f}s")

-if chunk_count > 1:
+if chunk_count > 1 and time_to_first_chunk < total_time * 0.8:
     print("✅ SUCCESS: Real-time streaming is working - received multiple chunks!")
 else:
-    print("⚠️  WARNING: Only received 1 chunk - may still be using simulated streaming")
+    print("⚠️  WARNING: Streaming behavior suggests simulated streaming")
test_streaming_logic.py (3)
25-32: Fix f-string usage and boolean comparison style.

Static analysis correctly identifies unnecessary f-string prefix and non-Pythonic boolean comparison.
-        print(f"   ✅ Agent created successfully")
+        print("   ✅ Agent created successfully")
         print(f"   📊 stream attribute: {getattr(agent1, 'stream', 'NOT SET')}")
         
-        if agent1.stream == False:
+        if not agent1.stream:
             print("   ✅ CORRECT: stream defaults to False (backward compatible)")
         else:
             print("   ❌ INCORRECT: stream should default to False")
             return False
42-49: Fix f-string usage and boolean comparison style.

Apply similar fixes for consistency.
-        print(f"   ✅ Agent created successfully")  
+        print("   ✅ Agent created successfully")
         print(f"   📊 stream attribute: {agent2.stream}")
         
-        if agent2.stream == False:
+        if not agent2.stream:
             print("   ✅ CORRECT: stream=False works")
         else:
             print("   ❌ INCORRECT: stream=False not working")
             return False
59-66: Fix f-string usage and boolean comparison style.

Apply similar fixes for consistency.
-        print(f"   ✅ Agent created successfully")
+        print("   ✅ Agent created successfully")
         print(f"   📊 stream attribute: {agent3.stream}")
         
-        if agent3.stream == True:
+        if agent3.stream:
             print("   ✅ CORRECT: stream=True works")
         else:
             print("   ❌ INCORRECT: stream=True not working")
             return False
src/praisonai-agents/praisonaiagents/agent/agent.py (1)

1956-1959: Consider documenting the verbose mode suppression during streaming.

The implementation correctly prevents console output conflicts by temporarily disabling verbose mode during streaming. However, this creates a behavioral difference where verbose output is suppressed during streaming but not during regular chat. Consider documenting this behavior or providing an alternative that maintains verbose output compatibility.

Also applies to: 2058-2059, 2062-2063

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between ca420fe and 6663b9b.

📒 Files selected for processing (5)

src/praisonai-agents/praisonaiagents/agent/agent.py (3 hunks)
src/praisonai-agents/praisonaiagents/llm/llm.py (1 hunks)
test_streaming.py (1 hunks)
test_streaming_basic.py (1 hunks)
test_streaming_logic.py (1 hunks)

📓 Path-based instructions (4)

src/praisonai-agents/praisonaiagents/llm/**/*.py

📄 CodeRabbit Inference Engine (src/praisonai-agents/CLAUDE.md)

LLM integration should use the unified wrapper in praisonaiagents/llm/ and support multiple providers via LiteLLM.

Files:

src/praisonai-agents/praisonaiagents/llm/llm.py

src/praisonai-agents/praisonaiagents/**/*.py

📄 CodeRabbit Inference Engine (src/praisonai-agents/CLAUDE.md)

src/praisonai-agents/praisonaiagents/**/*.py: All major components should support async execution, providing both synchronous and asynchronous methods (e.g., execute and aexecute).
Global error logging should use the error_logs list, and a callback system should be provided for real-time error reporting.
Context length exceptions should be handled with automatic retry logic.
Graceful degradation should be implemented for optional dependencies.

Files:

src/praisonai-agents/praisonaiagents/llm/llm.py
src/praisonai-agents/praisonaiagents/agent/agent.py

src/praisonai-agents/praisonaiagents/agent/**/*.py

📄 CodeRabbit Inference Engine (src/praisonai-agents/CLAUDE.md)

Agent creation should use the Agent class with parameters such as name, role, goal, backstory, llm, self_reflect, min_reflect, max_reflect, tools, guardrail, and max_guardrail_retries.

Files:

src/praisonai-agents/praisonaiagents/agent/agent.py

src/praisonai-agents/praisonaiagents/{agent,task}/**/*.py

📄 CodeRabbit Inference Engine (src/praisonai-agents/CLAUDE.md)

src/praisonai-agents/praisonaiagents/{agent,task}/**/*.py: Function-based guardrails for tasks should return either a GuardrailResult or a tuple (bool, Any).
LLM-based guardrails can be specified as a string description in the guardrail parameter for tasks or agents.

Files:

src/praisonai-agents/praisonaiagents/agent/agent.py

🧠 Learnings (6)

📓 Common learnings

Learnt from: CR
PR: MervinPraison/PraisonAI#0
File: src/praisonai-agents/CLAUDE.md:0-0
Timestamp: 2025-07-21T11:20:34.197Z
Learning: Applies to src/praisonai-agents/praisonaiagents/llm/**/*.py : LLM integration should use the unified wrapper in `praisonaiagents/llm/` and support multiple providers via LiteLLM.

Learnt from: CR
PR: MervinPraison/PraisonAI#0
File: src/praisonai-ts/.windsurfrules:0-0
Timestamp: 2025-07-18T14:40:53.719Z
Learning: Applies to src/praisonai-ts/src/llm/llm.ts : The LLM class in 'llm.ts' should wrap 'aisdk.generateText' calls for language model responses

Learnt from: CR
PR: MervinPraison/PraisonAI#0
File: src/praisonai-agents/CLAUDE.md:0-0
Timestamp: 2025-07-21T11:20:34.197Z
Learning: Applies to src/praisonai-agents/praisonaiagents/agent/**/*.py : Agent creation should use the `Agent` class with parameters such as `name`, `role`, `goal`, `backstory`, `llm`, `self_reflect`, `min_reflect`, `max_reflect`, `tools`, `guardrail`, and `max_guardrail_retries`.

test_streaming.py (4)

Learnt from: CR
PR: MervinPraison/PraisonAI#0
File: src/praisonai-agents/CLAUDE.md:0-0
Timestamp: 2025-07-21T11:20:34.197Z
Learning: Applies to src/praisonai-agents/tests/**/*.py : Use example-driven testing rather than a formal test runner; each test file should be runnable individually.

Learnt from: CR
PR: MervinPraison/PraisonAI#0
File: src/praisonai-agents/CLAUDE.md:0-0
Timestamp: 2025-07-21T11:20:34.197Z
Learning: Applies to src/praisonai-agents/tests/**/*.py : Test files should be placed in the tests/ directory and demonstrate specific usage patterns, serving as both test and documentation.

Learnt from: CR
PR: MervinPraison/PraisonAI#0
File: src/praisonai-ts/.windsurfrules:0-0
Timestamp: 2025-07-18T14:40:53.719Z
Learning: Applies to src/praisonai-ts/src/main.ts : The main script 'src/main.ts' should set up logging, callbacks, and integrate with 'aisdk' if needed, mirroring the Python 'main.py'

Learnt from: CR
PR: MervinPraison/PraisonAI#0
File: src/praisonai-ts/.cursorrules:0-0
Timestamp: 2025-07-18T14:39:32.711Z
Learning: Applies to src/praisonai-ts/src/tools/test.ts : The 'src/tools/test.ts' file should import and test each tool's main functions or classes for quick local validation.

test_streaming_logic.py (4)

Learnt from: CR
PR: MervinPraison/PraisonAI#0
File: src/praisonai-agents/CLAUDE.md:0-0
Timestamp: 2025-07-21T11:20:34.197Z
Learning: Applies to src/praisonai-agents/tests/**/*.py : Use example-driven testing rather than a formal test runner; each test file should be runnable individually.

Learnt from: CR
PR: MervinPraison/PraisonAI#0
File: src/praisonai-agents/CLAUDE.md:0-0
Timestamp: 2025-07-21T11:20:34.197Z
Learning: Applies to src/praisonai-agents/praisonaiagents/llm/**/*.py : LLM integration should use the unified wrapper in praisonaiagents/llm/ and support multiple providers via LiteLLM.

Learnt from: CR
PR: MervinPraison/PraisonAI#0
File: src/praisonai-agents/CLAUDE.md:0-0
Timestamp: 2025-07-21T11:20:34.197Z
Learning: Applies to src/praisonai-agents/praisonaiagents/agent/**/*.py : Agent creation should use the Agent class with parameters such as name, role, goal, backstory, llm, self_reflect, min_reflect, max_reflect, tools, guardrail, and max_guardrail_retries.

Learnt from: CR
PR: MervinPraison/PraisonAI#0
File: src/praisonai-agents/CLAUDE.md:0-0
Timestamp: 2025-07-21T11:20:34.197Z
Learning: Applies to src/praisonai-agents/tests/**/*.py : Test files should be placed in the tests/ directory and demonstrate specific usage patterns, serving as both test and documentation.

test_streaming_basic.py (8)

Learnt from: CR
PR: MervinPraison/PraisonAI#0
File: src/praisonai-agents/CLAUDE.md:0-0
Timestamp: 2025-07-21T11:20:34.197Z
Learning: Applies to src/praisonai-agents/tests/**/*.py : Use example-driven testing rather than a formal test runner; each test file should be runnable individually.

Learnt from: CR
PR: MervinPraison/PraisonAI#0
File: src/praisonai-agents/CLAUDE.md:0-0
Timestamp: 2025-07-21T11:20:34.197Z
Learning: Applies to src/praisonai-agents/tests/**/*.py : Test files should be placed in the tests/ directory and demonstrate specific usage patterns, serving as both test and documentation.

Learnt from: CR
PR: MervinPraison/PraisonAI#0
File: src/praisonai-agents/CLAUDE.md:0-0
Timestamp: 2025-07-21T11:20:34.197Z
Learning: Applies to src/praisonai-agents/praisonaiagents/llm/**/*.py : LLM integration should use the unified wrapper in praisonaiagents/llm/ and support multiple providers via LiteLLM.

Learnt from: CR
PR: MervinPraison/PraisonAI#0
File: src/praisonai-ts/.windsurfrules:0-0
Timestamp: 2025-07-18T14:40:53.719Z
Learning: Applies to src/praisonai-ts/src/main.ts : The main script 'src/main.ts' should set up logging, callbacks, and integrate with 'aisdk' if needed, mirroring the Python 'main.py'

Learnt from: CR
PR: MervinPraison/PraisonAI#0
File: src/praisonai-ts/.cursorrules:0-0
Timestamp: 2025-07-18T14:39:32.711Z
Learning: Applies to src/praisonai-ts/src/main.ts : The main setup file 'src/main.ts' should handle logging, display callbacks, and integration with 'aisdk' as needed, mirroring the Python 'main.py' functionality.

Learnt from: CR
PR: MervinPraison/PraisonAI#0
File: src/praisonai-agents/CLAUDE.md:0-0
Timestamp: 2025-07-21T11:20:34.197Z
Learning: Applies to src/praisonai-agents/praisonaiagents/agent/**/*.py : Agent creation should use the Agent class with parameters such as name, role, goal, backstory, llm, self_reflect, min_reflect, max_reflect, tools, guardrail, and max_guardrail_retries.

Learnt from: CR
PR: MervinPraison/PraisonAI#0
File: src/praisonai-ts/.cursorrules:0-0
Timestamp: 2025-07-18T14:39:32.711Z
Learning: Applies to src/praisonai-ts/src/index.ts : The main entry point 'src/index.ts' should re-export key classes and functions (such as Agent, Agents, Task, etc.) for simple import paths.

Learnt from: CR
PR: MervinPraison/PraisonAI#0
File: src/praisonai-ts/.cursorrules:0-0
Timestamp: 2025-07-18T14:39:32.711Z
Learning: Applies to src/praisonai-ts/src/tools/test.ts : The 'src/tools/test.ts' file should import and test each tool's main functions or classes for quick local validation.

src/praisonai-agents/praisonaiagents/llm/llm.py (2)

Learnt from: CR
PR: MervinPraison/PraisonAI#0
File: src/praisonai-agents/CLAUDE.md:0-0
Timestamp: 2025-07-21T11:20:34.197Z
Learning: Applies to src/praisonai-agents/praisonaiagents/llm/**/*.py : LLM integration should use the unified wrapper in praisonaiagents/llm/ and support multiple providers via LiteLLM.

Learnt from: CR
PR: MervinPraison/PraisonAI#0
File: src/praisonai-ts/.windsurfrules:0-0
Timestamp: 2025-07-18T14:40:53.719Z
Learning: Applies to src/praisonai-ts/src/llm/llm.ts : The LLM class in 'llm.ts' should wrap 'aisdk.generateText' calls for language model responses

src/praisonai-agents/praisonaiagents/agent/agent.py (3)

Learnt from: CR
PR: MervinPraison/PraisonAI#0
File: src/praisonai-agents/CLAUDE.md:0-0
Timestamp: 2025-07-21T11:20:34.197Z
Learning: Applies to src/praisonai-agents/praisonaiagents/agent/**/*.py : Agent creation should use the Agent class with parameters such as name, role, goal, backstory, llm, self_reflect, min_reflect, max_reflect, tools, guardrail, and max_guardrail_retries.

Learnt from: CR
PR: MervinPraison/PraisonAI#0
File: src/praisonai-agents/CLAUDE.md:0-0
Timestamp: 2025-07-21T11:20:34.197Z
Learning: Applies to src/praisonai-agents/praisonaiagents/llm/**/*.py : LLM integration should use the unified wrapper in praisonaiagents/llm/ and support multiple providers via LiteLLM.

Learnt from: CR
PR: MervinPraison/PraisonAI#0
File: src/praisonai-ts/.windsurfrules:0-0
Timestamp: 2025-07-18T14:40:53.719Z
Learning: Applies to src/praisonai-ts/src/llm/llm.ts : The LLM class in 'llm.ts' should wrap 'aisdk.generateText' calls for language model responses

🪛 Ruff (0.12.2)

test_streaming_logic.py

25-25: f-string without any placeholders

Remove extraneous f prefix

(F541)

28-28: Avoid equality comparisons to False; use not agent1.stream: for false checks

Replace with not agent1.stream

(E712)

42-42: f-string without any placeholders

Remove extraneous f prefix

(F541)

45-45: Avoid equality comparisons to False; use not agent2.stream: for false checks

Replace with not agent2.stream

(E712)

59-59: f-string without any placeholders

Remove extraneous f prefix

(F541)

62-62: Avoid equality comparisons to True; use agent3.stream: for truth checks

Replace with agent3.stream

(E712)

test_streaming_basic.py

19-19: praisonaiagents.llm.llm.LLM imported but unused

Remove unused import: praisonaiagents.llm.llm.LLM

(F401)

23-23: praisonaiagents.agent.agent.Agent imported but unused

Remove unused import: praisonaiagents.agent.agent.Agent

(F401)

🧰 Additional context used

📓 Path-based instructions (4)

src/praisonai-agents/praisonaiagents/llm/**/*.py

📄 CodeRabbit Inference Engine (src/praisonai-agents/CLAUDE.md)

LLM integration should use the unified wrapper in praisonaiagents/llm/ and support multiple providers via LiteLLM.

Files:

src/praisonai-agents/praisonaiagents/llm/llm.py

src/praisonai-agents/praisonaiagents/**/*.py

📄 CodeRabbit Inference Engine (src/praisonai-agents/CLAUDE.md)

src/praisonai-agents/praisonaiagents/**/*.py: All major components should support async execution, providing both synchronous and asynchronous methods (e.g., execute and aexecute).
Global error logging should use the error_logs list, and a callback system should be provided for real-time error reporting.
Context length exceptions should be handled with automatic retry logic.
Graceful degradation should be implemented for optional dependencies.

Files:

src/praisonai-agents/praisonaiagents/llm/llm.py
src/praisonai-agents/praisonaiagents/agent/agent.py

src/praisonai-agents/praisonaiagents/agent/**/*.py

📄 CodeRabbit Inference Engine (src/praisonai-agents/CLAUDE.md)

Agent creation should use the Agent class with parameters such as name, role, goal, backstory, llm, self_reflect, min_reflect, max_reflect, tools, guardrail, and max_guardrail_retries.

Files:

src/praisonai-agents/praisonaiagents/agent/agent.py

src/praisonai-agents/praisonaiagents/{agent,task}/**/*.py

📄 CodeRabbit Inference Engine (src/praisonai-agents/CLAUDE.md)

src/praisonai-agents/praisonaiagents/{agent,task}/**/*.py: Function-based guardrails for tasks should return either a GuardrailResult or a tuple (bool, Any).
LLM-based guardrails can be specified as a string description in the guardrail parameter for tasks or agents.

Files:

src/praisonai-agents/praisonaiagents/agent/agent.py

🧠 Learnings (6)

📓 Common learnings

Learnt from: CR
PR: MervinPraison/PraisonAI#0
File: src/praisonai-agents/CLAUDE.md:0-0
Timestamp: 2025-07-21T11:20:34.197Z
Learning: Applies to src/praisonai-agents/praisonaiagents/llm/**/*.py : LLM integration should use the unified wrapper in `praisonaiagents/llm/` and support multiple providers via LiteLLM.

Learnt from: CR
PR: MervinPraison/PraisonAI#0
File: src/praisonai-ts/.windsurfrules:0-0
Timestamp: 2025-07-18T14:40:53.719Z
Learning: Applies to src/praisonai-ts/src/llm/llm.ts : The LLM class in 'llm.ts' should wrap 'aisdk.generateText' calls for language model responses

Learnt from: CR
PR: MervinPraison/PraisonAI#0
File: src/praisonai-agents/CLAUDE.md:0-0
Timestamp: 2025-07-21T11:20:34.197Z
Learning: Applies to src/praisonai-agents/praisonaiagents/agent/**/*.py : Agent creation should use the `Agent` class with parameters such as `name`, `role`, `goal`, `backstory`, `llm`, `self_reflect`, `min_reflect`, `max_reflect`, `tools`, `guardrail`, and `max_guardrail_retries`.