miiflow-agent

A lightweight, unified Python SDK for LLM providers with built-in agentic patterns

miiflow-agent gives you a unified API across LLM providers, with built-in support for ReAct agents, tool calling, and streaming — all in ~15K lines of focused code.

from miiflow_agent import LLMClient, Message

# Same interface for any provider
client = LLMClient.create("openai", model="gpt-4o-mini")
response = client.chat([Message.user("Hello!")])

# Switch providers with one line
client = LLMClient.create("anthropic", model="claude-sonnet-4-20250514")

Demo of an Agentic Run

1214.mov

Why miiflow-agent?

	miiflow-agent	LangChain	LiteLLM
Codebase size	~15K lines	~500K lines	~50K lines
Dependencies	8 core	50+	20+
Built-in agents	ReAct + sub-agent hand-off	Requires setup	None
Tool system	@tool decorator	Chains	None
Learning curve	Hours	Weeks	Hours
Type safety	Full generics	Partial	Basic

The LangChain Problem

LangChain is powerful but complex. For production apps, you often fight its abstractions more than use them. miiflow-agent gives you what you actually need:

Unified provider interface — swap OpenAI → Claude → Gemini with one line
Agentic patterns built-in — a single ReAct loop with emergent planning and multi-agent hand-off, not bolted on
Simple tool system — decorate any function with @tool
Real streaming — event-based, not just token callbacks
Type-safe — full generics, proper error types

The LiteLLM Gap

LiteLLM unifies provider APIs but stops there. miiflow-agent adds:

ReAct agents with multi-hop reasoning
Sub-agent hand-off for complex multi-step tasks
Tool calling with automatic schema generation
Context injection (Pydantic AI compatible)

Installation

pip install miiflow-agent

# With optional providers
pip install miiflow-agent[groq,google]

# Everything
pip install miiflow-agent[all]

Quick Start

Basic Chat

from miiflow_agent import LLMClient, Message

client = LLMClient.create("openai", model="gpt-4o-mini")
response = client.chat([
    Message.system("You are a helpful assistant."),
    Message.user("What is Python?")
])
print(response.message.content)

Streaming

async for chunk in client.astream_chat([Message.user("Tell me a story")]):
    print(chunk.delta, end="", flush=True)

ReAct Agent with Tools

from miiflow_agent import Agent, AgentType, LLMClient, tool

@tool("calculate", "Evaluate mathematical expressions")
def calculate(expression: str) -> str:
    return str(eval(expression))

@tool("search", "Search for information")
def search(query: str) -> str:
    return f"Results for '{query}': ..."

# Create agent
agent = Agent(
    LLMClient.create("openai", model="gpt-4o"),
    agent_type=AgentType.REACT,
    max_iterations=10
)
agent.add_tool(calculate)
agent.add_tool(search)

# Run with automatic reasoning
result = await agent.run("What is 25 * 4 + the population of France?")
print(result.data)  # Agent reasons, calls tools, synthesizes answer

Context Injection (Pydantic AI Style)

from dataclasses import dataclass
from miiflow_agent import Agent, RunContext, tool

@dataclass
class UserContext:
    user_id: str
    permissions: list[str]

@tool("get_user_data")
def get_user_data(ctx: RunContext[UserContext], field: str) -> str:
    """Fetch data for the current user."""
    if "read" not in ctx.deps.permissions:
        return "Permission denied"
    return f"User {ctx.deps.user_id} data for {field}"

agent = Agent(client, deps_type=UserContext)
agent.add_tool(get_user_data)

result = await agent.run(
    "What's my account status?",
    deps=UserContext(user_id="alice", permissions=["read"])
)

Architecture

┌─────────────────────────────────────────────────────────────────┐
│                         Your Application                         │
└─────────────────────────────┬───────────────────────────────────┘
                              │
┌─────────────────────────────▼───────────────────────────────────┐
│                          LLMClient                               │
│  • Unified interface for all providers                          │
│  • Automatic tool schema generation                             │
│  • Metrics collection & observability                           │
└─────────────────────────────┬───────────────────────────────────┘
                              │
        ┌─────────────────────┼─────────────────────┐
        │                     │                     │
        ▼                     ▼                     ▼
┌───────────────┐   ┌───────────────┐   ┌───────────────┐
│    Agent      │   │   Provider    │   │    Tools      │
│               │   │   Clients     │   │               │
│ • SINGLE_HOP  │   │               │   │ • @tool       │
│ • REACT       │   │ • OpenAI      │   │ • FunctionTool│
│ • SubAgents   │   │ • Anthropic   │   │ • HTTPTool    │
│               │   │ • Gemini      │   │ • Registry    │
│ ┌───────────┐ │   │ • More...     │   │               │
│ │Orchestrator│ │   │               │   │ ┌───────────┐ │
│ │ • plan    │ │   │               │   │ │ Schemas   │ │
│ │ • handoff │ │   │               │   │ │ • Auto-gen│ │
│ └───────────┘ │   │               │   │ │ • Validate│ │
└───────────────┘   │               │   │ └───────────┘ │
                    └───────────────┘   └───────────────┘
                              │
                              ▼
                    ┌───────────────┐
                    │   Message     │
                    │   Unified     │
                    │   Format      │
                    │               │
                    │ • Text        │
                    │ • Images      │
                    │ • Tool calls  │
                    └───────────────┘

Supported Providers

Provider	Streaming	Tool Calling	Vision	Status
OpenAI	✅	✅	✅	Stable
Anthropic	✅	✅	✅	Stable
Google Gemini	✅	✅	✅	Stable
Groq	✅	✅	-	Beta
Amazon Bedrock	✅	✅	✅	Beta
Mistral	✅	✅	-	Beta
OpenRouter	✅	✅	✅	Beta
Ollama	✅	✅	-	Beta
xAI	✅	✅	-	Beta

Stable providers are production-tested with full feature support. Beta providers are functional but may have edge cases.

Agentic Patterns

miiflow-agent runs a single, unified ReAct loop. Each turn the model emits exactly one of: tool calls (the loop continues) or a text answer (the loop exits). Planning and parallel multi-agent execution are emergent behaviors inside this one loop — there are no separate "plan & execute" or "multi-agent" orchestrators to choose between. Complex tasks are handled by letting the model plan over multiple turns and dispatch work to sub-agents as normal tool calls.

Migrating from a pre-1.8 release? The standalone PlanAndExecuteOrchestrator, MultiAgentOrchestrator, and the AgentType.PLAN_AND_EXECUTE / PARALLEL_PLAN / MULTI_AGENT enum values have been removed. Express the same outcomes with a single Agent plus sub_agents=[SubAgent(...)] — see Sub-agent hand-off below. AgentType.REACT (the default) and AgentType.SINGLE_HOP are the only modes.

ReAct (Reasoning + Acting)

The agent thinks step-by-step, deciding when to use tools, and plans across turns for multi-step tasks:

agent = Agent(client, agent_type=AgentType.REACT)

# Agent internally:
# Thought: I need to search for this information
# Action: search("topic")
# Observation: Results...
# Thought: Now I can answer
# Final Answer: ...

Sub-agent hand-off

For complex work that benefits from specialization, give the agent one or more sub-agents. The parent's ReAct loop decides when to hand off, and the dispatch happens as an ordinary tool call — no separate orchestrator. Sub-agents are wired in through AgentConfig; the framework synthesizes a dispatch_assistant-shaped tool that routes calls through the dispatch lifecycle (depth, cycle, budget, and event bubbling):

from miiflow_agent import Agent
from miiflow_agent.core.config import AgentConfig

# `sub_agents` holds objects implementing the SubAgent protocol — each one a
# parent-side "edge" to a specialist child, carrying per-edge policy such as
# `when_to_use`, `handoff_schema`, and `clarification_policy`.
lead = Agent(config=AgentConfig(
    client=client,
    system_prompt="Coordinate research and writing.",
    sub_agents=[researcher_subagent, writer_subagent],
))

result = await lead.run(
    "Research Python web frameworks, compare them, and write a summary"
)
# The ReAct loop plans, dispatches to each specialist, and synthesizes the result.

See examples/subagents.py for a complete, runnable hand-off example, and miiflow_agent/core/subagent.py for the SubAgent protocol and its per-edge policy fields.

Event Streaming

Stream real-time events during agent execution:

from miiflow_agent import Agent, AgentType, RunContext
from miiflow_agent.core.react import ReActEventType

agent = Agent(client, agent_type=AgentType.REACT)
context = RunContext(deps=None)

async for event in agent.stream_react("What is 2+2?", context):
    match event.event_type:
        case ReActEventType.THINKING_CHUNK:
            print(event.data.get("delta", ""), end="")
        case ReActEventType.TOOL_START:
            print(f"\nCalling: {event.data['tool_name']}")
        case ReActEventType.OBSERVATION:
            print(f"Result: {event.data['observation']}")
        case ReActEventType.FINAL_ANSWER:
            print(f"\nAnswer: {event.data['answer']}")

Observability

Built-in Phoenix tracing support:

from miiflow_agent.core import setup_tracing

setup_tracing(phoenix_endpoint="http://localhost:6006")

# All LLM calls are now traced

Error Handling

Comprehensive error hierarchy:

from miiflow_agent import (
    MiiflowLLMError,    # Base
    ProviderError,      # Provider-specific
    RateLimitError,     # Rate limited
    AuthenticationError, # Invalid API key
    TimeoutError,       # Request timeout
    ToolError,          # Tool execution failed
)

try:
    response = client.chat(messages)
except RateLimitError as e:
    print(f"Rate limited, retry after {e.retry_after}s")
except AuthenticationError:
    print("Check your API key")
except ProviderError as e:
    print(f"{e.provider} error: {e.message}")

Error Handling

Quickstart Guide — Get started in 5 minutes
API Reference — Complete API documentation
Tool Tutorial — Build custom tools
Agent Tutorial — Build ReAct agents
Provider Guide — Provider-specific configuration
Observability — Tracing and debugging

Contributing

We welcome contributions! Here's how to get started:

# Clone and install
git clone https://github.com/Miiflow/miiflow-agent.git
cd miiflow-agent
pip install -e ".[all]"

# Run tests
pytest tests/

# Format code
black miiflow_agent/ tests/
isort miiflow_agent/ tests/

Ways to Contribute

Report bugs — Open an issue with reproduction steps
Request features — Describe your use case
Add providers — See CONTRIBUTING.md for the provider guide
Improve docs — Fix typos, add examples
Write tests — Increase coverage

See CONTRIBUTING.md for detailed guidelines.

License

MIT License - see LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
.github		.github
docs		docs
examples		examples
miiflow_agent		miiflow_agent
scripts		scripts
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

miiflow-agent

Why miiflow-agent?

The LangChain Problem

The LiteLLM Gap

Installation

Quick Start

Basic Chat

Streaming

ReAct Agent with Tools

Context Injection (Pydantic AI Style)

Architecture

Supported Providers

Agentic Patterns

ReAct (Reasoning + Acting)

Sub-agent hand-off

Event Streaming

Observability

Error Handling

Error Handling

Contributing

Ways to Contribute

License

About

Uh oh!

Releases 14

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

miiflow-agent

Why miiflow-agent?

The LangChain Problem

The LiteLLM Gap

Installation

Quick Start

Basic Chat

Streaming

ReAct Agent with Tools

Context Injection (Pydantic AI Style)

Architecture

Supported Providers

Agentic Patterns

ReAct (Reasoning + Acting)

Sub-agent hand-off

Event Streaming

Observability

Error Handling

Error Handling

Contributing

Ways to Contribute

License

About

Topics

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 14

Contributors

Uh oh!

Languages