AI Intelligence Engine System

System Overview
Architecture
Marcus Ecosystem Integration
Workflow Integration
Core Features
Technical Implementation
Pros and Cons
Design Rationale
Future Evolution
Task Complexity Handling
Board-Specific Considerations
Cato Integration

System Overview

The AI Intelligence Engine is Marcus's AI-powered analysis and decision engine that provides intelligent task assignment, blocker resolution, and project risk analysis using Claude API. The system gracefully falls back to rule-based approaches when AI is unavailable, ensuring Marcus continues functioning even without AI access.

Primary Location: src/integrations/ai_analysis_engine.py

Core Principle

AI-enhanced decision making with graceful degradation.

The system operates on a pragmatic principle: use AI when available for intelligent analysis, fall back to deterministic rules when AI is unavailable. This ensures reliability while leveraging AI's semantic understanding capabilities when possible.

Key Capabilities

Intelligent Task Assignment: AI-powered task-to-agent matching based on skills, capacity, and project context
Task Instruction Generation: Context-aware, detailed task instructions tailored to developer experience
Blocker Analysis: Root cause analysis and resolution suggestions for task blockers
Project Risk Assessment: Proactive risk identification with mitigation strategies
Project Health Analysis: Overall project health monitoring and recommendations
Feature Request Analysis: Intelligent analysis of feature requests and implementation planning
Graceful Fallback: Automatic fallback to rule-based logic when AI unavailable

Architecture

Single-Class Design

The AI Intelligence Engine is implemented as a single, cohesive class:

graph TB
    subgraph "AIAnalysisEngine"
        Init[Initialization]
        Claude[Claude API Client]
        Prompts[Prompt Templates]
        Analysis[Analysis Methods]
        Fallback[Fallback Logic]
        Tracking[Token Tracking]
    end

    Init --> Claude
    Init --> Prompts
    Analysis --> Claude
    Analysis --> Fallback
    Claude --> Tracking

    style Claude fill:#e1f5fe
    style Fallback fill:#fff3e0
    style Analysis fill:#e8f5e9

Component Structure:

AIAnalysisEngine: Single class providing all AI analysis functionality
- Claude API client (Anthropic SDK)
- Prompt template system
- Analysis methods for different use cases
- Fallback methods for when AI unavailable
- Token usage tracking integration

Core Class Structure

class AIAnalysisEngine:
    """
    AI-powered analysis and decision engine using Claude API.

    Provides intelligent analysis for project management decisions with
    graceful fallback to rule-based approaches when AI is unavailable.
    """

    def __init__(self) -> None:
        # Initialize Anthropic client
        self.client: Optional[anthropic.Anthropic] = None
        self.current_project_id: Optional[str] = None
        self.current_agent_id: Optional[str] = None

        # Load API key from config or environment
        api_key = config.get("ai.anthropic_api_key") or os.getenv("ANTHROPIC_API_KEY")
        if api_key:
            self.client = anthropic.Anthropic(api_key=api_key)

        # Model configuration
        self.model: str = "claude-3-5-sonnet-20241022"

        # Prompt templates (4 keys)
        self.prompts: Dict[str, str] = {
            "task_assignment": "...",
            "task_instructions": "...",
            "blocker_analysis": "...",
            "project_risk": "..."
        }
        # Note: project_health and feature_analysis methods build prompts inline
        # and are NOT stored as keys in self.prompts

Marcus Ecosystem Integration

Position in Marcus Architecture

The AI Intelligence Engine provides cognitive capabilities across the Marcus ecosystem:

graph TB
    subgraph "Marcus Ecosystem"
        MCP[MCP Tools]
        TM[Task Management]
        AIE[AI Intelligence Engine]
        Memory[Memory System]
        Kanban[Kanban Integration]
        Tokens[Token Tracking]
    end

    MCP --> AIE
    TM --> AIE
    AIE --> Memory
    AIE --> Kanban
    AIE --> Tokens

    style AIE fill:#e8f5e8,stroke:#4caf50,stroke-width:3px

Integration Points

Consumers of AI Engine:

MCP Tools: create_project, request_next_task, report_blocker
Task Management: Task assignment optimization
Project Analysis: Risk assessment and health monitoring

Dependencies:

Anthropic SDK: Direct Claude API integration
Token Tracking: Cost monitoring via ai_usage_middleware
Config System: API key and model configuration
Memory System: Historical context for analysis

Data Flow

Input: Task data, agent profiles, project state, blocker descriptions
Processing:
- Check if Claude API available
- If available: AI analysis via Claude
- If unavailable: Fallback to rule-based logic
Output: Structured decisions, instructions, risk assessments

Workflow Integration

The AI Intelligence Engine is invoked at key points in the Marcus workflow:

1. Project Creation (`create_project`)

MCP Tool Integration:

# Called from create_project MCP tool
# Handles PRD parsing and task generation
# See: src/intelligence/prd_parser.py and intelligent_task_generator.py

Note: PRD parsing is handled by separate intelligence modules (PRDParser, IntelligentTaskGenerator), not directly by AIAnalysisEngine. See System 23 (Task Management Intelligence) for details.

2. Task Assignment (`request_next_task`)

AI Engine Method:

async def match_task_to_agent(
    self,
    available_tasks: List[Task],
    agent: WorkerStatus,
    project_state: ProjectState
) -> Optional[Task]:
    """Find optimal task for agent using AI analysis."""

    if not self.client:
        return self._fallback_task_matching(available_tasks, agent)

    # Prepare data for AI (limit to 10 tasks to avoid context limits)
    tasks_data = [serialize(t) for t in available_tasks[:10]]

    # Call Claude with task assignment prompt
    prompt = self.prompts["task_assignment"].format(
        tasks=json.dumps(tasks_data),
        agent=json.dumps(serialize(agent)),
        project_state=project_state.value
    )

    response = await self._call_claude(prompt)
    result = json.loads(response)

    # Find and return the recommended task
    return find_task_by_id(result["recommended_task_id"])

Fallback Logic:

def _fallback_task_matching(
    self, tasks: List[Task], agent: WorkerStatus
) -> Optional[Task]:
    """Rule-based task matching when AI unavailable."""

    # Score tasks by priority and skill match
    priority_scores = {
        Priority.URGENT: 10,
        Priority.HIGH: 3,
        Priority.MEDIUM: 2,
        Priority.LOW: 1,
    }

    # Find best scoring task
    best_task = max(tasks, key=lambda t: (
        priority_scores.get(t.priority, 0) +
        skill_match_score(t, agent)
    ))

    return best_task

3. Task Instructions (`generate_task_instructions`)

AI Engine Method:

async def generate_task_instructions(
    self, task: Task, agent: WorkerStatus
) -> str:
    """Generate detailed instructions for a task."""

    if not self.client:
        return self._generate_fallback_instructions(task, agent)

    # AI-generated instructions tailored to task type
    prompt = self.prompts["task_instructions"].format(
        task=json.dumps(serialize(task)),
        agent=json.dumps(serialize(agent))
    )

    instructions = await self._call_claude(prompt)
    return instructions

Fallback Logic:

def _generate_fallback_instructions(
    self, task: Task, agent: WorkerStatus
) -> str:
    """Template-based instructions when AI unavailable."""

    instructions = f"""Task: {task.name}

Description:
{task.description}

Steps:
1. Review the task requirements
2. Implement the necessary changes
3. Test your implementation
4. Submit for review

Acceptance Criteria:
{task.acceptance_criteria or 'Complete the task as described'}
"""
    return instructions

4. Blocker Analysis (`report_blocker`)

AI Engine Method:

async def analyze_blocker(
    self,
    task_id: str,
    description: str,
    severity: str,
    agent: Optional["WorkerStatus"] = None,
    task: Optional["Task"] = None
) -> Dict[str, Any]:
    """Analyze blocker and suggest resolution steps."""

    if not self.client:
        return self._generate_fallback_blocker_analysis(...)

    # AI analysis with agent context
    prompt = self.prompts["blocker_analysis"].format(
        task_id=task_id,
        description=description,
        severity=severity,
        agent_context=f"Agent: {agent.name}, Skills: {agent.skills}",
        task_context=f"Task: {task.name}"
    )

    response = await self._call_claude(prompt)
    return json.loads(response)

Fallback Logic:

def _generate_fallback_blocker_analysis(
    self, description: str, severity: str, ...
) -> Dict[str, Any]:
    """Rule-based blocker analysis."""

    return {
        "root_cause": f"Blocker: {description}",
        "impact_assessment": f"Severity: {severity}",
        "resolution_steps": [
            "Review the error message or issue",
            "Check documentation for similar issues",
            "Consider asking team for assistance"
        ],
        "estimated_hours": 2.0,
        "escalation_needed": severity == "high"
    }

5. Project Risk Analysis (`analyze_project_risks`)

AI Engine Method:

async def analyze_project_risks(
    self,
    project_state: ProjectState,
    recent_blockers: List[BlockerReport],
    team_status: List[WorkerStatus]
) -> List[ProjectRisk]:
    """Analyze and identify project risks."""

    if not self.client:
        return self._generate_fallback_risk_analysis(project_state)

    # AI-powered risk identification
    prompt = self.prompts["project_risk"].format(
        project_state=project_state.value,
        recent_blockers=json.dumps([serialize(b) for b in recent_blockers[:5]]),
        team_status=json.dumps([serialize(a) for a in team_status])
    )

    response = await self._call_claude(prompt)
    risks_data = json.loads(response)

    # Convert to ProjectRisk objects
    return [
        ProjectRisk(
            risk_level=RiskLevel[r["impact"].upper()],
            description=r["description"],
            mitigation_strategy=r["mitigation"]
        )
        for r in risks_data["risks"]
    ]

Core Features

1. AI-Enhanced Decision Making

What makes it special: When Claude API is available, provides semantic understanding of tasks, agents, and context for intelligent decisions.

Capabilities:

Semantic task-agent matching (not just keyword matching)
Context-aware instruction generation
Intelligent blocker root cause analysis
Risk pattern identification across project history
Recommendations tailored to agent skill level

Example AI Analysis:

# AI considers:
# - Agent's Python skills match backend task requirements
# - Agent has bandwidth (40% capacity remaining)
# - Task is on critical path for milestone
# - Agent previously succeeded on similar tasks
# → Confidence: 0.85, Recommended: Yes

2. Graceful Fallback System

What makes it special: Marcus continues functioning even when AI is completely unavailable.

Fallback Strategy:

if not self.client:  # AI unavailable
    return self._fallback_method(...)  # Use rules instead

Fallback Methods:

_fallback_task_matching() - Priority + skill-based scoring
_generate_fallback_instructions() - Template-based instructions
_generate_fallback_blocker_analysis() - Rule-based analysis
_generate_fallback_risk_analysis() - Metric-based assessment

Fallback Quality:

Basic but functional - allows project to continue
Deterministic and predictable
No API costs
Instant response (no network latency)

3. Prompt Template System

What makes it special: Maintainable, version-controlled prompts for different analysis types.

Template Categories:

self.prompts = {
    "task_assignment": "...",      # Task-to-agent matching
    "task_instructions": "...",     # Detailed task guidance
    "blocker_analysis": "...",      # Problem resolution
    "project_risk": "...",          # Risk identification
}
# Note: project_health and feature_analysis build prompts inline — not stored in self.prompts

Template Design:

Clear instructions to Claude
Structured JSON output format
Context-specific guidance
Examples where helpful

4. Token Usage Tracking

What makes it special: Automatic cost monitoring for all Claude API calls.

Implementation:

async def _call_claude(self, prompt: str) -> str:
    """Call Claude API with error handling and token tracking."""

    response = self.client.messages.create(
        model=self.model,
        max_tokens=2000,
        temperature=0.7,
        messages=[{"role": "user", "content": prompt}]
    )

    # Token tracking happens automatically via middleware
    # See: ai_usage_middleware.wrap_ai_provider()

    return str(response.content[0].text)

Tracking Benefits:

Cost per project/task/agent
Usage trends over time
Budget alerts
Optimization opportunities

Technical Implementation

Initialization

class AIAnalysisEngine:
    def __init__(self) -> None:
        """Initialize with Anthropic client."""

        # Load API key
        from src.config.marcus_config import get_config
        config = get_config()
        api_key = config.ai.anthropic_api_key or os.getenv("ANTHROPIC_API_KEY")

        # Initialize client (None if no key)
        if api_key:
            self.client = anthropic.Anthropic(api_key=api_key)
        else:
            print("⚠️  No API key - AI features will use fallback mode")
            self.client = None

        # Model selection (config uses attribute access, not .get())
        self.model = getattr(getattr(config, "ai", None), "model", None) or "claude-3-5-sonnet-20241022"

Claude API Integration

async def _call_claude(self, prompt: str) -> str:
    """
    Call Claude API with error handling.

    Parameters
    ----------
    prompt : str
        The prompt to send to Claude

    Returns
    -------
    str
        Claude's response text

    Raises
    ------
    Exception
        If API call fails after retries
    """
    try:
        response = self.client.messages.create(
            model=self.model,
            max_tokens=2000,
            temperature=0.7,
            messages=[{"role": "user", "content": prompt}]
        )
        return str(response.content[0].text)

    except Exception as e:
        print(f"Claude API error: {e}", file=sys.stderr)
        raise

Error Handling

Approach: Fail gracefully to fallback methods rather than crash.

async def match_task_to_agent(self, tasks, agent, project_state):
    """AI task matching with fallback."""

    # Check if AI available
    if not self.client:
        return self._fallback_task_matching(tasks, agent)

    try:
        # Try AI analysis
        response = await self._call_claude(prompt)
        return parse_and_find_task(response)

    except Exception as e:
        # AI failed - use fallback
        print(f"AI analysis failed: {e}", file=sys.stderr)
        return self._fallback_task_matching(tasks, agent)

Context Serialization

Challenge: Convert Marcus domain objects to JSON for Claude.

Solution: Serialization helpers that extract relevant fields.

def serialize(obj: Any) -> Dict[str, Any]:
    """Convert domain object to JSON-serializable dict."""

    if isinstance(obj, Task):
        return {
            "id": obj.id,
            "name": obj.name,
            "description": obj.description,
            # Note: Task has no 'type' or 'skills_required' fields
            "priority": obj.priority.value,
            "estimated_hours": obj.estimated_hours,
            "labels": obj.labels,
        }
    elif isinstance(obj, WorkerStatus):
        return {
            "name": obj.name,
            "skills": obj.skills,
            "capacity": obj.capacity,
            # Note: WorkerStatus has no 'current_load' or 'experience_level' fields
        }
    # ... other types

Undocumented Methods

The following methods exist in src/integrations/ai_analysis_engine.py but are not covered in the sections above:

generate_clarification(task, questions) — generates clarifying questions for ambiguous tasks
analyze_integration_points(tasks) — identifies integration points between tasks
generate_structured_response(prompt, schema) — calls Claude and parses the response against a given JSON schema
initialize() — async initialization hook (sets up client and prompt templates after construction)

Pros and Cons

Advantages

Simple Architecture: Single class, easy to understand and maintain
Direct Integration: No abstraction layers, minimal overhead
Graceful Degradation: Works without AI (fallback mode)
Cost Tracking: Built-in token usage monitoring
Fast Task Assignment: Typically < 1 second with AI, instant with fallback
Flexible Prompts: Easy to update and version control
No Complex Dependencies: Just Anthropic SDK + Marcus core
Battle-Tested: Claude 3.5 Sonnet proven for this use case

Limitations

Single AI Provider: Anthropic/Claude only (no OpenAI, local models, etc.)
No Rule-Based Safety Layer: AI can make any decision (no validation layer)
No Learning/Pattern Storage: Doesn't persist insights across sessions
No Provider Abstraction: Tightly coupled to Anthropic SDK
Limited Context: Max 10 tasks per analysis (context window limits)
No Hybrid Confidence: Simple AI-or-fallback, no weighted combination
Basic Fallback: Rule-based fallbacks are functional but not sophisticated
No Streaming: Waits for complete response (no real-time output)

Trade-offs Made

Chose Simplicity Over Sophistication:

Single provider vs multi-provider abstraction
Direct API calls vs complex framework
Simple fallbacks vs advanced rule engine
Template prompts vs dynamic prompt generation

Reasoning: Get Marcus working with AI quickly, iterate based on real usage.

Design Rationale

Why This Approach Was Chosen

1. Single Class Design

Decision: Implement as one cohesive AIAnalysisEngine class.

Rationale:

Easier to understand and maintain
Faster development iteration
Sufficient for current needs
Can refactor later if needed

Trade-off: Less modular than multi-class architecture, harder to extend certain features.

2. Claude API Direct Integration

Decision: Use Anthropic SDK directly, no abstraction layer.

Rationale:

Claude 3.5 Sonnet excellent for these tasks
Anthropic SDK stable and well-documented
No need for multi-provider support yet
Faster to implement

Trade-off: Vendor lock-in, can't easily switch to OpenAI or local models.

3. Graceful Fallback

Decision: Every AI method has a fallback version.

Rationale:

Marcus must work even if API down or key missing
Fallbacks provide basic functionality
Users can use Marcus without AI budget
Development/testing doesn't require API access

Trade-off: Maintaining two code paths for each feature.

4. Template-Based Prompts

Decision: Store prompts as string templates in code.

Rationale:

Easy to version control
Quick to iterate and improve
No external prompt management needed
Reviewable in code reviews

Trade-off: Less flexible than dynamic prompt construction.

Future Evolution

Planned Enhancements

See docs/source/systems/intelligence/07-ai-intelligence-engine-FUTURE.md for the aspirational multi-component hybrid architecture.

Near-term improvements (Current System):

Streaming Responses: Real-time output for long analyses
Better Fallbacks: More sophisticated rule-based logic
Prompt Versioning: A/B test different prompt styles
Context Expansion: Handle more tasks per analysis

Medium-term improvements: 5. Learning System: Persist AI insights across sessions 6. Multi-Model Support: Use different models for different tasks 7. Confidence Calibration: Better confidence score accuracy 8. Cost Optimization: Smarter decisions about when to use AI

Long-term vision (see aspirational doc):

Multi-provider LLM abstraction
Hybrid rule-based + AI architecture
Contextual learning system
Advanced PRD parsing with dependency inference

Task Complexity Handling

Simple Tasks

Approach: Fast AI analysis or instant fallback.

# For simple tasks:
# - AI response typically < 1 second
# - Fallback response instant
# - Simple prompt, concise analysis

Example: "Implement login button"

AI quickly identifies: frontend task, junior-friendly, 2 hours
Fallback: Check labels, match to frontend dev, assign

Complex Tasks

Approach: More detailed AI analysis with richer context.

# For complex tasks:
# - Include project history
# - Reference similar past tasks
# - Detailed prompt with examples
# - Longer max_tokens for thorough analysis

Example: "Design and implement payment system with Stripe"

AI analyzes: security implications, integration complexity, testing needs
Considers: agent experience level, needs senior dev, estimates 40 hours
Fallback: Priority-based assignment, generic instructions

Board-Specific Considerations

Kanban Integration

Data Sources:

Task metadata from Kanban cards
Agent availability from board state
Project progress from board metrics

Adaptations:

Different task types → different analysis depth
Board structure affects priority scoring
Team size impacts assignment strategy

Board Quality Impact

High-Quality Boards (clear descriptions, good labels):

AI analysis more accurate
Better task-agent matching
Richer instructions possible

Low-Quality Boards (sparse descriptions, poor labels):

AI does best with limited data
Fallback methods work same regardless
Recommendations to improve board quality

Cato Integration

Current Integration

Marcus Role: Provides AI analysis capabilities via MCP tools.

Cato Role: Consumes analysis results for visualization and UI.

Data Exchange: JSON-formatted analysis results via MCP protocol.

API Patterns

# Marcus MCP tool calls AI Engine
result = await ai_engine.analyze_blocker(...)

# Returns structured data
{
    "root_cause": "...",
    "resolution_steps": [...],
    "estimated_hours": 2.0
}

# Cato displays in UI

Shared Context

Project state
Team information
Historical data
Analysis results

This documentation represents the current state of the AI Intelligence Engine as implemented in src/integrations/ai_analysis_engine.py. For the aspirational multi-component hybrid architecture, see 07-ai-intelligence-engine-FUTURE.md.

FilesExpand file tree

07-ai-intelligence-engine.md

Latest commit

History

07-ai-intelligence-engine.md

File metadata and controls

AI Intelligence Engine System

Table of Contents

System Overview

Core Principle

Key Capabilities

Architecture

Single-Class Design

Core Class Structure

Marcus Ecosystem Integration

Position in Marcus Architecture

Integration Points

Data Flow

Workflow Integration

1. Project Creation (create_project)

2. Task Assignment (request_next_task)

3. Task Instructions (generate_task_instructions)

4. Blocker Analysis (report_blocker)

5. Project Risk Analysis (analyze_project_risks)

Core Features

1. AI-Enhanced Decision Making

2. Graceful Fallback System

3. Prompt Template System

4. Token Usage Tracking

Technical Implementation

Initialization

Claude API Integration

Error Handling

Context Serialization

Undocumented Methods

Pros and Cons

Advantages

Limitations

Trade-offs Made

Design Rationale

Why This Approach Was Chosen

1. Single Class Design

2. Claude API Direct Integration

3. Graceful Fallback

4. Template-Based Prompts

Future Evolution

Planned Enhancements

Task Complexity Handling

Simple Tasks

Complex Tasks

Board-Specific Considerations

Kanban Integration

Board Quality Impact

Cato Integration

Current Integration

API Patterns

Shared Context

1. Project Creation (`create_project`)

2. Task Assignment (`request_next_task`)

3. Task Instructions (`generate_task_instructions`)

4. Blocker Analysis (`report_blocker`)

5. Project Risk Analysis (`analyze_project_risks`)