Lessons Learned

Overview

This document captures the lessons learned, challenges, and trade-offs encountered during the development of the Task Agent project - an AI-powered task management system built with Microsoft Agent Framework and Clean Architecture.

Project Architecture

Clean Architecture (4 Layers)

The project implements Clean Architecture with strict dependency flow:

┌───────────────────────────────────────────────────────────────┐
│                      Presentation Layer                       │
│                    (TaskAgent.WebApi)                         │
│  • REST API Controllers                                       │
│  • SSE Streaming Services                                     │
│  • Configuration Validation                                   │
│  • DI Registration                                            │
└───────────────────────────────────────────────────────────────┘
                            │ depends on
                            ▼
┌───────────────────────────────────────────────────────────────┐
│                    Infrastructure Layer                       │
│                 (TaskAgent.Infrastructure)                    │
│  • Database Contexts (SQL Server + PostgreSQL)                │
│  • Repositories (TaskRepository)                              │
│  • External Services (AgentStreamingService)                  │
│  • Thread Persistence (PostgresThreadPersistenceService)      │
└───────────────────────────────────────────────────────────────┘
                            │ depends on
                            ▼
┌───────────────────────────────────────────────────────────────┐
│                     Application Layer                         │
│                  (TaskAgent.Application)                      │
│  • DTOs (Request/Response models)                             │
│  • Interfaces (ITaskRepository, IThreadPersistenceService)    │
│  • Function Tools (6 AI agent functions)                      │
│  • Telemetry (AgentMetrics, AgentActivitySource)              │
└───────────────────────────────────────────────────────────────┘
                            │ depends on
                            ▼
┌───────────────────────────────────────────────────────────────┐
│                       Domain Layer                            │
│                    (TaskAgent.Domain)                         │
│  • Entities (TaskItem, ConversationThread)                    │
│  • Enums (TaskStatus, TaskPriority)                           │
│  • Business Rules & Validation                                │
│  • NO external dependencies                                   │
└───────────────────────────────────────────────────────────────┘

Packaging: Monolithic deployment (single deployable unit), but with Clean Architecture separation for maintainability and testability.

Content Safety Migration

Background

The project originally implemented a custom 2-layer defense architecture using Azure.AI.ContentSafety SDK:

Layer 1: Azure Prompt Shield (REST API) - Custom REST calls to /contentsafety/text:shieldPrompt
Layer 2: Azure Content Safety SDK - Azure.AI.ContentSafety NuGet package for content moderation

This was in addition to Azure OpenAI's built-in content filtering at the model level.

Migration Decision

Decision: Remove custom Azure.AI.ContentSafety implementation and rely solely on Azure OpenAI's built-in content filtering.

Rationale: Azure OpenAI's built-in filter already provides comprehensive protection:

Hate speech detection
Violence detection
Sexual content detection
Self-harm detection
Prompt injection attacks (Jailbreak detection)

The custom implementation added:

Maintenance overhead
Additional Azure resource costs
Configuration complexity
Redundant validation (same content checked twice)

Key Challenges in Content Safety Migration

1. Understanding Azure OpenAI's Built-in Content Filtering

Challenge: Initially unclear whether Azure OpenAI's built-in content filtering was sufficient or if the separate Content Safety SDK provided additional capabilities.

Resolution: Research via Microsoft Learn documentation confirmed:

"Azure OpenAI Service includes a content filtering system that works alongside core models. This system detects and takes action on specific categories of potentially harmful content in both input prompts and output completions."

Key Insight: The separate Azure.AI.ContentSafety SDK is designed for scenarios where you need:

Content moderation outside of Azure OpenAI
Custom category detection
Image/multimodal content analysis
Fine-grained threshold control beyond Azure OpenAI's settings

2. Error Response Format Differences

Challenge: The custom middleware returned structured error responses (ContentSafetyResult DTOs), while Azure OpenAI returns HTTP 400 with code: "content_filter".

Resolution: Created a new SSE event type (CONTENT_FILTER) to handle Azure OpenAI's error format gracefully:

// Infrastructure/Services/AgentStreamingService.cs
catch (ClientResultException ex) when (IsContentFilterError(ex))
{
    _contentFilterException = new ContentFilterException(ex.Message);
    yield break;
}

private static bool IsContentFilterError(ClientResultException ex)
{
    return ex.Status == 400 && 
           ex.Message.Contains("content_filter", StringComparison.OrdinalIgnoreCase);
}

3. UX Continuity for Blocked Messages

Challenge: The original implementation showed blocked messages as error toasts, breaking the conversation flow.

Resolution: Implemented ChatGPT-like UX where blocked messages appear as assistant responses:

// Frontend: lib/api/chat-service.ts
if (event.type === "CONTENT_FILTER") {
  fullMessage = event.message || "I'm unable to assist with that request...";
  onTextChunk(fullMessage); // Display in chat, not as error
}

Trade-off: This approach treats content filter blocks as "successful" responses from the AI (just with a refusal message), which maintains conversation continuity but may mask the distinction between intentional refusals and policy blocks.

Clean Architecture Lessons

Trade-offs Analysis

Benefits of Removing Custom Content Safety

Benefit	Impact
Reduced complexity	Removed ~500 lines of middleware, service, and DTO code
Fewer dependencies	Removed `Azure.AI.ContentSafety` NuGet package
Simplified configuration	No separate Content Safety endpoint/API key required
Lower cost	No additional Azure Content Safety resource charges
Reduced latency	No pre-validation overhead before sending to OpenAI

Trade-offs

Trade-off	Mitigation
Less granular control	Azure OpenAI portal allows threshold configuration per deployment
No custom categories	For task management, built-in categories are sufficient
Can't moderate non-OpenAI content	Not needed in this architecture (all chat goes through OpenAI)
Error messages less detailed	User-friendly messages are preferred anyway (security best practice)

Clean Architecture Challenges

1. Layer Dependency Discipline

Challenge: Maintaining strict dependency flow in a growing codebase. Easy to accidentally reference Infrastructure from Application layer.

Solution:

Project references enforce layer boundaries at compile time
Each layer has its own {Layer}ServiceExtensions.cs for DI registration
SonarAnalyzer enforces code quality rules

// Application layer defines interfaces
public interface ITaskRepository { ... }

// Infrastructure layer implements them
public class TaskRepository : ITaskRepository { ... }

2. Scoped Services in Singleton Contexts

Challenge: The AI Agent is registered as singleton, but needs access to scoped services like DbContext.

Solution: Inject IServiceProvider and create scopes per-operation:

// Application/Functions/TaskFunctions.cs
public class TaskFunctions
{
    private readonly IServiceProvider _serviceProvider;
    
    public async Task<string> CreateTaskAsync(string title, string description)
    {
        using IServiceScope scope = _serviceProvider.CreateScope();
        var repository = scope.ServiceProvider.GetRequiredService<ITaskRepository>();
        // Use repository with fresh DbContext per call
    }
}

3. Domain Layer Purity

Challenge: Keeping Domain layer free of external dependencies while still having rich validation.

Solution:

Factory methods encapsulate validation logic
Private setters enforce invariants
Constants for magic numbers/strings

// Domain/Entities/TaskItem.cs
public class TaskItem
{
    public string Title { get; private set; }
    
    private TaskItem() { } // EF Core only
    
    public static TaskItem Create(string title, string description, TaskPriority priority)
    {
        if (string.IsNullOrWhiteSpace(title))
            throw new ArgumentException(ValidationMessages.TITLE_REQUIRED);
            
        if (title.Length > TaskConstants.MAX_TITLE_LENGTH)
            throw new ArgumentException(ValidationMessages.TITLE_TOO_LONG);
            
        return new TaskItem { Title = title, /* ... */ };
    }
}

Dual Database Architecture Lessons

Challenge: SQL Server + PostgreSQL Coexistence

Background: The project uses two databases:

SQL Server - Task entities (structured CRUD operations)
PostgreSQL - Conversation threads (JSON blob storage)

Key Lessons

1. Separate DbContexts for Each Database

// Infrastructure/Data/TaskDbContext.cs - SQL Server
public class TaskDbContext : DbContext
{
    public DbSet<TaskItem> Tasks { get; set; }
}

// Infrastructure/Data/ConversationDbContext.cs - PostgreSQL
public class ConversationDbContext : DbContext
{
    public DbSet<ConversationThreadMetadata> Conversations { get; set; }
}

2. PostgreSQL `json` vs `jsonb` Type

Challenge: jsonb reorders properties alphabetically, breaking polymorphic deserialization that requires $type as the first property.

Solution: Use json type (preserves order) instead of jsonb:

// Entity configuration
entity.Property(e => e.SerializedThread)
    .HasColumnType("json"); // NOT jsonb!

3. Thread Serialization Preservation

Challenge: JsonSerializer.Serialize() reorders properties, breaking deserialization.

Solution: Use GetRawText() to preserve exact JSON structure:

// ✅ CORRECT - Preserves structure
JsonElement threadJson = thread.Serialize();
string serialized = threadJson.GetRawText();

// ❌ WRONG - Reorders properties
string serialized = JsonSerializer.Serialize(threadJson);

4. Two Serialization Formats for Conversation State

Challenge: The project ended up with two different formats for serializedState:

Full AgentThread JSON - From normal streaming flow (complex object with chat history)
Simple ThreadDbKey GUID - From conversation list API (stored in PostgreSQL metadata)

When loading a conversation from sidebar and sending a new message, the backend received a GUID but expected full JSON, causing it to create a new conversation.

Solution: The AgentStreamingService.DeserializeThread() now detects both formats:

// Check if string (ThreadDbKey) vs object (AgentThread JSON)
if (stateElement.ValueKind == JsonValueKind.String)
{
    // Simple GUID - load history from database
    _pendingThreadId = stateElement.GetString();
    return _agent.GetNewThread();
}
// Full JSON - deserialize normally
return _agent.DeserializeThread(stateElement);

Frontend Fix: Use serializedState from API response, not threadId:

// ✅ CORRECT
setSerializedState(response.serializedState ?? null);

// ❌ WRONG - Used threadId directly
setSerializedState(threadId);

Best Practices Identified

1. Leverage Platform Capabilities First

Lesson: Before implementing custom solutions, investigate what the platform provides out-of-the-box.

Azure OpenAI's built-in content filtering was sufficient. The custom implementation added complexity without significant benefit.

2. SSE Events for Graceful Error Handling

Lesson: For streaming protocols, use dedicated event types for different error categories rather than breaking the stream.

The CONTENT_FILTER SSE event pattern allows:

Clear distinction between network errors and policy blocks
Conversation continuity (thread state still sent)
Frontend can handle each case appropriately

3. Pin Preview Package Versions

Lesson: Preview NuGet packages can change without documentation. Always pin exact versions.

<!-- Directory.Packages.props -->
<PackageVersion Include="Microsoft.Agents.AI.OpenAI" Version="1.0.0-preview.251125.1" />

The Microsoft Agent Framework is in preview. Auto-updates can break your build without warning. Pin versions and test thoroughly before upgrading.

4. Dual Serialization Format Handling

Lesson: When integrating multiple persistence systems, handle format differences gracefully at deserialization time.

Problem: The project has two conversation persistence mechanisms:

AG-UI /agui endpoint: Uses PostgresChatMessageStore with simple ThreadDbKey (GUID string)
Custom /api/agent/chat endpoint: Uses AgentStreamingService with full AgentThread JSON

When loading a conversation from the sidebar, the frontend receives a simple ThreadDbKey, but AgentStreamingService.DeserializeThread() expected full AgentThread JSON.

Solution: Detect format and handle both cases:

// Infrastructure/Services/AgentStreamingService.cs
public object DeserializeThread(string? serializedState)
{
    JsonElement stateElement = JsonSerializer.Deserialize<JsonElement>(serializedState);
    
    // Check if it's a simple ThreadDbKey string (from loadConversation)
    if (stateElement.ValueKind == JsonValueKind.String)
    {
        _pendingThreadId = stateElement.GetString();
        return _agent.GetNewThread(); // Load history separately
    }
    
    // Full AgentThread JSON - deserialize normally
    return _agent.DeserializeThread(stateElement);
}

Key Insight: When the serializedState is a simple GUID, store it and load conversation history from PostgreSQL in StreamResponseAsync():

if (!string.IsNullOrEmpty(_pendingThreadId))
{
    List<ChatMessage> historyMessages = await LoadMessagesFromDatabaseAsync(_pendingThreadId);
    messageList = historyMessages.Concat(messageList).ToList();
}

5. Security Through Obscurity (Appropriate Here)

Lesson: For content safety blocks, generic user-facing messages are preferred over detailed error information.

const CONTENT_FILTER_MESSAGE = 
  "I'm unable to assist with that request as it may violate content policies. " +
  "Please try rephrasing your message.";

This prevents:

Attackers from learning filter thresholds
Users from crafting bypass attempts
Exposure of internal error details

5. Documentation-Driven Development

Lesson: Update documentation as part of the code change, not after.

The Content Safety migration touched 5 documentation files. Keeping docs in sync prevents:

Developers following outdated patterns
Configuration confusion
Support burden from incorrect setup instructions

6. Central Package Management (CPM)

Lesson: Use Directory.Packages.props for consistent dependency versions across projects.

<!-- Directory.Packages.props - Single source of truth -->
<PackageVersion Include="Microsoft.EntityFrameworkCore" Version="10.0.0" />

<!-- Individual .csproj - No version needed -->
<PackageReference Include="Microsoft.EntityFrameworkCore" />

Benefits:

No version mismatches between projects
Easy upgrades (change one file)
Clear audit trail of all dependencies

7. Fail-Fast Database Strategy

Lesson: For applications with critical database dependencies, fail fast on startup.

// Both databases MUST be available
await app.ApplyDatabaseMigrationsAsync(); // Throws if either fails

Why: This application cannot function without both SQL Server (tasks) and PostgreSQL (conversations). Silent degradation would cause confusing errors later.

Technology-Specific Lessons

AG-UI Protocol Integration

Challenge: Integrating Microsoft's AG-UI protocol with existing REST API architecture.

Lessons:

Single endpoint mapping - app.MapAGUI("/agui", agent) handles everything
Message store factory pattern - Pass factory function for per-thread persistence
Tool injection via closure - Capture IServiceProvider for scoped dependencies

.NET Aspire Orchestration

Challenge: Managing multi-project debugging with databases.

Lessons:

AppHost at root level - Keep orchestrator separate from backend solution
ServiceDefaults for shared config - Telemetry, health checks, resilience
Dashboard URL - https://localhost:17198 for OTLP visualization

Next.js Frontend Patterns

Challenge: TypeScript strict mode with API responses.

Lessons:

Type guards for API responses - Never trust as type assertions
Error boundaries per route - error.tsx for graceful failures
pnpm enforcement - Lock file incompatibility with npm/yarn

Future Considerations

When to Re-Add Custom Content Safety

Consider adding Azure.AI.ContentSafety back if:

Multi-modal content - Need to analyze images or audio
Non-OpenAI models - Using models without built-in filtering
Custom categories - Need domain-specific content detection
Regulatory requirements - Need audit logs of all safety checks
Pre-processing validation - Want to reject before sending to OpenAI (cost savings)

Monitoring Recommendations

With the migration complete, monitor:

Azure OpenAI metrics - Track content filter trigger rate in Azure portal
Frontend analytics - Count CONTENT_FILTER events received
User feedback - Watch for complaints about over-blocking

CI/CD Pipeline Lessons

GitHub Actions Architecture

1. Node.js Runtime for Actions (Not for Your Code)

Challenge: Confusion about why GitHub Actions like actions/checkout require Node.js updates when your project is .NET.

Key Insight: GitHub Actions are written in JavaScript/TypeScript and run on Node.js - this is the Actions runtime, not your application runtime.

┌────────────────────────────────────────────────────────┐
│              GitHub Actions Runner (ubuntu-latest)     │
├────────────────────────────────────────────────────────┤
│  Node.js Runtime (for executing Actions themselves)    │
│  ┌─────────────────┐  ┌─────────────────┐              │
│  │ checkout@v5     │  │ setup-dotnet@v4 │              │
│  │ (JavaScript)    │  │ (JavaScript)    │              │
│  └─────────────────┘  └─────────────────┘              │
├────────────────────────────────────────────────────────┤
│  .NET SDK 10.0.x (for YOUR code)                       │
│  ┌─────────────────────────────────────┐               │
│  │ dotnet build, dotnet test, etc.     │               │
│  └─────────────────────────────────────┘               │
└────────────────────────────────────────────────────────┘

Lesson: Keep Actions updated (v4→v5) to avoid Node.js deprecation warnings, even though your .NET code doesn't use Node.js.

2. dotnet restore Requires Project/Solution Files

Challenge: dotnet restore "directory/" fails with MSB1003: Specify a project or solution file.

Root Cause: The dotnet restore command does NOT accept directories - it requires explicit .sln or .csproj files.

# ❌ WRONG - Directory path
- name: Restore
  run: dotnet restore "src/backend/services/TaskAgent/tests"

# ✅ CORRECT - Solution file path
- name: Restore
  run: dotnet restore "src/backend/TaskAgentWeb.sln"

Lesson: Always point to the solution file for restore/build operations. Define SOLUTION_PATH environment variable for consistency.

Testcontainers in GitHub Actions

3. Docker Availability on Runners

Challenge: Will Testcontainers work in CI? Does GitHub Actions have Docker?

Key Insight: Ubuntu runners (ubuntu-latest) come with Docker Engine preinstalled. Testcontainers works out-of-the-box.

Runner	Docker Available	Testcontainers Support
`ubuntu-latest`	✅ Preinstalled	✅ Full support
`windows-latest`	❌ Not available	❌ Won't work
`macos-latest`	⚠️ Requires setup	⚠️ Limited

First Run Consideration: Initial workflow execution may be slower due to Docker image pulls:

SQL Server image: ~1.5GB
PostgreSQL image: ~80MB

Lesson: Always use ubuntu-latest for integration tests with Testcontainers. Set adequate timeout-minutes (30+) for first runs.

Code Coverage Aggregation

4. Combining Multiple Coverage Reports

Challenge: How to show unified coverage metrics when tests are split across Domain, Application, and Infrastructure projects?

Solution: Use ReportGenerator to merge Cobertura XML files:

- name: Generate Combined Coverage Report
  run: |
    dotnet tool install -g dotnet-reportgenerator-globaltool
    reportgenerator \
      -reports:"./TestResults/**/coverage.cobertura.xml" \
      -targetdir:"./TestResults/CoverageReport" \
      -reporttypes:"Html;JsonSummary;Cobertura"

Output Formats:

Html - Interactive report for download
JsonSummary - Parseable for GitHub Job Summary
Cobertura - Combined XML for external tools

Reading JSON Summary in Bash:

LINE_COV=$(cat ./TestResults/CoverageReport/Summary.json | jq -r '.summary.linecoverage // 0')
echo "| Lines | ${LINE_COV}% |" >> $GITHUB_STEP_SUMMARY

Lesson: Use jq (preinstalled on Ubuntu runners) to parse JSON and display metrics in GitHub Job Summary tables.

Action Version Management

5. Keeping Actions Updated

Challenge: How to know when GitHub Actions need updates?

Resolution: Use Context7 MCP or check official GitHub documentation for latest versions.

Action	Purpose	Current Stable
`actions/checkout`	Clone repository	v5
`actions/setup-dotnet`	Install .NET SDK	v4
`actions/upload-artifact`	Upload build artifacts	v4
`actions/download-artifact`	Download artifacts	v5
`azure/login`	Azure authentication	v2
`azure/webapps-deploy`	Deploy to App Service	v2

Lesson: Check action versions periodically. Major version bumps (v4→v5) often include important changes like Node.js runtime updates.

Frontend Testing Lessons

React 19 Compatibility

1. Testing Library Version Requirements

Challenge: Tests failed with cryptic errors after upgrading to React 19.

Root Cause: @testing-library/react versions below 16 are incompatible with React 19's new architecture.

// ❌ WRONG - Will fail with React 19
"@testing-library/react": "^14.0.0"

// ✅ CORRECT - Required for React 19
"@testing-library/react": "^16.3.0"

Lesson: When using React 19, always use @testing-library/react v16+. The compatibility matrix is not always clearly documented.

2. Testing forwardRef Components

Challenge: Testing a component that uses forwardRef for external focus control. TypeScript error: "Type 'RefObject' is not assignable to type 'Ref'".

Symptom:

// ❌ FAILS - ref typing mismatch
const ref = { current: null };
render(<ChatInput ref={ref} {...props} />);

Root Cause: When passing refs to forwardRef components in tests, you need a properly typed React ref from useRef, not a plain object.

Solution: Create a wrapper component that uses useRef:

// ✅ CORRECT - Wrapper with useRef
function TestWrapper({ onRef }: { onRef: (ref: HTMLTextAreaElement | null) => void }) {
  const ref = useRef<HTMLTextAreaElement>(null);

  useEffect(() => {
    onRef(ref.current);
  }, [onRef]);

  return <ChatInput ref={ref} {...defaultProps} />;
}

it('should be focusable via ref', () => {
  let textareaRef: HTMLTextAreaElement | null = null;
  render(<TestWrapper onRef={(ref) => { textareaRef = ref; }} />);
  
  expect(textareaRef).toBeInstanceOf(HTMLTextAreaElement);
  textareaRef?.focus();
  expect(textareaRef).toHaveFocus();
});

Lesson: For testing forwardRef components, use a wrapper component with useRef hook rather than plain object refs. This ensures proper TypeScript typing and React behavior.

GitHub Actions for Frontend CI/CD

3. pnpm Setup Order Matters

Challenge: Workflow failed with cache errors when using cache: 'pnpm' in setup-node.

Root Cause: actions/setup-node requires pnpm to be installed BEFORE it runs to detect the cache directory.

# ❌ WRONG - Cache won't work
- uses: actions/setup-node@v4
  with:
    cache: 'pnpm'
- uses: pnpm/action-setup@v4

# ✅ CORRECT - pnpm must be installed first
- uses: pnpm/action-setup@v4
  with:
    version: 9
- uses: actions/setup-node@v4
  with:
    cache: 'pnpm'
    cache-dependency-path: src/frontend/task-agent-web/pnpm-lock.yaml

Lesson: Always install pnpm (pnpm/action-setup) BEFORE setup-node when using pnpm caching. Also specify cache-dependency-path for monorepo structures.

4. GitHub Actions Has No Built-in Test Report Viewer

Challenge: Expected GitHub Actions to display test results like Azure DevOps does with its Test tab.

Key Insight: Unlike Azure DevOps, GitHub Actions does NOT have a built-in test report viewer. Test results must be handled differently.

Feature	Azure DevOps	GitHub Actions
Built-in test report viewer	✅ Yes (Test tab)	❌ No
Test annotations in code	✅ Yes	✅ Yes (with reporter)
Coverage visualization	✅ Yes (Coverage tab)	❌ No (artifacts only)
Job Summary markdown	❌ No	✅ Yes (`GITHUB_STEP_SUMMARY`)

Solution: Combine multiple techniques:

Job Summary (GITHUB_STEP_SUMMARY) - Display metrics directly in workflow summary
GitHub Reporter - Generate code annotations for failures
Artifacts - Upload HTML reports for detailed viewing

# Job Summary with coverage table
- name: Generate Unit Test Summary
  run: |
    echo "## 🧪 Unit Test Results" >> $GITHUB_STEP_SUMMARY
    echo "| Metric | Coverage |" >> $GITHUB_STEP_SUMMARY
    echo "|--------|----------|" >> $GITHUB_STEP_SUMMARY
    echo "| Lines | $(cat coverage/coverage-summary.json | jq -r '.total.lines.pct')% |" >> $GITHUB_STEP_SUMMARY

Lesson: GitHub Actions requires explicit configuration for test visualization. Use Job Summaries for quick metrics and artifacts for detailed reports.

5. Playwright GitHub Reporter for Code Annotations

Challenge: Want test failures to appear as annotations directly in PR code, like Azure DevOps.

Solution: Configure Playwright to use the github reporter in CI:

// playwright.config.ts
reporter: process.env.CI
  ? [['github'], ['html', { open: 'never' }], ['list']]
  : [['html', { open: 'never' }], ['list']],

Result: When E2E tests fail in CI, GitHub shows annotations directly on the affected code lines in PRs.

Lesson: The github reporter is specifically designed for GitHub Actions - it uses workflow commands to create annotations.

6. Vitest Coverage JSON Summary

Challenge: Wanted to display coverage metrics in GitHub Job Summary, but only had HTML reports.

Root Cause: Vitest's json reporter creates coverage-final.json (detailed per-file), but Job Summary scripts need aggregated totals from coverage-summary.json.

// ❌ INCOMPLETE - No summary file
reporter: ['text', 'json', 'html']

// ✅ COMPLETE - Includes summary for CI
reporter: ['text', 'json', 'json-summary', 'html']

Generated files:

json → coverage-final.json (detailed, per-file)
json-summary → coverage-summary.json (aggregated totals)

Lesson: Always include json-summary reporter when you need to extract coverage metrics programmatically (CI/CD, badges, etc.).

SSE Status Events Architecture Lessons

Background: Real-Time Status Updates

Challenge: The initial implementation used hardcoded status message constants (TaskFunctionStatusMessages.cs) mapped to function names. This worked but had a critical scalability limitation.

// ❌ ORIGINAL - Hardcoded constants (not scalable)
public static class TaskFunctionStatusMessages
{
    public static readonly Dictionary<string, string> StatusMessages = new()
    {
        { "CreateTaskAsync", "Creating task..." },
        { "ListTasksAsync", "Retrieving tasks..." },
        { "GetTaskByIdAsync", "Looking up task..." },
        // Must add new entries for EVERY function in EVERY agent
    };
}

Problem: In a multi-agent system, each new agent would require:

A new constants file with all its function mappings
Updates to SseStreamingService to handle each agent type
Duplicate maintenance when function descriptions change

Solution: Dynamic Status from `[Description]` Attributes

Hybrid Approach: Combine AG-UI standard lifecycle events with dynamic message extraction.

1. AG-UI Lifecycle Events (Standard Protocol)

Following the AG-UI protocol, implemented three event types:

Event	Purpose	When Sent
`STEP_STARTED`	Tool execution begins	Before function invocation
`STATUS_UPDATE`	Human-readable progress	During function execution
`STEP_FINISHED`	Tool execution completes	After function returns

// WebApi/Services/SseStreamingService.cs
await WriteEventAsync("STEP_STARTED", new { 
    stepName = functionName 
});

await WriteEventAsync("STATUS_UPDATE", new { 
    status = _functionDescriptionProvider.GetStatusMessage(functionName)
});

// ... function executes ...

await WriteEventAsync("STEP_FINISHED", new { 
    stepName = functionName 
});

2. FunctionDescriptionProvider (Dynamic Extraction)

Created a service that extracts [Description] attributes via reflection and converts them to gerund form:

// WebApi/Services/FunctionDescriptionProvider.cs
public class FunctionDescriptionProvider
{
    private readonly ConcurrentDictionary<string, string> _statusMessages = new();

    public void RegisterFunctionType(Type functionType)
    {
        foreach (var method in functionType.GetMethods())
        {
            var descAttr = method.GetCustomAttribute<DescriptionAttribute>();
            if (descAttr != null)
            {
                string status = ConvertToGerund(descAttr.Description);
                _statusMessages.TryAdd(method.Name, status);
            }
        }
    }

    private static string ConvertToGerund(string description)
    {
        // "Creates a new task" → "Creating a new task..."
        // "Lists all tasks" → "Listing all tasks..."
        // "Gets a task by ID" → "Getting a task by ID..."
    }
}

Key Challenges Encountered

1. Gerund Conversion Edge Cases

Challenge: English verb conjugation has many irregular patterns.

Examples handled:

"Creates" → "Creating" (drop 's', add 'ing')
"Deletes" → "Deleting" (drop 'es', add 'ing')
"Gets" → "Getting" (double consonant)
"Lists" → "Listing" (standard)

Solution: Pattern-based conversion with special cases:

private static string ConvertToGerund(string description)
{
    var match = Regex.Match(description, @"^(\w+)(.*)$");
    string verb = match.Groups[1].Value;
    string rest = match.Groups[2].Value;

    string gerund = verb.ToLower() switch
    {
        "gets" => "Getting",
        "creates" => "Creating",
        "updates" => "Updating",
        "deletes" => "Deleting",
        "lists" => "Listing",
        "marks" => "Marking",
        _ => verb.EndsWith("es") 
            ? verb[..^2] + "ing"  // "deletes" → "delet" + "ing"
            : verb.EndsWith("s") 
                ? verb[..^1] + "ing"  // "creates" → "creat" + "ing"
                : verb + "ing"
    };

    return $"{gerund}{rest}...";
}

2. Singleton vs Scoped Service Lifetime

Challenge: FunctionDescriptionProvider needed to be accessed from the singleton SseStreamingService, but also needed to be registered at startup.

Solution: Register as singleton and initialize during DI configuration:

// WebApi/Extensions/AgentServiceExtensions.cs
services.AddSingleton<FunctionDescriptionProvider>(sp =>
{
    var provider = new FunctionDescriptionProvider();
    provider.RegisterFunctionType(typeof(TaskFunctions));
    // Future: provider.RegisterFunctionType(typeof(CalendarFunctions));
    return provider;
});

Benefit: Registration happens once at startup; lookups are O(1) from ConcurrentDictionary.

3. Pairing STEP_STARTED with STEP_FINISHED

Challenge: Needed to track which function started so we could send the correct STEP_FINISHED event.

Solution: Track active step name in service state:

private string? _activeStepName;

// On function start
_activeStepName = functionName;
await WriteEventAsync("STEP_STARTED", new { stepName = functionName });

// On function complete
if (_activeStepName != null)
{
    await WriteEventAsync("STEP_FINISHED", new { stepName = _activeStepName });
    _activeStepName = null;
}

4. Frontend Event Handling

Challenge: Frontend needed to handle the new event structure without breaking existing functionality.

Solution: Update chat-service.ts to process all three event types:

// lib/api/chat-service.ts
case 'STEP_STARTED':
  onStepStarted?.(parsed.stepName);
  break;

case 'STATUS_UPDATE':
  onStatusUpdate?.(parsed.status);
  break;

case 'STEP_FINISHED':
  onStepFinished?.(parsed.stepName);
  break;

Benefits of Final Architecture

Aspect	Before (Hardcoded)	After (Dynamic)
New agent support	Requires new constants file	Just register `FunctionType`
Description changes	Update 2 places (attribute + constant)	Update 1 place (attribute only)
Type safety	String-based dictionary	Reflection-based, compile-time attributes
Maintenance	O(n) per agent	O(1) - single registration call
Protocol compliance	Custom events only	AG-UI standard lifecycle events

Testing Considerations

Unit Tests Added:

FunctionDescriptionProvider gerund conversion for all verb patterns
Status message retrieval for registered functions
Fallback behavior for unregistered functions

Integration Test Update:

SSE event mocks updated to include STEP_STARTED/STEP_FINISHED wrapper events

// Frontend test mock
const mockSSEResponse = `
event: STEP_STARTED
data: {"stepName":"ListTasksAsync"}

event: STATUS_UPDATE
data: {"status":"Listing all tasks..."}

event: STEP_FINISHED
data: {"stepName":"ListTasksAsync"}
`;

Lesson Learned

Key Insight: When designing status/progress systems, leverage existing metadata (like [Description] attributes) rather than creating parallel data structures. This follows DRY principle and ensures consistency.

The [Description] attribute was already required for the AI agent to understand function purposes. Reusing it for user-facing status messages:

Eliminates duplicate maintenance
Ensures AI understanding and user messaging stay in sync
Scales automatically to new agents/functions

Conclusion

This document captures key learnings from building TaskAgent-AgenticAI:

Architecture Decisions:

4-layer Clean Architecture provides excellent separation of concerns
Monolithic deployment simplifies operations while maintaining code quality
Dual-database strategy (SQL Server + PostgreSQL) works well for different data patterns

Integration Lessons:

Azure OpenAI's built-in content filtering is production-ready
Preview packages require careful version management
Platform capabilities should be leveraged before custom implementations

Code Quality:

Central Package Management prevents dependency conflicts
Factory methods in Domain layer enforce invariants
SSE events enable graceful error handling in streaming protocols

CI/CD Lessons:

GitHub Actions run on Node.js runtime (separate from your application runtime)
dotnet restore requires explicit .sln or .csproj files, not directories
Ubuntu runners include Docker preinstalled (Testcontainers works out-of-the-box)
ReportGenerator combines multiple coverage reports into unified metrics
Keep GitHub Actions updated to avoid Node.js deprecation warnings

Frontend Testing Lessons:

React 19 requires @testing-library/react v16+ (not documented clearly)
Testing forwardRef components requires wrapper with useRef hook
pnpm must be installed BEFORE setup-node for cache to work
GitHub Actions has no built-in test viewer (use Job Summaries + artifacts)
Playwright github reporter creates code annotations in PRs
Vitest json-summary reporter needed for CI coverage metrics

FilesExpand file tree

LESSONS_LEARNED.md

Latest commit

History

LESSONS_LEARNED.md

File metadata and controls

Lessons Learned

Overview

Project Architecture

Clean Architecture (4 Layers)

Content Safety Migration

Background

Migration Decision

Key Challenges in Content Safety Migration

1. Understanding Azure OpenAI's Built-in Content Filtering

2. Error Response Format Differences

3. UX Continuity for Blocked Messages

Clean Architecture Lessons

Trade-offs Analysis

Benefits of Removing Custom Content Safety

Trade-offs

Clean Architecture Challenges

1. Layer Dependency Discipline

2. Scoped Services in Singleton Contexts

3. Domain Layer Purity

Dual Database Architecture Lessons

Challenge: SQL Server + PostgreSQL Coexistence

Key Lessons

1. Separate DbContexts for Each Database

2. PostgreSQL json vs jsonb Type

3. Thread Serialization Preservation

4. Two Serialization Formats for Conversation State

Best Practices Identified

1. Leverage Platform Capabilities First

2. SSE Events for Graceful Error Handling

3. Pin Preview Package Versions

4. Dual Serialization Format Handling

5. Security Through Obscurity (Appropriate Here)

5. Documentation-Driven Development

6. Central Package Management (CPM)

7. Fail-Fast Database Strategy

Technology-Specific Lessons

AG-UI Protocol Integration

.NET Aspire Orchestration

Next.js Frontend Patterns

Future Considerations

When to Re-Add Custom Content Safety

Monitoring Recommendations

CI/CD Pipeline Lessons

GitHub Actions Architecture

1. Node.js Runtime for Actions (Not for Your Code)

2. dotnet restore Requires Project/Solution Files

Testcontainers in GitHub Actions

3. Docker Availability on Runners

Code Coverage Aggregation

4. Combining Multiple Coverage Reports

Action Version Management

5. Keeping Actions Updated

Frontend Testing Lessons

React 19 Compatibility

1. Testing Library Version Requirements

2. Testing forwardRef Components

GitHub Actions for Frontend CI/CD

3. pnpm Setup Order Matters

4. GitHub Actions Has No Built-in Test Report Viewer

5. Playwright GitHub Reporter for Code Annotations

6. Vitest Coverage JSON Summary

SSE Status Events Architecture Lessons

Background: Real-Time Status Updates

Solution: Dynamic Status from [Description] Attributes

1. AG-UI Lifecycle Events (Standard Protocol)

2. FunctionDescriptionProvider (Dynamic Extraction)

Key Challenges Encountered

1. Gerund Conversion Edge Cases

2. Singleton vs Scoped Service Lifetime

3. Pairing STEP_STARTED with STEP_FINISHED

4. Frontend Event Handling

Benefits of Final Architecture

Testing Considerations

Lesson Learned

2. PostgreSQL `json` vs `jsonb` Type

Solution: Dynamic Status from `[Description]` Attributes