Sherlock: Agentic Security Analysis System

Project Overview

Sherlock is an agentic AI system built on AWS Bedrock that analyzes system documentation and logs to identify security vulnerabilities, investigate potential breaches, and provide actionable security insights. The system leverages multiple specialized AI agents working together to perform comprehensive security analysis.

System Architecture

High-Level Architecture

graph TD
    User[User] <--> UI[Web UI]
    UI <--> API[API Gateway]
    API <--> Orchestrator[Agent Orchestrator]
    Orchestrator <--> DocAnalyzer[Documentation Analyzer Agent]
    Orchestrator <--> LogAnalyzer[Log Analyzer Agent]
    Orchestrator <--> InvestigationAgent[Investigation Agent]
    DocAnalyzer <--> PDFProcessor[PDF Processor]
    DocAnalyzer <--> NVDTool[NVD API Tool]
    LogAnalyzer <--> LogProcessor[Log Processor]
    LogAnalyzer <--> RegexTool[Regex Tool]
    InvestigationAgent <--> TreeGenerator[Tree Generator]
    PDFProcessor <--> S3Storage[S3 Storage]
    LogProcessor <--> S3Storage
    NVDTool <--> ExternalAPI[NVD External API]

AWS Integration Architecture

graph TD
    User[User] <--> CloudFront[CloudFront]
    CloudFront <--> S3WebUI[S3 - Web UI]
    CloudFront <--> APIGateway[API Gateway]
    APIGateway <--> LambdaOrchestrator[Lambda - Orchestrator]
    LambdaOrchestrator <--> LambdaDocAnalyzer[Lambda - Doc Analyzer]
    LambdaOrchestrator <--> LambdaLogAnalyzer[Lambda - Log Analyzer]
    LambdaOrchestrator <--> LambdaInvestigation[Lambda - Investigation]
    LambdaDocAnalyzer <--> Bedrock[AWS Bedrock]
    LambdaLogAnalyzer <--> Bedrock
    LambdaInvestigation <--> Bedrock
    LambdaDocAnalyzer <--> S3Storage[S3 - Storage]
    LambdaLogAnalyzer <--> S3Storage
    LambdaInvestigation <--> S3Storage
    LambdaDocAnalyzer <--> DynamoDB[DynamoDB]
    LambdaLogAnalyzer <--> DynamoDB
    LambdaInvestigation <--> DynamoDB
    LambdaDocAnalyzer <--> NVDAPIGateway[API Gateway - NVD Proxy]
    LambdaInvestigation <--> NVDAPIGateway
    NVDAPIGateway <--> LambdaNVD[Lambda - NVD API Client]
    LambdaNVD <--> ExternalNVD[External NVD API]

Agentic Design

Agent Roles and Responsibilities

1. Orchestrator Agent

Role: Coordinates the activities of all specialized agents and manages the overall workflow.

Tools:

Agent Communication Manager
Task Scheduler
State Manager

Inputs:

User requests
System documentation (PDF)
Log files
Agent outputs

Outputs:

Coordinated agent responses
Overall system state

2. Documentation Analyzer Agent

Role: Analyzes system documentation to identify potential security weakpoints and vulnerabilities.

Tools:

PDF Text Extractor
NVD API Client
Vulnerability Analyzer
Report Generator

Inputs:

System documentation (PDF)
NVD API responses

Outputs:

Vulnerability analysis report (Markdown)
Identified system components
Potential security weakpoints

3. Log Analyzer Agent

Role: Analyzes system logs to identify anomalies and potential security issues.

Tools:

Log Parser
Regex Pattern Matcher
Component Identifier

Inputs:

System logs from ./logs directory
System component information from Documentation Analyzer

Outputs:

Log analysis results
Identified anomalies
Relevant log entries

4. Investigation Agent

Role: Creates and manages the investigation tree for breach analysis.

Tools:

Tree Generator
NVD API Client
Hypothesis Generator
Tree Visualizer

Inputs:

Initial breach information
User feedback on plausible nodes
NVD API responses
Information from other agents

Outputs:

Interactive investigation tree
Breach analysis report

Agent Interaction Flow

sequenceDiagram
    participant User
    participant Orchestrator
    participant DocAnalyzer
    participant LogAnalyzer
    participant Investigation
    
    User->>Orchestrator: Upload documentation & logs
    Orchestrator->>DocAnalyzer: Analyze documentation
    DocAnalyzer->>Orchestrator: Return vulnerability report
    Orchestrator->>LogAnalyzer: Analyze relevant logs
    LogAnalyzer->>Orchestrator: Return log analysis
    
    User->>Orchestrator: Initiate investigation
    Orchestrator->>Investigation: Create investigation tree
    Investigation->>User: Present initial tree nodes
    User->>Investigation: Select plausible nodes
    Investigation->>Investigation: Generate next level nodes
    Investigation->>User: Present updated tree
    User->>Investigation: Continue selection process
    Investigation->>Orchestrator: Final investigation report
    Orchestrator->>User: Complete security analysis

Technical Components

1. PDF Processing Pipeline

graph TD
    Upload[PDF Upload] --> S3Store[Store in S3]
    S3Store --> TextractProcess[Process with Textract]
    TextractProcess --> TextStorage[Store Extracted Text]
    TextStorage --> LLMAnalysis[LLM Analysis]

2. Log Analysis Pipeline

graph TD
    LogUpload[Log Upload] --> S3LogStore[Store in S3]
    S3LogStore --> ComponentIdentify[Identify Component]
    ComponentIdentify --> SampleExtract[Extract Sample Lines]
    SampleExtract --> PatternIdentify[Identify Patterns]
    PatternIdentify --> RegexExecution[Execute Regex]
    RegexExecution --> ResultsAnalysis[Analyze Results]

3. Investigation Tree Generation

graph TD
    BreachInfo[Breach Information] --> InitialNodes[Generate Initial Nodes]
    InitialNodes --> UserSelection[User Selection]
    UserSelection --> NextLevel[Generate Next Level]
    NextLevel --> NVDEnrichment[Enrich with NVD Data]
    NVDEnrichment --> UserSelection
    UserSelection --> FinalReport[Generate Final Report]

4. NVD API Integration

graph TD
    Query[Query Formation] --> APIRequest[API Request to NVD]
    APIRequest --> ResponseParsing[Parse Response]
    ResponseParsing --> DataEnrichment[Enrich Analysis Data]
    DataEnrichment --> ReportIntegration[Integrate into Reports]

UI Framework Selection

For a hackathon with a need for a visually appealing UI that can be quickly implemented and is well-known by LLMs, I recommend React with AWS Amplify and Material-UI (MUI) for the following reasons:

React is widely used and well-documented, making it easy to get help from LLMs during development
AWS Amplify provides seamless integration with AWS services and simplifies authentication, API integration, and deployment
Material-UI (MUI) offers beautiful, pre-designed components that can be quickly assembled into a professional-looking interface
This combination allows for rapid development while still producing a visually impressive result for the demo/pitch

For the interactive tree visualization specifically, I recommend using react-d3-tree or react-orgchart libraries, which provide ready-to-use interactive tree visualizations that can be easily customized.

Implementation Roadmap

Phase 1: Setup & Infrastructure (Day 1 - Morning)

Set up GitHub repository with proper structure
Initialize AWS resources:
- S3 buckets for storage
- Lambda functions for each agent
- API Gateway for endpoints
- DynamoDB for state management
- Bedrock model access
Set up React application with AWS Amplify
Configure CI/CD pipeline

Phase 2: Core Agent Implementation (Day 1 - Afternoon)

Implement PDF processing pipeline
Implement NVD API integration
Implement basic agent communication
Set up log analysis framework

Phase 3: UI Development (Day 1 - Evening)

Implement main dashboard UI
Create file upload components
Implement report viewing components
Set up basic navigation

Phase 4: Investigation Tree Implementation (Day 2 - Morning)

Implement tree data structure
Create tree visualization component
Implement node generation logic
Set up user interaction handlers

Phase 5: Integration & Testing (Day 2 - Afternoon)

Integrate all components
Test end-to-end workflows
Fix bugs and optimize performance
Prepare demo data

Phase 6: Finalization & Presentation Prep (Day 2 - Evening)

Polish UI and user experience
Create demonstration script
Prepare PowerPoint presentation
Document code and update GitHub repository

AWS Bedrock Model Selection

For optimal performance across different components, we recommend the following AWS Bedrock models:

Documentation Analysis: Claude 3 Opus
- Reasoning: Best for complex document understanding and detailed analysis
- Use case: Analyzing system documentation to identify vulnerabilities
Log Analysis: Claude 3 Sonnet
- Reasoning: Good balance of performance and cost for pattern recognition
- Use case: Analyzing log files and identifying relevant components
Investigation Tree Generation: Claude 3 Opus
- Reasoning: Superior reasoning capabilities for generating plausible attack paths
- Use case: Creating the investigation tree and generating hypotheses
User Interaction: Claude 3 Haiku
- Reasoning: Fast response times for interactive elements
- Use case: Handling user queries and providing explanations

Technical Considerations

1. Performance Optimization

Implement caching for NVD API responses
Use AWS Lambda provisioned concurrency for critical functions
Optimize PDF processing for large documents
Implement pagination for large log files

2. Security Considerations

Implement proper IAM roles and permissions
Encrypt sensitive data at rest and in transit
Validate all user inputs
Implement rate limiting for API endpoints

3. Scalability

Design for horizontal scaling of Lambda functions
Use DynamoDB for scalable state management
Implement efficient S3 storage patterns
Design UI components to handle varying amounts of data

4. Hackathon Success Factors

Focus on visual impact for the demo
Prepare compelling examples that showcase the system's capabilities
Ensure smooth transitions between components during the presentation
Highlight AWS integration points in the presentation

Conclusion

The Sherlock system architecture provides a comprehensive approach to security analysis using agentic AI powered by AWS Bedrock. By leveraging multiple specialized agents, the system can analyze documentation, logs, and create investigation trees to provide valuable security insights.

The implementation roadmap is designed for rapid development in a hackathon environment, with a focus on creating a visually impressive and functional prototype that showcases the integration of AWS services and AI capabilities.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sherlock: Agentic Security Analysis System

Project Overview

System Architecture

High-Level Architecture

AWS Integration Architecture

Agentic Design

Agent Roles and Responsibilities

1. Orchestrator Agent

2. Documentation Analyzer Agent

3. Log Analyzer Agent

4. Investigation Agent

Agent Interaction Flow

Technical Components

1. PDF Processing Pipeline

2. Log Analysis Pipeline

3. Investigation Tree Generation

4. NVD API Integration

UI Framework Selection

Implementation Roadmap

Phase 1: Setup & Infrastructure (Day 1 - Morning)

Phase 2: Core Agent Implementation (Day 1 - Afternoon)

Phase 3: UI Development (Day 1 - Evening)

Phase 4: Investigation Tree Implementation (Day 2 - Morning)

Phase 5: Integration & Testing (Day 2 - Afternoon)

Phase 6: Finalization & Presentation Prep (Day 2 - Evening)

AWS Bedrock Model Selection

Technical Considerations

1. Performance Optimization

2. Security Considerations

3. Scalability

4. Hackathon Success Factors

Conclusion

FilesExpand file tree

project_scope.md

Latest commit

History

project_scope.md

File metadata and controls

Sherlock: Agentic Security Analysis System

Project Overview

System Architecture

High-Level Architecture

AWS Integration Architecture

Agentic Design

Agent Roles and Responsibilities

1. Orchestrator Agent

2. Documentation Analyzer Agent

3. Log Analyzer Agent

4. Investigation Agent

Agent Interaction Flow

Technical Components

1. PDF Processing Pipeline

2. Log Analysis Pipeline

3. Investigation Tree Generation

4. NVD API Integration

UI Framework Selection

Implementation Roadmap

Phase 1: Setup & Infrastructure (Day 1 - Morning)

Phase 2: Core Agent Implementation (Day 1 - Afternoon)

Phase 3: UI Development (Day 1 - Evening)

Phase 4: Investigation Tree Implementation (Day 2 - Morning)

Phase 5: Integration & Testing (Day 2 - Afternoon)

Phase 6: Finalization & Presentation Prep (Day 2 - Evening)

AWS Bedrock Model Selection

Technical Considerations

1. Performance Optimization

2. Security Considerations

3. Scalability

4. Hackathon Success Factors

Conclusion