AI Services Documentation

Overview

LogseqSpringThing integrates multiple AI services to provide intelligent features including chat, content generation, and speech processing. The system supports RAGFlow for chat interactions, Perplexity for content queries, and speech-to-text/text-to-speech capabilities.

Architecture

graph LR
    Client[Client] --> Handler[AI Handlers]
    Handler --> RS[RAGFlow Service]
    Handler --> PS[Perplexity Service]
    Handler --> SS[Speech Service]
    
    RS --> RAG[RAGFlow API]
    PS --> PERP[Perplexity API]
    SS --> AP[Audio Processor]
    
    style Handler fill:#f9f,stroke:#333,stroke-width:2px
    style RS fill:#bbf,stroke:#333,stroke-width:2px
    style PS fill:#bfb,stroke:#333,stroke-width:2px
    style SS fill:#fbf,stroke:#333,stroke-width:2px

RAGFlow Service

Location: src/services/ragflow_service.rs
Handler: src/handlers/ragflow_handler.rs

RAGFlow provides chat capabilities with support for streaming responses and text-to-speech.

Configuration

ragflow:
  api_key: "your-api-key"
  base_url: "https://api.ragflow.com"
  timeout: 30

Features

Chat Sessions: Create and manage conversation sessions
Streaming Responses: Real-time streaming of AI responses
Text-to-Speech: Convert responses to audio
Rate Limiting: Built-in rate limiting and retry logic

API Endpoints

Create Chat Session

POST /api/ragflow/sessions
Content-Type: application/json

{
  "name": "Session Name"
}

Send Message

POST /api/ragflow/completion
Content-Type: application/json

{
  "conversation_id": "session-id",
  "message": "User message"
}

Stream Response

POST /api/ragflow/stream_answer
Content-Type: application/json

{
  "question": "User question",
  "session_id": "session-id"
}

Returns: Server-Sent Events stream

Error Handling

pub enum RAGFlowError {
    ReqwestError(reqwest::Error),
    StatusError(StatusCode, String),
    ParseError(String),
    IoError(std::io::Error),
}

Perplexity Service

Location: src/services/perplexity_service.rs
Handler: src/handlers/perplexity_handler.rs

Perplexity service handles intelligent content queries and generates markdown files.

Configuration

perplexity:
  api_key: "your-api-key"
  base_url: "https://api.perplexity.ai"
  timeout: 30
  model: "mixtral-8x7b-instruct"
  max_tokens: 2048
  temperature: 0.7

Features

Content Generation: Generate markdown content based on queries
Metadata Integration: Automatically creates metadata for generated content
File Management: Saves responses as markdown files with proper structure
Context-Aware: Uses conversation history for context

API Endpoints

Query and Save

POST /api/perplexity/query_and_save
Content-Type: application/json

{
  "query": "Explain quantum computing",
  "filename": "quantum-computing",
  "metadata": {
    "tags": ["physics", "computing"],
    "category": "technology"
  }
}

Process Files

POST /api/perplexity/process
Content-Type: application/json

{
  "files": ["file1.md", "file2.md"],
  "action": "summarize"
}

Generated File Structure

# [Title from Query]

[Generated content]

---
Metadata:
- Generated: [timestamp]
- Model: [model-name]
- Query: [original-query]

Speech Service

Location: src/services/speech_service.rs
Handler: src/handlers/speech_socket_handler.rs
Audio Processor: src/utils/audio_processor.rs

Provides speech-to-text and text-to-speech capabilities via WebSocket.

Features

Real-time STT: WebSocket-based speech recognition
TTS Integration: Convert text responses to speech
Audio Processing:
- Sample rate conversion
- Format conversion (WebM to WAV)
- Chunked processing for streaming

WebSocket Protocol

Connection

const ws = new WebSocket('ws://localhost:8080/ws/speech');

Speech-to-Text

// Send audio chunks
ws.send(audioBlob);

// Receive transcription
ws.onmessage = (event) => {
  const result = JSON.parse(event.data);
  if (result.type === 'transcription') {
    console.log(result.text);
  }
};

Audio Processing Pipeline

graph LR
    Audio[Audio Input] --> WebM[WebM Format]
    WebM --> Proc[Audio Processor]
    Proc --> WAV[WAV Format]
    WAV --> STT[Speech Recognition]
    STT --> Text[Text Output]
    
    Text2[Text Input] --> TTS[TTS Engine]
    TTS --> Audio2[Audio Output]

Configuration

speech:
  sample_rate: 16000
  channels: 1
  chunk_size: 4096
  vad_enabled: true
  vad_threshold: 0.5

Integration Patterns

Service Initialization

// In app_state.rs
let ragflow_service = RAGFlowService::new(settings.clone()).await?;
let perplexity_service = PerplexityService::new(settings.clone()).await?;
let speech_service = SpeechService::new(settings.clone()).await?;

Error Handling

All services follow a consistent error pattern:

match service.process(request).await {
    Ok(response) => Ok(HttpResponse::Ok().json(response)),
    Err(e) => {
        error!("Service error: {}", e);
        Ok(HttpResponse::InternalServerError().json(json!({
            "error": e.to_string()
        })))
    }
}

Rate Limiting

Services implement exponential backoff for rate limiting:

let mut retry_count = 0;
loop {
    match client.send().await {
        Ok(resp) => return Ok(resp),
        Err(e) if retry_count < MAX_RETRIES => {
            let delay = Duration::from_millis(100 * 2_u64.pow(retry_count));
            tokio::time::sleep(delay).await;
            retry_count += 1;
        }
        Err(e) => return Err(e.into()),
    }
}

Performance Considerations

Connection Pooling: All services use connection pooling via reqwest::Client
Streaming: Large responses use streaming to reduce memory usage
Caching: Consider implementing response caching for repeated queries
Timeouts: Configurable timeouts prevent hanging requests

Security

API Keys: Stored in environment variables, never in code
Input Validation: All user input is validated before processing
Rate Limiting: Prevent abuse through request throttling
CORS: Properly configured for cross-origin requests

Testing

Unit Tests

#[cfg(test)]
mod tests {
    use super::*;

    #[tokio::test]
    async fn test_ragflow_session() {
        let settings = create_test_settings();
        let service = RAGFlowService::new(settings).await.unwrap();
        // Test implementation
    }
}

Integration Tests

Mock external API responses
Test error scenarios
Verify timeout behavior
Test streaming responses

Monitoring

Logging

info!("RAGFlow request: session={}, message_length={}", session_id, message.len());
error!("Perplexity error: {}", e);

Metrics to Track

Response times
Error rates
Token usage
Active sessions

Uh oh!

FilesExpand file tree

ai-services.md

Latest commit

History

ai-services.md

File metadata and controls

AI Services Documentation

Overview

Architecture

RAGFlow Service

Configuration

Features

API Endpoints

Create Chat Session

Send Message

Stream Response

Error Handling

Perplexity Service

Configuration

Features

API Endpoints

Query and Save

Process Files

Generated File Structure

Speech Service

Features

WebSocket Protocol

Connection

Speech-to-Text

Audio Processing Pipeline

Configuration

Integration Patterns

Service Initialization

Error Handling

Rate Limiting

Performance Considerations

Security

Testing

Unit Tests

Integration Tests

Monitoring

Logging

Metrics to Track

Related Documentation