Skip to content

Latest commit

 

History

History
29 lines (23 loc) · 1.23 KB

File metadata and controls

29 lines (23 loc) · 1.23 KB

Changelog

All notable changes to this project will be documented in this file.

The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.

[1.0.0] - 2025-09-10

Added

  • Initial public release of Document Loader MCP Server for AWS Labs MCP repository
  • PDF text extraction using pdfplumber
  • Word document processing (DOCX/DOC) using markitdown
  • Excel spreadsheet reading (XLSX/XLS) using markitdown
  • PowerPoint presentation processing (PPTX/PPT) using markitdown
  • Image loading support for PNG, JPG, JPEG, GIF, BMP, TIFF, WEBP formats
  • Comprehensive test suite with sample document generation
  • FastMCP framework integration for MCP protocol support

Features

  • read_document: Unified document processing tool supporting PDF, Word, Excel, and PowerPoint formats
  • read_image: Load and display image files for LLM analysis

Technical Details

  • Built with FastMCP framework for MCP protocol compliance
  • Uses pdfplumber for reliable PDF text extraction
  • Uses markitdown library for Office document conversion
  • Supports Python 3.10+ with comprehensive error handling
  • Includes automated test generation for validation