(introduction)=
(requirements)=
-
Python
- CPython 3.10, 3.11, 3.12, 3.13, 3.14
(installation)=
$ pip install PyAthenaExtra packages:
| Package | Install command | Version |
|---|---|---|
| SQLAlchemy | pip install PyAthena[SQLAlchemy] |
>=1.0.0 |
| AioSQLAlchemy | pip install PyAthena[AioSQLAlchemy] |
>=2.0.0 |
| Pandas | pip install PyAthena[Pandas] |
>=1.3.0 |
| Arrow | pip install PyAthena[Arrow] |
>=10.0.0 |
| Polars | pip install PyAthena[Polars] |
>=1.0.0 |
(features)=
PyAthena provides comprehensive support for Amazon Athena's data types and features:
Core Features:
- DB API 2.0 Compliance: Full PEP 249 compatibility for database operations
- SQLAlchemy Integration: Native dialect support with table reflection and ORM capabilities
- Multiple Cursor Types: Standard, Pandas, Arrow, Polars, S3FS and Spark cursor implementations
- Async Support: Asynchronous query execution for non-blocking operations
Data Type Support:
- STRUCT/ROW Types: {ref}
Complete support <sqlalchemy>for complex nested data structures - ARRAY Types: {ref}
Complete support <sqlalchemy>for ordered collections with automatic Python list conversion - MAP Types: {ref}
Complete support <sqlalchemy>for key-value dictionary-like data structures - JSON Integration: Seamless JSON data parsing and conversion
- Performance Optimized: Smart format detection for efficient data processing
Additional Features:
- Connection Management: Efficient connection pooling and configuration
- Result Caching: Athena query result reuse capabilities
- Error Handling: Comprehensive exception handling and recovery
- S3 Integration: Direct S3 data access and staging support
(license)=
Many of the implementations in this library are based on PyHive, thanks for PyHive.