An AWS-based data pipeline to collect, process, store, and monitor Twitter streaming data thoughout the COVID-19 pandemic in support of local, regional, and national public health initiatives.
python nlp aws text-mining automation twitter twitter-streaming-api stream-processing operations public-health high-performance-computing topic-modeling tweepy data-pipelines operations-research public-health-care academic-research multi-threaded-programming real-time-surveillance public-health-insights
-
Updated
Mar 12, 2025 - Jupyter Notebook