This repository contains code and analysis for my STAT 4915 research project on forecasting when protests escalate into political violence.
The project constructs a country–month panel dataset using ACLED event data and macro-political indicators, then applies statistical and machine learning models to estimate short-term escalation risk.
This project relies on three datasets:
- ACLED Event Data (2015–2025) – not included in the repository due to size and licensing restrictions. Data can be downloaded from https://acleddata.com/
- Democracy Index data
- GINI inequality data
All datasets should be placed in a data/ folder before running the analysis.
No additional filtering is required at the download stage for the ACLED dataset, as all event classifications are constructed during data processing. While not necessary, restricting the download to "Central America" and "South America" in the location field can reduce file size and improve download and processing efficiency. Access to ACLED data requires an account, which can typically be obtained through an institutional or academic affiliation.
code/datacleaning.qmd– Cleans and aggregates ACLED event-level data into a country–month panel and merges macro variablescode/modeling.qmd– Constructs the escalation variable, fits models, and evaluates predictive performancedata/– Input datasets (not included for ACLED)output/– Generated figures, tables, and model results
To reproduce the analysis:
- Download and place all required datasets in the
data/folder - Run the data cleaning script:
quarto render code/datacleaning.qmd- Run the modeling script:
quarto render code/modeling.qmdAll figures and results used in the final paper will be generated in the output/ folder.
The analysis uses time-ordered training, validation, and test splits (2015–2025) to preserve forecasting integrity. ACLED data are excluded from the repository due to licensing and size constraints. Results may vary slightly depending on data version and updates.
Sonia Lucey
B.S. Statistical Data Science, University of Connecticut