The original wildfire image dataset is not included in this repository.
The project expects the following directories when reproducing the full pipeline:
data/raw/: the unmodified source dataset after extractiondata/splits/: canonical train, validation, and test folders created byscripts/prepare_dataset.py
Expected split layout:
data/splits/
train/
fire/
no_fire/
val/
fire/
no_fire/
test/
fire/
no_fire/
The preparation script also writes data/splits/SPLIT_INFO.json with split counts, seed values, and source file metadata for reproducibility.