Skip to content

haskellcraigz/TEE-dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

66 Commits
 
 
 
 
 
 
 
 

Repository files navigation

TEE-dataset

Code for generating the TEE: temperature extremes in Europe dataset. TEE datasets can be downloaded on Figshare (link here). Detailed information on the data generating process and final datasets are here.

Recommended Citation: Ronnkvist, S.R., Haskell-Craig, Z., Robinson, A., Conte Keivabu, R., Hauer, M.E., Bovienzo, D., & Zagheni, E. (2025) What’s the TEE: Metrics of Temperature Extremes in Europe NUTS Regions (1980-2024). Scientific Data. https://doi.org/10.1038/s41597-025-05352-7

Abstract


We generate datasets quantifying extreme temperature exposure in Europe using a variety of metrics at two sub-national spatial scales (NUTS 2 and NUTS 3) and three temporal scales (daily, extreme temperature wave, and yearly) from 1980-2024. These datasets capture the breadth of temperature metrics used in epidemiology, demography and environmental literature with 67 different metrics: including regionally-unusual temperature events (defined as temperatures above/below the 95th/5th percentile of historical temperatures) and periods of sustained (consecutive day) exposure to extreme temperatures. Although publicly available, climate data format and spatial resolution rarely matches the structure, scale, and extent used to disseminate government statistics on health, economic, and demographic variables, and manipulating raw data is computationally expensive. Here we provide temperature data in a user-friendly format which can easily be linked to EuroStat. Our open-sourced code and reproducible methods can be extended to produce similar datasets at the global scale


Code description

Produces three datasets containing measures of extreme temperatures (hot and cold) across Europe at a NUTS2 and NUTS3 spatial resolution. Figure 2 (in paper) outlines the data analysis pipeline. Briefly, we convert daily temperature available only on a raster grid to the daily mean for each region (polygon data), using daily temperature we calculate seven commonly used metrics of extreme exposure (at a variety of thresholds), and using these define consecutive days of extreme exposure as heatwaves/coldsnaps.

Replication notes:

Steps

  • Download raw data from the publicly available repositories and save in Data folder
    • Can download daily temperature and UTCI data from Copernicus using scripts in 00_datadownload/
  • Run scripts in folder 01_rasterprocessing/ to process raster files and generate daily temperature measures (TEEdaily)
  • Run the following scripts in folder 02_metricconstruction/ to compute yearly temperature measures (used in TEEyearly):
"METRIC-cdd.R"                                "METRIC-expected_daysoverthreshold.R"        
"METRIC-extremetempwavedays_global.R"         "METRIC-extremetempwavedays_local.R"         
"METRIC-percentilethresholdtemp_days_local.R" "METRIC-tropicalnights.R"                    
"METRIC-utci.R"
  • Run scripts in folder 03_compiledataset to generate final datasets for TEE-daily, TEE-yearly, and TEE-wave

Raw data

Raw climate data is available from the Copernicus Data Store

Shapefile polygons delineating the European NUTS regions are available from EuroStat

Gridded population estimates were produced by the Center For International Earth Science Information Network available at the Gridded Population of the World, Version 4 (GPWv4): Population Count, Revision 11

Replicating code for other regions or metrics

While we only publish data for Europe, our code can be easily adapted to compute temperature extremes for other regions or at different spatial scales. One caveat is that users must have access to high throughput computing to process raster data. Since we used global raster data, the primary alteration to our code is changing the shapefile and associated variables (e.g. spatial unit identifier) used to compute spatial means. Other minor revisions to the code include changing the names of output files and variable names. Users wishing to generate metrics for different time spans must also change the time frame listed in the SET UP section in each code file. Users may also wish to use a different measure of daily temperature to construct yearly temperature metrics. Since we generated daily measures of the minimum, mean, and maximum temperature, users can simply switch the temperature variable used in the metric construction.

R environment

R version 4.4.1 (2024-06-14)
Platform: x86_64-pc-linux-gnu
Running under: AlmaLinux 8.10 (Cerulean Leopard)

Matrix products: default
BLAS/LAPACK: /usr/lib64/libopenblas-r0.3.15.so;  LAPACK version 3.9.0

locale:
 [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C               LC_TIME=en_US.UTF-8        LC_COLLATE=en_US.UTF-8     LC_MONETARY=en_US.UTF-8   
 [6] LC_MESSAGES=en_US.UTF-8    LC_PAPER=en_US.UTF-8       LC_NAME=C                  LC_ADDRESS=C               LC_TELEPHONE=C            
[11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C       

time zone: America/Chicago
tzcode source: system (glibc)

attached base packages:
[1] stats     graphics  grDevices utils     datasets  methods   base     

other attached packages:
 [1] foreign_0.8-86       sf_1.0-16            exactextractr_0.10.0 terra_1.7-78         fs_1.6.4             lubridate_1.9.3     
 [7] forcats_1.0.0        stringr_1.5.1        dplyr_1.1.4          purrr_1.0.2          readr_2.1.5          tidyr_1.3.1         
[13] tibble_3.2.1         ggplot2_3.5.1        tidyverse_2.0.0     

loaded via a namespace (and not attached):
 [1] s2_1.1.6           utf8_1.2.4         generics_0.1.3     lwgeom_0.2-14      class_7.3-22       KernSmooth_2.23-24 stringi_1.8.4     
 [8] lattice_0.22-6     hms_1.1.3          magrittr_2.0.3     grid_4.4.1         timechange_0.3.0   pkgload_1.4.0      e1071_1.7-14      
[15] DBI_1.2.3          fansi_1.0.6        scales_1.3.0       codetools_0.2-20   cli_3.6.3          rlang_1.1.4        crayon_1.5.3      
[22] units_0.8-5        munsell_0.5.1      withr_3.0.0        tools_4.4.1        raster_3.6-26      tzdb_0.4.0         colorspace_2.1-0  
[29] vctrs_0.6.5        R6_2.5.1           zoo_1.8-12         proxy_0.4-27       lifecycle_1.0.4    classInt_0.4-10    pkgconfig_2.0.3   
[36] pillar_1.9.0       gtable_0.3.5       glue_1.7.0         Rcpp_1.0.13        tidyselect_1.2.1   rstudioapi_0.16.0  farver_2.1.2      
[43] wk_0.9.1           compiler_4.4.1     sp_2.1-3

About

Code for generating the TEE: temperature extremes in Europe dataset

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors