Skip to main content
GeoHealth logoLink to GeoHealth
. 2025 Sep 8;9(9):e2025GH001451. doi: 10.1029/2025GH001451

Improved High Resolution Heat Exposure Assessment With Personal Weather Stations and Spatiotemporal Bayesian Models

Eva Marquès 1,, Kyle P Messier 1
PMCID: PMC12415529  PMID: 40927409

Abstract

Most of the United States (US) population resides in cities, where they are subjected to the urban heat island effect. In this study, we develop a method to estimate hourly air temperatures at 0.01°×0.01° resolution, improving exposure assessment of US population when compared to existing gridded products. We use an extensive network of personal weather stations to capture the intra‐urban variability. The uncertainty associated with this crowdsourced data set is addressed through a spatiotemporal Bayesian model implemented with the Integrated Nested Laplace Approximation‐Stochastic Partial Differential Equation approach. We evaluate the model on Philadelphia (PA), New York City (NY), Phoenix (AZ), and the Triangle area (NC). These case studies span different climatic zones and urban landscapes. They cover several meteorological events including a deadly heatwave in Phoenix and a snowstorm hitting part of the US in winter 2021. We obtain an overall root mean square error of 1.06°C, demonstrating the versatility of our model, and its applicability across various regions in the US. The high granularity of our model allows for the precise identification of hotspots that were previously undetected with daymet and gridMET products. Using the data generated by our method, we show that neighborhoods with high population concentration are more likely to experience elevated temperatures and prolonged hot nights, thus encouraging the use of our model for further epidemiological investigations on the impact of heat or cold stress on human health.

Keywords: 2 m‐air temperature, personal weather station, spatiotemporal, Bayesian Hierarchical Model, urban heat island, INLA‐SPDE

Plain Language Summary

Heat exposure is an increasing concern across the United States (US), especially in cities where residents are exposed to the urban heat island effect. In this study, we introduce a novel methodology designed to accurately measure air temperature exposure within all US cities. We demonstrate how this method can enhance the exposure assessment of population compared to existing air‐temperature products. Our approach is applied to four cities with different climate and urban layouts: Philadelphia (PA), New York City (NY), Phoenix (AZ), and the Triangle area (NC). It is validated against several meteorological events including a deadly heatwave in Phoenix and a snowstorm hitting part of the US in winter 2021. Our findings indicate that neighborhoods with high population concentration are more likely to experience high temperatures and prolonged hot nights. This research aims to facilitate further epidemiological investigations on the impact of heat or cold stress on human health.

Key Points

  • Novel spatiotemporal Bayesian approach to estimate hourly evolution of air temperatures at high resolution with personal weather stations

  • Urban heat island detection allows for the identification of urban hotspots that were previously undetected with daymet and gridMET products

  • Neighborhoods with high population concentration are more likely to experience both elevated temperatures and prolonged hot nights

1. Introduction

In recent decades, the United States (US) have witnessed a wide range of devastating natural disasters. As shown in Bell et al. (2024), the number of heatwaves in the US has tripled between the 1960s and 2020s, and their duration moved from an average of 3 days in the 1960s to 4 nowadays. Modeled projections state that temperatures will continue to increase throughout the 21st century (IPCC, 2023), with varying intensity depending on the scenario of greenhouse gas emission.

Extreme temperatures hit human health in multiple ways (Bell et al., 2024), increasing the risk of cardiovascular (Alahmad et al., 2023; Lane et al., 2024; Zafeiratou et al., 2021), respiratory (Konstantinoudis et al., 2023; Zafeiratou et al., 2023) and renal diseases (Lee et al., 2024; Tasian et al., 2014; Zhang et al., 2024). Son et al. (2022) showed evidence that heat exposure during pregnancy increases the odds ratio of preterm births. Extensive literature worldwide on excess mortality summarizes the danger posed by extreme heat events (Vicedo‐Cabrera et al., 2021; Weinberger et al., 2020). Heat strokes primarily affect people with comorbidities and the elderly (S. Chen et al., 2024; de Schrijver et al., 2022), the latter causing a growing public health concern while US population is aging (Bureau, 2023). Intense physical activity can also lead to heat exhaustion, even in young people (Vanos et al., 2023; Vecellio et al., 2023), making outdoor workers particularly vulnerable during heatwaves (Borg et al., 2021; De Sario et al., 2023; Spector et al., 2023).

In the urban environment, temperatures can exceed surrounding rural's by several degrees Celsius. This phenomenon known as the urban heat island (UHI) is mainly caused by a combination of the three‐dimensional layout of the streets and the thermal and radiative properties of the urban infrastructure. It is also exacerbated by anthropogenic heat emissions associated with industries, building thermal regulation, and transportation. Solar radiation stored during the day is released with delay as heat during the night, causing the UHI to vary both intra‐ and inter‐daily. It generally reaches its peak at the end of the night, and is exacerbated under sunny and low‐windy conditions. As a striking example, K. Chen et al. (2022) estimated that the nighttime UHI of Chicago reached more than 2.8°C during 2012 intense heatwave. In summers 2006–2013, Hardin et al. (2018) showed that the average nighttime UHI magnitude in New York City (NYC) was 3.51°C. When added to high regional temperatures, urban nocturnal thermal discomfort can turn into a life‐threatening risk (He et al., 2022; Murage et al., 2017; Royé et al., 2021). Neighborhoods affected by high UHI often coincide with those concentrating population with low socioeconomic status (S. Chen et al., 2024; Hsu et al., 2021; Li et al., 2022; Wilson, 2020). Population suffering from poor housing or homelessness are more likely to be exposed to thermal discomfort (Bezgrebelna et al., 2021; du Bray et al., 2023). In urban areas, UHI also acts in combination with other stressors, such as air pollution, noise, and absence of nature (Choi et al., 2022; Piracha & Chaudhary, 2022; Stafoggia et al., 2023; Wang et al., 2023).

According to the United Nations (UN, 2018), 83% of North Americans were living in urban areas in 2018, and it is projected to increase to 89% in 2050. In the US, the effect of the UHI on health and mortality might be underestimated: the urban temperature reference is most often derived from weather stations located at airports, which is not representative of dense city centers (WMO, 2018). Environmental epidemiological studies require high resolution and reliable reconstruction of past 2‐m air temperature (T2M). In the US, there are mainly three state‐of‐the‐art daily gridded products available: Daymet (Thornton et al., 2021), gridMET (Abatzoglou, 2013) or PRISM (Daly et al., 1997). Spangler et al. (2019) evaluated and compared these three products for their use in environmental epidemiology, but as they mentioned, weather stations used for the evaluation are not representative of the urban complex territory. Spatial prediction methods differ from one product to the other, but while most of the US population lives in urban areas, as far as we know none of these specifically included urban characteristics in their land use regression. The proper detection of the spatial variation of the UHI has not been demonstrated. Based on this statement, Newman et al. (2024) started to address this issue by creating a new gridded product. They accounted for the land surface heterogeneity, and added observations from local networks available nationwide.

Still, most of US cities remain unmonitored at high resolution, partly due to standards set by the World Meteorological Organization (WMO, 2018). They state that the weather stations should be placed at a height of 2 m, on a flat, low grass surface and at a distance from obstacles like buildings or trees. These criteria are challenging to meet in cities outside of parks, although they have been relaxed for urban environments in T. R. Oke (2004). To address this issue, there is increased global effort to incorporate opportunistic data. In particular, a growing number of meteorological studies incorporates personal weather stations (PWS), also called citizen weather stations when used for the urban environment (Meier et al., 2017). While not rigorously calibrated and controlled, they offer an unrivaled coverage in densely populated areas. After being pre‐processed (Fenner et al., 2021; Napoly et al., 2018), they can be used to capture the intra‐daily variability of T2M. However, despite the efficiency of quality control routines, measurement error and uncertainty remain the main limitation to their use in gridded products.

This paper introduces a new hourly T2M model that properly reconstructs the observed fine‐scale UHI and its intra‐daily evolution. We use PWS data and account for their uncertainty through a spatiotemporal Bayesian approach. We select a set of spatial covariates to have a good representation of terrain and land cover, and reanalysis of hourly meteorological variables at 2 m. The approach is applied and evaluated on four US urban areas with different layouts, population densities, and climatic zones: New York City (NY), Philadelphia (PA), Phoenix (AZ), and the Triangle (NC). Performances are evaluated with independent local state networks and compared to gridMET and daymet state‐of‐the‐art gridded products (Abatzoglou, 2013; Thornton et al., 2021). We run a heat stress exposure analysis as well to emphasize the need to incorporate products that properly detect UHI in health impact studies.

2. Methods

2.1. Study Area Descriptions

Here we describe the four urban areas selected to explore the heterogeneity of the city layout, size, population density, and climatic zones. Figure 1 shows the Local Climate Zones classification that is widely used in urban climatology to characterize the urban land cover types (Demuzere et al., 2020; Stewart & Oke, 2012).

Figure 1.

Figure 1

Local Climate Zones map on Philadelphia—New York City, Phoenix and the Triangle urban areas with personal weather stations (pws) and state reference network (ref) spatial locations.

The first area encompasses both NYC and Philadelphia, two of the most populated areas in the northeast of the US. The inclusion of these two neighbor cities in our analysis aims at demonstrating the capacity of the model to run on reasonably large scale, and simultaneously deals with cities of different characteristics. NYC is a coastal city that encompasses several bays and has dense urban zones. Philadelphia is located 130 km southwest of NYC and approximately 80 km inland from the Atlantic coast. It is mainly covered with a low‐ and mid‐rise built environment while NYC has grown vertically. The two cities are related by the I‐95 corridor, a major highway crossing several urban and suburban areas.

Second, is Phoenix, capital city of the southwestern state of Arizona (US). It is located in the Salt River Valley in the northern part of the Sonoran desert. Defying its extreme hot climate, 5.1 million people were reported in 2023 to live in the metropolitan area, including Phoenix, Mesa and Chandler. Its surroundings are very seldom inhabited and are mainly covered with sparse and low vegetation, with mountainous area to the north east.

The Triangle describes the metropolitan area delimited by the cities of Raleigh, Durham and Chapel Hill (NC). Since the 1950s, the region has attracted many tech companies and environmental health research laboratories, causing the area to grow particularly fast in recent years (U.S. Census Bureau, 2024). The Piedmont region, lying in between the Atlantic coastal plain and the Appalachian mountains, is characterized by a mild topography. The Triangle is located in the Neuse and Cape Fear river basins and accounts for lots of man‐made lakes and reservoirs, notable ones being Falls lake in north Raleigh and Jordan Lake in the south west visible on Figure 1. The urbanization is not very dense, and suburban areas are mixed with hardwood forests with a large variety of trees.

Both NYC, Philadelphia and the Triangle have a humid subtropical climate (Köppen climate classification Cfa, Peel et al., 2007), characterized by hot summers and cold winters. While Philadelphia and the Triangle are further inland, NYC has direct maritime influences, which can help moderate temperature extremes. In Phoenix, hot extremes are usual as the city has a hot desert climate (BWh in Köppen climate classification, Peel et al. (2007)). We specifically pay attention to the summer season, where weather conditions are favorable to extreme heat and high magnitude of UHI. In Philadelphia—NYC, we study July 2024 which was representative of a classical summer month. In Phoenix, we choose July 2023 as it broke records: according to the National Weather Service, it was the hottest month ever recorded at that time in the history of the city, and one of the driest as well. The average monthly temperature reached 38.8°C. According to the 2023 Heat Related Deaths Report of Maricopa County (Batchelor, 2024), 645 people died from heat in Phoenix's county that year. In the Triangle, we study July 2021 when temperature average was around normals. Notably, tropical storm Elsa hit the region on the eighth and brought 50–75 mm of rainfall (Badgett & Danco, 2021). As our method aims at being generalizable, we include the winter month of January 2021 in Philadelphia—NYC as well, when an extreme cold event hit the region.

2.2. Weather Station Data

The main objective of our methodology is to provide a tool applicable anywhere in the US at high spatial resolution. To this end, our model is trained on PWS from Weather Underground and validated against local or regional monitoring networks. PWS from the Weather Underground platform are widely spread out across the world, especially in western countries and in the US (see https://www.wunderground.com/wundermap for real‐time map of worldwide Weather Underground stations). The selected regions are covered by state networks visible on the map and depicted in Table 1. As opposed to PWS, these stations have been installed by meteorologists and are expected to comply with the World Meteorological Organization standards (WMO, 2018). We use them as reference stations, both to estimate the PWS measurement error in a preliminary analysis and to evaluate our model results.

Table 1.

State Reference Networks Used to Estimate Personal Weather Stations Measurement Error and Evaluate the Bayesian Hierarchical Model Approach

Name State Reference # Stations used for evaluation
MESONET New York Brotzge et al. (2020) 5
MESONET New Jersey Office of the NJ State Climatologist (2024) 30
AZMET Arizona Arizona Meteorological Network (2023) 5
ECONET North Carolina North Carolina State Climate Office (2025) 9

PWS are abundant in Phoenix and in the Triangle, and slightly sparser in the north east region of Philadelphia—NYC (Figure 1). As mentioned in Brousse et al. (2024), Calhoun et al. (2024), spatial coverage disparity also occurs at the intra‐urban level. It is partly explained by sociodemographic indicators but also by housing type. Few PWS are available in Manhattan, likely due to high‐rise buildings not suitable for outdoor PWS deployment. In Phoenix, while the urbanized area is densely covered, PWS in the desert are very scarce. A similar pattern is observed in the Triangle, even though the area surrounding the city is not as extreme.

There are many personal or low‐cost monitor calibration methods (Beele et al., 2022; Cornes et al., 2020; Fenner et al., 2021). Here we apply a statistical cleaning technique outlined in Fenner et al. (2021) using the CrowdQCplus R package. We eliminate observations failing at main routines checking for invalid metadata or unlikely measurements when compared to their neighbors. Isolated PWS are kept to improve rural coverage. We are unable to get rid of PWS located on high floors since the height of the Weather Underground stations is not provided.

Statistical quality control is efficient for detecting part of the erroneous observations, but some uncertainty remains in the data. In a preliminary analysis, we estimate the measurement error with the help of stations coming from independent state networks. Only PWS located less than 5 km from reference stations are kept for the estimate. A smaller radius would not provide a sufficient number of pairs for a robust error estimation. We take the precaution to remove reference stations when their surrounding PWS are not classified with similar land cover: the differential between measurements would likely be caused by the difference between their respective environments, and not by the sensors' quality itself.

Figure 2 shows the distribution of the estimated measurement error throughout the course of the day on the different territories and seasons. While this error is quite low and centered in NYC—Philadelphia in winter, PWS tend to overestimate in summer. In particular, T2M is underestimated in the Triangle mornings and overestimated in the Phoenix afternoons. For practical reason, we deduce beforehand the hourly average error in each region. The dispersion of the error is directly accounted for in the model itself (see Section 2.4).

Figure 2.

Figure 2

Empirical distribution of the estimated measurement error of personal weather stations after CrowdQC + quality control and calculated against independent state networks. Each plot corresponds to a case study and the distributions are vertically displayed for each hour of the day.

2.3. Covariates

PWS observation sparsity is partly accommodated by the inclusion of relevant geographic covariates. All covariates are listed in Table 2.

Table 2.

Spatial and Spatiotemporal Covariates Included in the Bayesian Hierarchical Model of 2 m‐Air Temperature

Full description Resolution
elev Terrain elevation (m) (Danielson & Gesch, 2011) 7.5 arc‐s
fch Forest canopy height (m) (Potapov et al., 2021) 30 m
imp Imperviousness (%) (Dewitz, 2024) 30 m
t2m ERA5 reanalysis of 2 m‐air temperature (°C) (Hersbach et al., 2020) 0.25°× 0.25° hourly
rh Relative humidity derived from ERA5 reanalysis of 2 m‐air temperature and 2 m‐dew point temperature (%) 0.25°× 0.25° hourly
tp ERA5 reanalysis of total precipitation (m) (Hersbach et al., 2020) 0.25°× 0.25° hourly
u10 ERA5 reanalysis of 10 m U wind component (m) (Hersbach et al., 2020) 0.25°× 0.25° hourly
v10 ERA5 reanalysis of 10 m V wind component (m) (Hersbach et al., 2020) 0.25°× 0.25° hourly
tcc ERA5 reanalysis of total cloud cover (%) (Hersbach et al., 2020) 0.25°× 0.25° hourly

To model terrain and land cover variations, we add altitude from the Global multi‐resolution terrain elevation data 2010 (Danielson & Gesch, 2011), the forest canopy height from the Global Land Analysis and Discovery laboratory (Potapov et al., 2021), and the imperviousness from the National Land Cover Database (Dewitz, 2024). The latter is essential to represent the level of urbanization. Atmospheric variables from reanalysis are included to account for the temporal evolution of T2M. The use of meteorological products often requires to arbitrate between high spatial and high temporal resolution. We prioritize the hourly availability of the Copernicus ERA5 reanalysis on single level (Hersbach et al., 2020) and select six relevant covariates listed in Table 2. Relative humidity is not directly provided in the ERA5 data set and has been derived from 2 m‐air temperature and 2 m‐dewpoint air temperature (Kraus, 2007).

We initially included tree canopy cover, building footprint (Heris et al., 2020) and solar radiation variables. After several experimental trials, we removed them to avoid colinearity in the additive model (see Section 2.4) and help interpret covariate contribution to the output (Section 3.2).

2.4. Spatiotemporal Bayesian Hierarchical Model Definition

We develop a spatiotemporal geostatistical model using a Bayesian Hierarchical Model (BHM) designed to leverage both information contained in the data and additional knowledge brought by covariates and prior definitions, while being interpretable and providing complete uncertainty quantification (Wikle et al., 1998). Let z, y and θ be random vectors representing the latent field of T2M, PWS observations, and model parameters, respectively. The BHM utilizes Bayes's theorem to estimate the most likely outcome distribution given the observations plus prior information on the data and parameter space through the following structure:

p(z,θy)p(yz,θ)×p(zθ)×p(θ). (1)

The latent field z is observed at S spatial locations and T hourly timestamps:

y(s,t)=z(s,t)+ϵ(s,t),s=1,,S,t=1,,T. (2)

where ϵ(s,t) represents the measurement error, with ϵNμy,σy2I. The prior on the precision σy21 is a log‐Gamma distribution with hyperparameters derived from the measurement error estimation (Section 2.2). Each observation follows a Gaussian distribution yNη+μy,σy2, with

η(s,t)=β0+i=13h=1241tDhαi,hxi(s)+j=16βjxj(s,t)+f(s,t). (3)

β0 is a Gaussian intercept with prior mean 0 and a high precision of 10. xii{13} is the set of spatial covariates encompassing elevation, imperviousness and forest canopy height. Dh is the set of timestamps with local time hour h (e.g., on July 2024 case study, D8={2024/07/01:0800,2024/07/02:0800,,2024/07/31:0800}). The term 1tDhαi,h means that we have different coefficients for the spatial covariates for each hour of the day. This choice has been made based on the assumption that the influence of topography and land cover on T2M varies throughout the course of the day. For example, T2M at time t partly results from the energy balance of the surrounding surfaces integrated through the past hours. Physically, a coefficient for each hour of the day makes more sense where a single linear relationship would miss part of the mechanism. Some atmospheric circulation processes induced by topography also change with diurnal—nocturnal cycle. We choose low informative priors but make sure to keep coefficients within physically‐likely ranges. For the elevation, we set a Gaussian prior with mean 0.006°C.m1 and a high precision as it is commonly admitted that T2M in the troposphere is linearly decreasing by 5.5–6.5°C every 1,000 m. This allows sparse high‐elevation mountain observations to inform the BHM.

x(j)j{16} is the set of meteorological factors coming or derived from ERA5 reanalysis. They help the model to adapt with meteorological conditions and seasonality. As for spatial covariates, we set low informative priors for β. For ERA5 T2M, despite its low spatial resolution, we consider that it should be very close to the outcome field of T2M. We set a Gaussian prior with mean 1 and a precision of 10.

f(s,t) can be seen as the remaining part of the spatiotemporal field that has not been modeled through the first additive terms. It is a non‐linear effect modeled through a centered Gaussian field f=MVN0,ΣST, with a separable spatiotemporal covariance matrix:

ΣST=ΣSΣT (4)

where ΣS is the spatial covariance of a Matérn field (Gneiting et al., 2010). With Integrated Nested Laplace Approximation (INLA), priors on its range r and variance σs can be set with penalized complexity priors (Fuglstad et al., 2019): here p(r<0.1)=0.99, and pσS>4=0.01. The smoothness parameter ν=α1 is fixed and is equal to 1, as we keep the default parametrization for α=2. ΣT is the covariance of an autoregressive process of order 1 on time t (Blangiardo & Cameletti, 2015):

yt=ρyt1+ωt,t=2,,T (5)

with ρ<1 and ωt a white noise. The bigger ρ is, the stronger the correlation from one timestamp to the next will be. To force the model to provide coherent sequences of T2M map, we set the following penalized complexity hyperprior on ρ:

p(ρ>0.6)=0.9. (6)

2.5. Inference and Prediction With INLA

We have defined each term of Equation 1. The likelihood is known thanks to Equation 2: conditional on the latent process z and on a set of likelihood parameters μy,σy2, observations are i.i.d with Gaussian distribution ymtz,μy,σy2Nμy,mt,σy2. Based on Equation 2 as well, zθMVNη,ΣST. All priors on parameters and hyperparameters are independent and are multiplied to get p(θ). The posterior distribution of T2M is derived through the hierarchical structure enounced in Equation 1, and this process is known as the inference of the BHM.

The inference is commonly done through Markov Chain Monte Carlo (MCMC) methods which sample the joint posterior distribution of model parameters. These algorithms are often impractical on large‐scale and high‐resolution spatiotemporal fields due to the computational complexity of inverting and storing the large covariance matrix. The INLA approach employs a sequence of approximation and optimization techniques to provide accurate deterministic approximations to the posterior marginals of latent Gaussian processes (Rue et al., 2009). For this specific class of models, extensively employed in practice, this alternative to MCMC significantly accelerates the inference. Additionally, the Spatial Partial Differential Equation (SPDE) method (Krainski et al., 2018; Lindgren et al., 2011) complements INLA to approximate spatial fields with Matérn covariance. Another big advantage of the INLA‐SPDE approach is that its complex algorithms have been synthesized in functions available in INLA R library (Lindgren & Rue, 2015).

2.6. Evaluation Method

INLA‐SPDE provides inference and access to the full posterior distribution of the T2M spatiotemporal field. For conciseness, we will focus on the mean of that distribution, subsequently referred to as T2MBHM. The mean is the most likely value, and the most relevant as well to assess heat exposure. As a quantitative evaluation (Section 3.1), we compare reference measurements to predicted values. We define residuals as the difference between measurement and T2MBHM at the same location. These will be used to calculate the root mean square error (RMSE) for each case study.

We detail in Section 3.2 how materials provided through the INLA‐SPDE approach can be analyzed to better understand both the model itself and physical processes at stake. The standard deviation σBHM(s,t) is an indicator of the uncertainty of the BHM at each point in the spatiotemporal domain. We also have access to posterior marginals for all model parameters. These distributions, compared to prior settings, help to understand covariate contributions to T2M for each case study and at different times of the day.

In Section 3.3, we compare T2MBHM to daymet and gridMET. Daymet applies a spatial and temporal interpolation of the preprocessed Global Historical Climatology Network—daily (Menne et al., 2012). GridMET merges PRISM, a land‐use regression on US observational networks (Daly et al., 2008), with North American Land Data Assimilation System (Mitchell et al., 2004). As opposed to our BHM, developed with specific attention to densely populated areas, these products were not developed for human health applications. Still, they are often used in epidemiological studies as they have a better spatial coverage than observational networks (e.g., Castro et al., 2023; Healy et al., 2023; Thomas et al., 2021). We compare the two products to daily minima T2Mmin and maxima T2Mmax derived from our BHM hourly predictions at each spatial location. In accordance with the two products, extrema are calculated from midnight to midnight (local standard time), for each day.

The last phase of our model evaluation focuses on its capacity to provide an accurate heat exposure assessment (Section 3.4). We first analyze the exposure spatial disparities by mapping the UHI, calculated by removing the spatial domain mean of T2MBHM. We also compare T2MBHM at two locations, one rural and the other urban, to have an insight on its intra‐daily evolution. Additionally, we take advantage of the hourly resolution to calculate exposure duration to nighttime extreme heat. For each night n, the hot night duration τn is defined as follows:

τn=h=10pm6am1T2Mhδ,τn19, (7)

with δ the average of the 0.9‐quantile of the minimum temperature recorded in July on the reference period 1991–2020:

δ=131d=07/0107/31q0.9minT2Md1991−2020. (8)

T2M on the reference period are measured by a reference station from the neighboring, daily Global Historical Climatology Network (Menne et al., 2012). High percentile of T2M generally comes with public health concerns (Royé et al., 2021), therefore we fix the threshold to q0.9. We then cross both the UHI and the hot night duration with population density data (Warszawski et al., 2017). A linear relationship between the UHI amplitude and the logarithm of the total population of cities has long been established (T. Oke, 1973). With our approach, we can investigate if the positive correlation remains valid at the intra‐city level, and look at the relationship with the duration of extreme exposure as well.

3. Results

3.1. Model Performances

The overall RMSE is of 1.06°C, with slight variations between case studies: 0.93°C and 1.08°C for January 2021 and July 2024 respectively in Philadelphia‐NYC, 1.33°C for July 2023 in Phoenix and 1.23°C January 2021 in the Triangle. RMSE on the 81 reference weather stations used for the evaluation ranges between 0.44 and 1.69°C, with an outlier of 2.76°C for a coastal station located near the beach.

Figure 3 helps to evaluate the consistency of the BHM in space and time for July 2024 case study in Philadelphia—NYC. On panel a, the residual distribution for each hour of the day consistently spreads around zero, showing that the performances barely vary with the time of day. The circles in plot B correspond to reference stations and are filled with their RMSE. The model performs better on average in some areas than others. In Durham and Phoenix, urban reference stations score better than rural ones (see Supporting Information S1). Even if the BHM is built to extrapolate to unobserved zones, its performances might still be downgraded in those areas because of the lower amount of information available. These critical areas can at least be detected through the additional material supplied by our Bayesian inference approach.

Figure 3.

Figure 3

Bayesian Hierarchical Model (BHM) diagnostic plots for the output in Philadelphia—New York City urban area on July 2023. (a) Overall residuals per local time hour. (b) Average standard deviation of the BHM and root mean square error at each MESONET weather station, with black dots corresponding to each personal weather stations.

3.2. BHM Output Interpretation

The background of the map in Figure 3b shows the average σBHM(t) at each location extracted from the posterior distribution. Areas with high standard deviation generally correspond to neighborhoods with few PWS or to areas where neighboring PWS do not agree. Phoenix for example, is surrounded by a desert and mountainous areas where PWS coverage is very low (Figure 1). The T2M field is inferred in those areas as well thanks to covariates, but it has a higher uncertainty than in the extensively covered urban area (see Supporting Information S1). Similarly to all statistical models, performances of the BHM both vary in space and time because of sampling, but this uncertainty aspect is monitored through the BHM standard deviation.

On Figure 4a, marginal posterior mean and 95% confidence intervals are displayed for elevation, forest canopy height and imperviousness coefficients. As mentioned in Equation 3, these change for each hour of the day. Priors for these coefficients are plotted on the left with same scale, to show how the precision and mean evolved through the inference process. The coefficient for the height of the forest canopy is not included in Phoenix, as the covariate barely varies throughout that region. Although we set a prior based on the known relationship between elevation and T2M in the troposphere, the posterior coefficient does not necessarily stay around the prior mean. It suggests that other meteorological effects related to terrain are at stake. The imperviousness mostly has a positive contribution, but as opposed to what we expect with our knowledge on UHI, it is not peaking at the end of the night. In our model, the forest canopy height seems to play a decisive role at this moment of the day. In the Triangle, imperviousness coefficient can even be negative, especially at night. For this specific region, the urbanized area slightly overlooks three basins located in the rural surroundings. This particular topography might partly explain why the nocturnal UHI is mild (see Supporting Information S1). Two lakes located in these reservoirs are particularly hot: the eye of an expert would help to determine if the storage and release of heat makes physically sense, or if it is an artifact of our probabilistic model. This specific case illustrates that even if they are informative, it is always delicate to draw definitive physics conclusions from the marginal posteriors.

Figure 4.

Figure 4

(a) Spatial covariate coefficient marginal priors (left) and posterior marginal's mean and 95% confidence interval evolution throughout the day. (b) ERA5 coefficients posterior marginals. Color corresponds to each case study (nps = Philadelphia—New York City (NYC) July 2024, npw = Philadelphia—NYC January 2021, phoe = Phoenix July 2023, tri = Triangle July 2021).

In Figure 4b, we plot the whole posterior marginals for each hourly meteorological variable (β in Equation 3). Its prior mean being set to 1 (Section 2.4), we notice that the BHM relies less than expected on the reanalysis of temperature, especially in Phoenix where the coefficient has been offset to less than 0.75. Total precipitation has a negative contribution on the Triangle and a positive one on summer 2024 in Philadelphia—NYC. Marginals are centered around zero for the other cases, suggesting no significant effect.

The analysis of the marginal posterior reveals that the contribution of covariates changes with the region and the season. With the Bayesian approach, the model is fit to each case study through the inference process. The effect of covariates should not be generalized, but rather interpreted conditional on the layout of the city, its geography and the meteorological event under consideration. Covariates are also incorporated through a relatively simple linear additive function: interpretation must be made accordingly.

We also analyze the parametrization of the non‐linear spatiotemporal term f(s,t) (Equation 3). For every case study, the posterior mean of ρ, the coefficient of the AR1 process, is between 0.86 and 0.95, with very high precision. It means that the field at time t1 strongly contributes to the next time step t. Thanks to this property, T2MBHM field is a coherent sequence that can be observed like a movie, partly revealing atmospheric processes like horizontal advection (movies of the case studies are shared in Supporting Information S1). The spatial range of the Gaussian field varies greatly from one case study to the next. It is larger for Philadelphia—NYC area. The variance of the field is between 1 and 1.8°C: the spatial field can be seen as a mold for the non‐negligible information contained in the data that the additive linear terms missed.

3.3. Comparison to Daily Gridded Products

Figure 5 shows one example of T2Mmin maps obtained with GridMET, daymet and our BHM approach on January 2021, where the overall RMSE is of 0.93°C. It is representative of all the maps obtained on each of our case studies: the BHM brings a higher granularity and detects some hot spots that do not appear on other products. It has a higher spatial resolution than gridMET, and even though daymet has a similar grid cell size, their T2M field looks smoothed across the region. In that example, the UHI of Philadelphia is shifted to the south west with gridMET, and it is simply not detected with daymet. In NYC, the BHM shows a T2Mmin up to 3 to 4°C in some neighborhoods of the city when it does not exceed 1°C for daymet and 3°C for gridMET.

Figure 5.

Figure 5

From left to right: minimal 2 m‐air temperature estimation in Philadelphia—New York City on 2021 14 January with gridMET, daymet, and the Bayesian Hierarchical Model approach.

When averaged on all the four urban areas explored in this study, it appears that the T2Mmax is almost systematically 0.5 to 3°C higher for existing products than for our BHM. For the T2Mmin, the three products globally agree, except on the winter case study where the BHM estimates hotter T2Mmin in rural areas. Further investigation on other cities is necessary to see if these patterns are the same in every climatic region and for all seasons. Daymet and gridMET have known and documented uncertainties in mountainous areas. Even though this analysis is not sufficient to draw conclusions about their performances in the urban environment, it encourages the user to be cautious when using these products for epidemiological studies. In contrast, the BHM approach is designed to optimize urban exposure assessment and shows better accuracy in densely populated areas.

3.4. Intra‐Daily Thermal Exposure Assessment

Figure 6a is a map of the UHI on Philadelphia—NYC averaged over July 2024, as defined in Section 2.6 (see Supporting Information S1 for other summer case studies). Philadelphia and NYC urban areas are up to 4°C hotter than the average of the region. The UHI is milder in the Triangle, where the urban area is mainly composed of open low‐rise local climate zone with a lot of vegetation patches (Figure 1). On Philadelphia and NYC, the UHI amplitude is smaller in winter than in summertime. The T2M gradient between the Atlantic ocean coastal area and the inland territories appears to be stronger in July 2024 than in January 2021, suggesting seasonal oceanic influences on T2M. Even if it is a good tool to target hot spots, the map of the average UHI smooths the intra‐daily variation. In Figure 6c, the intra‐daily variation on July 2024 can be read with a sweep from left to right. The UHI is stronger at night in Philadelphia, NYC, and Phoenix, as opposed to Durham and Raleigh where it reaches its maximum during the day. We observe a cool island in the afternoon in NYC and in the very late night and morning in Phoenix, where the rural point is located at the southwest of the city. The results on Philadelphia—NYC are coherent with those obtained in Hardin et al. (2018) in terms of UHI shape, intra‐daily amplitude and seasonal variation.

Figure 6.

Figure 6

Heat exposure analysis in July 2024 on Philadelphia—New York City with (a) average differential to regional hourly mean (urban heat island, UHI), (b) hot night average duration, (c) UHI evolution in the two city throughout the month, and (d) empirical density of the UHI and hot night duration per population density ranges. Legend is common to every plot. Points represent urban and rural locations used to create plot (c).

In Figure 6b, we map the median duration of hot nights in July 2024 in Philadelphia—NYC. For all of the four cities, we observe that hot nights in July were much longer within intensively urbanized areas. By crossing heat exposure data with population density (Figure 6d), we show that neighborhoods where a lot of people are living are more likely exposed to both higher amplitude and longer heat exposure.

4. Discussion

4.1. Urban Heat Island Detection

The BHM is designed to be applicable to any territory in the US, with a specific focus on urban areas. Our BHM properly detects the UHI thanks to three components: it uses a dense observational network of PWS, it has a high 0.01°×0.01°C spatial resolution and it incorporates urban land cover covariates. With this high spatial resolution, we detect for instance the cooling effect of Central Park in NYC and other urban parks. Both in Philadelphia and in the Triangle, our model detects hot spots in zones near Delaware River and other water bodies. Some urban neighborhoods are hotter than others.

While being densely inhabited, cities are often misrepresented in observed T2M reconstruction. In that respect, our approach fills a gap when compared to existing gridded products. Further work is needed on that topic to explore the added value of BHM in comparison to concurrent approaches to assess adverse health outcomes.

4.2. Intra‐Daily Evolution

The use of a spatiotemporal covariance allows to keep a certain amount of information from one time step to the next. It provides a temporal coherence that can be exploited to study the path of storms and other atmospheric events. Movies of the UHI hourly evolution on all case studies are shared in Supporting Information S1. Some physical processes however might be missed by the current BHM. The latter would gain to have non‐separable spatiotemporal structure. Covariances designed for advection and/or diffusion representation have already been implemented for the INLA‐SPDE approach (Clarotto et al., 2024; Lindgren et al., 2023). They are highly promising as they tend to hybrid the physics properties to the probabilistic approach, and deserve to be investigated in future versions of our BHM.

4.3. Model Performances and Crowdsourced Data

Our model is fitted with PWS data because of their dense urban coverage. Despite this advantage, PWS are not always reliable and their measurement error is not fully understood. The quality control routines applied before the BHM are not always sufficient, but they are also challenging to improve. Unlike institutional networks, Weather Underground data do not come with precise metadata such as weather station model or location environment. It is not possible to know if they are installed in gardens, balconies, or against obstacles. One of the strengths of our approach is the incorporation of the estimated measurement uncertainty in the BHM. Despite several modeling tricks, the BHM standard deviation is larger in the desert, in the mountains, and in rural areas where sensors are sparse. At the intra‐city level, some types of neighborhood might have fewer sensors than others (Brousse et al., 2024; Calhoun et al., 2024), and the consequences of that coverage inequity on exposure assessment would deserve more exploration. PWS are also a relatively recent source of data. Their availability does not cover the entire climatological period 1991–2020. The number of available stations increases with time. The accuracy of the products derived from PWS might increase as well in future years in covered areas.

4.4. Scalability Problem

Environmental health studies have better precision and can detect smaller effect sizes within noisy data when applied to large cohorts and domains. Weather Underground data is extensively available across the whole contiguous US, and the covariates cover the entire territory. From this perspective, and as shown with the different case studies, we can apply the BHM anywhere. Through Phoenix's heatwave and cold winter in Philadelphia——NYC case studies, we also illustrate the ability to properly capture extreme temperature events. Our approach, which can be applied on any US city, offers a new tool for urban planning and public health actors.

Scaling the BHM to the entire contiguous US on long periods is not trivial, even though INLA‐SPDE approach is one of the fastest to estimate Bayesian spatiotemporal Gaussian fields. We ran our experiments on a high computing platform, with 10 CPUs with 6 Gb memory each. PWS cleaning, covariates extraction, and data formatting before running the model take a lot of time, but these steps are algorithmically simple and can easily be parallelized. The call to INLA function in itself takes 41 min total on Phoenix 11,000km2, 17 min in the Triangle 4,500km2 and around 1h10 on the largest area encompassing NYC and Philadelphia 22,000km2. On larger scales, promising work showed that running time can significantly be reduced by using the PARDISO library for sparse linear algebra (Gaedke‐Merzhäuser et al., 2023) or a novel block tridiagonal arrowhead solver to leverage the power of GPUs Gaedke‐Merzhäuser et al. (2024). These solutions could be considered to accelerate future versions of our model.

Additional scientific challenges also arise with the creation of a country‐wide and hourly model: how to optimize the SPDE mesh with PWS coverage inequities across the whole US territory? How to generalize the coefficients of static covariates with the different seasons, timezones and geographies? Such a model would probably require to be complexified with additional covariates like solar angles, or to be stratified by climatic region and season. The current version of our model has the advantage to be fitted to a specific region and season each time, as shown in Figure 4. For the time being, the code of our approach is available in three open‐source R libraries (see Section 5). It makes it easier to use in a parallel framework to quickly generate large data sets of US cities for climatic events of interest. This strategy implies having zones and periods remaining uncovered. It still has the advantage of creating a decent learning set to potentially train a downscaling model for ERA5 or MERRA2 hourly reanalysis products. For example, a deep learning approach would ideally make profit of parallel computing on GPUs and CPUs to generate hourly data at 0.01°×0.01° resolution for several years on the contiguous US 7,700,000km2.

5. Conclusion

Heat exposure is a major public health concern in the US and the epidemiological literature on this topic is very active. In urban areas, people are exposed to the UHI effect, whose full amplitude is sometimes missed in state‐of‐the‐art gridded products. The UHI phenomenon varies by several degrees Celsius throughout the day, typically peaking at the end of the night. The impact of hot night exposure on human health is not fully understood in the US, partly because accurate 2 m‐air temperature (T2M) data for populated areas is difficult to obtain.

The Bayesian spatiotemporal model we propose in this article addresses several of these concerns. First, it leverages PWS data to achieve a spatial resolution of 0.01°×0.01°C and successfully detects the UHI amplitude. Our approach also provides coherent hourly estimations of the T2M, allowing us to examine intra‐daily exposure evolution. It is applicable anywhere in the contiguous US and can be adapted with equivalent data sets to other countries.

We have applied the BHM to four urban areas with very different climates and geographies. It has been tested under normal weather conditions, as well as extreme hot and cold events. Its performance has been evaluated against independent reference networks, with a consistent RMSE ranging from 0.93 to 1.33°C depending on the case study. As most of studies using PWS do, we run a quality control before the model inference. We go further with our approach by incorporating the remaining measurement uncertainty in the Bayesian framework itself. The inference provides the entire distribution of the field, making the model's uncertainty accessible through its dispersion measures.

We provide an overview of the implications that our model could have on heat exposure assessment. In comparison to gridMET and daymet, our approach offers a significantly better granularity, allowing to target local hotspots that were not detected before. Both exposure duration and amplitude can be calculated at high spatial resolution. Notably, we show that neighborhoods with high population concentration are more likely to experience high temperatures and longer hot nights. These preliminary findings raise the question of the accuracy of extreme heat risk assessment regarding urban environments. The model we propose will contribute to the ongoing effort to tailor heatwave alerts to each territory, thereby reducing the number of victims. It will offer an accurate gridded product to feed heat stress research and assist public health initiatives to help vulnerable populations.

Conflict of Interest

The authors declare no conflicts of interest relevant to this study.

Supporting information

Supporting Information S1

Acknowledgments

This work is supported by the National Institute of Environmental Health Sciences, Division of Translational Toxicology, Division of Intramural Research, and the Spatiotemporal Exposures and Toxicology group under project number ZIA ES103368‐02. The Weather Underground data has been pulled with the help of the IBM Expert Labs. We would like to thank Daniel Zilber and Mariana Alifa Kassien for their constructive comments.

Marquès, E. , & Messier, K. P. (2025). Improved high resolution heat exposure assessment with personal weather stations and spatiotemporal Bayesian models. GeoHealth, 9, e2025GH001451. 10.1029/2025GH001451

Data Availability Statement

T2MBHM rasters on Philadelphia—NYC (July 2024 and January 2021), Phoenix (July 2023) and the Triangle (July 2023) are archived on the Chemical Effects in Biological Systems database (Marques & Messier, 2025a). Additional data generated with our approach is available on the top largest US Urban Census areas, covering different climatic events (heatwave, blizzard, typical weather) and seasons. The T2MBHM rasters are stored in an open‐access versioned repository in the CAFE Climate and Health Research Coordinating Center Collection of the Harvard Dataverse (Marques & Messier, 2025b).

Software: R codes used to prepare Weather Underground data in the preliminary analysis are available in https://github.com/NIEHS/brassens/ (Marques & Messier, 2025c). R codes for the Bayesian Hierarchical Model and 2‐m air temperature rasters generated on the four urban areas can be found in https://github.com/NIEHS/samba/ (Marques & Messier, 2025e). R codes for the gridded products comparison are shared in https://github.com/NIEHS/mercury/ (Marques & Messier, 2025d).

References

  1. Abatzoglou, J. T. (2013). Development of gridded surface meteorological data for ecological applications and modelling. International Journal of Climatology, 33(1), 121–131. 10.1002/joc.3413 [DOI] [Google Scholar]
  2. Alahmad, B. , Khraishah, H. , Royé, D. , Vicedo‐Cabrera, A. M. , Guo, Y. , Papatheodorou, S. I. , et al. (2023). Associations between extreme temperatures and cardiovascular cause‐specific mortality: Results from 27 countries. Circulation, 147(1), 35–46. 10.1161/CIRCULATIONAHA.122.061832 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Arizona Meteorological Network . (2023). Arizona meteorological network (AZMet) data. University of Arizona. Retrieved from https://azmet.arizona.edu [Google Scholar]
  4. Badgett, P. , & Danco, J. (2021). 2021 central NC climate summary.
  5. Batchelor, M. (2024). 2023 heat related deaths report.
  6. Beele, E. , Reyniers, M. , Aerts, R. , & Somers, B. (2022). Quality control and correction method for air temperature data from a citizen science weather station network in Leuven, Belgium. Earth System Science Data, 14(10), 4681–4717. 10.5194/essd-14-4681-2022 [DOI] [Google Scholar]
  7. Bell, M. L. , Gasparrini, A. , & Benjamin, G. C. (2024). Climate change, extreme heat, and health. New England Journal of Medicine, 390(19), 1793–1801. 10.1056/NEJMra2210769 [DOI] [PubMed] [Google Scholar]
  8. Bezgrebelna, M. , McKenzie, K. , Wells, S. , Ravindran, A. , Kral, M. , Christensen, J. , et al. (2021). Climate change, weather, housing precarity, and homelessness: A systematic review of reviews. International Journal of Environmental Research and Public Health, 18(11), 5812. 10.3390/ijerph18115812 [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Blangiardo, M. , & Cameletti, M. (2015). Spatial and spatio‐temporal Bayesian models with R‐INLA. John Wiley & Sons. 10.1002/9781118950203 [DOI] [Google Scholar]
  10. Borg, M. A. , Xiang, J. , Anikeeva, O. , Pisaniello, D. , Hansen, A. , Zander, K. , et al. (2021). Occupational heat stress and economic burden: A review of global evidence. Environmental Research, 195, 110781. 10.1016/j.envres.2021.110781 [DOI] [PubMed] [Google Scholar]
  11. Brotzge, J. A. , Wang, J. , Thorncroft, C. D. , Joseph, E. , Bain, N. , Bassill, N. , et al. (2020). A technical overview of the New York state mesonet standard network. Journal of Atmospheric and Oceanic Technology, 37(10), 1827–1845. 10.1175/JTECH-D-19-0220.1 [DOI] [Google Scholar]
  12. Brousse, O. , Simpson, C. H. , Poorthuis, A. , & Heaviside, C. (2024). Unequal distributions of crowdsourced weather data in England and wales. Nature Communications, 15(1), 4828. 10.1038/s41467-024-49276-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Bureau, U. C. (2023). 2023 national population projections tables: Main series [section: Government]. Retrieved from https://www.census.gov/data/tables/2023/demo/popproj/2023‐summary‐tables.html
  14. Calhoun, Z. D. , Black, M. S. , Bergin, M. , & Carlson, D. (2024). Refining citizen climate science: Addressing preferential sampling for improved estimates of urban heat. Environmental Science and Technology Letters, 11(8), 845–850. 10.1021/acs.estlett.4c00296 [DOI] [Google Scholar]
  15. Castro, E. , Healy, J. , Liu, A. , Wei, Y. , Kosheleva, A. , & Schwartz, J. (2023). The interaction between extreme temperature and PM2.5 on mortality in nine US States. In ISEE conference abstracts (Vol. 1). 10.1289/isee.2023.EP-051 [DOI] [Google Scholar]
  16. Chen, K. , Newman, A. J. , Huang, M. , Coon, C. , Darrow, L. A. , Strickland, M. J. , & Holmes, H. A. (2022). Estimating heat‐related exposures and urban heat island impacts: A case study for the 2012 Chicago heatwave. GeoHealth, 6(1), e2021GH000535. 10.1029/2021GH000535 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Chen, S. , Bruhn, S. , & Seto, K. C. (2024). Trends in socioeconomic disparities in urban heat exposure and adaptation options in mid‐sized U.S. cities. Remote Sensing Applications: Society and Environment, 36, 101313. 10.1016/j.rsase.2024.101313 [DOI] [Google Scholar]
  18. Choi, H. M. , Lee, W. , Roye, D. , Heo, S. , Urban, A. , Entezari, A. , et al. (2022). Effect modification of greenness on the association between heat and mortality: A multi‐city multi‐country study. EBioMedicine, 84, 104251. 10.1016/j.ebiom.2022.104251 [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Clarotto, L. , Allard, D. , Romary, T. , & Desassis, N. (2024). The SPDE approach for spatio‐temporal datasets with advection and diffusion. Spatial Statistics, 62, 100847. 10.1016/j.spasta.2024.100847 [DOI] [Google Scholar]
  20. Cornes, R. C. , Dirksen, M. , & Sluiter, R. (2020). Correcting citizen‐science air temperature measurements across The Netherlands for short wave radiation bias. Meteorological Applications, 27(1), e1814. 10.1002/met.1814 [DOI] [Google Scholar]
  21. Daly, C. , Halbleib, M. , Smith, J. I. , Gibson, W. P. , Doggett, M. K. , Taylor, G. H. , et al. (2008). Physiographically sensitive mapping of climatological temperature and precipitation across the conterminous United States. International Journal of Climatology, 28(15), 2031–2064. 10.1002/joc.1688 [DOI] [Google Scholar]
  22. Daly, C. , Taylor, G. , & Gibson, W. (1997). The PRISM approach to mapping precipitation and temperature. In Proceedings of the 10th AMS conference on applied climatology (pp. 20–23). [Google Scholar]
  23. Danielson, J. J. , & Gesch, D. B. (2011). Global multi‐resolution terrain elevation data 2010 (GMTED2010). US Geological Survey. [Google Scholar]
  24. Demuzere, M. , Hankey, S. , Mills, G. , Zhang, W. , Lu, T. , & Bechtel, B. (2020). Combining expert and crowd‐sourced training data to map urban form and functions for the continental US. Scientific Data, 7(1), 264. 10.1038/s41597-020-00605-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. De Sario, M. , de’Donato, F. K. , Bonafede, M. , Marinaccio, A. , Levi, M. , Ariani, F. , et al. (2023). Occupational heat stress, heat‐related effects and the related social and economic loss: A scoping literature review. Frontiers in Public Health, 11, 1173553. 10.3389/fpubh.2023.1173553 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. de Schrijver, E. , Bundo, M. , Ragettli, M. S. , Sera, F. , Gasparrini, A. , Franco, O. H. , & Vicedo‐Cabrera, A. M. (2022). Nationwide analysis of the heat‐ and cold‐related mortality Trends in Switzerland between 1969 and 2017: The role of population aging. Environmental Health Perspectives, 130(3), 037001. 10.1289/EHP9835 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Dewitz, J. (2024). National land cover database (NLCD) 2019 products (ver. 3.0, February 2024). 10.5066/P9KZCM54 [DOI]
  28. du Bray, M. V. , Stotts, R. , Southee, R. , & Wutich, A. (2023). Beyond extreme: Heat emergency and water Insecurity for people experiencing Houselessness in Phoenix, Arizona, USA during and after the heatwave of 2023. Human Ecology, 51(5), 799–808. 10.1007/s10745-023-00447-4 [DOI] [Google Scholar]
  29. Fenner, D. , Bechtel, B. , Demuzere, M. , Kittner, J. , & Meier, F. (2021). CrowdQC+—A quality‐control for crowdsourced air‐temperature observations Enabling world‐wide urban climate applications. Frontiers in Environmental Science, 9, 720747. 10.3389/fenvs.2021.720747 [DOI] [Google Scholar]
  30. Fuglstad, G.‐A. , Simpson, D. , Lindgren, F. , & Rue, H. (2019). Constructing priors that penalize the complexity of Gaussian random fields. Journal of the American Statistical Association, 114(525), 445–452. 10.1080/01621459.2017.1415907 [DOI] [Google Scholar]
  31. Gaedke‐Merzhäuser, L. , Krainski, E. , Janalik, R. , Rue, H. , & Schenk, O. (2024). Integrated nested Laplace approximations for large‐scale spatiotemporal Bayesian modeling. SIAM Journal on Scientific Computing, 46(4), B448–B473. 10.1137/23M1561531 [DOI] [Google Scholar]
  32. Gaedke‐Merzhäuser, L. , van Niekerk, J. , Schenk, O. , & Rue, H. (2023). Parallelized integrated nested Laplace approximations for fast Bayesian inference. Statistics and Computing, 33(1), 25. 10.1007/s11222-022-10192-1 [DOI] [Google Scholar]
  33. Gneiting, T. , Kleiber, W. , & Schlather, M. (2010). Matérn cross‐covariance functions for multivariate random fields. Journal of the American Statistical Association, 105(491), 1167–1177. 10.1198/jasa.2010.tm09420 [DOI] [Google Scholar]
  34. Hardin, A. W. , Liu, Y. , Cao, G. , & Vanos, J. K. (2018). Urban heat island intensity and spatial variability by synoptic weather type in the northeast U.S. Urban Climate, 24, 747–762. 10.1016/j.uclim.2017.09.001 [DOI] [Google Scholar]
  35. He, C. , Kim, H. , Hashizume, M. , Lee, W. , Honda, Y. , Kim, S. E. , et al. (2022). The effects of night‐time warming on mortality burden under future climate change scenarios: A modelling study. The Lancet Planetary Health, 6(8), e648–e657. 10.1016/S2542-5196(22)00139-5 [DOI] [PubMed] [Google Scholar]
  36. Healy, J. P. , Danesh Yazdi, M. , Wei, Y. , Qiu, X. , Shtein, A. , Dominici, F. , et al. (2023). Seasonal temperature variability and mortality in the medicare population. Environmental Health Perspectives, 131(7), 077002. 10.1289/EHP11588 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Heris, M. P. , Foks, N. L. , Bagstad, K. J. , Troy, A. , & Ancona, Z. H. (2020). A rasterized building footprint dataset for the United States. Scientific Data, 7(1), 207. 10.1038/s41597-020-0542-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Hersbach, H. , Bell, B. , Berrisford, P. , Hirahara, S. , Horányi, A. , Muñoz‐Sabater, J. , et al. (2020). The ERA5 global reanalysis. Quarterly Journal of the Royal Meteorological Society, 146(730), 1999–2049. 10.1002/qj.3803 [DOI] [Google Scholar]
  39. Hsu, A. , Sheriff, G. , Chakraborty, T. , & Manya, D. (2021). Disproportionate exposure to urban heat island intensity across major US cities. Nature Communications, 12(1), 2721. 10.1038/s41467-021-22799-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. IPCC . (2023). Climate change 2022—Impacts, adaptation and vulnerability: Working group II contribution to the sixth assessment report of the intergovernmental panel on climate change. Cambridge University Press. [Google Scholar]
  41. Konstantinoudis, G. , Minelli, C. , Lam, H. , Fuertes, E. , Ballester, J. , Davies, B. , et al. (2023). Asthma hospitalisations and heat exposure in England: A case–crossover study during 2002–2019. Thorax, 78(9), 875–881. 10.1136/thorax-2022-219901 [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Krainski, E. , Gómez‐Rubio, V. , Bakka, H. , Lenzi, A. , Castro‐Camilo, D. , Simpson, D. , et al. (2018). Advanced spatial modeling with stochastic partial differential equations using R and INLA. Hall/CRC. 10.1201/9780429031892 [DOI] [Google Scholar]
  43. Kraus, H. (2007). Die Atmosphäre der Erde: Eine Einführung in die Meteorologie. Springer‐Verlag. [Google Scholar]
  44. Lane, M. , Ebelt, S. , Wu, Z. , Scovronick, N. , D’Souza, R. R. , & Chang, H. H. (2024). Time‐series analysis of temperature variability and cardiovascular emergency department visits in Atlanta over a 27‐year period. Environmental Health, 23(1), 9. 10.1186/s12940-024-01048-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Lee, J. , Oh, S. , Byon, J.‐Y. , Lee, W. , Weon, B. , Ko, A. , et al. (2024). Long‐term exposure to high perceived temperature and risk of mortality among patients with chronic kidney disease. Heliyon, 10(3), e25222. 10.1016/j.heliyon.2024.e25222 [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Li, D. , Newman, G. D. , Wilson, B. , Zhang, Y. , & Brown, R. D. (2022). Modeling the relationships between historical Redlining, urban heat, and heat‐related emergency department visits: An Examination of 11 Texas cities. Environment and Planning B: Urban Analytics and City Science, 49(3), 933–952. 10.1177/23998083211039854 [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Lindgren, F. , Bakka, H. , Bolin, D. , Krainski, E. , & Rue, H. (2023). A diffusion‐based spatio‐temporal extension of Gaussian Matérn fields [arXiv:2006.04917 [stat]]. 10.48550/arXiv.2006.04917 [DOI]
  48. Lindgren, F. , & Rue, H. (2015). Bayesian spatial modelling with R‐INLA. Journal of Statistical Software, 63(19), 1–25. 10.18637/jss.v063.i19 [DOI] [Google Scholar]
  49. Lindgren, F. , Rue, H. , & Lindström, J. (2011). An explicit link between Gaussian fields and Gaussian Markov random fields: The stochastic partial differential equation approach. Journal of the Royal Statistical Society Series B: Statistical Methodology, 73(4), 423–498. 10.1111/j.1467-9868.2011.00777.x [DOI] [Google Scholar]
  50. Marques, E. , & Messier, K. (2025a). Urban air temperature at high spatiotemporal resolution on major US cities (Version 1) [Dataset]. Chemical Effects in Biological Systems. 10.22427/NTP-DATA-500-021-001-000-9 [DOI]
  51. Marques, E. , & Messier, K. (2025b). Urban air temperature at high spatiotemporal resolution on major US cities (Version V1) [Dataset]. Harvard Dataverse. 10.7910/DVN/HNVCBR [DOI]
  52. Marques, E. , & Messier, K. (2025c). Brassens (version 1.0) [Software]. Zenodo. 10.5281/zenodo.15595882 [DOI]
  53. Marques, E. , & Messier, K. (2025d). Mercury (version 1.0.0) [Software]. Zenodo. 10.5281/zenodo.15596059 [DOI]
  54. Marques, E. , & Messier, K. (2025e). Samba (version 1.0) [Software]. Zenodo. 10.5281/zenodo.15595977 [DOI]
  55. Meier, F. , Fenner, D. , Grassmann, T. , Otto, M. , & Scherer, D. (2017). Crowdsourcing air temperature from citizen weather stations for urban climate research. Urban Climate, 19, 170–191. 10.1016/j.uclim.2017.01.006 [DOI] [Google Scholar]
  56. Menne, M. J. , Durre, I. , Vose, R. S. , Gleason, B. E. , & Houston, T. G. (2012). An overview of the global historical climatology network‐daily database. Journal of Atmospheric and Oceanic Technology, 29(7), 897–910. 10.1175/JTECH-D-11-00103.1 [DOI] [Google Scholar]
  57. Mitchell, K. E. , Lohmann, D. , Houser, P. R. , Wood, E. F. , Schaake, J. C. , Robock, A. , et al. (2004). The multi‐institution North American Land Data Assimilation System (NLDAS): Utilizing multiple GCIP products and partners in a continental distributed hydrological modeling system. Journal of Geophysical Research, 109(D7), D07S90. 10.1029/2003JD003823 [DOI] [Google Scholar]
  58. Murage, P. , Hajat, S. , & Kovats, R. S. (2017). Effect of night‐time temperatures on cause and age‐specific mortality in London. Environ Epidemiol, 1(2), e005. 10.1097/EE9.0000000000000005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Napoly, A. , Grassmann, T. , Meier, F. , & Fenner, D. (2018). Development and application of a statistically‐based quality control for crowdsourced air temperature data. Frontiers in Earth Science, 6, 118. 10.3389/feart.2018.00118 [DOI] [Google Scholar]
  60. Newman, A. J. , Kalb, C. , Chakraborty, T. C. , Fitch, A. , Darrow, L. A. , Warren, J. L. , et al. (2024). The high‐resolution urban meteorology for impacts dataset (HUMID) daily for the conterminous United States. Scientific Data, 11(1), 1321. 10.1038/s41597-024-04086-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. North Carolina State Climate Office . (2025). Cardinal. [Data retrieval interface]. NC State University. Retrieved from https://products.climate.ncsu.edu/cardinal/request [Google Scholar]
  62. Office of the NJ State Climatologist . (2024). Rutgers NJ weather network. Rutgers University. Retrieved from https://www.njweather.org/data [Google Scholar]
  63. Oke, T. (1973). City size and the urban heat island. Atmospheric Environment, 7(8), 769–779. 10.1016/0004-6981(73)90140-6 [DOI] [Google Scholar]
  64. Oke, T. R. (2004). Initial guidance to obtain representative meteorological observations at urban sites.
  65. Peel, M. C. , Finlayson, B. L. , & McMahon, T. A. (2007). Updated world map of the Köppen‐Geiger climate classification. Hydrology and Earth System Sciences, 11(5), 1633–1644. 10.5194/hess-11-1633-2007 [DOI] [Google Scholar]
  66. Piracha, A. , & Chaudhary, M. T. (2022). Urban air pollution, urban heat island and human health: A review of the literature [number: 15 Publisher: Multidisciplinary Digital Publishing Institute]. Sustainability, 14(15), 9234. 10.3390/su14159234 [DOI] [Google Scholar]
  67. Potapov, P. , Li, X. , Hernandez‐Serna, A. , Tyukavina, A. , Hansen, M. C. , Kommareddy, A. , et al. (2021). Mapping global forest canopy height through integration of GEDI and Landsat data. Remote Sensing of Environment, 253, 112165. 10.1016/j.rse.2020.112165 [DOI] [Google Scholar]
  68. Royé, D. , Sera, F. , Tobías, A. , Lowe, R. , Gasparrini, A. , Pascal, M. , et al. (2021). Effects of hot nights on mortality in southern Europe. Epidemiology, 32(4), 487–498. 10.1097/EDE.0000000000001359 [DOI] [PubMed] [Google Scholar]
  69. Rue, H. , Martino, S. , & Chopin, N. (2009). Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations. Journal of the Royal Statistical Society Series B: Statistical Methodology, 71(2), 319–392. 10.1111/j.1467-9868.2008.00700.x [DOI] [Google Scholar]
  70. Son, J.‐Y. , Choi, H. M. , Miranda, M. L. , & Bell, M. L. (2022). Exposure to heat during pregnancy and preterm birth in North Carolina: Main effect and disparities by residential greenness, urbanicity, and socioeconomic status. Environmental Research, 204, 112315. 10.1016/j.envres.2021.112315 [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Spangler, K. R. , Weinberger, K. R. , & Wellenius, G. A. (2019). Suitability of gridded climate datasets for use in environmental epidemiology. Journal of Exposure Science and Environmental Epidemiology, 29(6), 777–789. 10.1038/s41370-018-0105-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Spector, J. T. , Sampson, L. , Flunker, J. C. , Adams, D. , & Bonauto, D. K. (2023). Occupational heat‐related illness in Washington state: A descriptive study of day of illness and prior day ambient temperatures among cases and clusters, 2006–2021. American Journal of Industrial Medicine, 66(8), 623–636. 10.1002/ajim.23506 [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Stafoggia, M. , Michelozzi, P. , Schneider, A. , Armstrong, B. , Scortichini, M. , Rai, M. , et al. (2023). Joint effect of heat and air pollution on mortality in 620 cities of 36 countries. Environment International, 181, 108258. 10.1016/j.envint.2023.108258 [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Stewart, I. D. , & Oke, T. R. (2012). Local climate zones for urban temperature studies. Bulletin of the American Meteorological Society, 93(12), 1879–1900. 10.1175/BAMS-D-11-00019.1 [DOI] [Google Scholar]
  75. Tasian, G. E. , Pulido, J. E. , Gasparrini, A. , Saigal, C. S. , Horton, B. P. , Landis, J. R. , et al. (2014). Daily mean temperature and clinical kidney stone presentation in five U.S. Metropolitan areas: A time‐series analysis. Environmental Health Perspectives, 122(10), 1081–1087. 10.1289/ehp.1307703 [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Thomas, N. , Ebelt, S. T. , Newman, A. J. , Scovronick, N. , D’Souza, R. R. , Moss, S. E. , et al. (2021). Time‐series analysis of daily ambient temperature and emergency department visits in five US cities with a comparison of exposure metrics derived from 1‐km meteorology products. Environmental Health, 20(1), 55. 10.1186/s12940-021-00735-w [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Thornton, P. E. , Shrestha, R. , Thornton, M. , Kao, S.‐C. , Wei, Y. , & Wilson, B. E. (2021). Gridded daily weather data for North America with comprehensive uncertainty quantification. Scientific Data, 8(1), 190. 10.1038/s41597-021-00973-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. UN . (2018). The world’s cities in 2018. United Nations. [Google Scholar]
  79. U.S. Census Bureau . (2024). U.S. Census Bureau QuickFacts: Raleigh city, North Carolina. Retrieved from https://www.census.gov/quickfacts/fact/table/raleighcitynorthcarolina/PST045224
  80. Vanos, J. , Guzman‐Echavarria, G. , Baldwin, J. W. , Bongers, C. , Ebi, K. L. , & Jay, O. (2023). A physiological approach for assessing human survivability and liveability to heat in a changing climate. Nature Communications, 14(1), 7653. 10.1038/s41467-023-43121-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Vecellio, D. J. , Cottle, R. M. , Wolf, S. T. , & Kenney, W. L. (2023). Critical environmental Limits for human Thermoregulation in the Context of a changing climate. Exercise, Sport, and Movement, 1(2), e00008. 10.1249/ESM.0000000000000008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Vicedo‐Cabrera, A. M. , Scovronick, N. , Sera, F. , Royé, D. , Schneider, R. , Tobias, A. , et al. (2021). The burden of heat‐related mortality attributable to recent human‐induced climate change. Nature Climate Change, 11(6), 492–500. 10.1038/s41558-021-01058-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Wang, P. , Xiang, H. , Guo, M. , Shi, Y. , Chong, K. C. , Huang, J. , & Ho, H. C. (2023). Indoor and roadside exposures to traffic noise and cardiovascular mortality and the role of urban environmental stressors across a high‐rise, high‐density environment: A case study in Hong Kong. Building and Environment, 229, 109945. 10.1016/j.buildenv.2022.109945 [DOI] [Google Scholar]
  84. Warszawski, L. , Frieler, K. , Huber, V. , Piontek, F. , Serdeczny, O. , Zhang, X. , et al. (2017). Center for international earth science information network—Ciesin—Columbia University (2016). Gridded population of the world, version 4 (GPWV4): Population density (p. 228). NASA Socioeconomic Data and Applications Center (Sedac), Atlas of Environmental Risks Facing China Under Climate Change. 10.7927/h4np22dq [DOI] [Google Scholar]
  85. Weinberger, K. R. , Harris, D. , Spangler, K. R. , Zanobetti, A. , & Wellenius, G. A. (2020). Estimating the number of excess deaths attributable to heat in 297 United States counties. Environmental Epidemiology, 4(3), e096. 10.1097/EE9.0000000000000096 [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. Wikle, C. K. , Berliner, L. M. , & Cressie, N. (1998). Hierarchical Bayesian space‐time models. Environmental and Ecological Statistics, 5(2), 117–154. 10.1023/a:1009662704779 [DOI] [Google Scholar]
  87. Wilson, B. (2020). Urban heat Management and the Legacy of redlining. Journal of the American Planning Association, 86(4), 443–457. 10.1080/01944363.2020.1759127 [DOI] [Google Scholar]
  88. WMO . (2018). Guide to instruments and methods of observation (WMO‐No. 8).
  89. Zafeiratou, S. , Samoli, E. , Analitis, A. , Gasparrini, A. , Stafoggia, M. , De’ Donato, F. K. , et al. (2023). Assessing heat effects on respiratory mortality and location characteristics as modifiers of heat effects at a small area scale in Central‐Northern Europe. Environmental Epidemiology, 7(5), e269. 10.1097/EE9.0000000000000269 [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Zafeiratou, S. , Samoli, E. , Dimakopoulou, K. , Rodopoulou, S. , Analitis, A. , Gasparrini, A. , et al. (2021). A systematic review on the association between total and cardiopulmonary mortality/morbidity or cardiovascular risk factors with long‐term exposure to increased or decreased ambient temperature. Science of the Total Environment, 772, 145383. 10.1016/j.scitotenv.2021.145383 [DOI] [PubMed] [Google Scholar]
  91. Zhang, Z. , Heerspink, H. J. L. , Chertow, G. M. , Correa‐Rotter, R. , Gasparrini, A. , Jongs, N. , et al. (2024). Ambient heat exposure and kidney function in patients with chronic kidney disease: A post‐hoc analysis of the DAPA‐CKD trial. The Lancet Planetary Health, 8(4), e225–e233. 10.1016/S2542-5196(24)00026-3 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Citations

  1. Marques, E. , & Messier, K. (2025a). Urban air temperature at high spatiotemporal resolution on major US cities (Version 1) [Dataset]. Chemical Effects in Biological Systems. 10.22427/NTP-DATA-500-021-001-000-9 [DOI]
  2. Marques, E. , & Messier, K. (2025b). Urban air temperature at high spatiotemporal resolution on major US cities (Version V1) [Dataset]. Harvard Dataverse. 10.7910/DVN/HNVCBR [DOI]
  3. Marques, E. , & Messier, K. (2025c). Brassens (version 1.0) [Software]. Zenodo. 10.5281/zenodo.15595882 [DOI]
  4. Marques, E. , & Messier, K. (2025d). Mercury (version 1.0.0) [Software]. Zenodo. 10.5281/zenodo.15596059 [DOI]
  5. Marques, E. , & Messier, K. (2025e). Samba (version 1.0) [Software]. Zenodo. 10.5281/zenodo.15595977 [DOI]

Supplementary Materials

Supporting Information S1

Data Availability Statement

T2MBHM rasters on Philadelphia—NYC (July 2024 and January 2021), Phoenix (July 2023) and the Triangle (July 2023) are archived on the Chemical Effects in Biological Systems database (Marques & Messier, 2025a). Additional data generated with our approach is available on the top largest US Urban Census areas, covering different climatic events (heatwave, blizzard, typical weather) and seasons. The T2MBHM rasters are stored in an open‐access versioned repository in the CAFE Climate and Health Research Coordinating Center Collection of the Harvard Dataverse (Marques & Messier, 2025b).

Software: R codes used to prepare Weather Underground data in the preliminary analysis are available in https://github.com/NIEHS/brassens/ (Marques & Messier, 2025c). R codes for the Bayesian Hierarchical Model and 2‐m air temperature rasters generated on the four urban areas can be found in https://github.com/NIEHS/samba/ (Marques & Messier, 2025e). R codes for the gridded products comparison are shared in https://github.com/NIEHS/mercury/ (Marques & Messier, 2025d).


Articles from GeoHealth are provided here courtesy of Wiley

RESOURCES