Abstract
Worldwide, the COVID-19 pandemic has been influenced by a combination of environmental and sociodemographic drivers. To date, population studies have overwhelmingly focused on the impact of societal factors. In New Mexico, the rate of COVID-19 infection and mortality varied significantly among the state’s geographically dispersed, and racially and ethnically diverse populations who are exposed to unique environmental contaminants related to resource extraction industries (e.g. fracking, mining, oil and gas exploration). By looking at local patterns of COVID-19 disease severity, we sought to uncover the spatially varying factors underlying the pandemic. We further explored the compounding role of potential long-term exposures to various environmental contaminants on COVID-19 mortality prior to widespread applications of vaccinations. To illustrate the spatial heterogeneity of these complex associations, we leveraged multiple modeling approaches to account for spatial non-stationarity in model terms. Multiscale geographically weighted regression (MGWR) results indicate that increased potential exposure to fugitive mine waste is significantly associated with COVID-19 mortality in areas of the state where socioeconomically disadvantaged populations were among the hardest hit in the early months of the pandemic. This relationship is paradoxically reversed in global models, which fail to account for spatial relationships between variables. This work contributes both to environmental health sciences and the growing body of literature exploring the implications of spatial nonstationarity in health research.
Keywords: SARS-CoV-2, COVID-19, environmental exposure, rural areas, multiscale geographically weighted regression, Simpson’s paradox
Introduction
The impact of the COVID-19 pandemic worldwide has been influenced by a combination of environmental and sociodemographic drivers (Wang et al. 2022). In New Mexico, the rate COVID-19 infection and mortality varied significantly among the state’s geographically dispersed, and racially and ethnically diverse populations who are exposed to unique environmental contaminants related to resource extraction. While many studies focus on the relationship between social vulnerabilities and COVID-19 mortality, somewhat fewer explore the role of environmental pollution (e.g., Yu et al. 2018; Meng et al. 2021, 2022). However, a knowledge gap remains in understanding how environmental and social drivers together coproduce increased COVID-19 mortality and none have been systematically conducted in rural western states. One such state, New Mexico, is home to a potentially high exposure burden, as well as geographically and racially/ethnically diverse populations.
In previous environmental health research, members of our team conducted immune response modeling using estimated annual consumption of toxicants via drinking water sources among study participants – uranium, arsenic, nickel, mercury, and total radium (Ra-226 and −228). These metal contaminants were selected due to their prevalence in New Mexico water supplies (principally arsenic, uranium, and radium) (Hoover et al. 2017) and on a priori knowledge of their potential immunotoxic effects (principally arsenic, mercury, and nickel) (Aranyi et al. 1985; Haley et al. 1987; Ahmed et al. 2014). For example, arsenic has also been shown to be immunosuppressive for the development of autoantibodies. In addition, women may convert arsenic to particularly immunosuppressive metabolites, and long-term exposure of Navajo men to arsenic during aging may suppress these autoantibodies. These findings are consistent with the association between urinary arsenic and decreased protective antibody response to the mumps vaccine (Raqib et al. 2017) and against tuberculosis among Bangledesh children (Ahmed et al. 2014), and that sex-specific differences may complicate this effect (Lindberg et al. 2008). In direct tests arsenic trioxide inhibited interferon-γ synthesis in vitro by lymphocytes from patients with systemic lupus erythematosus (Hu et al. 2018) and prolonged survival and decreased anti-nDNA production in lupus-prone mice (Xia, Lin, and Zhou 2007), possibly by inhibiting inflammasome activation (Maier et al. 2014). Despite significant knowledge gaps in the immunotoxicity of arsenic and other metals/metalloids, these findings strongly suggest that such environmental exposures should be further investigated in the context of COVID-19 disease severity and death in New Mexico.
Impacts on the immune system may be especially pronounced and diverse among New Mexico populations based on proximity to mining- and fracking-related environmental exposures. For example, highly remote, rural populations such as those living on the Navajo Nation have documented urinary uranium concentrations at the 99th percentile of the National Health and Nutrition Examination Survey (NHANES) among non-occupationally exposed community members across the lifespan (Harmon et al. 2017; Hoover et al. 2020; Scammell et al. 2020; Erdei et al. 2022), indicating significant exposure risk to uranium and other metals through multiple environmental pathways. Resource extraction processes disrupt the natural distribution of soils, increasing the dispersion of particle-bound metals into environmental media (e.g., air, water, and soils) and the exposure risk to communities. Exposure to environmental metals has demonstrated impacts on immune system, which may render exposed individuals and communities at greater risk for COVID-19 mortality (Erdei et al. 2020; Wu et al. 2020; Faruk et al. 2023).
Sociodemographic factors are implicated in higher SARS-CoV-2 transmission rates, particularly among minority communities and people of color (Erdei et al 2020). The relationship between social vulnerability and COVID-19-related hardships in the US is well-documented (Dasgupta et al. 2020; Gaynor and Wilson 2020; Karaye and Horney 2020; Khazanchi et al. 2020; Nayak et al. 2020; Tieskens et al. 2021). In New Mexico, social vulnerabilities including household crowding, diminished access to running water or telecommunications infrastructure, and structural mechanisms of institutionalized racism have all been implicated in disproportionate adverse impacts from COVID-19 on communities of color, particularly among Indigenous populations in the western half of the state (Yellow Horse, Deschine Parkhurst, and Huyser 2020; Gorris et al. 2021; Rifat and Liu 2022). Rural New Mexican populations are also disproportionately underserved by healthcare and other crucial services (Soto Mas et al. 2019). We argue that the simultaneous comparison of environmental exposure and social determinants of health (SDOH) may shed light on their relative contribution to COVID-19 mortality.
The extent to which social and environmental drivers of COVID-19 morbidity and mortality are locally compounded has not been established and commonly applied ecologic models may be insufficient for capturing how multiple factors heterogeneously interact in localized contexts to exert influence on COVID-19 mortality. Because environmental and social contexts vary across space, the drivers of COVID-19 infection and death may be locally compounded. The heterogeneous distribution of health determinants and outcomes are thus spatially nonstationary. Spatial nonstationarity is the variability of causal relationships and statistical associations across space (Kwan 2021). Model coefficients are subject to spatial nonstationarity, meaning that global regression models may not adequately capture the complexity and nuance of spatial associations because they frame health-environment relationships as unchanging across space (Brunsdon, Fotheringham, and Charlton 1996).
Spatial regression models like geographically weighted regression (GWR) have been widely adopted in environmental health and justice research to account for spatial nonstationarity. For example, using GWR, Gilbert and Chakrborty (2011) demonstrated significant spatial patterns of socioeconomic disparities and cumulative cancer risks in Florida, and Grineski et al. (2015) revealed spatial associations between PM2.5 and social characteristics with children’s respiratory health in Texas. Other research has demonstrated the spatial heterogeneity of determinants of obesity (Oshan, Smith, and Fotheringham 2020) and associations between green spaces and COVID-19 transmission (Huang et al. 2020).
We hypothesize that spatial heterogeneity of predictor variables indicates spatial associations between potential exposure to environmental contaminants and COVID-19 mortality when simultaneously considering localized sociodemographic population characteristics. To that end, we expect to see the relationship between documented social and environmental exposures and COVID-19 mortality to manifest in different directions and magnitudes across the state, with significant positive associations in underserved regions. Those drivers include potential social vulnerabilities derived from the ATSDR-CDC Social Vulnerability Index (SVI) and the American Community Survey (ACS), as well as potential environmental exposure to environmental toxicants stemming from sites of resource extraction.
We specifically aim to address how coefficients reflecting the relationship between potential predictors and COVID-19-related mortality may be subject to Simpson’s paradox when spatial stationarity of model terms is assumed. In spatial models, Simpson’s paradox may occur when models calibrated on the same aggregated data contradict each other when population samples are subset into logical subgroups at the local level (Sachdeva and Fotheringham 2023). Our methods apply combinations of geographically weighted regressions (GWR) and multiscale GWR (MGWR) modeling techniques to account for local heterogeneity in explanatory environmental and social variables and COVID-19 mortality, examining linear relationships within spatial neighborhoods. The application of our methodology provides a simultaneous comparison of environmental exposure and SDOH on their relative contribution to COVID-19 mortality. Our analysis supports the ongoing planning of state and local public health response by promoting New Mexico and place-specific understanding of environmental risks and COVID-19 disease associations. These are critical considerations in a rural and underserved state in the Southwestern U.S. By asking where and to what degree do complex and complementary social and environmental processes related to COVID-19 mortality exist across the state, the purpose of the study is to demonstrate the analytical nuance provided by local regression modeling and increase understanding of the health challenges faced by diverse communities in New Mexico.
Additionally, we aim to address the effect of potential underreporting of COVID-19 cases on regression results. The State of New Mexico Department of Health (NMDOH) collected and compiled all available data on SARS-CoV-2 infection diagnoses since the beginning of the pandemic. Medical facts-based public health measures were implemented swiftly in New Mexico in the first week of March 2020 and significantly curbed the virus spreading primarily at hospital and primary health care settings. However, this process became increasingly challenging due to actions taken by US Centers for Disease Control and Prevention (CDC) and other governmental agencies via the distribution of household-based, free of charge, individual testing kits. The actual reporting of cases was potentially hindered by these diverse testing opportunities and COVID-19 reporting requirements (Castro et al. 2021). People who tested positive but who were unable or unwilling to seek medical care might be underrepresented in state datasets (Katzman et al. 2021; Sood et al. 2021). Furthermore, at the beginning of the pandemic as health care provider capacities were shut down limited, reporting may not reflect the actual numbers of cases.
Materials and Methods
Our exploratory study design develops and evaluates models at contrasting spatial specificity to test our hypothesis. Given that our data was available in a range of spatial units (e.g., zip code tabulation area [ZCTA] and census tract), we started by spatially harmonizing environmental and sociodemographic data related to environmental exposure and COVID-19. Next, we developed and compared linear and logistic regression models at both global and local levels. Finally, we rely on a priori knowledge of the study area to evaluate how sociodemographic and environmental factors differentially play out and coalesce as lurking variables in global regressions in rural and underserved regions of New Mexico compared to urban centers.
Study Area
New Mexico is a minority-majority state located in the southwest US (Figure 1) with 50.2% Hispanic or Latino and 11.2% American Indian/Alaska Native populations (U.S. Census Bureau 2021). The population in 2020 was approximately 2.1 million (U.S. Census Bureau 2021). The state is highly rural – an estimated 12% of the total population resides in areas classified as either rural or small towns according to the USDA Rural-Urban Commuting Area (RUCA) Codes, which account for approximately 52% of total land area. Significant natural resource production, refinement, and waste disposal activities occur in predominantly rural areas across the state, potentially rendering local communities disproportionately exposed to environmental contaminants. Figure 1 shows the locations of uranium and other hard rock mining sites, and unconventional oil and natural gas wells across the state. Many of the hard rock mine features include ore tailings disposal sites. Also included on the map is the Waste Isolation Pilot Project (WIPP) – one of three large underground storage facilities for radioactive waste in the world, which is located in the southeastern corner of the state.
Figure 1.

Study area map
Outcomes
A total of 3,863 deaths were recorded in the State of New Mexico COVID-19 disease registry and public health database between March 1, 2020, and March 26, 2021. The total number of COVID-19 deaths, tests, and vaccinations are provided by NMDOH aggregated at the ZCTA level. A total of 202 ZCTAs (55%) had at least one reported COVID-19-related death as of March 26, 2021. The estimate provided by NMDOH may be low due to underrepresenting the number of cases and deaths in ZCTAs on Tribal lands, at correctional facilities, and at military bases.
This study was carried out in collaboration with the Division of Epidemiology and Response of the NMDOH with approval from the UNM Health Sciences Center Human Research Protections Office.
Sociodemographic Covariates
To account for the extent of social determinants of COVID-19 deaths, we incorporated data from the CDC-ATSDR Social Vulnerability Index (SVI) reporting census tract-level sociodemographic characteristics, household characteristics, racial composition, and housing type. More granular estimates of racial composition at the ZCTA level were derived from the 2020 American Community Survey (ACS). Rurality or urbanity of ZCTA units were derived from both the USDA RUCA and the Frontier and Remote Area (FAR) codes.
Data Harmonization
Because COVID-19-related deaths were aggregated at ZCTA level, we applied dasymetric mapping, which creates population-weighted coefficients to reapportion data aggregated at different geospatial configurations and scales (Eicher and Brewer 2001), such as SVI data, which are aggregated at the census tract level. Granular population distributions were obtained from the Oak Ridge National Laboratory LandScan estimates of 24-hour ambient population for every square kilometer (Oak Ridge National Laboratory 2020). RUCA and FAR data are both available at the ZCTA level. Summary statistics of input data aggregated by ZCTA are provided in supplementary Table S1.
Environmental Pollution Estimates
To estimate potential environmental pollution from metals, we applied a geospatial approach for predicting the release of metal contaminants from abandoned and active mining sites and fracking sites as described in Lin et al. (2020). The models here and those described in Lin et al. (2020) specifically characterize the theoretical distribution of fugitive mine waste through various environmental processes. We constructed additional GIS layers proximity to mines by type and oil/natural gas production and aerial radiometric scans.
The locations of mine sites and features were derived from the Mineral Resources Data System (MRDS), a compendium of records curated by the USGS containing the point locations and primary commodities of mines across the coterminous U.S. (Mason and Arndt 1996). All point locations were used in the present analysis because the dataset does not provide sufficient information to discriminate mine activity or reclamation status.
The point locations within New Mexico of all reported unconventional oil and natural gas production wells via hydraulic fracturing, or fracking, were obtained from the FracFocus Chemical Disclosure Registry (2023). Fracking sites are primarily grouped in the southeast and northwest portions of the state (Figure 1). We modeled the density of fracking wells per ZCTA as the number of wells divided by the area in square kilometers of every ZCTA plus a 50-km buffer.
The potential aeolian (windblown) distribution of mine and fracking waste in dust was modeled two ways. The wind index (WI) was derived to estimate downwind exposure from mine sites at every location based on prevailing wind direction using the following equation (Ryan et al. 2008; Lin et al. 2020):
| (1) |
where is the straight-line distance from contaminant sources – in this case, active and abandoned mine and fracking sites – is the 30-year hourly average prevailing wind direction at any geographical location – in this case, every 30m grid cell in a digital elevation model (DEM) (U.S. Geological Survey 2022) – derived from the North American Regional Reanalysis (NARR) dataset (NOAA Physical Sciences Laboratory 2023). The output is a scaled value ranging from 0 to 1, where a WI of 1 indicates locations directly downwind from sources, 0.5 are locations directly perpendicular to sources, and 0 are locations directly upwind from sources.
Topographic wind exposure (TWE) expands on the wind index by considering terrain variability. For example, the leeward slope of a hill or mountain is less exposed to contaminant-bearing winds than the windward side. TWE calculates the angle of incidence between wind and surficial features and is also dependent on prevailing wind averages according to the following equation (Antonić and Legović 1999; Böhner and Antonić 2009; Lin et al. 2020):
| (2) |
We adapted the methodology described in Lin et al. (2020) such that is the angle of exposure, is the slope and is the aspect of a land surface, and and are the horizontal angle and 30-year prevailing wind direction, respectively. TWE values range from −1 to 1 where values approaching −1 indicate low TWE and those approaching 1 are highest.
Next, following work detailed in Lin et al. (2020) we classified landforms using the topographic position index (TPI) as valleys, steep slopes, lower slopes, moderate slopes, flat land, and ridges. Finally, we computed the median value per ZCTA of estimated uranium-238 from aerial gamma-ray surveys conducted as part of the National Uranium Resource Evaluation (NURE) program (Kucks 2005), as well as median concentrations of arsenic, cadmium, copper, mercury, nickel, lead, uranium, and uranium in NURE soil and sediment samples. All environmental variables were normalized from 0–1 for statistical modeling.
Statistical Modeling
The common approach for determining the suitability of spatial models is to evaluate the effect of spatial nonstationarity in global linear models before proceeding to local ones (Comber et al. 2020; Chen et al. 2024). We first modeled the number of deaths due to COVID-19 using zero-inflated models to account for the large number of NM ZCTAs (145 of 396 as of March 31, 2021) reporting zero deaths.
When such a model is applied with the number of deaths as outcome, we expect the population size and the interaction of population size with other variables to be strong predictors. Therefore, the outcome of interest was the relative number of COVID-19 related deaths in a ZCTA rather than the absolute number of such deaths using:
| (3) |
For the 226 observations where was defined, we fit models to estimate 1) the association between exposures and when ; and 2) the association between exposures and the probability that . The first question we sought to answer was thus how the rate of COVID-19-related mortality differed from the absolute number of reported deaths. To answer this question, we model , using only observations for which ). However, we cannot rule out the possibility of instances where the reporting of COVID-19-related deaths per ZCTA is 0 may be inaccurate and that the number of COVID-19-related deaths in some ZCTAs is in fact at least 1. To address this uncertainty, we then defined a binary variable, Y, where:
| (4) |
and modeled using all observations for which the predictor variables have measured values.
Predictor Variables
The variables we considered as potential predictors in all models included:
The proportion of the population living in poverty obtained from the SVI,
The proportion of the population over 65 years in age obtained from the SVI,
The logarithm of ZCTA land area,
- A variable denoting a combination of rurality (RUCA) and remoteness (FAR), , adjusted for difference in location and scale in, where
(5) The logarithm of uranium in parts per million,
Proximity to mining features,
Wind index,
Topographic wind exposure
The logarithm of the landforms variable, ,
The proportion of land that is held in trust by the federal government for Native American reservations, , and
Proportion of populations grouped by race as defined by the 2020 American Community Survey as: non-Hispanic white), b) Native American or Pacific Islander () c) Hispanic (), or d) other ().
Estimated total population derived from LandScan () was also considered as a potential predictor of the binary outcome defined in Equation 1.
The percentage of ZCTA land area that is Tribal reservation land and the proportion of population that is Native American were zero for several observations, and were transformed respectively with:
| (6) |
| (7) |
Due to the large number of covariates and predictors that we considered, and to avoid having predictor variables that were highly correlated, Principal Component Analysis (PCA) was used on both the sociodemographic covariates and soil metal concentrations. PCA is a technique for reducing the dimensionality of a dataset in order to ease interpretability while preserving as much statistical information as possible (Jolliffe and Cadima 2016). In PCA, an orthogonal linear transformation is applied to reframe the original potentially correlated data into a smaller number of uncorrelated variables called principal components (PCs). The first four PCs of sociodemographic covariates contained more than 90% of the total variation (Table 1). Two separate PCA were conducted for soil metal concentrations – one containing arsenic and another without due to significant missingness in arsenic measurements. PC loadings for soil concentrations are reported in supplemental table S2.
Table 1.
Loadings of the first four principal components from the PCA of sociodemographic covariates
| PC 1 | PC 2 | PC 3 | PC 4 | |
|---|---|---|---|---|
|
| ||||
| The proportion of the population living in poverty, | 0.4075 | 0.21 | 0.4148 | 0.771 |
| The proportion of the population over age 65, | −0.2634 | −0.2664 | 0.8846 | −0.2355 |
| Log Proportion Tribal lands, | 0.5172 | −0.1834 | 0.06024 | −0.3472 |
| Log Percent Native American, | 0.5126 | −0.2566 | 0.05473 | −0.2553 |
| Percent non-Hispanic White , | −0.4436 | −0.4833 | −0.08783 | 0.2774 |
| Percent Hispanic, | −0.1935 | 0.7429 | 0.1766 | −0.2958 |
These PCs will be labeled . , where .
Global Modeling
is continuous and was modeled with ordinary linear regression. The of Equation 4 is binary and was modeled using logistic regression.
The number of observations that can be used is further diminished if we attempt to use data on metals in soil/sediment. As stated above, if we attempt to include the effects of five metals, we would include as potential covariates the four principal components in Supplemental Table S2a; if we try to include the effects of all metals but arsenic we would include as potential covariates the three principal components in Supplemental Table S2b.
Model 1 tests the binary outcome, Y, model 2 tests the continuous outcome, . Each test took the following approach: first a full model was fitted, using all variables in the set of potential predictors. Then we applied backward stepwise model selection using the Akaike information criterion (AIC) to remove predictor variables that were not significantly associated with the outcome using a 95% confidence level (p>0.05). When starting with full models with metal-based variables (Table 3), the reduced models either included no metal-based PCs, or in one case (binary outcome, using PCs based on four metals, 191 observations), one metal PC appeared in the reduced model, but with high uncertainty (p>0.1). Therefore, we concluded that the models based on smaller numbers of observations with available soil and sediment metal concentration variables do not show results that are both different from the models with no metal data and also significant. The final reduced models are shown in Tables 3 and 4.
Table 3.
Models for binary outcome, . This model uses 333 observations, with no data for five metals
| Estimate | Std. Error | z-Value | FDR Test (j/m)* : | ||||
|---|---|---|---|---|---|---|---|
| Reject | |||||||
| (Intercept) | −20.520 | 10.440 | −1.967 | 0.049 | 0.030 | 0.006 | 0 |
| logpop | 3.640 | 1.808 | 2.014 | 0.440 | 0.027 | 0.005 | 1 |
| FMM | −0.197 | 0.064 | −3.091 | 0.002 | 0.013 | 0.003 | 0 |
| logarea | 0.655 | 0.166 | 3.935 | 0.000 | 0.003 | 0.001 | 1 |
| MineProx | 12.570 | 11.190 | 1.123 | 0.262 | 0.043 | 0.009 | 0 |
| WindIndex | −1.895 | 0.932 | −2.033 | 0.042 | 0.023 | 0.005 | 0 |
| logLandForms | −0.625 | 0.380 | −1.646 | 0.100 | 0.033 | 0.007 | 0 |
| logeU.PPM | −1.068 | 0.653 | −1.636 | 0.102 | 0.037 | 0.007 | 0 |
| densityFracking | −2.735 | 2.658 | −1.029 | 0.304 | 0.050 | 0.010 | 0 |
| PropRacCatComp.1 | 0.472 | 0.129 | 3.651 | 0.000 | 0.007 | 0.001 | 1 |
| PropRacCatComp.2 | 2.586 | 1.096 | −2.359 | 0.018 | 0.017 | 0.002 | 0 |
| PropRacCatComp.3 | 0.769 | 0.243 | 3.170 | 0.002 | 0.010 | 0.003 | 1 |
| logpop:MineProx | −2.628 | 1.967 | −1.336 | 0.182 | 0.040 | 0.008 | 0 |
| logpop:densityFracking | 0.558 | 0.500 | 1.115 | 0.265 | 0.047 | 0.009 | 0 |
|
| |||||||
| AIC | 225.8 | ||||||
| p-value of statistic | 0 | ||||||
| Tjur’s COD | 0.6141 | ||||||
Table 4.
Models for logit r. This model uses 202 observations, with no data for five metals.
| a) Model after variable selection | b) Full model | |||||||
|---|---|---|---|---|---|---|---|---|
|
| ||||||||
| Est. | SE | t-Val | Est. | SE | t-Val | |||
| (Intercept) | −3.773 | 0.184 | −20.480 | 0.000 | −4.385 | 0.788 | −5.562 | 0.000 |
| FMM | 0.054 | 0.018 | 3.018 | 0.003 | 0.045 | 0.021 | 2.082 | 0.039 |
| WindIndex | −1.152 | 0.243 | −4.750 | 0.000 | −1.473 | 0.356 | −4.139 | 0.000 |
| TWE | −0.314 | 0.222 | −1.414 | 0.159 | −0.307 | 0.245 | −1.253 | 0.212 |
| PropRacCatComp.4 | 0.400 | 0.097 | 4.120 | 0.000 | 0.383 | 0.106 | 3.607 | 0.000 |
| logarea | − | − | − | − | 0.025 | 0.054 | 0.453 | 0.651 |
| MineProx | − | − | − | − | 0.204 | 0.412 | 0.496 | 0.620 |
| Landforms | − | − | − | − | −0.159 | 0.144 | −1.104 | 0.271 |
| logeU.PPM | − | − | − | − | −0.061 | 0.231 | −0.264 | 0.792 |
| PropRacCatComp.1 | − | − | − | − | −0.020 | 0.053 | −0.384 | 0.702 |
| PropRacCatComp.2 | − | − | − | − | 0.044 | 0.037 | 1.169 | 0.244 |
| PropRacCatComp.3 | − | − | − | − | −0.028 | 0.055 | −0.504 | 0.615 |
| Logit V | − | − | − | − | 0.133 | 0.114 | 1.171 | 0.243 |
|
| ||||||||
| AIC | 510.4 | 524.8 | ||||||
| Adjusted R2 | 0.2757 | 0.2581 | ||||||
| p-val. of F statistic | 6.48E-14 | 1.05E-09 | ||||||
Once a model was fit, we evaluated how well the fitted values approximate the known outcomes. For all models, AIC was calculated at each step, and AIC is shown for all models. The models for are linear models. For these, a standard measure of goodness of fit is the adjusted R2.
The outcome defined in Equation 2 is binary. For a binary outcome, R2, adjusted or not, is not an appropriate measure for goodness of fit. One measure that is suited to binary outcomes and is often used is Tjur’s coefficient of discrimination (COD). This uses the fact that observations can be divided into two groups: those for which the outcome is 0, and those for which is 1. When modeling a binary outcome, we are estimating the probability that the outcome is 1. That is, is an estimate of . When , should be close to 1; when , should be close to 0. Tjur’s COD is the difference of the means of fitted values for the two groups:
| (6) |
Local modeling
Global regression models fail to account for spatial nonstationarity (Brunsdon, Fotheringham, and Charlton 1996; Siordia, Saenz, and Tom 2012; Kwan 2021), whereby place-based health relationships vary across geographic space. In the current study, we argue that exposure to environmental contaminants is largely influenced by spatially varying wind patterns, and that the relationship between windborne metals and population-level exposure is dependent on proximity to mining activities – the distribution of which is heterogeneous.
We tested four commonly used modeling approaches to account for spatial relationships between the predictor variables and the outcome, : two spatial autoregressive (SAR) models – a spatial lag model and spatial error model – a geographically weighted regression (GWR), and a multiscale GWR (MGWR) of the reduced models described in Table 5. Spatial lag models violate the assumption that error terms are independent and introduces a lag coefficient () to the model and spatial error models violate the assumptions that both error and observations are independent by introducing a spatial error term () to account for spatial diffusion between variables. GWR is a form of linear regression that models relationships between all points within a local neighborhood for each observation in the full set (Fotheringham 2011). MGWR models differ from GWR by selecting a unique neighborhood size for each predictor variable – this is particularly useful when the spatial influence of predictor variables is nonstationary.
Table 5.
Local model results
| Model | R2 | AIC | Effective DF | Test for spatial dependence | Stat. | p |
|---|---|---|---|---|---|---|
| Spatial error | 0.25 | 184.171 | 197 | Likelihood ratio | 3.302 | 0.069 |
|
| ||||||
| Lag coefficient () | 0.173 | |||||
| Std. Error | 0.085 | |||||
| z-value | 2.04 | |||||
| Prob | 0.041* | |||||
|
| ||||||
| Spatial lag | 0.25 | 185.686 | 196 | Likelihood ratio | 3.787 | 0.052 |
|
| ||||||
| Lag coefficient () | 0.145 | |||||
| Std. Error | 0.075 | |||||
| z-value | 1.937 | |||||
| Prob | 0.052 | |||||
|
| ||||||
| GWR | 0.423 | 166.221 | 176.838 | Moran’s I | −0.002 | 0.92 |
|
| ||||||
| Anselin Local | See Figure 2 | |||||
| Moran’s I | ||||||
|
| ||||||
| Bandwidth [fixed] | 109 | |||||
|
| ||||||
| MGWR | 0.88 | 656.125 | 63.534 | Moran’s I | 0.041 | 0.407 |
|
| ||||||
| Anselin Local | See Figure 2 | |||||
| Moran’s I | ||||||
|
| ||||||
| Bandwidth [TWE] | 201 | |||||
| Bandwidth [WindIndex] | 30 | |||||
| Bandwidth [PropRacCatComp.4] | 9 | |||||
| Bandwidth [FMM] | 12 | |||||
We calculated the spatial autocorrelation of model residuals using the Moran’s I global test of spatial autocorrelation, which is defined as the correlation of a variable with itself across two-dimensional space (Barber 1988). If model residuals are spatially autocorrelated, then we conclude that the global model is poorly specified in that it fails to account for spatial dependence between variables. Spatial dependence may stem from measurement and approximation error, or from phenomena where the spatial or directional (anisotropic) nature of a characteristic are important defining features of that characteristic.
Results
Global models
Logistic model for binary outcome
Table 3 lists the variables used to predict any reported deaths due to COVID-19 after variable selection. All covariates representing soil metal concentrations were automatically removed in this step. The model described in Table 3 seems quite complex. However, many of the covariates were not statistically significant (p > 0.05). Therefore, their actual effect on the outcome is uncertain. According to the model, and have positive coefficients with low p-values. So, (COVID-19-deaths) increases with population and area of ZCTA. The coefficient of is negative, so, increasing exposure to environmental contamination via wind tends to decrease the probability that there will be COVID-19 deaths. The variable has a negative coefficient. So, zip codes that are more rural and remote tend to be more likely to have zero deaths reported as being due to COVID-19. Finally, principal components 1–3 of the PCA of Table 1 all appear in the model with small p-values. PCs 1 and 3 have positive coefficients, PC 2 has a negative coefficient. However, the interaction between PC 2 and has a positive coefficient. Additionally, we calculated the false discovery rate (FDR) to account for the expected proportion of false positives among the declared significant results. The FDR shows that the significant findings (, , , and ) are likely true effects rather than artifacts of multiple comparisons.
PC 1 is high when the proportion of Native Americans in the population and the proportion living in poverty are high, but the proportion of non-Hispanic Whites is low. PC 3 is dominated by the proportion over 65; the PC is high when this proportion is high, but to a lesser extent when proportion living in poverty is high. PC 2 is high when the proportion of the population that is Hispanic is high, and the proportions of non-Hispanic whites or Native Americans are low. So, the probability of having any deaths recorded as due to COVID-19 is higher for zip codes that have a large proportion of population that is Native American, a small proportion that is non-Hispanic white, and/or a large proportion of population living in poverty, a large proportion of population over 65, a small proportion that is non-Hispanic white, and/or a large proportion of population living in poverty, or a large proportion of population that is Hispanic, and small proportions of population that were non-Hispanic white or Native American. However, this is true only if the total population is fairly large. A zip code with a largely Hispanic population, but a small total population, is likely to show zero deaths due to COVID-19.
Linear model for logit r
Now consider the model for (Table 4). Similarly, all soil metal concentration covariates were removed following variable selection in this model, which only has four covariates: , , , and the fourth component of the sociodemographic PCA described in Table 1. This fourth PC is dominated by the proportion in poverty. It is positive for the non-Hispanic white racial proportion, negative for other racial proportions. The coefficient of this PC in the model is positive. The model says that ZCTAs that tend to be downwind of environmental contamination tend to have lower rates of COVID-19 deaths per number of tests, ZCTAs that are more remote tend to have higher rates of COVID-19 deaths per number of tests, ZCTAs with higher proportions of the population living in poverty tend to have higher rates of COVID-19 deaths, and that the effect in item 3 is stronger for ZCTAs with higher proportions of non-Hispanic white, and not as strong for ZCTAs with higher proportions of Hispanics or Native Americans.
The numbers of vaccinated individuals per zip code were estimated from the numbers of vaccinated per county as of 3/3/2021. From this we can find the vaccination rate:
| (7) |
and calculate . There was one ZCTA for which the estimated number of vaccinated was larger than the estimated population. This is likely because of a centralized vaccination clinic serving a large rural Tribal population. The single dose vaccination rate for the county in which the ZCTA is located is 95%. To deal with this case, is assumed to be 0.95 for this ZCTA.
Models for were fit in which was included as a potential predictor. When model selection was applied using backwards stepwise regression, was always removed. This included interactions in models that allowed interactions. So, there did not seem to be a significant association between the rate of deaths attributed to COVID-19 () and the vaccination rate , as of March 31, 2021.
Additionally, we considered the effect of confounders on the reduced linear model by including results from the full model in Table 4. The reduced model, after stepwise variable selection, appears to perform similarly to the full model in terms of explaining the variance in the outcome (as indicated by adjusted R2) but with fewer predictors and a lower AIC. This suggests that the reduced model provides a more parsimonious representation of the relationship between the predictors and the outcome. In terms of adjusting for confounders, both models seem to appropriately account for the significant predictors. The reduced model retains the significant variables identified in the full model, ensuring that key confounders are still adjusted for while eliminating unnecessary predictors.
Local models
Before conducting local regression models, we determine appropriateness by testing the spatial autocorrelation of global model residuals using the Moran’s test of spatial autocorrelation. If p-values from this test are significant (e.g., <0.05) we conclude that model residuals are spatially random, indicating no significant structure. On the other hand, if p-values are significant, we conclude that underlying spatial relationships are not adequately explained by the global modeling approach. Results indicate that the residuals of the binary model for are spatially random (I = −0.018, z-score = −0.608, p = 0.543), indicating the global model was correctly specified. Conversely, the residuals of the linear model for were significantly spatially autocorrelated (I = 0.079, z-score = 2.599, p = 0.009). Additionally, we measured the degree to which residuals from the GWR and MGWR models cluster using the Anselin Local Moran’s I test of spatial association (Anselin 1995). If geographical features demonstrate significantly similar values to other features within a spatial neighborhood, they are clusters of either high or low values. Conversely, if significantly dissimilar values are either high or low outliers. Outliers or clusters of residuals suggest some structure in spatial relationships that are not fully accounted for by local models. Figure 2 shows that some ZCTAs demonstrate clustering and outlier relationships in both the GWR and MGWR models (p < 0.05) in some regions of the study area. These clusters and outliers must be interpreted alongside the spatial distribution of other significant model terms. Therefore, all local models test the outcome, , using the same predictor variables described in Table 4: , , , and on 202 observations. A test for the variance inflation factor (VIF) in the linear model indicates that there is no significant multicollinearity between these model terms.
Figure 2.

Local clustering of model residuals in MGWR (left) and GWR (right).
We conducted four spatial regression models to account for spatial dependencies between observations in the linear model (Table 5). The first two spatial autoregressive (SAR) models add an additional term to account for spatial dependencies between terms and the outcome. First, a spatial error model was conducted in GeoDa (version 1.20.0.36). Results suggest that the global model explains only slightly more variability than the global linear model, but a likelihood ratio test for spatial dependence indicates that the spatial effects are accounted for. Second, a spatial lag model was also conducted using GeoDa. While spatial dependence in the model was likely the result of more than measurement error, the results did not indicate significantly improved model performance from the spatial lag model.
Third, because neither nor were significant at p < 0.05 in either SAR model (Table 5) and because model strength was reduced compared to the global model, a GWR was used to account for spatial dependence between variables using the GWModel package (version2.3–1) in R (version 4.2.2). We defined neighborhood size by optimizing the AIC by the number of neighbors from each ZCTA. Neighbors are defined using edge and corner continuity between ZCTA polygons and spatial weights are assigned using a bisquare kernel. The results indicate significant model improvement in terms of variation explained and a lower overall AIC. Moreover, the Moran’s test demonstrates that residuals are randomly distributed (, p = 0.920), suggesting adequate model specification in terms of underlying spatial relationships.
Fourth, while the GWR appears to be well-specified in terms of spatial autocorrelation of residuals, R2, and AIC (Table 5), the wind index coefficient demonstrates unexpected results in that it is negative for every observation. Suspecting that the spatial influence of each variable varies, we applied an MGWR model in R to account for differing degrees of influence within the model. Neighborhood sizes for each explanatory variable were determined by backfitting the changing values of the residual sum of squares over 600 iterations. Residuals in this model are similarly randomly distributed (I = 0.041, p = 0.407). The MGWR explains slightly more overall variance than the GWR, and despite a higher AIC the model effectively captures spatial heterogeneity of model coefficients. Table 6 and Figure 3 provide the distribution of model coefficients. Because the MGWR model calculates a linear relationship between the outcome and predictor variables at every observation, the significance of relationships is also spatially heterogeneous. We see that the coefficient for the wind index is negative in large portions of the study area, but positive and significant in regions heavily impacted by mining activity (Figure 4).
Table 6.
Summary statistics of MGWR model coefficients.
| Min. | 1st Qu. | Median | 3rd Qu. | Max. | |
|---|---|---|---|---|---|
| Intercept | −3.21 | −2.14 | −1.82 | −1.62 | −0.09 |
| TWE | 1.08 | 1.10 | 1.11 | 1.11 | 1.12 |
| WindIndex | −3.63 | −0.91 | −0.58 | 0.48 | 1.20 |
| PropRacCatComp.4 | 0.39 | 0.44 | 0.70 | 0.71 | 0.77 |
| FMM | −0.31 | −0.11 | −0.07 | 0.00 | 0.19 |
Figure 3.
Boxplots showing distribution of MGWR coefficients.
Figure 4.

Local wind index coefficients from MGWR model.
Discussion
The logistic model for the binary outcome, , does not provide information about the impact of COVID-19 but rather evaluates the completeness of reporting. This model identified four ZCTAs where the predicted number of COVID-19-related deaths is at least 1 but where the reported number of deaths is 0. Each of these ZCTAs is highly rural with a population of less than 600. We cannot rule out inaccurate reporting and conclude that these data have been suppressed. Therefore, the logistic model suggests that rural and remote portions of New Mexico are somewhat more likely to have reported 0 COVID-19-related deaths than more urbanized ones. However, we also see that the size of zip codes is positively associated with increased mortality. On the surface these findings may appear to be contradictory, but we do also see a positive relationship with sociodemographic PCs that are dominated by elderly Native American populations, who were among some of the hardest hit in the early stages of the pandemic (Hatcher et al. 2020; Rodriguez-Lonebear et al. 2020; Sequist 2020; Gorris et al. 2021).
Soil metal concentrations were automatically removed in both the logistic model and the linear model for , the relative number COVID-19-related deaths to tests per ZCTA, after stepwise model selection. However, both models do include potential sociodemographic and environmental drivers of disease mortality. Despite this, we see negative relationships between potential windblown exposure to fugitive waste from resource extraction sites in both models. Beyond this negative association, the linear model for similarly indicates positive associations between COVID-19-related deaths and rural or remote zip codes, as well as poverty – this finding is consistent with the literature (Hirko et al. 2020; Sequist 2020; Gorris et al. 2021). Conversely, stepwise model selection consistently removed the estimated ZCTA-level vaccination rate, , despite meta-analysis showing significant positive odds ratios of pooled vaccine effectiveness against incidence rate, hospitalization, and mortality after both first and second doses (Rahmani et al. 2022).
Overall, results from the MGWR model are different from the linear model for . To wit, while sociodemographic characteristics captured by are consistently positive, coefficients are generally negative. Environmental transport of dust captured by and both have positive coefficients in the MGWR but are negative in the global linear model. We are primarily interested in the distribution of coefficients, of which a large proportion are positive and negative. Furthermore, by calculating a t-statistic at each observation we can map which ones are statistically significant (Figure 4) and demonstrate that the model is significant for coefficients in both directions.
The spatial heterogeneity of coefficients is likely driven by the physical geography of New Mexico. The state is bisected by the Rio Grande (Figure 1), and the western portion has more variable mountainous terrain than the east. This mountainous terrain has been the setting for decades of hard rock mining that frequently occurred on lands largely inhabited by Native American populations (Hoover et al. 2020). The southeastern plains of New Mexico sit on the Permian Basin, a geologic formation with significant petroleum reserves (Frenzel et al. 1988). This topographic and industrial variability is highlighted by the variable, which explicitly considers terrain variability in chorus with proximity to and density of point sources of environmental exposures, be those hard rock mines or oil and natural gas wells.
We determined the suitability of the MGWR model by its ability to effectively capture spatial heterogeneity of environmental coefficients. MGWR is an ideal modeling approach when the scale of influence of predictor variables varies greatly, as is the case here. By not forcing a predetermined neighborhood on all variables a priori, as GWR does, MGWR consumes fewer degrees of freedom by allowing some modeled relationships to be global (Oshan, Smith, and Fotheringham 2020). This is the case with TWE in the present model, which has a neighborhood of 201 observations.
Conclusion
The present analysis demonstrates complex associations between socio-environmental predictors and large proportions of COVID-19-related deaths in New Mexico during the first year of the pandemic. These associations are spatially nonstationary, meaning that potential exposures vary across space and geographic context. Specifically, the wind index, which captures the redistribution of toxic metal-bearing fugitive dust from hard rock mines and fracking wells is both positively and negatively correlated with COVID-19-related mortality in different regions across New Mexico. Regression models that attend to spatial nonstationarity by examining local relationships within a larger study area have the potential to enhance health-focused research and are highly applicable in contexts with complex geographical properties.
We show that New Mexican populations living both in closer proximity to extractive industries and in complex mountainous terrain are more likely to have experienced increased incidences of COVID-19-related deaths in the first year of the pandemic. New Mexico is a highly rural state with large Tribal populations segregated into disproportionately underserved and mostly rural regions (Farrigan and Parker 2012; Yellow Horse, Deschine Parkhurst, and Huyser 2020). Disproportionate environmental and social burdens experienced by Indigenous communities across the state are deeply entrenched and rendered these populations particularly vulnerable to adverse effects of the pandemic (Yellow Horse, Deschine Parkhurst, and Huyser 2020).
Our models do not contradict numerous findings showing the relationship between social vulnerability and COVID-19-related hardships across the U.S. However, the local effect of environmental metals exposures was almost impossible to capture using traditional global modeling techniques. For example, using MGWR we demonstrate that the effect of potential windborne exposures to metal-bearing dusts is only significant and positive in pockets across the state – local variability in this relationship is so pronounced that the overall effect is negative in global models. Local spatial relationships between sociodemographic disparities and disproportionate environmental exposures are masked in global modeling approaches in what is a clear example of Simpson’s paradox (Sachdeva and Fotheringham 2023). Specifically, we see coefficients change direction primarily in the wind index variable when aggregated by ZCTA.
Limitations
There are several limitations of the current study. First, nearly all health data are protected by an array of laws and regulations at multiple scales of authority – including state, federal, and institutional oversight. One of the most common methods for protecting individual privacy is to group data into spatial regions – typically this occurs along census designations like tracts or – as in the current analysis – ZCTA. Because these spatial configurations vary in both scale and zoning patterns, particular and challenging methodological problems characterize their use. One of the most persistent is the modifiable areal units problem (MAUP), which refers to the sensitivity in model calibration introduced by aggregating data into areal units (Openshaw 1984; Fotheringham and Wong 1991). Failure to account for and understand the extent of impact from the MAUP on analyses can have profound implications on both environmental and health research and policy.
We also faced challenges surrounding data quality that cannot be fully resolved. For example, fracking locations are voluntarily disclosed by well operators. The Shale Gas Production Subcommittee of the Congressional Secretary of Energy Advisory Board (SEAB) recommends public disclosure of fracking activity but there is currently no law or regulation specifically requiring such disclosures. All disclosures in New Mexico were freely obtained from the FracFocus Chemical Disclosure Registry (FracFocus 2023) dating back to January 1, 2011. All wells, regardless of current production status, were included in the present analysis. Similarly, the point locations of all mine features derived from the MRDS is a compendium of mine records compiled over a multidecadal period. Each record has varying degrees of completeness, and it is difficult to ascertain mine size, production volume, or reclamation status.
Finally, measurements of metal concentrations in soil and sediment are provided by the NURE program, dating back to the 1960s. While the documentation of the laboratory methods is thorough and we are able to clean the data effectively, we remain uncertain about any biogeophysical and anthropogenic processes that have contributed to the resuspension or change in geographical extent of metals in the earth’s crust since samples were collected.
Supplementary Material
Table 2.
Numbers of observations used in different models, by outcomes and sets of covariates.
| Outcome | ||
|---|---|---|
| Type | continuous | binary |
| Model type | Linear | logistic |
|
|
||
| # Observations used | ||
|
|
||
| Basic covariates | 202 | 333 |
| + PCs for four metals | 104 | 191 |
| + PCs for five metals | 83 | 139 |
Highlights.
The association between COVID-19-related mortality and socio-environmental exposures is spatially heterogeneous
Underserved communities and racial minorities tend to experience a higher burden of environmental exposure, rendering them more vulnerable to COVID-19-related mortality.
Spatial relationships between COVID-19-related mortality and its environmental and sociodemographic drivers are masked by global modeling approaches.
Acknowledgements:
We thank Carmela Smith, MS, Dr. Smita Das, MD, PhD, MPH, and Joseph Bareta, MS at the New Mexico Department of Health, and Dr. Tracie Collins, MD, MPH, MHCDS at the University of New Mexico Health Sciences Center for support acquiring data and drafting a data transfer and use agreement between both entities. This work was supported in part by grants or awards from the National Institutes of Health National Institute of Environmental Health Science (1P50ES026102 and 1P30ES032755). The content is solely the responsibility of the authors and does not necessarily represent the official views of any funding agencies.
Footnotes
Declaration of interests
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Declarations of interest: none.
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
Data Statement:
Data are from the NMDOH and are managed under a use agreement between NMDOH and UNM-HSC. Data will be made available to interested researchers pending approval for dissemination of sensitive data by both the University of New Mexico Sponsored Projects Office and the New Mexico Department of Health.
References
- Ahmed S, Moore SE, Kippler M, Gardner R, Hawlader MDH, Wagatsuma Y, Raqib R, and Vahter M. 2014. Arsenic Exposure and Cell-Mediated Immunity in Pre-School Children in Rural Bangladesh. Toxicological Sciences 141 (1):166–175. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Anselin L. 1995. Local indicators of spatial association—LISA. Geographical Analysis 27 (2):93–115. [Google Scholar]
- Antonić O, and Legović T. 1999. Estimating the direction of an unknown air pollution source using a digital elevation model and a sample of deposition. Ecological Modelling 124 (1):85–95. [Google Scholar]
- Aranyi C, Bradof JN, O’Shea WJ, Graham JA, and Miller FJ 1985. Effects of arsenic trioxide inhalation exposure on pulmonary antibacterial defenses in mice. Journal of Toxicology and Environmental Health 15 (1):163–172. [DOI] [PubMed] [Google Scholar]
- Barber GM 1988. Elementary statistics for geographers. New York, NY: Guilford Press. [Google Scholar]
- Böhner J, and Antonić O. 2009. Chapter 8 Land-Surface Parameters Specific to Topo-Climatology. In Developments in Soil Science, Geomorphometry., eds. Hengl T. and Reuter HI, 195–226. Elsevier; http://www.sciencedirect.com/science/article/pii/S0166248108000081 (last accessed 8 July 2019). [Google Scholar]
- Brunsdon C, Fotheringham AS, and Charlton ME 1996. Geographically Weighted Regression: A Method for Exploring Spatial Nonstationarity. Geographical Analysis 28 (4):281–298. [Google Scholar]
- Castro LA, Shelley CD, Osthus D, Michaud I, Mitchell J, Manore CA, and Valle SYD 2021. How New Mexico leveraged a COVID-19 case forecasting model to preemptively address the health care needs of the state: Quantitative analysis. JMIR Public Health and Surveillance 7 (6):e27888. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen F, Leung Y, Wang Q, and Zhou Y. 2024. Spatial non-stationarity test of regression relationships in the multiscale geographically weighted regression model. Spatial Statistics :100846. [Google Scholar]
- Comber A, Brunsdon C, Charlton M, Dong G, Harris R, Lu B, Lü Y, Murakami D, Nakaya T, Wang Y, and Harris P. 2020. The GWR route map: a guide to the informed application of Geographically Weighted Regression. http://arxiv.org/abs/2004.06070 (last accessed 19 June 2024). [Google Scholar]
- Dasgupta S, Bowen VB, Leidner A, Fletcher K, Musial T, Rose C, Cha A, Kang G, Dirlikov E, Pevzner E, Rose D, Ritchey MD, Villanueva J, Philip C, Liburd L, and Oster AM 2020. Association Between Social Vulnerability and a County’s Risk for Becoming a COVID-19 Hotspot — United States, June 1–July 25, 2020. Morbidity and Mortality Weekly Report 69 (42):1535–1541. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Eicher CL, and Brewer CA 2001. Dasymetric Mapping and Areal Interpolation: Implementation and Evaluation. Cartography and Geographic Information Science 28 (2):125–138. [Google Scholar]
- Erdei E, Meltzer G, Avenbuan O, Awada C, Oyetade O, Blackman T, Kwon S, and Zelikoff J. 2020. Environmentally Marginalized Populations: the “perfect storm” for infectious disease pandemics, including COVID-19. The Journal of Health Disparities Research and Practice 13:68–77. [Google Scholar]
- Erdei E, Qeadan F, Miller CP, Kanda DA, Luo L, Gonzales M, Lewis JL, and MacKenzie D. 2022. Environmental uranium exposures and cytokine profiles among mother-newborn baby pairs from the Navajo Βirth Cohort Study. Toxicology and Applied Pharmacology 456:116292. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Farrigan TL, and Parker eds TS 2012. The Concentration of Poverty Is a Growing Rural Problem. Amber Waves: The Economics of Food, Farming, Natural Resources, and Rural America. [Google Scholar]
- Faruk MO, Rana Md. S., Jannat SN, Lisa F. Khanam, and Rahman Md. S. 2023. Impact of environmental factors on COVID-19 transmission: spatial variations in the world. International Journal of Environmental Health Research 33 (9):864–880. [DOI] [PubMed] [Google Scholar]
- Fotheringham AS 2011. Geographically weighted regression. In The SAGE Handbook of Spatial Analysis, eds. Fotheringham A. and Rogerson P, 243–253. 1 Oliver’s Yard, 55 City Road, London England EC1Y 1SP United Kingdom: SAGE Publications, Ltd. [Google Scholar]
- Fotheringham AS, and Wong DWS 1991. The modifiable areal unit problem in multivariate statistical analysis. Environment and Planning A: Economy and Space 23 (7):1025–1044. [Google Scholar]
- FracFocus. 2023. About Us. https://www.fracfocus.org/about-us (last accessed 16 November 2023).
- Frenzel HN, Bloomer RR, Cline RB, Cys JM, Galley JE, Gibson WR, Hills JM, King WE, Seager WR, Kottlowski FE, Thompson III S, Luff GC, Pearson BT, and Siclen DCV 1988. The Permian Basin region. In Sedimentary Cover—North American Craton: U.S, ed. Sloss LL, 0. Geological Society of America; 10.1130/DNAG-GNA-D2.261 (last accessed 14 August 2024). [DOI] [Google Scholar]
- Gaynor TS, and Wilson ME 2020. Social Vulnerability and Equity: The Disproportionate Impact of COVID-19. Public Administration Review 80 (5):832–838. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gilbert A, and Chakraborty J. 2011. Using geographically weighted regression for environmental justice analysis: Cumulative cancer risks from air toxics in Florida. Social Science Research 40 (1):273–286. [Google Scholar]
- Gorris ME, Shelley CD, Del Valle SY, and Manore CA 2021. A time-varying vulnerability index for COVID-19 in New Mexico, USA using generalized propensity scores. Health Policy OPEN 2:100052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Grineski SE, Collins TW, and Olvera HA 2015. Local variability in the impacts of residential particulate matter and pest exposure on children’s wheezing severity: a geographically weighted regression analysis of environmental health justice. Population and Environment 37 (1):22–43. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Haley PJ, Bice DE, Muggenburg BA, Hann FF, and Benjamin SA 1987. Immunopathologic effects of nickel subsulfide on the primate pulmonary immune system. Toxicology and Applied Pharmacology 88 (1):1–12. [DOI] [PubMed] [Google Scholar]
- Harmon ME, Lewis J, Miller C, Hoover J, Ali A-MS, Shuey C, Cajero M, Lucas S, Zychowski K, Pacheco B, Erdei E, Ramone S, Nez T, Gonzales M, and Campen MJ 2017. Residential proximity to abandoned uranium mines and serum inflammatory potential in chronically exposed Navajo communities. Journal of Exposure Science and Environmental Epidemiology 27 (4):365–371. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hatcher SM, Agnew-Brune C, Anderson M, Zambrano LD, Rose CE, Jim MA, Baugher A, Liu GS, Patel SV, Evans ME, Pindyck T, Dubray CL, Rainey JJ, Chen J, Sadowski C, Winglee K, Penman-Aguilar A, Dixit A, Claw E, Parshall C, Provost E, Ayala A, Gonzalez G, Ritchey J, Davis J, Warren-Mears V, Joshi S, Weiser T, Echo-Hawk A, Dominguez A, Poel A, Duke C, Ransby I, Apostolou A, and McCollum J. 2020. COVID-19 Among American Indian and Alaska Native Persons — 23 States, January 31–July 3, 2020. Morbidity and Mortality Weekly Report 69 (34):1166–1169. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hirko KA, Kerver JM, Ford S, Szafranski C, Beckett J, Kitchen C, and Wendling AL 2020. Telehealth in response to the COVID-19 pandemic: Implications for rural health disparities. Journal of the American Medical Informatics Association 27 (11):1816–1818. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hoover J, Gonzales M, Shuey C, Barney Y, and Lewis J. 2017. Elevated arsenic and uranium concentrations in unregulated water sources on the Navajo Nation, USA. Exposure and Health 9 (2):113–124. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hoover JH, Erdei E, Begay D, Gonzales M, NBCS Study Team, J. M. Jarrett P.-Y. Cheng, and Lewis J. 2020. Exposure to uranium and co-occurring metals among pregnant Navajo women. Environmental Research 190:109943. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hu H, Chen E, Li Y, Xiaochun Zhu T. Zhang, and Xiaofang Zhu. 2018. Effects of Arsenic Trioxide on INF-gamma Gene Expression in MRL/lpr Mice and Human Lupus. Biological Trace Element Research 184 (2):391–397. [DOI] [PubMed] [Google Scholar]
- Huang J, Kwan M-P, Kan Z, Wong MS, Kwok CYT, and Yu X. 2020. Investigating the Relationship between the Built Environment and Relative Risk of COVID-19 in Hong Kong. ISPRS International Journal of Geo-Information 9 (11):624. [Google Scholar]
- Jolliffe IT, and Cadima J. 2016. Principal component analysis: A review and recent developments. Philosophical transactions Series A, Mathematical, physical, and engineering sciences 374 (2065):20150202. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Karaye IM, and Horney JA 2020. The Impact of Social Vulnerability on COVID-19 in the U.S.: An Analysis of Spatially Varying Relationships. American Journal of Preventive Medicine 59 (3):317–325. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Katzman JG, Tomedi LE, Thornton K, Menking P, Stanton M, Sosa N, Harkins M, Katzman N, Liu J, Archer GRD, and Arora S. 2021. Innovative COVID-19 programs to rapidly serve New Mexico: Project ECHO. Public Health Reports® 136 (1):39–46. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Khazanchi R, Beiter ER, Gondi S, Beckman AL, Bilinski A, and Ganguli I. 2020. County-Level Association of Social Vulnerability with COVID-19 Cases and Deaths in the USA. Journal of General Internal Medicine 35 (9):2784–2787. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kucks RP 2005. Terrestrial Radioactivity and Gamma-ray Exposure in the United States and Canada: Gridded geographic images. https://mrdata.usgs.gov/metadata/narad.faq.html (last accessed 11 November 2023). [Google Scholar]
- Kwan M-P 2021. The stationarity bias in research on the environmental determinants of health. Health & Place 70:102609. [DOI] [PubMed] [Google Scholar]
- Lin Y, Hoover J, Beene D, Erdei E, and Liu Z. 2020. Environmental risk mapping of potential abandoned uranium mine contamination on the Navajo Nation, USA, using a GIS-based multi-criteria decision analysis approach. Environmental Science and Pollution Research 27 (24):30542–30557. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lindberg A-L, Ekström E-C, Nermell B, Rahman M, Lönnerdal B, Persson L-Å, and Vahter M. 2008. Gender and age differences in the metabolism of inorganic arsenic in a highly exposed population in Bangladesh. Environmental Research 106 (1):110–120. [DOI] [PubMed] [Google Scholar]
- Maier NK, Crown D, Liu J, Leppla SH, and Moayeri M. 2014. Arsenic Trioxide and Other Arsenical Compounds Inhibit the NLRP1, NLRP3, and NAIP5/NLRC4 Inflammasomes. The Journal of Immunology 192 (2):763–770. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mason GT, and Arndt RE 1996. Mineral Resources Data System (MRDS). The Survey,. https://pubs.usgs.gov/publication/ds20 (last accessed 11 November 2023). [Google Scholar]
- Meng Y, Wong MS, Kwan M-P, and Zhu R. 2022. Association between Global Air Pollution and COVID-19 Mortality: A Study of Forty-Six Cities in the World. Annals of the American Association of Geographers 112 (6):1777–1793. [Google Scholar]
- Meng Y, Wong MS, Xing H, Kwan M-P, and Zhu R. 2021. Assessing the Country-Level Excess All-Cause Mortality and the Impacts of Air Pollution and Human Activity during the COVID-19 Epidemic. International Journal of Environmental Research and Public Health 18 (13):6883. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nayak A, Islam SJ, Mehta A, Ko Y-A, Patel SA, Goyal A, Sullivan S, Lewis TT, Vaccarino V, Morris AA, and Quyyumi AA 2020. Impact of Social Vulnerability on COVID-19 Incidence and Outcomes in the United States. :2020.04.10.20060962. https://www.medrxiv.org/content/10.1101/2020.04.10.20060962v2 (last accessed 16 November 2023). [Google Scholar]
- NOAA Physical Sciences Laboratory. 2023. NCEP North American Regional Reanalysis (NARR). https://psl.noaa.gov/data/gridded/data.narr.html (last accessed 11 November 2023).
- Oak Ridge National Laboratory. 2020. LandScan. Documentation, LandScan. https://landscan.ornl.gov/documentation (last accessed 8 April 2021). [Google Scholar]
- Openshaw S. 1984. The modifiable areal unit problem. Concepts and Techniques in Modern Geography. https://ci.nii.ac.jp/naid/10024464407/ (last accessed 16 December 2019). [Google Scholar]
- Oshan TM, Smith JP, and Fotheringham AS 2020. Targeting the spatial context of obesity determinants via multiscale geographically weighted regression. International Journal of Health Geographics 19 (1):11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rahmani K, Shavaleh R, Forouhi M, Disfani HF, Kamandi M, Oskooi RK, Foogerdi M, Soltani M, Rahchamani M, Mohaddespour M, and Dianatinasab M. 2022. The effectiveness of COVID-19 vaccines in reducing the incidence, hospitalization, and mortality from COVID-19: A systematic review and meta-analysis. Frontiers in Public Health 10. https://www.frontiersin.org/journals/public-health/articles/10.3389/fpubh.2022.873596/full (last accessed 19 June 2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Raqib R, Ahmed S, Ahsan KB, Kippler M, Akhtar E, Roy AK, Lu Y, Arifeen SE, Wagatsuma Y, and Vahter M. 2017. Humoral Immunity in Arsenic-Exposed Children in Rural Bangladesh: Total Immunoglobulins and Vaccine-Specific Antibodies. Environmental Health Perspectives 125 (6):067006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rifat SAA, and Liu W. 2022. One year into the pandemic: the impacts of social vulnerability on COVID-19 outcomes and urban–rural differences in the conterminous United States. International Journal of Environmental Health Research 32 (12):2601–2619. [DOI] [PubMed] [Google Scholar]
- Rodriguez-Lonebear D, Barceló NE, Akee R, and Carroll SR 2020. American Indian reservations and COVID-19: Correlates of early infection rates in the pandemic. Journal of Public Health Management and Practice 26 (4):371–377. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ryan PH, LeMasters GK, Levin L, Burkle J, Biswas P, Hu S, Grinshpun S, and Reponen T. 2008. A land-use regression model for estimating microenvironmental diesel exposure given multiple addresses from birth through childhood. Science of The Total Environment 404 (1):139–147. [DOI] [PubMed] [Google Scholar]
- Sachdeva M, and Fotheringham AS 2023. A Geographical Perspective on Simpson’s Paradox. Journal of Spatial Information Science (26):1–25. [Google Scholar]
- Scammell MK, Sennett C, Laws RL, Rubin RL, Brooks DR, Amador JJ, López-Pilarte D, Ramirez-Rubio O, Friedman DJ, McClean MD, Cohort Study Team NB, Lewis J, and Erdei E. 2020. Urinary Metals Concentrations and Biomarkers of Autoimmunity among Navajo and Nicaraguan Men. International Journal of Environmental Research and Public Health 17 (15):5263. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sequist TD 2020. The Disproportionate Impact of Covid-19 on Communities of Color. Catalyst nonissue content 1 (4). https://catalyst.nejm.org/doi/full/10.1056/CAT.20.0370 (last accessed 22 October 2023). [Google Scholar]
- Siordia C, Saenz J, and Tom SE 2012. An Introduction to Macro- Level Spatial Nonstationarity: a Geographically Weighted Regression Analysis of Diabetes and Poverty. Human geographies 6 (2):5–13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sood A, Pollard C, Suer KL, Vlahovich K, and Walker J. 2021. Caring for miners during the Coronavirus Disease-2019 (COVID-19) Pandemic. The Journal of Rural Health 37 (1):165–168. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Soto Mas F, Iriart C, Pedroncelli R, Binder DS, Qualls CR, and Price B. 2019. Impact of Health Care and Socioeconomic Needs on Health Care Utilization and Disease Management: The University of New Mexico Hospital Care One Program. Population Health Management 22 (2):113–119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tieskens K, Patil P, Levy JI, Brochu P, Lane KJ, Fabian MP, Carnes F, Haley BM, Spangler KR, and Leibler JH 2021. Time-varying associations between COVID-19 case incidence and community-level sociodemographic, occupational, environmental, and mobility risk factors in Massachusetts. Research Square :rs.3.rs-237622. [DOI] [PMC free article] [PubMed] [Google Scholar]
- U.S. Census Bureau. 2021. QuickFacts: New Mexico. QuickFacts. https://www.census.gov/quickfacts/NM (last accessed 6 March 2023). [Google Scholar]
- U.S. Census Bureau, U. C. 2021. Race and Ethnicity in the United States: 2010 Census and 2020 Census. Race and Ethnicity in the United States: 2010 Census and 2020 Census. https://www.census.gov/library/visualizations/interactive/race-and-ethnicity-in-the-united-state-2010-and-2020-census.html (last accessed 6 March 2023). [Google Scholar]
- U.S. Geological Survey. 2022. Elevation Derivatives for National Applications (EDNA) | U.S. Geological Survey https://www.usgs.gov/special-topics/elevation-derivatives-for-national-applications (last accessed 11 November 2023).
- Wang H, Paulson KR, Pease SA, Watson S, Comfort H, Zheng P, Aravkin AY, Bisignano C, Barber RM, Alam T, Fuller JE, May EA, Jones DP, Frisch ME, Abbafati C, Adolph C, Allorant A, Amlag JO, Bang-Jensen B, Bertolacci GJ, Bloom SS, Carter A, Castro E, Chakrabarti S, Chattopadhyay J, Cogen RM, Collins JK, Cooperrider K, Dai X, Dangel WJ, Daoud F, Dapper C, Deen A, Duncan BB, Erickson M, Ewald SB, Fedosseeva T, Ferrari AJ, Frostad JJ, Fullman N, Gallagher J, Gamkrelidze A, Guo G, He J, Helak M, Henry NJ, Hulland EN, Huntley BM, Kereselidze M, Lazzar-Atwood A, LeGrand KE, Lindstrom A, Linebarger E, Lotufo PA, Lozano R, Magistro B, Malta DC, Månsson J, Herrera AMM, Marinho F, Mirkuzie AH, Misganaw AT, Monasta L, Naik P, Nomura S, O’Brien EG, O’Halloran JK, Olana LT, Ostroff SM, Penberthy L, Jr RCR, Reinke G, Ribeiro ALP, Santomauro DF, Schmidt MI, Shaw DH, Sheena BS, Sholokhov A, Skhvitaridze N, Sorensen RJD, Spurlock EE, Syailendrawati R, Topor-Madry R, Troeger CE, Walcott R, Walker A, Wiysonge CS, Worku NA, Zigler B, Pigott DM, Naghavi M, Mokdad AH, Lim SS, Hay SI, Gakidou E, and Murray CJL 2022. Estimating excess mortality due to the COVID-19 pandemic: a systematic analysis of COVID-19-related mortality, 2020–21. The Lancet 399 (10334):1513–1536. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wilson V, and Mokhiber Z. 2016. 2016 ACS shows stubbornly high Native American poverty and different degrees of economic well-being for Asian ethnic groups. Bulgaria: Economic Policy Institute. https://policycommons.net/artifacts/1407215/2016-acs-shows-stubbornly-high-native-american-poverty-and-different-degrees-of-economic-well-being-for-asian-ethnic-groups/2021476/ (last accessed 24 January 2024). [Google Scholar]
- Wu X, Nethery RC, Sabath MB, Braun D, and Dominici F. 2020. Air pollution and COVID-19 mortality in the United States: Strengths and limitations of an ecological regression analysis. Science Advances 6 (45):eabd4049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xia X, Lin S, and Zhou Y. 2007. Effects of arsenic trioxide on the autoimmunity and survival time in BXSB lupus mice. Zhongguo Zhong xi yi jie he za zhi Zhongguo Zhongxiyi jiehe zazhi = Chinese journal of integrated traditional and Western medicine 27 (2):138–141. [PubMed] [Google Scholar]
- Yellow Horse AJ, Deschine Parkhurst NA, and Huyser KR 2020. COVID-19 in New Mexico tribal lands: Understanding the role of social vulnerabilities and historical racisms. Frontiers in Sociology 5:610355. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yu H, Russell A, Mulholland J, and Huang Z. 2018. Using cell phone location to assess misclassification errors in air pollution exposure estimation. Environmental Pollution 233:261–266. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Data are from the NMDOH and are managed under a use agreement between NMDOH and UNM-HSC. Data will be made available to interested researchers pending approval for dissemination of sensitive data by both the University of New Mexico Sponsored Projects Office and the New Mexico Department of Health.

