Abstract
In this correspondence, we emphasize methodological caveats of ecological studies assessing associations between COVID-19 and its physical and social environmental determinants. First, we stress that inference is error-prone due to the modifiable areal unit problem and the modifiable temporal unit problem. The possibility of confounding from using aggregated data is substantial due to the neglect of person-level factors. Second, studying the viral transmission of COVID-19 solely on people's residential neighborhoods is problematic because people are also exposed to nonhome locations and environments en-route along their daily mobility path. We caution against an uncritical application of aggregated data and reiterate the importance of stronger research designs (e.g., case-control studies) on an individual level. To address environmental contextual uncertainties due to people's day-to-day mobility, we call for people-centered studies with mobile phone data.
Keywords: SARS-CoV-2, Environmental determinants, Ecological research designs, Modifiable areal unit problem, Modifiable temporal unit problem, Uncertain geographic context problem
The rapid spread of COVID-19 across the globe has led scholars to speculate about the environment's role in the incidence and prevalence of this disease (Shakil et al., 2020). It makes intuitive sense that some environmental settings may facilitate transmission. An increasing number of studies has examined possible physical and social environmental determinants of COVID-19. Frequently applied are ecological study designs using grouped COVID-19 and (open) environmental data (e.g., Worldpop, Google Earth Engine) corresponding to an areal unit (Wakefield, 2008). In some cases, socio-demographics (Sannigrahi et al., 2020) and measures of exposure such as air pollution (Fattorini and Regoli, 2020), humidity (Ma et al., 2020), and temperature (Sarmadi et al., 2020) were found to be associated with infection rates. However, at least two methodological caveats – resonating well-established sources of bias in geospatial data analyses – are shared amongst many of these studies. To assist COVID-19 research to make more fruitful progress in the future, we highlight potential concerns and challenges with two aims: 1) emphasizing the critical role of acknowledging geographical principles related to the spatial and temporal unit of analysis; and 2) arguing for exposure assessments centering on people rather than areas when assessing COVID-19 transmission.
First, many (if not all) of ecological COVID-19 studies to date have utilized imprecise and spatially and temporally unaligned data on an ad hoc basis. We posit that data decisions are driven by what COVID-19 datasets were publically available and the paucity of readily available exposure data. Such datasets are often limited to the municipality, county, or even country level for privacy reasons, and utilizing secondary data provides rapid empirically-based recommendations for health policy. However, spatially aggregated data across space and over time susceptible to the modifiable areal unit problem (MAUP) (Openshaw, 1981) and the modifiable temporal unit problem (MTUP) (Cheng and Adepeju, 2014). The former emphasizes that analytical differences may occur depending on the size of the areal units used (the scale effect) and the way the study area is divided (e.g., municipalities vs. districts) (the zoning effect). The latter stresses that results also depend on the way data are temporally aggregated (e.g., weeks vs. months) and which temporal segmentation schemes are applied (e.g., first day per week/month vs. middle of the week/months) (Cressie, 1998). While some attempts have been made to assess daily COVID-19 patterns and reduce the risk of the MTUP (e.g., Badr et al., 2020), it is likely that COVID-19-associations remain dependent on the state of the epidemic. Different data periods considered in an ecological analysis (e.g., the first month vs. the first three months since the outbreak) likely manifest in different spatiotemporal COVID-19 incidence patterns and thus altered correlations.
Both the MAUP and MTUP represent fundamental methodological problems shown to bias statistical inference and result in spurious models (de Jong and de Bruin, 2012; Wang and Di, 2020). Limited space-time granularity can result not only in underpowered studies but also in conclusions based on geographic scales that are too large to uncover meaningful COVID-19-environment associations and effectively inform policy. Of similar importance, the possibility of confounding from using ecological inference is substantial due to the neglect of person-level factors (e.g., age) (Greenland et al., 1994). Thanks to considerable efforts undertaken to quickly understand COVID-19 risk factors, recent ecological analyses are more comprehensively adjusted for potentially important confounders including the epidemic stage, measures for containment, socio-demographic and meteorological conditions, etc. (Klompmaker et al., 2020). These efforts have considerably reduced residual confounding, but regardless of the strength of an observed (well-adjusted) ecological association, neither can conclusions on the cause-and-effect relationships be made nor can the associations be transferred to the individual-level (ecological fallacy) (Robinson, 2009; Wakefield, 2008). We, therefore, urge for conceptually-stronger observational research designs including retrospective case-control studies based on register linkage data enriched with high-resolution environmental determinants derived on a residential address level by means of geographic information systems. Prospective cohort studies may not be practical in light of ethical issues regarding the exposure of people to a health-threatening risk in which specific population groups (e.g., those with pre-existing illness) appear to be disproportionately affected.
Second, when studying highly contagious infectious diseases such as COVID-19 where viral transmission occurs through largely face-to-face interaction (Wiersinga et al., 2020), assessing transmission risk based solely on people's residential neighborhoods seems questionable. Despite partial lockdowns, people can be exposed to different environments at nonhome locations (e.g., home improvement stores, grocery stores) and en-route along their mobility path when traveling between their day-to-day activity places. Disregarding the timing and duration that people are exposed to environments beyond their home gives rise to another methodological concern, the uncertain geographic context problem (UGCoP) (Kwan, 2012). This concern describes misspecification of the geospatial context that manifests in the over- and underestimation of the “true” exposure to environments, and ultimately results in inferential errors. Relatedly, epidemiological and environmental data should be temporally aligned as closely as possible to reduce temporal context uncertainties; a temporal misalignment can obscure or distort COVID-19–exposure associations and falsely render environmental factors as correlates or not (Helbich, 2019). To address context uncertainties from people's day-to-day mobility behavior, we call for individual rather than area-centered approaches using location-tracking technologies.
With strict privacy regulations and data security in place and without the need for primary data collection, researchers should take advantage of mobile phone data. Movement patterns and activity locations can be inferred based on the specific cellular towers and the handovers from one tower to another assigned to phones (Oliver et al., 2020). Since such data can play a critical role in retrieving fine-grained information on space-time mobility and social interaction on a population-level, we urge for alliances between research labs and telecommunications operators for mobility-based exposure assessments.
Taken together, while a methodological and epistemological pluralism is inevitable in spatial analyses on COVID-19, we caution against an uncritical application of area-aggregated data and reiterate the importance of inherent scale-time dependencies and contextual uncertainties arising through human mobility.
Declaration of competing interest
The authors have no conflict of interest to declare.
Acknowledgments
Acknowledgements
We thank the editor-in-chief Jay Gan and the four anonymous reviewers for their suggestions to improve the original draft of the correspondence.
Funding
Marco Helbich received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (grant agreement No. 714993). This work was also supported by EXPOSOME-NL, which is funded through the Gravitation program of the Dutch Ministry of Education, Culture, and Science and the Netherlands Organization for Scientific Research (NWO grant number 024.004.017). Mei-Po Kwan was supported by a grant from the Research Committee on Research Sustainability of Major RGC Funding Schemes of the Chinese University of Hong Kong.
Editor: Jay Gan
References
- Badr H.S., Du H., Marshall M., Dong E., Squire M.M., Gardner L.M. Association between mobility patterns and COVID-19 transmission in the USA: a mathematical modelling study. Lancet Infect. Dis. 2020 doi: 10.1016/S1473-3099(20)30553-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cheng T., Adepeju M. Modifiable temporal unit problem (MTUP) and its effect on space-time cluster detection. PLoS One. 2014;9 doi: 10.1371/journal.pone.0100465. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cressie N. Aggregation and interaction issues in statistical modeling of spatiotemporal processes. Geoderma. 1998;85:133–140. [Google Scholar]
- de Jong R., de Bruin S. Linear trends in seasonal vegetation time series and the modifiable temporal unit problem. Biogeosciences. 2012;9:71–77. [Google Scholar]
- Fattorini D., Regoli F. Role of the chronic air pollution levels in the Covid-19 outbreak risk in Italy. Environ. Pollut. 2020;114732 doi: 10.1016/j.envpol.2020.114732. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Greenland S., Robins J., Piantadosi S., Cohen B.L. Ecologic studies-biases, misconceptions, and counterexamples. Invited commentaries. Authors replies. Am. J. Epidemiol. 1994;139:747–771. doi: 10.1093/oxfordjournals.aje.a117069. [DOI] [PubMed] [Google Scholar]
- Helbich M. Spatiotemporal contextual uncertainties in green space exposure measures: exploring a time series of the normalized difference vegetation indices. Int. J. Environ. Res. Public Health. 2019;16:852. doi: 10.3390/ijerph16050852. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Klompmaker J.O., Hart J.E., Holland I., Sabath M.B., Wu X., Laden F., Dominici F., James P. County-level exposures to greenness and associations with COVID-19 incidence and mortality in the United States. medRxiv. 2020 doi: 10.1101/2020.08.26.20181644. Version 1. Preprint. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kwan M.-P. The uncertain geographic context problem. Ann. Assoc. Am. Geogr. 2012;102:958–968. [Google Scholar]
- Ma Y., Zhao Y., Liu J., He X., Wang B., Fu S., Yan J., Niu J., Zhou J., Luo B. Effects of temperature variation and humidity on the death of COVID-19 in Wuhan, China. Sci. Total Environ. 2020;138226 doi: 10.1016/j.scitotenv.2020.138226. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Oliver N., Lepri B., Sterly H., Lambiotte R., Deletaille S., De Nadai M., Letouzé E., Salah A.A., Benjamins R., Cattuto C. Mobile phone data for informing public health actions across the COVID-19 pandemic life cycle. Sci. Adv. 2020;6 doi: 10.1126/sciadv.abc0764. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Openshaw S. The modifiable areal unit problem. Quant. Geogr. A Br. View. 1981:60–69. [Google Scholar]
- Robinson W.S. Ecological correlations and the behavior of individuals. Int. J. Epidemiol. 2009;38:337–341. doi: 10.1093/ije/dyn357. [DOI] [PubMed] [Google Scholar]
- Sannigrahi S., Pilla F., Basu B., Basu A.S., Molter A. Examining the association between socio-demographic composition and COVID-19 fatalities in the European region using spatial regression approach. Sustain. Cities Soc. 2020;102418 doi: 10.1016/j.scs.2020.102418. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sarmadi M. Association of COVID-19 global distribution and environmental and demographic factors: an updated three-month study. Environ. Res. 2020;109748 doi: 10.1016/j.envres.2020.109748. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shakil M.H., Munim Z.H., Tasnia M., Sarowar S. COVID-19 and the environment: a critical review and research agenda. Sci. Total Environ. 2020;141022 doi: 10.1016/j.scitotenv.2020.141022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wakefield J. Ecologic studies revisited. Annu. Rev. Public Heal. 2008;29:75–90. doi: 10.1146/annurev.publhealth.29.020907.090821. [DOI] [PubMed] [Google Scholar]
- Wang Y., Di Q. Modifiable areal unit problem and environmental factors of COVID-19 outbreak. Sci. Total Environ. 2020;139984 doi: 10.1016/j.scitotenv.2020.139984. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wiersinga W.J., Rhodes A., Cheng A.C., Peacock S.J., Prescott H.C. Pathophysiology, transmission, diagnosis, and treatment of coronavirus disease 2019 (COVID-19): a review. Jama. 2020;324(8):782–793. doi: 10.1001/jama.2020.12839. [DOI] [PubMed] [Google Scholar]