Abstract
The negative impacts of fine particulate matter (PM2.5) exposure on human health are a primary motivator for air quality research. However, estimates of the air pollution health burden vary considerably and strongly depend on the datasets and methodology. Satellite observations of aerosol optical depth (AOD) have been widely used to overcome limited coverage from surface monitoring and to assess the global population exposure to PM2.5 and the associated premature mortality. Here we quantify the uncertainty in determining the burden of disease using this approach, discuss different methods and datasets, and explain sources of discrepancies among values in the literature. For this purpose we primarily use the MODIS satellite observations in concert with the GEOS-Chem chemical transport model. We contrast results in the United States and China for the years 2004–2011. Using the Burnett et al. (2014) integrated exposure response function, we estimate that in the United States, exposure to PM2.5 accounts for approximately 2% of total deaths compared to 14% in China (using satellite-based exposure), which falls within the range of previous estimates. The difference in estimated mortality burden based solely on a global model vs. that derived from satellite is approximately 14% for the U.S. and 2% for China on a nationwide basis, although regionally the differences can be much greater. This difference is overshadowed by the uncertainty in the methodology for deriving PM2.5 burden from satellite observations, which we quantify to be on the order of 20% due to uncertainties in the AOD-to-surface-PM2.5 relationship, 10% due to the satellite observational uncertainty, and 30% or greater uncertainty associated with the application of concentration response functions to estimated exposure.
1 Introduction
By 2030, air pollution will be the leading environmentally-related cause of premature mortality worldwide (OECD, 2012). The World Health Organization (WHO) estimates that exposure to outdoor air pollution resulted in 3.7 million premature deaths in 2012. Many epidemiological studies have shown that chronic exposure to fine particulate matter (PM2.5) is associated with an increase in the risk of mortality from respiratory diseases, lung cancer, and cardiovascular disease, with the underlying assumption that a causal relationship exists between PM and health outcomes (Dockery et al., 1993; Jerrett et al., 2005; Krewski et al., 2009; Pope et al., 1995; 2002; 2004; 2006). This has been shown through single and multi-population time series analyses, long-term cohort studies, and meta-analyses.
In order to stress the negative impacts of air pollution on human health and inform policy development (particularly with regard to developing strategies for intervention and risk reduction), many studies have calculated the total number of premature deaths each year attributable to air pollution exposure or the “burden of disease,” through health impact assessment methods. One of the main obstacles in attributing specific health impacts of PM2.5 is determining exposure and linking this to specific health outcomes. Jerrett et al. (2005) suggest personal monitors would be the optimal method because it would be easier to attribute individual recorded health outcomes to specific particulate levels, but point out that the financial costs and time-intensiveness limit widespread use. Many studies have instead relied on fixed-site monitors within a certain radius to estimate population-level exposure. However, these monitoring networks are generally located in urban regions and provide no information on concentration gradients between sites. Thus, epidemiological studies typically have to quantify the aggregate population response to an area-average concentration. Additionally, health data can be limited and therefore the responses may be determined from a subset of individuals that may not be representative of the wider population.
Estimating the burden of disease associated with particulate air pollution requires robust estimates of PM2.5 exposure. Fixed-site monitoring networks can be costly to operate and maintain, and the sampling time period for many of these monitors in the United States is often only every third or sixth day. Due to the high spatial and temporal variability in aerosol concentrations, this makes it difficult to determine exposure and widespread health impacts. Worldwide, monitoring networks are even scarcer, with many developing countries lacking any long-term measurements. “Satellite-based” concentrations are now used extensively for estimating mortality burdens and health impacts (e.g. Crouse et al., 2012; Evans et al., 2013; Fu et al., 2015; Hystad et al., 2012; Villeneuve et al., 2015) Satellite observations of aerosol optical depth (AOD) can offer observational constraints for population-level exposure estimates in regions where surface air quality monitoring is limited; however they represent the vertically-integrated extinction of radiation due to aerosols, and thus additional information on the vertical distribution and the optical properties of particulate matter is required (often provided by a model) to translate these observations to surface air quality (van Donkelaar et al., 2006, 2010; Liu et al., 2004, 2005). Alternatively, studies have relied on model-based estimates of PM2.5 exposure. Table 1 shows that the resulting estimates of premature mortality vary widely. Here, we discuss these different methods and contrast the uncertainty in these approaches for estimating exposure for both the U.S., where air quality has improved due to regulations and control technology, and China, where air quality is a contemporary national concern. Our objective is to investigate the factors responsible for uncertainty in chronic PM2.5 burden of disease estimates, and use these uncertainties to contextualize the comparison of satellite-based and model-based estimates of premature mortality with previous work. As health impact assessment methods are becoming more popular in the scientific literature, a greater understanding of the uncertainties in these methods and the datasets that are used is important.
Table 1.
Premature mortality from PM2.5 exposure by all-cause (All), heart disease (heart), and lung cancer (LC) as estimated in other studies for the globe, U.S. (or North America), and China (or Asia). Values are for (× 1000 deaths per year).
| Study | U.S. (North America) | China (Asia/Western Pacific, East Asia) | Global | Year for estimate | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
|
| ||||||||||
| All | Heart | LC | All | Heart | LC | All | Heart | LC | ||
| Evans et al., 2013* (WHO region) | 2640–4220 | 1123–1669 | 176–264 | 2004 | ||||||
| Fann et al., 2012* (U.S.) | 130–320 | 2005 | ||||||||
| Anenberg et al., 2010* (continents) | 141 | 124 | 17 | 2736 | 2584 | 152 | 2077–7714 | 3499 1800–4549 |
222 39–336 |
2000 |
| Lelieveld et al., 2013 (U.S. and China) | 55 | 46 | 9.1 | 1006 | 898 | 108 | 2200 | 2000 | 186 | 2005 |
| Cohen et al., 2004* (WHO region) | 28 | 3–55 | 1–12 | 355 | 192–504 | 22–53 | 799 | 474–1132 | 39–105 | 2000 |
| Lim et al., 2012 (GBD 2010, U.S. and China) | 86 | 58 | 20 | 858 | 563 | 185 | 3100 | 2010 | ||
| Forouzanfar et al., 2015 (GBD 2013, U.S. and China) | 78 | 54 | 17 | 916 | 600 | 201 | 2900 | 2013 | ||
| WHO, 2014 (WHO region) | 152 | 1669 | 3700 | 1505 | 227 | 2012 | ||||
| Fang et al., 2013 (North America and East Asia) | 38 | 4.4 | 661 | 53 | 1532 | 95 | 2000 | |||
| Silva et al., 2013** (continents) | 12.2–77 | 908–1240 | 1880–2380 | 2000 | ||||||
| U.S. EPA, 2010* (U.S.) | 26–360 | 2005 | ||||||||
| U.S. Environmental Protection Agency, 2009 (U.S.) | 144 | |||||||||
| Punger and West, 2013 (U.S.) | 66 | 61 | 9.9 | 2005 | ||||||
| Lelieveld et al., 2015 (U.S. and China) | 55 | 1357 | 3297 | 2010 | ||||||
| Sun et al., 2015 (U.S.) | 103.3 | 68.3 | 15.4 | 2000 | ||||||
| Rohde and Muller, 2015 (China) | 1600 | 2014 | ||||||||
|
| ||||||||||
| This Study: Satellite (U.S. and China) with Burnett et al., 2014 | 50 | 38 | 5 | 1271 | 9 | 138 | 2004–2011 | |||
| This Study: Model (U.S. and China) with Burnett et al., 2014 | 43 | 32 | 4 | 1300 | 931 | 144 | 2004–2011 | |||
All cause values for this study are calculated as the sum of heart disease, lung cancer, and respiratory disease deaths (as opposed to calculating this based on an all-cause CRF).
Study provides several estimates determined using different CRFs.
Study provides several estimates from 14 different models. Table 3 provides additional information on the data sources and concentrations response functions used in these studies.
2 Methods and Tools
2.1 General formulation to calculate the burden of disease
To estimate the burden of premature mortality due to a specific factor like PM2.5 exposure, we rely on equations 1 and 2 (equations 6 and 8 in Ostro, 2004 and as previously used in van Donkelaar et al., 2011; Evans et al., 2013; Marlier et al., 2013; Zheng et al., 2015). The attributable fraction (AF) of mortality due to PM2.5 exposure depends on the relative risk value (RR), which here is the ratio of the probability of mortality (all-cause or from a specific disease) occurring in an exposed population to the probability of mortality occurring in a non-exposed population. The total burden due to PM2.5 exposure (ΔM) can be estimated by convolving the AF with the baseline mortality (equal to the baseline mortality rate Mb × exposed population P). The relative risk is assumed to change (ΔRR) with concentration, so that, in general, exposure to higher concentrations of PM2.5 should pose a greater risk for premature mortality (section 2.4).
| (1) |
| (2) |
Application of this approach requires information on the baseline mortality rates and population, along with the RR, which is determined through a concentration response function (including a shape and initial relative risk, section 2.4), and ambient surface PM2.5 concentrations.
2.2 Baseline mortality and population
For population data, we use the Gridded Population of the World, Version 3 (GPWv3), created by the Center for International Earth Science Information Network (CIESIN) and available from the Socioeconomic Data and Applications Center (SEDAC). This gridded dataset has a native resolution of 2.5 arc-minutes (~5km at the equator) and provides population estimates for 1990, 1995, and 2000, and projections (made in 2004) for 2005, 2010, and 2015. We linearly interpolate between available years to get population estimates for years not provided. Population density for China and the United States for the year 2000 are shown in Figure 1 along with the projected change in population density by the year 2015, illustrating continued growth of urbanized areas (at the expense of rural regions in China). We also compare mortality estimates using only urban area population (similar to Lelieveld et al., 2013 which estimates premature mortality in mega-cities). For this, we rely on the populated places dataset (provided by Natural Earth, which gives values for a point location rather than a grid and includes all major cities and towns along with some smaller towns in sparsely inhabited regions) which is determined from LandScan population estimates (Bright et al., 2008). In the U.S., approximately 80% of the population lives in urban areas. For China, 36% of the population lived in urban areas in 2000, but this number rose to 53% in 2013 (World Bank, 2015).
Figure 1.

Population density [per km2] for the year 2000 from the GPWv3 data for (a) the continental U.S. and (c) China. The projection for increase in population density by the year 2015 for (b) the continental U.S. and (d) China.
To determine baseline mortalities in the U.S. for cardiovascular disease (ischemic heart disease and stroke), lung cancer, and respiratory disease, we use mortality rates for each cause of death for all ages from the Center for Disease Control (cdc.gov) for each year and each state. We multiply the gridded population by these state-level mortality rates to obtain the baseline mortality in each grid box. Other studies have also used country-wide (or regional) (e.g. Evans et al., 2013) or county-level (e.g. Fann et al., 2013) average deaths rates. Some studies use the mortality rate for all cardiovascular diseases, which would produce larger estimates than just using ischemic heart disease and cerebrovascular disease (stroke). Additionally, some studies also only consider respiratory deaths related to ozone exposure. Mortality values are not as readily available for China, so we rely on country-wide values for baseline mortality (WHO age-standardized mortality rates by cause). Therefore, in China spatial variations in Mb are only due to variations in population and not regional variations in actual death rates (i.e. we do not account for death-specific mortality rates varying between provinces). In order to account for some regional variability in mortality rates, we use a population threshold to distinguish between urban and rural regions for lung cancer mortality rates (Chen et al., 2013a).
2.3 Relative Risk
The relative risk (RR) is a ratio of the probability of a health endpoint (in this case premature mortality) occurring in a population exposed to a certain level of pollution to the probability of that endpoint occurring in a population that is not exposed. Values greater than one suggest an increased risk, while a value of one would suggest no change in risk. These values are determined through epidemiological studies which relate individual health impacts to changes in concentrations, and literature values span a large range (Figure 2). While these studies attempt to account for differences in populations, lifestyles, pre-existing conditions, and co-varying pollutants, relative risk ratios determined from each study still differ. This is likely due to variables not taken into consideration, errors in exposure estimates (“exposure misclassification”) (Sheppard et al., 2012), and because, although the long-term effects of exposure to atmospheric pollutants have been well-documented, the pathophysiological mechanisms linking exposure to mortality risk are still unclear (Chen and Goldberg, 2009; Pope and Dockery, 2013; Sun et al., 2010), which make it difficult to determine how transferable results are from the context in which they were generated.
Figure 2.

Relative risk ratios from select previous studies for mortality due to chronic exposure to PM2.5 (given as per 10μgm−3 increase) colored by cause of death. Studies applied in this work are highlighted in bold.
For our initial estimates, we use the integrated risk function from Burnett et al. (2014) for heart disease, respiratory disease, and lung cancer premature mortality due to chronic exposure. We also compare our results to premature mortality estimates using risk ratios determined by Krewski et al. (2009), which is an extended analysis of the American Cancer Society study (Pope et al., 1995), and from Laden et al. (2006) which is an updated and extended analysis of the Harvard Six Cities study (Pope et al., 2002). The updated Krewski et al. (2009) risk ratios have been widely used in similar studies due to the large study population with national coverage, 18 year time span, and extensive analysis of confounding variables (ecological covariates, gaseous pollutants, weather, medical history, age, smoking, etc.). However, the Burnett et al. (2014) function is becoming more widely used in the literature (e.g. Lelieveld et al., 2015; Lim et al., 2012, Apte et al., 2015) because it provides the shape of the mortality function for the global range of exposure concentrations. Using these different risk ratios can makes our results more directly comparable to studies in Table 1 which rely on the risk ratios from these four studies (Burnett et al., 2014; Krewski et al., 2009; Laden et al., 2006; Pope et al., 2002).
2.4 Concentration response function
In order to determine an attributable fraction, it is necessary to understand how the response changes with concentration (i.e. does the relative risk increase, decrease, or level off with higher concentrations?). The shape of this concentration response function is an area of ongoing epidemiological research (e.g. Burnett et al., 2014; Pope et al., 2015).
In the simplest form, it might be assumed that the change in relative risk (RR, given as per 10 μg/m3) linearly depends on the surface PM2.5 concentration (C, in μg/m3) as given in equation 4 (and as presented as an alternate form in Cohen et al., 2004; 2005).
| (4) |
In this equation, C0 can be considered the “policy relevant (PRB)/target”, “natural background” or “threshold”/ “counterfactual”/ “lowest effect level” surface PM2.5 concentration. Studies have shown that there is not a concentration level below which there is no adverse health effect for PM (e.g. Pope et al., 2002; Shi et al., 2015) and most experts in health impacts of ambient air quality agree that there is no population-level threshold (although there may be individual-level thresholds, e.g. Roman et al., 2008). However, there are few epidemiological studies in regions with very low annual average concentrations (Crouse et al., 2012 does records a 1.9 μg m−3 annual concentration in rural Canada) making it difficult to determine the health risks in relatively clean conditions. How to extrapolate the relationship out of the range of observed measurements is uncertain. Therefore, rather than assuming that the function is linear down to zero, studies often set C0 to the value of the lowest measured level (LML) observed in the epidemiology study from which the RRs are derived [e.g. Evans et al., 2013 use 5.8 μg/m3 with the RR from Krewski et al. 2009] or use the “policy relevant” background (PRB, generally 0–2 μgm−3) concentration. This is the level to which policies might be able to reduce concentration and is generally determined from model simulations in which domestic anthropogenic emissions have been turned off (e.g. Fann et al., 2012). Similarly, some studies have set this value to preindustrial (1850) pollution levels (e.g. Fang et al., 2013; Silva et al., 2013).
Linear response functions are generally a good fit to observed responses at lower concentrations (Pope et al., 2002). However, studies suggest that linear response functions can greatly overestimate RR at high concentrations (e.g. Pope et al., 2015), where responses may start to level off. There is uncertainty at high concentrations because most epidemiology studies of the health effects of air pollution exposure have generally been conducted under lower concentrations (i.e. in the U.S.). In order to determine the shape of this response at higher concentrations, smoking has been used as a proxy (Burnett et al., 2014; Pope et al., 2011, 2009), which does show a diminishing response at higher concentrations. Therefore, both log-linear (Equations 5 and 6, where β = 0.15515/0.23218 for heart disease/lung cancer from Pope et al. (2002) or β = 0.18878/0.21136 for heart disease/lung cancer from Krewski et al., 2009 in Equation 6 and β = 0.01205/0.01328 for heart disease/lung cancer from Krewski et al., 2009 in Equation 5) and power law (Equation 7, where I is the inhalation rate of 18m3day−1, β = 0.2730/0.3195, α = 0.2685/0.7433 for heart disease/lung cancer from Pope et al., 2011 and as used in Marlier et al., 2013) functions have been also been explored in this study.
| (5) |
| (6) |
| (7) |
We note that Cohen et al. (2005) and Anenberg et al. (2010) reference Equation 5 as a log-linear function, while Ostro (2004), Evans et al. (2013), and Giannadaki et al. (2014) use this as their linear function and instead use Equation 6 as their log-linear function, we will refer to these equation numbers for clarity in other sections. Another method to limit the response at high concentrations is to simply use a “ceiling,” “maximum exposure/high-concentration threshold,” or “upper truncation” value in which it is assumed that the response remains the same for any value above it (e.g. Anenberg et al., 2012; Cohen et al., 2005; Evans et al., 2013). This can be a somewhat arbitrary value or the highest observed concentration in the original epidemiological study.
Recently, Burnett et al. (2014) fit an integrated exposure response (IER) model using RRs from a variety of epidemiological studies on ambient and household air pollution, active smoking, and second hand tobacco smoke in order to determine RR functions over all global PM2.5 exposure ranges for ischemic heart disease, cerebrovascular disease, chronic obstructive pulmonary disease, and lung cancer (Equation 8). Monte Carlo simulations were conducted in order to derive the one thousand sets of coefficients for the IER function (the coefficients are available at http://ghdx.healthdata.org/record/global-burden-disease-study-2010-gbd-2010-ambient-air-pollution-risk-model-1990–2010). for C < C0, ΔRR= 0
| (8) |
This form is now being widely used (Apte et al., 2015; Lelieveld et al., 2015; Lim et al., 2012) and we use it here for our baseline estimates. In the following sections, we will discuss the uncertainty on the burden of disease associated with the shape of the concentration response function and threshold concentration.
2.5 Estimating surface PM2.5
We use both a global model and satellite observations to estimate surface PM2.5 concentrations and translate these to PM2.5 exposure and health burden. In addition, we use surface measurements of PM2.5 to test the accuracy of these estimates.
2.5.1 Unconstrained Model Simulation
We use the global chemical transport model GEOS-Chem (geos-chem.org) to simulate both surface PM2.5 and AOD. We use v9.01.03 of the model, driven by GEOS-5 meteorology, in the nested grid configuration over North America and Asia (0.5°×0.667° horizontal resolution) for 2004–2011. Using this longer time period gives greater confidence in our uncertainty results. The GEOS-Chem aerosol simulation includes sulfate, nitrate, ammonium (Park et al., 2004), primary carbonaceous aerosols (Park et al., 2003), dust (Fairlie et al., 2007; Ridley et al., 2012), sea salt (Alexander et al., 2005), and secondary organic aerosols (SOA) (Henze et al., 2008). There are several regional anthropogenic emission inventories used in the model, such as BRAVO over Mexico (Kuhns et al., 2003), EMEP over Europe (Vestreng et al., 2007), CAC for Canada (http://www.ec.gc.ca/pdb/cac/cac_home_e.cfm), the EPA NEI05 inventory (Hudman et al., 2007, 2008) over the U.S., and Streets et al. (2006) over Asia. Any location not covered by one of these regional inventories relies on the GEIA (Benkovitz et al., 1996) and EDGAR (Olivier and Berdowski, 2001; Olivier et al., 2001) inventories. Biofuel emissions over the U.S. are also from the EPA NEI05 inventory (Hudman et al., 2007, 2008) and anthropogenic emissions of black and organic carbon over North America follow Cooke et al. (1999) with the seasonality from Park et al. (2003). Biogenic VOC emissions are calculated interactively following MEGAN (Guenther et al., 2006), while year-specific biomass burning is specified according to the GFED2 inventory (van der Werf et al., 2006). Surface dry PM2.5 is calculated by combining sulfate, nitrate, ammonium, elemental carbon, organic matter, fine dust, and accumulation mode sea salt concentrations in the lowest model grid box. In the following discussion, these values are referred as the “unconstrained model.” Simulated AOD is calculated at 550 nm based on aerosol optical and size properties as described in Ford and Heald (2013).
2.5.2 Satellite-based
We also derive a satellite-based surface PM2.5 using satellite observed aerosol optical depth, with additional constraints from the GEOS-Chem model, in a similar manner to Liu et al. (2004, 2007) and van Donkelaar et al. (2006, 2010, 2011). This method relies on the following relationship:
| (9) |
Where the satellite-derived PM2.5 is estimated at the resolution of the unconstrained model by multiplying the satellite observed AOD by the value ƞ, which is the ratio of model simulated surface PM2.5 to simulated AOD at the time of the satellite overpass. This is then a combined product which relies on a chemical transport model to simulate the spatially and temporally varying relationship between AOD and surface PM2.5 by accounting for all the aerosol properties and varying physical distribution and then constraining these results by “real” (i.e. satellite) measurements of AOD. Using the satellite to constrain the model concentrations is extremely useful in regions where emissions inventories and model processes are less well known.
For AOD, we use observations from both of the Moderate Resolution Imaging Spectroradiometer (MODIS) instruments and from the Multi-angle Imaging SpectroRadiometer (MISR) instrument. For this work we use MODIS 550 nm Level 2, Collection 6, Atmosphere Products for Aqua as well as Level 2, Collection 5 for Aqua. We filter these data for cloud fraction (CF < 0.2) and remove observations with high AOD (>2.0), as cloud contamination causes known biases in the AOD (Zhang et al., 2005) as in Ford and Heald (2012) although we note that this could remove high pollution observations, particularly in China. For MISR, we also use the Level 2 AOD product (F12, version 22, 500nm). We note that this is a different wavelength than from the MODIS instrument, but we neglect that difference for these comparisons. We use both of these observations for comparison as MODIS has a greater number of observations while MISR is generally considered to better represent the spatial and temporal variability of AOD over China (Cheng et al., 2012; Qi et al., 2013; You et al., 2015). Satellite observations are gridded to the GEOS-Chem nested grid resolution. We sample GEOS-Chem to days and grid boxes with valid satellite observations to calculate the ƞ used to translate the AOD to surface PM2.5.
In Figure 3, we show the long-term average (2004–2011) of satellite-based PM2.5 for the U.S. and China using MODIS Aqua Collection 6 and compare this to model-only estimates. In the following sections, most of our results will be shown using Collection 6; but reference and comparisons will be made to other products as a measure of uncertainty. In general the unconstrained model and satellite-based estimates show similar spatial features and magnitudes, with stronger local features apparent in the satellite-based PM2.5. The satellite-based estimate suggests that concentrations should be higher over much of the western U.S., particularly over California, Nevada, and Arizona (comparisons with surface measurements are discussed in section 2.5.3). In China, the satellite-derived PM2.5 is higher in Eastern China, around Beijing and the Heibei province, Tianjin, and Shanghai, but lower in many of the central provinces. While many previous studies suggest that MODIS may be biased high (and MISR biased low) over China (e.g. Cheng et al., 2012; Qi et al., 2013; You et al., 2015) and the Indo-Gangetic Plain (Bibi et al., 2015); Wang et al. (2013) note that the GEOS-Chem model underestimates PM2.5 in the Sichuan basin, suggesting that the MODIS satellite-based estimate could reduce the bias in this province.
Figure 3.

Long-term average (2004–2011) unconstrained model simulation of PM2.5 for the (a) continental U.S. and (b) China, along with the (MODIS-Aqua Collection 6) satellite-based PM2.5 for the (c) continental U.S. and (d) China, and the difference between the satellite-constrained and unconstrained model PM2.5 concentrations.
2.5.3 Surface-based Observations
We use observations of PM2.5 mass from two networks in the United States (where long-term values are more readily available than in China) to evaluate the model and satellite-derived PM2.5: the Interagency Monitoring of Protected Visual Environments (IMPROVE) and the EPA Air Quality System (AQS) database. The IMPROVE network measures PM2.5 over a 24-hour period every third day and these measurements are then analyzed for concentrations of fine, total, and speciated particle mass (Malm et al., 1994). We use the reconstructed fine mass (RCFM) values, which are the sum of ammonium sulfate, ammonium nitrate, soil, sea salt, elemental carbon and organic matter.
Previous studies have generally shown good agreement between measurements and GEOS-Chem simulations of PM2.5 (e.g. Ford and Heald, 2013; van Donkelaar et al., 2010). In Figure 4, we show the long-term average of PM2.5 at AQS and IMPROVE sites in the U.S. overlaid on simulated concentrations. In general, GEOS-Chem agrees better with measurements at IMPROVE sites, likely because these are located in rural regions where simulated values will not be as impacted by the challenge of resolving urban plumes in a coarse Eulerian model. There are noted discrepancies in California (Schiferl et al., 2014) and the Appalachia/Ohio River Valley region where the model is biased low. The model has a low mean bias of −25% compared to measurements at the EPA AQS sites and a bias of −6% compared to measurements at IMPROVE sites. Annual mean bias at individual sites ranges from −100 to 150%. At these same AQS sites, the satellite-derived PM2.5 is less biased (−12% using MODIS C6 or −8% using MISR).
Figure 4.

GEOS-Chem simulated average surface PM2.5 mass for years 2004–2011 overlaid with measurements at IMPROVE (circles) and AQS sites (diamonds).
To estimate the uncertainty in satellite AOD, we also rely on surface-based measurements of AOD from the global AErosol RObotic NETwork (AERONET) of sun photometers. AOD and aerosol properties are recorded at eight wavelengths in the visible and near-infrared (0.34–1.64μm) and are often used to validate satellite measurements (e.g. Remer et al., 2005). AERONET AOD has an uncertainty of 0.01–0.015 (Holben et al., 1998). For this work, we use hourly Version 2 Level 2 measurements sampled to two hour windows around the times of the satellite overpasses. We also perform a least-square polynomial fit to interpolate measurements to 550 nm.
3 Estimated health burden associated with exposure to PM2.5
We compare national exposure estimates for the U.S. and China using unconstrained and satellite-based (MODIS and MISR) annual average PM2.5 concentrations in Figure 5, which is a cumulative distribution plot that is calculated as the sum of the population in each grid box which has an annual average concentration at or above each concentration level. For the U.S., satellite-based estimates suggest a slightly greater fraction of the population is exposed to higher annual average concentrations, while in China, the satellite-based estimates suggest a lower fraction. Using MISR AOD suggests higher annual average concentrations in the U.S. and much lower in China, as MISR has a high bias in regions of low AOD and a low bias in regions of high AOD (Jiang et al., 2007; Kahn et al., 2010). The large discrepancy between results from MISR and MODIS could be due to differencing in sampling, but studies have also shown that MODIS is biased high in China and MISR is biased low (Cheng et al., 2012; Qi et al., 2013; You et al., 2015). We discuss the uncertainties in these estimates in section 4.
Figure 5.

Percent of the population exposed to different annual PM2.5 concentrations in the U.S. (a) and China (b). Lines denote estimates using the unconstrained GEOS-Chem simulation (red) or using satellite-based estimates with MODIS (green) and MISR (blue). Shading represents potential uncertainty associated with the model η (described in Section 4.2) and dashed black lines represent national annual air quality standards.
These exposure estimates are used to calculate an attributable fraction of mortality associated with heart disease, lung cancer, and respiratory disease attributable to chronic exposure using both model and satellite-based annual average concentrations for the U.S. and China (Table 1). In the U.S., we estimate that exposure to PM2.5 accounts for approximately 2% of total deaths (6% of heart diseases and 5% of respiratory diseases) compared to 14% (33% of heart and 22% of respiratory) in China using satellite-based concentrations. The Global Burden of Disease estimates for 2010, that 10% of total deaths in China and 3% of total deaths in the U.S. are attributable to exposure to PM2.5 (Lim et al., 2012). We present these as an average over the 2004–2011 time period in order to provide more robust results that are not driven by an outlier year, as there is considerable year-to-year variability in AOD and surface PM2.5 concentrations (for example, heavy dust years in China). However, there are trends in population (Figure 1) and surface concentrations that can influence these results. For example, there is a significant decreasing trend in AOD over the northeastern U.S. simulated in the model which is also noticeable in the satellite observations and the surface concentrations (Hand et al., 2012). This decreasing trend can be attributed to declining SO2 emissions in the U.S. as noted in Leibensperger et al. (2012). Trends in China are more difficult to ascertain as emissions have been variable over this period in general [Lu et al., 2011; Zhao et al., 2013] with widespread increases from 2004 to 2008 followed by variable trends in different regions through 2011. The difference between mortality burden estimates using model or satellite concentrations is approximately 20% for the U.S. and 2% for China on a nationwide basis, although regionally the difference can be much greater. A question we aim to address here is whether these model and satellite-based estimates are significantly different.
We compare our results to premature mortality burden estimates from other studies in Table 1. In general, our estimates for China are higher than most previous estimates, except for Lelieveld et al. (2015) and Rohde and Muller (2015).However, these studies provide estimates for 2010 and 2014, respectively, and we did find an increasing trend in our mortality estimates over the study time period. For the U.S., our estimates are in the lower range of previous studies. The spread among these studies can be attributed to the data used (i.e. MODIS Collection 5 rather than Collection 6 or unconstrained model concentrations, choice of baseline mortality rates, and population), the resolution of the data, the years studied, as well as the risk ratios and response functions and the resoultion of these products. For example, Evans et al. (2013) also use satellite-based concentrations (using MISR/MODIS Collection 5 and GEOS-Chem), but use a different concentration response function… In the following sections, we delineate the uncertainty in these results and reasons for differences from previous studies.
4 Uncertainty in Satellite-based PM2.5
Uncertainties in the PM2.5 concentrations derived from satellite observations arise from the two pieces of information which inform this estimate: (1) satellite AOD and (2) model ƞ. Here we explore some of the limitations and uncertainties associated with each of these inputs.
4.1 Uncertainty Associated with Satellite AOD
While satellite observations of aerosols are often used for model validation (e.g. Ford and Heald, 2012), these are indirect measurements with their own limitations and errors. The uncertainty in satellite AOD can be due to a variety of issues such as the presence of clouds, the choice of optical model used in the retrieval algorithm, and surface properties (Toth et al., 2014; Zhang and Reid, 2006). For validation of satellite products, studies have often relied on comparisons against AOD measured with sun photometers at AERONET ground sites (e.g. Kahn et al., 2005; Levy et al., 2010; Remer et al., 2005, 2008; Zhang and Reid, 2006). The uncertainty in AOD over land from MODIS is estimated as 0.05 ± 15% (Remer et al., 2005), while Kahn et al. (2005) suggest that 70% of MISR AOD data are within 0.05 (or 20 % ×AOD) of AERONET AOD.
There are also discrepancies between AOD measured by the different instruments due to different observational scenarios and instrument design. The Aqua platform has an afternoon overpass while the Terra platform has a morning overpass. It might be expected that there would be some differences in retrieved AOD associated with diurnal variations in aerosol loading. However, the difference of 0.015 in the globally averaged AOD between MODIS onboard Terra and Aqua (Collection 5), although within the uncertainty range of the retrieval, is primarily attributed to uncertainties and a drift in the calibration of the Terra instrument, noted in Zhang and Reid (2010) and Levy et al. (2010). Collection 6 (as will be discussed further) reduces the AOD divergence between the two instruments (Levy et al., 2013). MISR employs a different multi-angle measurement technique with a smaller swath width; as a result the correlation between MISR AOD and MODIS AOD is only 0.7 over land (0.9 over ocean) (Kahn et al., 2005).
Not only are there discrepancies in AOD between instruments, there are also differences between product versions for the same instrument. The MODIS Collection 6 Level 2 AOD is substantially different from Collection 5.1 (Levy et al., 2013) and Figure 6. In general, AOD decreases over land and increases over ocean with Collection 6. These changes are due to a variety of algorithm updates including better detection of thin cirrus clouds, a wind speed correction, a cloud mask that now allows heavy smoke retrievals, better assignments of aerosol types, and updates to the Rayleigh optical depths and gas absorption corrections (Levy et al., 2013). These differences can also impact the derived PM2.5 (and can explain some differences between our results and previous studies). In particular, because Collection 6 suggests higher AOD over many of the urbanized regions, the derived PM2.5 and resulting exposure estimates (all other variables constant) are greater. The difference between these two retrieval products, given the same set of radiance measurements from the same platform, gives a sense of the uncertainty in the satellite AOD product (Figure 6a).
Figure 6.

(a) Percent difference between annual mean AOD from MODIS Collection 6 and Collection 5 and (b) simulated bias in satellite-derived annual average surface PM2.5 associated with satellite sampling.
We estimate the uncertainty in satellite AOD used here by comparing satellite observations to AERONET and determining the normalized mean bias (NMB) between AOD from each satellite instrument and AERONET for the U.S. and China (Figure 7). Although there a very limited number of sites in China, from these comparisons, we find that the satellites generally agree with AERONET better in the eastern U.S. and northeastern China than in the western U.S. and western and southeastern China. There are larger biases in the west near deserts and at coastal regions where it may be challenging to distinguish land and water in the retrieval algorithm. NMBs at each AERONET site are generally similar among the instruments (MISR comparison not shown), with greater differences at these western sites. While Collection 6 does reduce the bias at several sites along the East Coast in the U.S., it is generally more biased at the Four Corners region of the U.S. We use these NMBs to regionally “bias correct” our AOD values and estimate the associated range of uncertainty in our premature mortality estimates. Compared to the standard MODIS AOD retrieval uncertainty, our overall NMB is less in the eastern U.S. (−1%) and western China (11%) and higher in the western U.S. (40%) and eastern China (18%).
Figure 7.

Normalized mean bias in AOD between MODIS-Aqua Collection 6 and AERONET sites for (a) the U.S. and (b) China.
There may also be biases associated with the satellite sampling, should concentrations on days with available observations be skewed. In order to assess the sampling bias, we use the model and compare the annual mean to the mean of days with valid observations (Figure 6b). In general, sampling leads to an underestimation in AOD (average of 20% over the U.S.). This can partly be attributed to the presence of high aerosol concentrations below or within clouds which cannot be detected by the satellite, the mistaken identification of high aerosol loading as cloud in retrieval algorithms, as well as the removal of anomalously high AOD values (>2.0) from the observational record. This suggests that the average AOD values can also be influenced by the chosen filtering and data quality standards. Analysis of the impact of satellite data quality on the AOD to PM2.5 relationship is discussed in Toth et al. (2014). They find that using higher quality observations does tend to improve correlations between observed AOD and surface PM2.5 across the U.S. though in general correlations are low (<0.55).
4.2 Uncertainty Associated with Model ƞ
In general, the model simulates PM2.5 well (Figure 4) and represents important processes; but, satellite AOD can help to constrain these estimates to better represent measured concentrations (van Donkelaar et al., 2006). However, in specific regions or periods of time, errors in ƞ could lead to discrepancies between satellite-derived and actual surface mass and Snider et al. (2015) does show some regional biases in the GEOS-Chem model ƞ compared to ƞ determined from collocated surface measurements of AOD and PM2.5. In order to assess the potential uncertainty in model-based ƞ, we perform multiple sensitivity tests to determine the impact that different aerosol properties, grid-size resolution and time scales will have on ƞ and, ultimately, on the resulting satellite-based PM2.5 (listed in Table 2). These sensitivity tests are performed solely with model output, which can provide a complete spatial and temporal record, and results from the modified simulations are compared to the standard model simulation. We note that these are “errors” only with respect to our baseline simulation; we do not characterize how each sensitivity simulation may be “better” or “worse” compared to true concentrations of surface PM2.5, but rather how different they are from the baseline, thus characterizing the uncertainty in derived PM2.5 resulting from the model estimates of ƞ. We make these comparisons for both the U.S. and China and show results in Figure 8. Because mass concentrations in China are generally much higher, the absolute value of potential errors can also be much greater.
Table 2.
List of model sensitivity tests and descriptions with results shown in Figure 8.
| Sensitivity Test | Description |
|---|---|
| AvgAOD | AOD is held constant through season while Ƞ varies daily. |
| AvgEta | AOD varies daily, while Ƞ is held constant through season. |
| AvgProf | Column mass varies daily, but shape of vertical profile is held constant for season. AOD and Ƞ vary daily but are re-calculated for redistributed mass. |
| AvgRH | AOD and Ƞ vary daily but are re-calculated assuming relative humidity remains constant throughout season. |
| 2×2.5 | Ƞ values are calculated for simulation run at coarser (2°×2.5°) resolution and then regrid to nested resolution (0.5°×0.666°). |
| SO4 | Assume all mass in column is sulfate and recalculate Ƞ. |
| BC | Assume all mass in column is black carbon and recalculate Ƞ. |
| No NO3 | Calculate AOD and Ƞ without the contribution of nitrate. |
Figure 8.

Distribution of normalized mean biases in annual average PM2.5 for grid boxes in different regions of the U.S. (top row) and China (bottom row) determined from sensitivity tests to investigate the uncertainty in η. Sensitivity tests are described (and abbreviations defined) in Table 3.
The timescale of the estimated PM2.5 influences the error metric we choose for this analysis. We use the NMB for estimating error associated with annual PM2.5 exposure (the metric of interest for chronic exposure). This allows for the possibility that day-to-day errors may compensate, resulting in a more generally unbiased annual mean value. The error on any given day of satellite-estimated PM2.5 is likely larger, and not characterized by the NMB used here.
Our first sensitivity tests relate specifically to the methodology. To derive a satellite-based PM2.5 with this method requires model output for every day and that there are valid satellite observations. Running a model can be labor intensive, at the same time there are specific regions and time periods with poor satellite coverage. Therefore, it might be beneficial to be able to use a climatological ƞ or a climatological satellite AOD. To test the importance of daily variability in AOD, we compute daily ƞ values and then solve for daily surface PM2.5 values using a seasonally averaged model simulated AOD (AvgAOD). This mimics the error introduced by using seasonally averaged satellite observations, an attractive proposition to overcome limitations in coverage. This approximation often produces the greatest error (~20% in the U.S. and 0–50% in China) especially in regions where AOD varies more dramatically and specifically where transported layers aloft can significantly increase AOD (Figure 8). For the seasonally averaged ƞ test (AvgEta), we estimate daily PM2.5 values (which are averaged into the annual concentration) from the seasonally averaged ƞ and daily AOD values. As regional ƞ relationships can be more consistent over time than PM2.5 or AOD, this test evaluates the necessity of using daily model output to define the ƞ relationship. The error in the annual average of daily PM2.5 values determined using a seasonally averaged ƞ creates results that are very similar to the error found calculating an annual average of daily PM2.5 values calculated using a seasonally averaged AOD.
The model ƞ also inherently prescribes a vertical distribution of aerosol, which may be inaccurately represented by the model and introduce errors in the satellite-derived PM2.5. Previous studies have shown that an accurate vertical distribution is essential for using AOD to predict surface PM2.5. (e.g. Li et al., 2015; van Donkelaar et al., 2010). We test the importance of the variability of the vertical distribution in the ƞ relationship for predicting surface PM2.5 concentrations by comparing values from the standard simulation against using an ƞ from a seasonally averaged vertical distribution (AvgProf). For this comparison, we allow the column mass loading to vary day-to-day, but we assume that the profile shape does not change (i.e. we re-distribute the simulated mass to the same seasonally averaged vertical profile). We note that this is not the same as assuming a constant ƞ, as relative humidity and aerosol composition are allowed to vary. Additionally, this differs from other studies (van Donkelaar et al., 2010; Ford and Heald, 2013) in that we are not testing the representativeness of the seasonal average profile, but testing the importance of representing the daily variability in the vertical profile. From Figure 7, we see that using a seasonally averaged vertical distribution (AvgProf) can lead to large errors in surface concentrations. Information on how the pollutants are distributed is extremely important because changes in column AOD can be driven by changes in surface mass loading, but also by layers of lofted aerosols that result from production aloft or transport (and changes in the depth of the boundary layer). This is important in areas that are occasionally impacted by transported elevated biomass burning plumes or dust. Large errors often occur in China, especially during the spring when these regions are influenced by transported dust from the Taklamakan and Gobi Deserts (Wang et al., 2008). Southeastern China has the largest NMB due to not only transport from interior China, but also from other countries in Southeast Asia. There is a positive bias in most regions, because on average, most of the aerosol mass is located at the surface; therefore, using an average profile will overpredict the surface concentrations. Similar to the average AOD and ƞ (AvgAOD and AvgEta), average vertical distributions generally overpredict PM2.5 due to the presence of outliers. This stresses the importance of not only getting the mean profile correct, but the necessity of also simulating the variability in the profile on shorter timescales.
We also test the sensitivity of derived PM2.5 to aerosol water uptake. This is done by recalculating ƞ using a seasonally averaged relative humidity (RH) profile (AvgRH). This generally reduces the seasonally averaged AOD (less water uptake) in every season (because hygroscopic growth of aerosols is non-linear with RH). This leads to an overestimate of ƞ that when applied to the AOD values from the standard simulation and generally overestimates surface PM2.5 in regions with potentially higher RH and more hygroscopic aerosols (eastern U.S. and eastern China). This is because, for the same AOD, a higher ƞ value would suggest more mass at the surface in order to compensate for optically smaller particles aloft. Western China (and some of central China) has a negative bias, suggesting that using a mean relative humidity actually underestimates PM2.5. However, this is because the RH is generally low but can have large variability, and concentrations (outside of the desert regions) are also low so that the NMB may be large although the absolute error is not. A higher resolution model, although more computationally expensive, will likely better represent small scale variability and is better suited for estimating surface air quality. Punger and West (2013) find that coarse resolution models often drastically underestimate exposure in urban areas. We therefore investigate the grid-size dependence of our simulated ƞ. For this, we determine the ƞ values from a simulation running at 2°x 2.5° grid resolution (with the same emission inputs and time period), re-grid these values to the nested grid resolution (0.5°×0.666°) and solve for the derived PM2.5 concentrations using the AOD values from the nested simulation (noted as 2×2.5 in Figure 8). From Figure 8, we see larger discrepancies in regions which are dominated by more spatially variable emissions (Northeastern U.S. and China) rather than areas with broad regional sources (Southeastern U.S.). This is line with Punger and West (2013) who show smaller differences due to resolution in estimated premature mortality due to PM2.5 exposure in rural areas than in urban areas. Compared to the other sensitivity tests, using the coarser grid leads to mean errors of only 10–15% in the U.S. and in China, which suggests that spatially averaged ƞ are potentially more useful than temporally averaged ƞ for constraining surface PM2.5. Thompson and Selin (2012) and Thompson et al. (2014) show that coarse grids can overpredict pollutant concentrations and consequently health impacts, but using very fine grids does not significantly decrease the error in simulated concentrations compared to observations. This effect was more pronounced with ozone. Additionally, their coarsest grid resolution is 36 km which they compare to results at 2, 4, and 12 km. Punger and West (2013) compare health impacts at a variety of resolutions out to several 400 km and show that coarser resolutions underestimate health impacts because concentrations are diluted over larger areas rather than high concentrations being co-located with large urban populations.
The GEOS-Chem simulation of surface nitrate aerosol over the U.S. is biased high (Heald et al., 2012). This can be an issue in regions where nitrate has a drastically different vertical profile (or ƞ) from other species. To test how this nitrate bias could impact ƞ and the derived PM2.5, we compute ƞ without nitrate aerosol, and then derive PM2.5 using the standard AOD (No NO3). This is not a large source of potential error (<15%), with only slightly larger errors in winter and in regions where nitrate has a significant high bias (central U.S.). Furthermore, these errors are less than the bias between the model and surface observations of nitrate in the U.S. (1–2 μgm−3 compared to 2–7 μgm−3), suggesting that even though there is a known bias in the model, using satellite observations may largely correct for this by constraining the total AOD when estimating satellite-derived PM2.5. We also did this comparison for China. Measured nitrate concentrations are not widely available for evaluation, but Wang et al. (2014) suggests that model nitrate is also too high in eastern China. The NMB is even less in regions in China (<10%), with negative values in eastern China (where nitrate concentrations are high) and positive values in western and central China (where nitrate concentrations are lower and have less bias compared to observations).
To further explore the role of aerosol composition (and possible mischaracterization in the model), we take the simulated mass concentrations and compute the AOD assuming that the entire aerosol mass is sulfate (SO4 in Figure 8) or, alternatively, hydrophobic black carbon (BC in Figure 8). Black carbon has a high mass extinction efficiency, which is constant with RH given its hydrophobic nature; while sulfate is very hygroscopic, resulting in much higher extinction efficiencies at higher relative humidity values. Overall, assuming that all the mass is sulfate leads to low biases on the order of 15–20% as the AOD in many regions in the U.S. is dominated by inorganics. Errors are largest in regions and seasons with larger contributions of less hygroscopic aerosols (organic carbon and dust) and/or high relative humidity. Assuming the entire aerosol mass is black carbon can lead to greater errors than sulfate because BC has a larger mass extinction at lower relative humidity values and hydrophobic black carbon generally makes up a small fraction of the mass loading in all regions in the U.S. and China. When RH is low, this assumption increases the AOD, which leads to an under prediction in the derived PM2.5. When RH is high, this decreases the AOD and leads to an over prediction in derived PM2.5. The largest percentage changes occur in the southwestern U.S. and western China (~−30%) due to the low relative humidity, low mass concentrations, and large contribution of dust.
We also compare these sensitivity tests on daily timescales. We do not show the results here because we rely on chronic exposure (annual average concentrations) for calculating mortality burdens. The normalized mean biases in annual average concentrations (Figure 8) are generally much less (range of ±20% in U.S. and ±50% in China) than potential random errors in daily values as many of these daily errors cancel out in longer term means. This is the case for our sensitivity tests regarding the vertical profile and relative humidity, which have much larger errors on shorter timescales. However, because our method to test the sensitivity to aerosol type assumes that all aerosol mass is black carbon or sulfate, we introduce a systematic bias that is not significantly reduced in the annual NMB. This highlights the differing potential impacts due to systematic and random errors, which is an important distinction for determining the usefulness of this method. Systematic errors may not be as obvious on short timescales compared to random errors (related to meteorology and/or representation of plumes) that can lead to large biases in daily concentrations. However, these random errors have less impact when we examine annual average concentrations and mortality burdens. Systematic errors, potentially related to sources or processes, may be harder to counteract even on longer timescales and even when the model is constrained by satellite observations. However, we also show that random daily errors can bias the long term mean, stressing the importance of not only correcting regional biases, but also in accurately simulating daily variability.
We translate this potential uncertainty in ƞ to potential uncertainty in mortality estimates determined from the satellite-based PM2.5. We use the normalized mean bias in annual PM2.5 determined from the sensitivity tests for RH, the vertical profile, grid resolution, and aerosol composition for each grid box and then use these values to “bias correct” our satellite-based annual PM2.5 concentrations and re-calculate exposure (shown in Figure 5) and mortality (discussed in Section 6). From Figure 5, we see that the uncertainty in ƞ, when translated to an annual exposure level, are larger than the differences in exposure levels estimated from model and satellite-based PM2.5, suggesting that satellite-based products which rely strongly on the model or which do not account for the variability in the aforementioned variables, does not necessarily provide a definitively better estimate of exposure. Secondly, these uncertainties in many regions are greater than the difference between both the model and surface PM2.5 and the satellite-based and surface observations. While these comparisons are limited spatially and temporally, this highlights that constraining the model with the satellite observations can improve estimates of PM2.5 but there remains a large amount of uncertainty in these estimates.
4.3 Selection of concentration response function and relative risk
The choice of the shape of the concentration response function (CRF) and relative risk ratio value explains much of the difference in burden estimated in different studies listed in Table 1. In general, it is difficult to determine risks at the population level and studies have found that using ambient concentrations tend to under predict health effects (e.g. Hubbell et al., 2009). However, personal monitoring is costly and time-intensive, and therefore, epidemiology studies generally rely on determining population-level concentration response functions rather than personal-level exposure responses. However, populations also respond differently; and therefore the shape and magnitude of this response varies among studies.
For an initial metric of the uncertainty in the risk ratios, studies often include estimates generated using the 95% confidence intervals of the RR determined in the original study (as shown in Figure 2). A confidence interval shows the statistical range within which the true PM coefficient for the study population is likely to lie, which could be a single city, region, or population group. The Krewski et al. [2009] study, which is a reanalysis of the American Cancer Society (ACS) Cancer Prevention Study II (CPS-II), included 1.2 million people in the Los Angeles and New York City regions, whereas the Laden et al. (2006) study, an extended analysis of the Harvard Six Cities Studies, included 8,096 white participants. Using just these confidence intervals as a measure of uncertainty suggests that there exists a large range of uncertainty in population-level health responses to exposure and caution should be exercised when attempting to transfer these values beyond the population from which they were determined in order to estimate national-level mortality burdens based on ambient concentrations. The IER coefficients from Burnett et al. (2014) are generated using the risk ratios, threshold values, and confidence intervals from previous studies and therefore also provide a large range in premature mortality estimates. To depict this range, we also include the 5th and 95th percentile estimates in addition to the mean estimate and show the maximum value in our sensitivity tests.
To test the impact of methodological choices associated with the burden calculation, we compare results using different concentration response functions and relative risk ratios that previous studies have used. Table 3 lists the different choices that we explore regarding the CRF and relative risk, the study that used these values, and the resulting percent change in burden compared to our initial estimates using the IER from Burnett et al. (2014). In particular we compare our results using risk ratio values from Krewski et al. (2009), Pope et al. (2002) and Laden et al. (2006), and log-linear and power law relationships. Figure 9 shows that the largest difference in burden is associated with using the higher risk ratios from Laden et al. (2006) vs. using Krewski et al. (2009) or the mean estimates determined using the IER coefficients from Burnett et al. (2014), the former suggest a much greater mortality response to PM2.5 exposure.
Table 3.
Input for premature mortality burden estimate sensitivity tests and the resulting percent change in mortality due to chronic exposure determined from satellite-based concentrations. Parentheses are for values determined from model simulated concentrations.
| RR source | Threshold | CRF shape | % Change USA | % Change China | Study using method | In Fig. 8 |
|---|---|---|---|---|---|---|
| Krewski et al., 2009 | No | Linear | base | base | K-L | |
| Krewski et al., 2009 | lowest measured level | Linear | −56 (−62) | −8 (−8) | Anenberg et al., 2010 | K-L-LM |
| Krewski et al., 2009 | counterfactual | Linear | −38 (−43) | −6 (−5) | K-L-CF | |
| Krewski et al., 2009 | policy relevant background | Linear | −10 (−11) | K-L-PRB | ||
| Krewski et al., 2009 | No | Linear to 30 ugm−3 | 0 (0) | −30 (−33) | Anenberg et al., 2010 | K-L-30 |
| Krewski et al., 2009 | No | Linear to 50 ugm−3 | 0 (0) | −4 (−8) | K-L-50 | |
| Krewski et al., 2009 | lowest measured level | Log-Linear (Eq. 6) | −23 (−30) | −13 (−15) | Evans et al., 2013; Fann and Risley, 2013; U.S. EPA, 2010 | K-LL-LML |
| Krewski et al., 2009 | counterfactual | Log-Linear (Eq. 6) | 28 (23) | −2 (−4) | K-LL-CF | |
| Krewski et al., 2009 | No | Log-Linear (Eq. 5) | 10 (10) | 26 (26) | K-LL | |
| Krewski et al., 2009 | lowest measured level | Log-Linear (Eq. 5) | −53 (−59) | 14 (16) | U.S. EPA, 2010 | K-LL-LML |
| Pope et al., 2002 | No | Power Law | 44 (52) | −17 (−18) | Marlier et al., 2013 | K-PL |
| Pope et al., 2002 | lowest measured level | Power Law | −8 (−8) | −44 (−46) | K-PL-LML | |
| Pope et al., 2002 | policy relevant background | Power Law | 21 (35) | K-L-PRB | ||
| Laden et al., 2006 | No | Linear | 105 (108) | 47 (45) | L-L | |
| Laden et al., 2006 | lowest measured level | Linear | −14 (−24) | 38 (36) | Anenburg et al., 2010; U.S. EPA, 2010 | L-L-LML |
| Laden et al., 2006 | counterfactual | Linear | 32 (24) | 51 (49) | L-L-CF | |
| Burnett et al., 2014 | fitted | IER | base | base | Lim et al., 2012; Zheng et al., 2014; Lelieveld et al., 2015 | B-IER |
| Burnett et al., 2014 | fitted | IER | 167 (167) | 65 (64) | *maximum value detemined from set of coefficeints | B-IERmax |
| Krewski et al., 2009 | Lowest measured level (5.8 μg m−3) | Equation 5 | 18(15) | 18 (21) | Evans et al., 2013; Lelieveld et al., 2013 | K-L5.8 |
| Krewski et al., 2009 | Lowest measured level (5.8 μg m−3), ceiling (30 μg m−3) | Equation 5 | 18(15) | −24 (−26) | Anenberg et al., 2010 | K-Lc30 |
| Krewski et al., 2009 | Lowest measured level (5.8 μg m−3), ceiling (50 μg m−3) | Equation 5 | 90 (93) | 6 (4) | Cohen et al., 2004 | K-Lc50 |
| Krewski et al., 2009 | Lowest measured level (5.8 μg m−3) | Equation 6 | 143 (167) | −6 (−7) | Evans et al., 2013 | K-LL5.8 |
| Krewski et al., 2009 | Policy Relevant Background | Equation 5 | 134 (158) | U.S. EPA, 2010 | K-LPR | |
| Krewski et al., 2009 | No threshold | Equation 5 | 169 (200) | 52 (55) | Silva et al., 2013 | K-L0 |
| Pope et al., 2002 | Lowest measured level (5.8 μg m−3), ceiling (30 μg m−3) | Power Law | 134 (158) | −26 (−28) | Marlier et al., 2013 | P-PL5.8c30 |
| Pope et al., 2002 | Lowest measured level (7.5 μg m−3) | Power Law | 102 (105) | −15 (−15) | Pope et al., 2002 | P-PL7.5 |
| Laden et al., 2006 | lowest measured level (10 μg m−3) | Equation 5 | −58 (−68) | 126(130) | Anenburg et al., 2010; U.S. EPA, 2010 | L-L10 |
| Laden et al., 2006 | lowest measured level (10 μg m−3) | Equation 5 | 239 (275) | U.S. EPA, 2010 | L-LPR | |
| Laden et al., 2006 | lowest measured level (10 μg m−3); ceiling (30 μg m−3) | Equation 5 | 38 (33) | Anenburg et al., 2010 | L-Lc30 | |
| Pope et al., 2002 | lowest measured level (7.5 μg m−3) | Equation 5 | −55 (−58) | −25 (−27) | P-L7.5 | |
| Pope et al., 2002 | lowest measured level (7.5 μg m−3) | Equation 6 | −29 (−28) | 1 (1) | P-LL7.5 |
Figure 9.

Premature mortality estimates for (a) the U.S. and (b) China determined using different RR, CRFs, and threshold/ceiling values, as described in Table 3. Colors represent cause of death estimated using PM2.5 concentrations from unconstrained model simulations (solid) and satellite-based estimates (hatched).
Our estimates of Section 3 also use the same relative risk values for every location. However, studies have found that different populations have varied responses to exposure (potential for “effect modification”) (Dominici et al., 2003). One of the main uncertainties in these methods is relying on risk ratios that are primarily determined from epidemiology studies conducted in the United States, which may not represent the actual risks for populations in China. Long-term epidemiology studies examining exposure to PM2.5 across broad regions of China are scarce, but studies using acute exposure to PM2.5 or chronic exposure to PM10 or total suspended particles have suggested lower exposure-response coefficients than determined by studies conducted in the U.S. and Europe (Aunan and Pan, 2004; Chen et al., 2013b; Shang et al., 2013), indicating that assessments which use CRFs from studies conducted in the U.S. might overestimate the health effects in China.
We also explore using different “threshold” values. The IER function uses threshold values between 5.8 μgm−3 and 8.8 μgm−3. In the U.S., higher threshold values can significantly reduce burden estimates. When we compare sensitivity tests that use the same CRF (Krewski et al., 2009) but with a a regional PRB concentration instead of the lowest measured level (5.8 μgm−3), the premature mortality estimates are reduced, suggesting that the choice of this value is very important in the U.S. where annual mean concentrations are relatively low. However, in China these threshold values have less impact on our results because annual mean concentrations are high enough that subtracting a threshold makes little difference. Conversely, using a ceiling value of 30 μgm−3 or 50 μgm−3 produces no difference in the U.S. (0% of the population experiences annual concentration values greater than 30 μgm−3), while strongly reducing burden estimates in China.
We also see that the shape of the CRF produces different results between the U.S. and China. Using a power law or log-linear (Equation 6) function increase relative risks at low concentrations and decreases risk ratios at high concentrations such that total disease burden estimates increase in the U.S. and decrease in China. In the U.S., a log-linear CRF is almost equivalent to a linear response because of the low concentrations. In general, the shape of the concentration response function is more important at low or very high concentrations.
4.4 Comparison of uncertainty
Figure 10 provides a summary of the different sources of uncertainty discussed here is shown in Figure 10. We show the mortality burdens for respiratory disease, lung cancer and heart disease associated with chronic exposure to ambient PM2.5 and calculated using annual average model-based and “satellite-based” values (from MISR and MODIS) for both the U.S. and China. We show here that the satellite-based estimates suggest slightly higher national burdens in the U.S. and slightly lower in China. However, our values using these different annual average concentrations fall within the range of values found in the literature (Table 1).
Figure 10.

Burden of mortality due to outdoor exposure to fine particulate matter as determined in previous studies (Table 1, gray bars with values from individual studies designated by black lines), calculated using model (GEOS-Chem, solid) and satellite-based (hatched) annual concentrations (colored by disease, whiskers denote 5th and 95th percentile estimates generated using the Burnett et al., 2014 coefficients). The uncertainty range on the MODIS-based estimates due to satellite AOD (taupe), model ƞ (coral), and CRF (blue) are shown on the right.
We further contrast these estimates to the range in uncertainty associated with our observations and methodology. The difference between the burden calculated using strictly the model or the satellite-based approach is greater than the uncertainty range in the satellite AOD, suggesting that this difference is outside of the scope of measurement limitations and errors. However, the potential uncertainty in the satellite-based estimate due to the conversion from AOD to surface PM2.5 (represented by the model ƞ) is substantially larger, larger even than the difference between model-derived and satellite-derived estimates. Therefore, while constraining the model estimate of PM2.5 by actual observations should improve our health effect estimates, the uncertainty in the required model information may limit the accuracy of this approach. Again, we stress that these are “potential” model uncertainties which may overestimate the true uncertainty in regions where the model accurately represents the composition and distribution of aerosols. We also acknowledge that we have investigated a limited set of factors; additional biases may exacerbate these uncertainties. However, adding additional observational data and model estimates can also help to better constrain these satellite-based PM2.5 estimates (Brauer et al., 2012, 2015; van Donkelaar et al., 2015a, 2015b).
Figure 10 also conveys the range in mortality estimates for the U.S. and China that can result from varying choices for the risk ratio or shape of the concentration response. While epidemiology studies attempt to statistically account for differences in populations and confounding variables, there is still a large spread in determined risk ratios. Just as important, or perhaps more so than determining ambient concentrations, applying response functions is a determining factor in quantifying the burden of mortality due to outdoor air quality. Differences in exposure estimates can be overshadowed by these different approaches. As an added example, we calculated the mortality burden using only populated places, similar to Lelieveld et al. (2013) and Cohen et al. (2004) and find that for the U.S. this decreased the burden by 13%, (satellite-based, 18% for model). For China, this reduces the burden estimate by 72%. Differences in our estimates here and those found in the literature can be partly attributed to differences in application of the CRF function, along with differences in baseline mortalities and population estimates. Disease burdens estimated in various studies can therefore only be truly compared when the methodology is harmonized.
5 Conclusions
Calculating health burdens is an extremely important endeavor for informing air pollution policy, but literature estimates cover a large range due to differences in methodology regarding both the measurement of ambient concentrations and the health impact assessment. Satellite observations have proved useful in estimating exposure and the resulting health impacts (van Donkelaar et al., 2015b; Yao et al., 2013). However, there remain large uncertainties associated with these satellite measurements and the methods for translating them into surface air quality that needs to be further investigated. Our goal with this work is to explore how mortality burden estimates are made and how choices within this methodology can explain some of these discrepancies. We also aim to provide a context for interpreting the quantification of PM2.5 chronic exposure health burdens.
While we have discussed several potential sources for uncertainty in calculating health burdens with satellite-based PM2.5, there are still a significant number of other sources of uncertainty that we did not explore. There are processes that could impact the AOD to PM2.5 relationship in the model, such as different emissions and removal processes. Additionally, our sensitivity test results are likely partly tied to the spatial resolution of the model and the satellite AOD, and their ability to capture finer spatial variations in pollution in regions with high populations. However, Thompson et al., (2014) suggest that uncertainty in the CRF will likely still have a larger impact than uncertainties in population-weighted concentrations due to model resolution.
Satellite measurements have provided great advancements in monitoring global air quality, providing information in regions with previously few measurements. However, further progress still needs to be made in determining how to characterize exposure to ambient PM2.5 using these satellite observations, especially as they are becoming more widely used in epidemiological studies and health impact assessments. Reducing uncertainty, even at the lower concentrations observed in the U.S., is important if these methods and datasets are to be used for policy assessment or air quality standards. However, as air pollution is a leading environmentally-related cause of premature mortality, the difficulties in applying this data should not negate the importance of this endeavor. Overcoming sampling limitations in satellite observations and better accounting for regional biases could help to reduce the uncertainty in satellite-retrieved AOD and adding additional observational data and model estimates can help to better constrain satellite-based PM2.5 estimates (Brauer et al., 2012, 2015; van Donkelaar et al., 2015a, 2015b). Future geostationary satellites will also be critical to advance this methodology and will provide extremely valuable information for daily monitoring and tracking of air quality. Furthermore, these geostationary observations, in concert with greater surface monitoring, will offer new constraints for epidemiological studies to develop health risk assessments and lessen the uncertainty in applying concentration-response functions and determining health burdens.
Acknowledgments
This work was supported in part by the MIT Center for Environmental Health Science (via NIH grant P30-ES002109). We acknowledge useful discussions with Dr. Jennifer Peel.
References
- Alexander B, Park RJ, Jacob DJ, Li QB, Yantosca RM, Savarino J, Lee CCW, Thiemens MH. Sulfate formation in sea-salt aerosols: Constraints from oxygen isotopes. J Geophys Res. 2005;110(D10):D10307. doi: 10.1029/2004JD005659. [DOI] [Google Scholar]
- Anenberg SC, Horowitz LW, Tong DQ, West JJ. An Estimate of the Global Burden of Anthropogenic Ozone and Fine Particulate Matter on Premature Human Mortality Using Atmospheric Modeling, Environ. Health Perspect. 2010;118(9):1189–1195. doi: 10.1289/ehp.0901220. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Apte JS, Marshall JD, Cohen AJ, Brauer M. Addressing Global Mortality from Ambient PM2.5 - Environmental Science & Technology (ACS Publications) Environ Sci Technol. 2015;49(13):8057–8066. doi: 10.1021/acs.est.5b01236. [DOI] [PubMed] [Google Scholar]
- Aunan K, Pan XC. Exposure-response functions for health effects of ambient air pollution applicable for China – a meta-analysis. Sci Total Environ. 2004;329(1–3):3–16. doi: 10.1016/j.scitotenv.2004.03.008. [DOI] [PubMed] [Google Scholar]
- Bibi H, Alam K, Chishtie F, Bibi S, Shahid I, Blaschke T. Intercomparison of MODIS, MISR, OMI, and CALIPSO aerosol optical depth retrievals for four locations on the Indo-Gangetic plains and validation against AERONET data. Atmos Environ. 2015;111:113–126. doi: 10.1016/j.atmosenv.2015.04.013. [DOI] [Google Scholar]
- Brauer M, Amann M, Burnett RT, Cohen A, Dentener F, Ezzati M, Henderson SB, Krzyzanowski M, Martin RV, Van Dingenen R, van Donkelaar A, Thurston GD. Exposure assessment for estimation of the global burden of disease attributable to outdoor air pollution. Environ Sci Technol. 2012;46(2):652–660. doi: 10.1021/es2025752. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brauer M, Freedman G, Frostad J, van Donkelaar A, Martin RV, Dentener F, van Dingenen R, Estep K, Amini H, Apte JS, Balakrishnan K, Barregard L, Broday D, Feigin V, Ghosh S, Hopke PK, Knibbs LD, Kokubo Y, Liu Y, Ma S, Morawska L, Sangrador JLT, Shaddick G, Anderson HR, Vos T, Forouzanfar MH, Burnett RT, Cohen A. Ambient Air Pollution Exposure Estimation for the Global Burden of Disease 2013. Environ Sci Technol. 2015 doi: 10.1021/acs.est.5b03709. [DOI] [PubMed] [Google Scholar]
- Burnett RT, Pope CA, III, Ezzati M, Olives C, Lim SS, Mehta S, Shin HH, Singh G, Hubbell B, Brauer M, Anderson HR, Smith KR, Balmes JR, Bruce NG, Kan H, Laden F, Prüss-Ustün A, Turner MC, Gapstur SM, Diver WR, Cohen A. An Integrated Risk Function for Estimating the Global Burden of Disease Attributable to Ambient Fine Particulate Matter Exposure. Environ Health Perspect. 2014 doi: 10.1289/ehp.1307049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cheng T, Chen H, Gu X, Yu T, Guo J, Guo H. The inter-comparison of MODIS, MISR and GOCART aerosol products against AERONET data over China. J Quant Spectrosc Radiat Transf. 2012;113(16):2135–2145. doi: 10.1016/j.jqsrt.2012.06.016. [DOI] [Google Scholar]
- Chen H, Goldberg MS. The effects of outdoor air pollution on chronic illnesses. McGill J Med MJM. 2009;12(1):58–64. [PMC free article] [PubMed] [Google Scholar]
- Chen W, Zheng R, Zhang S, Zhao P, Li G, Wu L, He J. 2009 The incidences and mortalities of major cancers in China. Chin J Cancer. 2013a;32(3):106–112. doi: 10.5732/cjc.013.10018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen Y, Ebenstein A, Greenstone M, Li H. Evidence on the impact of sustained exposure to air pollution on life expectancy from China’s Huai River policy. Proc Natl Acad Sci. 2013b;110(32):12936–12941. doi: 10.1073/pnas.1300018110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cohen AJ, Anderson HR, Ostro B, Pandey KD, Krzyzanowski M, Künzli N, Gutschmidt K, Pope CA, III, Romieu I, Samet JM, Smith KR. In: Urban air pollution, in Comparative Quantification of Health Risks: Global and Regional Burden of Disease Attributable to Selected Major Risk Factors. Ezzati M, Lopez AD, Rodgers A, Murray CJL, editors. Vol. 1. World Health Organization; Geneva: 2004. pp. 1353–1434. [Google Scholar]
- Cohen AJ, Ross Anderson H, Ostro B, Pandey KD, Krzyzanowski M, Künzli N, Gutschmidt K, Pope A, Romieu I, Samet JM, Smith K. The global burden of disease due to outdoor air pollution. J Toxicol Environ Health A. 2005;68(13–14):1301–1307. doi: 10.1080/15287390590936166. [DOI] [PubMed] [Google Scholar]
- Cooke WF, Liousse C, Cachier H, Feichter J. Construction of a 1° × 1° fossil fuel emission data set for carbonaceous aerosol and implementation and radiative impact in the ECHAM4 model. J Geophys Res. 1999;104(D18):22137–22162. doi: 10.1029/1999JD900187. [DOI] [Google Scholar]
- Crouse DL, Peters PA, van Donkelaar A, Goldberg MS, Villeneuve PJ, Brion O, Khan S, Atari DO, Jerrett M, Pope CA, Brauer M, Brook JR, Martin RV, Stieb D, Burnett RT. Risk of Nonaccidental and Cardiovascular Mortality in Relation to Long-term Exposure to Low Concentrations of Fine Particulate Matter: A Canadian National-Level Cohort Study. Environ Health Perspect. 2012;120(5):708–714. doi: 10.1289/ehp.1104049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- van Donkelaar A, Martin RV, Park RJ. Estimating ground-level PM2.5 using aerosol optical depth determined from satellite remote sensing. J Geophys Res Atmospheres. 2006;111(D21) doi: 10.1029/2005JD006996. n/a–n/a. [DOI] [Google Scholar]
- van Donkelaar A, Martin RV, Brauer M, Kahn R, Levy R, Verduzco C, Villeneuve PJ. Global estimates of ambient fine particulate matter concentrations from satellite-based aerosol optical depth: development and application. Environ Health Perspect. 2010;118(6):847–855. doi: 10.1289/ehp.0901623. [DOI] [PMC free article] [PubMed] [Google Scholar]
- van Donkelaar A, Martin RV, Levy RC, da Silva AM, Krzyzanowski M, Chubarova NE, Semutnikova E, Cohen AJ. Satellite-based estimates of ground-level fine particulate matter during extreme events: A case study of the Moscow fires in 2010. Atmos Environ. 2011;45(34):6225–6232. doi: 10.1016/j.atmosenv.2011.07.068. [DOI] [Google Scholar]
- van Donkelaar A, Martin RV, Spurr RJD, Burnett RT. High-Resolution Satellite-Derived PM2.5 from Optimal Estimation and Geographically Weighted Regression over North America. Environ Sci Technol. 2015a;49(17):10482–10491. doi: 10.1021/acs.est.5b02076. [DOI] [PubMed] [Google Scholar]
- van Donkelaar A, Martin RV, Brauer M, Boys B. Use of Satellite Observations for Long-Term Exposure Assessment of Global Concentrations of Fine Particulate Matter. Environ Health Perspect. 2015b;123:135–143. doi: 10.1289/ehp.1408646. http://dx.doi.org/10.1289/ehp.1408646. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Evans J, van Donkelaar A, Martin RV, Burnett R, Rainham DG, Birkett NJ, Krewski D. Estimates of global mortality attributable to particulate air pollution using satellite imagery. Environ Res. 2013;120:33–42. doi: 10.1016/j.envres.2012.08.005. [DOI] [PubMed] [Google Scholar]
- Fairlie DT, Jacob DJ, Park RJ. The impact of transpacific transport of mineral dust in the United States. Atmos Environ. 2007;41(6):1251–1266. doi: 10.1016/j.atmosenv.2006.09.048. [DOI] [Google Scholar]
- Fang Y, Naik V, Horowitz LW, Mauzerall DL. Air pollution and associated human mortality: the role of air pollutant emissions, climate change and methane concentration increases from the preindustrial period to present. Atmos Chem Phys. 2013;13(3):1377–1394. doi: 10.5194/acp-13-1377-2013. [DOI] [Google Scholar]
- Fann N, Lamson AD, Anenberg SC, Wesson K, Risley D, Hubbell BJ. Estimating the National Public Health Burden Associated with Exposure to Ambient PM2.5 and Ozone. Risk Anal. 2012;32(1):81–95. doi: 10.1111/j.1539-6924.2011.01630.x. [DOI] [PubMed] [Google Scholar]
- Fann N, Fulcher CM, Baker K. The Recent and Future Health Burden of Air Pollution Apportioned Across U.S. Sectors. Environ Sci Technol. 2013;47(8):3580–3589. doi: 10.1021/es304831q. [DOI] [PubMed] [Google Scholar]
- Ford B, Heald CL. An A-train and model perspective on the vertical distribution of aerosols and CO in the Northern Hemisphere. J Geophys Res. 2012;117(D6):D06211. doi: 10.1029/2011JD016977. [DOI] [Google Scholar]
- Ford B, Heald CL. Aerosol loading in the Southeastern United States: reconciling surface and satellite observations. Atmos Chem Phys. 2013;13(18):9269–9283. doi: 10.5194/acp-13-9269-2013. [DOI] [Google Scholar]
- Forouzanfar MH, Alexander L, Anderson HR, Bachman VF, Biryukov S, Brauer M, Burnett R, Casey D, Coates MM, Cohen A, Delwiche K, Estep K, Frostad JJ, KC A, Kyu HH, Moradi-Lakeh M, Ng M, Slepak EL, Thomas BA, Wagner J, Aasvang GM, Abbafati C, Ozgoren AA, Abd-Allah F, Abera SF, Aboyans V, Abraham B, Abraham JP, Abubakar I, Abu-Rmeileh NME, Aburto TC, Achoki T, Adelekan A, Adofo K, Adou AK, Adsuar JC, Afshin A, Agardh EE, Al Khabouri MJ, Al Lami FH, Alam SS, Alasfoor D, Albittar MI, Alegretti MA, Aleman AV, Alemu ZA, Alfonso-Cristancho R, Alhabib S, Ali R, Ali MK, Alla F, Allebeck P, Allen PJ, Alsharif U, Alvarez E, Alvis-Guzman N, Amankwaa AA, Amare AT, Ameh EA, Ameli O, Amini H, Ammar W, Anderson BO, Antonio CAT, Anwari P, Cunningham SA, Arnlöv J, Arsenijevic VSA, Artaman A, Asghar RJ, Assadi R, Atkins LS, Atkinson C, Avila MA, Awuah B, Badawi A, Bahit MC, Bakfalouni T, Balakrishnan K, Balalla S, Balu RK, Banerjee A, Barber RM, Barker-Collo SL, Barquera S, Barregard L, Barrero LH, Barrientos-Gutierrez T, Basto-Abreu AC, Basu A, Basu S, Basulaiman MO, Ruvalcaba CB, Beardsley J, Bedi N, Bekele T, Bell ML, Benjet C, Bennett DA, et al. Global, regional, and national comparative risk assessment of 79 behavioural, environmental and occupational, and metabolic risks or clusters of risks in 188 countries, 1990–2013 a systematic analysis for the Global Burden of Disease Study 2013. The Lancet. 2015;386(10010):2287–2323. doi: 10.1016/S0140-6736(15)00128-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fu J, Jiang D, Lin G, Liu K, Wang Q. An ecological analysis of PM2.5 concentrations and lung cancer mortality rates in China. BMJ Open. 2015;5(11):e009452. doi: 10.1136/bmjopen-2015-009452. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Giannadaki D, Pozzer A, Lelieveld J. Modeled global effects of airborne desert dust on air quality and premature mortality. Atmos Chem Phys. 2014;14(2):957–968. doi: 10.5194/acp-14-957-2014. [DOI] [Google Scholar]
- Guenther A, Karl T, Harley P, Wiedinmyer C, Palmer PI, Geron C. Estimates of global terrestrial isoprene emissions using MEGAN (Model of Emissions of Gases and Aerosols from Nature) Atmos Chem Phys. 2006;6(11):3181–3210. doi: 10.5194/acp-6-3181-2006. [DOI] [Google Scholar]
- Hand JL, Schichtel BA, Pitchford M, Malm WC, Frank NH. Seasonal composition of remote and urban fine particulate matter in the United States. J Geophys Res. 2012;117(D5):D05209. doi: 10.1029/2011JD017122. [DOI] [Google Scholar]
- Heald CL, Collett JL, Lee T, Benedict KB, Schwandner FM, Li Y, Clarisse L, Hurtmans DR, Van Damme M, Clerbaux C, Coheur PF, Pye HOT. Atmospheric ammonia and particulate inorganic nitrogen over the United States. Atmospheric Chem Phys Discuss. 2012;12(8):19455–19498. doi: 10.5194/acpd-12-19455-2012. [DOI] [Google Scholar]
- Henze DK, Seinfeld JH, Ng NL, Kroll JH, Fu TM, Jacob DJ, Heald CL. Global modeling of secondary organic aerosol formation from aromatic hydrocarbons: high- vs. low-yield pathways. Atmos Chem Phys. 2008;8(9):2405–2420. doi: 10.5194/acp-8-2405-2008. [DOI] [Google Scholar]
- Holben BN, Eck TF, Slutsker I, Tanré D, Buis JP, Setzer A, Vermote E, Reagan JA, Kaufman YJ, Nakajima T, Lavenu F, Jankowiak I, Smirnov A. AERONET—A Federated Instrument Network and Data Archive for Aerosol Characterization. Remote Sens Environ. 1998;66(1):1–16. doi: 10.1016/S0034-4257(98)00031-5. [DOI] [Google Scholar]
- Hubbell B, Fann N, Levy JI. Methodological considerations in developing local-scale health impact assessments: balancing national, regional, and local data. Air Qual Atmosphere Health. 2009;2(2):99–110. doi: 10.1007/s11869-009-0037-z. [DOI] [Google Scholar]
- Hudman RC, Jacob DJ, Turquety S, Leibensperger EM, Murray LT, Wu S, Gilliland AB, Avery M, Bertram TH, Brune W, Cohen RC, Dibb JE, Flocke FM, Fried A, Holloway J, Neuman JA, Orville R, Perring A, Ren X, Sachse GW, Singh HB, Swanson A, Wooldridge PJ. Surface and lightning sources of nitrogen oxides over the United States: Magnitudes, chemical evolution, and outflow. J Geophys Res. 2007;112(D12):D12S05. doi: 10.1029/2006JD007912. [DOI] [Google Scholar]
- Hudman RC, Murray LT, Jacob DJ, Millet DB, Turquety S, Wu S, Blake DR, Goldstein AH, Holloway J, Sachse GW. Biogenic versus anthropogenic sources of CO in the United States. Geophys Res Lett. 2008;35(4):L04801. doi: 10.1029/2007GL032393. [DOI] [Google Scholar]
- Hystad P, Demers PA, Johnson KC, Brook J, van Donkelaar A, Lamsal L, Martin R, Brauer M. Spatiotemporal air pollution exposure assessment for a Canadian population-based lung cancer case-control study. Environ Health. 2012;11(1):22. doi: 10.1186/1476-069X-11-22. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jiang X, Liu Y, Yu B, Jiang M. Comparison of MISR aerosol optical thickness with AERONET measurements in Beijing metropolitan area. Remote Sens Environ. 2007;107:45–53. [Google Scholar]
- Kahn RA, Gaitley BJ, Martonchik JV, Diner DJ, Crean KA, Holben B. Multiangle Imaging Spectroradiometer (MISR) global aerosol optical depth validation based on 2 years of coincident Aerosol Robotic Network (AERONET) observations. J Geophys Res Atmospheres. 2005;110(D10):D10S04. doi: 10.1029/2004JD004706. [DOI] [Google Scholar]
- Kahn RA, Gaitley BJ, Garay MJ, Diner DJ, Eck TF, Smirnov A, Holben BN. Multiangle Imaging SpectroRadiometer global aerosol product assessment by comparison with the Aerosol Robotic Network. J Geophys Res Atmospheres. 2010;115(D23):D23209. doi: 10.1029/2010JD014601. [DOI] [Google Scholar]
- Krewski D, Jerrett M, Burnett RT, Ma R, Hughes E, Shi Y, Turner MC, Pope CA, 3rd, Thurston G, Calle EE, Thun MJ, Beckerman B, DeLuca P, Finkelstein N, Ito K, Moore DK, Newbold KB, Ramsay T, Ross Z, Shin H, Tempalski B. Extended follow-up and spatial analysis of the American Cancer Society study linking particulate air pollution and mortality. Res Rep Health Eff Inst. 2009;(140):5–114. discussion 115–136. [PubMed] [Google Scholar]
- Laden F, Schwartz J, Speizer FE, Dockery DW. Reduction in Fine Particulate Air Pollution and Mortality. Am J Respir Crit Care Med. 2006;173(6):667–672. doi: 10.1164/rccm.200503-443OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Leibensperger EM, Mickley LJ, Jacob DJ, Chen WT, Seinfeld JH, Nenes A, Adams PJ, Streets DG, Kumar N, Rind D. Climatic effects of 1950–2050 changes in US anthropogenic aerosols – Part 1: Aerosol trends and radiative forcing. Atmos Chem Phys. 2012;12(7):3333–3348. doi: 10.5194/acp-12-3333-2012. [DOI] [Google Scholar]
- Lelieveld J, Barlas C, Giannadaki D, Pozzer A. Model calculated global, regional and megacity premature mortality due to air pollution. Atmos Chem Phys. 2013;13(14):7023–7037. doi: 10.5194/acp-13-7023-2013. [DOI] [Google Scholar]
- Lelieveld J, Evans JS, Fnais M, Giannadaki D, Pozzer A. The contribution of outdoor air pollution sources to premature mortality on a global scale. Nature. 2015;525(7569):367–371. doi: 10.1038/nature15371. [DOI] [PubMed] [Google Scholar]
- Levy RC, Remer LA, Kleidman RG, Mattoo S, Ichoku C, Kahn R, Eck TF. Global evaluation of the Collection 5 MODIS dark-target aerosol products over land. Atmospheric Chem Phys. 2010;10(21):10399–10420. doi: 10.5194/acp-10-10399-2010. [DOI] [Google Scholar]
- Levy RC, Mattoo S, Munchak LA, Remer LA, Sayer AM, Patadia F, Hsu NC. The Collection 6 MODIS aerosol products over land and ocean. Atmos Meas Tech. 2013;6(11):2989–3034. doi: 10.5194/amt-6-2989-2013. [DOI] [Google Scholar]
- Li J, Carlson BE, Lacis AA. How well do satellite AOD observations represent the spatial and temporal variability of PM2.5 concentration for the United States? Atmos Environ. 2015;102:260–273. doi: 10.1016/j.atmosenv.2014.12.010. [DOI] [Google Scholar]
- Lim SS, Vos T, Flaxman AD, Danaei G, Shibuya K, Adair-Rohani H, Amann M, Anderson HR, Andrews KG, Aryee M, Atkinson C, Bacchus LJ, Bahalim AN, Balakrishnan K, Balmes J, Barker-Collo S, Baxter A, Bell ML, Blore JD, Blyth F, Bonner C, Borges G, Bourne R, Boussinesq M, Brauer M, Brooks P, Bruce NG, Brunekreef B, Bryan-Hancock C, Bucello C, Buchbinder R, Bull F, Burnett RT, Byers TE, Calabria B, Carapetis J, Carnahan E, Chafe Z, Charlson F, Chen H, Chen JS, Cheng ATA, Child JC, Cohen A, Colson KE, Cowie BC, Darby S, Darling S, Davis A, Degenhardt L, Dentener F, Des Jarlais DC, Devries K, Dherani M, Ding EL, Dorsey ER, Driscoll T, Edmond K, Ali SE, Engell RE, Erwin PJ, Fahimi S, Falder G, Farzadfar F, Ferrari A, Finucane MM, Flaxman S, Fowkes FGR, Freedman G, Freeman MK, Gakidou E, Ghosh S, Giovannucci E, Gmel G, Graham K, Grainger R, Grant B, Gunnell D, Gutierrez HR, Hall W, Hoek HW, Hogan A, Hosgood HD, Hoy D, Hu H, Hubbell BJ, Hutchings SJ, Ibeanusi SE, Jacklyn GL, Jasrasaria R, Jonas JB, Kan H, Kanis JA, Kassebaum N, Kawakami N, Khang YH, Khatibzadeh S, Khoo JP, Kok C, et al. A comparative risk assessment of burden of disease and injury attributable to 67 risk factors and risk factor clusters in 21 regions, 1990–2010: a systematic analysis for the Global Burden of Disease Study 2010. Lancet. 2012;380(9859):2224–2260. doi: 10.1016/S0140-6736(12)61766-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu Y, Park RJ, Jacob DJ, Li Q, Kilaru V, Sarnat JA. Mapping annual mean ground-level PM2.5 concentrations using Multiangle Imaging Spectroradiometer aerosol optical thickness over the contiguous United States. J Geophys Res Atmospheres. 2004;109(D22):D22206. doi: 10.1029/2004JD005025. [DOI] [Google Scholar]
- Liu Y, Sarnat JA, Kilaru V, Jacob DJ, Koutrakis P. Estimating ground-level PM2.5 in the eastern United States using satellite remote sensing. Environ Sci Technol. 2005;39(9):3269–3278. doi: 10.1021/es049352m. [DOI] [PubMed] [Google Scholar]
- Liu Y, Koutrakis P, Kahn R. Estimating fine particulate matter component concentrations and size distributions using satellite-retrieved fractional aerosol optical depth: part 1–method development. J Air Waste Manag Assoc 1995. 2007;57(11):1351–1359. doi: 10.3155/1047-3289.57.11.1351. [DOI] [PubMed] [Google Scholar]
- Malm WC, Sisler JF, Huffman D, Eldred RA, Cahill TA. Spatial and seasonal trends in particle concentration and optical extinction in the United States. J Geophys Res Atmospheres. 1994;99(D1):1347–1370. doi: 10.1029/93JD02916. [DOI] [Google Scholar]
- Marlier ME, DeFries RS, Voulgarakis A, Kinney PL, Randerson JT, Shindell DT, Chen Y, Faluvegi G. El Nino and health risks from landscape fire emissions in southeast Asia. Nat Clim Change. 2013;3(2):131–136. doi: 10.1038/nclimate1658. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Park RJ, Jacob DJ, Chin M, Martin RV. Sources of carbonaceous aerosols over the United States and implications for natural visibility. J GEOPHYS RES. 2003;108:2002–3190. [Google Scholar]
- Park RJ, Jacob DJ, Field BD, Yantosca RM, Chin M. Natural and transboundary pollution influences on sulfate-nitrate-ammonium aerosols in the United States: Implications for policy. J Geophys Res Atmospheres. 2004;109(D15) doi: 10.1029/2003JD004473. n/a–n/a. [DOI] [Google Scholar]
- Pope CA, Dockery DW. Air pollution and life expectancy in China and beyond. Proc Natl Acad Sci. 2013;110(32):12861–12862. doi: 10.1073/pnas.1310925110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pope CA, Bates DV, Raizenne ME. Health effects of particulate air pollution: time for reassessment? Environ Health Perspect. 1995;103(5):472–480. doi: 10.1289/ehp.95103472. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pope CA, Burnett RT, Turner MC, Cohen A, Krewski D, Jerrett M, Gapstur SM, Thun MJ. Lung Cancer and Cardiovascular Disease Mortality Associated with Ambient Air Pollution and Cigarette Smoke: Shape of the Exposure-Response Relationships. Environ Health Perspect. 2011;119(11):1616–1621. doi: 10.1289/ehp.1103639. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pope CA, Cropper M, Coggins J, Cohen A. Health benefits of air pollution abatement policy: Role of the shape of the concentration–response function. J Air Waste Manag Assoc. 2015;65(5):516–522. doi: 10.1080/10962247.2014.993004. [DOI] [PubMed] [Google Scholar]
- Pope CA, 3rd, Burnett RT, Thun MJ, Calle EE, Krewski D, Ito K, Thurston GD. Lung cancer, cardiopulmonary mortality, and long-term exposure to fine particulate air pollution. JAMA J Am Med Assoc. 2002;287(9):1132–1141. doi: 10.1001/jama.287.9.1132. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pope CA, 3rd, Burnett RT, Krewski D, Jerrett M, Shi Y, Calle EE, Thun MJ. Cardiovascular mortality and exposure to airborne fine particulate matter and cigarette smoke: shape of the exposure-response relationship. Circulation. 2009;120(11):941–948. doi: 10.1161/CIRCULATIONAHA.109.857888. [DOI] [PubMed] [Google Scholar]
- Punger EM, West JJ. The effect of grid resolution on estimates of the burden of ozone and fine particulate matter on premature mortality in the USA, Air Qual. Atmosphere Health. 2013;6(3):563–573. doi: 10.1007/s11869-013-0197-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Qi Y, Ge J, Huang J. Spatial and temporal distribution of MODIS and MISR aerosol optical depth over northern China and comparison with AERONET. Chin Sci Bull. 2013;58(20):2497–2506. doi: 10.1007/s11434-013-5678-5. [DOI] [Google Scholar]
- Remer LA, Kaufman YJ, Tanré D, Mattoo S, Chu DA, Martins JV, Li RR, Ichoku C, Levy RC, Kleidman RG, Eck TF, Vermote E, Holben BN. The MODIS Aerosol Algorithm, Products, and Validation. J Atmospheric Sci. 2005;62(4):947–973. doi: 10.1175/JAS3385.1. [DOI] [Google Scholar]
- Remer LA, Kleidman RG, Levy RC, Kaufman YJ, Tanré D, Mattoo S, Martins JV, Ichoku C, Koren I, Yu H, Holben BN. Global aerosol climatology from the MODIS satellite sensors. J Geophys Res Atmospheres. 2008;113(D14):D14S07. doi: 10.1029/2007JD009661. [DOI] [Google Scholar]
- Ridley DA, Heald CL, Ford B. North African dust export and deposition: A satellite and model perspective. J Geophys Res. 2012;117(D2):D02202. doi: 10.1029/2011JD016794. [DOI] [Google Scholar]
- Rohde RA, Muller RA. Air Pollution in China: Mapping of Concentrations and Sources. PLoS ONE. 2015;10(8):e0135749. doi: 10.1371/journal.pone.0135749. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schiferl LD, Heald CL, Nowak JB, Holloway JS, Neuman JA, Bahreini R, Pollack IB, Ryerson TB, Wiedinmyer C, Murphy JG. An investigation of ammonia and inorganic particulate matter in California during the CalNex campaign. J Geophys Res Atmospheres. 2014;119(4):2013JD020765. doi: 10.1002/2013JD020765. [DOI] [Google Scholar]
- Shang Y, Sun Z, Cao J, Wang X, Zhong L, Bi X, Li H, Liu W, Zhu T, Huang W. Systematic review of Chinese studies of short-term exposure to air pollution and daily mortality. Environ Int. 2013;54:100–111. doi: 10.1016/j.envint.2013.01.010. [DOI] [PubMed] [Google Scholar]
- Sheppard L, Burnett RT, Szpiro AA, Kim SY, Jerrett M, Pope CA, Brunekreef B. Confounding and exposure measurement error in air pollution epidemiology. Air Qual Atmosphere Health. 2012;5(2):203–216. doi: 10.1007/s11869-011-0140-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shi L, Zanobetti A, Kloog I, Coull BA, Koutrakis P, Melly SJ, Schwartz JD. Low-Concentration PM2.5 and Mortality: Estimating Acute and Chronic Effects in a Population-Based Study. Environ Health Perspect. 2015 doi: 10.1289/ehp.1409111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Silva RA, West JJ, Zhang Y, Anenberg SC, Lamarque JF, Shindell DT, Collins WJ, Dalsoren S, Faluvegi G, Folberth G, Horowitz LW, Nagashima T, Naik V, Rumbold S, Skeie R, Sudo K, Takemura T, Bergmann D, Cameron-Smith P, Cionni I, Doherty RM, Eyring V, Josse B, MacKenzie IA, Plummer D, Righi M, Stevenson DS, Strode S, Szopa S, Zeng G. Global premature mortality due to anthropogenic outdoor air pollution and the contribution of past climate change. Environ Res Lett. 2013;8(3):034005. doi: 10.1088/1748-9326/8/3/034005. [DOI] [Google Scholar]
- Snider G, Weagle CL, Martin RV, van Donkelaar A, Conrad K, Cunningham D, Gordon C, Zwicker M, Akoshile C, Artaxo P, Anh NX, Brook J, Dong J, Garland RM, Greenwald R, Griffith D, He K, Holben BN, Kahn R, Koren I, Lagrosas N, Lestari P, Ma Z, Vanderlei Martins J, Quel EJ, Rudich Y, Salam A, Tripathi SN, Yu C, Zhang Q, Zhang Y, Brauer M, Cohen A, Gibson MD, Liu Y. SPARTAN: a global network to evaluate and enhance satellite-based estimates of ground-level particulate matter for global health applications. Atmos Meas Tech. 2015;8(1):505–521. doi: 10.5194/amt-8-505-2015. [DOI] [Google Scholar]
- Streets DG, Zhang Q, Wang L, He K, Hao J, Wu Y, Tang Y, Carmichael GR. Revisiting China’s CO emissions after the Transport and Chemical Evolution over the Pacific (TRACE-P) mission: Synthesis of inventories, atmospheric modeling, and observations. J Geophys Res Atmospheres. 2006;111(D14):D14306. doi: 10.1029/2006JD007118. [DOI] [Google Scholar]
- Sun Q, Hong X, Wold LE. Cardiovascular Effects of Ambient Particulate Air Pollution Exposure. Circulation. 2010;121(25):2755–2765. doi: 10.1161/CIRCULATIONAHA.109.893461. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thompson TM, Selin NE. Influence of air quality model resolution on uncertainty associated with health impacts. Atmos Chem Phys. 2012;12(20):9753–9762. doi: 10.5194/acp-12-9753-2012. [DOI] [Google Scholar]
- Thompson TM, Saari RK, Selin NE. Air quality resolution for health impact assessment: influence of regional characteristics. Atmos Chem Phys. 2014;14(2):969–978. doi: 10.5194/acp-14-969-2014. [DOI] [Google Scholar]
- Toth TD, Zhang J, Campbell JR, Hyer EJ, Reid JS, Shi Y, Westphal DL. Impact of data quality and surface-to-column representativeness on the PM2.5 / satellite AOD relationship for the contiguous United States. Atmos Chem Phys. 2014;14(12):6049–6062. doi: 10.5194/acp-14-6049-2014. [DOI] [Google Scholar]
- U.S. Environmental Protection Agency. Risk Assessment to Support the Review of the PM Primary National Ambient Air Quality Standards (External Reivew Draft) U.S. EPA; Research Triangle Park, NC: 2009. [Google Scholar]
- U.S. Environmental Protection Agency. Quantitative health risk assessment for particulate matter. U.S. Environmental Protection Agency; Research Triangle Park, NC: 2010. [Google Scholar]
- Villeneuve PJ, Weichenthal SA, Crouse D, Miller AB, To T, Martin RV, van Donkelaar A, Wall C, Burnett RT. Long-term Exposure to Fine Particulate Matter Air Pollution and Mortality Among Canadian Women. Epidemiol Camb Mass. 2015;26(4):536–545. doi: 10.1097/EDE.0000000000000294. [DOI] [PubMed] [Google Scholar]
- Wang X, Huang J, Ji M, Higuchi K. Variability of East Asia dust events and their long-term trend. Atmos Environ. 2008;42(13):3156–3165. doi: 10.1016/j.atmosenv.2007.07.046. [DOI] [Google Scholar]
- Wang Y, Zhang QQ, He K, Zhang Q, Chai L. Sulfate-nitrate-ammonium aerosols over China: response to 2000–2015 emission changes of sulfur dioxide, nitrogen oxides, and ammonia. Atmos Chem Phys. 2013;13(5):2635–2652. doi: 10.5194/acp-13-2635-2013. [DOI] [Google Scholar]
- Wang Y, Zhang Q, Jiang J, Zhou W, Wang B, He K, Duan F, Zhang Q, Philip S, Xie Y. Enhanced sulfate formation during China’s severe winter haze episode in January 2013 missing from current models. J Geophys Res Atmospheres. 2014;119(17):2013JD021426. doi: 10.1002/2013JD021426. [DOI] [Google Scholar]
- van der Werf GR, Randerson JT, Giglio L, Collatz GJ, Kasibhatla PS, Arellano AF., Jr Interannual variability in global biomass burning emissions from 1997 to 2004. Atmos Chem Phys. 2006;6(11):3423–3441. doi: 10.5194/acp-6-3423-2006. [DOI] [Google Scholar]
- World Bank. World Development Indicators 2015. World Bank; Washington, DC: 2015. [Google Scholar]
- Yao J, Brauer M, Henderson SB. Evaluation of a wildfire smoke forecasting system as a tool for public health protection. Environ Health Perspect. 2013;121(10):1142–1147. doi: 10.1289/ehp.1306768. [DOI] [PMC free article] [PubMed] [Google Scholar]
- You W, Zang Z, Pan X, Zhang L, Chen D. Estimating PM2.5 in Xi’an, China using aerosol optical depth: A comparison between the MODIS and MISR retrieval models. Sci Total Environ. 2015;505:1156–1165. doi: 10.1016/j.scitotenv.2014.11.024. [DOI] [PubMed] [Google Scholar]
- Zhang J, Reid JS. MODIS aerosol product analysis for data assimilation: Assessment of over-ocean level 2 aerosol optical thickness retrievals. J Geophys Res Atmospheres. 2006;111(D22):D22207. doi: 10.1029/2005JD006898. [DOI] [Google Scholar]
- Zhang J, Reid JS. A decadal regional and global trend analysis of the aerosol optical depth using a data-assimilation grade over-water MODIS and Level 2 MISR aerosol products. Atmos Chem Phys. 2010;10(22):10949–10963. doi: 10.5194/acp-10-10949-2010. [DOI] [Google Scholar]
- Zhang J, Reid JS, Holben BN. An analysis of potential cloud artifacts in MODIS over ocean aerosol optical thickness products. Geophys Res Lett. 2005;32(15):L15803. doi: 10.1029/2005GL023254. [DOI] [Google Scholar]
