Skip to main content
GeoHealth logoLink to GeoHealth
. 2023 Sep 13;7(9):e2023GH000834. doi: 10.1029/2023GH000834

Data‐Driven Placement of PM2.5 Air Quality Sensors in the United States: An Approach to Target Urban Environmental Injustice

Makoto M Kelp 1,, Timothy C Fargiano 2, Samuel Lin 3, Tianjia Liu 4, Jay R Turner 5, J Nathan Kutz 6, Loretta J Mickley 7
PMCID: PMC10499371  PMID: 37711364

Abstract

In the United States, citizens and policymakers heavily rely upon Environmental Protection Agency mandated regulatory networks to monitor air pollution; increasingly they also depend on low‐cost sensor networks to supplement spatial gaps in regulatory monitor networks coverage. Although these regulatory and low‐cost networks in tandem provide enhanced spatiotemporal coverage in urban areas, low‐cost sensors are located often in higher income, predominantly White areas. Such disparity in coverage may exacerbate existing inequalities and impact the ability of different communities to respond to the threat of air pollution. Here we present a study using cost‐constrained multiresolution dynamic mode decomposition (mrDMDcc) to identify the optimal and equitable placement of fine particulate matter (PM2.5) sensors in four U.S. cities with histories of racial or income segregation: St. Louis, Houston, Boston, and Buffalo. This novel approach incorporates the variation of PM2.5 on timescales ranging from 1 day to over a decade to capture air pollution variability. We also introduce a cost function into the sensor placement optimization that represents the balance between our objectives of capturing PM2.5 extremes and increasing pollution monitoring in low‐income and nonwhite areas. We find that the mrDMDcc algorithm places a greater number of sensors in historically low‐income and nonwhite neighborhoods with known environmental pollution problems compared to networks using PM2.5 information alone. Our work provides a roadmap for the creation of equitable sensor networks in U.S. cities and offers a guide for democratizing air pollution data through increasing spatial coverage of low‐cost sensors in less privileged communities.

Keywords: fine particulate matter (PM2.5), sensor placement, sensor networks, environmental justice, citizen science

Key Points

  • A method using pollution data to diagnose the optimal and equitable placement of PM2.5 sensors is presented

  • By incorporating race and income information, the method shifts the sensor distribution to more nonwhite and low‐income neighborhoods

  • The approach provides a roadmap for urban areas to create low‐cost sensor networks that address America's legacy of environmental racism

1. Introduction

Fine particulate matter (PM2.5) air pollution poses the greatest environmental risk to public health (Fann et al., 2012; Murray et al., 2020). However, the deployment of a distributed sensor network to monitor PM2.5 pollution is financially and resource intensive. Interpreting measurements of PM2.5 is particularly difficult due to its many outdoor sources (e.g., wildfires or vehicular combustion) and its sensitivity to meteorological conditions (Bond et al., 2007; McDuffie et al., 2021; Tai et al., 2012). In addition, the relatively short atmospheric lifetime of PM2.5 leads to large spatial and temporal variation of PM2.5 exposure, even at the neighborhood scale (Clark et al., 2011; Dai et al., 2020; Liu et al., 2009). In the United States, communities of color are disproportionately exposed to higher levels of PM2.5 air pollution at all income levels (Lane et al., 2022; Tessum et al., 2021). Moreover, low‐cost sensor network initiatives have led to sensors being systematically deployed in wealthier, predominantly White neighborhoods (deSouza & Kinney, 2021). Here we provide an intentional sensor network optimization framework that takes into account the multiscale variability of PM2.5 and two socioeconomic metrics (race and income) to determine the optimal and equitable placement of PM2.5 monitors.

In the United States, monitoring of PM2.5 has long relied on networks owned and operated by state, local, and tribal agencies using regulatory grade (Federal Reference Method and Equivalent Method) samplers and monitors. The Environmental Protection Agency (EPA) has promulgated PM2.5 minimum monitoring network requirements, based on Metropolitan Statistical Area (MSA) population, and additional design criteria for selecting monitoring locations (Code of Federal Regulations, 2023). These state or local air monitoring stations, hereafter called the “EPA monitors” are deployed primarily to determine compliance with the National Ambient Air Quality Standard yet have ancillary purposes such as providing data to assess air pollution impacts on public health. These sites were originally intended to monitor high‐emitting point sources (e.g., coal‐fired power plants, on‐road traffic) and air pollution in areas with relatively high population. As a result, EPA monitors are not equally distributed across the United States and are not designed to sample the full range of pollutant concentrations (Di et al., 2019; Kelp et al., 2022; Marlier et al., 2022). In part due to the high cost to purchase and maintain regulatory grade monitoring infrastructure, new monitoring sites are added infrequently (US EPA, 2020), and the EPA is considering incorporating networks of low‐cost and crowdsourced sensors to supplement the current EPA network (Barkjohn et al., 2021).

The rise of low‐cost, crowdsourced sensor networks has greatly increased the spatiotemporal monitoring of PM2.5 monitoring in the United States. Low‐cost sensors can report measurements publicly in real‐time (Snyder et al., 2013), and recent studies have focused on their calibration (Barkjohn et al., 2021; Delp & Singer, 2020; deSouza et al., 2022), ability to capture wildfire smoke in the wildland‐urban interface (Burke et al., 2022; Holder et al., 2020; Kramer et al., 2023), and skill in characterizing indoor PM2.5 from outdoor pollution (Liang et al., 2021; May et al., 2021). While low‐cost sensor networks monitoring PM2.5 in urban areas are available, these networks are largely designed by volunteers or are focused on points of interest (e.g., hospitals, elderly care homes, or bus stops) (Esie et al., 2022; Mousavi et al., 2021; Sun et al., 2019). Such networks do not consider the spatial features or temporal dynamics of air pollution, and only recently have these factors been considered when designing locally‐deployed sensor networks (Frederickson et al., 2022). The PurpleAir network is currently the most extensive crowdsourced PM2.5 sensor network in the world, with over 10,000 devices globally and over 2,000 outdoor sensors in the United States (Barkjohn et al., 2021).

While device manufacturers such as PurpleAir (with its allied data platform) and organizations such as OpenAQ (with its open‐source air quality data platform) promote the democratization of air pollution monitoring data, the selection of these sensor locations suffers from systematic racial and income biases. The crowdsourced networks often rely on volunteers who are responsible for the installation and upkeep of each sensor, resulting in deployment in predominantly White areas characterized by higher incomes and levels of education relative to US census tracts with EPA monitors (deSouza & Kinney, 2021). In addition, areas with a higher density of low‐cost sensors tend to report lower annual‐average PM2.5 concentrations than the EPA monitors in all states except California (deSouza & Kinney, 2021). While well‐intentioned, these citizen science efforts can exacerbate disparities in the spatial coverage of PM2.5 monitors, limit the pursuit of environmental justice, and may further perpetuate inequality in PM2.5 monitoring (Sorensen et al., 2019; Tubridy et al., 2022). Such considerations are especially pressing given the EPA's American Rescue Plan Enhanced Air Quality Monitoring for Communities, an initiative which provides funding to enhance ambient air quality monitoring in and near underserved communities across the United States (US EPA, 2022).

A multitude of studies describe disparities in exposure to air pollution among racial minorities and people of low socioeconomic status (SES) in the United States, but few propose frameworks to start addressing these inequalities (Fann et al., 2011; Gardner‐Frolick et al., 2022; Van Horne et al., 2022; Wang et al., 2022). A recent study highlights the historical discriminatory practice of redlining, in which services such as mortgages, insurance loans, and other financial services are systematically denied to residents of certain areas based on their race or ethnicity, and reveals how this practice continues to shape systemic disparities in air pollution exposure in the United States (Lane et al., 2022). Racial minorities and low SES groups are at a higher risk of death and disease from PM2.5 exposure (Bell & Ebisu, 2012; Jbaily et al., 2022; Liu et al., 2021; Mikati et al., 2018; Miranda et al., 2011). While absolute disparities in air pollution in the US have declined significantly since 2000, relative disparities between White and minority groups persist (Clark et al., 2014; Colmer et al., 2020). Past studies have comprehensively investigated racial/ethnic disparities in air pollution exposure; such studies have relied on ground‐based monitoring data (Clark et al., 2014; Demetillo et al., 20202021; Jbaily et al., 2022; Kerr et al., 2021; Liu et al., 2021; Tessum et al., 2021). Given the heightened risk of racial minorities and low SES groups to PM2.5 pollution in the United States, a PM2.5 sensor network biased toward wealthier or whiter communities may lead to mischaracterizations of exposures and inaccurate assessment of the health impacts of such pollution. Proposing an equitable PM2.5 monitoring network presents a beneficial first step in starting to address this inequality issue.

In this study, we demonstrate a data‐driven approach that determines the optimal and equitable placement of sensors to capture PM2.5 concentrations and variability, while considering socioeconomic metrics (race, income) across four urban areas with histories of segregation in the United States. We use multiresolution dynamic mode decomposition with cost constraints (mrDMDcc), which recursively decomposes a data set into low‐rank spatial modes and their temporal Fourier dynamics, while incorporating a socioeconomic cost function in the optimization (Clark et al., 2019; Kutz et al., 2016; Manohar et al., 2019; Proctor et al., 2014). This algorithm allows for the creation of a library of modes that not only captures PM2.5 concentrations spatially and temporally on short (weekly) and long‐term (years to decade) timescales, but also incorporates cost‐constraining functions that optimize sensor placement based on race and income metrics relevant to environmental justice. Our algorithm can capture a finer level of spatial and temporal variability in a data set that would otherwise be averaged out using traditional mean or maximum PM2.5 metrics. We previously applied a similar algorithm, but without the socioeconomic cost function, to identify the optimal placement of PM2.5 sensors across the contiguous United States (Kelp et al., 2022). Here we extend that work by designing optimal PM2.5 sensor networks that account for race and income in four cities: St. Louis, MO; Houston, TX; Buffalo, NY; and Boston, MA. We identify monitoring locations on the spatial scale of 1 km2, which is in accords with neighborhood scale of representation for PM2.5 monitoring siting. We compare our results to the current networks of EPA and PurpleAir monitors (acknowledging that there may be other low‐cost sensor networks in one or more of these cities), and we contrast the mrDMDcc network designed with only air pollution modal information against those with race and with income included in the optimization.

2. Methods

2.1. PM2.5 Data Set

We use a data set consisting of modeled daily PM2.5 concentrations at 1 km × 1 km for the contiguous United States for the period January 2000 to December 2016. This data set was produced through a data fusion method using ensemble machine learning to combine surface monitoring measurements, satellite aerosol optical depth, land‐use data, and chemical transport model results, among other variables (Di et al., 20192021). We subset the data set to encompass each city described in Section 2.3. While this PM2.5 data set tends to underpredict PM2.5 on high‐pollution wildfire days in the Western United States (Considine et al., 2022), these underestimates are not a concern for our analysis as we focus on urban areas in the central and eastern United States. We remove missing values from the data set by applying a mask to the spatial grid.

2.2. Census Data

For our racial and economic analysis of the urban areas, we obtain sociodemographic features from the 2020 American Community Survey using the R package tidycensus. We use socioeconomic data at the census tract level and interpolate these features to the Di et al. (2021) grid to get census estimates at the centroids of the 1 km × 1 km grid cells. For each grid cell in an urban area, we calculate the proportion of nonwhite individuals in terms of population and annual median income. We define the nonwhite proportion as one minus the non‐Hispanic White proportion.

2.3. Urban Areas

We focus our analysis on four urban areas in the United States with high rates of racial segregation and underserved PM2.5 monitoring networks: St. Louis, MO, Houston, TX; Buffalo, NY; and Boston, MA. Although many cities in the United States are racially segregated, we select these urban areas based on their geographic diversity and long histories of environmental injustice. Figure 1 shows the locations of these urban areas with geographic landmarks. The extent of the urban area domain is specified by the U.S. Census Bureau Shapefile for Urban Areas (https://www.census.gov).

Figure 1.

Figure 1

Spatial extent of St. Louis, Houston, Buffalo, and Boston metropolitan areas. Image source: Google Earth Engine software with data from SIO, NOAA, U.S. Navy, NGA, GEBCO, and images from Landsat/Copernicus, U.S. Geological Survey.

In all urban areas, decadal PM2.5 concentrations are generally higher in more nonwhite and lower income neighborhoods (Figure 2). St. Louis (population: 2.8 million, area: 22,000 km2) displays stark segregation, with large nonwhite populations found north of downtown in majority‐Black neighborhoods such as Jennings and Ferguson in Missouri and East St. Louis in Illinois. These neighborhoods in greater St. Louis have long histories of being targeted by structurally racist polices, which have shaped the city landscape (Johnson, 2020). Houston (population: 7.2 million, area: 26,000 km2) has one of the largest nonwhite Hispanic populations in the United States and is home to nearly a quarter of the nation's chemical refineries. Most of these petrochemical refineries are located near the Houston Ship Channel along the Buffalo Bayou River between Baytown and Downtown Houston, an area containing a disproportionate number of low‐income and nonwhite households (Demetillo et al., 2020; Jang et al., 2021; Sansom et al., 2017). Buffalo (population: 1.1 million, area: 4,000 km2) has large populations of nonwhite residents in the areas north and east of downtown. The Buffalo metropolitan area has a long history of environmentally racist policies, including toxic chemical dumping in predominantly nonwhite neighborhoods such as Niagara Falls and Love Canal in the 1970s (Fletcher, 2021). Nonwhite communities continue to disproportionately experience exposure to air pollution from major roadways and have limited access to environmental amenities like parks and other public spaces (Drake et al., 2022; Krieg, 2005; Phillips et al., 2007). Finally, Boston's (population: 4.9 million, area: 12,000 km2) modern demographics are influenced by its early 20th century redlining policies, with large nonwhite populations found in the residential suburbs of South Boston, Lowell, and Lawrence among others (Harvard Chan‐NIEHS Center for Environmental Health, 2021).

Figure 2.

Figure 2

Maps of decadal PM2.5 concentrations, socioeconomic inequality metrics, and EPA and PurpleAir sensor locations for St. Louis, Houston, Buffalo, and Boston metropolitan areas. The 11‐year mean of annual averages of PM2.5 over 2006–2016 are from estimates from Di et al. (2021). The proportion nonwhite and median annual household income are from the 2020 American Community Survey interpolated onto the centroids of the Di et al. (2021) PM2.5 data set. EPA (locations used in Di et al. (2021)) and PurpleAir (downloaded on 19 July 2021) sensor locations are gridded onto the same 1 km × 1 km Di et al. (2021) grid. White areas of the sensor location maps represent the built environment, while the shades of green represent the natural vegetation colors of the area.

In our selected urban areas, the number and locations of EPA monitors are sparse, and the locations of PurpleAir monitors are generally in whiter and higher‐income census tracts, as first indicated by deSouza et al. (2022) (Figure 2). On average, EPA monitors are in lower income and higher proportion nonwhite areas (St. Louis: 47% nonwhite, $66,000 median income; Houston: 79%, $55,000; Buffalo: 31%, $44,000; Boston: 52%, $83,000) than PurpleAir monitors (St. Louis: 45% nonwhite, $60,000 median income; Houston: 59%, $81,000; Buffalo: 36%, $67,000; Boston: 41%, $98,000). The St. Louis metropolitan area has the most EPA monitors out of the four cities, with 18, but only 12 PurpleAir monitors. Buffalo has the fewest EPA and PurpleAir monitors out of the four cities, with five and four, respectively. Boston has the most PurpleAir monitors out of the four cities, with 82.

2.4. Sensor Placement Using Multiresolution Dynamic Mode Decomposition With Environmental Justice Cost‐Constraint (mrDMDcc)

We use multiresolution dynamic mode decomposition (mrDMD), which recursively decomposes a data set into low‐rank spatial modes and their temporal Fourier dynamics (Kutz et al., 2016; Manohar et al., 2019). mrDMD has been shown to capture PM2.5 concentrations spatially and temporally on short (daily) and long‐term (years to decade) timescales, and to incorporate information from transient phenomena, such as wildfires and temperature inversions, that would otherwise be discarded or averaged out using similar data reduction techniques (Kelp et al., 2022). The algorithm can thus capture a finer level of spatial and temporal variability in a data set that would otherwise be averaged out using traditional mean PM2.5 or maximum PM2.5 metrics. mrDMD is a dimensionality reduction algorithm, similar to principal components analysis (PCA), but mrDMD is more precise in capturing spatiotemporal variability than methods based on singular value decomposition such as PCA (Manohar et al., 2019). A formal expansion of the DMD and mrDMD theory and modeling approach may be found elsewhere (Kelp et al., 2022; Kutz et al., 2016; Manohar et al., 2019).

The mrDMD algorithm operates by decreasing the time domain by a factor of two at each successive decomposition level. We apply mrDMD to training windows starting at 11.4 years (M = 4,096 days), followed by 10 decomposition levels so that the shortest frequency is weekly. This approach thus yields a long‐term mode characterizing the average PM2.5 concentrations over 11.4 years, with potential identification of transient pollution events spanning timescales from 5.7 years (2,048 days) to 8 days. Recent studies have demonstrated the sensitivity of PM2.5 across the United States to large‐scale, multi‐year meteorological patterns such as El Niño or the Atlantic Multidecadal Oscillation (Previdi & Fiore, 2019; Shen et al., 2017). As we shall see, however, most of the events identified by mrDMD occur on timescales lasting days to weeks. Here, we focus our analysis on the last 11.4 years (4,096 days) of our data set: September 2005–December 2016.

We use the matrix libraries containing all mrDMD modes as tailored basis sets ψrRn×r to optimize for sensor placement. We identify the optimal sensor locations by employing QR pivoting to our mrDMD basis sets (Heck et al., 1998; Manohar et al., 2019). QR pivoting is a “greedy” selection algorithm that is computationally efficient for finding near‐optimal sensor locations. Greedy approaches are often favored over other optimization techniques, as the true optimal solution often involves a combinatorially intractable optimization. QR column pivoting identifies rows in the modal library ψ r with the highest 2‐norm, which corresponds to locations with the largest PM2.5 modal frequencies and therefore greatest variability. The reduced matrix QR factorization with column pivoting decomposes a matrix ARm×n into a unitary matrix Q, an upper‐triangular matrix R, and a column permutation matrix C T , such that AC T  = QR. Thus, the QR factorization with column pivoting yields r point sensors (pivots) that best sample the r tailored basis modes ψ r :

ψrTCT=QR (1)

That is, each QR pivot identifies those spatial locations in the modal library that exhibit the most variability and where sensor placement would capture the greatest number of significant pollution episodes above background concentrations. To reiterate, mrDMD intends to capture spatial and temporal variability in pollution without taking into account other considerations.

Here, we present an extension to the mrDMD framework that considers cost‐constraining functions (Clark et al., 2019) to optimize sensor placement based on racial and income metrics related to environmental justice. The mrDMDcc algorithm is based on the column pivoted QR algorithm described above, where the pivot column is now chosen to balance (a) the decrease in accuracy of capturing the largest air pollution modal signals with (b) the increase in capturing pollution exposure in communities with either a high proportion nonwhite or low‐income populations. To incorporate the cost of a sensor, we create a factorization l which satisfies:

l=maxi=1,,nkCk,i2γηji+kk (2)

where k is the number of sensors specified by the mrDMD library, n is the number of columns in the original PM2.5 data matrix, C is the column permutation matrix, η is a vector that contains the socioeconomic cost function, and γ is the cost‐balance coefficient that specifies the degree to which the cost function must be obeyed by applying a penalty term. Once calculated, l is used to permute the indices in the mrDMD library to find the pivots that satisfy the cost constraint.

A formal expansion of the sparse sensor placement approach and cost‐constraint pseudocode may be found elsewhere (Brunton & Kutz, 2019; Clark et al., 2019; Kelp et al., 2022; Manohar et al., 20182019).

The cost function used here is a step function that penalizes placing sensors too far from majority nonwhite or low‐income neighborhoods. For the cost function vector, all socioeconomic data is rescaled between 0 and 1. The proportion of nonwhite people in a grid cell ranges from 0 to 1, with 0 representing an all‐white population and 1 representing an all‐nonwhite population. The annual median income of a grid cell is normalized according to the maximum value in the census data ($250,000 for St. Louis, Houston, and Boston; $150,000 for Buffalo) and inverted such that 0 indicates highest income and 1 indicates lowest income (Figure S1 in Supporting Information S1). Values of γ range from 0 to 0.5 and we select, by inspection, sensor network results that best balance capturing high‐variability PM2.5 locations with more nonwhite or lower income areas. We apply a user‐selected γ value that prevents the cost function from placing sensors in highly dense clusters that do not improve coverage of vulnerable neighborhoods (Figure S2 in Supporting Information S1). With a PM2.5 data set timescale spanning over 11 years to 1 week, the mrDMD modal library typically identifies more than 800 sensors because the more polluting events on shorter timescales that the sensor network attempts to capture, the more sensors it needs to do so (Figure S3 in Supporting Information S1). However, we choose to display the top 250 sensors in St. Louis, Houston, and Boston, and the top 150 sensors in Buffalo, as these numbers of monitors create a realistic distribution of sensors spanning both the urban core and more suburban/rural areas of each city without losing spatial coverage. Generally, we find that under 100 sensors fails to capture many socioeconomic communities in an urban area, while more than 300 sensors lead to overlapping spatial clustering which may be redundant (Figure S4 in Supporting Information S1). However, adding or subtracting 50 sensors from this total does not meaningfully impact the prioritization of the relevant socioeconomic communities. We choose to display 150 sensors in Buffalo as it is the smallest in area (4,000 km2) among all urban cities selected in this study. We find that showing more than 150 sensors becomes less interpretable when comparing differences. Additionally, we selected 250 sensors for St. Louis (22,000 km2), Houston (26,000 km2), and Boston (12,000 km2) as this number is sufficient to cover the domain of the MSA and identify patterns among different mrDMDcc optimizations. We acknowledge that the optimal number of sensors may vary for each city, although we did not optimize that number here.

The mrDMDcc algorithm does not require a new simulation when the target number of sensors is changed. This is in contrast to other algorithms where the network size is specified a priori, such as the location‐allocation approach which is an optimization algorithm built into ArcGIS (Kanaroglou et al., 2005).

3. Results and Discussion

3.1. Optimal and Equitable Sensor Locations

In St. Louis, using race and income metrics in the mrDMDcc optimization leads to a greater number of sensors clustered in historically nonwhite and low‐income neighborhoods (Figure 3, top row). We find that the mrDMD algorithm, which takes into account only air pollution modes, distributes sensors throughout the St. Louis urban airshed, with many monitors located some distance away from downtown and the Black northern residential suburbs. This distribution is due to frequently occurring wintertime nitrate events, during which cold temperatures and high‐pressure systems drive nitrate from agricultural activities in the Midwest into particle phase, forming haze (Lee et al., 2006). These haze events result in large PM2.5 modal signals above background variability that are located away from the urban city center (Figure S5 in Supporting Information S1). In contrast, the mrDMDcc optimized for race (γ = 0.20) shifts more sensors to the historically Black East St. Louis neighborhood. This area not only has the highest proportion of nonwhite residents, but also includes steelworks operations in Granite City, IL, (Lee & Hopke, 2006; Wang et al., 2009) which are a large driver of environmental injustice in the region. The area surrounding Granite City is characterized by high PM2.5 variability in our mrDMD modal library (Figure S5 in Supporting Information S1). Finally, the mrDMDcc optimization for income (γ = 0.17) shifts more sensors to the lower‐income (and historically Black) northern suburbs of Jennings and Ferguson, communities that have faced many environmental justice issues (Marcantonio et al., 2017). If considering only air pollution modes from the mrDMD sensor network, all these communities would be under monitored.

Figure 3.

Figure 3

PM2.5 sensor locations for St. Louis, Houston, Buffalo, and Boston. Distribution of sensor locations identified as optimal by the mrDMD algorithm, and those identified as optimal and equitable by the mrDMDcc using race and income metrics. All sensor locations are gridded onto the same 1 km × 1 km Di et al. (2021) grid. Dots represent sensor locations with the shading representing the proportion of nonwhite (left and center columns) or low‐income households (right column) in that grid box. Dots outlined in red indicate areas with historic environmental justice issues mentioned in the text—for example, Granite City, IL, and East St. Louis, IL, for the race optimized mrDMDcc case.

In Houston, race‐ and income‐optimized mrDMDcc networks shift the distribution of sensors toward the Ship Channel region and majority Black neighborhoods (Figure 3, second row). When considering only air pollution modes, the mrDMD algorithm places sensors throughout the Houston area, with a particular focus around downtown. This distribution is reasonable given that the downtown area typically has the highest concentrations of PM2.5 in the region due to industrial sources and heavy‐duty diesel trucking (Demetillo et al., 2020; Du et al., 2019). Evidence of downtown PM2.5 emission hotspots and arterial traffic can be seen in the PM2.5 modal library (Figure S6 in Supporting Information S1). Here the mrDMDcc optimization that takes into account race information (γ = 0.12) shifts more sensors to neighborhoods along the Ship Channel and in majority Black neighborhoods in Southwest Houston. The Ship Channel is a major polluter of the Houston urban core, with epidemiological studies revealing that children living within two miles of the channel are 56% more likely to develop leukemia than the national average (Linder et al., 2008). The race‐optimized sensor locations in Southwest Houston are in a majority Black neighborhood and show PM2.5 modal signals suggesting a large presence of on‐road emissions from freeways (Figure S6 in Supporting Information S1) and is upwind from the W.A. Parrish Generating Station, a dual‐fired power plant that includes the largest coal‐fired plant in Texas (Demetillo et al., 2020). Finally, the mrDMDcc optimization with income information (γ = 0.13) also shifts sensor placements along the Ship Channel and in the low income (and majority Black) suburbs north of downtown, including Trinity and Houston Gardens. As in St. Louis, we find that mrDMDcc distributes sensors in Houston with a heightened focus on polluted, primarily low‐income, and majority Black neighborhoods that would be relatively under monitored if considering only the air pollution modes.

In Buffalo, all three sensor network optimizations place monitors in the Niagara Falls and Love Canal neighborhoods (Figure 3, third row). Buffalo is the smallest of the urban cities analyzed in this work in terms of both spatial extent and population, and we limit the number of sensors to 150 (Figure S7 in Supporting Information S1). Here we find that the distribution of sensors generated by all three mrDMD approaches yields similar results. The race‐ (γ = 0.34) and income‐ (γ = 0.32) optimized networks do not shift monitors to the north of downtown Buffalo, where large nonwhite and low‐income populations reside. Although the downtown is represented in the PM2.5 modal library (Figure S8 in Supporting Information S1), the largest variability in PM2.5 is caused by lake effect meteorology (Spak & Holloway, 2009) and by pollution in the Niagara Falls neighborhoods; both of these effects shift sensor density away from downtown. Nevertheless, the Niagara Falls and Love Canal neighborhoods have known environmental justice pollution issues that affected large nonwhite and low‐income populations (Fletcher, 20022021; Gibbs, 2011; Newman, 2016) that are captured by all mrDMD sensor networks.

In Boston, the income‐optimized mrDMDcc algorithm captures more nonwhite and low‐income neighborhoods than either the race‐optimized algorithm or the mrDMD taking only air pollution modes into account (Figure 3, bottom row). The mrDMD approach places sensors throughout the Boston metropolitan area, with a particular focus on downtown and the area west of Boston. To be sure, the largest emissions of PM2.5 are in downtown Boston, which dominate the background variability in the PM2.5 modal library and where population density is greatest (Figure S9 in Supporting Information S1). The mrDMDcc race‐optimized algorithm (γ = 0.24) shifts more monitors to South Boston, which has the highest proportion of nonwhite residents in this region. However, the large signal from South Boston dominates the mrDMDcc cost function in this case and shifts sensor placement from other nonwhite neighborhoods such as Lowell, Lawrence, and Brockton, causing these locations to lose sensor density. We attribute this result to the mrDMDcc optimization, which prioritizes the largest values in the cost function vector and does not linearly scale other values in importance. On the other hand, the income‐optimized mrDMDcc algorithm (γ = 0.10) clusters more sensors in nonwhite neighborhoods since the income metric contains fewer extreme values that can overwhelm the optimization cost function. Using the income‐based sensor optimization, we find more sensors placed in the residential suburbs of Lowell, Lawrence, Brockton, Randolph, and Lynn, which have both major nonwhite and low‐income populations. Designing a distributed sensor network for Boston is challenging in part due to the highly variable meteorological patterns of New England that affect PM2.5 concentrations. This variability is caused by New England's physical geography, including its coastal orientation, position within the prevailing westerlies, and presence of mountains, as well as its diverse climate patterns, such as large diurnal changes in temperature, droughts, heavy rainfall, and blizzards (Keim et al., 2005). In addition, the Boston area is predominately White with spatially dispersed nonwhite communities, making it difficult to optimize based on the race metric. Our mrDMD results in Boston thus highlight the importance of considering both race and income in sensor network design due to these computational limitations.

3.2. Cumulative Distributions of Sensors

Cumulative distributions of sensors confirm that mrDMDcc optimizations using environmental justice metrics capture more nonwhite and low‐income neighborhoods than the mrDMD optimization relying on pollution modes alone. Figure 4 shows the cumulative distributions of sensors in St. Louis for proportion of nonwhite residents and median annual income for the three different sensor optimizations methods in this work. The decadal PM2.5 concentrations are similar among the three sensor optimizations, with differences of at most 0.25 μg m−3 for a given cumulative frequency. This indicates that the methods similarly capture the distribution and extent of decadal mean PM2.5 in a metropolitan area, even though daily or weekly PM2.5 concentrations may vary substantially across the city. Comparing mrDMD against the distributions for racial composition (Figure 4, light blue) and income (Figure 4, orange) across all 1 km2 grid cells within the St. Louis city bounds indicates that the mrDMD method is comparable to random sampling for both socioeconomic and racial metrics. In contrast, the cumulative distributions for the proportion of nonwhite residents suggest that the race‐optimized network captures a relatively higher density of the greatly nonwhite locations.

Figure 4.

Figure 4

Cumulative frequency distributions for proportion of nonwhite locations and median annual income for the three different sensor network optimizations for St. Louis. Each point represents one sensor location out of the 250 designed for St. Louis. The mrDMD network is designed with only air pollution modal information, mrDMD‐race includes race information from the United States Census in the sensor network optimization, and mrDMD‐income includes annual income information from the Census in the sensor network optimization. An additional set of points in each plot represents the distribution across racial composition (light blue) and income (orange) for a high‐density, uniformly distributed sensor network across all 1 km2 grid cells within the city bounds. The y‐axis for median annual income has been reversed to make this panel consistent with the other panels, with the neighborhoods of greatest interest in this study plotted at the high end of the distributions.

Figure 4 demonstrates that the mrDMDcc cost function is particularly sensitive to the maximum and minimum values of the environmental justice metrics we supply. The mrDMDcc distributions reflect this sensitivity to the cost function: East St. Louis has the highest values of the proportion nonwhite residents (greater than 95% nonwhite, Figure 2) while Jennings and Ferguson have the lowest annual income values (less than $30,000, Figure 2). As a result, these neighborhoods dominate the race and income optimizations, respectively, and subsequently shift sensor density to these areas (Figure 3). In the middle quartile range of cumulative sensor frequency (e.g., 0.25–0.75), the race‐optimized network captures mostly White neighborhoods. Given the strong racial segregation in St. Louis, relatively few monitors are placed in neighborhoods with 0.3–0.8 fraction of nonwhite populations. At the low end of the race‐optimized distribution are mostly White (and often more rural) communities that have large variability in PM2.5 concentrations due to meteorology and regional‐scale haze events (winter nitrate, summer sulfate) rather than from emissions in the urban core of a city. The income‐optimized network, on the other hand, captures more low‐income neighborhoods compared to both the mrDMD and mrDMDcc race‐optimized networks.

Optimizing by income in Houston, Buffalo, and Boston leads to more low‐income and more nonwhite sensor locations compared to both the mrDMD and mrDMDcc race‐optimized networks (Figures S10–S12 in Supporting Information S1). We generally observe that the income metric landscapes have fewer extreme values, which make the optimization problem more tractable for the mrDMDcc framework. However, the income optimizations in St. Louis, Houston, and Buffalo also place sensors in the wealthiest neighborhoods (top 5% of income distribution). These unexpected sensor configurations are likely due to (a) the mrDMDcc algorithm balancing the accumulated error from selecting low‐income populations, which may not have as much air pollution modal variability, or (b) suburban areas having higher relative PM2.5 variability because they have lower concentrations than the urban core together with local meteorology or regional scale events (wintertime nitrate, summertime sulfate) causing larger relative impacts. Further investigation of these issues lies beyond the scope of this study, but would be important to consider for further implementation of this work.

While race and income are often correlated in urban areas (Bhutta et al., 2020), the mrDMDcc algorithm tends to capture more nonwhite communities when using income in the cost function than when using race, especially in St. Louis and Houston (Figure S13 in Supporting Information S1). We hypothesize that this finding is due to extreme levels of racial segregation in these cities. As described above in the case of St. Louis, race metrics exhibit greater disparity, both spatially and in terms of minimum and maximum values, leading to a more challenging optimization. Additionally, the “shape” of segregation (Chodrow, 2017) has an impact on the optimizations presented in this study. In cases where large portions of a city are segregated, such as St. Louis and Houston, sensor placement is relatively straightforward. However, if segregated areas are scattered throughout the urban area, as in the case of Boston, the sensor placement becomes more complex. Binning the United States Census race and income data (from which the environmental justice metrics are derived) into coarser statistical groups, such as combining the 90th percentile of nonwhite residents with the 99th percentile, may improve the sensor optimization. However, we do not do so in this work as it would likely average out the most socioeconomically disparate communities. In any event, the large disparity in race values encountered in this work underscore the legacy of stark segregation exhibited in most major American cities (Chodrow, 2017).

4. Conclusions

We present a data‐driven approach to identify the optimal and equitable placement of PM2.5 sensors to capture extremes of air pollution in four urban cities (St. Louis, Houston, Buffalo, and Boston), all with legacies of segregation and environmental racism. Previous studies have relied on crowdsourced sensor networks, such as PurpleAir, to fill in the gaps of the EPA monitoring network, but such sensors are mainly concentrated in whiter, higher income neighborhoods. This study is the first to diagnose both the optimal and equitable placement of PM2.5 sensors, capturing both PM2.5 extremes and PM2.5 exposure in majority nonwhite or poor neighborhoods. Our method uses multiresolution dynamic mode decomposition with environmental justice cost constraints (mrDMDcc), an approach that takes into account the variability of PM2.5 on timescales ranging from 8 days to over a decade and incorporates race and income data from the 2020 United States Census into the optimization. All mrDMDcc sensor networks are data‐driven and constructed from modal libraries, which capture both the background variability and reoccurring high pollution episodes in PM2.5 within an urban area. Comparing mrDMD against the distributions for racial composition and income across all 1 km2 grid cells within metropolitan city bounds indicate that the mrDMD method is comparable to random sampling from a dense distribution for both socioeconomic metrics (Figure 4, Figures S10–S12 in Supporting Information S1). Our results show that using mrDMDcc to design an air quality sensor network leads to a higher number of sensors placed in historically low‐income and nonwhite neighborhoods with known environmental pollution problems. For example, the mrDMDcc sensor networks highlight neighborhoods along major polluting areas, such as the Ship Channel in Houston and Granite City, IL, in the St. Louis metropolitan area. Such neighborhoods are considerably under monitored when using an approach without considering race or income in the optimization. In Buffalo, a city with only five EPA monitors and four PurpleAir monitors, the mrDMDcc algorithm creates sensor networks with a heightened focus on the Niagara Falls and Love Canal neighborhoods, both of which have long legacies of environmentally racist policies.

Although the mrDMDcc sensor network shifts more monitors to nonwhite and low‐income locations, we find that the algorithm is sensitive to the range of the socioeconomic data supplied. Even though studies have shown that race is the dominant factor in air pollution exposure disparities (Liu et al., 2021), we find that optimizing sensor location by income more often leads to greater coverage in both low‐income and nonwhite neighborhoods compared to optimizing by race (Figures S10–S12 in Supporting Information S1). The mrDMDcc algorithm tends to capture fewer nonwhite communities when using race in the cost function due to the severe racial segregation, both spatially and in terms of the minimum and maximum values supplied to the cost function. These factors lead to a more challenging optimization (Figure S13 in Supporting Information S1). Binning the United States Census race and income data into coarser statistical groups than those used here could improve the sensor optimization but would likely average out the most socioeconomically disparate communities. The large disparity in race values encountered in this work underscores the legacy of stark segregation that endures in most major American cities.

The mrDMDcc algorithm provides a roadmap for designing optimal and equitable PM2.5 sensor networks for segregated cities and regions in the United States. Our results indicate that considering social inequality metrics in the optimization of air quality sensor networks can lead to a more equitable distribution of monitoring resources and provide a more comprehensive understanding of the distribution of air pollution within a city. We produce sensor networks with 250 monitors for St. Louis, Houston, and Boston, and 150 monitors for Buffalo, which may seem impractical given that these areas typically have fewer than 50 EPA and PurpleAir monitors combined. However, our results are intended as a strategy for designing a comprehensive sensor network for these urban areas. Extensions to the mrDMDcc framework could incorporate sensor measurements with different levels of accuracy—i.e., high‐cost and low‐cost sensors (Clark et al., 2020)—or could consider cost‐constraining functions using other metrics such as population density (Kanaroglou et al., 2005), fraction of children or the elderly (Sun et al., 2019), Normalized Difference Vegetation Index for green spaces in urban areas (Larkin et al., 2017), and other land use information (Lu et al., 2021). Additional postprocessing of sensor locations may be necessary to better capture spatial gradients coming into and out of communities of interest and determine the threshold at which adding more sensors will lead to redundant overlap. Furthermore, we may use a combination of metrics such as the covariance between race and income. In future work, a more systematic approach could be developed to determine an appropriate γ and a sufficient number of sampling sites. The mrDMDcc algorithm can easily include new, emerging patterns in PM2.5 pollution and demographic information for determining future monitor placement. Finally, this framework may be applied to the existing EPA and PurpleAir networks to determine the optimal and equitable placement of new sensor locations.

Conflict of Interest

The authors declare no conflicts of interest relevant to this study.

Supporting information

Supporting Information S1

Acknowledgments

This work was partly funded by the US Environmental Protection Agency (EPA) Grant 83587201. It has not been formally reviewed by the EPA. The views expressed in this document are solely those of the authors and do not necessarily reflect those of the EPA. We thank Angelique Demetillo for valuable discussion on the Houston urban core, and Yurika Harada for GIS advice.

Kelp, M. M. , Fargiano, T. C. , Lin, S. , Liu, T. , Turner, J. R. , Kutz, J. N. , & Mickley, L. J. (2023). Data‐driven placement of PM2.5 air quality sensors in the United States: An approach to target urban environmental injustice. GeoHealth, 7, e2023GH000834. 10.1029/2023GH000834

Data Availability Statement

The mrDMD algorithm and sensor network locations that support the findings of this study are openly available at the following URL/DOI: https://doi.org/10.5281/zenodo.7686016.

References

  1. Barkjohn, K. K. , Gantt, B. , & Clements, A. L. (2021). Development and application of a United States‐wide correction for PM2.5 data collected with the PurpleAir sensor. Atmospheric Measurement Techniques, 14(6), 4617–4637. 10.5194/amt-14-4617-2021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Bell, M. L. , & Ebisu, K. (2012). Environmental inequality in exposures to airborne particulate matter components in the United States. Environmental Health Perspectives, 120(12), 1699–1704. 10.1289/ehp.1205201 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bhutta, N. , Chang, A. C. , Dettling, L. J. , & Hsu, J. W. (2020). Disparities in wealth by race and ethnicity in the 2019 survey of consumer finances. FEDS Notes. Board of Govenors of the Federal Reserve System. [Google Scholar]
  4. Bond, T. C. , Bhardwaj, E. , Dong, R. , Jogani, R. , Jung, S. , Roden, C. , et al. (2007). Historical emissions of black and organic carbon aerosol from energy‐related combustion. Global Biogeochemical Cycles, 21(2), 1850–2000. 10.1029/2006GB002840 [DOI] [Google Scholar]
  5. Brunton, S. L. , & Kutz, J. N. (2019). Data‐driven science and engineering: Machine learning, dynamical systems, and control. Cambridge University Press. 10.1017/9781108380690 [DOI] [Google Scholar]
  6. Burke, M. , Heft‐Neal, S. , Li, J. , Driscoll, A. , Baylis, P. , Stigler, M. , et al. (2022). Exposures and behavioural responses to wildfire smoke. Nature Human Behaviour, 6(10), 1351–1361. 10.1038/s41562-022-01396-6 [DOI] [PubMed] [Google Scholar]
  7. Chodrow, P. S. (2017). Structure and information in spatial segregation. Proceedings of the National Academy of Sciences of the United States of America, 114(44), 11591–11596. 10.1073/pnas.1708201114 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Clark, E. , Askham, T. , Brunton, S. L. , & Nathan Kutz, J. (2019). Greedy sensor placement with cost constraints. IEEE Sensors Journal, 19(7), 2642–2656. 10.1109/JSEN.2018.2887044 [DOI] [Google Scholar]
  9. Clark, E. , Brunton, S. L. , & Kutz, J. N. (2020). Multi‐fidelity sensor selection: Greedy algorithms to place cheap and expensive sensors with cost constraints. arXiv:2005.03650 [eess].
  10. Clark, L. P. , Millet, D. B. , & Marshall, J. D. (2011). Air quality and urban form in U.S. urban areas: Evidence from regulatory monitors. Environmental Science & Technology, 45(16), 7028–7035. 10.1021/es2006786 [DOI] [PubMed] [Google Scholar]
  11. Clark, L. P. , Millet, D. B. , & Marshall, J. D. (2014). National patterns in environmental injustice and inequality: Outdoor NO2 air pollution in the United States. PLoS One, 9(4), e94431. 10.1371/journal.pone.0094431 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Code of Federal Regulations . (2023). Appendix D to Part 58, title 40—Network design criteria for ambient air quality monitoring [WWW Document]. Retrieved from https://www.ecfr.gov/current/title-40/chapter-I/subchapter-C/part-58/appendix-Appendix%20D%20to%20Part%2058
  13. Colmer, J. , Hardman, I. , Shimshack, J. , & Voorheis, J. (2020). Disparities in PM2.5 air pollution in the United States. Science, 369(6503), 575–578. 10.1126/science.aaz9353 [DOI] [PubMed] [Google Scholar]
  14. Considine, E. M. , Hao, J. , deSouza, P. , Braun, D. , Reid, C. E. , & Nethery, R. C. (2022). Evaluation of model‐based PM$_{2.5}$ estimates for exposure assessment during wildfire smoke episodes in the western U.S. 10.48550/arXiv.2209.01479 [DOI] [PMC free article] [PubMed]
  15. Dai, F. , Chen, M. , & Yang, B. (2020). Spatiotemporal variations of PM2.5 concentration at the neighborhood level in five Chinese megacities. Atmospheric Pollution Research, 11(6), 190–202. 10.1016/j.apr.2020.03.010 [DOI] [Google Scholar]
  16. Delp, W. W. , & Singer, B. C. (2020). Wildfire smoke adjustment factors for low‐cost and professional PM2.5 monitors with optical sensors. Sensors, 20(13), 3683. 10.3390/s20133683 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Demetillo, M. A. G. , Harkins, C. , McDonald, B. C. , Chodrow, P. S. , Sun, K. , & Pusede, S. E. (2021). Space‐based observational constraints on NO2 air pollution inequality from diesel traffic in major US cities. Geophysical Research Letters, 48(17), e2021GL094333. 10.1029/2021GL094333 [DOI] [Google Scholar]
  18. Demetillo, M. A. G. , Navarro, A. , Knowles, K. K. , Fields, K. P. , Geddes, J. A. , Nowlan, C. R. , et al. (2020). Observing nitrogen dioxide air pollution inequality using high‐spatial‐resolution remote sensing measurements in Houston, Texas. Environmental Science & Technology, 54(16), 9882–9895. 10.1021/acs.est.0c01864 [DOI] [PubMed] [Google Scholar]
  19. deSouza, P. , Barkjohn, K. , Clements, A. , Lee, J. , Kahn, R. , Crawford, B. , & Kinney, P. (2022). An analysis of degradation in low‐cost particulate matter sensors. 10.48550/arXiv.2210.14759 [DOI] [PMC free article] [PubMed]
  20. deSouza, P. , & Kinney, P. L. (2021). On the distribution of low‐cost PM2.5 sensors in the US: Demographic and air quality associations. Journal of Exposure Science and Environmental Epidemiology, 31(3), 514–524. 10.1038/s41370-021-00328-2 [DOI] [PubMed] [Google Scholar]
  21. Di, Q. , Amini, H. , Shi, L. , Kloog, I. , Silvern, R. , Kelly, J. , et al. (2019). An ensemble‐based model of PM2.5 concentration across the contiguous United States with high spatiotemporal resolution. Environment International, 130, 104909. 10.1016/j.envint.2019.104909 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Di, Q. , Wei, Y. , Shtein, A. , Hultquist, C. , Xing, X. , Amini, H. , et al. (2021). Daily and annual PM2.5 concentrations for the contiguous United States, 1‐km grids, v1 (2000–2016). 10.7927/0RVR-4538 [DOI]
  23. Drake, G. , Wheeler, A. P. , Kim, D.‐Y. , Phillips, S. W. , & Mendolera, K. (2022). The impact of COVID‐19 on the spatial distribution of shooting violence in Buffalo, NY. Journal of Experimental Criminology, 19(2), 513–530. 10.1007/s11292-021-09497-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Du, J. , Qiao, F. , & Yu, L. (2019). Temporal characteristics and forecasting of PM2.5 concentration based on historical data in Houston, USA. Resources, Conservation and Recycling, 147, 145–156. 10.1016/j.resconrec.2019.04.024 [DOI] [Google Scholar]
  25. Esie, P. , Daepp, M. I. G. , Roseway, A. , & Counts, S. (2022). Neighborhood composition and air pollution in Chicago: Monitoring inequities with a dense, low‐cost sensing network, 2021. American Journal of Public Health, 112(12), 1765–1773. 10.2105/AJPH.2022.307068 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Fann, N. , Lamson, A. D. , Anenberg, S. C. , Wesson, K. , Risley, D. , & Hubbell, B. J. (2012). Estimating the national public health burden associated with exposure to ambient PM2.5 and ozone. Risk Analysis, 32(1), 81–95. 10.1111/j.1539-6924.2011.01630.x [DOI] [PubMed] [Google Scholar]
  27. Fann, N. , Roman, H. A. , Fulcher, C. M. , Gentile, M. A. , Hubbell, B. J. , Wesson, K. , & Levy, J. I. (2011). Maximizing health benefits and minimizing inequality: Incorporating local‐scale data in the design and evaluation of air quality policies. Risk Analysis, 31(6), 908–922. 10.1111/j.1539-6924.2011.01629.x [DOI] [PubMed] [Google Scholar]
  28. Fletcher, T. (2002). Neighborhood change at Love Canal: Contamination, evacuation and resettlement. Land Use Policy, 19(4), 311–323. 10.1016/S0264-8377(02)00045-5 [DOI] [Google Scholar]
  29. Fletcher, T. (2021). From Love Canal to environmental justice: The politis of harardous waste on the Canada—U.S. Border, from Love Canal to environmental justice. University of Toronto Press. 10.3138/9781442602250 [DOI] [Google Scholar]
  30. Frederickson, L. B. , Sidaraviciute, R. , Schmidt, J. A. , Hertel, O. , & Johnson, M. S. (2022). Are dense networks of low‐cost nodes really useful for monitoring air pollution? A case study in Staffordshire. Atmospheric Chemistry and Physics, 22(21), 13949–13965. 10.5194/acp-22-13949-2022 [DOI] [Google Scholar]
  31. Gardner‐Frolick, R. , Boyd, D. , & Giang, A. (2022). Selecting data analytic and modeling methods to support air pollution and environmental justice investigations: A critical review and guidance framework. Environmental Science & Technology, 56(5), 2843–2860. 10.1021/acs.est.1c01739 [DOI] [PubMed] [Google Scholar]
  32. Gibbs, L. M. (2011). Love Canal: And the birth of the environmental health movement. Island Press. [Google Scholar]
  33. Harvard Chan‐NIEHS Center for Environmental Health . (2021). Environmental racism in greater Boston: an interactive web resource. Harvard Chan‐NIEHS Center for Environmental Health. Retrieved from https://www.hsph.harvard.edu/niehs/environmental-racism-project/ [Google Scholar]
  34. Heck, L. P. , Olkin, J. A. , & Naghshineh, K. (1998). Transducer placement for broadband active vibration control using a novel multidimensional QR factorization. Journal of Vibration and Acoustics, 120(3), 663–670. 10.1115/1.2893881 [DOI] [Google Scholar]
  35. Holder, A. L. , Mebust, A. K. , Maghran, L. A. , McGown, M. R. , Stewart, K. E. , Vallano, D. M. , et al. (2020). Field evaluation of low‐cost particulate matter sensors for measuring wildfire smoke. Sensors, 20(17), 4796. 10.3390/s20174796 [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Jang, S. , McDonald, T. J. , Bhandari, S. , Rusyn, I. , & Chiu, W. A. (2021). Spatial and temporal distribution of surface water contaminants in the Houston Ship Channel after the intercontinental terminal company fire. Journal of Exposure Science and Environmental Epidemiology, 31(5), 887–899. 10.1038/s41370-021-00343-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Jbaily, A. , Zhou, X. , Liu, J. , Lee, T.‐H. , Kamareddine, L. , Verguet, S. , & Dominici, F. (2022). Air pollution exposure disparities across US population and income groups. Nature, 601(7892), 228–233. 10.1038/s41586-021-04190-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Johnson, W. (2020). The broken heart of America: St. Louis and the violent history of the United States. Basic Books. [Google Scholar]
  39. Kanaroglou, P. S. , Jerrett, M. , Morrison, J. , Beckerman, B. , Arain, M. A. , Gilbert, N. L. , & Brook, J. R. (2005). Establishing an air pollution monitoring network for intra‐urban population exposure assessment: A location‐allocation approach. Atmospheric Environment, 39(13), 2399–2409. 10.1016/j.atmosenv.2004.06.049 [DOI] [Google Scholar]
  40. Keim, B. D. , Meeker, L. D. , & Slater, J. F. (2005). Manual synoptic climate classification for the east coast of New England (USA) with an application to PM2.5 concentration. Climate Research, 28, 143–153. 10.3354/cr028143 [DOI] [Google Scholar]
  41. Kelp, M. M. , Lin, S. , Kutz, J. N. , & Mickley, L. J. (2022). A new approach for determining optimal placement of PM2.5 air quality sensors: Case study for the contiguous United States. Environmental Research Letters, 17(3), 034034. 10.1088/1748-9326/ac548f [DOI] [Google Scholar]
  42. Kerr, G. H. , Goldberg, D. L. , & Anenberg, S. C. (2021). COVID‐19 pandemic reveals persistent disparities in nitrogen dioxide pollution. Proceedings of the National Academy of Sciences of the United States of America, 118(30), e2022409118. 10.1073/pnas.2022409118 [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Kramer, A. L. , Liu, J. , Li, L. , Connolly, R. , Barbato, M. , & Zhu, Y. (2023). Environmental justice analysis of wildfire‐related PM2.5 exposure using low‐cost sensors in California. Science of the Total Environment, 856, 159218. 10.1016/j.scitotenv.2022.159218 [DOI] [PubMed] [Google Scholar]
  44. Krieg, E. J. (2005). Race and environmental justice in Buffalo, NY: A ZIP code and historical analysis of ecological hazards. Society & Natural Resources, 18(3), 199–213. 10.1080/08941920590908015 [DOI] [Google Scholar]
  45. Kutz, J. N. , Fu, X. , & Brunton, S. L. (2016). Multiresolution dynamic mode decomposition. SIAM Journal on Applied Dynamical Systems, 15(2), 713–735. 10.1137/15M1023543 [DOI] [Google Scholar]
  46. Lane, H. M. , Morello‐Frosch, R. , Marshall, J. D. , & Apte, J. S. (2022). Historical redlining is associated with present‐day air pollution disparities in U.S. Cities. Environmental Science & Technology Letters, 9(4), 345–350. 10.1021/acs.estlett.1c01012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Larkin, A. , Geddes, J. A. , Martin, R. V. , Xiao, Q. , Liu, Y. , Marshall, J. D. , et al. (2017). A global land use regression model for nitrogen dioxide air pollution. Environmental Science & Technology, 51(12), 6957–6964. 10.1021/acs.est.7b01148 [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Lee, J. H. , & Hopke, P. K. (2006). Apportioning sources of PM2.5 in St. Louis, MO using speciation trends network data. Atmospheric Environment, Particulate Matter Supersites Program and Related Studies, 40, 360–377. 10.1016/j.atmosenv.2005.11.074 [DOI] [Google Scholar]
  49. Lee, J. H. , Hopke, P. K. , & Turner, J. R. (2006). Source identification of airborne PM2.5 at the St. Louis‐Midwest Supersite. Journal of Geophysical Research, 111(D10), D10S10. 10.1029/2005JD006329 [DOI] [Google Scholar]
  50. Liang, Y. , Sengupta, D. , Campmier, M. J. , Lunderberg, D. M. , Apte, J. S. , & Goldstein, A. H. (2021). Wildfire smoke impacts on indoor air quality assessed using crowdsourced data in California. Proceedings of the National Academy of Sciences of the United States of America, 118(36), e2106478118. 10.1073/pnas.2106478118 [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Linder, S. H. , Marko, D. , & Sexton, K. (2008). Cumulative cancer risk from air pollution in Houston: Disparities in risk burden and social disadvantage. Environmental Science & Technology, 42(12), 4312–4322. 10.1021/es072042u [DOI] [PubMed] [Google Scholar]
  52. Liu, J. , Clark, L. P. , Bechle, M. J. , Hajat, A. , Kim, S.‐Y. , Robinson, A. L. , et al. (2021). Disparities in air pollution exposure in the United States by race/ethnicity and income, 1990–2010. Environmental Health Perspectives, 129(12), 127005. 10.1289/EHP8584 [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Liu, Y. , Paciorek, C. J. , & Koutrakis, P. (2009). Estimating regional spatial and temporal variability of PM2.5 concentrations using satellite data, meteorology, and land use information. Environmental Health Perspectives, 117(6), 886–892. 10.1289/ehp.0800123 [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Lu, T. , Marshall, J. D. , Zhang, W. , Hystad, P. , Kim, S.‐Y. , Bechle, M. J. , et al. (2021). National empirical models of air pollution using microscale measures of the urban environment. Environmental Science & Technology, 55(22), 15519–15530. 10.1021/acs.est.1c04047 [DOI] [PubMed] [Google Scholar]
  55. Manohar, K. , Brunton, B. W. , Kutz, J. N. , & Brunton, S. L. (2018). Data‐driven sparse sensor placement for reconstruction: Demonstrating the benefits of exploiting known patterns. IEEE Control Systems Magazine, 38, 63–86. 10.1109/MCS.2018.2810460 [DOI] [Google Scholar]
  56. Manohar, K. , Kaiser, E. , Brunton, S. L. , & Kutz, J. N. (2019). Optimized sampling for multiscale dynamics. Multiscale Modeling and Simulation, 17(1), 117–136. 10.1137/17M1162366 [DOI] [Google Scholar]
  57. Marcantonio, R. A. , Golub, A. , Karner, A. , & Nelson Dyhle, L. (2017). Confronting inequality in metropolitan regions: Realizing the promise of civil rights and environmental justice in metropolitan transportation planning colloquium: Getting there from here: An exploration of regionalism and transportation in the United States. Fordham Urban Law Journal, 44, 1017–1078. [Google Scholar]
  58. Marlier, M. E. , Brenner, K. I. , Liu, J. C. , Mickley, L. J. , Raby, S. , James, E. , et al. (2022). Exposure of agricultural workers in California to wildfire smoke under past and future climate conditions. Environmental Research Letters, 17(9), 094045. 10.1088/1748-9326/ac8c58 [DOI] [Google Scholar]
  59. May, N. W. , Dixon, C. , & Jaffe, D. A. (2021). Impact of wildfire smoke events on indoor air quality and evaluation of a low‐cost filtration method. Aerosol and Air Quality Research, 21, 210046. 10.4209/aaqr.210046 [DOI] [Google Scholar]
  60. McDuffie, E. E. , Martin, R. V. , Spadaro, J. V. , Burnett, R. , Smith, S. J. , O’Rourke, P. , et al. (2021). Source sector and fuel contributions to ambient PM2.5 and attributable mortality across multiple spatial scales. Nature Communications, 12(1), 3594. 10.1038/s41467-021-23853-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Mikati, I. , Benson, A. F. , Luben, T. J. , Sacks, J. D. , & Richmond‐Bryant, J. (2018). Disparities in distribution of particulate matter emission sources by race and poverty status. American Journal of Public Health, 108(4), 480–485. 10.2105/AJPH.2017.304297 [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Miranda, M. L. , Edwards, S. E. , Keating, M. H. , & Paul, C. J. (2011). Making the environmental justice grade: The relative burden of air pollution exposure in the United States. International Journal of Environmental Research and Public Health, 8(6), 1755–1771. 10.3390/ijerph8061755 [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Mousavi, A. , Yuan, Y. , Masri, S. , Barta, G. , & Wu, J. (2021). Impact of 4th of July fireworks on spatiotemporal PM2.5 concentrations in California based on the PurpleAir sensor network: Implications for policy and environmental justice. International Journal of Environmental Research and Public Health, 18(11), 5735. 10.3390/ijerph18115735 [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Murray, C. J. L. , Aravkin, A. Y. , Zheng, P. , Abbafati, C. , Abbas, K. M. , Abbasi‐Kangevari, M. , et al. (2020). Global burden of 87 risk factors in 204 countries and territories, 1990–2019: A systematic analysis for the Global Burden of Disease Study 2019. The Lancet, 396(10258), 1223–1249. 10.1016/S0140-6736(20)30752-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Newman, R. S. (2016). Love Canal: A toxic history from colonial times to the present. Oxford University Press. [Google Scholar]
  66. Phillips, A. , Hung, Y.‐T. , & Bosela, P. (2007). Love Canal tragedy. Journal of Performance of Constructed Facilities, 21(4), 313–319. 10.1061/(ASCE)0887-3828(2007)21:4(313) [DOI] [Google Scholar]
  67. Previdi, M. , & Fiore, A. M. (2019). The importance of sampling variability in assessments of ENSO‐PM2.5 relationships: A case study for the South Central United States. Geophysical Research Letters, 46(12), 6878–6884. 10.1029/2019GL082250 [DOI] [Google Scholar]
  68. Proctor, J. L. , Brunton, S. L. , & Kutz, J. N. (2014). Dynamic mode decomposition with control. 10.48550/arXiv.1409.6358 [DOI]
  69. Sansom, G. , Parras, J. , Parras, A. , Nieto, Y. , Arellano, Y. , Berke, P. , et al. (2017). The impacts of exposure to environmental risk on physical and mental health in a small geographic community in Houston, TX. Journal of Community Health, 42(4), 813–818. 10.1007/s10900-017-0322-y [DOI] [PubMed] [Google Scholar]
  70. Shen, L. , Mickley, L. J. , Leibensperger, E. M. , & Li, M. (2017). Strong dependence of U.S. Summertime air quality on the decadal variability of Atlantic sea surface temperatures. Geophysical Research Letters, 44(24), 12527–12535. 10.1002/2017GL075905 [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Snyder, E. G. , Watkins, T. H. , Solomon, P. A. , Thoma, E. D. , Williams, R. W. , Hagler, G. S. W. , et al. (2013). The changing paradigm of air pollution monitoring. Environmental Science & Technology, 47(20), 11369–11377. 10.1021/es4022602 [DOI] [PubMed] [Google Scholar]
  72. Sorensen, A. E. , Jordan, R. C. , LaDeau, S. L. , Biehler, D. , Wilson, S. , Pitas, J.‐H. , & Leisnham, P. T. (2019). Reflecting on efforts to design an inclusive citizen science project in West Baltimore. Citizen Science: Theory and Practice, 4(1), 13. 10.5334/cstp.170 [DOI] [Google Scholar]
  73. Spak, S. N. , & Holloway, T. (2009). Seasonality of speciated aerosol transport over the Great Lakes region. Journal of Geophysical Research, 114(D8), D08302. 10.1029/2008JD010598 [DOI] [Google Scholar]
  74. Sun, C. , Li, V. O. K. , Lam, J. C. K. , & Leslie, I. (2019). Optimal citizen‐centric sensor placement for air quality monitoring: A case study of city of Cambridge, the United Kingdom. IEEE Access, 7, 47390–47400. 10.1109/ACCESS.2019.2909111 [DOI] [Google Scholar]
  75. Tai, A. P. K. , Mickley, L. J. , Jacob, D. J. , Leibensperger, E. M. , Zhang, L. , Fisher, J. A. , & Pye, H. O. T. (2012). Meteorological modes of variability for fine particulate matter (PM2.5) air quality in the United States: Implications for PM2.5 sensitivity to climate change. Atmospheric Chemistry and Physics, 12(6), 3131–3145. 10.5194/acp-12-3131-2012 [DOI] [Google Scholar]
  76. Tessum, C. W. , Paolella, D. A. , Chambliss, S. E. , Apte, J. S. , Hill, J. D. , & Marshall, J. D. (2021). PM2.5 polluters disproportionately and systemically affect people of color in the United States. Science Advances, 7(18), eabf4491. 10.1126/sciadv.abf4491 [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Tubridy, F. , Mölter, A. , Lennon, M. , & Pilla, F. (2022). Citizen science and environmental justice: Exploring contradictory outcomes through a case study of air quality monitoring in Dublin. Local Environment, 27(5), 622–638. 10.1080/13549839.2022.2068143 [DOI] [Google Scholar]
  78. US EPA . (2020). Ambient air monitoring quality assurance guidance documents. Retrieved from https://www.epa.gov/amtic/ambient-air-monitoring-quality-assurance-guidance-documents
  79. US EPA . (2022). ARP enhanced air quality monitoring for communities—Competitive grant 2022. Retrieved from https://www.epa.gov/arp/arp-enhanced-air-quality-monitoring-communities-competitive-grant
  80. Van Horne, Y. O. , Alcala, C. S. , Peltier, R. E. , Quintana, P. J. E. , Seto, E. , Gonzales, M. , et al. (2022). An applied environmental justice framework for exposure science. Journal of Exposure Science and Environmental Epidemiology, 33, 1–11. 10.1038/s41370-022-00422-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Wang, G. , Hopke, P. , & Fu, G. (2009). Identification of major sources of PM2.5 . Journal of Ocean University of China, 8(2), 101–110. 10.1007/s11802-009-0101-1 [DOI] [Google Scholar]
  82. Wang, Y. , Apte, J. S. , Hill, J. D. , Ivey, C. E. , Patterson, R. F. , Robinson, A. L. , et al. (2022). Location‐specific strategies for eliminating US national racial‐ethnic PM2.5 exposure inequality. Proceedings of the National Academy of Sciences of the United States of America, 119(44), e2205548119. 10.1073/pnas.2205548119 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supporting Information S1

Data Availability Statement

The mrDMD algorithm and sensor network locations that support the findings of this study are openly available at the following URL/DOI: https://doi.org/10.5281/zenodo.7686016.


Articles from GeoHealth are provided here courtesy of Wiley

RESOURCES