Abstract
Background:
Crowd-sourced traffic data potentially allow prediction of traffic-related air pollution at high temporal and spatial resolution.
Objectives:
To examine associations (1) of radar-based traffic measurements with congestion colors displayed on crowd-sourced traffic data maps and (2) of black carbon (BC) levels with radar and crowd-sourced traffic data.
Methods:
At an off-ramp of an interstate and a small one-way street in a mixed-use area in New York City, we used radar devices to obtain vehicle speeds and flows (hourly counts) for cars and trucks. At these radar sites and at an additional non-radar equipped site at a 2-way street, we monitored BC levels using aethalometers in the summer and early fall of 2017. At all three sites, free-flow traffic conditions typically did not occur due to the nearby presence of traffic lights and forced turns. We also downloaded real-time traffic maps from a crowd-sourced traffic data provider and assigned an ordinal integer congestion color code CCC to the congestion colors, ranging from 1 (dark red) to 5 (gray).
Results:
CCC increased with vehicle speed. Traffic flow was highest for intermediate speeds and intermediate CCC. Regression analyses showed that BC levels increased with either segregated or total vehicle flows. At the off-ramp, time-dependent BC levels can be inferred from time-dependent CCC and radar-derived mean vehicle flow data. A unit decrease in CCC for a mean traffic flow of 100 vehicles/hr was associated with a mean (95% CI) increase in BC levels of 0.023 (0.028, 0.018) μg/m3. At the small 1-way and the 2-way street, BC levels were also negatively associated with CCC, though at a >0.05 significance level.
Conclusions:
Use of inexpensive crowd-sourced traffic data holds great promise in air pollution modeling and health studies. Time-dependent traffic-related primary air pollution levels may be inferred from radar-calibrated crowd-sourced traffic data, in our case radar-derived mean traffic flow and widely available CCC data. However, at some locations mean traffic flow data may already be available.
1. INTRODUCTION
Combustion-engine powered vehicles significantly contribute to traffic-related air pollution (TRAP).1 TRAP includes a wide variety of toxic chemicals, including black carbon (BC), polycyclic aromatic hydrocarbons (PAHs), carbon monoxide, nitrogen oxides, sulfur oxides, particulate matter (PM), and BTEX (benzene, toluene, ethylbenzene and xylenes) volatile organic compounds (VOCs). TRAP has been linked to multiple adverse health outcomes including respiratory2 and cardiovascular disease,3,4 cancer,5 and is considered an ischemic stroke trigger.6,7
The distance between a location of interest (e.g. home address) and roadways can serve as proxy for air pollution8–10 and has been linked to adverse health outcomes.11,12 Land-use regression models (LURs) have also been used to predict air pollution1,13–17 and associated health outcomes.18 However, neither approach accounts for temporal variations in traffic. Accounting for spatio-temporal variations in air pollution at an hourly resolution may be important to assessing acute health effects due to differences in exposure to TRAP at different times of day and across a city. For instance, a recent study suggested that ischemic stroke onset risk was highest when the exposure to TRAP, in this case via BC (often used as a tracer for diesel truck emissions19), occurred 12 to 14 hours prior to the event.6
Key to predicting temporal variations in TRAP is the measurement of temporal variations in traffic conditions. Traditional technologies for counting vehicles in-situ20 include pneumatic tube counters,19,21,22 piezo-electric sensors, and inductive loops laid on roadways.23,24 More recently, video-camera based systems25,26 and radar-based systems27,28 have become available. These systems can provide information about vehicle speed and “traffic flow” (vehicle counts per unit time29) and have been used to examine the impacts of traffic on local hourly air pollution levels.19,22,30–34
Evaluation of data obtained from GPS-enabled mobile phones collocated in driving vehicles offers the opportunity of inexpensive, real-time traffic monitoring of entire street networks,35 which cannot be done with traditional traffic monitors. Traffic maps based on such crowd-sourced data are nowadays broadcasted in virtually real time via the Internet. There is great potential of using such data in air pollution studies. For instance, Google traffic travel time data along five routes were recently used to examine the effects of high-occupancy vehicle restrictions.36 In this study, our objectives are (1) to examine associations of congestion colors displayed on crowd-sourced traffic data maps with radar-based measurements of traffic, and (2) to develop statistical models based on regression analysis for the mobile-source contributions to air pollution levels that account for measured traffic and meteorological data. Such models could potentially allow estimating the effects of additional traffic or traffic interventions on air pollution. While some of the models we will develop account for radar data (for comparison with other studies19,22,37), it was our primary objective to examine whether mobile-source contributions to air pollution can be inferred from crowd-sourced traffic data and mean traffic flow.
2. METHODS
2.1. Study sites
Data collection occurred in the summer and early fall of 2017 in the Mott Haven/Port Morris neighborhoods in the South Bronx in New York City, NY, where we conducted a pilot study to examine associations between truck traffic and air pollution. Vehicles were counted at two locations impacted by truck traffic. It would have been desirable to count at additional locations, but the project budget did not allow for this. To minimize measurement error, we installed the radar devices at locations with at most one-lane traffic on each of the two sides of the device, thereby avoiding vehicle occlusion.38 The devices were attached to street light poles with approval from the Department of Transportation (DOT). To examine contrasting traffic and pollution patterns, the two locations (see Figure A.1) differed in average daily vehicle counts:
Site A: An exit ramp from US Interstate I-87. The radar device provides the combined flow of one-way traffic on the ramp as it gently slopes uphill (to the North of the counter) and one-way traffic on East 134th Street (to the South). For the most part, East 134th Street appears to serve its relatively few residents (in row homes) living in front of the traffic counting location, while the ramp traffic serves huge industrial hubs including the Harlem River Yard and Hunts Point. Field visits confirm that eastbound traffic on East 134th Street at the radar location is negligible; as such, we interpret the radar counts as only representing the exit ramp. Traffic flow was affected by a downstream traffic light at the intersection between Bruckner Blvd. and Brown Place. A BC Aethalometer (AE21, Magee Scientific) was located in a residential building on the South side of East 134th Street. The air inlet was located 4 m above street level and 6 m from the curb, which is abutted by a lane typically used for residential parking.
-
Site B: A one-way street, which primarily serves vehicles visiting industrial areas in the Harlem River Yards and in Hunts Point. At the end of the block, the street leads to a T-intersection where vehicles need to make either a left or right turn. A BC micro-aethalometer monitor (MA350, AethLabs) was located on the flat roof of a commercially used warehouse on the East side of Willow Ave. inside of an environmental chamber that had Peltier heating and air-conditioning to keep the temperature inside the box at a constant 32.8°C. The air inlet was located 9 m above street level and 8 m from the curb. The roof was enclosed by an about 1-m tall parapet wall.
In addition, BC measurements were performed at a site where no radar was installed.
Site C: A two-way street, which serves vehicles visiting industrial areas in the Harlem River Yards. Traffic flow was affected by traffic lights at the intersection between St. Ann’s Ave. and Bruckner Blvd. and the intersection between St. Ann’s Ave and 134th St. A BC Aethalometer (AE16, Magee Scientific) was located in a residential building on the East side of St. Ann’s Ave. The air inlet was located 8 m above street level and 12 m from the curb. Average daily traffic counts for a location on St. Ann’s Ave to the North of Site C were obtained from DOT: 6,863 vehicles per day.39
All three sites typically did not allow drivers to select their own speed due to the presence of traffic lights or a downstream T-intersection. Therefore, conditions of free-flow traffic were typically not met. Also, both rack-mounted aethalometer and MA350 instruments use the same approach for BC measurements (transmission/adsorption at 880 nm) and collocation experiments have shown that they provide very similar concentrations (as quantified by the slope-intercept line40,41) with high correlation (R2 > 0.95).
The study was approved by the Columbia University Institutional Review Board and informed was consent obtained; participants were identified with the assistance of our community partner.
2.2. Radar-based traffic counting
We used radar devices (Armadillo traffic counter, Houston Radar) to make physical measurements of traffic conditions. The devices measure vehicle speed and traffic flow. As compared to pneumatic counters, radar devices have the advantage that they do not require study personnel to walk onto roads when installing the devices and are not damaged by street cleaners, thus significantly reducing maintenance and risk of injury.
For each detected vehicle, the radar device records the following data: time and date of the event, radar-derived vehicle speed (RS), and vehicle class (small, medium or large vehicle). According to the manufacturer, the accuracy of speed measurements is 0.4%, and up to 97% of vehicles are counted for the local road configurations for typical free flowing traffic, with worse accuracy for stop and go traffic42 as is typical for many other traffic monitoring techniques including pneumatic counters and video-based systems. The manufacturer indicates that vehicles with a length <4 m are counted as small vehicles, between 4 and 7 m as medium vehicle, and >7 m as large vehicles. In this paper, we interpret small- and medium-sized vehicles as “cars” and large vehicles as “trucks.” From the event logs, one can determine total traffic flow Qtot, truck flow Qtr and car flow Qcar.
2.3. Collection of crowd-sourced traffic data
During the course of our field study, we used a Unix “cron” job to automatically download crowd-sourced traffic data maps of the study area from a leading provider (with no service contract) onto a personal computer every minute and store them as digital images. In the maps, road segments are shown in different colors (green, orange, light and dark red) indicating speeds of vehicles and collocated phones. The bounding colors green and dark red indicate relatively free-flowing and congested traffic, respectively. Gray is displayed if not a sufficient number of drivers supply their GPS position to the traffic data provider via their cell phones. For each road segment for which a congestion color had been assigned, we converted the five colors into an ordinal variable, the congestion color code CCC, as follows:
where the color of the square box indicates the congestion color displayed on the traffic maps and the number in the box indicates CCC. Since downloaded traffic maps were “true color” with 16.8 million possible colors, we converted the true-color images into indexed images with a total of 128 colors such that different shades of the five identified colors were reduced to a total of five colors.
Assuming that the fraction of GPS-enabled cell phones in moving traffic is independent of the congestion color, our color scheme assumes gray to be indicative of little traffic, allowing drivers to proceed at highest speeds. Thus CCC should increase monotonically with vehicle speed. By determining CCC for the road segments where the air pollution monitors and/or radar devices were mounted, we obtained CCC time series at a 1-minute resolution. The road segments used to characterize crowd-sourced traffic conditions at Sites A, B and C are indicated in Figure 1a and were 110, 80 and 80 m long, respectively.
2.4. Sampling periods
At Site A, data collection occurred from June 19 to October 11, 2017. Two major gaps occurred in the BC data: (1) between August 8 to 17, because the aethalometer ran out of filter tape; and (2) between September 3 and 27, because the aethalometer pump broke down. One major gap occurred in the radar data: between October 3 and 13 due to memory overflow of the data logger in the radar device. At Site B, data collection occurred from July 31 to October 11, 2017. No major gaps in BC and radar data occurred. At Site C, data collection occurred from June 27 to October 15, 2017. Radar data were not collected, and a major gap in BC data collection occurred from August 22 to September 20 due to issues related to advancing the aethalometer’s filter tape. At Site A, complete data sets (BC, radar and CCC data) were collected for a total of 70 days, and at Site B for 71 days. At Site C, complete data sets (BC and CCC data) were collected for 80 days. Figure A.2 illustrates sampling periods and data gaps.
2.5. Meteorological data
Hourly surface weather data (temperature, wind speed and direction, relative humidity, precipitation depth) were obtained for the nearby LaGuardia airport (~3 miles away) from the US National Climatic Data Center.43 Upper air radiosonde data were obtained from the US National Oceanic and Atmospheric Administration and the US Earth System Research Laboratory44 for the closest out of the three radiosonde station in the state of New York: Brookhaven, NY which is ~50 miles away. While the regression coefficients for mixing height of the models for BC we developed could potentially be biased due to this 50-mile distance, we do not believe that this has a major impact on the regression coefficients for traffic variables that our paper presents. From the surface and upper air data, the mixing layer height was estimated each hour using AERMET, software developed by the U.S. Environmental Protection Agency.45
2.6. Data synthesis and descriptive analyses
Different types of data were collected at different temporal resolutions. We normalized the data to 1-min, 5-min, 15-min and 60-min observational windows22 as explained in Appendix A. To explore diurnal variations in traffic conditions and pollution levels, we plotted for each hour of the day mean hourly BC levels and total traffic flow Qtot averaged over the entire sampling period and stratified by weekdays and weekends. We used box plots to explore two kinds of relationships between pairs of 15-min time-dependent variables: (1) between two out of the five traffic variables (RS, Qtot, Qtr, Qcar, CCC) to examine how radar measurements are related to crowd-sourced traffic data, and (2) between BC levels and one traffic variable to get clues about which traffic conditions are related to low and high BC levels, even though these relationships might be confounded, e.g., by meteorological variables.
Since traffic maps cannot be expected to reflect changes in traffic conditions instantaneously (due to data transfer and analysis) we determined the time lag tlag by calculating the cross-correlation46 between radar speed RS(t) and color code CCC(t) time series for a 1-min observational window, the temporal resolution of the CCC data. The lag tlag is given by the τ value, for which the absolute value of is maximal. Matlab™’s “finddelay” routine was used to determine tlag.
2.7. Statistical analyses
To examine how BC levels are affected by one or more traffic variables, we employed generalized additive models (GAMs)47 using the 15-min time series of the measured traffic, air pollution and meteorological data. In our study, five different time-dependent traffic variables could be used to infer BC levels: segregated traffic flows Qcar(t) and Qtr(t), total traffic flow Qtot(t) including its mean representing the average over the entire sampling period, radar speed RS(t), and color code CCC(t). Not all five traffic variables were used at the same time, because they may contain redundant information (an extreme example for redundancy is caused by the relation Qcar + Qtr = Qtot). Even though we were primarily interested in examining whether BC levels can be inferred from CCC, we fitted the following six regression models to the measured data so that we could compare radar-derived regressions to other published studies:
where the constant BC0 = 1 μg/m3 makes sure the equation is dimensionally correct. The first model (Eq. 1) is the only model that accounts for fleet composition by using all the radar data collected, i.e., segregated flows and radar-derived vehicle speed. The second and third models (Eq. 2 and 3) account only for total flow and one measure for vehicle speed, either RS or CCC. Comparing the two models allows evaluating the potential equivalency of RS and CCC data. The fourth model (Eq. 4) represents the conventional approach for modeling time-dependent air pollution and accounts only for time-dependent vehicle flow data.22 The fifth model (Eq. 5) allows examining our primary conjecture that BC levels can be predicted from crowd-sourced traffic data. The sixth model (Eq. 6) was added to evaluate again the potential equivalency of RS and CCC data.
Even though BC should be expected to scale linearly with traffic flows, we log-transformed the BC data to improve both normality and to resolve heteroscedasticity issues. Thus the interpretation of the α’s became less straightforward. However, in case of the products αQ < 1 (with subscripts car, tr, tot), the regression coefficients αcar, αtr and αtot represent the mobile-source contributions to BC levels, quantifying the increase in BC levels due to an increase in traffic flow, either of all vehicles or segregated (cars vs. trucks). Similarly, the regression coefficients βrad and βCCC can be interpreted as speed slope factors, which describe the change in BC levels due to a change in vehicle speed (if flow remains constant). Both the βrad and βCCC coefficients were multiplied by to improve comparability between sites with different mean flows, where for Sites A and B was derived from our radar measurements while for Site C the DOT estimate was used. Units of βrad and βCCC are chosen such that the numerical value represents the change in BC levels in units of μg/m3 if the vehicle speed changes by 10 mph and CCC changes by 1, respectively, for a traffic flow of 100 vehicles per hour.
The regression models account explicitly for meteorological variables through the Met(t) term, because meteorological conditions could potentially confound the relationship the relationship between BC levels and CCC, which would in turn lead to biased estimates of the traffic-related regression coefficients. Furthermore, even if meteorological variables do not confound the CCC–BC association,48,49 including them in the regression model can help account for some of the error variance, and doing so can increase the precision of the parameter estimates for the traffic-related regression coefficients. Similar to previous studies on TRAP,22,50,51 we assumed that the Met. term could be represented by the following linear model:
(7) |
where H is the height of the mixing layer, WS is wind speed, T is temperature, RH is relative humidity, PD is precipitation depth, and WDi is a categorical wind direction. T and RH are accounted for, because they affect the size distribution of vehicle exhaust particles and hence their subsequent fate in the atmospheric environment.52 Precipitation was accounted for, because it may affect traffic density and BC levels through washout of air particulate matter.53 Wind speed WS was accounted for, because it affects atmospheric dispersion of BC. Wind direction was accounted for because it can affect TRAP.54–56 Continuous meteorological variables were excluded from a regression specified by site ID (A, B, C) and model number (1, 2, …, 6) if their significance levels were >0.05. The categorical predictor variable WD was excluded if an ANOVA test showed that regressions results obtained with and without WD did not differ from one another at a level >=0.05.
Unmeasured confounding factors such as vehicle BC emissions from roads surrounding each site, maritime traffic, domestic and industrial sources, and wider urban BC inputs are accounted for by the penalized spline s(t). We examined the dependence of the regression coefficients on the number of knots in the spline function s(t) used to represent contributions from these additional BC sources. To illustrate the temporal variations of these contributions, we determined for each site “background” BC levels, which we define here as the levels that would occur if traffic at the site was not present or constant. These levels can be predicted with the fitted regression model by setting the flows to zero (Qtot = Qtr = Qcar = 0), vehicle speed variables (RS, CCC) to their medians, and maintaining the values of meteorological prediction variables.
GAM models generally assume that errors are mutually independent, which is not necessarily true. Due to expected autocorrelation in the time series of the model residuals,22 our regressions accounted for autoregressive (AR) errors. The AR(1) model was accounted for by the “bam” regression routine (R’s mgcv library); bam requires specifying a correlation parameter “rho” which we estimated from the lag-1 autocorrelation of the time series of model residuals obtained by first running a regression without accounting for AR errors. Moreover, we performed regressions using mgcv’s “gamm” routine which can account for AR(n) models. Unlike bam, gamm does not require specifying correlation parameters. We performed regressions for lag n up to 3. Since our measured time series were interrupted (e.g., due to data downloads from the monitors or equipment failures), the autocorrelation analyses excluded these data gaps.
We compared the bam regression routine, which accounts for AR(1) errors, to the gamm routine, which can account for AR(n) errors. For the gamm routine, we found that the autocorrelation between model residuals decreased substantially for increasing values of n. However, background BC levels could not be represented by the spline s(t) in a meaningful way. Even though we started the regression with 1 knot per measurement day, the final estimated degree of freedom (EDF) of the spline s(t) was very small, much less than 10, for n >= 2 close to 1. Therefore, the gamm regression could not account for confounding factors. At the same time, the explained variation R2 was very small and decreased with increasing order n, e.g., to 1% for n = 3. In contrast, the bam routine substantially reduced the autocorrelation of model residuals while retaining the capability of modeling confounding background BC levels. Our findings are consistent with the mgcv documentation (page 90 in57) which states that gamm is not quite as numerically robust as gam (bam being a variation of gam). We therefore used the bam routine in the final regressions.
For the spline function s(t), we chose a number of knots for which the regression coefficients for the traffic variables did not markedly change.58 This was fulfilled for 1 spline knot per day of data collection. Thus BC variations due to diurnal variations in traffic and meteorological conditions were not smoothed out. To support our choice, we varied the number of knots for each final regression (with insignificant meteorological regression variables removed) and examined the dependence of regression coefficients on the EDF of s(t).
Analyses were conducted using the R Statistical Software, version 3.4.3 (Foundation for Statistical Computing, Vienna, Austria).
3. RESULTS
3.1. Illustration of traffic data
Figure 1a shows an example of one of the crowd-sourced maps we downloaded every minute. Using these data, time series of the congestion color code CCC can be developed for the entire street network shown in the map. In this work, we only analyzed traffic at the three locations with aethalometers. Figure 1b shows the times series of CCC for Site B collected in July 2017, a subset of all data collected. Diurnal fluctuations in traffic conditions as well as differences between weekdays and weekends can be easily discerned with this type of plot that synthesizes the crowd-sourced traffic data for a single road segment.
3.2. Temporal variations
Figure 2 shows average traffic flow and BC levels versus the hour of the day, stratified by weekdays and weekends. At the two sites where radar measurements were performed (Sites A and B), total traffic flow Qtot peaked at about 6–7 am and 3–4 pm on weekdays, the rush hours. Truck flow Qtr peaked from 4–8 am and from 2–3 pm with truck flow having been higher in the afternoon rush hour. On weekends, traffic was overall less, particularly truck traffic.
BC levels did not entirely follow traffic flow. At all three sites, BC levels peaked in the morning between 6–8 am, the time window in which total vehicle and truck flows peaked at Sites A and B. However, BC levels did not peak during the afternoon rush hour. Instead, BC levels decreased almost monotonically after 8 am, even though traffic picked up at 9 am at Sites A and B. Only after 6 pm did BC levels increase again.
To examine potential correlations between BC peaks and short-term traffic increases, we show in Figure A.3 sample BC levels and vehicle flows versus time. The plots reflect features of the average diurnal data from Figure 2. BC levels rose in the early morning when traffic picked up and then decreased until 6 pm, even though traffic remained about constant until 6 pm. BC peaks were not always accompanied by peaks in traffic flow. For instance at Site A, the maximum BC level was observed at 10 am. At Site B, BC levels rose in the early morning when traffic picked up and then decreased until about 4 pm. At that point in time, BC levels increased about 1.5-fold, and total traffic flow peaked at the same time. However, BC peaks are again not always associated with traffic flow peaks, e.g., at 7 am.
The lag between congestion color code CCC and radar speed RS was tlag = 4 minutes at Site A and tlag = 2 minutes at Site B.
3.3. Relationships between traffic variables
Site A: Figure 3a shows the four box plots for the four distributions of radar speed RS for all those 15-min observational windows when the rounded color code CCC was 1, 2, 3 and 4. Color code CCC = 5 was not observed. The median vehicle speed is a strictly monotonically increasing function of CCC and the IQRs almost non-overlap. Figure 3c illustrates the distributions of vehicle flow for the four rounded congestion color codes CCC. Median traffic flow was maximum for CCC = 3. Traffic flow decreased monotonically when CCC decreased from 3 to 2 and then to 1. Figure 3e shows the distribution of vehicle flow when the radar-derived vehicle speed RS fell into the five equal-width bins between the minimum and maximum observed radar speed. The relationship resembles the one between vehicle flow and CCC (Figure 3c), because RS is a proxy for CCC.
Site B: For the most part, the median radar speed RS increases monotonically with CCC, except from CCC = 1 to 2 (Figure 3b); however, the overlapping notches indicate that the observed decrease is statistically not significant. Moving from CCC = 2 to 4, the rate of increase is quite small. The relationships between measured vehicle speeds, i.e., either radar speed RS or color code CCC, and vehicle flow (Figures 3d and 3f) resemble those of Site A. Traffic flow is highest for intermediate vehicle speeds. However, median vehicles speeds for the five CCC values were much lower than at Site A.
3.4. Relationships between black carbon and single traffic variables
Site A: Figure 4a shows the five boxplots for the distribution of BC levels observed when total vehicle flows Qtot fell into the five vehicle flow bins between the minimum and maximum of Qtot. Median BC levels increase monotonically with traffic flow, i.e., on average more traffic as quantified by traffic flow results in higher TRAP. Figure 4c shows five boxplots for the distribution of BC levels observed when radar speed was divided into five equal-width bins between the minimum and maximum observed speed. Median BC levels decrease monotonically as radar speed RS increases. Figure 4e reflects the findings from Figure 4c if one replaces in the discussion RS by its proxy CCC.
Site B: BC levels tended to increase monotonically with total vehicle flow Qtot, except for the 4th Qtot bin, for which BC levels are smaller than for the 3rd and 5th bin (Figure 4b). Due to the small sample size (n=3), it is not clear whether the 5th bin has higher BC levels than the third bin. Median BC levels tend to increase with radar-derived vehicle speed, even though overlapping notches indicate that the increase may not be statistically significant for the first four radar speed bins (Figures 4d). And BC levels appear to be highest for the highest radar-speed bin (the notches of the 4th and 5th bins almost overlap reducing the statistical significance of the difference). Similarly, BC levels are higher for CCC = 5 than for 3 and 4 (Figures 4f). Interestingly, BC levels are highest for CCC=1 but lowest for the 1st radar-speed bin.
Site C: BC levels reflect features from Site A in that median levels decrease with CCC and that CCC = 5 was not observed (Figure 4g).
3.5. Regression analyses for black carbon
The regressions for Sites A and B that accounted for temporally resolved radar-derived traffic flows (Models 1 to 4) yielded similar results. The regression coefficients for the flows are all highly significant (p < 0.001) and positive, i.e., additional traffic flow causes BC levels to increase. Moreover, for each site the αtot coefficients fitted for Models 2 to 4 are similar in that their confidence intervals overlap. Furthermore, for each model (1 to 4), regression coefficients obtained for each of the two sites are similar as their confidence intervals overlap. At both sites, emission factors are about 3 to 4 times higher for trucks than for cars. For Site A, accounting for segregated traffic flow yields an R2 value that is 0.8% higher as a comparison of Models 1 and 2 shows. For Site B, accounting for segregated flows did not result in an improved model fit. Another important difference between Sites A and B is that for Site A regression coefficients for vehicle speed as quantified either by RS or CCC are all highly significant and negative; i.e., an increase in vehicle speed (with traffic flow being held constant) causes BC levels to drop. For Site B, however, only for Model 3 BC levels depend moderately (p < 0.1) on CCC.
For the regression that used CCC as the sole traffic-related predictor variable (Model 5), the βCCC coefficient for Site A was highly significant (p < 0.001). For Site C, βCCC was marginally significant (p < 0.1), whereas for Site B; βCCC was not significant (p = 0.14). The confidence intervals of the βCCC of Sites A and B overlap. The absolute value of the predicted βCCC coefficient for Site C is about 10 times lower than absolute values for Sites A and B.
The R2 values as well as the variation explained by traffic predictor variables, Rtr2, were generally higher for the regressions that accounted for time-dependent traffic flows (Model 1 to 4) than for those that accounted only for a speed variable (Model 5 and 6). For any regression, only 3 out of the 6 meteorological variables were significant predictor variables (Table A.1). At all sites, relative humidity RH and wind direction WD were significant predictors. The fact that WD is a significant predictor variable is supported by the BC roses for the three sites, as the roses show an uneven distribution of the highest BC levels among the eight wind directions (Figure A.5). At Sites A and C, wind speed WS was the third predictor, while at Site B it was mixing height H. Use of either WS or H can be attributed to co-linearity between the two variables that we observed for the three sites during the measurement campaign. We note that regression coefficients including confidence intervals of the traffic predictor variables changed only marginally when regressions for Site B were ran using WS instead of H as a predictor variable. However, since then Rtr2 decreased, we preferred to determine the meteorological predictor variables by iteratively removing the most insignificant ones.
Background BC levels were similar for the three sites as illustrated for Model 6 (Figure 5). Pearson correlation coefficients between background BC levels for different sites are r = 0.88 between Sites A and B, 0.94 between Sites A and C, 0.91 between Sites B and C.
To better characterize the BC – CCC relationship and assess potential deviations from linearity, we also ran regressions that allowed BC levels to depend on CCC through a smooth function s. To support the choice of model, we compared AIC scores59 of the linear and nonlinear regressions. For all three sites, AIC scores of the linear and nonlinear regressions were almost the same. We therefore found no evidence that these nonlinear terms, the parameters of which are also harder to interpret than the speed slope factor βCCC we suggest, improve the fit of our models.
We also performed sensitivity analyses to assess the usefulness of accounting for meteorological co-variates. For regressions that did not account for meteorological variables, absolute values of traffic-related regression coefficients were in most cases slightly smaller suggesting bias due to confounding. Moreover both R2 and Rtr2 decreased significantly. Thus we only present results of regressions that account for meteorological variables.
4. DISCUSSION
4.1. Associations between two traffic variables
Congestion color code CCC was found to increase with radar-derived vehicle speed RS. For Site A, the IQRs of the four box plots are almost non-overlapping, strongly suggesting that CCC is a proxy for vehicle speed. Complete non-overlap between the IQRs cannot be expected, because the traffic maps use smartphone data along the entire road segment, while the radar device provides more of a point measurement of vehicle speed RS. At Site B, the IQRs of RS overlap significantly for adjacent CCC values; however, a multi-comparison test of radar speeds measured for the different CCC values showed that there significant differences in radar speed between different CCC values (with 4 out of the 10 possible comparisons at significance levels p < 0.05). One possible explanation is again related to the fact that radar speed is measured at only one location contrary to CCC. The road segment at Site B lends itself to particularly non-uniform vehicle speed, because it ends at a “T” intersection, where vehicles need to decelerate to make either a left or right turn.
At Sites A and B, traffic flow Qtot was on average highest for intermediate vehicle speeds, either quantified by RS or CCC. Flow decreased when the highest observed CCC value was approached, likely because smaller flow (indicating a lower traffic density) allows for higher vehicle speed. Maximum flow for intermediate vehicle speeds has also been observed in a study of freeway traffic60 and is consistent with speed-flow curves known from traffic engineering.61 Moreover, the observation that traffic flow decreases monotonically as CCC approaches 1 is consistent with the theoretical argument that in the limit where the vehicle speed is zero, flow is zero. Contrary to Site B, CCC = 5 was not observed for Site A, likely because the ramp feeds vehicles from an interstate carrying traffic around the clock into a highly populated residential area and a large industrial complex at the South Bronx waterfront.
At Site B, most of the lowest radar speeds (defined here as < 10 mph) were measured when CCC = 4 and 5 (and not when CCC = 1 to 3) and are in the box plots represented by outliers. The transition from CCC = 4 to 5 (green to gray) appears to be associated with an apparently discontinuous jump in vehicle velocity. This jump might be due to the absence of crowd-sourced traffic data when CCC = 5.
While general trends in the relationships between CCC, RS and Qtot are similar for Sites A and B, the relationships vary dramatically in magnitude indicating local calibration of crowd-sourced traffic data is needed. At Site B, CCC values are associated with smaller vehicle speeds when compared to Site A, likely because at Site B vehicles slowed down when they approached the T-intersection (Figure A.1). Mapping of vehicle speed on CCC is likely related to the maximum achievable speed.
4.2. Temporal BC variations
Diurnal variations in BC are similar for the three sites, with BC levels being high in morning rush hour and much lower in the afternoon rush hour, even though traffic counts are similar in both rush hours (Figure 2). Therefore, traffic conditions do not explain all the variation in BC levels. Likely, temporal variations in BC levels are affected by diurnal variations in meteorological conditions, e.g., increased wind speed peaking in the late afternoon (Figure A.4a).
BC spikes observed for a 5-min observational window were not always associated with increases in flow (Figure A.3). This could potentially be due to emission-rich vehicles or unresolved changes in meteorological conditions (at time scales < 1 hr), e.g., sudden periods of low wind speed and sudden changes in wind direction.
At the two sites, changes in CCC lagged 2 and 4 minutes behind changes in radar speed RS suggesting that traffic provider’s maps are broadcasted with some delay, perhaps because it takes some time to process the collected GPS data or because their algorithms are based on temporal averages of GPS data. This finding also suggests that associations between CCC and other variables should be examined for observational windows greater than the lag, e.g., 15 min, unless the CCC data is shifted in time.
4.3. Associations between BC levels and single traffic variables
Examination of associations between BC levels and one traffic variable suggests that more traffic as quantified by flow tended to cause an increase in BC levels (Figures 4a/b) and that vehicle speed might have also affected BC levels. BC levels do not approach a value of zero as Qtot approaches zero due to BC background levels, e.g., from adjacent roads and wider urban sources, which can be expected to be confounding factors.
At Site A, median BC levels decreased as radar speed RS increased (Figure 4c) and also as CCC increased (Figure 4e). Based on field observations, we believe that vehicles were most of the time slow not because drivers elected to drive slow but because there was a lot of traffic and/or congestion. This could suggest that small vehicle speeds, being a proxy for congestion, caused higher BC levels due to repeated acceleration-related emissions.
At Site B, BC levels were higher for CCC = 5 than for 3 and 4 (Figure 4f). This seems counterintuitive, because CCC = 5 indicates that not many vehicles were present, which should result in only little BC emissions. One potential explanation is related to the fact that CCC = 5 was typically observed at night (Figure 1b) when the atmosphere tended to be stable causing air pollution to be higher, which is also seen in the diurnal variations in BC levels (Figure 2). I.e., the relationship between BC and vehicle speed is confounded by meteorological conditions. This illustrates the utility of crowd-sourced traffic data in the interpretation of air pollution data. However, other factors could have contributed to the higher night-time BC levels, potentially related to non-traffic BC sources.
At Site B, an apparent inconsistency occurs as BC levels are highest for CCC=1 but lowest for the 1st radar-speed bin. This could be attributed to the small sample sizes (n=7 for CCC and n=19 for RS), which do not support the conclusion that BC was higher for CCC=1 than for CCC=2. Moreover, the entire range of radar speeds associated with CCC=1 (12–24 mph according to Figure 3b) does not correspond to the range of the lowest RS bin (5–12 mph according to Figure 4d). In fact, most of these small vehicle speeds were measured when CCC=4 and 5, represented by the outliers in Figure 3b, i.e., when traffic was flowing freely according to the traffic maps. The smallest vehicle speeds could therefore potentially be attributed to drivers who elected to drive slow, perhaps drivers of very large, emission-rich trucks who needed to slow down before approaching the T-intersection.
4.4. Regression analyses
To explore associations between measured BC levels and combinations of traffic variables, we fitted regression models to the measured BC-traffic data that contrary to the associations discussed above addressed confounding by modeling directly potentially confounding meteorological predictor variables and by including a slowly varying spline to represent vehicle emissions from roads surrounding our sites as well as wider urban BC levels. We note that not accounting for a confounding variable can result in biased estimates of regression coefficients.
The similarity of background BC levels at the three sites (Figure 5) as well as the high correlation coefficients between background levels of different sites provides confidence in our modeling approach of confounding background BC levels. Note that perfect agreement of background BC levels cannot be expected because of the nonuniform distribution of the other BC sources. The impact of immediately adjacent traffic is seen in the spiky measured BC data as also seen in a study of BC emissions from aircraft traffic.55
For both Sites A and B, the regression coefficients for traffic flow predictor variables (αcar, αtr, and αtot) are positive, i.e., traffic flow causes BC levels to rise. Moreover, the values of αcar, αtr, and αtot were similar between the two sites. Note, however, that such agreement cannot be expected, because our statistical model does not account for important site characteristics. E.g., the air pollution monitor at Site B was mounted on a rooftop, while it was mounted through a third-floor double hung window at Site A.
At Site A, the association for BC-levels becomes stronger if segregated traffic counts are accounted for, with the variation explained by traffic predictor variables, Rtr2, being 0.8% higher for Model 1 than Model 2 (Table 1). At Site B, Rtr2 is about the same for Models 1 and 2. This can be explained by the hourly ratio of truck counts vs. total vehicle counts (Qtr / Qtot) which is relatively constant for Site B but not so for Site A (data not shown). In any instance, regression analysis of segregated traffic flows yields the mobile-source contributions for cars and trucks, αcar and αtr, respectively. This information allows policy makers and communities to assess impacts of changes in truck or car traffic on air pollution and community health.
Table 1:
ln(BC)-traffic model | αtot [μg/m3 per 100 vehic/hr] | αtr [μg/m3 per 100 trucks/hr] | αcar [μg/m3 per 100 cars/hr] | βrad [μg/m3 per 10 mph & 100 veh/hr] | βCCC [μg/m3 per CCC &100 veh/hr] | R2 [%] | Rtr2 [%] |
---|---|---|---|---|---|---|---|
Site A | |||||||
– | 0.15*** (0.13, 0.18) | 0. 04*** (0.03, 0.05) | −0.05*** (−0.06, −0.04) | – | 66.0 | 9.2 | |
0.05*** (0.04, 0.06) | – | – | −0.05*** (−0.05, −0.04) | – | 65.2 | 8.4 | |
0.06*** (0.05, 0.07) | – | – | – | −0.031*** (−0.036, −0.025) | 65.5 | 8.7 | |
0.05*** (0.05, 0.06) | – | – | – | – | 62.7 | 5.9 | |
– | – | – | −0.05*** (−0.05, −0.04) | – | 60.1 | 3.3 | |
– | – | – | – | −0.023*** (−0.028, −0.018) | 59.3 | 2.5 | |
Site B | |||||||
– | 0.21*** (0.10, 0.31) | 0.06*** (0.03, 0.08) | 0.00 (−0.04, 0.04) | – | 65.3 | 1.1 | |
0.07*** (0.04, 0.09) | – | – | 0.01 (−0.03, 0.05) | – | 65.3 | 1.1 | |
0.07*** (0.04, 0.09) | – | – | – | −0.025. (−0.052, 0.003) | 65.4 | 1.2 | |
0.07*** (0.04, 0.09) | – | – | – | – | 65.3 | 1.1 | |
– | – | – | 0.01 (−0.03, 0.05) | – | 64.2 | 0.0 | |
– | – | – | – | −0.022 (−0.051, 0.006) | 64.1 | 0.2 | |
Site C | |||||||
– | – | – | – | −0.004. (−0.008, 0.000) | 58.6 | 0.8 |
Significance codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
Only at Site A, regression coefficients for vehicle speed (β’s) are highly significant. Moreover, they are negative, i.e., a decrease in vehicle speed causes BC levels to increase if traffic flow is kept constant, presumably due to congestion (higher vehicle density). Site C shows similar trends as Site A, however, with a much lower p-value (< 0.1). Moreover, the absolute value of βCCC is about a factor of 5 smaller at Site C than at Site A. This is potentially caused by the greater distance between the air pollution monitor and the road at Site C, thereby also weakening the correlation between traffic and BC levels.
At Site B, vehicle speed is a much less significant predictor variable. Only for Model 3, which also uses transient traffic flow Qtot as a predictor variable, CCC is of moderate significance (p < 0.1). The other four vehicle-speed dependent regression models do not show a significant dependence of BC levels on vehicle speed (at p < 0.1 level). This may be due to the fact that BC levels at Site B did indeed not depend on vehicle speed. This conjecture is supported by the fact that traffic flow depends much more on radar speed at Site A than at Site B as a comparison between Figures 3c and 3d shows. Thus vehicle speed and CCC were not affected much by congestion but rather by the fact that vehicles needed to decelerate as they approached a T-intersection at the end of the block to then make a left or right turn. It is interesting to note, however, that predicted values for βCCC are very similar for Sites A and B: −0.023 and −0.022 μg/m3 per 100 veh/hr. This is consistent with similar values for αtot for the two sites.
The variations explained by traffic, Rtr2, might have been higher for Models 1 to 4 than those for Models 5 and 6, because time-dependent traffic flows provide a more direct measure for the actual number of emission-emitting vehicles present than a vehicle speed variable, from which one can infer flow only indirectly. The Rtr2 for Model 5 relative to those for Models 1 to 4 was smaller for Site B than for Site A, perhaps because of the larger fraction of time during which no crowd-sourced traffic data was available (CCC = 5) for Site B.
In summary, only at Site A, BC levels can be inferred from the product CCC(t) at a significance level p < 0.05. At Site C, BC levels were moderately correlated with CCC(t), at a much lower significance level though (p < 0.1), potentially because the air pollution monitor at Site C was much further from the road than at Site A. At Site B, BC levels did not depend significantly on CCC(t), but it is interesting that the corresponding regression coefficient βCCC is very similar to the one for Site A, as could be expected because regression coefficients for traffic flows are similar for the two sites. Finally, the negative speed slope factors (β’s) we found appear to consistent with results shown by Ntziachristos et al.62 who found NOx emissions from passenger cars to increase when the mean vehicle speed decreased from about 47 to 6 mph (our vehicle speed regime), even though the comparison is slightly hampered by the fact that their mean vehicle speed differs from ours in that it is defined as the average vehicle speed measured during different driving cycles. We note that our models do not predict an increase of emissions with vehicle speed, which has been observed under free-flow traffic conditions;63 however, these conditions were not prevalent at our monitoring sites.
4.5. Comparison to mobile-source contributions to BC levels from other studies
In 1999, Lena et al.19 performed measurements to examine the impacts of truck traffic on sidewalk level elemental carbon (EC) levels in a transportation hub in New York City. They found 1.69±0.37 μg/m3 elemental carbon (EC) per 100 large trucks/hr, which is larger than our values but measured 18 years earlier when there was a larger fraction of older and emission-richer diesel vehicles within the commercial fleet.. We found αtr = 0.15±0.01 μg/m3 per 100 trucks/hr for Site A and αtr = 0.21±0.05 μg/m3 per 100 trucks/hr for Site B. Furthermore, the difference may be attributed to the fact that measurements by Lena et al. were taken at the elevation of the traffic emissions, whereas ours were taken in the third floor or higher where TRAP levels can be expected to be lower to do dispersion.
Between 2002 and 2003, deCastro et al.22 performed measurements to examine the impacts of traffic on 2nd-floor BC levels along a traffic artery in Baltimore. They found a mobile-source contribution of 66 ng/m3 of BC per 100 vehicles every 5 min (based on the traffic-air pollution model according to Eq. 4), which using our notation and units becomes αtot = 0.006 μg/m3 per 100 vehicles/hr. The deCastro values are an order of magnitude smaller than ours, αtot = 0.05 μg/m3 per 100 vehicles/hr for Site A and αtot = 0.07 μg/m3 per 100 vehicles/hr for Site B. We are not certain why they observed smaller values, even though they sampled outdoor air closer to street level (2nd-floor window) where TRAP should be higher. Seasonal effects (their study ran for one year) could potentially explain the difference.
In 2003 and 2004, Patel et al.37 performed measurements to study the impacts of truck traffic on BC levels at a suburban and an urban school within the NYC metropolitan area, respectively. They found 0.62 μg/m3 per 443 trucks/hr (αtr = 0.14 μg/m3 per 100 trucks/hr) at the urban school and 0.20 μg/m3 per 70.5 trucks/hr (αtr = 0.28 μg/m3 per 100 trucks/hr) at the suburban school. Their values are very similar to ours.
It is encouraging that we obtained order-of-magnitude or better agreement with other studies that also quantified the mobile-source contributions to BC levels. Observed differences can be attributed to factors the regression models do not account for such as distance to roadways and elevation of the BC monitors.
4.6. Limitations
From an air pollution perspective, the study area is complex as road traffic at the three sites is not the only contribution to atmospheric BC at a site. Additional BC sources existed which did not only include traffic at surrounding roads but also domestic and industrial sources, and maritime traffic. However, an advantage of the timing of this study was that it was not performed during the heating season. Hence, little BC emissions occurred due to heating-related combustion of oil and natural gas. We represented the additional BC sources by a site-specific slowly varying function, also to reduce confounding effects of these sources on traffic-related regression coefficients. The background BC levels (predicted by assuming traffic to be constant at a site) for the three sites were similar to each other as could be expected for the relative small study area.
Ideally one would construct a mechanistic model that explicitly accounts for all pollutant (BC in our study) sources, the dispersion of air pollutants, pollutant transformation, atmospheric and pollution level boundary conditions, and topographic features causing, e.g., street canyon effects.64,65 Due to the expense of such modeling approaches and their required inputs, we used a relatively simplistic modeling approach that is based on linear regression. However, concepts introduced in this study can potentially be incorporated into more mechanistic air pollution models based on, e.g., air dispersion modeling66–70 or computational fluid dynamics.71–76 For instance, CCC data that is potentially calibrated to account for road-specific traffic flows could be incorporated into traffic emissions estimation models such as HBEFA.77 Alternatively, one could generalize the statistical model we proposed to account for distance to roadways and elevations of receptors.
5. CONCLUSIONS
Our study suggests that levels of primary air pollutants (BC in our study) can depending on traffic flow patterns and site characteristics be estimated from a model that uses radar-calibrated CCC data, i.e., the product CCC as an input. For Sites A and B, we measured the mean flows with our own radar devices, while for Site C we estimated from published DOT data. Indeed, for many streets can be obtained from departments of transportation, which in the US publish Annual Average Daily Traffic (AADT) data. This opens up the possibility that BC levels can be inferred from time-dependent crowd-sourced traffic data and average vehicle flow data , which in big cities are often available.
We tested our approach at three sites, at all of which free-flow traffic conditions were typically not present due to the nearby presence of traffic lights or forced turns. This explains why the dependence of BC on vehicle speed (and CCC) could be described by a speed slope factor, i.e., a linear model. Future studies should therefore address free-flow traffic conditions, and it appears entirely possible that then the concept of speed slope factor needs to be generalized, because of the non-monotonicity of observed relationships.
Interpretation of our study is hampered by the fact that the strength of the correlation between BC levels and CCC is low for two out of the three sites. This could be due to the higher elevation of the air pollution monitors at these two sites. Future studies should therefore examine better the effects of elevation of the BC measurements. Statistical models for the mobile-source contributions to BC levels could potentially become generalizable if they accounted explicitly for site characteristics such as elevation of BC exposure. Alternatively CCC data could be used as an input into air dispersion and CFD models, which can more directly account for site characteristics and the transformation of different air pollutants.
Use of crowd-sourced traffic data offers great promise in air pollution modeling as it is inexpensive. While we only analyzed air pollution at three road segments, crowd-sourced traffic data together with AADT data yield information about traffic in entire street networks, allowing potential prediction of TRAP at the neighborhood and city scales and perhaps interpretation of air quality indices.
ACKNOWLEDGMENTS
This study was supported by NIEHS grant P30 ES009089. We thank Wade McGillis (Columbia University) for lending us a temperature controlled environmental chamber and Jemal Cole for providing assistance with software engineering.
Appendix
A. Observational Windows
Different types of data were collected at different temporal resolution. Radar events were logged at discrete points in time when vehicles were detected, at a 1-second resolution. Data gaps of several minutes occurred when data were downloaded from the radar devices; however, these data gaps typically lasted only about 5 minutes and occurred about every 4 weeks. There were also two occasions were radar data was missing for longer periods of time, presumably because the internal memory of the radar devices overflew. Congestion color code CCC was obtained every minute. BC aethalometer data were logged every minute, except when the internal filter tape used for BC quantification advanced.
15-minute observational windows were used to examine associations between different time-dependent variables and to perform regression analyses. This period of time was deemed long enough to reduce the role of a possible time lag of the crowd-sourced traffic data and to avoid too many data points, for which no average radar speed data can be determined because of no observed traffic counts. At the same time, a 15-minute window is short enough to capture temporal variations in the data. Data acquired at a less than 15-min resolution (BC, CCC) was averaged while data acquired at a larger resolution (meteorological data) was linearly interpolated. The event-based radar data were integrated over the observational windows in order to determine traffic flows and mean vehicle speed for each window.
Table A.1:
ln(BC)-traffic model | γH | γWS | γT | γRH | γPD | γWD |
---|---|---|---|---|---|---|
Site A | ||||||
- | −9.5*** (−10.8, −8.3) | - | 0.71*** (0.45, 0.87) | - | *** | |
- | −9.3*** (−10.6, −8.0) | - | 0.67*** (0.50, 0.84) | - | *** | |
- | −9.2*** (−10.5, −7.9) | - | 0.69 *** (0.53, 0.86) | - | *** | |
- | −8.9*** (−10.2, −7.5) |
- | 0.66*** (0.48, 0.84) | - | *** | |
- | −8.1*** (−9.6, −6.6) | - | 0.56*** (0.36, 0.76) | - | *** | |
- | −8.1*** (−9.5, −6.6) | - | 0.54*** (0.35, 0.74) | - | *** | |
Site B | ||||||
−0.4*** (−0.4, −0.3) | - | - | 0.99*** (0.79, 1.20) | - | *** | |
−0.4*** (−0.4, −0.3) | - | - | 0.99*** (0.79, 1.20) | - | *** | |
−0.4*** (−0.4, −0.3) | - | - | 1.01*** (0.80, 1.22) | - | *** | |
−0.4*** (−0.4, −0.3) | - | - | 0.99*** (0.79, 1.20) | - | *** | |
−0.3*** (−0.4, −0.3) | - | - | 0.97*** (0.75, 1.19) | - | *** | |
−0.3*** (−0.4, −0.3) | - | - | 1.00*** (0.78, 1.23) | - | *** | |
Site C | ||||||
- | −6.9*** (−8.4, −5.3) | - | 0.98*** (0.77, 1.18) | - | ** |
Significance codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
Footnotes
Competing financial interest declaration: Steven Chillrud reports receiving travel support and small honorarium for his wife and himself from Astrazenica. The other authors declare they have no actual or potential competing financial interest.
REFERENCES
- 1.Ito K, Xue N, Thurston G. Spatial variation of PM2.5 chemical species and source-apportioned mass concentrations in New York City. Atmospheric Environment. 2004;38(31):5269–82. doi: 10.1016/j.atmosenv.2004.02.063. [DOI] [Google Scholar]
- 2.Ostro B, Lipsett M, Mann J, Braxton-Owens H, White M. Air pollution and exacerbation of asthma in African-American children in Los Angeles. Epidemiology. 2001;12(2):200–8. [DOI] [PubMed] [Google Scholar]
- 3.Brook RD, Rajagopalan S, Pope CA III, Brook JR, Bhatnagar A, Diez-Roux AV, et al. Particulate Matter Air Pollution and Cardiovascular Disease An Update to the Scientific Statement From the American Heart Association. Circulation. 2010;121(21):2331–78. [DOI] [PubMed] [Google Scholar]
- 4.Le Tertre A, Medina S, Samoli E, Forsberg B, Michelozzi P, Boumghar A, et al. Short-term effects of particulate air pollution on cardiovascular diseases in eight European cities. Journal of Epidemiology and Community Health. 2002;56(10):773–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Nyberg F, Gustavsson P, Jarup L, Bellander T, Berglind N, Jakobsson R, et al. Urban air pollution and lung cancer in Stockholm. Epidemiology. 2000;11(5):487–95. [DOI] [PubMed] [Google Scholar]
- 6.Wellenius GA, Burger MR, Coull BA, Schwartz J, Suh HH, Koutrakis P, et al. Ambient air pollution and the risk of acute ischemic stroke. Archives of Internal Medicine. 2012;172(3):229–34. Epub 2012/02/15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Villeneuve PJ, Chen L, Stieb D, Rowe BH. Associations between outdoor air pollution and emergency department visits for stroke in Edmonton, Canada. European Journal of Epidemiology. 2006;21(9):689–700. Epub 2006/10/19. doi: 10.1007/s10654-006-9050-9. [DOI] [PubMed] [Google Scholar]
- 8.Weaver AM, Wellenius GA, Wu WC, Hickson DA, Kamalesh M, Wang Y. Residential distance to major roadways and cardiac structure in African Americans: cross-sectional results from the Jackson Heart Study. Environmental Health. 2017;16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Gilbert NL, Woodhouse S, Stieb DM, Brook JR. Ambient nitrogen dioxide and distance from a major highway. Science of the Total Environment. 2003;312(1–3):43–6. [DOI] [PubMed] [Google Scholar]
- 10.Karner AA, Eisinger DS, Niemeier DA. Near-Roadway Air Quality: Synthesizing the Findings from Real-World Data. Environmental Science & Technology. 2010;44(14):5334–44. [DOI] [PubMed] [Google Scholar]
- 11.Gauderman WJ, Vora H, McConnell R, Berhane K, Gilliland F, Thomas D, et al. Effect of exposure to traffic on lung development from 10 to 18 years of age: a cohort study. Lancet. 2007;369(9561):571–7. [DOI] [PubMed] [Google Scholar]
- 12.Brunekreef B, Janssen NAH, deHartog J, Harssema H, Knape M, vanVliet P. Air pollution from truck traffic and lung function in children living near motorways. Epidemiology. 1997;8(3):298–303. [DOI] [PubMed] [Google Scholar]
- 13.Briggs DJ, de Hoogh C, Guiliver J, Wills J, Elliott P, Kingham S, et al. A regression-based method for mapping traffic-related air pollution: application and testing in four contrasting urban environments. Science of the Total Environment. 2000;253(1–3):151–67. doi: 10.1016/s0048-9697(00)00429-0. [DOI] [PubMed] [Google Scholar]
- 14.Eeftens M, Beelen R, de Hoogh K, Bellander T, Cesaroni G, Cirach M, et al. Development of Land Use Regression Models for PM2.5, PM2.5 Absorbance, PM10 and PMcoarse in 20 European Study Areas; Results of the ESCAPE Project. Environmental Science & Technology. 2012;46(20):11195–205. [DOI] [PubMed] [Google Scholar]
- 15.van Nunen E, Vermeulen R, Tsai MY, Probst-Hensch N, Ineichen A, Davey M, et al. Land Use Regression Models for Ultrafine Particles in Six European Areas. Environmental Science & Technology. 2017;51(6):3336–45. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Raaschou-Nielsen O, Hertel O, Vignati E, Berkowicz R, Jenson SS, Larsen VB, et al. An air pollution model for use in epidemiological studies: evaluation with measured levels of nitrogen dioxide and benzene. Journal of Exposure Analysis and Environmental Epidemiology. 2000;10(1):4–14. [DOI] [PubMed] [Google Scholar]
- 17.Beelen R, Hoek G, Vienneau D, Eeftens M, Dimakopoulou K, Pedeli X, et al. Development of NO2 and NOx land use regression models for estimating air pollution exposure in 36 study areas in Europe - The ESCAPE project. Atmospheric Environment. 2013;72:10–23. [Google Scholar]
- 18.Clark NA, Demers PA, Karr CJ, Koehoorn M, Lencar C, Tamburic L, et al. Effect of Early Life Exposure to Air Pollution on Development of Childhood Asthma. Environ Health Persp. 2010;118(2):284–90. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Lena TS, Ochieng V, Carter M, Holguin-Veras J, Kinney PL. Elemental carbon and PM2.5 levels in an urban community heavily impacted by truck traffic. Environ Health Persp. 2002;110(10):1009–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Gates TJ, Schrock SD, Bonneson JA. Comparison of portable speed measurement devices. Data and Information Technology. 2004(1870):139–46. [Google Scholar]
- 21.Liu BH, Li QB, Chen DY, Sun HL. Pattern Recognition of Vehicle Types and Reliability Analysis of Pneumatic Tube Test Data under Mixed Traffic Condition. In: Wu Y, Li J, editors. 2010. 2nd International Asia Conference on Informatics in Control, Automation and Robotics2010. p. 44–7. [Google Scholar]
- 22.deCastro BR, Wang L, Mihalic JN, Breysse PN, Geyh AS, Buckley TJ. The longitudinal dependence of black carbon concentration on traffic volume in an urban environment. Journal of the Air & Waste Management Association. 2008;58(7):928–39. [DOI] [PubMed] [Google Scholar]
- 23.Golob TF, Recker WW. Relationships among urban freeway accidents, traffic flow, weather, and lighting conditions. Journal of Transportation Engineering-ASCE. 2003;129(4):342–53. doi: 10.1061/(asce)0733-947x(2003)129:4(342). [DOI] [Google Scholar]
- 24.Gajda J, Sroka R, Stencel M, Wajda A, Zeglen T, Ieee, et al. A vehicle classification based on inductive loop detectors. Imtc/2001: Proceedings of the 18th Ieee Instrumentation and Measurement Technology Conference, Vols 1–3: Rediscovering Measurement in the Age of Informatics2001. p. 460–4. [Google Scholar]
- 25.Gupte S, Masoud O, Martin RFK, Papanikolopoulos NP. Detection and classification of vehicles. IEEE Transactions on Intelligent Transportation Systems. 2002;3(1):37–47. [Google Scholar]
- 26.Michalopoulos PG. Vehicle detection video through image-processing - The autoscope system. IEEE Transactions on Vehicular Technology. 1991;40(1):21–9. [Google Scholar]
- 27.Kim S, Coifman B. Assessing the performance of SpeedInfo radar traffic sensors. Journal of Intelligent Transportation Systems. 2017;21(3):179–89. [Google Scholar]
- 28.Misaghi P, Hassan Y. Modeling operating speed and speed differential on two-lane rural roads. Journal of Transportation Engineering-ASCE. 2005;131(6):408–17. [Google Scholar]
- 29.Treiber M, Kesting A. Traffic flow dynamics: data, models and simulation. Heidelberg, New York: Springer; 2013. [Google Scholar]
- 30.Zhang K, Batterrnan S. Near-road air pollutant concentrations of CO and PM2.5: A comparison of MOBILE6.2/CALINE4 and generalized additive models. Atmospheric Environment. 2010;44(14):1740–8. doi: 10.1016/j.atmosenv.2010.02.008. [DOI] [Google Scholar]
- 31.Shorshani MF, Seigneur C, Rehn LP, Chanut H, Pellan Y, Jaffrezo JL, et al. Atmospheric dispersion modeling near a roadway under calm meteorological conditions. Transportation Research Part D-Transport and Environment. 2015;34:137–54. doi: 10.1016/j.trd.2014.10.013. [DOI] [Google Scholar]
- 32.Tsai MY, Chen KS. Measurements and three-dimensional modeling of air pollutant dispersion in an Urban Street Canyon. Atmospheric Environment. 2004;38(35):5911–24. doi: 10.1016/j.atmosenv.2004.07.008. [DOI] [Google Scholar]
- 33.Liu YH, Ma JL, Li L, Lin XF, Xu WJ, Ding H. A high temporal-spatial vehicle emission inventory based on detailed hourly traffic data in a medium-sized city of China. Environmental Pollution. 2018;236:324–33. doi: 10.1016/j.envpol.2018.01.068. [DOI] [PubMed] [Google Scholar]
- 34.Batterman S, Cook R, Justin T. Temporal variation of traffic on highways and the development of accurate temporal allocation factors for air pollution analyses. Atmospheric Environment. 2015;107:351–63. doi: 10.1016/j.atmosenv.2015.02.047. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Herrera JC, Work DB, Herring R, Ban XG, Jacobson Q, Bayen AM. Evaluation of traffic data obtained via GPS-enabled mobile phones: The Mobile Century field experiment. Transportation Research Part C-Emerging Technologies. 2010;18(4):568–83. [Google Scholar]
- 36.Hanna R, Kreindler G, Olken BA. Citywide effects of high-occupancy vehicle restrictions: Evidence from “three-in-one” in Jakarta. Science (New York, NY). 2017;357(6346):89–93. [DOI] [PubMed] [Google Scholar]
- 37.Patel MM, Chillrud SN, Correa JC, Feinberg M, Hazi Y, Deepti KC, et al. Spatial and temporal variations in traffic-related particulate matter at New York City high schools. Atmospheric Environment. 2009;43(32):4975–81. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.DOT. Traffic Monitoring Guide. U.S. Department of Transportation. Federal Highway Administration, 2013. [Google Scholar]
- 39.NYS DOT. Traffic Data Viewer: New York State Department of Transportation 2017. Available from: https://www.dot.ny.gov/tdv.
- 40.Hsu YM, Wang X, Chow JC, Watson JG, Percy KE. Collocated comparisons of continuous and filter-based PM2.5 measurements at Fort McMurray, Alberta, Canada. Journal of the Air & Waste Management Association (1995). 2016;66(3):329–39. Epub 2016/01/05. doi: 10.1080/10962247.2015.1136362. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Cai J, Yan BZ, Ross J, Zhang DN, Kinney PL, Perzanowski MS, et al. Validation of MicroAeth (R) as a Black Carbon Monitor for Fixed-Site Measurement and Optimization for Personal Exposure Characterization. Aerosol and Air Quality Research. 2014;14(1):1–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Houston Radar LLC. Armadillo Tracker Data Sheet 2018 [November 15, 2018]. Available from: houston-radar.com/pdf/ArmadilloTracker-shortformdatasheet-rev2.0-release.pdf.
- 43.NCDC. Integrated Surface Data FTP Access Point: US National Climatic Data Center 2018 [January 27, 2018]. Available from:ftp://ftp.ncdc.noaa.gov/pub/data/noaa/2017/.
- 44.NOAA/ESRL. NOAA/ESRL Radiosonde Database: National Oceanic and Atmospheric Administration & Earth System Research Laboratory; 2018 [January 27, 2018]. Available from: https://ruc.noaa.gov/raobs/.
- 45.EPA. User’s Guide for the AERMOD Meteorological Preprocessor (AERMET). U.S. Environmental Protection Agency, 2016. Contract No.: EPA-454/B-16–010. [Google Scholar]
- 46.Papoulis A The Fourier integral and its applications. New York, NY, USA McGraw-Hill; 1962. [Google Scholar]
- 47.Wood SN. Generalized Additive Models: An Introduction with R. 2nd ed: Chapman and Hall/CRC; 2017. [Google Scholar]
- 48.Fontes T, Li PL, Barros N, Zhao PJ. A proposed methodology for impact assessment of air quality traffic-related measures: The case of PM2.5 in Beijing. Environmental Pollution. 2018;239:818–28. doi: 10.1016/j.envpol.2018.04.061. [DOI] [PubMed] [Google Scholar]
- 49.Beloconi A, Chrysoulakis N, Lyapustin A, Utzinger J, Vounatsou P. Bayesian geostatistical modelling of PM10 and PM2.5 surface level concentrations in Europe using high-resolution satellite-derived products. Environment International. 2018;121:57–70. doi: 10.1016/j.envint.2018.08.041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Yu CH, Fan ZH, Lioy PJ, Baptista A, Greenberg M, Laumbach RJ. A novel mobile monitoring approach to characterize spatial and temporal variation in traffic-related air pollutants in an urban community. Atmospheric Environment. 2016;141:161–73. [Google Scholar]
- 51.Karatzas K, Katsifarakis N, Orlowski C, Sarzyński A. Revisiting urban air quality forecasting: a regression approach. Vietnam Journal of Computer Science. 2018;5:177–84. [Google Scholar]
- 52.Yao XH, Lau NT, Fang M, Chan CK. Correlations of ambient temperature and relative humidity with submicron particle number concentration size distributions in on-road vehicle plumes. Aerosol Science and Technology. 2007;41(7):692–700. [Google Scholar]
- 53.Tai APK, Mickley LJ, Jacob DJ. Correlations between fine particulate matter (PM2.5) and meteorological variables in the United States: Implications for the sensitivity of PM2.5 to climate change. Atmospheric Environment. 2010;44(32):3976–84. [Google Scholar]
- 54.Molnar P, Janhall S, Hallquist M. Roadside measurements of fine and ultrafine particles at a major road north of Gothenburg. Atmospheric Environment. 2002;36(25):4115–23. [Google Scholar]
- 55.Dodson RE, Houseman EA, Morin B, Levy JI. An analysis of continuous black carbon concentrations in proximity to an airport and major roadways. Atmospheric Environment. 2009;43(24):3764–73. [Google Scholar]
- 56.Venkatram A, Snyder M, Isakov V, Kimbrough S. Impact of wind direction on near-road pollutant concentrations. Atmospheric Environment. 2013;80:248–58. [Google Scholar]
- 57.Package Wood S. ‘mgcv’: Mixed GAM Computation Vehicle with Automatic Smoothness Estimation 2019 [March 18, 2019]. Available from: https://cran.r-project.org/web/packages/mgcv/mgcv.pdf.
- 58.Peng RD, Dominici F. Statistical Methods for Environmental Epidemiology with R: A Case Study in Air Pollution and Health: Springer; 2008. [Google Scholar]
- 59.Akaike H Information Theory and an Extension of the Maximum Likelihood Principle In: Parzen E, Tanabe K, Kitagawa G, editors. Selected Papers of Hirotugu Akaike: Springer, New York, NY; 1998. [Google Scholar]
- 60.Banks JH. Freeway speed-flow-concentration relationships: more evidence and interpretations. Transportation Research Record. 1992:53–60. [Google Scholar]
- 61.Wang C, Quddus MA, Ison SG. The effect of traffic and road characteristics on road safety: A review and future research direction. Safety Science. 2013;57:264–75. doi: 10.1016/j.ssci.2013.02.012. [DOI] [Google Scholar]
- 62.Ntziachristos L, Gkatzoflias D, Kouridis C, Samaras Z. COPERT: A European road transport emission inventory model In: Athanasiadis IN, Rizzoli AE, Mitkas PA, Gómez JM, editors. Information Technologies in Environmental Engineering: Springer, Berlin, Heidelberg; 2009. p. 491–504. [Google Scholar]
- 63.Zheng X, Zhang SJ, Wu Y, Zhang KM, Wu X, Li ZH, et al. Characteristics of black carbon emissions from in-use light-duty passenger vehicles. Environmental Pollution. 2017;231:348–56. [DOI] [PubMed] [Google Scholar]
- 64.Zhong J, Cai XM, Bloss WJ. Coupling dynamics and chemistry in the air pollution modelling of street canyons: A review. Environmental pollution (Barking, Essex : 1987). 2016;214:690–704. Epub 2016/05/06. doi: 10.1016/j.envpol.2016.04.052. [DOI] [PubMed] [Google Scholar]
- 65.Vardoulakis S, Fisher BEA, Pericleous K, Gonzalez-Flesca N. Modelling air quality in street canyons: a review. Atmospheric Environment. 2003;37(2):155–82. doi: 10.1016/s1352-2310(02)00857-9. [DOI] [Google Scholar]
- 66.Nyberg F, Gustavsson P, Jarup L, Bellander T, Berglind N, Jakobsson R, et al. Urban air pollution and lung cancer in Stockholm. Epidemiology. 2000;11(5):487–95. Epub 2000/08/24. [DOI] [PubMed] [Google Scholar]
- 67.Wu J, Ren C, Delfino RJ, Chung J, Wilhelm M, Ritz B. Association between local traffic-generated air pollution and preeclampsia and preterm delivery in the south coast air basin of California. Environ Health Perspect. 2009;117(11):1773–9. Epub 2010/01/06. doi: 10.1289/ehp.0800334. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Levitin J, Harkonen J, Kukkonen J, Nikmo J. Evaluation of the CALINE4 and CAR-FMI models against measurements near a major road. Atmospheric Environment. 2005;39(25):4439–52. doi: 10.1016/j.atmosenv.2005.03.046. [DOI] [Google Scholar]
- 69.Giang NTH, Oanh NTK. Roadside levels and traffic emission rates of PM2.5 and BTEX in Ho Chi Minh City, Vietnam. Atmospheric Environment. 2014;94:806–16. doi: 10.1016/j.atmosenv.2014.05.074. [DOI] [Google Scholar]
- 70.Hilpert M, Rule AM, Adria-Mora B, Tiberi T. Vent pipe emissions from storage tanks at gas stations: Implications for setback distances. The Science of the total environment. 2019;650(Pt 2):2239–50. Epub 2018/10/07. doi: 10.1016/j.scitotenv.2018.09.303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Zhong J, Cai XM, Bloss WJ. Modelling the dispersion and transport of reactive pollutants in a deep urban street canyon: using large-eddy simulation. Environmental pollution (Barking, Essex : 1987). 2015;200:42–52. Epub 2015/02/24. doi: 10.1016/j.envpol.2015.02.009. [DOI] [PubMed] [Google Scholar]
- 72.Hu LH, Huo R, Yang D. Large eddy simulation of fire-induced buoyancy driven plume dispersion in an urban street canyon under perpendicular wind flow. J Hazard Mater. 2009;166(1):394–406. Epub 2009/01/21. doi: 10.1016/j.jhazmat.2008.11.105. [DOI] [PubMed] [Google Scholar]
- 73.Baker J, Walker HL, Cai XM. A study of the dispersion and transport of reactive pollutants in and above street canyons - a large eddy simulation. Atmospheric Environment. 2004;38(39):6883–92. doi: 10.1016/j.atmosenv.2004.08.051. [DOI] [Google Scholar]
- 74.Tong Z, Baldauf RW, Isakov V, Deshmukh P, Max Zhang K. Roadside vegetation barrier designs to mitigate near-road air pollution impacts. The Science of the total environment. 2016;541:920–7. Epub 2015/10/13. doi: 10.1016/j.scitotenv.2015.09.067. [DOI] [PubMed] [Google Scholar]
- 75.Kim MJ, Park RJ, Kim JJ. Urban air quality modeling with full O-3-NOx-VOC chemistry: Implications for O-3 and PM air quality in a street canyon. Atmospheric Environment. 2012;47:330–40. doi: 10.1016/j.atmosenv.2011.10.059. [DOI] [Google Scholar]
- 76.Tong Z, Wang YJ, Patel M, Kinney P, Chrillrud S, Zhang KM. Modeling spatial variations of black carbon particles in an urban highway-building environment. Environ Sci Technol. 2012;46(1):312–9. Epub 2011/11/17. doi: 10.1021/es201938v. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Keller M, Hausberger S, Matzer C, Wüthrich P, Notter B. HBEFA Version 3.3. Background documentation: Infras; 2017.