Skip to main content
Wiley - PMC COVID-19 Collection logoLink to Wiley - PMC COVID-19 Collection
. 2021 Feb 16;48(4):e2020GL091202. doi: 10.1029/2020GL091202

Assessing the COVID‐19 Impact on Air Quality: A Machine Learning Approach

Yves Rybarczyk 1,2,, Rasa Zalakeviciute 2,3,
PMCID: PMC7995168  PMID: 33785973

Abstract

The worldwide research initiatives on Corona Virus disease 2019 lockdown effect on air quality agree on pollution reduction, but the most reliable method to pollution reduction quantification is still in debate. In this paper, machine learning models based on a Gradient Boosting Machine algorithm are built to assess the outbreak impact on air quality in Quito, Ecuador. First, the precision of the prediction was evaluated by cross‐validation on the four years prelockdown, showing a high accuracy to estimate the real pollution levels. Then, the changes in pollution are quantified. During the full lockdown, air pollution decreased by −53 ± 2%, −45 ± 11%, −30 ± 13%, and −15 ± 9% for NO2, SO2, CO, and PM2.5, respectively. The traffic‐busy districts were the most impacted areas of the city. After the transition to the partial relaxation, the concentrations have nearly returned to the levels as before the pandemic. The quantification of pollution drop is supported by an assessment of the prediction confidence.

Keywords: air pollution, COVID‐19, quarantine measures, urban air quality

Key Points

  • A data driven modeling is applied to quantify the reduction of air pollution during the Corona Virus disease 2019 outbreak in Quito, Ecuador

  • The accuracy of the models is high (mean PCC = 0.78), especially for predicting NO2 (mean PCC = 0.87) and CO (mean PCC = 0.86)

  • The average drop of pollution during the lockdown is: −53% for NO2, −45% for SO2, −30% for CO, and −15% for PM2.5

  • The industrial areas are less impacted by the quarantine than the traffic and residential districts

  • The concentration of pollution tends to return to usual levels, as soon as the relaxed restriction is implemented

1. Introduction

Throughout 2020, human population of the world faced a challenging new pandemic of Corona Virus disease 2019 (COVID‐19), propagated by a severe acute respiratory syndrome corona virus 2 (SARS‐CoV‐2). Originating in China at the end of 2019, this complex‐symptom disease exponentially spread to almost every part of the world (Worldometer, 2020). In order to slow down social interaction, the principal mode of infection, a range of regulatory measures has been implemented across the countries. Anthropogenic activities were limited or even prohibited, which in turn, positively affected the environmental quality, especially, due to the reduction in emissions by transportation and industries (Shi & Brasseur, 2020; Y. Wang et al., 2020). This was a unique global experiment, as the urban, regional and global air quality conditions have been persistently worsening due to a rapid increase in polluting sources by human population (Limb, 2016; UN., 2015; WHO., 2016). Most common atmospheric contaminants, such as particulate matter (PM, especially with aerodynamic diameters ≤2.5 µm, PM2.5), nitrogen oxides (NO and NO2), sulfur dioxide (SO2), and carbon monoxide (CO), compromise respiratory and cardiovascular systems (EPA, 2018; Lelieveld et al., 2015; Pope et al., 2006). Moreover, a number of studies show that population exposed to a long‐term elevated levels of air pollution have higher mortality rates due to COVID‐19 (Coker et al., 2020; Geographic, 2020; Ogen, 2020; Wu et al., 2020).

Undoubtedly, nearly globally implemented social distancing and isolation measures have resulted in a significant reduction of air pollution (Bauwens et al., 2020; Ding et al., 2020; European Space Agency, 2020; NASA, 2020; Navinya et al., 2020; Nikkei Asian Review, 2020; Querol et al., 2007; Shi & Brasseur, 2020; P. Wang et al., 2020). These statistics of air quality improvements are commonly reported by comparing the restricted human activity periods with the “normal conditions,” months or years before the sanitary emergency with no way to estimate the reliability of these results.

Ecuador is among the countries that implemented extreme quarantine measures. The country had a first reported case of COVID‐19 on February 29, 2020, very quickly escalating in numbers, and in two weeks reporting the first deaths (Worldometer, 2020). In order to prevent health care system collapse, extremely strict national preventative rules were communicated on March 15, active from March 17, 2020. The full lockdown only permitted circulation for absolute necessities (e.g., medicine and food) once per week, based on the license plate number, 7:00–14:00 during the workdays (Zalakeviciute et al., 2020a). Gradually, however, more businesses could function every day anytime. While the infection numbers kept growing, the relaxation to these rules was implemented starting June 2, 2020. Partial relaxation allowed private vehicle circulation every second day, excluding Sundays. This rule permitted all bus functioning (public and private lines) at a limited capacity for passengers. A further increase in permits to some offices were issued (shopping malls, governmental offices, private businesses, etc.).

High elevation capital Quito is characterized by the rainy and dry seasons, with a strong influence of El Niño phenomenon, affecting air pollution conditions, usually worsening during the rainier months (Zalakeviciute et al., 2018). In addition, due to technological advances and increasing strictness of traffic regulations, air quality tends to improve over time, regardless of the motorization trend (European Environment Agency, 2018, 2017; Zalakeviciute et al., 2018). As a result, the evaluation of urban pollution reduction during the sanitary quarantine based on pollution levels from previous years or months could introduce some uncertainties. To get a more accurate assessment of the effect of an event on air quality, a variable normalization technique can be applied (Grange et al., 2018; Grange & Carslaw, 2019). A meteorology‐normalized approach was used to analyze the effect of the COVID‐19 lockdown on NO2 concentrations in Spain (Petetin et al., 2020) and in 20 North American cities (Goldberg et al., 2020).

This paper proposes an alternative approach, which consists of training a machine learning (ML) algorithm that will learn the effects of the climatic conditions and time on the atmospheric pollution. ML approach is preferred over chemical transport models, as the latter show a reduced performance in the complex terrain regions (Pani et al., 2020; Žabkar et al., 2015) and require an updated emissions inventory, which Quito lacks. Unlike the previous studies, NO2, SO2, CO, and PM2.5 are analyzed, in the Ecuadorian capital. Since the lockdown affected essentially the human mobility, we investigate the differences in the reduction of pollution in different districts with varied sources of contamination (e.g., traffic vs. industry). One model for each city area and contaminant is built, in order to estimate the concentration of pollutants from meteorological and temporal features without the lockdown. The quantification of the concentration changes due to the reduced human activity, is obtained by comparing the value provided by the model to the real measurements.

2. Materials and Methods

2.1. Study Site

Ecuador, even though one of the smallest countries in South America, occupies a variety of biomes, including Pacific Coast region, Galapagos Islands Archipelago, Andes Cordillera and the Amazonian Rainforest. The capital Quito is set on a slender plateau, stretching on the side of the Pichincha volcano (elev. 4,800 meters above sea level (m.a.s.l.)), forming a part of the Andes Cordillera. Quito is the most elevated constitutional capital of the world (elev. 2,815 m. a.s.l.) (EMASEO., 2011). The quickly spreading and densifying metro area houses a population of 2.2 million people (INEC., 2011). As most of the cities in the developing countries, Quito reports a long‐term air quality problem (Zalakeviciute et al., 2018).

The Secretariat of Environment of Metropolitan District of Quito manages an air quality network recording atmospheric pollutants and meteorological parameters, since 2004. Six sites with more complete data have been chosen for this study: urban traffic‐busy site Belisario (elev. 2,835 m. a.s.l, coord. 78°29′24″W, 0°10′48″S), urban/industrial sites Camal (elev. 2,840 m. a.s.l, coord. 78°30′36″W, 0°15′00″S), and Carapungo (elev. 2,660 m. a.s.l, coord. 78°26′50″W, 0°5′54″S), suburban/residential traffic site Cotocollao (elev. 2,739 m. a.s.l, coord. 78°29′50″W, 0°6′28″S), suburban/agricultural site Guamani (elev. 3,066 m. a.s.l., coord. 78°33′5″W, 0°19′51″S), and suburban/industrial site Chillos (elev. 2,453 m. a.s.l, coord. 78°27′36″W, 0°18′00″S). More details on the study sites can be found in a previous study evaluating pollution reduction in Ecuador during the first month of COVID‐19 quarantine (Zalakeviciute et al., 2020a).

2.2. Quito Air Quality Monitoring

The atmospheric pollution monitoring equipment were set up, following the procedures established by the Environmental Protection Agency of the United States (USEPA). CO concentrations were measured using ThermoFisher Scientific 48i (EPA No. RFCA‐0981‐054). SO2 was measured using ThermoFisher Scientific 43i high level SO2 analyzers (EPA No. EQSA‐0486‐060). NO2 was measured using ThermoFisher Scientific 42i NOx analyzers. PM2.5 was measured using Thermo Scientific FH62C14‐DHS continuous ambient particulate monitors 5014i (EPA No. EQPM‐0609‐183).

To register meteorological parameters, complete automatic weather stations were collocated with the atmospheric pollution equipment at study sites. MetOne instruments were used to record wind speed and direction. Thies Clima were employed to measure relative humidity, temperature, and precipitation. Kipp & Zonen radiometers were used to measure solar radiation. Vaisala equipment were used to measure atmospheric pressure.

2.3. Machine Learning Modeling

ML models are trained to predict the pollutant concentrations from meteorological and time variables. Seven meteorological features are selected: relative humidity, precipitation, temperature, solar radiation, pressure, wind speed, and wind direction. The time attributes are: Julian day, week day, hour, and trend (date index). These temporal variables account for cyclical emission patterns, which are robust drivers of the concentration of several air pollutants (Henneman et al., 2015). Julian day, weekday, and hours inform respectively on the seasonal (wet vs. dry season), weekly (workday vs. weekend), and hourly (rush hours) occurrences of pollution peaks. Trend reflects the pollution reduction in Quito due to the implementation of fuel and traffic regulations (Zalakeviciute et al., 2018). The features to predict are the concentrations of NO2, SO2, CO, and PM2.5 in the six different districts. We work on hourly data from January 1, 2016 to June 30, 2020. The data set is split in three parts. Data from January 1, 2016 to January 15, 2020 are used to train the model, based on a five‐fold cross‐validation. Then, two months before the full lockdown (January 16, to March 15, 2020) are used to test the prediction accuracy. Even though the COVID‐19 quarantine measures were officially applied starting March 17, the city was showing the lockdown signs starting the March 15, the day of national regulation communication. The Root Mean Squared Error (RMSE) and the Pearson Coefficient of Correlation (PCC) are the metrics chosen to assess the model performance. To validate a model, the RMSE must be minimal, and the PCC the closest to 1. Finally, the remaining data consist of the values during the full ( March 16 to June 1, 2020) and relaxed (June 2 to June 30, 2020) lockdown. The modification of the concentration of contaminants is quantified by subtracting the model predicted values from the actual values recorded during the quarantine.

Due to the complex nonlinear relationships between meteorology and air quality, we selected a nonparametric ML method. A Gradient Boosting Machine (GBM) was chosen, because it is a state‐of‐the‐art decision tree‐based ensemble method (Friedman, 2001). GBM builds an ensemble of shallow and weak successive trees with each tree learning and improving the previous one. As any boosting method, this algorithm adds sequentially new models to the ensemble. At each iteration, a new weak base‐learner model is trained from the error of the entire learned ensemble. A model is considered weak if its error rate is just slightly better than random guessing. In boosting, a weak model is built at each iterative sequence, in order to improve the remaining errors. The generalization of the algorithm is a stagewise additive model of n individual regression trees Equation 1.

f(x)=n=1Nfn(x) (1)

GBM offers several advantages over the other machine learning algorithms. First, the data set does not need any specific preprocessing. Second, the algorithm handles missing values, and does not require data imputation. Third, since it is based on a decision tree method, feature selection is automatically embedded in the algorithm. Finally, it is possible to identify the importance of each variable in the resulting model.

3. Data

Four years and six months of atmospheric pollution and meteorological data were processed on the period prior and during the COVID‐19 quarantine. For each contaminant, the concentration change is computed from the mean difference between ML‐based business‐as‐usual and actual value measured during the outbreak. The H2O‐R package was used to implement the GBM algorithm in RStudio. We tuned the parameters of the library so the training (i.e., incrementation of the number of trees) stops after no improvement on the cross‐validated error is provided by 10 consecutive trees. The accuracy of the GBM models is compared to a benchmark, which consists of calculating the average concentration of each pollutant at each site from January 16 to June 30, during 2016–2019. This period is chosen, because the ML models are built on the same years. For illustrative purposes, hourly NO2, SO2, CO, and PM2.5 concentrations were compiled as 24‐h averages starting January 2020. Graphics were done using Microsoft Excel (MS Office) and Igor Pro (Wavemetrics) softwares. The data are available at https://data.mendeley.com/datasets/trs5j932s8/1 (Zalakeviciute, et al., 2020b).

4. Results and Discussion

4.1. Models and Performance

Twenty‐four models (six sites × four contaminants) were built to predict the level of pollution without lockdown (business‐as‐usual). Table 1 indicates the number of trees used to implement this decision tree‐based ensemble method. More than 6,000 trees were necessary to build the models. The smallest and highest number of trees were in Belisario (mean = 7,283) and Cotocollao (mean = 11,943), respectively.

Table 1.

Comparison of the predictive accuracy for the machine learning‐based models (eight column) versus the four previous year average (ninth column).

4.1.

The metrics used to evaluate the performance of the models (mean RMSE and mean PCC) show a high prediction accuracy. Also, the fact that the performance is systematically lower on the testing set (RMSE = 4.5; PCC = 0.78) than the training set (RMSE = 3.4; PCC = 0.89) suggests no overfitting. The best prediction on the testing set is obtained for NO2 (RMSE = 5.8; PCC = 0.87) and CO (RMSE = 0.2; PCC = 0.86). The model accuracy for SO2 is good (RMSE = 3; PCC = 0.73) and fair for PM2.5 (RMSE = 9.1; PCC = 0.65). For these last two pollutants, there are more accuracy differences from one district to another (stddev PCC SO2 = 0.08; stddev PCC PM2.5 = 0.06) than for the best predicted gases (stddev PCC NO2 = 0.03; stddev PCC CO = 0.01). These results show that the reliability of the proposed ML‐based models to predict the concentration of pollutants is higher for NO2 and CO than SO2 and PM2.5. The prediction of the first three contaminants is better than PM2.5. Those are contaminants strictly related to the fossil fuel combustion, and in an urban area they are directly resulting from human activity. On the contrary, PM2.5 while also mostly emitted by mobile sources, it may come from other sources (Karagulian et al., 2015), settle on the surfaces and disperse again in the atmosphere due to wind ventilation.

On the other hand, a benchmark based on the averages of four previous years shows a very low accuracy. Its correlation with the values observed from mid‐January 2020 to mid‐March 2020 does not exceed 0.3 (PCC NO2 = 0.11; PCC CO = 0.17; PCC SO2 = 0.16; PCC PM2.5 = 0.12). This result suggests that applying a simple average of the concentration measured in 2016–2019, for the lockdown period, is not reliable to quantify the impact on air quality. This outcome highlights the benefit of building a model based on an ML approach, particularly because it considers actual meteorological conditions and temporal features.

It is to note that the model accuracy is also dependent on the characteristics of the metropolitan areas. The best prediction is obtained for the districts in which the source of pollution is traffic‐based (PCC for Belisario = 0.8; men PCC for Cotocollao = 0.82). The performance is slightly worse for the industrial (PCC for Carapungo = 0.8; PCC for Camal = 0.77; PCC for Chillos = 0.75) and agricultural areas (PCC for Guamin = 0.73). Whether the district is urban (PCC = 0.79) or suburban (PCC = 0.77) does not influence significantly the quality of the prediction. This is because both areas are circulated by the same poor‐quality fuel‐powered buses and private vehicles. These results can be explained by the fact that the main source of pollution in Quito is traffic‐based, especially peaking during rush hours (Zalakeviciute et al., 2018). These precise patterns can be learned by our data‐driven models. Also, a possible distinction between inner‐city and suburb is not verified, because the COVID‐19 quarantine regulations are applied extremely strictly in all the metropolitan area.

4.2. Variable Importance

GBM algorithm allows us to understand how the features have been handled by the resulting models. The importance of the 11 independent variables for the six districts and four pollutants is represented in Figure 1. For a determined contaminant and site, this importance is expressed in percentage of each feature. If 15% is used as minimal threshold, the five more relevant factors to build the models are: trend, hour, wind speed (WS), relative humidity (RH), and temperature (Temp). The other six features do not exceed this cut‐off whatever the district and the pollutant. The two temporal features, trend and hour, are always among the most important variables for the modeling of any contaminant. For the trend feature, it can be explained by the implementation of the successive regulations, which have progressively improved air quality in Quito over the last decade (Zalakeviciute et al., 2018). As previously mentioned, the daily pollution peaks occur during the morning and evening rush hours systematically making hour a reliable feature to predict the concentration of anthropogenic pollutants.

Figure 1.

Figure 1

Variable importance for the modeling of NO2 (panel (a)), SO2 (panel (b)), CO (panel (c)), and PM2.5 (panel (d)) for the six districts. The broken line is an arbitrary threshold (15% of importance) used to highlight the most relevant features for the prediction of each contaminant. WD, WS, RH, Temp and SR stand for wind direction, wind speed, relative humidity, temperature and solar radiation, respectively.

We note that the three most relevant meteorological features (WS, RH, and Temp) have, on average, a lower percentage of importance than the two temporal factors (Trend and Hour). WS is mostly used for the prediction of PM2.5 and CO. The transport and re‐suspension effect of wind on the traffic‐related contamination can explain the importance of this factor in the models. Similarly, RH seems to influence mainly the concentration of these two pollutants. A previous study by the authors shows that wet conditions increase traffic‐related pollution, because humidity reduces engine efficiency, especially in high‐elevation cities (Zalakeviciute et al., 2018). Among the three relevant meteorological factors, temperature is the least important. Nevertheless, it is significantly used for the modeling of NO2 in Carapungo and SO2 in Camal, which are both urban‐industrial areas. Temperature, which is positively correlated with WS and negatively correlated with RH, is known for causing thermal turbulences that contribute to dilute gas emissions.

In terms of districts, some unique percentages of importance emerge. For Cotocollao, trend constitutes half the features for predicting the NO2 concentration. It means that the consecutive fuel and traffic regulations have largely impacted this area. On the contrary, the level of NO2 and PM2.5 in the suburban agricultural site of Guamani are more influenced by wind velocity, suggesting transport factor. Camal, as an urban‐industrial area, is the only district which exhibits concentration levels of SO2 more affected by temperature than the trend feature, also suggesting increased temperature effect on pollution transport. Considering the dense road network in Belisario, it was expected to get RH with high percentage of importance for modeling PM2.5 and CO in this urban area. For CO, directly linked to combustion sources, hour is systematically the most important feature independently of the district.

4.3. Effect of the Mobility Restrictions

Figure 2 represents the concentration of the four pollutants before (green‐shaded area), during (red‐shaded area) and after the full lockdown (yellow‐shaded area), for two types of districts (traffic vs. industrial). A clear drop of concentrations is observed from the beginning of the lockdown (March 16, 2020), especially for NO2, SO2, and CO (Figure 2, panels a–f). The measurements are a bit noisier for PM2.5, and the reduction is not so obvious in both, traffic (Belisario) and industrial (Carapungo) areas (Figure 2, panels g–h). For instance, high PM2.5 concentrations can be noticed around 20 March. This peak seems to be caused by windy conditions, possibly re‐suspending fine particulate matter into the atmosphere (Csavina et al., 2014). The contrasted and limited effect of the lockdown on the concentration of PM2.5 is also observed in other cities around the world due to regional pollution events or an increase in private vehicles' use (Dhaka et al., 2020; Faridi et al., 2020). Another explanation is the fact that PM are not only emitted by vehicles, but also factories which continued to work almost normally during the quarantine. The relatively low reduction of PM2.5 in the industrial districts of Quito (−20% for Camal, −19% for Carapungo, and +3% for Chillos) supports this hypothesis (Table 2). Once the full lockdown is over (June 1, 2020), the concentrations return progressively to close to the same levels as before the pandemic.

Figure 2.

Figure 2

Real and estimated (ML modeled and four previous year mean) concentration of pollutants in traffic (Belisario) and industrial (Carapungo) areas during the prepandemic (green‐shaded area), full‐lockdown (red‐shaded area), and partial relaxation (yellow‐shaded area). For Belisario, panels (a), (c), (e) and (g) represent the concentrations of NO2, SO2, CO and PM2.5, respectively. For Carapungo, NO2, SO2, CO and PM2.5 are plotted in panels (b), (d), (f), and (h), respectively. ML: Machine Learning.

Table 2.

Effect of the two levels of mobility restriction on the concentration of pollutants: (a) full‐lockdown, and (b) partial relaxation.

4.3.

Note: Color scale from red to green highlights the percentage of pollution reduction. The greener is the cell, the higher is the pollution drop. SD stands for standard deviation.

Moreover, Figure 2 shows the accuracy of the estimation of the pollution levels as if no lockdown was implemented. Before March 16, the overlap between predictive data, from the ML models, and actual data is almost perfect. It suggests that we can reasonably trust these predictive values as very close to the ones that would have been recorded if the human activities were usual. On the contrary, a reference based on the average of four previous years does not correlate at all with the real data. This visualization confirms that the proposed GBM‐based models can provide an accurate benchmark to quantify the pollution drop during the lockdown. The estimation of pollution reduction provided by the ML approach is by far more reliable than a simple statistical averaging of the value recorded during the past years. The robustness of the GBM models is verified by the fact that actual and predictive data re‐overlap progressively after the implementation of the partial relaxation measures.

Summarizing, during the full lockdown, NO2 gets the highest drop, reducing by half its concentrations (overall mean: −53% ± 2%), independently of the district (Table 2a). SO2 is also strongly reduced over the whole city (overall mean: −45% ± 11%), followed by CO (−30% ± 13%). CO decreases more in the traffic and residential areas (around 35%) than in the industrial areas (around 20%), which can be explained by the fact that CO is directly emitted from vehicles. As previously mentioned, the effect of the lockdown on PM2.5 is less important, but more significant for traffic‐busy than industrial districts. For instance, the level of PM2.5 kept constant in Chillos, but more reduced in traffic‐busy areas, because this anthropogenic pollutant is mostly emitted by diesel vehicles. In the case of Chillos, it comes from thermoelectric power plant and factories, which maintained their activity during the pandemic.

When the restrictions on human mobility subsided slightly, the atmospheric pollution returned to almost normal levels (overall mean: −11% ± 7%, during the partial relaxation, against −36% ± 15%, during the full lockdown), especially for PM2.5, CO, and NO2 (Table 2b). Even if the reduction was not comparable with the lockdown period, SO2 levels stayed lower than usual. There are also certain inconsistencies from one site to another (e.g., −40% NO2 in Cotocollao, against +4% in Guamani) and from one pollutant to another (e.g., +11% PM2.5, against −41% SO2 in Guamani). The consequences on air quality are a bit noisier during the partial relaxation due to a more stochastic human activity.

5. Conclusions

This study aimed to estimate the impact of COVID‐19 on NO2, CO, PM2.5, and SO2 in Quito, Ecuador. A GBM algorithm was used to build 24 models to predict the air quality as if COVID‐19 quarantine never existed. Unlike other weather normalization studies focusing on a single pollutant, we addressed the prediction of more pollutants and investigated the importance of more variables. The temporal features were among the most important variables for the modeling of any contaminant, while meteorological parameters are pollutant‐ and site‐dependent.

The accuracy of the models was evaluated through a cross‐validation on several years before the lockdown, starting January 2016. The evaluation metrics show a high predictive performance for NO2 and CO, a good one for SO2 and fair for PM2.5, varying for different districts in the city. The best model accuracy is obtained for the traffic‐busy areas. This study suggests that previous years' average concentrations are not as reliable in estimating the effect of human activities on air quality and highlights the benefit of building a model based on an ML approach, considering actual meteorological conditions and temporal features.

Finally, the lockdown effect on air pollutants was analyzed for traffic and industrial districts. A clear reduction is observed for NO2 (−53% ± 2%), SO2 (−45% ± 11%), and CO (−30% ± 13%) concentrations during the full lockdown. The reduction in PM2.5 concentrations display some variability depending on the type of monitoring site (traffic site: −21% ± 3%, industrial site: −12% ± 11%). Weather events, like increased wind speed, help explain the reduced effect of mobility regulations on PM2.5. Unavoidably, implementing partial relaxation measures, the pollution concentrations progressively return to close to the same levels as before the pandemic.

Months into the pandemic, the world is overwhelmed with the implications to the economic development, public health, mobility, and environmental politics. However, this short break in a production of anthropogenic emissions showed a different reality with a global reduction of pollution. This unexpected scenario was an opportunity to assess the weight of the human activity (especially its mobility) on the atmospheric environment (Turner et al., 2020). The sanitary emergency highlighted the importance of strict traffic policies to improve urban air quality and reduce emission of greenhouse gases. In that sense, future work must be carried out on the impact of human mobility on the atmospheric pollution in the rapidly growing developing countries and an efficient transition to cleaner transportation. Although this study focuses on the COVID‐19 pandemic, the reliability of our predictions supports the idea that an ML‐based modeling is a robust method to quantify the impact of any event on air pollution.

Acknowledgments

Funding for this study is provided by the Universidad de Las Americas, Ecuador as a part of an internal research project AMB.RZ.20.02 and CEPRA‐XV‐2021‐018 project funded by CEDIA (Corporación Ecuatoriana para el Desarrollo de la Investigación y la Academia.

Rybarczyk, Y. , & Zalakeviciute, R. (2021). Assessing the COVID‐19 impact on air quality: A machine learning approach. Geophysical Research Letters, 48, e2020GL091202. 10.1029/2020GL091202

The copyright line for this article was changed on 12 February 2021 after original online publication.

Contributor Information

Yves Rybarczyk, Email: yry@du.se.

Rasa Zalakeviciute, Email: rasa.zalakeviciute@udla.edu.ec.

Data Availability Statement

There are no real or perceived financial conflicts of interests for any author.The data supporting the conclusions can be obtained at Mendeley Data, V1, doi: 10.17632/trs5j932s8.1. (Zalakeviciute, et al., 2020b), “Air Quality in Quito during COVID‐19 outbreak” at https://data.mendeley.com/datasets/trs5j932s8/1

References

  1. Bauwens, M. , Compernolle, S. , Stavrakou, T. , Müller, J.‐F. , van Gent, J. , Eskes, H. , et al. (2020). Impact of Coronavirus Outbreak on NO2 Pollution Assessed Using TROPOMI and OMI Observations. Geophysical Research Letters, 47(11), e2020GL087978. 10.1029/2020GL087978 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Coker, E. S. , Cavalli, L. , Fabrizi, E. , Guastella, G. , Lippo, E. , Parisi, M. L. , et al. (2020). The Effects of Air Pollution on COVID‐19 Related Mortality in Northern Italy. Environmental and Resource Economics, 76, 611–634. 10.1007/s10640-020-00486-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Csavina, J. , Field, J. , Félix, O. , Corral‐Avitia, A. Y. , Sáez, A. E. , & Betterton, E. A. (2014). Effect of wind speed and relative humidity on atmospheric dust concentrations in semi‐arid climates. The Science of the Total Environment, 487, 82–90. 10.1016/j.scitotenv.2014.03.138 [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Dhaka, S. K. , Chetna, V. , Panwar, V. , Dimri, A. P. , Singh, N. , Patra, P. K. , et al. (2020). PM2.5 diminution and haze events over Delhi during the COVID‐19 lockdown period: An interplay between the baseline pollution and meteorology. Nature Scientific Reports, 10(1), 13442. 10.1038/s41598-020-70179-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Ding, J. , van der, A. R. J. , Eskes, H. , Mijling, B. , Stavrakou, T. , van Geffen, J. , & Veefkind, P. 2020. NOx emissions reduction and rebound in China due to the COVID‐19 crisis. Geophysical Research Letters, e2020GL089912. 10.1029/2020GL089912 [DOI] [Google Scholar]
  6. EMASEO (2011). Municipio del distrito metropolitano de Quito: Plan de Desarrollo 2012–2022, (1–166). Quito. http://www.emaseo.gob.ec/documentos/lotaip_2012/s/plan_de_desarrollo_2012_2014.pdf [Google Scholar]
  7. EPA (2018). Health and environmental effects of particulate matter (PM) [WWW Document]. Part. Matter Pollut. URL. https://www.epa.gov/pm-pollution/health-and-environmental-effects-particulate-matter-pm [Google Scholar]
  8. European Environment Agency (2017). Air quality in Europe — 2017 report, (1–80). 10.2800/777411 [DOI] [Google Scholar]
  9. European Environment Agency (2018). Air quality in Europe — 2018 report, (1–88). 10.2800/850018 [DOI] [Google Scholar]
  10. European Space Agency (2020). Coronavirus: nitrogen dioxide emissions drop over Italy. [WWW Document]. Satell. images video. URL. https://youtu.be/ARpxtAKsORw [Google Scholar]
  11. Faridi, S. , Yousefian, F. , Niazi, S. , Ghalhari, M. R. , Hassanvand, M. S. , & Naddafi, K. (2020). Impact of SARS‐CoV‐2 on Ambient Air Particulate Matter in Tehran. Aerosol Air Qual. Res., 20(8), 1805–1811. 10.4209/aaqr.2020.05.0225 [DOI] [Google Scholar]
  12. Friedman, J. H. (2001). Greedy function approximation: A gradient boosting machine. Annals of Statistics, 29(5), 1189–1232. 10.1214/aos/1013203451 [DOI] [Google Scholar]
  13. Geographic, N. , 2020. Pollution made COVID‐19 worse. Now lockdowns are clearing the air. [WWW Document]. Sci. Coronavirus Cover. Retrieved from. https://www.nationalgeographic.com/science/2020/04/pollution-made-the-pandemic-worse-but-lockdowns-clean-the-sky/ [Google Scholar]
  14. Goldberg, D. L. , Anenberg, S. C. , Griffin, D. , McLinden, C. A. , Lu, Z. , & Streets, D. G. (2020). Disentangling the Impact of the COVID‐19 Lockdowns on Urban NO2 From Natural Variability. Geophysical Research Letters, 47(17), e2020GL089269. 10.1029/2020GL089269 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Grange, S. K. , & Carslaw, D. C. (2019). Using meteorological normalization to detect interventions in air quality time series. The Science of the Total Environment, 653, 578–588. 10.1016/j.scitotenv.2018.10.344 [DOI] [PubMed] [Google Scholar]
  16. Grange, S. K. , Carslaw, D. C. , Lewis, A. C. , Boleti, E. , & Hueglin, C. (2018). Random forest meteorological normalization models for Swiss PM 10 trend analysis. Atmospheric Chemistry and Physics Discussions, 18, 6223–6239. 10.5194/acp-18-6223-2018 [DOI] [Google Scholar]
  17. Henneman, L. R. F. , Holmes, H. A. , Mulholland, J. A. , & Russell, A. G. (2015). Meteorological detrending of primary and secondary pollutant concentrations: Method application and evaluation using long‐term (2000–2012) data in Atlanta. Atmospheric Environment, 119, 201–210. 10.1016/j.atmosenv.2015.08.007 [DOI] [Google Scholar]
  18. INEC (2011). Quito. Poblacion, superficie (km2), densidad poblacional a nivel parroquial. Retrieved from https://www.ecuadorencifras.gob.ec/search/Poblaci%C3%B3n,+superficie+(km2),+densidad+poblacional+a+nivel+parroquial/ [Google Scholar]
  19. Karagulian, F. , Belis, C. A. , Dora, C. F. C. , Prüss‐Ustün, A. M. , Bonjour, S. , Adair‐Rohani, H. , et al. (2015). Contributions to cities’ ambient particulate matter (PM): A systematic review of local source contributions at global level. Atmospheric Environment, 120, 475–483. 10.1016/j.atmosenv.2015.08.087 [DOI] [Google Scholar]
  20. Lelieveld, J. , Evans, J. S. , Fnais, M. , Giannadaki, D. , & Pozzer, A. (2015). The contribution of outdoor air pollution sources to premature mortality on a global scale. Nature, 525, 367–371. 10.1038/nature15371 [DOI] [PubMed] [Google Scholar]
  21. Limb, M. (2016). Half of wealthy and 98% of poorer cities breach air quality guidelines. BMJ, 353. 10.1136/bmj.i2730 [DOI] [PubMed] [Google Scholar]
  22. NASA (2020). Airborne Nitrogen Dioxide Plummets Over China. [WWW Document]. Earth Obs. URL. https://earthobservatory.nasa.gov/images/146362/airborne-nitrogen-dioxide-plummets-over-china [Google Scholar]
  23. Navinya, C. , Patidar, G. , & Phuleria, H. C. (2020). Examining Effects of the COVID‐19 National Lockdown on Ambient Air Quality across Urban India. Aerosol Air Qual. Res., 20(8), 1759–1771. 10.4209/aaqr.2020.05.0256 [DOI] [Google Scholar]
  24. Nikkei Asian Review , 2020. Blue skies return to China as coronavirus cuts coal consumption ‐ Nikkei Asian Review [WWW Document]. Spotlight. URL asia.nikkei.com/Spotlight/Coronavirus/Blue‐skies‐return‐to‐China‐as‐coronavirus‐cuts‐coal‐consumption.
  25. Ogen, Y. (2020). Assessing nitrogen dioxide (NO2) levels as a contributing factor to coronavirus (COVID‐19) fatality. The Science of the Total Environment, 726, 138605. 10.1016/j.scitotenv.2020.138605 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Pani, S. K. , Lin, N.‐H. , & Ravindra Babu, S. (2020). Association of COVID‐19 pandemic with meteorological parameters over Singapore. The Science of the Total Environment, 740, 140112. 10.1016/j.scitotenv.2020.140112 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Petetin, H. , Bowdalo, D. , Soret, A. , Guevara, M. , Jorba, O. , Serradell, K. , et al. (2020). Meteorology‐normalized impact of the COVID‐19 lockdown upon NO2 pollution in Spain. Atmospheric Chemistry and Physics, 20, 11119–11141. 10.5194/acp-20-11119-2020 [DOI] [Google Scholar]
  28. Pope, C. A. , Dockery, D. W. , Pope, C. A. , & Dockery, D. W. (2006). In: 2012. Health effects of fine particulate air pollution: Lines that connect health effects of fine particulate air pollution. Journal of the Air & Waste Management Association, 56(6), Li 2247. 10.1080/10473289.2006.10464485 [DOI] [PubMed] [Google Scholar]
  29. Querol, X. , Viana, M. , Alastuey, A. , Amato, F. , Moreno, T. , Castillo, S. , et al. (2007). Source origin of trace elements in PM from regional background. Urban and Industrial Sites of Spain, 41(37), 7219–7231. 10.1016/j.atmosenv.2007.05.022 [DOI] [Google Scholar]
  30. Shi, X. , & Brasseur, G. P. (2020). The Response in Air Quality to the Reduction of Chinese Economic Activities During the COVID‐19 Outbreak. Geophysical Research Letters, 47(11), e2020GL088070. 10.1029/2020GL088070 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Turner, A. J. , Kim, J. , Fitzmaurice, H. , Newman, C. , Worthington, K. , Chan, K. , et al. (2020). Observed Impacts of COVID‐19 on Urban CO2 Emissions. Geophysical Research Letters, 47(22), e2020GL090037. 10.1029/2020GL090037 [DOI] [Google Scholar]
  32. UN. (2015). The 2015 revision of world population prospects. Retrieved from https://population.un.org/wpp/Publications/Files/WPP2015_DataBooklet.pdf [Google Scholar]
  33. Wang, P. , Chen, K. , Zhu, S. , Wang, P. , & Zhang, H. (2020). Severe air pollution events not avoided by reduced anthropogenic activities during COVID‐19 outbreak. Resources, Conservation and Recycling, 158, 104814. 10.1016/j.resconrec.2020.104814 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Wang, Y. , Yuan, Y. , Wang, Q. , Liu, C. , Zhi, Q. , & Cao, J. (2020). Changes in air quality related to the control of coronavirus in China: Implications for traffic and industrial emissions. The Science of the Total Environment, 731(20), 139133. 10.1016/j.scitotenv.2020.139133 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. WHO . (2016). Air pollution levels rising in many of the world's poorest cities [WWW Document]. WHO. URL. http://www.who.int/mediacentre/news/releases/2016/air-pollution-rising/en/#.WrrLXYrQNwg.mendeley [Google Scholar]
  36. Worldometer (2020). Coronavirus Cases [WWW Document]. COVID‐19 CORONAVIRUS PANDEMIC. 10.1101/2020.01.23.20018549V2 [DOI]
  37. Wu, X. , Nethery, R. C. , Sabath, M. B. , Braun, D. , & Dominici, F. (2020). Air pollution and COVID‐19 mortality in the United States: Strengths and limitations of an ecological regression analysis, 6(45), eabd4049. Retrieved from https://advances.sciencemag.org/content/6/45/eabd4049 [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Žabkar, R. , Honzak, L. , Skok, G. , Forkel, R. , Rakovec, J. , Ceglar, A. , et al. (2015). Evaluation of the high resolution WRF‐Chem (v3.4.1) air quality forecast and its comparison with statistical ozone predictions. Geoscientific Model Development, 8(7), 2119–2137. 10.5194/gmd-8-2119-2015 [DOI] [Google Scholar]
  39. Zalakeviciute, R. , López‐Villada, J. , & Rybarczyk, Y. (2018. a). Contrasted effects of relative humidity and precipitation on urban PM2.5 pollution in high elevation urban areas. Sustainability, 10(6), 2064. 10.3390/su10062064 [DOI] [Google Scholar]
  40. Zalakeviciute, Rasa Rybarczyk, Y. , & Diaz, V. (2020b). Assessing the COVID‐19 impact on air quality in Quito Ecuador [WWW Document]. Mendeley Data. 10.17632/trs5j932s8.1 [DOI] [Google Scholar]
  41. Zalakeviciute, R. , Rybarczyk, Y. , Lopez Villada, J. , & Diaz Suarez, M. V. (2018. b). Quantifying decade‐long effects of fuel and traffic regulations on urban ambient PM2.5 pollution in a mid‐size South American city. Atmospheric Pollution Research, 9(1), 66–75. 10.1016/j.apr.2017.07.001 [DOI] [Google Scholar]
  42. Zalakeviciute, R. , Vasquez, R. , Bayas, D. , Buenano, A. , Mejia, D. , Zegarra, R. , et al. (2020a). Drastic Improvements in Air Quality in Ecuador during the COVID‐19 Outbreak. Aerosol and Air Quality Research, 20(8), 1783–1792. 10.4209/aaqr.2020.05.0254 [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

There are no real or perceived financial conflicts of interests for any author.The data supporting the conclusions can be obtained at Mendeley Data, V1, doi: 10.17632/trs5j932s8.1. (Zalakeviciute, et al., 2020b), “Air Quality in Quito during COVID‐19 outbreak” at https://data.mendeley.com/datasets/trs5j932s8/1


Articles from Geophysical Research Letters are provided here courtesy of Wiley

RESOURCES