Skip to main content
Computational Intelligence and Neuroscience logoLink to Computational Intelligence and Neuroscience
. 2022 Dec 9;2022:9755422. doi: 10.1155/2022/9755422

Air Quality Prediction Using the Fractional Gradient-Based Recurrent Neural Network

Sugandha Arora 1,, Narinderjit Singh Sawaran Singh 2, Divyanshu Singh 1, Rishi Rakesh Shrivastava 1, Trilok Mathur 1, Kamlesh Tiwari 1, Shivi Agarwal 1
PMCID: PMC9757944  PMID: 36531923

Abstract

In this study, the air quality index (AQI) of Indian cities of different tiers is predicted by using the vanilla recurrent neural network (RNN). AQI is used to measure the air quality of any region which is calculated on the basis of the concentration of ground-level ozone, particle pollution, carbon monoxide, and sulphur dioxide in air. Thus, the present air quality of an area is dependent on current weather conditions, vehicle traffic in that area, or anything that increases air pollution. Also, the current air quality is dependent on the climate conditions and industrialization in that area. Thus, the AQI is history-dependent. To capture this dependency, the memory property of fractional derivatives is exploited in this algorithm and the fractional gradient descent algorithm involving Caputo's derivative has been used in the backpropagation algorithm for training of the RNN. Due to the availability of a large amount of data and high computation support, deep neural networks are capable of giving state-of-the-art results in the time series prediction. But, in this study, the basic vanilla RNN has been chosen to check the effectiveness of fractional derivatives. The AQI and gases affecting AQI prediction results for different cities show that the proposed algorithm leads to higher accuracy. It has been observed that the results of the vanilla RNN with fractional derivatives are comparable to long short-term memory (LSTM).

1. Introduction

With the increase in urbanization, industrialization, and traffic in the cities, the air pollutants are increasing and air quality is reducing [1]. To keep a check on the extent of air pollution, the US Environment Protection Agency has introduced a parameter called the air quality index (AQI) which tracks the daily effects of air pollutants [2]. AQI is a numerical value between 0 and 500; when the value of AQI is 0, the air quality is adequate, and if the value of AQI is 500, then the air quality is hazardous. AQI is calculated by considering major air pollutants, such as carbon monoxide (CO), nitrogen dioxide (NO2), ozone (O3), particulate matter (PM10 and PM2.5), and sulphur dioxide (SO2). These pollutants are the residual gases and particles emitted from vehicles, industries, and due to climate change [3].

Biomass and coal-burning highly increase the levels of particulate matter (PM10 and PM2.5) that causes haze in the air. These particles deteriorate the air composition and cause respiratory problems in living beings. Moreover, haze reduces the visibility that further affects the economic sectors such as tourism and agriculture [4]. The combustion of fossil fuels is carried out in several industries which are the main contributors of SO2 and NO2 in air [5, 6]. The motorized vehicles and combustion of fossil fuels also emit CO, which is another major pollutant responsible for worsening the air quality. CO is highly poisonous and can even lead to mortality on long exposure [7]. Another major pollutant is ground-level ozone O3, obtained from the combination of two primary pollutants, nitrogen oxides (NOx) and volatile organic compounds (VOCs). The 95% of these primary pollutants come from oil, coal, and gasoline combustion in vehicles, industries, power plants, and households, upstream gas and oil production, combustion of residual woods, and the evaporated liquid fuels [8]. Exposure to ozone can significantly affect human health, cause asthma, and can lead to premature mortality [9]. In addition, ozone can adversely affect vegetation, damage flowers and shrubs, and reduce crop productivity [10, 11].

Air pollution is not a local phenomenon; the current quality of air is dependent on its history. The industrialization has massively impacted the environment, especially the air quality [12]. The levels of air pollutants like ground-level ozone and particulate matter are also getting influenced by modifying weather patterns that occurred due to climate change [13, 14]. The change in climate affects the temperature, humidity levels, and wind patterns, which in turn influences the air quality. In addition, the naturally occurring emissions, for example, wind-blown dust and wildfires, get provoked by climate-driven changes in meteorology that affect the air quality. The uncontrolled emission of air pollutants is gradually causing air pollution. Continuous exposure to polluted air is severely affecting human health [15] and leading to the development of lung, heart, and skin diseases [16]. Six of the world's top 10 most polluted cities are from India. Air pollution has been observed as the second biggest risk factor which is causing diseases in India and thus affecting its economy. Thus, there is a need to keep a check on air pollution in Indian cities. Each city has its unique features such as population per square km, temperature, humidity level, climate, vehicles, and industries in the region, and thus, it is better to study air pollution region-wise. Generally, the air quality in tier I and urban cities is low and it is required to give more attention in such areas.

To prevent the serious consequences of air pollution, several forecasting techniques for AQI are being developed. Based on target objectives, the techniques and approaches of forecasting are being expanded and improved. Traditional AQI forecasting techniques involve statistical techniques such as autoregressive integrated moving average (ARIMA) [17, 18], principal component regression (PCR) [19], multiple linear regression (MLR) [20], and grey models [21, 22]. These models perform well, but with the high increase in pollution, more accurate methods are required. These models are linear and thus are unable to capture the nonlinear traits [23, 24]. Even with a large amount of data, not much increase is seen in the accuracy of these models. The performances of the statistical techniques have been improved by developing hybrid techniques [25]. Artificial intelligence-based techniques are capable of analyzing the nonlinear data and thus are being recently used in the time series forecasting [26, 27]. With the availability of sufficient amount of data and computational support, AQI forecasting is being done with a deep neural network [28]. But these methods require to learn large number of parameters. Thus, a simpler and accurate method has been developed in this study using a vanilla RNN. The current level of air pollution in any area is also dependent on AQI status in the past. To capture the history dependency, fractional derivatives have been employed in the back propagation algorithm to train the vanilla RNN for the prediction of AQI in Indian cities.

In this study, five cities of different tiers are considered for AQI prediction. Bengaluru, Kolkata, and Hyderabad are tier I cities, while Patna and Talcher are tier II and tier III cities, respectively, as shown in Figure 1. The major air pollutants of Kolkata are also predicted using the proposed approach. The results show that the proposed method achieves minimum error on some fractional orders. Also, the obtained results are comparable to the LSTM. The rest of the study is structured as follows: Section 2 briefly explains the related work. The proposed approach is presented in Section 4. Section 5 discusses the experimental results obtained, followed by conclusion and future scopes in Section 6 and Section 7, respectively.

Figure 1.

Figure 1

Map of India showing the locations of tier (I), tier II, and tier III cities represented by red, orange, and yellow color, respectively.

2. Preliminaries

Factional calculus is a 300-year-old branch of mathematics that deals with derivatives and integrals of noninteger order, i.e., order can be any number, be complex or real. Earlier, this domain was only theoretical involving rigorous calculations, but these derivatives are used in practical applications as well [29, 30]. Several versions of fractional derivatives and integrals have been introduced till now, where each version has unique characteristics. The most widely used versions are described.

  • (i)
    Riemann–Liouville (RL) fractional integral operator: This is the most frequently used version of fractional integral [31]. The α order RL fractional integral is expressed as follows:
    aDtαft=1Γαattηα1fηdη. (1)
  • Here, α > 0, t > a, t, a, the function Γ(·) is the Gamma function, and f is a piecewise continuous function on [0, ) and integrable on any finite subinterval of (0, ).

  • (ii)
    Riemann–Liouville (RL) fractional derivative: This is the natural generalization of integer-order derivative, as this fractional derivative version and ordinary derivative are left inverse of integrals, i.e., aDtαDαf(t) = D0f(t) = f(t). For t > a, α > 0, and n, such that n − 1 < α < n; the derivative of order α is evaluated by differentiating the nν order RL integral of function f(t)n times,
    aDtαft=DnaDtnαft, (2)
  • i.e.,
    aDtαft=1Γnαdndtnatfηtην+1ndη,n1<α<n,dndtnft,α=n. (3)
  • It is the α order RL fractional derivative [31]. But this definition has some disadvantages as well. The most significant disadvantage is that RL derivative of order α, (<1) of a constant is not zero.

  • (iii)
    Caputo's fractional derivative: Fort > a, α > 0, and n, such that n − 1 < α < n, the Caputo's derivative of order α is obtained by evaluating the nα order RL integral of nth order derivative of function f(t), i.e., aCDtαf(t)=aDt−(nα)[Dnf(t)], that gives the following:
    DtαaCft=1Γnαatfnηtηα+1ndη,n1<α<n,dndtnft,α=n. (4)
  • It is called the α order Caputo's fractional derivative. Caputo gave this definition of fractional derivative in 1967 for overcoming the limitations of RL derivative [32]. The Caputo's derivative of order α > 0, n − 1 ≤ α < n for a constant c is zero. This version increased the applicability of fractional derivatives in modelling real world problems. Thus, in this study, Caputo's version of fractional derivatives has been used.

  • (iv)
    Grünwald–Letnikov (GL) fractional derivative: The limit definition of fractional derivative was given by Anton Karl Grünwald and Aleksey Vasilievich Letnikov in 1867 and 1868, respectively [31]. Without any assumptions on differentiability of the function for α > 0, the α-order GL derivative of function f is expressed as follows:
    aDtαft=limh0nh=tahαr=0n1rαrftrh. (5)
  • Here, h is the step size and αr=Γα+1/Γαr+1.Γr+1 with the Gamma function Γ(·).

Clearly, the limit definition of first-order derivative shows that evaluation of derivative involves usage of only two points. But it can be seen from limit definition of fractional derivatives and from the Caputo's definition in equation (4) and equation (5), respectively, that their evaluation involves usage of value of the function at all past points. This makes the fractional derivatives to be a nonlocal operator and incorporate memory to the systems. Due to the memory property and availability of software and other tools, fractional calculus has been used in numerous applications of science and engineering. These have been widely used in viscoelasticity [33], biology [34], signal and image processing [35], stock market [27, 36], economics [37], and in other domains with history dependency [38]. Moreover, the order of differentiation acts as a degree of freedom in the optimization process.

The nonlocality of fractional derivatives has been the major motivation for their application in different domains. Fractional calculus has been successfully used for air quality prediction [3941]. Fractional derivative-based Kalman filter has been introduced to measure the pollutant emission and hence the air quality [39]. Several variants of fractional Kalman filters have been developed using different fractional-order derivatives version for improving the prediction accuracy [4042]. In these air-quality models, fractional calculus is incorporated because of its long-term memory and nonlocal nature. Fractional calculus has been successfully applied in the training of artificial neural networks [4346]. After replacing integer-order derivative by fractional-order derivative in the back propagation of the training algorithm, the update rule gets updated as follows:

Δw=ηδαEδwα, (6)

where η and α are the learning rate and fractional order of differentiation, respectively. Chen [47] employed the fractional derivative in their backpropagation method for feed-forward neural networks (FNNs) in 2013. The simulation results showed that fractional derivative-based FNNs had a substantially better convergence speed than integer-order FNNs. The fractional derivatives have been successfully applied in the backpropagation learning algorithm of the radial basis function network [48], recurrent neural network [46], convolutional neural network [49], and even in deep neural networks and have shown significant improvement in accuracy. In our study, the effect of using fractional derivatives in the learning of the neural network has been analyzed for the prediction of air quality in few Indian cities. Earlier too, the effectiveness of fractional derivatives has been shown for nonlinear system identification, pattern classification, and Mackey–Glass chaotic time series prediction [46].

3. Fractional Gradient-Based Backward Propagation Algorithm

In this section, we introduce the fractional-order truncated backpropagation through the time algorithm on the RNN with 10 neurons in a layer. This backpropagation algorithm considers the truncated depth of the input data and the state of the network, which makes the algorithm computationally efficient. For the implementation of the backpropagation algorithm, the mean squared error at an instant is considered as follows:

Es=12iΩΦuisxis2=12iΩeis2, (7)

where i is the output neuron, Φ(ui(s)) and xi(s) are the actual output and the expected output of the ith neuron at time s, and ui(s) = ∑j∈Ωwijvj(s) at time s, where wij(s) is the weight of a signal from jth neuron to ith and vj(s) is the output of jth neuron at time s; then, the update rule becomes as follows:

wijs+1=wijsηwijαEs, (8)

where η is the learning rate and ∇wijα represents the factional gradient w.r.t wij. Now, ∇wijαE(s) can be evaluated by applying the approximated chain to the error function. The actual chain rule applicable on fractional derivatives is complicated and involves special mathematical functions; thus, several approximated chain rules have been developed for fractional derivatives. [5053] The chain rule given by expression (14) has been obtained by using fractional Taylor's series expansion for differentiable function. Consider a differentiable function, say f then for a small h,

fx+h=fx+hαΓ1+αfαx+h2αΓ2+αf2αx+h3αΓ3+αf3αx+. (9)

Then,

Δfxhα=1Γ1+αfαx+hαΓ2+αf2αx+h2αΓ3+αf3αx+. (10)

Taking limit h⟶0, we get

limh0Δfxhα=1Γ1+αfαx. (11)

Also, limh⟶0Δαf(x)/hα=f(α)(x),

ΔαfxΓ1+αΔfx0<α<1. (12)

From above equation, we can also say dαf(x) ≈ Γ(1 + α)df(x)0 < α < 1. Using this result,

dαfvxdxα=dαfuxdαvdαvdxα=Γα+1dfvxΓα+1dvvxαx=guv.vxα. (13)

Hence,

Dxαfvx=Dufv.Dxαvx. (14)

After using the abovementioned fractional chain rule, we get

wjiαEs=Esejs·ejsvjs·vjsuis·αuiswjiαs. (15)

Suppose ∂E(s)/∂uj(s)=δj(s), then

Esejs·ejsvjs·vjsuis=Esujs·αuiswjiαs=δjs·αuiswjiαs. (16)

Now, as in this study Caputo's version of fractional derivative is being used and 0CDxαxp = Γ(p + 1)xpα/Γ(pα + 1), for p > −1, then the following holds:

αuiswjiαs=wji1αsvisΓ2α. (17)

Thus, from equations (4), (16), and (17), the following final update rule is obtained:

wkjs+1=wkjsηs=nh+1nδkswkj1αsvjsΓ2α. (18)

4. Proposed Approach

In this study, the vanilla RNN has been employed to predict the AQI value of a day based on the previous sequential AQI data of five different cities. RNNs are capable of learning the sequential pattern of historical data. Furthermore, the accuracy of the system has been improved by incorporating memory into the system using the fractional gradient descent algorithm.

4.1. Data Exploration and Processing

In this study, the continuous AQI data have been used to predict future unseen AQI values. For each city, a continuous-time patch of around 1000 data points has been used from the AQI dataset. The sample data can be seen in Table 1. For constructing the training data, a min-max scaler has been used to scale the data values between 0 and 1. The predicted values are also obtained between 0 and 1, which are inverse-transformed later to evaluate the final predicted AQI value.

Table 1.

Data samples of all cities.

Bengaluru
Date 2019-09-01 2019-09-02 2019-09-03 2019-09-04 2019-09-05 2019-19-06 2019-19-07
AQI 61 48 51 51 56 64 63

Kolkata
Date 2018-06-16 2018-06-17 2018-06-18 2018-06-19 2018-06-20 2018-06-21 2018-06-22
AQI 119 113 107 148 94 100 60

Hyderabad
Date 2017-11-01 2017-11-02 2017-11-03 2017-11-04 2017-11-05 2017-11-06 2017-11-07
AQI 150 156 158 114 91 80 94

Patna
Date 2017-11-05 2017-11-06 2017-11-07 2017-11-08 2017-11-09 2017-11-10 2017-11-11
AQI 276 289 286 354 430 439 429

Talcher
Date 2018-02-08 2018-02-09 2018-02-10 2018-02-13 2018-02-14 2018-02-15 2018-02-16
AQI 311 321 343 343 269 243 350

4.2. Neural Network Architecture and Training

The vanilla RNN has been used in the proposed model which has a single-layered architecture with 10 nodes in it. The fractional gradient-based RNN model is built from scratch using the NumPy library, and the Pandas library is used for data preprocessing. Forward propagation and fractional gradient-based back propagation as given by (18) have been used for training the vanilla RNN. On the other hand, for the LSTM, the TensorFlow library is used to produce all the results, and the integer-order gradient descent algorithm is used to train the model. Backpropagation has been used in both the models through 10 days (timestamps) which would predict the AQI data for the next (11th) day. Training and testing sets have 600–800 and 100 data values, respectively. For the initializing of weights for the models, Xavier's initialization has been used with 0.1 learning rate; 80 epochs were used to train the model for each city and each fractional order. Figure 2 shows the architecture of the RNN with fractional gradient-based backpropagation.

Figure 2.

Figure 2

RNN architecture with fractional backpropagation.

4.3. Evaluation Parameter

The standard evaluation metrics for forecasting models viz root mean squared error (RMSE) and mean absolute percentage error (MAPE) have been employed to assess the performance of the proposed model in the prediction of AQI of different Indian cities and the major pollutants in one of those cities. The lesser the value of RMSE and MAPE, the better the performance of the predictor. These errors measure the performance of forecasting, climatology, and regression analysis for verifying the experimental results. The detailed information related to these parameters is provided.

The root mean square error (RMSE) is the square root of the average of the squared difference between the actual and predicted value. RMSE can be expressed by the following expression:

RMSE=t=1Not^ot2N, (19)

where ot^ is the predicted value and ot is the expected output for iteration t, which are observed for N times.

The mean absolute percentage error (MAPE) is the average percentage of the absolute difference between the actual and predicted values divided by the actual value for each time period [54]. MAPE can be expressed by the following expression:

MAPE=100Nt=1Not^otot, (20)

where ot^ is the predicted value and ot is the expected output for iteration t, which are observed for N times. This is a form of percentage error, which has helped in the analysis of the proposed model in different situations.

5. Results and Analysis

This section describes the AQI dataset of five cities chosen, the results obtained by using the proposed approach on the AQI data, and the discussion of comparison between predictions of LSTM and the proposed approach on different fractional orders. The performance of networks is measured using RMSE and MAPE.

5.1. Dataset

The AQI dataset of five cities for 2015–2020 is considered which is publicly available at the official portal of the Central Pollution Control Board, Government of India (http://cpcb.nic.in/). The dataset consists of daily air quality levels at various stations across multiple cities in India which are obtained by averaging out the hourly value of AQI. Indian cities chosen for the analysis includes Kolkata, Hyderabad, Bengaluru, Patna, and Talcher. Basic information related to these cities is as follows:

  1. Kolkata (22°34′03N 88°43′57 E), located in West Bengal, is a tier I and the seventh most populous city of India with third-most populous metropolitan area. The concentration of pollutants such as sulphur dioxide and nitrogen dioxide remains within the limit, but the presence of particulate matter in air is high and is increasing over the years. Due to this, air pollution is severe and is causing respiratory ailments such as lung cancer.

  2. Bengaluru (12°58′44 N 77°35′30 E), located in Karnataka, is also a tier I and the third most populous city of India with fifth most populous metropolitan area. Bengaluru is also considered as “Silicon Valley of India” because it is the nation's top IT exporter. This IT hub region is the most polluted and is causing several environmental issues. Due to the large population, Bengaluru generates tonnes of solid waste which is polluting the environment. Thus, the large population and IT hub of Bengaluru is the major reason for air pollution.

  3. Hyderabad (17°21′42 N 78°28′29 E), located in Telangana, is also a tier I and the fourth most populous city of India with sixth most populous metropolitan area. Again, due to the large population, increased economic activity, and rapid urbanization, tonnes of solid waste are generated, and disposal of such waste becomes hazardous and pollutes the environment. The particulate matter (PM10) dispersed in the atmosphere causes around 2500 deaths each year.

  4. Patna (25°36′0 N 85°6′0 E), located in Bihar, is a tier II city with a high population. Air pollution is a major issue in this city. The situation in winter becomes even worse due to dense smog, leading to an increase in mortality. Patna was declared as the second most air polluted city in India, in the WHO survey of 2014.

  5. Talcher (20°57′0 N 85°13′48 E), located in Angul district of Orissa, is a tier III city. This is a small city with less population, but Talcher has the country's biggest coalfield with the highest coal reserve of around 52 billion tonnes. The presence of these coal mines leads to air pollution.

The cities of different tiers have been chosen where air pollution is a major issue. To summarise, the cities with a large population or with a high emission rate of air pollutants affecting human health are considered. Around 600 normalized data points for each city have been used for the analysis, which is divided into train and test data in the ratio of 4 : 1. The model has been tested on the data for 100 days for each city.

5.2. AQI Prediction Results Using Fractional-Order Gradient Learning

The performance of the vanilla RNN in predicting AQI values of each city using the fractional backpropagation algorithm has been analyzed. To assess the performance of the model, RMSE and MAPE are computed. The prediction performance of the RNN using the fractional gradient descent algorithm with values of fractional orders in the neighborhood of 1 is compared with the performance of the RNN with the traditional integer-order gradient descent algorithm where the order remains 1. The values of fractional order α which are considered are 6/9, 7/9, 8/9, 1, 10/9, and 11/9 [49]. The results obtained at different orders using the proposed approach can be seen in Table 2. The graphs in Figures 37 show the comparison between actual and predicted output for Bengaluru, Kolkata, Hyderabad, Patna, and Talcher, respectively. In all the graphs, the expected output and the actual output are represented by the yellow lines and blue lines, respectively. It can be observed that the least RMSE and MAPE are acquired by the vanilla RNN at some fractional orders, either on 7/9 or 8/9 for all the cities. The model achieved minimum RMSE and MAPE of 13.22 and 06.11%, respectively, at α=8/9 for Bengaluru. Also, the minimum RMSE and MAPE are found to be 19 and 7.02% for Kolkata and 23.85 and 7.43% for Patna at α=8/9. Moreover, the minimum RMSE and MAPE are found to be 7.41 and 3.22% for Hyderabad and 11.40 and 4.15% for Talcher at α=7/9. Thus, it can be concluded from the results that the fractional-order gradient is more accurate than the integer-order gradient algorithm. Moreover, the proposed model performed best for α=0.7 by achieving the least MAPE of 3.22% for Hyderabad among all the cities.

Table 2.

Performance comparison of the fractional gradient-based RNN with the integer gradient-based RNN and LSTM.

Model Bengaluru Kolkata Hyderabad Patna Talcher Bengaluru Kolkata Hyderabad Patna Talcher
RMSE MAPE (%)
α=6/9 21.47 31.35 25.35 44.30 14.25 14.41 18.21 16.67 14.78 06.62
α=7/9 19.36 26.12 07.41 34.08 11.40 11.32 11.58 03.22 10.64 04.15
α=8/9 13.22 19.00 09.75 23.85 29.64 06.11 07.02 05.92 07.43 16.30
α=1 14.78 21.37 08.58 25.44 15.96 07.15 09.31 05.23 08.07 06.11
α=10/9 24.99 28.50 11.31 38.56 21.09 16.60 12.90 05.49 11.51 06.66
α=11/9 35.20 21.85 09.95 29.53 24.51 22.54 10.50 04.99 12.90 07.90
LSTM 10.78 19.00 07.02 24.12 13.11 05.78 06.61 03.45 07.83 04.11

Figure 3.

Figure 3

Predicted vs. actual AQI of Bengaluru city for the period of 100 days from 29-08-2019 to 07-12-2019. (a) α = 6/9. (b) α = 7/9. (c) α = 8/9. (d) α = 1. (e) α = 10/9. (f) α = 11/9.

Figure 4.

Figure 4

Predicted vs. actual AQI of Kolkata city for the period of 100 days from 28-10-2019 to 04-02-2020. (a) α = 6/9. (b) α = 7/9. (c) α = 8/9. (d) α = 1. (e) α = 10/9. (f) α = 11/9.

Figure 5.

Figure 5

Predicted vs. actual AQI of Hyderabad city for the period of 100 days from 29-09-2019 to 06-01-2020. (a) α = 6/9. (b) α = 7/9. (c) α = 8/9. (d) α = 1. (e) α = 10/9. (f) α = 11/9.

Figure 6.

Figure 6

Predicted vs. actual AQI of Patna city for the period of 100 days from 26-03-2019 to 04-07-2019, (a) α = 6/9. (b) α = 7/9. (c) α = 8/9. (d) α = 1. (e) α = 10/9. (f) α = 11/9.

Figure 7.

Figure 7

Predicted vs. actual AQI of Telchar city for the period of 100 days from 01-03-2020 to 08-6-2020. (a) α = 6/9. (b) α = 7/9. (c) α = 8/9. (d) α = 1. (e) α = 10/9. (f) α = 11/9.

5.3. Comparison of Results Obtained by the Proposed Method and LSTM

The prediction of AQI for the same set of datasets of all cities has been done using LSTM as well with the same number of timestamps, nodes, and the same procedure for inputs as done for the fractional RNN. The obtained results are also shown in Table 2. It can be concluded from the table that the performance of the LSTM is comparable to the vanilla RNN with fractional gradient learning. The RMSE values obtained by fractional gradient learning on Vanilla RNN are 23.85 and 11.40 which are lesser than 24.12 and 13.11 RMSE achieved by LSTM, corresponding to Talcher and Patna, respectively. In addition, the RMSE of the fractional-based RNN and LSTM is found to be 19 for Kolkata which is equal. Similarly, the MAPE for Hyderabad and Patna is found to be 03.22% and 7.43%, respectively, by the vanilla RNN with fractional gradient learning, whereas higher MAPE of 3.45% and 7.83% for Hyderabad and Patna, respectively, is achieved by LSTM. For Talcher, MAPE is found to be 4.15% corresponding to the fractional gradient-based RNN and is found to be 4.11% for LSTM which are almost equivalent. It can be seen that the least MAPE of 3.22% is obtained by the fractional gradient-based RNN for Hyderabad as compared to other cities and models. Figure 8 shows the comparison between the expected AQI value and the AQI value predicted for all the cities by LSTM.

Figure 8.

Figure 8

Predicted vs. actual AQI of all cities for 100 days using LSTM, (a) Bengaluru. (b) Kolkata. (c) Hyderabad. (d) Patna. (e) Telchar.

5.4. Prediction Results of Major Pollutants in Kolkata Using Different Fractional-Order Gradient Learning

The proposed approach has been implemented for the prediction of the concentration of major pollutants such as SO2, CO, and PM10. Here, the considered time is also the same as used in the prediction of the AQI of Kolkata. As we have seen in the above section, the performance of the algorithm is found to be better either for α = 7/9 or 8/9. So, we have considered these two fractional values to compare the results with the integer-order-based learning of the vanilla RNN. Figures 911 show the comparison between expected air pollutant concentrations and actual concentrations in Kolkata. It can be observed from Table 3 that minimum RMSE and MAPE for each city are attained at order α = 8/9, thus outperforming the traditional integer-order learning for the vanilla RNN. Moreover, the least MAPE of 4.70% is achieved in the prediction of CO, and thus, the proposed model is better for predicting the concentrations of CO as compared to other pollutants.

Figure 9.

Figure 9

Predicted vs. actual sulphur dioxide (SO2) in air in Kolkata for the period of 100 days from 28-10-2019 to 04-02-2020. (a) α = 7/9. (b) α = 8/9. (c) α = 1.

Figure 10.

Figure 10

Predicted vs. actual carbon monoxide (CO) in air in Kolkata for the period of 100 days from 28-10-2019 to 04-02-2020. (a) α = 7/9. (b) α = 8/9. (c) α = 1.

Figure 11.

Figure 11

Predicted vs. actual particulate matter (PM10) in air in Kolkata for the period of 100 days from 28-10-2019 to 04-02-2020. (a) α = 7/9. (b) α = 8/9. (c) α = 1.

Table 3.

RMSE corresponding to the fractional gradient-based RNN for different orders in predicting concentration of different pollutants responsible for air pollution in Kolkata city.

Pollutant RMSE MAPE (%)
α=7/9 α=8/9 α=1 α=7/9 α=8/9 α=1
S O 2 1.9 1.62 1.77 7.00 5.55 6.83
CO 0.20 0.12 0.15 5.21 4.20 4.89
P M 10 36.45 19.25 33.75 10.95 6.77 11.21

6. Conclusion

In this study, the fractional-order gradient has been used in the backpropagation of the vanilla RNN for the AQI prediction of five Indian cities of all tiers. The proposed approach has been used for the prediction of major air pollutants in tier I Kolkata city. Through the results of prediction of AQI of multiple cities and prediction of air pollutants, it has been observed that the minimum error on predictions is achieved at a fractional order. Most cities achieve better results when the order is equal to 8/9. The architecture of the vanilla RNN is much simpler than the structure/functioning of an LSTM, but the predictions made by RNNs with fractional gradient-based backpropagation are comparable and sometimes even better than LSTM with the integer-order gradient descent algorithm. Achieving lesser RMSE and MAPE with simpler architecture shows the effectiveness of fractional gradient over integer-order gradient descent. The least MAPE value is found for Hyderabad by using the fractional gradient-based RNN as compared to other cities and models. In addition, the least MAPE is achieved during predictions of CO concentrations in Kolkata. Therefore, the proposed model is better in the prediction of AQI values of Hyderabad as compared to other cities and CO concentrations in the air of Kolkata as compared to other air pollutants. From the results, it can be seen that RMSE is more for Kolkata and Patna. Patna is even amongst the world's top 10 most polluted cities, and particulate matter is increasing in Kolkata each year. Hence, the memory property of fractional derivatives can be well exploited with deep neural networks for dealing with more complex and dynamic data.

7. Future Scope

This study can be extended by predicting other air pollutants in the cities, and from there, AQI values can also be predicted. Using this strategy, major air pollutants in a city can be detected and stringent actions can be taken accordingly to prevent further damage. A portfolio of economic activities can be created considering the air quality of the particular city and also detecting the most affecting gases among them in the future. The order of the derivative is chosen manually in this study, due to which results are evaluated only on a few values of order. Hence, there is a need to develop an adaptive method that automatically evaluates the optimal order for a particular city or a set of data. Methods like particle swarm optimization (PSO) and genetic algorithms can be employed for optimizing the order of differentiation. Fractional gradient descent can be used with suitable architectures for different cities. Through the results of predictions of various gases of the city, we can find a better way to develop in a sustainable way.

Acknowledgments

The authors are thankful to Data Science and Information Technology Department, INTI International University, Malaysia, for supporting and providing resources to conduct this research. Sugandha Arora is grateful to University Grants Commission, India, for the financial assistance under JRF Programme with Ref No: 1034/(CSIR-UGC NET JUNE 2017).

Data Availability

The AQI dataset of five cities for 2015–2020 is considered, which is publicly available at https://cpcb.nic.in, the official portal of the Central Pollution Control Board, Government of India.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

References

  • 1.Ma J., Ding Y., Cheng J. C., et al. Identification of high impact factors of air quality on a national scale using big data and machine learning techniques. Journal of Cleaner Production . 2020;244118955 [Google Scholar]
  • 2.Kang G. K., Gao J. Z., Chiao S., Lu S., Xie G. Air quality prediction: big data and machine learning approaches. International Journal of Environment and Sustainable Development . 2018;9(1):8–16. [Google Scholar]
  • 3.Bai L., Wang J., Ma X., Lu H. Air pollution forecasts: an overview. International Journal of Environmental Research and Public Health . 2018;15(4):p. 780. doi: 10.3390/ijerph15040780. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Feng W., Yuan H. Haze pollution and economic fluctuations: an empirical analysis of Chinese cities. Cleaner Environmental Systems . 2021;2 doi: 10.1016/j.cesys.2021.100010.100010 [DOI] [Google Scholar]
  • 5.Makgato S., Chirwa E. Recent developments in reduction of sulphur emissions from selected waterberg coal samples used in south african power plants. Journal of Cleaner Production . 2020;276 doi: 10.1016/j.jclepro.2020.123192.123192 [DOI] [Google Scholar]
  • 6.Yu M., Liu Q. Deep learning-based downscaling of tropospheric nitrogen dioxide using ground-level and satellite observations. Science of the Total Environment . 2021;773 doi: 10.1016/j.scitotenv.2021.145145.145145 [DOI] [PubMed] [Google Scholar]
  • 7.Chen K., Breitner S., Wolf K., et al. Ambient carbon monoxide and daily mortality: a global time-series study in 337 cities. The Lancet Planetary Health . 2021;5(4):e191–e199. doi: 10.1016/s2542-5196(21)00026-7. [DOI] [PubMed] [Google Scholar]
  • 8.Lu H., Xie M., Liu X., et al. Adjusting prediction of ozone concentration based on CMAQ model and machine learning methods in Sichuan-Chongqing region, China. Atmospheric Pollution Research . 2021;12(6) doi: 10.1016/j.apr.2021.101066.101066 [DOI] [Google Scholar]
  • 9.Sicard P., Paoletti E., Agathokleous E., et al. Ozone weekend effect in cities: deep insights for urban air pollution control. Environmental Research . 2020;191 doi: 10.1016/j.envres.2020.110193.110193 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Mukherjee A., Yadav D. S., Agrawal S. B., Agrawal M. Ozone a persistent challenge to food security in India: current status and policy implications. Current Opinion in Environmental Science & Health . 2021;19 doi: 10.1016/j.coesh.2020.10.008.100220 [DOI] [Google Scholar]
  • 11.Fischer T. Wheat yield losses in India due to ozone and aerosol pollution and their alleviation: a critical review. Outlook on Agriculture . 2019;48(3):181–189. doi: 10.1177/0030727019868484. [DOI] [Google Scholar]
  • 12.Power A. L., Tennant R. K., Jones R. T., et al. Monitoring impacts of urbanisation and industrialisation on air quality in the anthropocene using urban pond sediments. Frontiers of Earth Science . 2018;6:p. 131. doi: 10.3389/feart.2018.00131. [DOI] [Google Scholar]
  • 13.Nguyen G. T. H., Shimadera H., Uranishi K., Matsuo T., Kondo A. Numerical assessment of PM2.5 and O3 air quality in Continental Southeast Asia: impacts of potential future climate change2.5 and O3 air quality in continental Southeast Asia: impacts of potential future climate change. Atmospheric Environment . 2019;215 doi: 10.1016/j.atmosenv.2019.116901.116901 [DOI] [Google Scholar]
  • 14.Fiore A. M., Naik V., Leibensperger E. M. Air quality and climate connections. Journal of the Air & Waste Management Association . 2015;65(6):645–685. doi: 10.1080/10962247.2015.1040526. [DOI] [PubMed] [Google Scholar]
  • 15.D’Amato G., Baena-Cagnani C. E., Cecchi L., et al. Climate change, air pollution and extreme events leading to increasing prevalence of allergic respiratory diseases. Multidisciplinary Respiratory Medicine . 2013;8(1):1–9. doi: 10.4081/mrm.2013.481. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Sivarethinamohan R., Sujatha S., Priya S., Sankaran S., Gafoor A., Rahman Z. Impact of air pollution in health and socio-economic aspects: review on future approach. Materials Today Proceedings . 2021;37:2725–2729. doi: 10.1016/j.matpr.2020.08.540. [DOI] [Google Scholar]
  • 17.García Nieto P., Sánchez Lasheras F., García-Gonzalo E., de Cos Juez F. PM10 concentration forecasting in the metropolitan area of Oviedo (Northern Spain) using models based on SVM, MLP, VARMA and ARIMA: a case study 10 concentration forecasting in the metropolitan area of Oviedo (Northern Spain) using models based on SVM, MLP, VARMA and ARIMA: a case study. Science of the Total Environment . 2018;621:753–761. doi: 10.1016/j.scitotenv.2017.11.291. [DOI] [PubMed] [Google Scholar]
  • 18.Zhang L., Lin J., Qiu R., et al. Trend analysis and forecast of PM2.5 in Fuzhou, China using the ARIMA model2.5 in Fuzhou, China using the ARIMA model. Ecological Indicators . 2018;95:702–710. doi: 10.1016/j.ecolind.2018.08.032. [DOI] [Google Scholar]
  • 19.Kumar A., Goyal P. Forecasting of air quality in Delhi using principal component regression technique. Atmospheric Pollution Research . 2011;2(4):436–444. doi: 10.5094/apr.2011.050. [DOI] [Google Scholar]
  • 20.Lwasa S., Mugagga F., Wahab B., Simon D., Connors J., Griffith C. Urban and peri-urban agriculture and forestry: transcending poverty alleviation to climate change mitigation and adaptation. Urban Climate . 2014;7:92–106. doi: 10.1016/j.uclim.2013.10.007. [DOI] [Google Scholar]
  • 21.Wu L., Gao X., Xiao Y., Liu S., Yang Y. Using Grey Holt-Winters model to predict the air quality index for cities in China. Natural Hazards . 2017;88(2):1003–1012. doi: 10.1007/s11069-017-2901-8. [DOI] [Google Scholar]
  • 22.Wu L., Li N., Yang Y. Prediction of air quality indicators for the Beijing-Tianjin-Hebei region. Journal of Cleaner Production . 2018;196:682–687. doi: 10.1016/j.jclepro.2018.06.068. [DOI] [Google Scholar]
  • 23.Yang W., Wang J., Niu T., Du P. A hybrid forecasting system based on a dual decomposition strategy and multi-objective optimization for electricity price forecasting. Applied Energy . 2019;235:1205–1225. doi: 10.1016/j.apenergy.2018.11.034. [DOI] [Google Scholar]
  • 24.Yang W., Wang J., Lu H., Niu T., Du P. Hybrid wind energy forecasting and analysis system based on divide and conquer scheme: a case study in China. Journal of Cleaner Production . 2019;222:942–959. doi: 10.1016/j.jclepro.2019.03.036. [DOI] [Google Scholar]
  • 25.Wang J., Du P., Hao Y., Ma X., Niu T., Yang W. An innovative hybrid model based on outlier detection and correction algorithm and heuristic intelligent optimization algorithm for daily air quality index forecasting. Journal of Environmental Management . 2020;255 doi: 10.1016/j.jenvman.2019.109855.109855 [DOI] [PubMed] [Google Scholar]
  • 26.Niu W. j., Feng Z. k. Evaluating the performances of several artificial intelligence methods in forecasting daily streamflow time series for sustainable water resources management. Sustainable Cities and Society . 2021;64 doi: 10.1016/j.scs.2020.102562.102562 [DOI] [Google Scholar]
  • 27.Singh N., Sugandha S., Mathur T., Agarwal S., Tiwari K. Journal of Physics: Conference Series . Vol. 1969. IOP Publishing; 2021. Stock price prediction using fractional gradient-based long short term memory.12038 [Google Scholar]
  • 28.Janarthanan R., Partheeban P., Somasundaram K., Navin Elamparithi P. A deep learning approach for prediction of air quality index in a metropolitan city. Sustainable Cities and Society . 2021;67 doi: 10.1016/j.scs.2021.102720.102720 [DOI] [Google Scholar]
  • 29.Pritam K. S., Sugandha S., Mathur T., Agarwal S. Underlying dynamics of crime transmission with memory. Chaos, Solitons & Fractals . 2021;146110838 [Google Scholar]
  • 30.Bansal K., Arora S., Pritam K. S., Mathur T., Agarwal S. Dynamics of Crime Transmission Using Fractional Order Differential Equations. Fractals . 2022;30(1):2250012–1485. [Google Scholar]
  • 31.Podlubny I. Fractional-order systems and fractional-order controllers. Institute of Experimental Physics, Slovak Academy of Sciences, Kosice . 1994;12(3):1–18. [Google Scholar]
  • 32.Caputo M. Linear models of dissipation whose Q is almost frequency independent—II. Geophysical Journal International . 1967;13(5):529–539. doi: 10.1111/j.1365-246x.1967.tb02303.x. [DOI] [Google Scholar]
  • 33.Rossikhin Y. A., Shitikova M. Fractional calculus models in dynamic problems of Viscoelasticity. Handbook of Fractional Calculus with Applications . 2019;7:139–158. [Google Scholar]
  • 34.Valentim C. A., Jr, Oliveira N. A., Rabi J. A., David S. A. Can fractional calculus help improve tumor growth models? Journal of Computational and Applied Mathematics . 2020;379 doi: 10.1016/j.cam.2020.112964.112964 [DOI] [Google Scholar]
  • 35.Arora S., Mathur T., Agarwal S., Tiwari K., Gupta P. Applications of fractional calculus in computer vision: a survey. Neurocomputing . 2022;489:407–428. doi: 10.1016/j.neucom.2021.10.122. [DOI] [Google Scholar]
  • 36.Wang H. Research on application of fractional calculus in signal real-time analysis and processing in stock financial market. Chaos, Solitons & Fractals . 2019;128:92–97. doi: 10.1016/j.chaos.2019.07.021. [DOI] [Google Scholar]
  • 37.Tarasov V. E. On history of mathematical economics: application of fractional calculus. Mathematics . 2019;7(6):p. 509. doi: 10.3390/math7060509. [DOI] [Google Scholar]
  • 38.Sun H., Zhang Y., Baleanu D., Chen W., Chen Y. A new collection of real world applications of fractional calculus in science and engineering. Communications in Nonlinear Science and Numerical Simulation . 2018;64:213–231. doi: 10.1016/j.cnsns.2018.04.019. [DOI] [Google Scholar]
  • 39.Wu L., Liu S., Yao L., Yan S., Liu D. Grey system model with the fractional order accumulation. Communications in Nonlinear Science and Numerical Simulation . 2013;18(7):1775–1785. doi: 10.1016/j.cnsns.2012.11.017. [DOI] [Google Scholar]
  • 40.Metia S., Oduro S. D., Duc H. N., Ha Q. Inverse air-pollutant emission and prediction using extended fractional Kalman filtering. Ieee Journal of Selected Topics in Applied Earth Observations and Remote Sensing . 2016;9(5):2051–2063. doi: 10.1109/jstars.2016.2541958. [DOI] [Google Scholar]
  • 41.Sun Y., Wu X., Cao J., Wei Z., Sun G. Fractional extended Kalman filtering for non-linear fractional system with lévy noises. IET Control Theory & Applications . 2017;11(3):349–358. doi: 10.1049/iet-cta.2016.1041. [DOI] [Google Scholar]
  • 42.Oktaviana Y. V., Apriliani E., Arif D. K. Journal of Physics: Conference Series . Vol. 1008. IOP Publishing; 2018. Fractional Kalman filter to estimate the concentration of air pollution.12008 [Google Scholar]
  • 43.Pu Y. F., Yi Z., Zhou J. L. Fractional hopfield neural networks: fractional dynamic associative recurrent neural networks. IEEE Transactions on Neural Networks and Learning Systems . 2017;28(10):2319–2333. doi: 10.1109/tnnls.2016.2582512. [DOI] [PubMed] [Google Scholar]
  • 44.Wang J., Wen Y., Gou Y., Ye Z., Chen H. Fractional-order gradient descent learning of BP neural networks with Caputo derivative. Neural Networks . 2017;89:19–30. doi: 10.1016/j.neunet.2017.02.007. [DOI] [PubMed] [Google Scholar]
  • 45.Wang H., Yu Y., Wen G., Zhang S., Yu J. Global stability analysis of fractional-order hopfield neural networks with time delay. Neurocomputing . 2015;154:15–23. doi: 10.1016/j.neucom.2014.12.031. [DOI] [PubMed] [Google Scholar]
  • 46.Khan S., Ahmad J., Naseem I., Moinuddin M. A novel fractional gradient-based learning algorithm for recurrent neural networks. Circuits, Systems, and Signal Processing . 2018;37(2):593–612. doi: 10.1007/s00034-017-0572-z. [DOI] [Google Scholar]
  • 47.Chen X. Nanjing, China: Nanjing Forestry University; 2013. Application of Fractional Calculus in Back Propagation Neural Networks. Ph.D Thesis. [Google Scholar]
  • 48.Khan S., Naseem I., Malik M. A., Togneri R., Bennamoun M. A fractional gradient descent-based RBF neural network. Circuits, Systems, and Signal Processing . 2018;37(12):5311–5332. doi: 10.1007/s00034-018-0835-3. [DOI] [Google Scholar]
  • 49.Sheng D., Wei Y., Chen Y., Wang Y. Convolutional neural networks with fractional order gradient method. Neurocomputing . 2020;408:42–50. doi: 10.1016/j.neucom.2019.10.017. [DOI] [Google Scholar]
  • 50.Jumarie G. Modified Riemann-Liouville derivative and fractional Taylor series of nondifferentiable functions further results. Computers & Mathematics with Applications . 2006;51(9-10):1367–1376. doi: 10.1016/j.camwa.2006.02.001. [DOI] [Google Scholar]
  • 51.Jumarie G. Table of some basic fractional calculus formulae derived from a Modified Riemann-Liouville derivative for non-differentiable functions. Applied Mathematics Letters . 2009;22(3):378–385. doi: 10.1016/j.aml.2008.06.003. [DOI] [Google Scholar]
  • 52.Jumarie G. An approach via fractional analysis to non-linearity induced by coarse-graining in space. Nonlinear Analysis: Real World Applications . 2010;11(1):535–546. doi: 10.1016/j.nonrwa.2009.01.003. [DOI] [Google Scholar]
  • 53.Jumarie G. On the derivative chain-rules in fractional calculus via fractional difference and their application to systems modelling. Open Physics . 2013;11(6):617–633. doi: 10.2478/s11534-013-0256-7. [DOI] [Google Scholar]
  • 54.Cruz Y. J., Rivas M., Quiza R., Haber R. E., Castaño F., Villalonga A. A two-step machine learning approach for dynamic model selection: a case study on a micro milling process. Computers in Industry . 2022;143 doi: 10.1016/j.compind.2022.103764.103764 [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The AQI dataset of five cities for 2015–2020 is considered, which is publicly available at https://cpcb.nic.in, the official portal of the Central Pollution Control Board, Government of India.


Articles from Computational Intelligence and Neuroscience are provided here courtesy of Wiley

RESOURCES