Spatiotemporal prediction of O3 concentration based on the KNN-Prophet-LSTM model

Biao Zhang; Chao Song; Ying Li; Xuchu Jiang

doi:10.1016/j.heliyon.2022.e11670

. 2022 Nov 21;8(11):e11670. doi: 10.1016/j.heliyon.2022.e11670

Spatiotemporal prediction of O₃ concentration based on the KNN-Prophet-LSTM model

Biao Zhang ^a, Chao Song ^b, Ying Li ^b, Xuchu Jiang ^b,^⁎

PMCID: PMC9712550 PMID: 36468093

Abstract

In this paper, a prediction method based on the KNN-Prophet-LSTM hybrid model is established by using the daily pollutant concentration data of Wuhan from January 1, 2014, to May 3, 2021, and considering the characteristics of time and space. First, the data are divided into trend items, periodic items and error items by the Prophet decomposition method. Considering the advantages of the Prophet and the Long Short-Term Memory (LSTM) models, the trend items and periodic items are predicted by the Prophet model. The LSTM model is used to predict the error terms, and the K-Nearest Neighbor algorithm (KNN) is added to fuse the spatial and temporal information to predict the ozone (O₃) concentration value day by day. To highlight the effectiveness and rationality of the KNN-Prophet-LSTM hybrid model, four groups of comparative experiments are set up to compare it with the single model Autoregressive Integrated Moving Average (ARIMA), Prophet, LSTM and the hybrid model Prophet-LSTM. The experimental results show that, (1) the daily maximum 8-hour average concentration of O₃ in Wuhan has a significant periodic variation. The difference in the surrounding environment will lead to the difference in O₃ concentration change in the region, and the O₃ concentration change of similar stations will have a high similarity. (2) The Prophet decomposition algorithm decomposes the original time series, which can effectively extract the time series information and remove noise. Thus, the prediction accuracy is obviously improved. (3) Considering the spatial information of the surrounding sites by KNN algorithm, the accuracy of the model can be further improved. Compared with the baseline model ARIMA, the accuracy is improved by approximately 49.76% on mean absolute error (MAE) and 46.81% on root mean square error (RMSE) respectively. (4) The prediction effect of the mixed model is generally better than that of the single model and possesses a higher prediction accuracy.

Keywords: O₃ prediction, KNN-Prophet-LSTM model, Spatiotemporal prediction

O₃ prediction; KNN-Prophet-LSTM model; Spatiotemporal prediction

1. Introduction

In recent years, with the improvement of the economic level and the rapid development of cities, the concentration of ozone in the atmosphere has increased. As an oxidant, ozone is also an important factor in photochemical smog pollution. The increase in ozone concentration is harmful to human health, such as eye irritation, poor breathing, severe respiratory diseases and even suffocation. Plants in a high-concentration ozone environment will not perform photosynthesis normally, resulting in plant withering. Therefore, it is of great significance to accurately predict the ozone concentration in advance.

The existing ozone concentration prediction methods can be roughly divided into three directions. One is to optimize traditional machine learning algorithms and use various optimization methods to learn complex patterns and hidden features [1]. There are two main ideas: the Bayesian method and the kernel method. The former mainly learns the implicit patterns in the time series through model integration [2], and the latter is inclined to capture the relationship between hidden variables to improve the prediction accuracy [3], [4]. The second direction is based on the idea of signal decomposition. The development of frequency domain analysis methods is relatively mature. Applying signal decomposition methods, complex time series can be decomposed into several signals of different frequencies to simplify the prediction task, but the workload of such methods is relatively large, which need to be predicted separately and finally aggregated. [5], [6] The third direction is to use deep learning models to accurately analyze and predict ozone concentrations, taking into account spatial and temporal factors. The existing research is mainly divided into the improvement of the existing model and the data-driven prediction framework. Ma W et al. [7] proposed an optimized neural network model based on transfer learning, which first learned the intrinsic patterns of other pollutants (NO₂, PM_2.5, etc., and made accurate predictions of ozone concentration. Convolutional neural networks have great advantages in learning the spatial correlation of changes. Some scholars use convolutional neural networks to capture spatial change information [8], [9], [10] and then use sequence-to-sequence [11], [12] or encoding-decoding methods [13], [14] to simulate combined time series. The above models are all carried out on the premise of a large amount of data; otherwise, they may not converge. Therefore, in the case of small samples, the prediction results are not good. Some scholars have proposed data-driven prediction frameworks to perform feature engineering and model prediction by observing the characteristics of the data [15], [16].

At present, the pollutant concentration prediction methods for air monitoring stations mainly have the following problems: (1) ignoring the temporal correlation of station data: there is no long-term fine-grained prediction, and most of them are predictions for the next hour. (2) A deep learning model provides high accuracy, but its interpretability is poor. (3) Graph convolutional networks cannot directly extract long-term macro information in time series, which limits their application in the field of time series forecasting. In this paper, the KNN-Prophet-LSTM hybrid model is proposed to solve the above problems. The KNN algorithm is used to select the spatial correlation information of the target site, and the idea of time series decomposition is used to combine the traditional time series model with the deep learning model, which improves the prediction accuracy and increases the interpretability of the model from the two dimensions of time and space.

2. Models

2.1. The Prophet model

The Prophet is a time series prediction model which is opened by Facebook in 2017 [17]. The Prophet is popular for its flexibility and usability, which can automatically fill in missing values and has a very good prediction effect. The Prophet uses time series decomposition to predict and model the time series. The Prophet's model constitutes are shown in Eq. (1):

y (t) = g (t) + s (t) + h (t) + ε

(1)

where $g (t)$ is a trend item, $s (t)$ is a periodic term, $h (t)$ is a holiday item, and ε is a stochastic volatility term.

(1) Trend term: the trend term of the Prophet model is based on the improved logistic growth function to fit the aperiodic changes in the time series (Eq. (2)).

g (t) = \frac{C (t)}{1 + e x p (- (k + a {(t)}^{T} δ)) (t - (m + a {(t)}^{T}) γ)} a (t) = {\begin{matrix} 1, & t > s_{j} \\ 0, & otherwise \end{matrix}

(2)

where C denotes the capacity of the model, i.e., the saturation value of the growth, is the time t. A function of $k + a {(t)}^{T} δ$ . Represents the growth rate of the model over time, $(m + a {(t)}^{T}) γ$ represents an offset, $s_{j}$ is a point of discontinuity at which the growth rate changes during a time series change, and δ represents the amount of change in the growth rate at the mutation point.

(2) Periodic term: the Prophet model uses a Fourier series to model the periodicity of the time series (Eq. (3)):

s (t) = \sum_{n = 1}^{N} (a_{n} \sin (\frac{2 π n t}{p}) + b_{n} \cos (\frac{2 π n t}{p}))

(3)

where p represents a fixed period, and N represents the number of periods to be used in the model. $a_{n}$ , $b_{n}$ are the parameters to be estimated.

(3) Holiday item: the Prophet model regards the influence of different holidays in a year on the trend change of time series as an independent model and sets a separate dummy variable for each model (Eq. (4)):

h (t) = Z (t) k = \sum_{i = 1}^{L} k_{i} \times 1_{{t \in D_{i}}}

(4)

where $k_{i}$ represents the effect of a holiday on the forecast value and $D_{i}$ represents a dummy variable.

The main advantages of the Prophet model are that it can flexibly adjust the periodicity and make different assumptions about the trend of time series; there is no need to fill in the missing values, and the model will automatically handle the missing values; the adoption can obtain the result needing to be predicted in a shorter time; and the parameters of the prediction model can be adjusted for different scenarios to improve the model.

2.2. The LSTM model

Long short-term memory is an improved model based on the traditional recurrent neural network, which has good performance in time series prediction, and the LSTM has a more sophisticated information transmission mechanism. It can solve the problems of long-term memory deficiency, gradient disappearance or gradient expansion faced by Recurrent Neural Network (RNN) in the practical application process. Thus, the LSTM can deal with the long-term dependence problem in time series. The LSTM model structure is shown in Fig. 1.

The LSTM realizes the control and protection of the cell state by three structures of the input gate, output gate and forgetting gate. The input gate controls the inflow of information, the output gate controls the outflow of information, and the forgetting gate controls the memory unit to record the intensity of the historical cell state. The main functions of each state gate are as follows:

(1) Input gate: determines what information goes into the cell state and updates the cell state information. The sigmoid function determines the value to be updated, then the tanh function creates a new value vector, and finally, the latest cell state is updated (Eq. (5)).

i_{t} = σ (W_{t} [h_{t - 1}, x_{t}] + b_{t}) C_{t} = f_{t} C_{t - 1} + i_{t} \times \tanh (W_{c} [h_{t - 1}, x_{t}]) + b_{c}

(5)

where $W_{t}$ , $W_{c}$ are weight vectors, and $b_{t}$ , $b_{c}$ are the deviation vectors.

(2) Forget gate: by selectively processing the historical information, it is possible to determine which information in the cell state needs to be lost and which information needs to be retained. Enter as $h_{t - 1}$ and $x_{t}$ , the forgetting gate is calculated by the sigmoid function (Eq. (6)).

f_{t} = σ (W_{f} [h_{t - 1}, x_{t}] + b_{f}

(6)

where $W_{f}$ is a weight vector and $b_{f}$ is a deviation vector.

(3) Output gate: determine the information to be output. First, the sigmoid function is used to convert the output value into 0 and 1, where 1 represents output and 0 represents no output. Multiply the cell state with the obtained value to output the final information (Eq. (7)).

o_{t} = σ (W_{o} [h_{t - 1}, x_{t}] + b_{o}) h_{t} = o_{t} + \tanh ([C_{c}])

(7)

where $W_{o}$ is a weight vector and $b_{o}$ is a deviation vector.

2.3. The KNN algorithm

The KNN algorithm is a supervised learning classification algorithm that is simple to implement and fast to train. It measures the similarity of two points in space by their distance. The smaller the distance is, the higher the similarity is. The test points are classified by the main category to which the nearest K points belong. The common distance measures are Euclidean distance, Mahalanobis distance, Manhattan distance and so on. The Euclidean distance is calculated in Eq. (8).

d_{i j} = {‖ X_{i} - X_{j} ‖}_{2} = \sqrt{\sum_{m = 1}^{k} {(x_{i} (m) - x_{j} (m))}^{2}}

(8)

where K denotes the dimension of the feature vector, $x_{i} (m)$ and $x_{j} (m)$ denote the value of the mth dimension, and m ranges from $1, 2, 3, \dots, k$ .

2.4. The KNN-Prophet-LSTM prediction model

In the Prophet model, the time series fitting effect of historical data with a strong seasonal effect and several seasons is good, and it has strong robustness to missing data and trend changes. It can usually deal with outliers well, but the expression ability of the model is simple, so it is often unable to learn complex patterns when training the model. When the time series is not decomposed completely, it will lead to chaos in the remainder, and the Prophet model cannot fit the remainder well, which will reduce the prediction accuracy. As a deep learning model, the LSTM model can learn the potential relationship in a time series to fully extract effective information. If only the above model is used, the problem of not fully utilizing the spatial correlation characteristics is solved.

Therefore, the KNN algorithm is used to screen the adjacent spatial factors as an additional input to construct the KNN-Prophet-LSTM model to achieve a more accurate prediction of the O₃ concentration. The specific prediction process is shown in Fig. 2.

(1) Data preprocessing: before using the original data, we need to deal with the real samples and outliers. After statistics, there are no missing data; because the collection of data has gone through a series of strict audit processes, it is considered that the data are true and effective, and the data are not denoised. Because each pollution factor of a detection station has different dimensions and magnitudes, to reduce the error and speed up the model training, max-min normalization is applied to the data.

x^{j} = \frac{x^{i} - x_{m i n}}{x_{m a x} - x_{m i n}}

(9)

where $x_{i}$ is the original data, $x_{j}$ is the normalized data, and $x_{m a x}, x_{m i n}$ are the maximum and minimum values in the data respectively.

(2) The KNN algorithm is used to extract the spatial correlation characteristics of the target station. Let $K = 1$ . In this paper, the Euclidean distance is used to measure the degree of correlation between the target station and the nearby stations. The smaller the distance is, the stronger the correlation. The KNN algorithm selection process is as follows.

1) According to the time step in the Prophet-LSTM model for a single site $s_{t}$ . Build $O_{3}$ state matrix, there are m stations in the region. The state quantity of station I at time t in time step $X_{t}^{i}$ and state matrix $S_{t}$ in time t is shown in (Eq. (10)-Eq. (11)).

X_{t}^{i} = {(\begin{matrix} x_{t - s t}^{i} & x_{t - s t + 1}^{i} & \dots & x_{t - 2}^{i} & x_{t - 1}^{i} \end{matrix})}^{T}

(10)

S_{t} = (\begin{matrix} x_{t - s t}^{1} & x_{t - s t}^{2} & x_{t - s t}^{3} & \dots & x_{t - s t}^{m - 1} & x_{t - s t}^{m} \\ x_{t - s t + 1}^{1} & x_{t - s t + 1}^{2} & x_{t - s t + 1}^{3} & \dots & x_{t - s t + 1}^{m - 1} & x_{t - s t + 1}^{m} \\ x_{t - s t + 2}^{1} & x_{t - s t + 2}^{2} & x_{t - s t + 2}^{3} & \dots & x_{t - s t + 2}^{m - 1} & x_{t - s t + 2}^{m} \\ \dots & \dots & \dots & \dots & \dots & \dots \\ x_{t - 1}^{1} & x_{t - 1}^{2} & x_{t - 1}^{3} & \dots & x_{t - 1}^{m - 1} & x_{t - 1}^{m} \\ x_{t}^{1} & x_{t}^{2} & x_{t}^{3} & \dots & x_{t}^{m - 1} & x_{t}^{m} \end{matrix})

(11)

where $x_{n}^{u}$ is the $O_{3}$ concentration value at site u at time n.

2) By calculating the Euclidean distances between the state quantity of the time step at moment t of the target station and other adjacent stations (m-1 in total), sorting, and selecting the data of the front K corresponding to the monitoring points as the space-related factors of the target station at moment t, $X_{s p}$ :

X_{sp} = (\begin{matrix} x_{s p}^{1} & x_{s p}^{2} & \dots & x_{s p}^{k - 1} & x_{s p}^{k} \end{matrix})

(12)

where $x_{s p}^{i}$ is the site concentration value associated with the ith site of the target site at time t.

(3) According to the historical pollution data of the target site, the Prophet-LSTM network is used to extract the temporal features. Enter as Target Site $s_{t}$ . Time step of the six pollutant concentrations of the verse, t time input data $i n p u t_{t}$ . As shown in Eq. (13).

i n p u t_{t} = (\begin{matrix} i_{t - s t}^{co} & i_{t - s t}^{{no}_{2}} & i_{t - s t}^{{so}_{2}} & i_{t - s t}^{o_{3}} & i_{t - s t}^{{pm}_{10}} & i_{t - s t}^{{pm}_{2.5}} \\ i_{t - s t + 1}^{co} & i_{t - s t + 1}^{{no}_{2}} & i_{t - s t + 1}^{{so}_{2}} & i_{t - s t + 1}^{o_{3}} & i_{t - s t}^{{pm}_{10}} & i_{t - s t + 1}^{{pm}_{2.5}} \\ i_{t - s t + 2}^{co} & i_{t - s t + 2}^{{no}_{2}} & i_{t - s t + 2}^{s_{2}} & i_{t - s t + 2}^{o_{3}} & i_{t - s t}^{{pm}_{10}} & i_{t - s t + 2}^{{pm}_{2.5}} \\ \dots & \dots & \dots & \dots & \dots & \dots \\ i_{t - 2}^{co} & i_{t - 2}^{{no}_{2}} & i_{t - 2}^{{so}_{2}} & i_{t - 2}^{o_{3}} & i_{t - s t}^{{pm}_{10}} & i_{t - 2}^{{pm}_{2.5}} \\ i_{t - 1}^{c c} & i_{t - 1}^{{no}_{2}} & i_{t - 1}^{{so}_{2}} & i_{t - 1}^{o_{3}} & i_{t - s t}^{{pm}_{10}} & i_{t - 1}^{{pm}_{2.5}} \end{matrix})

(13)

where $i_{j}^{co}, i_{j}^{n o_{2}}, i_{j}^{s o_{2}}, i_{j}^{p m_{10}}, i_{j}^{p m_{2.5}}$ is the concentration value of CO, NO₂, SO₂, O₃, PM₁₀ and PM_2.5 at time J.

(4) The spatial correlation features extracted by KNN $x_{s p}^{i}$ . As a second input, it is spliced with the output of the Prophet-LSTM.

(5) Let $K = K + 1$ , repeat steps (3-4) until $K > M$ sites, where M is the number of neighboring sites.

(6) Determine the optimal K according to the evaluation index (selecting the RMSE).

3. Results and discussion

3.1. Monitoring station and data display

The data of this study come from the China Environmental Monitoring Station (http://www.cnemc.cn/), and the daily data of pollutant concentrations in Wuhan city from January 1, 2014, to May 3, 2021, are selected. The total amount of data is 2678, and there are no missing values. The historical data from January 1, 2014, to December 30, 2020, are used as the training set, and the data from January 1, 2021, to May 3, 2021, are used as the test set. A training set is adopted to fit the model parameters. The test set evaluates the predictive ability of the model. Site names and distribution are shown in Table 1 and Fig. 3.

Table 1.

Station information.

Station ID	Classification	Latitude	Longitude
Qingshan Ganghua Station	industrial	114.36	30.62
Wuchang Ziyang Station	urban	114.30	30.52
Hanyang Moon Lake Station	suburban	114.25	30.55
182 National Avenue Station	urban	114.40	30.46
Zhuankou New District Station	urban	114.16	30.48
HanKou Flower Bridge Station	urban	114.37	30.55
East Lake Pear Garden Station	suburban	114.30	30.59
Hankou River Beach Station	industrial	114.28	30.61
Wujiashan Station	suburban	114.12	30.63

Open in a new tab

According to the correlation of the O₃ time series of each station, the correlation coefficient between Wujiashan station and Zhuankou station and other stations is not more than 0.6, while the correlation coefficient between any two other stations is more than 0.7. Therefore, the above two stations are ignored in the selection of spatial stations. The data distributions are shown in Fig. 4.

3.2. Selection of evaluation criteria

Mean absolute error (MAE), root mean square error (RMSE) and mean square error (MSE) are selected to measure the prediction accuracy of different models. MSE squares the error (let e = true value - predicted value), so if e > 1, MSE will further increase the error. If there are outliers in the data, the value of e will be large, and e will be much larger than |e|. Therefore, the model using MSE assigns more weight to outliers than using MAE to calculate the loss. However, this reduces the overall performance of the model. MAE loss works better if the training data are polluted by outliers (e.g., there are many false negative and positive labels in the training data but not in the test set). $y_{i}$ is a true value, $\hat{y_{i}}$ is an estimate, $i = 1, 2, \dots, n$ . Among n is the sample size, and the expression of the above indicator is shown in Eq. (14)-Eq. (16)).

M A E = \frac{1}{n} \sum_{i = 1}^{n} | \hat{y_{i}} - y_{i} |

(14)

R M S E = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - \hat{y})}^{2}}

(15)

M S E = \frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - \hat{y})}^{2}

(16)

According to the expression of the above evaluation index, the smaller the values of the three are, the smaller the prediction error of the model is.

3.3. Experimental environment and model parameter settings

The experimental environment and computer configuration are as follows: the programming language is Python 3.8, the development environment is Visual Studio Code editor, and the operating system is Windows $10 \times 64$ .

The time step of the LSTM layer is 12, and the K value is selected as 3. Specific network parameter settings are shown in Table 2.

Table 2.

Network Parameter Settings.

Layer	Output shape	Param
InputLayer	(12,6)	0
Lstm_output	64	18176
Aux_input	2	0
Concatenate	66	0
Dense_1	6	402
Output	1	7

Open in a new tab

3.4. Experimental results and analysis

3.4.1. Experimental results

According to Prophet's decomposition principle, the periodicity is decomposed out. As shown in Fig. 5, from the annual trend, the concentration of O₃ in Wuhan in winter and spring is higher than that in summer and autumn, with the lowest concentration in July and the highest concentration in February. The rationality and accuracy of the model are further verified by the analysis of the decomposition results.

Schematic diagram of the trend item and periodicity.

This experiment has been predicted for nearly three months, and the fitting effect of each model is shown in Fig. 6. The traditional ARIMA model and the linear The Prophet model are not sensitive to the extreme values of the data, and their fitting curves are relatively smooth. However, the prediction of LSTM is unstable, and its volatility is too large, which leads to poor model fitting. Compared with the single model, the hybrid model Prophet-LSTM. The variance of the prediction results is reduced, and the accuracy is also improved because after the time series data are decomposed, the LSTM only predicts the error term, and the seasonal term and the trend term are linearly fitted by The Prophet, thus significantly improving the prediction effect of the model. The target model KNN-Prophet-LSTM selects the spatial relevant information of the target site through the KNN algorithm and utilizes the characteristic that LSTM has time memory. The prediction accuracy of the model is further improved by considering the two dimensions of time and space.

3.4.2. Model comparison and analysis

In the experiment, we compare the performance of five different models and choose the ARIMA as the baseline model. Among the selected methods, ARIMA is representative of statistical learning methods, and Prophet combines statistical learning methods and deep learning methods. The neural network represented by LSTM is the most used deep learning method for solving time series forecasting problems. Therefore, selecting these methods for comparison can fully verify the effectiveness of the prediction effect of KNN-Prophet-LSTM. In the experiment, the hyperparameters of all the comparison methods are adjusted to the optimal values according to the suggestions of the original paper. The comparison results are shown in Table 3. Under the two evaluation indexes, the target model KNN-Prophet-LSTM shows the best performance, in which the MAE and RMSE are 10. 9009 and 14. 7334, respectively, which are 11. 183 and 12. 8043 lower than the second-best model (the Prophet-LSTM), respectively. Compared with the baseline model ARIMA, the accuracy is improved by approximately 49.76% (MAE) and 46.81% (RMSE), respectively. The reason for the analysis is mainly because the data set selected in this paper comes from air quality monitoring data in the environmental field, and the nature of pollutants often shows strong daily, monthly and annual cycles, which is more in line with the assumption of KNN-Prophet-LSTM. Using the model in this scenario can better capture the periodicity of the time series to obtain a more ideal forecasting effect. It is worth noting that although the LSTM has good accuracy in dealing with various types of time series, its effect is even worse than that of the traditional ARIMA model in predicting air pollutant concentration data. In addition, through further comparison, it is found that the hybrid model has better performance than the single model, and the improvement of prediction accuracy is more obvious.

Table 3.

Model comparison.

Model	MAE	RMSE
ARIMA (2,0,2)	21.7003	27.6976
LSTM	25.8970	32.4554
Prophet	22.9098	28.4266
Prophet-LSTM	21.7179	27.5377
KNN-Prophet-LSTM	10.9009	14.7334

Open in a new tab

4. Conclusion

In this paper, a prediction method based on the KNN-Prophet-LSTM combination model is established by using the pollutant concentration data of Wuhan city and considering the characteristics of time and space. The method comprises the following steps: dividing data into a trend item, a period item and an error item through the Prophet decomposition method, comprehensively considering the advantages of the Prophet model and the LSTM model, predicting the trend item and the period item by using the Prophet model, and predicting the error item by using the LSTM model. The KNN algorithm is added to fuse the spatiotemporal information and predict the O₃ concentration value day by day. To highlight the effectiveness and rationality of the KNN-Prophet-LSTM hybrid model, four groups of comparative experiments are set up to compare it with the single model ARIMA, Prophet, LSTM and the hybrid model Prophet-LSTM. The results show that (1) the daily maximum 8-hour average concentration of O₃ in Wuhan has a significant periodic change. The difference in the surrounding environment will lead to the difference in O₃ concentration change in the region, and the O₃ concentration change of similar stations will have a high similarity. (2) The Prophet decomposition algorithm decomposes the original time series, which can effectively extract the time series information and remove the noise so that the prediction accuracy is significantly improved. (3) Through a KNN algorithm, considering the spatial information of the surrounding sites, the accuracy of the model can be further improved. Compared with the baseline model ARIMA, the accuracy is improved by approximately 49.76% (MAE) and 46.81% (RMSE), respectively. (4) The prediction effect of the mixed model is generally better than that of the single model, and the prediction accuracy is higher.

Although the KNN-Prophet-LSTM proposed in this paper has high accuracy for the concentration prediction of O₃ time series data, it does not extract too much for the error term, resulting in more factors included; thus, the interpretation of the error term is poor. If the error term can be further decomposed, the prediction accuracy may be further improved.

Declarations

Author contributions statement

Biao Zhang: Conceived and designed the experiments. Chao Song: Performed the experiments; Wrote the paper. Ying Li: Analyzed and interpreted the data. Xuchu Jiang: Contributed reagents, materials, analysis tools or data; Wrote the paper.

Funding statement

Biao Zhang was supported by Natural Science Foundation of Shandong Province [ZR2021QF036], “Guangyue Young Scholar Innovation Team” of Liaocheng University [LCUGYTD2022-03].

Data availability statement

Data included in article /supp.material/ referenced in article.

Declaration of interests statement

The authors declare no conflict of interest.

Additional information

No additional information is available for this paper.

References

1.Yafouz A., AlDahoul N., Birima A.H., et al. Comprehensive comparison of various machine learning algorithms for short-term ozone concentration prediction. Alex. Eng. J. 2022;61(6):4607–4622. [Google Scholar]
2.Ren X., Mi Z., Cai T., et al. Flexible Bayesian ensemble machine learning framework for predicting local ozone concentrations. Environ. Sci. Technol. 2022;56(7):3871–3883. doi: 10.1021/acs.est.1c04076. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Kurniawan R., Setiawan I.N., Caraka R.E., et al. Using Harris hawk optimization towards support vector regression to ozone prediction. Stoch. Environ. Res. Risk Assess. 2022;36(2):429–449. doi: 10.1007/s00477-022-02178-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Li D., Ren X. Prediction of ozone hourly concentrations based on machine learning technology. Sustainability. 2022;14(10):5964. [Google Scholar]
5.Zhou J., Xu Z., Wang S. A novel dual-scale ensemble learning paradigm with error correction for predicting daily ozone concentration based on multi-decomposition process and intelligent algorithm optimization, and its application in heavily polluted regions of China. Atmos. Pollut. Res. 2022;13(2) [Google Scholar]
6.Qu H., Zhang R. 2022 IEEE International Conference on Electrical Engineering, Big Data and Algorithms (EEBDA) IEEE; 2022. Short-term mathematical prediction model of air quality based on CEEMD-ELM-PSO; pp. 227–232. [Google Scholar]
7.Ma W., Yuan Z., Lau A.K.H., et al. Optimized neural network for daily-scale ozone prediction based on transfer learning. Sci. Total Environ. 2022;827 doi: 10.1016/j.scitotenv.2022.154279. [DOI] [PubMed] [Google Scholar]
8.Mao W., Jiao L., Wang W. Long time series ozone prediction in China: a novel dynamic spatiotemporal deep learning approach. Build. Environ. 2022;218 [Google Scholar]
9.Khan M.A., Kim H., Park H. Exploiting neural network for temporal multi-variate air quality and pollutant prediction. J. Korea Multimed. Soc. 2022;25(2):440–449. [Google Scholar]
10.Muthukumar P., Nagrecha K., Cocom E., et al. AGU Fall Meeting Abstracts. 2021. Predicting PM2. 5 air pollution using deep learning with multisource satellite and ground-based observations and meteorological and wildfire big data. GC45B-0842. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Chen Y., Chen X., Xu A., et al. A hybrid CNN-transformer model for ozone concentration prediction. Air Qual. Atmos. Health. 2022:1–14. [Google Scholar]
12.Ma D., Ma S., Chen Q., et al. 2022 IEEE 25th International Conference on Computer Supported Cooperative Work in Design (CSCWD) IEEE; 2022. Temperature prediction algorithm based on spatio-temporal prediction; pp. 151–157. [Google Scholar]
13.Tsokov S., Lazarova M., Aleksieva-Petrova A. A hybrid spatiotemporal deep model based on CNN and LSTM for air pollution prediction. Sustainability. 2022;14(9):5104. [Google Scholar]
14.Suresh C., Kiranmayee B.V., Sneha B. Proceedings of Second International Conference on Advances in Computer Engineering and Communication Systems. Springer; Singapore: 2022. Analysis and prediction of air pollutant using machine learning; pp. 335–345. [Google Scholar]
15.Marvin D., Nespoli L., Strepparava D., et al. A data-driven approach to forecasting ground-level ozone concentration. Int. J. Forecast. 2022;38(3):970–987. [Google Scholar]
16.Gu K., Liu H., Liu J., et al. Air pollution prediction in mass rallies with a new temporally-weighted sample-based multitask learner. IEEE Trans. Instrum. Meas. 2022;71:1–15. [Google Scholar]
17.Taylor S.J., Letham B. Forecasting at scale. 2017. https://doi.org/10.7287/peerj.preprints.3190v2 PeerJ Preprints 5:e3190v2.

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Data included in article /supp.material/ referenced in article.

[br0010] 1.Yafouz A., AlDahoul N., Birima A.H., et al. Comprehensive comparison of various machine learning algorithms for short-term ozone concentration prediction. Alex. Eng. J. 2022;61(6):4607–4622. [Google Scholar]

[br0020] 2.Ren X., Mi Z., Cai T., et al. Flexible Bayesian ensemble machine learning framework for predicting local ozone concentrations. Environ. Sci. Technol. 2022;56(7):3871–3883. doi: 10.1021/acs.est.1c04076. [DOI] [PMC free article] [PubMed] [Google Scholar]

[br0030] 3.Kurniawan R., Setiawan I.N., Caraka R.E., et al. Using Harris hawk optimization towards support vector regression to ozone prediction. Stoch. Environ. Res. Risk Assess. 2022;36(2):429–449. doi: 10.1007/s00477-022-02178-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

[br0040] 4.Li D., Ren X. Prediction of ozone hourly concentrations based on machine learning technology. Sustainability. 2022;14(10):5964. [Google Scholar]

[br0050] 5.Zhou J., Xu Z., Wang S. A novel dual-scale ensemble learning paradigm with error correction for predicting daily ozone concentration based on multi-decomposition process and intelligent algorithm optimization, and its application in heavily polluted regions of China. Atmos. Pollut. Res. 2022;13(2) [Google Scholar]

[br0060] 6.Qu H., Zhang R. 2022 IEEE International Conference on Electrical Engineering, Big Data and Algorithms (EEBDA) IEEE; 2022. Short-term mathematical prediction model of air quality based on CEEMD-ELM-PSO; pp. 227–232. [Google Scholar]

[br0070] 7.Ma W., Yuan Z., Lau A.K.H., et al. Optimized neural network for daily-scale ozone prediction based on transfer learning. Sci. Total Environ. 2022;827 doi: 10.1016/j.scitotenv.2022.154279. [DOI] [PubMed] [Google Scholar]

[br0080] 8.Mao W., Jiao L., Wang W. Long time series ozone prediction in China: a novel dynamic spatiotemporal deep learning approach. Build. Environ. 2022;218 [Google Scholar]

[br0090] 9.Khan M.A., Kim H., Park H. Exploiting neural network for temporal multi-variate air quality and pollutant prediction. J. Korea Multimed. Soc. 2022;25(2):440–449. [Google Scholar]

[br0100] 10.Muthukumar P., Nagrecha K., Cocom E., et al. AGU Fall Meeting Abstracts. 2021. Predicting PM2. 5 air pollution using deep learning with multisource satellite and ground-based observations and meteorological and wildfire big data. GC45B-0842. [DOI] [PMC free article] [PubMed] [Google Scholar]

[br0110] 11.Chen Y., Chen X., Xu A., et al. A hybrid CNN-transformer model for ozone concentration prediction. Air Qual. Atmos. Health. 2022:1–14. [Google Scholar]

[br0120] 12.Ma D., Ma S., Chen Q., et al. 2022 IEEE 25th International Conference on Computer Supported Cooperative Work in Design (CSCWD) IEEE; 2022. Temperature prediction algorithm based on spatio-temporal prediction; pp. 151–157. [Google Scholar]

[br0130] 13.Tsokov S., Lazarova M., Aleksieva-Petrova A. A hybrid spatiotemporal deep model based on CNN and LSTM for air pollution prediction. Sustainability. 2022;14(9):5104. [Google Scholar]

[br0140] 14.Suresh C., Kiranmayee B.V., Sneha B. Proceedings of Second International Conference on Advances in Computer Engineering and Communication Systems. Springer; Singapore: 2022. Analysis and prediction of air pollutant using machine learning; pp. 335–345. [Google Scholar]

[br0150] 15.Marvin D., Nespoli L., Strepparava D., et al. A data-driven approach to forecasting ground-level ozone concentration. Int. J. Forecast. 2022;38(3):970–987. [Google Scholar]

[br0160] 16.Gu K., Liu H., Liu J., et al. Air pollution prediction in mass rallies with a new temporally-weighted sample-based multitask learner. IEEE Trans. Instrum. Meas. 2022;71:1–15. [Google Scholar]

[br0170] 17.Taylor S.J., Letham B. Forecasting at scale. 2017. https://doi.org/10.7287/peerj.preprints.3190v2 PeerJ Preprints 5:e3190v2.

PERMALINK

Spatiotemporal prediction of O3 concentration based on the KNN-Prophet-LSTM model

Biao Zhang

Chao Song

Ying Li

Xuchu Jiang

Abstract

1. Introduction

2. Models

2.1. The Prophet model

2.2. The LSTM model

Figure 1.

2.3. The KNN algorithm

2.4. The KNN-Prophet-LSTM prediction model

Figure 2.

3. Results and discussion

3.1. Monitoring station and data display

Table 1.

Figure 3.

Figure 4.

3.2. Selection of evaluation criteria

3.3. Experimental environment and model parameter settings

Table 2.

3.4. Experimental results and analysis

3.4.1. Experimental results

Figure 5.

Figure 6.

3.4.2. Model comparison and analysis

Table 3.

4. Conclusion

Declarations

Author contributions statement

Funding statement

Data availability statement

Declaration of interests statement

Additional information

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Spatiotemporal prediction of O₃ concentration based on the KNN-Prophet-LSTM model