Abstract
Improving the temperature prediction accuracy for subgrades in seasonally frozen regions will greatly help improve the understanding of subgrades’ thermal states. Due to the nonlinearity and non-stationarity of the temperature time series of subgrades, it is difficult for a single general neural network to accurately capture these two characteristics. Many hybrid models have been proposed to more accurately forecast the temperature time series. Among these hybrid models, the CEEMDAN-LSTM model is promising, thanks to the advantages of the long short-term memory (LSTM) artificial neural network, which is good at handling complex time series data, and its combination with the broad applicability of the complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) in the field of signal decomposition. In this study, by performing empirical mode decomposition (EMD), ensemble empirical mode decomposition (EEMD), and CEEMDAN on temperature time series, respectively, a hybrid dataset is formed with the corresponding time series of volumetric water content and frost heave, and finally, the CEEMDAN-LSTM model is created for prediction purposes. The results of the performance comparisons between multiple models show that the CEEMDAN-LSTM model has the best prediction performance compared to other decomposed LSTM models because the composition of the hybrid dataset improves predictive ability, and thus, it can better handle the nonlinearity and non-stationarity of the temperature time series data.
Keywords: seasonal frozen subgrade, temperature prediction, complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN), long short-term memory (LSTM)
1. Introduction
Frozen ground occurs when the ground contains water, and the temperature of the ground is at or below 0 °C. Frozen ground can be divided into permafrost, seasonal frozen ground, and intermittently frozen ground. Permafrost usually remains at or below 0 °C for at least two years, while a layer of soil that freezes for more than 15 days per year is defined as seasonally frozen ground, and a layer of soil that freezes between one and 15 days a year is defined as intermittently frozen ground. China has the third largest permafrost area in the world, with seasonal frozen ground and permafrost accounting for 53.5% and 21.5% of China’s land area, respectively [1]. Seasonal frozen ground in China is distributed in the regions to the west of the Helan Mountain–Ailaoshan Mountain line, and the regions to the east of this line and to the north of the Qinling Mountains–Huaihe River line [2].
China’s One Belt One Road initiative has increased infrastructure development in the seasonal frozen region of Qinghai-Tibet, where several highway projects are planned [3,4,5]. The main hazard for subgrades in a seasonal frozen region is freeze-thaw damage, which is directly affected by the soil’s internal temperature. Therefore, the prediction of seasonal frozen soil temperatures has become a key issue and has received continuous attention from the academic community. Essentially, the temperature prediction problem belongs to time series forecasting. A time series is a data sequence arranged according to the chronological order of its occurrence. The main purpose of time series analysis is to forecast the future based on existing historical data. There are many classical time series analysis models, such as the autoregressive (AR) model, autoregressive moving average (ARMA) model, autoregressive conditional heteroskedasticity (ARCH) model, and the generalized autoregressive conditional heteroskedasticity (GARCH) model, and some improved algorithms have been widely used for prediction purposes [6,7,8,9,10]. However, these methods are often limited to linear and stationary time series forecasting. With the rapid development of information technology and the huge improvement of computer performance in the past two decades, many machine learning methods have been applied to the field of time series analysis, such as random forests (RF) [11], support vector machines (SVM) [12], Bayesian [13], extreme learning machines (ELM) [14], and Elman neural networks (ENN) [15]. Lin et al. [16] proposed a random forest-based model for extreme learning machine integration. Santamaría-Bonfil et al. [17] proposed a hybrid method for wind speed time series prediction based on support vector regression (SVR). Chen et al. [18] proposed a time series forecasting method based on a weighed least-squares support vector machine (LSSVM). Ticknor et al. [19] proposed a new Bayesian-regularized artificial neural network (BR-ANN) prediction method. Khellal et al. [20] proposed a method used to learn neural network features based entirely on an extreme learning machine (ELM). These aforementioned methods perform an essential role in time series forecasting with small and uniform data. However, when the data are large, nonstationary, or nonlinear, the prediction results are far from satisfactory [21].
In recent years, the research of artificial neural networks has made breakthrough progress. Deep learning models can handle many of the practical problems that are not easily solved by traditional methods. It has been shown that deep learning models can approximate nonlinear functions with arbitrary precision, as long as enough neurons are available. The recurrent neural network (RNN) model is one of the deep learning models with time series processing capability, which is often used for predictions of various time series. However, when using RNN models to solve long sequence problems, the issues of gradient disappearance and gradient explosion are prone to occur. To address this problem, Hochreiter and Schmidhuber [22] proposed long short-term memory (LSTM) as a variant of RNN. Since LSTM can learn the long-term dependence of data, it is widely used in natural language processing [23], image recognition [24], speech recognition [25], time series prediction [26], and other fields. Ma et al. [27] proposed a short-time traffic prediction model that effectively captures nonlinear traffic dynamics through an improved LSTM model. Hao et al. [28] improved the LSTM model for predicting the trajectory of walkers. Li et al. [29] proposed an air pollutant concentration prediction model that inherently considers spatiotemporal correlation using an improved LSTM model. However, there are few reports on the temperature time series prediction of seasonal frozen subgrades.
Due to the complex nonlinearity and non-stationarity of time series, a single prediction model sometimes gets stuck in local minima, resulting in suboptimal results. Therefore, more and more hybrid models have been proposed to obtain more accurate time series forecasting results. To maximize the use of the information contained in the history of time series, hybrid models are emerging, which generally combine two or more methods (modular unit). The mainstream hybrid models focus on the combination of data decomposition methods and forecasting models, where decomposition methods are an important preprocessing step in building these hybrid models. Empirical mode decomposition (EMD) is an adaptive decomposition algorithm for nonlinear nonstationary time series [30]. However, EMD cannot effectively decompose a nonstationary time series without sufficient extreme value points. In order to solve the mode mixing problem of EMD, Wu and Huang [31] proposed a white-noise-assisted data analysis method called ensemble empirical pattern decomposition (EEMD). Compared to EMD, EEMD eliminates the effect of mode mixing but still retains some noise in the intrinsic mode function (IMF). Therefore, a complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) algorithm is proposed by adding positive and negative auxiliary white noise pairs to the original time series data and introducing adaptive noise components, which not only maintains the ability to eliminate mode mixing and residual noise, but also has higher convergence performance with lower iterative cost [32].
Sha et al. [33] demonstrated that the input data pre-processed by CEEMDAN method can effectively improve the prediction performance of deep learning models. Hybrid models that combine these data decomposition methods with prediction models are widely used for time series analysis, such as the EEMD-ENN hybrid model used to predict annual runoff time series proposed by Zhang et al. [34]. Zheng et al. [35] proposed a hybrid SD-EMD-LSTM model for forecasting electrical loads. Lei et al. [36] proposed a hybrid model of EMD-SVR for predicting liquid level. Zhang et al. [37] proposed and validated the EEMD-LSTM hybrid model as a suitable model for temperature forecasting. Jiang et al. [38] proposed a CEEMDAN-FE-BILSTM hybrid model to predict PM2.5 concentration. Lin et al. [39] proposed a hybrid method combining CEEMDAN and ML-GRU (multi-layer gated recurrent unit) to accurately predict crude oil prices. Zhou et al. [40] proposed a hybrid model based on CEEMDAN, DL (deep learning), and ARIMA to predict short-term building energy consumption. Lin et al. [41] proposed a CEEMDAN-MLSTM hybrid model for reducing exchange rate risk in international trade.
This paper aims to more accurately predict the temperature of seasonal frozen stratification through the proposed CEEMDAN-LSTM model with a hybrid dataset. The prediction performance of the proposed model is compared with that of the other models through experimental results. The results show that the CEEMDAN decomposition method can better handle the nonlinearity and non-stationarity of temperature time series data, thus improving the prediction performance of neural network models. The models using hybrid datasets significantly improve the prediction performance compared to the models using single datasets, and the proposed CEEMDAN-LSTM model using hybrid datasets in all models has the best prediction performance. More details about the proposed method will be presented in the following sections.
2. Material and Methods
2.1. Case Study Area and Data Acquisition
The subgrade section to be researched in this paper was taken from the Golmud-Naqu highway, which is a part of China National Highway G109 connecting Beijing and Lhasa. It is located in Xiangmao Township (31°02′ N, 91°68′ E), Seni District, Naqu City, pile No. K3588 + 100, and the subgrade soil is mainly sandy soil. The location of section K3588 + 100 is shown in Figure 1. The area belongs to the plateau sub-cold semi-arid monsoon climate zone, with high terrain in the west and low terrain in the east. The elevation is between 3800–4500 m, with an average of about 4100 m. As a result of its high altitude, it suffers from a lack of heat and a harsh, arid climate. The annual average temperature is −2.2 °C, the annual relative humidity is 49%, the annual precipitation is 380 mm, and the annual sunshine hours are more than 2852 h.
The time series data were acquired through temperature sensors, volumetric water content sensors, and frost heave sensors deployed in the study area, the detailed study section layout is shown in Figure 2. In Figure 2, ①–⑤ are shown as the left foot of slope, left shoulder, roadbed median, right shoulder, and right foot of slope of the study section, respectively. The data duration is from 1 January 2020 to 31 December 2021, and the variation of temperature time-history at different depths of the right shoulder of section K3588 + 100 is shown in Figure 3.
2.2. Methods
2.2.1. EMD and EEMD
Empirical mode decomposition (EMD) was proposed by Huang et al. in 1998 [30], Its main purpose is to decompose complex time series into a high-frequency part (i.e., the intrinsic mode function (IMF)) and a low-frequency part (i.e., the residual (R)). The decomposed IMF contains the features of the original time series at different time scales, and any time series can be decomposed into a finite number of IMF components with the residual.
The screening process for EMD is as follows:
Step 1: Find all local maxima and local minima of the original time series S(t), and then fit the upper envelope U(t) and lower envelope L(t) of the S(t) with the cubic spline interpolation function. The average of its upper and lower envelopes is M(t), as shown in Equation (1).
(1) |
Step 2: Then, a new sequence H(t) is obtained by subtracting M(t) from the original time series S(t), as shown in Equation (2).
(2) |
Step 3: If M(t) and H(t) satisfy termination criteria, then the first IMF as , and the first residual R as are obtained. The termination criteria are: (i) M(t) approaches zero, (ii) the number of extreme points and zero crossing points in the H(t) differs by no more than 1.
Step 4: If the termination criteria are not satisfied, repeat steps 1 to 3 above for until and are obtained.
Step 5: Repeat steps 1 to 4 above for until all IMFs and the residual are obtained. Thus, the original time series S(t) are decomposed as Equation (3):
(3) |
One of the shortcomings of EMD is mode mixing. When mode mixing occurs, a single IMF consists of different features of a time series signal, or features of a similar time series signal mixed in different IMFs. To alleviate this shortcoming, Huang et al. [31] proposed an improved algorithm called ensemble empirical modal decomposition (EEMD). It solves the mode-mixing problem in EMD by adding Gaussian white noise to the original time series signal. The specific steps are shown below:
Step 1: Add Gaussian white noise to the original time series, then obtain a new time series.
Step 2: Decompose the new time series to obtain each IMF component.
Step 3: Repeat steps 1 and 2 continuously, but each time using a different Gaussian white noise.
Step 4: Take the ensemble means of corresponding IMFs of the decompositions as the final result.
2.2.2. CEEMDAN
Although EEMD overcomes the problem of mode mixing, the Gaussian white noise added to EEMD needs to be repeatedly averaged before it can be eliminated, which will take a large amount of computing time. Moreover, the reconstruction data still contains residual noise, and different realization of noise added to the time series could yield different numbers of modes. To solve the shortcomings of EEMD, Torres et al. [32] proposed a new decomposition method for time series signals, namely, complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN). By adding positive and negative auxiliary white noise pairs to the original time series data and introducing adaptive noise components, the CEEMDAN algorithm not only maintains the capability of eliminating mode mixing and residual noise, but also has a lower cost for iterations and higher convergence performance. The specific decomposition algorithm is described as follows:
S(t) is the original time series, is the kth IMF obtained by CEEMDAN decomposition, represents the jth IMF obtained by EMD, is a scalar coefficient to set the signal-to-noise ratio at each stage, and thus, to determine the standard deviation of the Gaussian white noise, is the Gaussian white noise that meets the standard normal distribution. In this part of the calculation, , , , and represent a long vector of time series.
Step 1: By adding a white noise with signal-to-noise ratio to the original time series , the S(t) obtained is used for the first decomposition, as shown in Equation (4). Where t represents the different time points, i represents the ith addition of white noise, and n represents the total number of added white noise.
(4) |
Step 2: Use EMD to decompose n times, and then obtain . The average value is then calculated according to Equation (5) to obtain the first IMF of CEEMDAN, the first residual is obtained using Equation (6), and represents the first IMF obtained through EMD. In theory, since the average value of white noise is zero, the effect of white noise can be eliminated by calculating the average value.
(5) |
(6) |
Step 3: The adaptive noise term is the first IMF obtained by EMD with the addition of white noise with signal-to-noise ratio . Then the adaptive noise term is added to the first residual , obtaining a new time series. Subsequently, a new time series is decomposed to obtain the second IMF of CEEMDAN using Equation (7), the second residual is obtained according Equation (8).
(7) |
(8) |
Step 4: Repeat Step 3, the new time series is obtained by adding the new adaptive noise term to the residual term. Then decompose it to obtain the kth IMF of CEEMDAN, where ,. The specific Equations (9) and (10) are as follows:
(9) |
(10) |
Step 5: Finally, the CEEMDAN algorithm terminates when the residual term cannot continue the decomposition as it does not exceed two extreme points. At that time, the final residual is a distinct trend term. The full IMF and obtained are related to the original time series by the following Equation (11).
(11) |
2.2.3. LSTM
Long short-term memory (LSTM) network is a special variant of recurrent neural networks (RNN) [22]. In traditional feedforward neural networks, information can only flow from the input layer to the hidden layer, and finally from one direction to the output layer. The main distinction between RNNs and feedforward neural networks is that RNNs have a recurrent cell to store the historical state of all past elements in the sequence [42,43]. However, when training an RNN model with the gradient descent method (usually used to train feedforward neural networks), the gradient may increase or decrease exponentially, which can cause the gradient to vanish or explode. If the gradient vanishes during the training process, the weights cannot be updated, which eventually leads to training failure. On the contrary, exploding gradients that are too large will drastically update network parameters, and, in extreme cases, can lead to erroneous results [44]. LSTM improved RNN by introducing the concepts of gates, and of applying memory cells to filter and process historical states and information instead of the recurrent cells of RNN. The basic structure of an unfolded single memory cell of an LSTM is shown in Figure 4. Each memory cell contains an input gate (), a forget gate (), and an output gate () to control the flow of information.
The input gate () determines how much input data needs to be stored in the cell state at the current moment and the intermediate value () is used to update the cell state in the process of Equations (12) and (13).
(12) |
(13) |
The forget gate () determines how many cell states need to be retained from the previous moment to the current moment , as shown in Equation (14).
(14) |
The cell state is updated from to by removing some of the old information and adding the filtered intermediate value , as shown in Equation (15).
(15) |
The output gate () controls how much of the current cell state needs to be output to the new hidden state, as shown in Equations (16) and (17).
(16) |
(17) |
where in Equations (12)–(17), is the input at time ; and are the model output states at time and , respectively; and are the outputs of the hidden layer at time and , respectively; is the cell input state at time . , and are the outputs of the forget gate, input gate, and output gate at time , respectively; , , , and are the weights connecting and to the forget gate, input gate, output gate, and cell input, respectively; , , , and are their corresponding bias terms.
In this study, the Adam optimization algorithm, which is a gradient descent method, was used. Because it has the good property of adjusting the learning rate adaptively, it is often used to calculate a weight matrix [45]. Adam combines the advantages of AdaGrad and RMSProp optimization algorithms with simple implementation, high computational efficiency, and less computing resources. Since the update of parameters in Adam is not affected by the gradient transformation, it is suitable for unstable or sparse gradients in very noisy datasets. Overfitting is a common phenomenon in LSTM, which results in a trained model with high accuracy on the training set, but low accuracy on the test set. Therefore, it is essential to prevent overfitting during training. Srivastava et al. [46] proposed a method, namely dropout, to prevent overfitting by dropping some random neurons from the network with a certain probability in each training process.
As shown in Figure 5, Figure 5a shows the fully connected network, and Figure 5b shows the network after applying the dropout method. Dropout can solve the overfitting problem by ignoring feature detectors to reduce the complex relationships between neurons and force the neural network model to learn better features.
3. Construction of CEEMDAN-LSTM Hybrid Prediction Model
3.1. Process of Constructing the Hybrid Prediction Model
Taking the temperature time series at 0.5 m below the subgrade surface of the right shoulder of section K3588 + 100 as an example, the IMFs and R (residual) obtained by various decomposing methods of EMD, EEND, and CEEMDAN are shown in Figure 6, respectively.
Previously, the process of CEEMDAN-LSTM for general prediction purposes could be divided into three steps. First, decompose the time series to be predicted into IMFs and a residual using CEEMDAN. Second, put each decomposed IMF as a single input vector into the LSTM neural network, and then obtain each corresponding IMF prediction. Third, add all the obtained IMF predictions to the residual term to obtain the final prediction value.
Considering that the temperature of the subgrade in the seasonal frozen region is mainly influenced by the freeze-thaw cycle of the subgrade soils, a new temperature strategy is proposed in this study. Both the measured volumetric water content and frost heave at the corresponding location are time series. Combine the three types of values (i.e., volumetric water content, frost heave, and IMFs) as a vector, and then construct a composite time series dataset. This dataset is used to train and test the LSTM model, and the specific steps are shown in Figure 7.
As shown in Figure 7, the EMD, EEMD, and CEEMDAN methods were used to decompose the original temperature time series data, respectively, and then the obtained IMF is combined with the volumetric water content time series and the frost heave time series to compose a hybrid dataset, respectively. 80% of the hybrid dataset is used as the train set and 20% as the test set. The train set is put into the lstm model to train to get the optimal prediction model, and then the test set is put into the optimal prediction model to get the IMFs predictions. Finally, all the IMF predictions and the residual term are added together to get the final prediction.
3.2. LSTM Neural Network Parameter Setting
As shown in Figure 8, ten continuous time series data are used in this study to predict the next time step data in the future, this means that it is a step ahead of the forecast, with the length of each time step being 30 min. Other hyperparameters are set as follows: LSTM with 3 hidden layers, 36 hidden layer neurons, 3 input features, Dropout regularization of 0.1, learning rate of 0.001, the final output layer is a linear layer, and the number of training epochs for each model is 200. The specific dimensional transformation from the data input to the predicted output is shown in Figure 8.
3.3. Model Evaluation Metrics
In this study, three evaluation metrics, namely root mean square error (RMSE), mean absolute error (MAE), and coefficient of determination (R2), were used to evaluate the relevant models. RMSE (for which a smaller value is better) is considered the most widely used error assessment method in point forecasting, and is generally more sensitive to large deviations between measured (actual) and predicted values. MAE (for which a smaller value is better) avoids the problem of errors canceling each other out, thus accurately reflecting the actual magnitude of forecast errors, while R2 (for which a bigger value is better) is used to estimate the degree of conformity between predicted and actual values. The specific formulas are as shown in Equations (18)–(20).
(18) |
(19) |
(20) |
where is the predicted value, is the actual value, represents the average of the actual value, and n is the total number of time series samples.
4. Results
4.1. Performance Comparison of Models Predictions before and after Decomposition
This study compares the prediction performance of the models before and after temperature time series decomposition to verify whether the effort of decomposition has a practical improvement on the prediction performance of the model. The EMD, EEMD, and CEEMDAN models are used as pre-processing to decompose the temperature time series, respectively, combined with an LSTM neural network to form three hybrid models (i.e., EMD-LSTM, EEMD-LSTM, and CEEMDAN-LSTM). The reconstructed hybrid datasets after the decomposition of temperature time series outperformed the undecomposed LSTM neural network model in prediction performance. The comparison of prediction performance of different models is shown in Table 1. The prediction accuracy of the EMD-LSTM, EEMD-LSTM, and CEEMDAN-LSTM models compared to the LSTM model improved by 57%, 9%, and 80% on RMSE, 59%, 18%, and 84% on MAE, and 8%, 4%, and 11% on R2, respectively. Compared to the LSTM, EMD-LSTM, and EEMD-LSTM models, the prediction accuracy of the CEEMDAN-LSTM model is 80%, 54%, and 78% improved over RMSE, 84%, 60%, and 80% improved over MAE, and 10%, 2%, and 7% improved over R2, respectively. These results show that the prediction performance after any decomposition by EMD, EEMD, or CEEMDAN is higher than that of the LSTM model without decomposition, indicating that the temperature time series is better predicted after decomposition. Among these models, the CEEMDAN-LSTM model obtains the highest prediction accuracy and performed significantly better than other models. A comparison of the prediction results of different models for 0.5 m under the right shoulder of the subgrade is shown in Figure 9.
Table 1.
Predictive Performance |
LSTM | EMD-LSTM | EMD-LSTM | CEEMDAN-LSTM |
---|---|---|---|---|
RMSE (smaller is better) | 0.210335 | 0.090355 | 0.191001 | 0.041966 |
MAE (smaller is better) | 0.192146 | 0.079020 | 0.156670 | 0.031439 |
R2 (bigger is better) | 0.893138 | 0.966392 | 0.925871 | 0.989248 |
4.2. Comparison of Prediction Results of LSTM Models for Single and Hybrid Datasets
In this study, the ability of the hybrid dataset to improve the model prediction performance was validated by comparing the prediction results of the single dataset and the hybrid dataset. The single dataset of the ith IMF obtained by EMD, EEMD, and CEEMDAN, respectively, was reconstructed into a hybrid dataset with volumetric water content time series (H) and frost heave time series (D) at corresponding depth. Subsequently, the hybrid dataset and single dataset were respectively input into the LSTM model for prediction. The prediction performances were compared in Table 2. To distinguish the names of each model, the following decomposed single-variable LSTM models are named d-EMD-LSTM, d-EEMD-LSTM, and d-CEEMDAN-LSTM. It can be seen that the LSTM prediction models that use the hybrid dataset are better than those LSTM prediction models that use the single variable dataset for all metrics. For example, the EMD-LSTM model improves the prediction accuracy of RMSE, MAE, and R2 by 65%, 65%, and 10%, compared to the d-EMD-LSTM model; the EEMD-LSTM model improves the prediction accuracy of RMSE, MAE, and R2 by 30%, 32%, and 3%, compared to the d-EEMD-LSTM model; and the CEEMDAN-LSTM improves the prediction accuracy of RMSE, MAE, and R2 by 75%, 77%, and 4%, compared to the d-CEEMDAN-LSTM model, respectively. Moreover, when the different decomposition methods are compared on different datasets, it can be seen that the prediction accuracy is the highest after CEEMDAN processing, and the CEEMDAN-LSTM model outperforms all the other models in terms of prediction accuracy. A Comparison of the prediction results of models with a single dataset and with a hybrid dataset is shown in Figure 10.
Table 2.
Predictive Performance |
d-EMD-LSTM | d-EEMD-LSTM | d-CEEMDAN-LSTM | EMD-LSTM | EEMD-LSTM | CEEMDAN-LSTM |
---|---|---|---|---|---|---|
RMSE (smaller is better) | 0.286758 | 0.274303 | 0.168228 | 0.090355 | 0.191001 | 0.041966 |
MAE (smaller is better) | 0.226918 | 0.230845 | 0.134943 | 0.079020 | 0.156670 | 0.031439 |
R2 (bigger is better) | 0.876900 | 0.900513 | 0.955218 | 0.966392 | 0.925871 | 0.989248 |
4.3. Comparison of Different Depth Prediction Models
The data previously used in Section 4.1 and Section 4.2 is at a subgrade depth of 0.5 m. In order to verify the predictive performance of the CEEMDAN-LSTM model at different subgrade depths, and then to determine whether it can be applied to different depths of the subgrade, the monitored temperature time series at different depths were decomposed into EMD, EEMD, and CEEMDAN, then reconstructed into a hybrid dataset with the volumetric water content time series (H) and frost heave time series (D) at their corresponding depths, then put into the LSTM model for training and finally for prediction. Due to paper space limitations, only the figures of CEEMDAN decomposition are shown in Figure 11.
The temperature time series were processed by EMD, EEMD, and CEEMDAN, respectively, then reconstructed into a hybrid dataset that contains the volumetric water content time series (H) and frost heave time series (D) of the corresponding depths. After that, the hybrid dataset was put into the LSTM model training, to get the optimal model, and to then make predictions. The specific prediction results are shown in Figure 12.
As shown in Figure 12, it can be seen that the prediction performance of the CEEMDAN-LSTM model is higher than that of the EMD-LSTM and EEMD-LSTM models at different subgrade depths, and the CEEMDAN-LSTM model does not decrease its prediction performance with the change of depth. However, as the depth of the stratum continues to increase, the temperature no longer changes significantly, and the temperature time series curve trend is nearly a smooth curve at this time. Then, the decomposition by EMD, EEMD, and CEEMDAN will no longer yield significant accuracy gains. This confirms that EMD-like methods are more suitable for time series with complex changes to obtain better decomposition. The detailed different performances are shown in Table 3.
Table 3.
Depth under Subgrade | Predictive Performance |
EMD-LSTM | EEMD-LSTM | CEEMDAN-LSTM |
---|---|---|---|---|
0.5 m | RMSE | 0.090355 | 0.191001 | 0.041966 |
MAE | 0.079020 | 0.156670 | 0.031439 | |
R 2 | 0.966392 | 0.925871 | 0.989248 | |
1.5 m | RMSE | 0.245341 | 0.274970 | 0.086217 |
MAE | 0.229630 | 0.233995 | 0.064258 | |
R 2 | 0.947074 | 0.934600 | 0.964736 | |
2.9 m | RMSE | 0.102413 | 0.139584 | 0.047771 |
MAE | 0.092721 | 0.127279 | 0.038470 | |
R 2 | 0.925463 | 0.936328 | 0.960552 |
As shown in Table 3, the prediction accuracy of the CEEMDAN-LSTM model at subgrade depth of 0.5 m is improved by 54% and 78% in RMSE, 60% and 80% in MAE, and 2% and 7% in R2, respectively, compared to the EMD-LSTM and EEMD-LSTM models. The prediction accuracy of the CEEMDAN-LSTM model at subgrade depth of 1.5 m is improved by 65% and 69% in RMSE, 72% and 73% in MAE, and 2% and 3% in R2, respectively, compared to the EMD-LSTM and EEMD-LSTM models. The prediction accuracy of the CEEMDAN-LSTM model at subgrade depth of 2.9 m is improved by 54% and 66% in RMSE, 59% and 70% in MAE, and 4% and 2% in R2, respectively, compared to the EMD-LSTM and EEMD-LSTM models.
5. Conclusions
Subgrade temperature prediction is of great significance for analyzing the thermal state of subgrades in seasonal frozen regions. In order to more accurately predict the subgrade temperature changes, a CEEMDAN-LSTM hybrid prediction model suitable for hybrid datasets is proposed. For various model comparison purposes, the original temperature time series were processed by EMD, EEMD, and CEEMDAN, respectively, and each IMF component was reconstructed into a hybrid dataset with the corresponding volumetric water content time series (H) and frost heave time series (D), and then the hybrid data were put into the LSTM neural network for training and prediction. Finally, the prediction results of different models were compared and analyzed, and the specific conclusions are as follows:
-
(1)
By comparing the prediction performance of the original temperature time series model without decomposition and the models with different decompositions, it was found that the CEEMDAN-LSTM model among the hybrid models (i.e., EMD-LSTM, EEMD-LSTM, and CEEMDAN-LSTM) had the best prediction performance. Specifically, the prediction accuracy of the CEEMDAN-LSTM model was improved by 80%, 54%, and 78% in RMSE compared with the LSTM, EMD-LSTM, and EEMD-LSTM models, respectively. This means that the CEEMDAN decomposition method can better handle the nonlinearity and non-stationarity of the temperature time series data under the subgrade of seasonal frozen regions.
-
(2)
Hybrid datasets significantly improved the prediction performance over single datasets, which is attributed to the strong extraction ability of LSTM neural networks for multidimensional features. It was found that models using hybrid datasets all outperformed models using single datasets, among which the CEEMDAN-LSTM model using hybrid datasets had the best prediction performance.
-
(3)
In order to evaluate the prediction performance of models for different depths of the subgrade, predictions were applied to locations at subgrade depths of 0.5 m, 1.5 m, and 2.9 m. It was found that the CEEDMAN-LSTM model had the best prediction performance at all subgrade depths, and the performance did not decrease with depth. This means that the model can accurately predict the temperature inside the subgrade in seasonal frozen regions, which can provide reference and guidance for related research.
Acknowledgments
This paper was significantly improved with the aid of anonymous reviewers. All support is gratefully acknowledged.
Author Contributions
Conceptualization, X.L.; methodology, L.C. and X.L.; software, L.C.; validation, L.C. and F.C.; formal analysis, X.L. and C.Z.; investigation, L.C., C.Z., X.H., F.C. and B.Z.; resources, C.Z. and X.H.; data curation, L.C., X.H., F.C. and B.Z.; writing—original draft preparation, L.C. and X.L.; writing—review and editing, X.L.; visualization, L.C. and B.Z.; supervision, C.Z. and X.H.; project administration, X.L. and X.H.; funding acquisition, X.L., C.Z. and X.H. All authors have read and agreed to the published version of the manuscript.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Not applicable.
Conflicts of Interest
The authors declare no conflict of interest.
Funding Statement
This research was funded by the National Natural Science Foundation of China (NSFC), grant numbers 42072314, 42177147 and 41807271; China Postdoctoral Science Foundation, grant numbers 2012M521500 and 2014T70758; the National Key R&D Program of China, grant number 2017YFC1501304; China Communications Construction Company Second Highway Consultant Co., Ltd., grant number KJFZ-2018-049-001. The APC was funded by China Communications Construction Company Second Highway Consultant Co., Ltd., grant number KJFZ-2018-049.
Footnotes
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Xu X.Z., Wang J.C., Zhang L.X. Physics of Frozen Soils. Science Press; Beijing, China: 2001. [Google Scholar]
- 2.Lai Y., Xu X., Dong Y., Li S. Present situation and prospect of mechanical research on frozen soils in China. Cold Reg. Sci. Technol. 2013;87:6–18. doi: 10.1016/j.coldregions.2012.12.001. [DOI] [Google Scholar]
- 3.Deng Q., Liu X., Zeng C., He X., Chen F., Zhang S. A Freezing-Thawing Damage Characterization Method for Highway Subgrade in Seasonally Frozen Regions Based on Thermal-Hydraulic-Mechanical Coupling Model. Sensors. 2021;21:6251. doi: 10.3390/s21186251. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Liu Y., Li D., Chen L., Ming F. Study on the Mechanical Criterion of Ice Lens Formation Based on Pore Size Distribution. Appl. Sci. 2020;10:8981. doi: 10.3390/app10248981. [DOI] [Google Scholar]
- 5.Wang P., Liu E., Song B., Liu X., Zhang G., Zhang D. Binary medium creep constitutive model for frozen soils based on homogenization theory. Cold Reg. Sci. Technol. 2019;162:35–42. doi: 10.1016/j.coldregions.2019.03.019. [DOI] [Google Scholar]
- 6.Boyd G., Na D., Li Z., Snowling S., Zhang Q., Zhou P. Influent Forecasting for Wastewater Treatment Plants in North America. Sustainability. 2019;11:1764. doi: 10.3390/su11061764. [DOI] [Google Scholar]
- 7.Davidson J., Li X. Strict stationarity, persistence and volatility forecasting in ARCH (∞) processes. J. Empir. Financ. 2016;38:534–547. doi: 10.1016/j.jempfin.2015.08.010. [DOI] [Google Scholar]
- 8.Trombe P.-J., Pinson P., Madsen H. A General Probabilistic Forecasting Framework for Offshore Wind Power Fluctuations. Energies. 2012;5:621–657. doi: 10.3390/en5030621. [DOI] [Google Scholar]
- 9.Xin J., Zhou J., Yang S.X., Li X., Wang Y. Bridge Structure Deformation Prediction Based on GNSS Data Using Kalman-ARIMA-GARCH Model. Sensors. 2018;18:298. doi: 10.3390/s18010298. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Wang X., Ma J.-J., Wang S., Bi D.-W. Time series forecasting for energy-efficient organization of wireless sensor networks. Sensors. 2007;7:1766–1792. doi: 10.3390/s7091766. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Yu R., Yang Y., Yang L., Han G., Move O.A. RAQ-A Random Forest Approach for Predicting Air Quality in Urban Sensing Systems. Sensors. 2016;16:86. doi: 10.3390/s16010086. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Mohandes M.A., Halawani T.O., Rehman S., Hussain A.A. Support vector machines for wind speed prediction. Renew. Energy. 2004;29:939–947. doi: 10.1016/j.renene.2003.11.009. [DOI] [Google Scholar]
- 13.Li J., Dai B., Li X., Xu X., Liu D. A Dynamic Bayesian Network for Vehicle Maneuver Prediction in Highway Driving Scenarios: Framework and Verification. Electronics. 2019;8:40. doi: 10.3390/electronics8010040. [DOI] [Google Scholar]
- 14.Amrouche B., Le Pivert X. Artificial neural network based daily local forecasting for global solar radiation. Appl. Energy. 2014;130:333–341. doi: 10.1016/j.apenergy.2014.05.055. [DOI] [Google Scholar]
- 15.Fan J., Liu C., Lv Y., Han J., Wang J. A Short-Term Forecast Model of foF2 Based on Elman Neural Network. Appl. Sci. 2019;9:2782. doi: 10.3390/app9142782. [DOI] [Google Scholar]
- 16.Lin L., Wang F., Xie X., Zhong S. Random forests-based extreme learning machine ensemble for multi-regime time series prediction. Expert Syst. Appl. 2017;83:164–176. doi: 10.1016/j.eswa.2017.04.013. [DOI] [Google Scholar]
- 17.Santamaría-Bonfil G., Reyes-Ballesteros A., Gershenson C. Wind speed forecasting for wind farms: A method based on support vector regression. Renew. Energy. 2016;85:790–809. doi: 10.1016/j.renene.2015.07.004. [DOI] [Google Scholar]
- 18.Chen T.-T., Lee S.-J. A weighted LS-SVM based learning system for time series forecasting. Inform. Sci. 2015;299:99–116. doi: 10.1016/j.ins.2014.12.031. [DOI] [Google Scholar]
- 19.Ticknor J.L., Hsu-Kim H., Deshusses M.A. A robust framework to predict mercury speciation in combustion flue gases. J. Hazard. Mater. 2014;264:380–385. doi: 10.1016/j.jhazmat.2013.10.052. [DOI] [PubMed] [Google Scholar]
- 20.Khellal A., Ma H., Fei Q. Convolutional Neural Network Based on Extreme Learning Machine for Maritime Ships Recognition in Infrared Images. Sensors. 2018;18:1490. doi: 10.3390/s18051490. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Rather A.M., Agarwal A., Sastry V.N. Recurrent neural network and a hybrid model for prediction of stock returns. Expert Syst. Appl. 2015;42:3234–3241. doi: 10.1016/j.eswa.2014.12.003. [DOI] [Google Scholar]
- 22.Hochreiter S., Schmidhuber J. Long short-term memory. Neural Comput. 1997;9:1735–1780. doi: 10.1162/neco.1997.9.8.1735. [DOI] [PubMed] [Google Scholar]
- 23.Li Y., Zhu Z., Kong D., Han H., Zhao Y. EA-LSTM: Evolutionary attention-based LSTM for time series prediction. Knowl.-Based Syst. 2019;181:104785. doi: 10.1016/j.knosys.2019.05.028. [DOI] [Google Scholar]
- 24.Su B., Lu S. Accurate recognition of words in scenes without character segmentation using recurrent neural network. Pattern Recognit. 2017;63:397–405. doi: 10.1016/j.patcog.2016.10.016. [DOI] [Google Scholar]
- 25.Weninger F., Geiger J., Wöllmer M., Schuller B., Rigoll G. Feature enhancement by deep LSTM networks for ASR in reverberant multisource environments. Comput. Speech Lang. 2014;28:888–902. doi: 10.1016/j.csl.2014.01.001. [DOI] [Google Scholar]
- 26.Zhang J., Zhu Y., Zhang X., Ye M., Yang J. Developing a Long Short-Term Memory (LSTM) based model for predicting water table depth in agricultural areas. J. Hydrol. 2018;561:918–929. doi: 10.1016/j.jhydrol.2018.04.065. [DOI] [Google Scholar]
- 27.Ma X., Tao Z., Wang Y., Yu H., Wang Y. Long short-term memory neural network for traffic speed prediction using remote microwave sensor data. Transp. Res. Part C Emerg. Technol. 2015;54:187–197. doi: 10.1016/j.trc.2015.03.014. [DOI] [Google Scholar]
- 28.Xue H., Huynh D.Q., Reynolds M. SS-LSTM: A Hierarchical LSTM Model for Pedestrian Trajectory Prediction; Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision (WACV); Lake Tahoe, NV, USA. 12–15 March 2018; pp. 1186–1194. [Google Scholar]
- 29.Rao G., Huang W., Feng Z., Cong Q. LSTM with sentence representations for document-level sentiment classification. Neurocomputing. 2018;308:49–57. doi: 10.1016/j.neucom.2018.04.045. [DOI] [Google Scholar]
- 30.Huang N.E., Shen Z., Long S.R., Wu M.C., Shih H.H., Zheng Q., Yen N.-C., Tung C.C., Liu H.H. The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. Proc. R. Soc. London. Ser. A Math. Phys. Eng. Sci. 1998;454:903–995. doi: 10.1098/rspa.1998.0193. [DOI] [Google Scholar]
- 31.Wu Z., Huang N.E. Ensemble empirical mode decomposition: A noise-assisted data analysis method. Adv. Adapt. Data Anal. 2009;1:1–41. doi: 10.1142/S1793536909000047. [DOI] [Google Scholar]
- 32.Torres M.E., Colominas M.A., Schlotthauer G., Flandrin P. A complete ensemble empirical mode decomposition with adaptive noise; Proceedings of the 2011 IEEE international conference on acoustics, speech and signal processing (ICASSP); Prague, Czech Republic. 22–27 May 2011; pp. 4144–4147. [Google Scholar]
- 33.Sha J., Li X., Zhang M., Wang Z.-L. Comparison of Forecasting Models for Real-Time Monitoring of Water Quality Parameters Based on Hybrid Deep Learning Neural Networks. Water. 2021;13:1547. doi: 10.3390/w13111547. [DOI] [Google Scholar]
- 34.Zhang X., Zhang Q., Zhang G., Nie Z., Gui Z. A Hybrid Model for Annual Runoff Time Series Forecasting Using Elman Neural Network with Ensemble Empirical Mode Decomposition. Water. 2018;10:416. doi: 10.3390/w10040416. [DOI] [Google Scholar]
- 35.Zheng H., Yuan J., Chen L. Short-Term Load Forecasting Using EMD-LSTM Neural Networks with a Xgboost Algorithm for Feature Importance Evaluation. Energies. 2017;10:1168. doi: 10.3390/en10081168. [DOI] [Google Scholar]
- 36.Lei Z., Su W. Mold Level Predict of Continuous Casting Using Hybrid EMD-SVR-GA Algorithm. Processes. 2019;7:177. doi: 10.3390/pr7030177. [DOI] [Google Scholar]
- 37.Zhang X., Zhang Q., Zhang G., Nie Z., Gui Z., Que H. A Novel Hybrid Data-Driven Model for Daily Land Surface Temperature Forecasting Using Long Short-Term Memory Neural Network Based on Ensemble Empirical Mode Decomposition. Int. J. Environ. Res. Public Health. 2018;15:1032. doi: 10.3390/ijerph15051032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Jiang X., Wei P., Luo Y., Li Y. Air Pollutant Concentration Prediction Based on a CEEMDAN-FE-BiLSTM Model. Atmosphere. 2021;12:1452. doi: 10.3390/atmos12111452. [DOI] [Google Scholar]
- 39.Lin H., Sun Q. Crude Oil Prices Forecasting: An Approach of Using CEEMDAN-Based Multi-Layer Gated Recurrent Unit Networks. Energies. 2020;13:1543. doi: 10.3390/en13071543. [DOI] [Google Scholar]
- 40.Zhou Y., Wang L., Qian J. Application of Combined Models Based on Empirical Mode Decomposition, Deep Learning, and Autoregressive Integrated Moving Average Model for Short-Term Heating Load Predictions. Sustainability. 2022;14:7349. doi: 10.3390/su14127349. [DOI] [Google Scholar]
- 41.Lin H., Sun Q., Chen S.-Q. Reducing Exchange Rate Risks in International Trade: A Hybrid Forecasting Approach of CEEMDAN and Multilayer LSTM. Sustainability. 2020;12:2451. doi: 10.3390/su12062451. [DOI] [Google Scholar]
- 42.LeCun Y., Bengio Y., Hinton G. Deep learning. Nature. 2015;521:436–444. doi: 10.1038/nature14539. [DOI] [PubMed] [Google Scholar]
- 43.Wang K., Qi X., Liu H. A comparison of day-ahead photovoltaic power forecasting models based on deep learning neural network. Appl. Energy. 2019;251:113315. doi: 10.1016/j.apenergy.2019.113315. [DOI] [Google Scholar]
- 44.Niu H., Xu K., Wang W. A hybrid stock price index forecasting model based on variational mode decomposition and LSTM network. Appl. Intell. 2020;50:4296–4309. doi: 10.1007/s10489-020-01814-0. [DOI] [PubMed] [Google Scholar]
- 45.Kingma D.P., Ba J. Adam: A method for stochastic optimization. arXiv. 20141412.6980 [Google Scholar]
- 46.Srivastava N., Hinton G., Krizhevsky A., Sutskever I., Salakhutdinov R. Dropout: A Simple Way to Prevent Neural Networks from Overfitting. J. Mach. Learn Res. 2014;15:1929–1958. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
Not applicable.