Hybrid the long short-term memory with whale optimization algorithm and variational mode decomposition for monthly evapotranspiration estimation

Tonglin Fu; Xinrong Li

doi:10.1038/s41598-022-25208-z

. 2022 Dec 1;12:20717. doi: 10.1038/s41598-022-25208-z

Hybrid the long short-term memory with whale optimization algorithm and variational mode decomposition for monthly evapotranspiration estimation

Tonglin Fu ^1,^✉, Xinrong Li ²

PMCID: PMC9715655 PMID: 36456679

Abstract

The sustainability of artificial sand-binding vegetation is determined by the water balance between evapotranspiration (ET) and precipitation in desert regions. Consequently, accurately estimating ET is a critical prerequisite for determing the types and spatial distribution of artificial vegetation in different sandy areas. For this purpose, a novel hybrid estimation model was proposed to estimate monthly ET by coupling the deep learning long short term memory (LSTM) with variational mode decomposition (VMD) and whale optimization algorithm (WOA) (i.e., VMD-WOA-LSTM) to estimate the monthly ET in the southeast margins of Tengger Desert. The superiority of LSTM was selected due to its capability of automatically extracting the nonlinear and nonstationary features from sequential data, WOA was employed to optimize the hyperparameters of LSTM, and VMD was used to extract the intrinsic traits of ET time series. The estimating results of VMD-WOA-LSTM has been compared with actual ET and estimation of other hybrid models in terms of standard performance metrics. The results reveale that VMD-WOA-LSTM provide more accurate and reliable estimating results than that of LSTM, the support vector machine (SVM), and the variants of those models. Therefore, VMD-WOA-LSTM could be recommended as an essential auxiliary method to estimate ET in desert regions.

Subject terms: Environmental sciences, Hydrology

Introduction

Evapotranspiration (ET) is a highly nonlinear physical and biological process, which connects the ecological and hydrological processes by the water balance^1,2. It is the central component of regional water and energy balance, and serves as a significant linkage in the soil–plant-atmosphere (SPA) system³. Accurately estimating ET is a critical prerequisite in environmental management^4–6, especially in desert regions with large areas of artificial sand-binding vegetation, where the sustainability of artificial sand-binding vegetation is determined by the water balance between ET and precipitation^5,7 . In addition, climate change, especially changes in warming and precipitation patterns, will inevitably have a profound impact on the sustainability of artificial vegetation^7,8. Different from the natural vegetation, artificial sand-binding vegetation is established with speciall purpose and function, the accurate estimation of ET can provide a reference for understanding the water balance and determining the composition, structure, spatial distribution, and scale of artificial sand-binding vegetation in desert regions^9,10. However, the application of physically-based methods (e.g., Priestley-Taylor method, Hargreaves method, the corrected FAO-24 Penman method, FAO-56 Penman–Monteith method, etc.) is severely limited due to the lack of required meteorological parameters (e.g. the latent heat of vaporization, solar radiation, relative humidity, air temperature, etc.) in desert regions^4,6,2–12. Therefore, constructing the other types of data-driven models to obtain accurate estimating results is highly desirable.

Recently, the machine learning (ML) models, including back-propagation neural networks (BPNN)¹³, multi-layer perceptron (MLP)², Multilayer artificial neural networks (MLNN)⁶, support vector machine (SVM)^7,12 , extreme learning machine (ELM) ⁶, Model tree (MT)^14,15, random forest (RF)⁶, wavelet neural networks (WNN)¹⁶, radial basis function (RBF)¹⁷, etc., have been dramatically employed to estimate evaporation or ET due to its capability of automatically learning features and not requiring any assumptions. As ML models have the defects that the hyperparameters are difficult to adjust by themselves, which significantly decrease the computing accuracy. To overcome the drawbacks of ML models, meta-heuristic algorithms such as flower pollination algorithm (FPA)⁶, firefly algorithm (FFA)¹¹, intelligent water drops (IWD) algorithm¹² , whale optimization algorithm (WOA)¹⁸, grey wolf optimizer algorithm (GWO)^19,20 etc., were employed to determine the optimal hyperparameters of ML models. Studies have shown that ML models coupled with meta-heuristic algorithms have higher computing performance than that of single ML models and physically-based methods^{12,16,18,21,22}.

As ET is closely affected by the meteorological parameters, soil moisture, and vegetation traits¹², the measured ET time series taking on many sharp and fluctuating points, which significantly decreased the estimating accuracy¹². To obtain more credible estimating results, data pre-processing techniques, including Discrete wavelet transform (DWT)²³, ensemble empirical mode decomposition (EEMD)^14,15, and variational mode decomposition (VMD)^7,24 etc., were employed to decompose ET time series frequency into various components and obtain the required information at multiple levels^7,14,23,24. Literature review shows that data pre-processing techniques hybridized with ML models can significantly improve the model performance^16,25 . In this regard, Gocić et al.²²coupled SVM with DWT and firefly algorithm (FFA) to estimate reference ET in Serbia, where FFA was employed to determine the hyperparameters of SVM. The results show that DWT-FFA-SVM is the best estimating method for reference ET estimation. Pammar and Deka^[24]proposed a hybrid DWT-SVM to estimate the daily pan evaporation in Karnataka, India. The results also confirm that SVM combined with DWT can improve the estimation accuracy. Rezaie-Balf et al.¹⁵ integrated EEMD with SVM and M5 model tree (M5T) separately to estimate the monthly pan evaporation models of Siirt station and Diyarbakir station in Turkish, and the proposed models presented much higher accuracy. Fu et al.⁷ proposed hybrid models by combining the DWT, EEMD, and VMD with SVM and GWO-SVM separately to estimate the monthly ET. The results indicated that VMD and DWT exhibited better pre-processing performance than that of EEMD, and the estimating accuracy of VMD-GWO-SVM was higher than that of DWT-GWO-SVM and EEMD-GWO-SVM.

The previous works are mainly focused on using shallow ML models to estimate ET^{2,4,6,7,2–18,21–23}. It is well known that the shallow ML models have drawbacks that cannot sufficiently extract the hidden nonlinear and non-static features from the ET time series²⁵. Thus, long short-term memory (LSTM)^3,26, deep neural network (DNN)²⁷, temporal convolution neural network(TCN)²⁷, recurrent neural network (RNN)²⁸ have been employed to estimate ET or evaporation based on limited meteorological data. E.g., Majhi et al.³ used LSTM, MLNN, Hargreaves formula, and Blaney-Criddle formula to estimate daily pan evaporation of Chhattisgarh state in India. The results indicate that LSTM can achieve accurate estimation of evapotranspiration, and has better estimation than other models. Chen et al.²⁷estimated the daily reference ET in the Northeast plain of China by using LSTM, DNN, TCN, SVM, RF, Hargreaves model, Ritchie method, Priestley-Talor model, Makkink formula, Romanenko model, and Schendel formula, respectively. The results show that the LSTM, TCN and DNN have better estimation performance than that of the shallow ML models and empirical models in the absence of meteorological parameters. Granata and Di Nunno²⁸ used LSTM and NARX to estimate ET of Cypress Swamp and Kobeh Valley in the USA. The results show that deep learning models have higher precision than the shallow ML models due to the high hierarchical structure.

In fact, the hyperparameters of ML models directly determine the computing accuracy, but most data driven models are unable to search the optimal hyperparameters by themselves, and LSTM is no exception. The hyperparameters of LSTM, including the number of hidden layers (HL), number of hidden units (HU), epochs, and learning rate (LR)²⁶, significantly affect the estimating performance of LSTM. However, to the best knowledge of the authors, the application of LSTM coupled with meta-heuristic algorithms to estimate evaporation or ET has been very minimal.

Study area and data

The research was conducted in the southeast margins of Tengger Desert (37°32'N, 105°02'E). The primary landscape type is densely distributed trellis dunes^9,10. To prevent the harm of sandstorms to the Baotou-Lanzhou railway, the Chinese academy of sciences and relevant units of railway have established artificially re-vegetated belt in 1956a and extended in 1964a, 1981a, and 1987a. Mechanical sand barrier perpendicular to the main wind direction was installed on the mobile dune, straw checkerboards (spacing 1 m × 1 m) were set behind the mechanical sand barrier, and two-year xeristic shrub seedlings were planted in the same configuration in a banded way with plant spacing and row spacing of 1 m × 2 m or 2 m × 3 m under the condition of no irrigation. After more than half a century of succession, the number of natural plant species has increased from 25 to 453, and the vegetation coverage has increased from less than 1% to 42.3%. A biological windbreak sand fixation zone with a length of 16 km and a width of 200-1000 m has been gradually formed. The artificial sand-binding vegetation established in different years (1956a, 1964a, 1981a and 1987a) are distributed on both sides of the railway in parallel, which successfully prevented the damage of wind-blown-sand damage to railway traffic and significantly improved the ecological environment of the study area. As the stability and sustainability of the revegetation depend on the water balance between ET and precipitation^7,9,10, it is of great theoretical and practical significance to accurately estimate ET for protecting and utilizing artificial sand-binding vegetation^7,10.

In this study, the monthly ET data measured from January 1991 to December 2018, the data from January 1991 to December 2010 were regarded as the training set, and the rest was used as the testing set. Table 1 shows the main statistical metrics of the monthly ET time series in the study area.

Table 1.

The statistical characters of the monthly ET time series.

	Data set	Mean	Std.	Skewness	Kurtosis	Minimum	Maximum
ET	Training	221.2187	126.7168	0.246	− 1.233	29.3	488.40
	Testing	182.0594	107.8447	0.381	− 1.064	32.8	424.80
	Total	210.0304	122.7508	0.321	− 1.134	29.30	488.40

Open in a new tab

Methodology

The framework of the proposed models

LSTM is a new time cycle neural network that can overcome the gradient vanishing problem in RNN by adding a chain form of repeating neural network modules to store relevant information^25,26. It uses the working principle of “two in and two out” to solve the problem of long-order dependency²⁶. In this study, LSTM was selected as main modular to estimate the monthly ET duo to the LSTM has the excellent capability of tackling nonlinear patterns among the time series^27,28. In adition, SVM was also employed to estimate ET since SVM has better adaptability to solve a broader class of nonlinear fitting problems (e.g., estimate ET) than that of other shallow ML models (e.g., BPNN, WNN, ELM, MT, and MLP)²⁹ .

As DWT is sensitive to the wavelet basis and the threshold, EMD suffers from an intrinsic drawback of mode mixing^7,24, and EEMD exists endpoint effects⁷. VMD is a more robust and adaptive data pre-processing method than DWT, EMD, and EEMD²⁴, it effectively overcomes the shortcomings of those methods and extract the main variation features from the irregularity and non-stationarity time series²⁴. In this study, VMD was employed to extract the main variation features from the ET data set. To compare the denoising performance of VMD, DWT was selected as the reference. The output of DWT, VMD and the collected ET time series were divided into training set and testing set, and fed to ML models to estimate ET, respectively.

To improve the training efficiency of ML models, the min–max normalization method^7,22 was used to normalize the input and output data sets. The input and output sets of the SVM and LSTM were determined by using the longitudinal selection (LS) method⁷, namely, the group of training samples is represented by a vector from $i$ to $k - q$ , the training output is denoted as a vector from $i + d - 1$ to $k - q$ , the testing output vector is represented by a vector from $k - q + 1$ to $k$ , where $i$ , $d$ , $k$ and $q$ denote the starting point, the input dimension, the size of data set, and the length of the testing output, respectively. Figure 1 shows the schematic diagram of the LS method with $i = 1$ , $d = 3$ and $k = 336$ .

WOA is a new efficient and stable meta-heuristic algorithms¹⁸, which was widely used to solve the nonlinear optimization problems, including optimizing the hyperparameters of shallow ML models for estimation of evaporation or ET. However, no researcher has proposed a hybrid model by coupling LSTM with meta-heuristic algorithms to estimate ET. Thus, the WOA was employed to optimize the hyperparameters of LSTM to obtain more accurately estimating results. Figure 2 shows the flowcharts of WOA-LSTM. The WOA-LSTM coupled with VMD is denoted as VMD-WOA-LSTM.

The superiority of VMD-WOA-LSTM was tested by comparing the estimating performance with LSTM, SVM, DWT-LSTM, DWT-SVM, VMD-LSTM, VMD-SVM, WOA-LSTM, GWO-SVM, DWT-WOA-LSTM, DWT-GWO-SVM, and VMD-GWO-SVM, where GWO-SVM indicates that the best C and G of SVM were determined by using GWO algorithm, Fig. 3 show the flowcharts of GWO-SVM. In the numerical simulation processes, the mean absolute percentage error (MAPE) was employed as the fitness function to determine the optimal hyperparameters of ML models. It should be noted that the mathematical principles of the methods used in this paper, including DWT, VMD, SVM, LSTM, GWO and WOA, can be found in relevant literatures, so the mathematical description of these methods is omitted.

Metrics for performance evaluation

The error analysis is performed by using the evaluation metrics, including the mean absolute error (MAE), MAPE, normalized mean squared error (NMSE), root mean squared error (RMSE), and Nash–Sutcliffe coefficient of efficiency (NSCE). The definitions of those evaluation metrics are as follows:

M A E = \frac{1}{n} \sum_{i = 1}^{n} |y_{i} - {\hat{y}}_{i}|

M A P E = \frac{1}{n} \sum_{i = 1}^{n} |\frac{y_{i} - {\hat{y}}_{i}}{y_{i}}| \times 100 %

N M S E = \frac{1}{n} \sum_{i = 1}^{n} {(\frac{y_{i} - {\hat{y}}_{i}}{y_{i}})}^{2}

R M S E = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}

N S C E = 1 - \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2} / \sum_{i = 1}^{n} {(y_{i} - \bar{y})}^{2}

where $y_{i}$ and ${\hat{y}}_{i}$ denote the desired and estimated result.

The MAE, MAPE, NMSE, and RMSE are generally used to measure the deviation between the desired and estimated result, the estimating performance is better when the values of these evaluation metrics are smaller. As MAPE is relatively more stable among these criterions, and NSCE is generally used to determine model efficiency in the hydrological field (NSCE close to 1 indicate the proposed model has good fitness). Therefore, MAPE and NSCE were selected as the main benchmarks to evaluating the model performance of the proposed models.

Parameter settings

The estimating performance of different models is primarily determined by the parameters selection. Table 2 shows the key parameters of the DWT and VMD. The number of search agents and the maximum iterations of WOA and GWO algorithms were 5 and 100, respectively. The mini-batch size of all the LSTM-based models was 128. The range of HL, HU, Epochs and LR of WOA-LSTM were [1, 200], [1, 200], [10, 100], and [0.001, 0.01], respectively. As for GWO-SVM, the range of C and G were [0.01, 100].

Table 2.

The experiment parameters and denoising performance of DWT and VMD.

DWT	db4 with	VMD
Threshold method	Rigorous sure	The balance parameter parameterata fidelity constraint	$α = 0.05$
Threshold rule	Soft	The step of the dual ascent	$τ = 0$
Noise structure	Unscaled wite noise	The number of modes	K = 5
SNR	40.8201	SNR	42.6451
RMSE	2.2127	RMSE	1.7934

Open in a new tab

Results

Figures 4 and 5 show the results of DWT and VMD. As shown in Figs. 4 and 5, the collected monthly ET time series have many sharp and fluctuating points, which will affect the estimating performance. To resolve these problematic patterns in ET estimating processes, the db4 with level 1 and VMD with K = 5 were finalized and employed to denoise the steady ET time series. The residual time series shows the low amplitude and high-frequency fluctuation, suggesting that the white noise was eliminated from the original ET data set. Table 2 shows that the denoising performance of VMD (with SNR = 42.6451 and RMSE = 1.7934) is more efficient than that of DWT (with SNR = 40.8201 and RMSE = 2.2127). In fact, DWT has the the disadvantages that the wavelet basis should be selected in advance, and it is also sensitive to the selection of threshold. Therefore, VMD has better denoising performance than that of DWT in this study.

As mentioned, LSTM and SVM were used to estimate the monthly ET, respectively. For each fixed input dimension d in d = 2, 3, …, 16, the minimum MAPE was selected from 5 replications, and the optimal hyperparameters of the WOA-LSTM and GWO-SVM were recorded according to the minimum MAPE. Table 3 shows the optimal input features and hyperparameters of proposed models. As shown in Table 3, the MAPE of each LSTM-based model is less than that of SVM-based, suggesting that the deep learning models significantly outperformed the shallow ML models; The MAPE of single ML model was decreased by coupling ML model with the data pre-processing techniques, suggesting that the performance of ML model can be improved by using the data pre-processing; The MAPE of the hybrid models with optimal parameters are all smaller than that of the ML models with default parameters, indicating that ML models integrated with meta-heuristic algorithms have moderately higher computing performance than that of the ML models with default parameters; The MAPE of the hybrid DWT-GWO-SVM, DWT-WOA-LSTM, VMD-GWO-SVM, and VMD-WOA-LSTM models are smaller than that of LSTM, SVM, DWT-LSTM, DWT-SVM, VMD-LSTM, VMD-SVM, WOA-LSTM, and GWO-SVM, suggesting that the performance of ML model can be significantly improved by using the data pre-processing and meta-heuristic algorithm.

Table 3.

The experiment parameters of the proposed model.

Parameters	LSTM	DWT-LSTM	VMD-LSTM	WOA-LSTM	DWT-WOA-LSTM	VMD-WOA-LSTM
Input features	10	12	12	12	12	12
No. of HL	200	200	200	90	159	70
No. of HU	200	200	200	140	144	79
Epochs	100	100	100	47	51	64
LR	0.005	0.005	0.005	0.0098	0.0085	0.0068
Batch siz	128	128	128	128	128	128
MAPE	28.18%	25.55%	27.17%	18.97%	19.72%	18.90%

Parameters	SVM	DWT-SVM	VMD-SVM	GWO-SVM	DWT-GWO-SVM	VMD-GWO-SVM
Input features	3	3	3	4	4	3
C	1	1	1	55.4135	68.9681	86.0928
G	0.3333	0.3333	0.3333	0.7248	1.2431	0.7648
MAPE	28.42%	27.94%	27.46%	24.14%	23.69%	23.21%

Open in a new tab

The estimating results of the ML models with the minimum MAPE and the optimal hyperparameters are presented in Fig. 6. Figure 6 shows that the results of all the proposed models are consistent with most of the points in the middle range, but the starting point and extreme values are overestimated; The monthly ET time series have a period of 12 months, the optimal input dimensions of LSTM, DWT-LSTM, VMD-LSTM, WOA-LSTM, DWT-WOA-LSTM,and VMD-WOA-LSTM are approximately equal to the period of the ET time series (Table 3), suggesting that LSTM can take full advantage of historical data patterns and effectively overcome the drawbacks of shallow ML models. Overall, the hybrid DWT-GWO-SVM, DWT-WOA-LSTM, VMD-GWO-SVM, and VMD-WOA-LSTM models have better estimating performance than that of other proposed models.

ET estimation using (A) LSTM and SVM, (B) DWT-LSTM and DWT-SVM, (C) VMD-LSTM and VMD-SVM, (D) WOA-LSTM and GWO-SVM, (E) DWT-WOA-LSTM and DWT-GWO-SVM, (F) VMD-WOA-LSTM and VMD-GWO-SVM.

Table 4 shows the evaluation metrics of each model in the training and testing stages, where the evaluation metrics average of the ML models optimized by meta-heuristic algorithms are marked in bold. As shown in Table 4, the model performance of DWT-LSTM, DWT-SVM, VMD-LSTM, and VMD-SVM models is better than that of LSTM and SVM in testing stage (Table 4), suggesting that the data pre-processing is an aid to enhance the estimating performance of ML models. As shown in Table 4, the evaluation metrics including MAE, MAPE, NMSE, and RMSE of WOA-LSTM and GWO-SVM are all smaller than that of LSTM, SVM, DWT-LSTM, DWT-SVM, VMD-LSTM and VMD-SVM, and NSCE increased to varying degrees, indicating that the estimating performance of LSTM and SVM are significantly improved when the hyperparameters of LSTM and SVM are optimized by using meta-heuristic algorithms.

Table 4.

The evaluation metrics in the training and testing stages.

Model	Training					Testing
Model	MAE	MAPE (%)	NMSE	RMSE	NSCE	MAE	MAPE (%)	NMSE	RMSE	NSCE
LSTM	14.4930	8.97	0.0140	33.0085	0.9767	32.9698	28.18	0.3047	41.0264	0.8738
SVM	33.9835	18.89	0.0909	45.5818	0.8700	33.4904	28.42	0.1740	40.3111	0.8588
DWT-LSTM	12.1676	7.01	0.0167	31.3245	0.9545	32.5677	25.55	0.1187	36.4157	0.8682
DWT-SVM	32.8495	18.23	0.0850	44.2475	0.8775	32.9807	27.94	0.1665	39.6668	0.8633
VMD-LSTM	12.3580	6.99	0.0120	31.4618	0.9817	35.7024	27.17	0.1261	39.8266	0.8843
VMD-SVM	34.0165	19.10	0.0937	45.4876	0.8705	33.4178	27.46	0.1760	40.2667	0.8591
WOA-LSTM	12.8634	5.74	0.0162	29.7665	0.9845	26.7628	18.98	0.1020	40.6856	0.8851
GWO-SVM	25.3157	15.47	0.0474	34.5420	0.9254	32.7588	24.56	0.1303	40.4901	0.8575
DWT-WOA-LSTM	10.3077	5.43	0.0104	28.5546	0.9966	27.0411	18.90	0.1257	35.3072	0.8578
DWT-GWO-SVM	22.3948	13.76	0.0346	28.8268	0.9479	32.6383	23.77	0.1021	40.2469	0.8593
VMD-WOA-LSTM	11.3376	5.46	0.0102	28.7892	0.9805	26.8135	18.72	0.1048	37.4979	0.8917
VMD-GWO-SVM	29.7499	16.98	0.0544	41.4806	0.8923	25.2081	23.22	0.1122	37.8756	0.8754

Open in a new tab

The evaluation metrics of SVM integrated with GWO are all smaller than that of the LSTM coupled with data pre-processing techniques; The MAPE of hybrid VMD-GWO-SVM is smaller than that of other SVM-based models and LSTM, DWT-LSTM, VMD-LSTM models (Table 4), suggesting that shallow ML models integrated with data pre-processing techniques and meta-heuristic algorithms can be employed to estimate ET. Whereas, the model performance of shallow ML models still needs to be improved. As shown in Table 4, the deep learning models significantly outperformed the shallow ML models, and the model performance of hybrid WOA-LSTM, DWT-WOA-LSTM, and VMD-WOA-LSTM are better than that of the SVM-based hybrid models. Compared with VMD-GWO-SVM, the the MAPE of DWT-WOA-LSTM was decreased from 23.22% to 18.90%, and NSCE was improved from 0.8754 to 0.8578; The MAPE of VMD-WOA-LSTM was decreased from 23.22% to 18.72%, and NSCE was improved from 0.8754 to 0.8917. Those results indicating that the hybrid DWT-WOA-LSTM and VMD-WOA-LSTM models have excellent capability in estimating the monthly ET over other proposed methods. The MAPE of the hybrid VMD-WOA-LSTM models is the smallest among the proposed models, indicating that VMD-WOA-LSTM is the most precise model to estimate the monthly ET among the proposed models. Therefore, VMD-WOA-LSTM could be recommended as an essential auxiliary method to estimate ET in desert regions.

Discussion

As mentioned, the hybrid models based on ML models, data pre-processing techniques and meta-heuristic algorithms were proposed to estimate ET in the study area. Generally, constructing the hybrid ML models is a complex and time-consuming procedure^6,14–18. However, the hybrid ML models have higher accuracy than the physically-based methods^20–23. Therefore, using the hybrid ML models to precisely estimate the ET is a core issue in hydrology and ecology.

The deep learning models generally outperformed the shallow ML models³. However, only using ML models to estimate ET may lead to large errors when the data pre-processing is not performed^7,14–16. As show in Figs. 4 and 5, the collected ET time series exhibit the irregularity and non-stationarity since ET is determined by the meteorological factors and closely related to the soil moisture, geomorphological and vegetation traits in desert regions. Thus, data preprocessing technique should be used to to extract the main variation features from the collected ET time series for obtaining more credible estimating results. The data preprocessing methods such as DWT and VMD can be applied to eliminate the noise from the non-linear and non-stationary time series. Whereas, DWT is sensitive to the selection of threshold, VMD is a more robust and effective data preprocessing method than the empirical mode decomposition (EMD), and EEMD, which can be used to remove the noise from time series by decomposing the nonlinear and nonstationary signal into several band-limited intrinsic mode functions (IMFs)^7,24. Therefore, the denoising performance of DWT is worse than that of VMD (Table 2).

Literature review shows that the ML models integrated with data pre-processing technique is an effective strategy for obtaining more credible estimating results^7,21–23, the results in our research also confirm thi s conclusion. As shown in Tables 3 and 4, the the evaluation metrics average of the ML models integrated with DWT or VMD are all smaller than that of of single SVM and LSTM models. Therefore, extracting useful features by using data preprocessing technique is necessary for more credible estimating results (Supplementary File 1).

The estimating performance of the ML models can also be significantly improved by coupling ML models with meta-heuristic algorithms. To a certain extent, it is more important to obtain the optimal hyperparameters of a ML model than to select the appropriate data preprocessing technique. As shown in Tables 3 and 4, the estimating performance of ML models hybridized with data preprocessing technique were significantly improved when the hyperparameters of LSTM and SVM were optimized. Compared with DWT-SVM, VMD-SVM, DWT-LSTM, and VMD-LSTM, the positive evaluation metrics of DWT-GW0-SVM, VMD-GWO-SVM, DWT-WOA-LSTM and VMD-WOA-LSTM decreased, and the negative indicators increased to varying degrees. Thus, the selection of hyperparameters for ML models was crucially important to improve the estimating performance^6,7,21–29 .

Furthermore, as concluded, the main objectives of this study was to propose a novel hybrid model to estimate ET by integrating the LSTM with WOA and data preprocessing techniques, including DWT and VMD. The estimating performance of VMD-WOA-LSTM was compared with other hybrid models in terms of standard performance metrics. The performance comparison of the proposed models demonstrated that the hybrid VMD-WOA-LSTM model performed better than other ML-based models to estimate the monthly ET in the southeast margins of Tengger Desert.

Conclusions

In this study, the hybrid models based on the VMD, WOA, and LSTM were proposed to estimate ET using the time series analysis methods, where VMD was used to extract the intrinsic traits of ET time series, and WOA was employed to optimize the hyperparameters of LSTM. The model performance was compared with actual ET and estimation of other hybrid models in terms of MAE, MAPE, NMSE, RMSE, and NSCE. The results indicate that VMD-WOA-LSTM has more accurate estimating results than other ML-based models, which can be recommended as an essential auxiliary method to estimate ET in desert regions.

Supplementary Information

Supplementary Information.^{(307.5KB, xlsx)}

Acknowledgements

The research was supported by the Creative Research Groups of China under Grant No. 41621001.

Author contributions

T.F. wrote the main manuscript text, X.L. reviewed and checked themanuscript.All authors reviewed the manuscript.

Data availability

All data analyzed or generated during this study are included in the Supplementary Information, and are available from the corresponding authors upon reasonable request.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

The online version contains supplementary material available at 10.1038/s41598-022-25208-z.

References

1.Keshtegar B, Piri J, Kisi O. A nonlinear mathematical modeling of daily pan evaporation based on conjugate gradient method. Comput. Electron. Agric. 2016;127:120–130. doi: 10.1016/j.compag.2016.05.018. [DOI] [Google Scholar]
2.Hashemi M, Sepaskhah AR. Evaluation of artificial neural network and Penman-Monteith equation for the prediction of barley standard evapotranspiration in a semi-arid region. Theor. Appl. Climatol. 2020;139:275–285. doi: 10.1007/s00704-019-02966-x. [DOI] [Google Scholar]
3.Majhi B, Naidu D, Mishra AP, et al. Improved prediction of daily pan evaporation using Deep-LSTM model. Neural Comput. Appl. 2019;32:7823. doi: 10.1007/s00521-019-04127-7. [DOI] [Google Scholar]
4.Wen X, Si J, He Z, et al. Support-vector-machine-based models for modeling daily reference evapotranspiration with limited climatic data in extreme arid regions. Water Resour. Manag. 2015;29:3195–3209. doi: 10.1007/s11269-015-0990-2. [DOI] [Google Scholar]
5.Li XR, Zhang ZS, Liu YB, et al. Fundamental Ecohydrology of Ecological Restoration and Recovery in Sand Desert Regions of China. Science Press; 2016. [Google Scholar]
6.Wu LF, Huang GM, Fan JL, et al. Hybrid extreme learning machine with meta-heuristic algorithms for monthly pan evaporation prediction. Comput. Electron. Agr. 2020;168:105–115. doi: 10.1016/j.compag.2019.105115. [DOI] [Google Scholar]
7.Fu TL, Li XR, Jia RL, et al. A novel integrated method based on a machine learning model for estimating evapotranspiration in dryland. J. Hydrol. 2021;603:126881. doi: 10.1016/j.jhydrol.2021.126881. [DOI] [Google Scholar]
8.Zhang DH, Li XR, Zhang F. Effects of rainfall intensity and intermittency on woody vegetation cover and deep soil moisture in dryland ecosystems. J. Hydrol. 2016;543:270–282. doi: 10.1016/j.jhydrol.2016.10.003. [DOI] [Google Scholar]
9.Li XR, Zhang DH, Zhang F, et al. The eco-hydrological threshold for evaluating the stability of sand-binding vegetation in different climatic zones. Ecol. Ind. 2017;83:404–415. doi: 10.1016/j.ecolind.2017.08.005. [DOI] [Google Scholar]
10.Li XR, Jia RL, Zhang ZS, et al. Hydrological response of biological soil crusts to global warming: A ten-year simulative study. Global Change Biol. 2018;24:4960–4971. doi: 10.1111/gcb.14378. [DOI] [PubMed] [Google Scholar]
11.Moazenzadeh R, Mohammadi B, Shamshirband S, et al. Coupling a firefly algorithm with support vector regression to predict evaporation in northern Iran. Eng. Appl. Comput. Fluid Mech. 2018;12(1):584–597. [Google Scholar]
12.Farshad A, Saeid M, Babak M, et al. Application of an artificial intelligence technique enhanced with intelligent water drops for monthly reference evapotranspiration estimation. Agr. Water Manage. 2021;244:106622. doi: 10.1016/j.agwat.2020.106622. [DOI] [Google Scholar]
13.Tikhamarine Y, Malik A, Souag-Gamane D, et al. Artificial intelligence models versus empirical equations for modeling monthly reference evapotranspiration. Environ. Sci. Pollut. Res. 2020;27:30001–30019. doi: 10.1007/s11356-020-08792-3. [DOI] [PubMed] [Google Scholar]
14.Rezaie-Balf M, Naganna SR, Ghaemi A, et al. Wavelet coupled MARS and M5 model tree approaches for groundwater level forecasting. J. Hydrol. 2017;553:356–373. doi: 10.1016/j.jhydrol.2017.08.006. [DOI] [Google Scholar]
15.Rezaie-Balf M, Kisi O, Chua LH. Application of ensemble empirical mode decomposition based on machine learning methodologies in forecasting monthly pan evaporation. Hydrol. Res. 2019;50(2):498–516. doi: 10.2166/nh.2018.050. [DOI] [Google Scholar]
16.Qasem SN, Samadianfard S, Kheshtgar S, et al. Modeling monthly pan evaporation using wavelet support vector regression and wavelet artificial neural networks in arid and humid climates. Eng. Appl. Comput. Fluid Mech. 2019;13(1):177–187. [Google Scholar]
17.Shi L, Feng P, Wang B, et al. Projecting potential evapotranspiration change and quantifying its uncertainty under future climate scenarios: A case study in southeastern Australia. J. Hydrol. 2020;584:124756. doi: 10.1016/j.jhydrol.2020.124756. [DOI] [Google Scholar]
18.Mohammadi B, Mehdizadeh S. Modeling daily reference evapotranspiration via a novel approach based on support vector regression coupled with whale optimization algorithm. Agric. Water Manag. 2020;237:106145. doi: 10.1016/j.agwat.2020.106145. [DOI] [Google Scholar]
19.Mirjalili S, Mirjalili SM, Lewis A. Grey wolf optimizer. Adv. Eng. Softw. 2014;69:46–61. doi: 10.1016/j.advengsoft.2013.12.007. [DOI] [Google Scholar]
20.Pan JX, Jing B, Jiao XX, et al. Analysis and application of grey wolf optimizer-long short-term memory. IEEE Access. 2020;8:121460–121468. doi: 10.1109/ACCESS.2020.3006499. [DOI] [Google Scholar]
21.Abdullah SS, Malek MA, Abdullah NS, et al. Extreme learning machines: A new approach for prediction of reference evapotranspiration. J. Hydrol. 2015;527:184–195. doi: 10.1016/j.jhydrol.2015.04.073. [DOI] [Google Scholar]
22.Gocić M, Motamedi S, Shamshirband S, et al. Soft computing approaches for forecasting reference evapotranspiration. Comput. Electron. Agric. 2015;113:164–173. doi: 10.1016/j.compag.2015.02.010. [DOI] [Google Scholar]
23.Pammar L, Deka PC. Daily pan evaporation modeling in climatically contrasting zones with hybridization of wavelet transform and support vector machines. Paddy Water Environ. 2017;15(4):711–722. doi: 10.1007/s10333-016-0571-x. [DOI] [Google Scholar]
24.Dragomiretskiy K, Zosso D. Variational Mode decomposition. IEEE Trans. Signal Proces. 2014;62(3):531–544. doi: 10.1109/TSP.2013.2288675. [DOI] [Google Scholar]
25.Zuo GG, Luo J, Wang N, et al. Decomposition ensemble model based on variational mode decomposition and long short-term memory for streamflow forecasting. J. Hydrol. 2020;585:124776. doi: 10.1016/j.jhydrol.2020.124776. [DOI] [Google Scholar]
26.Hochreiter S, Schmidhuber J. Long short-term memory. Neural Comput. 1997;9(8):1735–1780. doi: 10.1162/neco.1997.9.8.1735. [DOI] [PubMed] [Google Scholar]
27.Chen ZJ, Zhu ZC, Jiang H, et al. Estimating daily reference evapotranspiration based on limited meteorological data using deep learning and classical machine learning methods. J. Hydrol. 2020;591:125286. doi: 10.1016/j.jhydrol.2020.125286. [DOI] [Google Scholar]
28.Granata F, Di Nunno F. Forecasting evapotranspiration in different climates using ensembles of recurrent neural networks. Agr. Water Manage. 2021;255:107040. doi: 10.1016/j.agwat.2021.107040. [DOI] [Google Scholar]
29.Shrestha NK, Shukla S. Support vector machine based modeling of evapotranspiration using hydro-climatic variables in a sub-tropical environment. Agric. For. Meteorol. 2015;200:172–184. doi: 10.1016/j.agrformet.2014.09.025. [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Information.^{(307.5KB, xlsx)}

Data Availability Statement

All data analyzed or generated during this study are included in the Supplementary Information, and are available from the corresponding authors upon reasonable request.

[CR1] 1.Keshtegar B, Piri J, Kisi O. A nonlinear mathematical modeling of daily pan evaporation based on conjugate gradient method. Comput. Electron. Agric. 2016;127:120–130. doi: 10.1016/j.compag.2016.05.018. [DOI] [Google Scholar]

[CR2] 2.Hashemi M, Sepaskhah AR. Evaluation of artificial neural network and Penman-Monteith equation for the prediction of barley standard evapotranspiration in a semi-arid region. Theor. Appl. Climatol. 2020;139:275–285. doi: 10.1007/s00704-019-02966-x. [DOI] [Google Scholar]

[CR3] 3.Majhi B, Naidu D, Mishra AP, et al. Improved prediction of daily pan evaporation using Deep-LSTM model. Neural Comput. Appl. 2019;32:7823. doi: 10.1007/s00521-019-04127-7. [DOI] [Google Scholar]

[CR4] 4.Wen X, Si J, He Z, et al. Support-vector-machine-based models for modeling daily reference evapotranspiration with limited climatic data in extreme arid regions. Water Resour. Manag. 2015;29:3195–3209. doi: 10.1007/s11269-015-0990-2. [DOI] [Google Scholar]

[CR5] 5.Li XR, Zhang ZS, Liu YB, et al. Fundamental Ecohydrology of Ecological Restoration and Recovery in Sand Desert Regions of China. Science Press; 2016. [Google Scholar]

[CR6] 6.Wu LF, Huang GM, Fan JL, et al. Hybrid extreme learning machine with meta-heuristic algorithms for monthly pan evaporation prediction. Comput. Electron. Agr. 2020;168:105–115. doi: 10.1016/j.compag.2019.105115. [DOI] [Google Scholar]

[CR7] 7.Fu TL, Li XR, Jia RL, et al. A novel integrated method based on a machine learning model for estimating evapotranspiration in dryland. J. Hydrol. 2021;603:126881. doi: 10.1016/j.jhydrol.2021.126881. [DOI] [Google Scholar]

[CR8] 8.Zhang DH, Li XR, Zhang F. Effects of rainfall intensity and intermittency on woody vegetation cover and deep soil moisture in dryland ecosystems. J. Hydrol. 2016;543:270–282. doi: 10.1016/j.jhydrol.2016.10.003. [DOI] [Google Scholar]

[CR9] 9.Li XR, Zhang DH, Zhang F, et al. The eco-hydrological threshold for evaluating the stability of sand-binding vegetation in different climatic zones. Ecol. Ind. 2017;83:404–415. doi: 10.1016/j.ecolind.2017.08.005. [DOI] [Google Scholar]

[CR10] 10.Li XR, Jia RL, Zhang ZS, et al. Hydrological response of biological soil crusts to global warming: A ten-year simulative study. Global Change Biol. 2018;24:4960–4971. doi: 10.1111/gcb.14378. [DOI] [PubMed] [Google Scholar]

[CR11] 11.Moazenzadeh R, Mohammadi B, Shamshirband S, et al. Coupling a firefly algorithm with support vector regression to predict evaporation in northern Iran. Eng. Appl. Comput. Fluid Mech. 2018;12(1):584–597. [Google Scholar]

[CR12] 12.Farshad A, Saeid M, Babak M, et al. Application of an artificial intelligence technique enhanced with intelligent water drops for monthly reference evapotranspiration estimation. Agr. Water Manage. 2021;244:106622. doi: 10.1016/j.agwat.2020.106622. [DOI] [Google Scholar]

[CR13] 13.Tikhamarine Y, Malik A, Souag-Gamane D, et al. Artificial intelligence models versus empirical equations for modeling monthly reference evapotranspiration. Environ. Sci. Pollut. Res. 2020;27:30001–30019. doi: 10.1007/s11356-020-08792-3. [DOI] [PubMed] [Google Scholar]

[CR14] 14.Rezaie-Balf M, Naganna SR, Ghaemi A, et al. Wavelet coupled MARS and M5 model tree approaches for groundwater level forecasting. J. Hydrol. 2017;553:356–373. doi: 10.1016/j.jhydrol.2017.08.006. [DOI] [Google Scholar]

[CR15] 15.Rezaie-Balf M, Kisi O, Chua LH. Application of ensemble empirical mode decomposition based on machine learning methodologies in forecasting monthly pan evaporation. Hydrol. Res. 2019;50(2):498–516. doi: 10.2166/nh.2018.050. [DOI] [Google Scholar]

[CR16] 16.Qasem SN, Samadianfard S, Kheshtgar S, et al. Modeling monthly pan evaporation using wavelet support vector regression and wavelet artificial neural networks in arid and humid climates. Eng. Appl. Comput. Fluid Mech. 2019;13(1):177–187. [Google Scholar]

[CR17] 17.Shi L, Feng P, Wang B, et al. Projecting potential evapotranspiration change and quantifying its uncertainty under future climate scenarios: A case study in southeastern Australia. J. Hydrol. 2020;584:124756. doi: 10.1016/j.jhydrol.2020.124756. [DOI] [Google Scholar]

[CR18] 18.Mohammadi B, Mehdizadeh S. Modeling daily reference evapotranspiration via a novel approach based on support vector regression coupled with whale optimization algorithm. Agric. Water Manag. 2020;237:106145. doi: 10.1016/j.agwat.2020.106145. [DOI] [Google Scholar]

[CR19] 19.Mirjalili S, Mirjalili SM, Lewis A. Grey wolf optimizer. Adv. Eng. Softw. 2014;69:46–61. doi: 10.1016/j.advengsoft.2013.12.007. [DOI] [Google Scholar]

[CR20] 20.Pan JX, Jing B, Jiao XX, et al. Analysis and application of grey wolf optimizer-long short-term memory. IEEE Access. 2020;8:121460–121468. doi: 10.1109/ACCESS.2020.3006499. [DOI] [Google Scholar]

[CR21] 21.Abdullah SS, Malek MA, Abdullah NS, et al. Extreme learning machines: A new approach for prediction of reference evapotranspiration. J. Hydrol. 2015;527:184–195. doi: 10.1016/j.jhydrol.2015.04.073. [DOI] [Google Scholar]

[CR22] 22.Gocić M, Motamedi S, Shamshirband S, et al. Soft computing approaches for forecasting reference evapotranspiration. Comput. Electron. Agric. 2015;113:164–173. doi: 10.1016/j.compag.2015.02.010. [DOI] [Google Scholar]

[CR23] 23.Pammar L, Deka PC. Daily pan evaporation modeling in climatically contrasting zones with hybridization of wavelet transform and support vector machines. Paddy Water Environ. 2017;15(4):711–722. doi: 10.1007/s10333-016-0571-x. [DOI] [Google Scholar]

[CR24] 24.Dragomiretskiy K, Zosso D. Variational Mode decomposition. IEEE Trans. Signal Proces. 2014;62(3):531–544. doi: 10.1109/TSP.2013.2288675. [DOI] [Google Scholar]

[CR25] 25.Zuo GG, Luo J, Wang N, et al. Decomposition ensemble model based on variational mode decomposition and long short-term memory for streamflow forecasting. J. Hydrol. 2020;585:124776. doi: 10.1016/j.jhydrol.2020.124776. [DOI] [Google Scholar]

[CR26] 26.Hochreiter S, Schmidhuber J. Long short-term memory. Neural Comput. 1997;9(8):1735–1780. doi: 10.1162/neco.1997.9.8.1735. [DOI] [PubMed] [Google Scholar]

[CR27] 27.Chen ZJ, Zhu ZC, Jiang H, et al. Estimating daily reference evapotranspiration based on limited meteorological data using deep learning and classical machine learning methods. J. Hydrol. 2020;591:125286. doi: 10.1016/j.jhydrol.2020.125286. [DOI] [Google Scholar]

[CR28] 28.Granata F, Di Nunno F. Forecasting evapotranspiration in different climates using ensembles of recurrent neural networks. Agr. Water Manage. 2021;255:107040. doi: 10.1016/j.agwat.2021.107040. [DOI] [Google Scholar]

[CR29] 29.Shrestha NK, Shukla S. Support vector machine based modeling of evapotranspiration using hydro-climatic variables in a sub-tropical environment. Agric. For. Meteorol. 2015;200:172–184. doi: 10.1016/j.agrformet.2014.09.025. [DOI] [Google Scholar]

PERMALINK

Hybrid the long short-term memory with whale optimization algorithm and variational mode decomposition for monthly evapotranspiration estimation

Tonglin Fu

Xinrong Li

Abstract

Introduction