Branch error reduction criterion-based signal recursive decomposition and its application to wind power generation forecasting

Fen Xiao; Siyu Yang; Xiao Li; Junhong Ni

doi:10.1371/journal.pone.0299955

. 2024 Mar 22;19(3):e0299955. doi: 10.1371/journal.pone.0299955

Branch error reduction criterion-based signal recursive decomposition and its application to wind power generation forecasting

Fen Xiao ¹, Siyu Yang ¹, Xiao Li ^2,^*, Junhong Ni ²

Editor: Samuel Asante Gyamerah³

PMCID: PMC10959340 PMID: 38517881

Abstract

Due to the ability of sidestepping mode aliasing and endpoint effects, variational mode decomposition (VMD) is usually used as the forecasting module of a hybrid model in time-series forecasting. However, the forecast accuracy of the hybrid model is sensitive to the manually set mode number of VMD; neither underdecomposition (the mode number is too small) nor over-decomposition (the mode number is too large) improves forecasting accuracy. To address this issue, a branch error reduction (BER) criterion is proposed in this study that is based on which a mode number adaptive VMD-based recursive decomposition method is used. This decomposition method is combined with commonly used single forecasting models and applied to the wind power generation forecasting task. Experimental results validate the effectiveness of the proposed combination.

Introduction

The increasing frequency of human activities and rapid development of the social economy increase electricity demand, which drives the growth of the global power generation industry. In order to meet future energy and power demand, adapt to changes in energy supply and demand and environmental situation, and realize long-term sustainable development, it is urgent to strengthen the development and utilization of clean and renewable energy [1]. The utilization and exploration of renewable energy power generation will be one of the important issues in the power industry in the future [2]. Because new energy generation is strongly affected by environmental factors, the time series of power generation is intermittent and volatile, which is not conducive to the stable operation and rational planning of the power system [3]. Therefore, it is critical to develop an effective time series model for power generation forecasting.

Currently, time series models can be roughly divided into three categories: classical statistical models, machine learning models, and hybrid models. The ARIMA model is one of the typical representatives of classical statistical models and has been widely used in load forecasting [4]. Commonly used machine learning models include the back propagation (BP) neural network [5], long short-term memory (LSTM) [6], and support vector network (SVM) [7].

Compared to single models, the hybrid model has better performance when solving complex time series forecasting issues. Hybrid models can be further classified into two subtypes. The first subtype integrates the forecasting results of several single models. Zhang et al. [8] used a genetic algorithm (GA) to optimize the parameters of support vector regression (SVR) in the time series forecasting task. Choi et al. [9] combined CNN and BiLSTM together to handle the strong long memory serial dependence feature of the dataset. The second subtype decomposes the time series into subsignals and sums the forecasting results of the subsignals. Bai et al. [10] decomposed the time series using the wavelet transform (WT) [11] and forecasted future air pollutant concentration measurements with a BP neural network. Zheng et al. [12] and Chen et al. [13] used empirical mode decomposition (EMD) [14] to decompose the electric load and combined LSTM and extreme learning machine (ELM). Qin et al. [15] combined ensemble EMD (EEMD) [16] with local polynomial prediction (LPP) as the final model for the forecasting task. Lv et al. [17] decomposed time series using VMD and used LSTM to forecast power load. Cai et al. [18] proposed a combination of VMD, gated recurrent unit (GRU) and time convolutional network (TCN) to achieve a satisfactory power load forecasting result.

The time series of electricity generation is generally a broadband signal, and its future trend is not stable. Therefore, it is difficult to approximate the relationship between historical measurements and its future changes. The future trend of a narrowband signal is normally considered to be more stable. Therefore, the second type of hybrid model is used to decompose the time series of power generation into narrowband modes, and the final forecasted results are obtained by summarizing the forecasted results of each mode. Among the decomposition methods, WT is nonadaptive, and the selection of the optimal wavelet basis strongly affects the decomposition results. EMD suffers from mode aliasing and endpoint effects. EEMD can overcome the shortcomings of EMD to a certain extent but still requires complex calculations and incompletely neutralizes white noise. VMD [19] is a nonrecursive and robust signal decomposition method and has a solid theoretical basis and circumvents the disadvantages of similar methods.

Therefore, VMD is the best choice for the signal decomposition module in hybrid models. However, VMD is sensitive to the mode number, which must be set manually. A mode number that is too small leads to underdecomposition of the signal, while a mode number that is too large results in overdecomposition. Both underdecomposition and overdecomposition decrease the forecast accuracy of the hybrid model. Therefore, it is important to adaptively determine the optimal mode number for VMD. In many studies, the mode number was adaptively aligned with the number determined by EMD [20–22], but this method cannot mitigate the negative impact of modal aliasing on forecast accuracy. Huang et al. [23] used a genetic algorithm to optimize VMD parameters to reduce the decomposition loss, but the addition of a new algorithm makes the problem complex and inefficient.

To address this issue, we design a branch error reduction criterion, upon which a VMD-based recursive decomposition method with adaptive mode number is proposed.

The primary contributions of this study are as follows:

The BER criterion is designed, and we show that a subsignal that is further decomposed leads to better forecasting accuracy if this subsignal satisfies the BER criterion.
A mode number adaptive VMD-based recursive decomposition method is proposed. The hybrid model that combines the proposed decomposition method and commonly used forecasting single model is used to fulfill the wind power generation forecasting task.
Experimental results validate that the proposed VMD-based recursive decomposition method can effectively extract the fluctuation patterns of wind power and improve the forecast accuracy.

The remainder of this paper is organized as follows: Principles of VMD and permutation entropy; Derivation of the branch error reduction criterion and the VMD-based recursive decomposition method; Experiments that verified the effectiveness of the proposed method; Summary of the paper.

VMD and permutation entropy

VMD

VMD assumes that all components are narrowband signals concentrated around their respective center frequencies; thus, VMD constructs constrained optimization problems based on narrowband conditions of components to estimate the center frequency of subsignals and reconstruct corresponding components [12].

Under the assumption that the original signal f(t) is decomposed into K subsignals and the decomposition sequence is guaranteed to be a mode component with a finite bandwidth around a central frequency, the sum of the estimated bandwidth of each mode is minimized, and the constraint is that the sum of all modes is equal to the original signal. Then, the variational model can be formulated as:

\begin{matrix} {\begin{matrix} min_{{u_{k}}, {ω_{k}}} {\sum_{k} | | \partial_{t} [(δ (t) + \frac{j}{π t}) * u_{k} (t)] e^{- j ω_{k} t} {| |}_{2}^{2}} \\ s . t . \sum_{k} u_{k} (t) = f (t) \end{matrix} \end{matrix}

(1)

where u_k(t) is the mode function; ω_k is the mode center frequency; K is the number of modes; δ is the Dirac function; * is the convolution calculator; and f(t) is the input signal.

The Lagrange multiplier λ(t) and the quadratic penalty factor α are introduced to transform the constrained algorithm into an unconstrained variational problem:

\begin{matrix} \begin{matrix} L ({u_{k} (t)}, {ω_{k}}, λ (t)) = α \sum_{k} | | \partial_{t} [(δ (t) + \frac{j}{π t}) * u_{k} (t)] e^{- j ω_{k} t} {| |}_{2}^{2} \\ + | | g (t) - \sum_{k} u_{k} (t) {| |}_{2}^{2} + ⟨ λ (t), g (t) - \sum_{k} u_{k} (t) ⟩ \end{matrix} \end{matrix}

(2)

The optimal solution of the variational problem is obtained by iteratively updating $u_{k}^{n + 1} (t)$ , $ω_{k}^{n + 1} (t)$ and $λ_{k}^{n + 1} (t)$ using the alternating direction method of the multipliers. In this study, the iterative process of the Fourier transform of u_k(t), ω_k and λ(t) can be expressed as:

\begin{matrix} {\hat{u}}_{k}^{n + 1} (ω) = \frac{\hat{f} (ω) - \sum_{i \neq k} \hat{u_{i}} (ω) + \frac{\hat{λ} (ω)}{2}}{1 + 2 α {(ω - ω_{k})}^{2}} \end{matrix}

(3)

\begin{matrix} ω_{k}^{n + 1} (ω) = \frac{\int_{0}^{\infty} ω | {\hat{u}}_{k} (ω) |^{2} d ω}{\int_{0}^{\infty} | {\hat{u}}_{k} (ω) |^{2} d ω} \end{matrix}

(4)

\begin{matrix} {\hat{λ}}^{n + 1} (ω) \leftarrow {\hat{λ}}^{n} (ω) + η [\hat{f} (ω) - \sum_{k} {\hat{u}}_{k}^{n + 1} (ω)] \end{matrix}

(5)

where η is the noise tolerance of the signal.

As a decomposition algorithm widely used in signal processing, VMD effectively overcomes the problems of mode aliasing and endpoint effects; thus, it is often combined with forecasting models to form hybrid models for time series forecasting. The number of decomposition modes is a key parameter that must be set manually when using VMD, which is critical to the forecasting results. For example, daily power generation is the result of multiple factors, and its time series is a combination of several modes with different vibration frequencies. Underdecomposition of daily power generation series cannot accurately separate each mode, resulting in the overlap of modes with different fluctuation patterns, which affects the accuracy of the final results, while in the case of overdecomposition, the increase in the mode number corresponds to the growth of computation and training time, which damages or even cancels the advantage of the stable future trend of some subsignals. Therefore, the effective determination of the optimal mode number of VMD becomes a critical problem to solve. Table 1 shows the final forecasting errors obtained by decomposing the daily power generation data in 2020 according to different mode numbers under the three single forecasting models.

Table 1. Decomposition results of daily power generation data in 2020 under different mode numbers.

Model	6	7	8	9	10	11	12	13
ELM	26906.86	17269.87	9553.03	12591.53	37514.39	77360.43	-	-
LSTM	-	-	23629.83	1300.68	809.20	490.02	1439.95	1803.80
LSSVM	-	370.82	237.62	208.83	195.28	176.50	153.20	159.09

Open in a new tab

Table 1 shows that the forecasting error varies strongly with an increase in the number of modes, which highlights the importance of the choice of the mode number for accurate forecasting. The trend of the variation in the normalization error corresponding to the three hybrid models is shown more intuitively in Fig 1, which means that the optimal mode number differs for different forecasting models, and this parameter cannot be derived empirically or by simple data processing. Therefore, a method that can effectively determine the optimal mode number corresponding to different forecasting models is important to develop.

Permutation entropy

During the process of VMD signal decomposition, a residual fraction (RF) will be generated that contains more random noise but may also have part of the information of the original series; thus, the permutation entropy (PE) criterion is considered as the basis for filtering the residual component in this paper. PE was proposed by Bandt et al. [24] in 2002 to detect the randomness of time series and is suitable to analyze nonstationary signals with good robustness. The entropy of a signal determines its random degree: the larger the entropy, the more random the signal is. Therefore, RF can be detected by PE. The calculation steps are as follows.

We denote a time series as S = {s(1), s(2), ⋯, s(n)} and obtain the matrix after space reconstruction:

\begin{matrix} S^{'} = [\begin{matrix} s (1) & s (1 + τ) & \dots & s (1 + (m - 1) τ) \\ ⋮ & ⋮ & ⋱ & ⋮ \\ s (i) & s (i + τ) & \dots & s (i + (m - 1) τ) \\ ⋮ & ⋮ & ⋱ & ⋮ \\ s (n - (m - 1) τ) & s (n - (m - 1) τ) & \dots & s (n) \end{matrix}] \end{matrix}

(6)

where τ is the delay time and m is the embedding dimension. Rearranging the reconstructed matrix in ascending order yields:

\begin{matrix} s (i + (j_{1} - 1) τ) \leq s (i + (j_{2} - 1) τ) \leq \dots \leq s (i + (j_{m} - 1) τ) \end{matrix}

(7)

where j₁, j₂, ⋯, j_m are the index values of elements in the reconstructed component.

For any segment s_i, a set of symbol sequences {j₁, j₂, ⋯, j_m} can be obtained; thus, there are different symbol sequences mapped from dimensional space. Calculating the probability of each symbol sequence, PE can be defined as:

\begin{matrix} H_{P} (m) = - \sum_{j = 1}^{N - (m - 1) τ} P_{j} ln P_{j} \end{matrix}

(8)

where PE reaches its maximum ln(m!) when P_j = 1/m!. In the real process, normalization is usually performed:

\begin{matrix} 0 \leq H_{P} = H_{P} / ln (m!) \leq 1 \end{matrix}

(9)

PE thus describes the randomness of the time series. By calculating the entropy of RF, the components with a larger proportion of noise are eliminated to reduce random noise.

VMD-based signal recursive decomposition

Based on the shortcomings of VMD, we propose a mode number adaptive VMD-based recursive decomposition method, which is expected to automatically calculate the corresponding optimal mode number when combining different forecasting models.

Branch error reduction criterion

BER uses the mean absolute error (MAE) as an expression of test error and determines whether further decomposition is required. If the sum of the subsignals’ MAE is less than the MAE before decomposition, the decomposition is meaningful.

The criterion is based on the following theorem:

Theorem 1. The total testing error decreases when the sum of the errors of the child branches is smaller than the error of the parent branch.

Proof:

We assume that V = [V₁, V₂, ⋯, V_N] is testing data, where V^(k) is the kth subsignal of V and its MAE is e^(k); V^(k,q) is the qth subsignal of V^(k) and its MAE is e^{(k, q)}. If the testing error before and after decomposition satisfies:

\begin{matrix} \sum_{q} e^{(k_{0}, q)} < e^{(k_{0})} \end{matrix}

(10)

then it can also be expressed as:

\begin{matrix} \frac{1}{N} \sum_{q, t} | {\hat{V}}_{t}^{(k_{0}, q)} - V_{t}^{(k_{0}, q)} | < \frac{1}{N} \sum_{t} | {\hat{V}}_{t}^{(k_{0})} - V_{t}^{(k_{0})} | \end{matrix}

(11)

We thus extend Eq (11) as:

\begin{matrix} \begin{matrix} \frac{1}{N} \sum_{q, t} | {\hat{x}}_{t}^{(k_{0}, q)} - x_{t}^{(k_{0}, q)} | + \frac{1}{N} \sum_{k \neq k_{0}, t} | {\hat{V}}_{t}^{(k)} - V_{t}^{(k)} | \\ < \frac{1}{N} \sum_{t} | {\hat{V}}_{t}^{(k_{0})} - V_{t}^{(k_{0})} | + \frac{1}{N} \sum_{k \neq k_{0}, t} | {\hat{V}}_{t}^{(k)} - V_{t}^{(k)} | \end{matrix} \end{matrix}

(12)

Expressing Eq (12) in another form:

\begin{matrix} e = \sum_{q} e^{(k_{0}, q)} + \sum_{k \neq k_{0}} e^{(k)} < e^{(k_{0})} + \sum_{k \neq k_{0}} e^{(k)} = \sum_{k} e^{(k)} \end{matrix}

(13)

where ${\hat{V}}_{t}^{(k)}$ and ${\hat{V}}_{t}^{(k_{0}, q)}$ are the forecasting values of and, respectively.

This derivation shows that if Theorem 1 is satisfied, the total testing error after decomposition is reduced, which proves that the decomposition is meaningful.

BER-based decomposition

Based on Theorem 1, we propose a recursive signal decomposition method based on BER. The specific decomposition process of this method is as follows.

Step 1: The original time series is decomposed with VMD at the first level, the number of decompositions is set to, and the subsignals are input into each forecasting model to obtain the corresponding testing error.
Step 2: Decompose and forecast the subsignals in the next level and set the number of decompositions to.
Step 3: Check if the error before and after decomposition satisfies the BER criterion. If the error decreases, the decomposition is retained, and the operation is repeated in Step 2; otherwise, the decomposition is invalid and terminates.

A flow diagram of the decomposition method is shown in Fig 2.

Wind power generation data are decomposed for the first time by VMD, and the mode number K₀ is obtained via a simple calculation and is a small value in the range of the optimal mode number of different forecasting models. K′ is selected to ensure the initial decomposition of the data without overdecomposition. In this study, is set to 2. In addition, the subsignals of the final output are through different levels of decomposition in most cases.

The hybrid model used in this study is shown in Fig 3. In this model, the BER-based decomposition method is used as the signal decomposition module to first decompose the historical wind power generation data into subsignals adaptively. Then, these subsignals are fed into the different models to realize the forecasting task, and finally, the forecast results of the subsignals are superimposed to obtain the predicted power generation. In the hybrid models used in this study, the forecasting models can use statistical, machine learning, or deep learning models such as LSSVM, ELM, DBN, LSTM, etc. In addition, VMD decomposes the signal into multiple narrowband components and a residual component, which may contain the high-frequency component of the original signal; thus, directly discarding the residual component may cause the loss of the high-frequency component and affect the forecast accuracy. Thus, we choose PE as a measure of signal randomness and set a threshold θ = 1 to filter the residual components of each decomposition.

Data experiments

Power generation datasets

In this example, the historical data of wind power generation in Fujian Province from 2020 to 2021 are selected for the experiment. The sampling interval of this dataset is 1 h, with a total of 731 wind power generation datasets, as shown in Fig 4. The first ten months are selected as the training set, and the last two months are selected as the testing set.

As shown in Fig 4, the overall wind power generation in Fujian Province increases from 2020 to 2021 and shows a marked seasonal fluctuation. Correspondingly, the stationarity of the series is poor, which can also be seen from the Augmented Dickey-Fuller (ADF) Test. The ADF test results are all greater than critical values, which proves that the series is non-stationary and must be further processed. The statistics of the wind power generation by season are shown in Table 2 and Fig 5, where January-March for spring, April-June for summer, July September for autumn, and October-December for winter.

Table 2. Analysis of the daily power generation datasets.

Year	Season	Maximum	Minimum	Average	Standard deviation	Kurtosis	Skewness	ADF
2020	Ayear	94722.37	37482.15	70535.15	13445.52	-0.60017	-0.07689	-1.53987
	Spring	71441.16	37482.15	53559.38	10556.19	0.016509	-1.50993	-2.61722
	Summer	87595.2	53401.02	70531.12	7395.71	0.582071	0.463894	-1.84112
	Autumn	94722.37	52945.79	80081.74	9622.918	-0.52217	-0.7652	2.74015
	Winter	82782.82	70575.13	75600.89	2788.615	0.576969	0.12333	-1.64434
2021	Ayear	101772.5	41920.3	79122.76	11324.93	-0.56403	0.816906	-2.09905
	Spring	83659.18	41920.3	68567.24	11489.91	-0.81146	-0.51751	-2.49696
	Summer	98046.13	62004.29	80890.61	6974.329	0.09976	0.783972	-3.22889
	Autumn	101772.5	69698.15	87854.12	8768.691	-0.72712	-0.66269	-1.82974
	Winter	82359.6	70116.29	74472.17	2405.469	1.02896	1.781289	-2.29268

Open in a new tab

The wind power generation datasets in 2021 show an overall increase compared to 2020 and are primarily reflected in the three seasons of spring, summer, and autumn; the standard deviation in 2021 is relatively small and stable overall. In addition, the seasonal differences within a year are strong, showing high power generation in summer and autumn, and low power generation in spring and autumn; the standard deviation in winter is maintained at a low level, while the variation in spring is more intense and random.

Evaluation indicators

Error indicators

In this study, the mean absolute error (MAE), root mean square error (RMSE), and mean absolute percentage error (MAPE) are selected as evaluation indicators:

\begin{matrix} M A E = \frac{1}{N} \sum_{t = 1}^{N} | V_{t}^{a} - V_{t}^{f} | \end{matrix}

(14)

\begin{matrix} R M S E = \sqrt{\frac{1}{N} \sum_{t = 1}^{N} {(V_{t}^{a} - V_{t}^{f})}^{2}} \end{matrix}

(15)

\begin{matrix} M A P E = \frac{1}{N} \sum_{t = 1}^{N} | \frac{V_{t}^{a} - V_{t}^{f}}{V_{t}^{f}} | \times 100 % \end{matrix}

(16)

where N is the length of the series, $V_{t}^{a}$ is the real series, and $V_{t}^{f}$ is the forecasting series.

Improvement indicators

To evaluate the improvement of the proposed decomposition method compared with other methods, the following formula is used as improvement indicators based on the above error indicators:

\begin{matrix} P = \frac{e_{1} - e_{0}}{e_{1}} \times 100 % \end{matrix}

(17)

where P is the percentage of error reduction, e₁ and e₀ represent the error of the proposed method and comparative experiment, respectively.

Comparative experiments

Decomposition according to the center frequency

There is still a lack of general guidelines for the selection of the mode number [25]. Among the traditional methods of determining the mode number, the more intuitive and simple method is to observe whether the center frequency is aliased [26]. The size of K is increased from K = 2 to observe the distribution of the center frequency. The center frequencies for different values of K are shown in Table 3.

Table 3. Central frequency of the first decomposition.

K	Center Frequency
3	0.000022	0.0135	0.128	-	-	-	-
4	0.000019	0.0112	0.0429	0.1352	-	-	-
5	0.000018	0.0112	0.0429	0.1351	0.4217	-	-
6	0.000018	0.0111	0.0417	0.1224	0.1828	0.4236	-
7	0.000018	0.0108	0.0374	0.0784	0.1403	0.2747	0.427

Open in a new tab

Table 3 shows that when the mode number is above 5, the center frequency of the last mode component always remains relatively stable. If K is continuously increased, the more layers of decompostion there are, the smaller the interval of the center frequencies of each component will be, and the more likely it is to generate additional noise components because the center frequency of the last layer remains unchanged. Thus, the optimal value of the mode number for the first decomposition is 5. Fig 6 shows the decomposed signal curves of the daily power generation data in 2020 and 2021, where IMF1-IMF5 are the narrowband components and RF is the residual component. From top to bottom, the curve vibration frequency becomes increasingly intense and irregular.

Similarly, a second decomposition is performed for all components according to the central frequencies to obtain the central frequencies at K = 2, 3, 4. The central frequencies obtained by performing this operation for the power generation datasets in 2020 and 2021 are shown in Table 4. The values of K for the second decomposition are 3, 3, 3, 3, 1, and 3 in 2020, and 3, 2, 3, 2, 1, and 3 in 2021. The central frequencies of the fifth component are already aliased at K = 2; thus, the second decomposition is not performed.

Table 4. Central frequencies of the second decomposition in 2020 and 2021.

Year	Subsignal	K = 2	K = 3	K = 4
2020	1	1.82E-05; 0.0319	3.02E-06; 0.0025; 0.0338	2.79E-06; 0.0307; 0.0489; 0.0024
	2	0.0092; 0.0301	0.0081; 0.0120; 0.0323	0.0105; 0.0295; 0.0452; 0.0066
	3	0.1019; 0.1213	0.0751; 0.1055; 0.1239	0.0750; 0.1051; 0.1214; 0.1394
	4	0.1551; 0.1823	0.1517; 0.1753; 0.1944	0.1512; 0.1738; 0.1983; 0.1854
	5	0.4105; 0.4298	0.3945; 0.4308; 0.4180	-
2021	1	9.40E-07; 0.0030	8.67E-07; 0.0404; 0.0030	5.77E-07; 0.0028; 0.0076; 0.0411
	2	0.014; 0.034	0.010; 0.037; 0.018	-
	3	0.114; 0.140	0.113; 0.138; 0.161	0.095; 0.139; 0.165; 0.119
	4	0.258; 0.287	0.251; 0.270; 0.289	0.230; 0.271; 0.255; 0.289
	5	0.432; 0.409	-	-

Open in a new tab

Decomposition according to BER criterion

In a deeper decomposition of the original wind power generation series based on BER, the final decomposition results differ markedly from the decomposition according to the central frequency, and different forecasting models correspond to different optimal decompositions. Taking LSSVM as the forecasting model as an example, the decomposition process of the power generation series in 2020 is shown in Fig 7.

With LSSVM as the forecasting model, four components can be decomposed for the second time, and eleven subsignals can be decomposed for the third time. According to the same decomposition process, the optimal decomposition is performed under the forecasting models of ELM, LSTM, and DBN, and the percentage decrease of the error sum corresponding to the latter two decompositions is shown in Table 5. With the gradual depth of decomposition, the error sum decreases, satisfying the BER criterion.

Table 5. Percentage decrease in the error sum.

year	decomposition level	LSTM	ELM	LSSVM	DBN
2020	2	-5.37%	-31.90%	-29.22%	-33.62%
2020	3	-7.57%	-47.41%	-54.91%	-44.49%
2021	2	-28.61%	-35.06%	-32.29%	-11.71%
2021	3	-41.67%	-41.55%	-43.87%	-17.93%

Open in a new tab

From this data, RF is shown to produce the largest prediction variations and testing errors. RF contains more noise but also preserves some information about the original series; thus, we measure whether the RF and the subsignals obtained from the second decomposition of RF should be retained by PE. Table 6 shows the PE of the above components and compares it with the entropy of the five narrowband components acquired from the first decomposition, which is referred to as avg. IMF.

Table 6. PE of each component.

Year	avg. IMF	first decomposition	second decomposition
Year	avg. IMF	RF	RF1	RF2	RF3
2020	0.4479	0.6085	0.4235	0.5337	0.6179
2021	0.4451	0.6149	0.4478	0.5421	0.6206

Open in a new tab

Table 6 shows that although most of the values of several residual components are greater than the avg. IMF, they do not exceed the threshold; thus, no component must be removed. Also, after the second decomposition of the residual components, the PE of the first two secondary components of the decomposition is significantly reduced, tending to the range of the narrowband components, while the PE of the secondary residual components has increased to a certain extent, indicating that the noise components are more obviously separated after the second decomposition. In the experiments, the secondary residual component with the largest value of PE is removed as an independent comparison experiment.

Thirteen comparison experiments of different decomposition patterns are conducted for each forecasting model separately, all of which divided the original series into five narrowband components and one residual component at the first level of decomposition. The experiments are divided into five groups, and the variables between groups are the number of decomposition layers and basis; the variables within groups are the treatment methods of the residual components. In the last group of experiments, a comparison experiment of removing secondary residual components is conducted. Therefore, when comparisons are made between groups, the best-performing type of experimental data within the group is selected in all cases:

A. Direct secondary decomposition
- Decomposing the six components according to K = 2.
- Decomposing the six components according to K = 3.
B. Secondary decomposition according to the central frequency
- Decomposing five narrowband components according to the center frequency, the residual components are not subjected to the second level of decomposition.
- Decomposing five narrowband components according to the center frequency, the residual components are decomposed according to K = 2.
- Decomposing five narrowband components according to the center frequency, the residual components are decomposed according to K = 3.
C. Secondary decomposition according to the BER criterion
- Five narrowband components are decomposed for a second time according to the BER criterion, and the residual components are not subjected to the second level of decomposition.
- Five narrowband components are decomposed for a second time according to the BER criterion, and the residual components are decomposed according to K = 2.
- Five narrowband components are decomposed for a second time according to the BER criterion, and the residual components are decomposed according to K = 3.
D. Third decomposition according to the BER criterion
- Five narrowband components are decomposed three times according to the BER criterion, and the residual components are not subjected to the second level of decomposition.
- Five narrowband components are decomposed three times according to the BER criterion, and the residual components are decomposed according to K = 2.
- Five narrowband components are decomposed three times according to the BER criterion, and the residual components are decomposed according to K = 3.
- Decomposing all components three times according to the BER criterion.
E. Third decomposition and removal of secondary residual components
- Based on the optimal decomposition, the secondary residual components are removed.

Experimental results and analysis

1. Comparison Experiment I

Group A and Group B are measured by three error indicators and compared with the first level of decomposition. The forecasting results of each model are shown in Table 7.

Table 7. Forecasting results of Group A and Group B.

Indicator	Model	2020			2021
Indicator	Model	First level	Group A	Group B	First level	Group A	Group B
MAE	LSSVM	573.1176	197.3233	190.7865	489.8285	320.3984	320.3929
	ELM	1958.7979	32688.73	4056138	1300.4613	587.2526	1465.8817
	DBN	499.3744	244.252	202.6702	431.1148	225.0884	213.6698
	LSTM	3916.722	3912.2559	1023.958	1774.359	992.1692	625.4075
RMSE	LSSVM	706.9316	255.4071	247.16437	631.1439	403.0863	404.7089
	ELM	2576.0908	41155.958	5326.8536	1561.6901	881.4671	1882.439
	DBN	623.8845	314.3993	279.9758	548.7777	300.3513	293.8501
	LSTM	4243.249	3998.325	1304.2325	2190.5804	1168.814	768.2987
MAPE(%)	LSSVM	0.7596	0.2613	0.2523	0.6549	0.4267	0.4276
	ELM	2.5915	43.4915	1465.8817	1.7603	0.7773	1.9743
	DBN	0.619	0.3591	0.2651	0.5756	0.2997	0.2839
	LSTM	5.2461	5.2043	1.3706	2.344	1.3429	0.8397

Open in a new tab

The results in Table 7 show the following:

The experimental errors of direct secondary decomposition and secondary decomposition according to the central frequency are better than those of first-level decomposition in most instances, but there are also cases where the accuracy is extremely poor. These results indicate that deeper levels of decomposition do not equate to more accurate results.
In the experiments conducted on both the 2020 and 2021 datasets, the mode number of Group B is more than that in Group A. In the experiments based on LSTM, the error of Group B has markedly decreased compared with that of Group A. However, in the experiments based on other models, the advantage of Group B is not large, and there is even one large error. Thus, more modes do not equate to less error with the same decomposition levels. In addition, both decomposition methods have certain drawbacks and great limitations in reducing the forecast accuracy.

2. Comparison experiment II

In the experiments based on the BER criterion, only some subsignals satisfy the condition of the third decomposition. Therefore, in addition to the experimental results of Group C and Group D, another comparison experiment is set up for all the subsignals obtained by the second decomposition to be decomposed for the third time to verify the accuracy and superior performance of the recursive decomposition method based on the BER criterion. Experimental results are shown in Table 8.

Table 8. Experimental error results for the third and fourth groups.

Indicator	Model	2020			2021
Indicator	Model	Group C	Group D	Comparative data	Group C	Group D	Comparative Data
MAE	LSSVM	204.9825	159.527	161.651	315.0771	251.2095	253.226
	ELM	1709.5477	1595.1924	1951437	644.626	558.1538	1235.4259
	DBN	286.868	229.1196	278.1702	290.0355	225.9414	259.521
	LSTM	3627.924	3592.7705	75903	978.1682	882.723	1697.3942
RMSE	LSSVM	263.7048	193.667	197.4948	396.1654	321.8135	323.3518
	ELM	2245.8487	1969.1107	2200894	789.1289	686.8216	1491.9478
	DBN	332.7216	267.1993	356.7122	359.461	288.6117	326.7534
	LSTM	3705.5479	3676.57	75908	1142.9699	1100.5448	1974.204
MAPE(%)	LSSVM	0.2718	0.2122	0.2153	0.4197	0.3352	0.338
	ELM	2.2741	2.0964	2584.696	0.87	0.7536	1.6566
	DBN	0.3796	0.3037	0.3504	0.3853	0.2996	0.3444
	LSTM	4.8301	4.7808	100.702	1.328	1.1768	2.3134

Open in a new tab

The errors of Group D and the comparison data in Table 8 show that a deeper decomposition without satisfying the BER criterion is likely to cause an error explosion. The error comparing Group C and Group D following the BER criterion with the first-level decomposition is shown in Table 9 and Fig 8. Experimental error decreases to some extent for each level of decomposition for all forecasting models. Combined with the previous conclusion that the number of decomposition levels and modes is not proportional to the reduction in forecasting error, the superiority of the BER criterion in decomposing time series is further shown.

Table 9. Percentage reduction for secondary and tertiary decomposition.

Year	Decomposition Level	LSSVM	ELM	DBN	LSTM
2020	2	-64.23%	-12.72%	-42.55%	-7.37%
2020	3	-72.17%	-18.56%	-54.12%	-8.27%
2021	2	-35.68%	-50.43%	-32.72%	-44.87%
2021	3	-48.71%	-57.08%	-47.59%	-50.25%

Open in a new tab

3. Comparison experiment III

To describe the influence of the residual components on the forecast accuracy, the secondary residual component with the largest PE is chosen to be discarded as a comparison experiment. The relative percentage decrease in the error for Group E compared with Group D is shown in Table 10.

Table 10. Percentage decrease in the error of discarding secondary residual components.

Models	2021			2021
Models	MAE	RMSE	MAPE	MAE	RMSE	MAPE
LSSVM	-52.19%	-48.64%	-52.32%	-75.06%	-76.11%	-74.98%
ELM	-6.10%	-1.05%	-6.08%	-18.66%	-20.72%	-18.60%
DBN	-44.11%	-42.68%	-43.91%	1.92%	-12.37%	3.65%
LSTM	-0.29%	-0.86%	-0.51%	-10.08%	-11.97%	-9.95%

Open in a new tab

As shown in Table 10, the errors of the four groups of experiments decreased to a certain extent after the removal of the secondary residual components. Fig 9 shows the average percentage of MAE deletion. The experiments with LSSVM as the forecasting model show larger decreases after removing the residual components, indicating that the decomposition of wind power generation series is more accurate in this experiment, and the separation of noise in the series is more successful.

Summary

These comparative experiments and data results show the following:

To achieve better separation for higher forecasting accuracy, blind decomposition is undesirable. Neither deep decomposition levels nor a large number of modes is equivalent to a small error.
Traditional decomposition methods are more random in terms of validity and effectiveness, which is inappropriate as a criterion for judging the mode number.
Recursive decomposition based on BER has a complete mathematical derivation process and shows stability in the real forecasting process, which is more objective than other decomposition methods.

Conclusions

In this paper, we propose a recursive decomposition method based on the branch error reduction criterion to decompose wind power generation into more regular and easily trained multiple modes. Four forecasting models, LSSVM, ELM, DBN, and LSTM, are used for forecasting, and the superior performance of the proposed decomposition method is primarily shown in the following results:

Taking full advantage of VMD, the proposed method can decompose the original time series into subcomponents with more easily captured features.
The branch error reduction criterion is supported by mathematical theory, which improves the reliability and robustness of the overall model.
The ambiguous judgment method is abandoned, and a mathematical guideline is used to facilitate program integration and modular design of signal decomposition.

Because the trend of power generation and the influencing factors change over time, the distribution of the dataset used for training is not consistent with the new data, resulting in the previous model not being able to forecast the present data at a high level of accuracy, which means there is a distribution drift phenomenon. To improve model generalizability and make the distribution of training and testing data as consistent as possible, the distribution drift will be improved in the future based on the existing research using the sample weighting strategy to improve prediction accuracy.

Supporting information

S1 Data

(CSV)

pone.0299955.s001.csv^{(14.2KB, csv)}

Data Availability

All relevant data are within the manuscript and its Supporting information files.

Funding Statement

Funding information: State Grid Fujian Electric Power Co. Ltd.: SGTYHT/20-JS-223(SGFJJY00GHJS2200054) The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

1. Wu CY, Wang JZ, Hao Y. Deterministic and uncertainty crude oil price forecasting based on outlier detection and modified multi-objective optimization algorithm. Resources Policy. 2022. Aug;102780. doi: 10.1016/j.resourpol.2022.102780 [DOI] [Google Scholar]
2. Şahin Utkucan. Projections of Turkey’s electricity generation and installed capacity from total renewable and hydro energy using fractional nonlinear grey Bernoulli model and its reduced forms. Sustainable Production and Consumption. 2020. Jul;23:52–62. [Google Scholar]
3.Shi XW, Shi XF, Dong WQ, Zang P, Jia HY, Wu JF, et al. Research on Energy Storage Configuration Method Based on Wind and Solar Volatility. 2020 10th International Conference on Power and Energy Systems (ICPES). 2020 Dec;464–468.
4.Elsaraiti M, Ali G, Musbah H, Merabet A, Little T. Time Series Analysis of Electricity Consumption Forecasting Using ARIMA Model. 2021 13TH ANNUAL IEEE GREEN TECHNOLOGIES CONFERENCE GREENTECH 2021. 2021 Jun;259-262.
5. Yi M, Xie W, Mo L. Short-Term Electricity Price Forecasting Based on BP Neural Network Optimized by SAPSO. Energies. 2021. Oct;14(20):6514. doi: 10.3390/en14206514 [DOI] [Google Scholar]
6. Kong WC, Dong ZY, Hill DJ, Luo FJ, Xu Y. Short-term residential load forecasting based on resident behavior learning. IEEE Transactions on Power Systems. 2018. Jan;33(1):1087–1088. doi: 10.1109/TPWRS.2017.2688178 [DOI] [Google Scholar]
7. Zhang XB, Wang JZ, Zhang KQ. Short-term electric load forecasting based on singular spectrum analysis and support vector machine optimized by Cuckoo search algorithm. Electric Power Systems Research. 2017. May;146:270–285. doi: 10.1016/j.epsr.2017.01.035 [DOI] [Google Scholar]
8. Zhang GQ, Guo JF. A novel method for hourly electricity demand forecasting. IEEE Transactions on Power Systems. 2019. Mar;35(2):1351–1363. doi: 10.1109/TPWRS.2021.3098960 [DOI] [Google Scholar]
9. Choi JE, Dong WS. Parallel architecture of CNN-bidirectional LSTMs for implied volatility forecast. Journal of Forecasting. 2021. Feb;41(6):1087–1098. doi: 10.1002/for.2844 [DOI] [Google Scholar]
10. Bai Y, Li Y, Wang XX, Xie JJ, Li C. Air Pollutants Concentrations Forecasting Using Back Propagation Neural Network Based on Wavelet Decomposition with Meteorological Conditions. Atmospheric Pollution Research. 2016. May;7(3):557–566. doi: 10.1016/j.apr.2016.01.004 [DOI] [Google Scholar]
11. Mallat SG. A theory for multiresolution signal decomposition: the wavelet representation. IEEE Transactions on Pattern Analysis and Machine Intelligence. 1989. Jul.;11(7):674–693. doi: 10.1109/34.192463 [DOI] [Google Scholar]
12. Zheng HT, Yuan JB, Chen L. Short-Term load forecasting using EMD-LSTM neural networks with a Xgboost algorithm for feature importance evaluation. Energies. 2017. Aug;10(8):1168. doi: 10.3390/en10081168 [DOI] [Google Scholar]
13. Chen YH, Kloft M, Yang Y, Li CH, Li L. Mixed kernel based extreme learning machine for electric load forecasting. Neurocomputing. 2018. Oct;312:90–106. doi: 10.1016/j.neucom.2018.05.068 [DOI] [Google Scholar]
14. Huang ME, Shen Z, Long SR, Wu MLC, Shih HH, Zheng QN, et al. The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. Proceedings of the Royal Society A-Mathematical Physical and Engineering Sciences. 2020. Mar;454:903–995. doi: 10.1098/rspa.1998.0193 [DOI] [Google Scholar]
15. Qin QD, He HD, Li L, He LY. A Novel Decomposition-Ensemble Based Carbon Price Forecasting Model Integrated with Local Polynomial Prediction. Computational Economics. 2020. Apr;55(4):1249–1273. doi: 10.1007/s10614-018-9862-1 [DOI] [Google Scholar]
16. Wu ZH, Huang NE. Ensemble empirical mode decomposition: a noise-assisted data analysis method. Advances in adaptive data analysis. 2009. Jan;1(1):1–41. doi: 10.1142/S1793536909000047 [DOI] [Google Scholar]
17. Lv LL, Wu ZY, Zhang JH, Zhang L, Tan ZY, Tian ZH. A VMD and LSTM Based Hybrid Model of Load Forecasting for Power Grid Security. IEEE Transactions on Industrial Informatics. 2022. Sep;18(9):6474–6482. doi: 10.1109/TII.2021.3130237 [DOI] [Google Scholar]
18. Cai C, Li Y, Su ZH, Zhu TQ, He YY. Short-Term Electrical Load Forecasting Based on VMD and GRU-TCN Hybrid Network. Applied Sciences. 2022. Jul;12(13):6647. doi: 10.3390/app12136647 [DOI] [Google Scholar]
19. Dragomiretskiy K, Zosso D. Variational Mode Decomposition. IEEE Transactions on Signal Processing. 2014. Feb;62(3):531–543. doi: 10.1109/TSP.2013.2288675 [DOI] [Google Scholar]
20. Liu W, Cao SY, Wang ZM, Kong XZ, Chen YK. Spectral decomposition for hydrocarbon detection based on VMD and teager-kaiser energy. Geoscience and Remote Sensing Letters. 2017. Apr;14(4):539–543. doi: 10.1109/LGRS.2017.2656158 [DOI] [Google Scholar]
21. Lahmiri S. Comparing variational and empirical mode decomposition in forecasting day-ahead energy prices. Systems Journal. 2017. Sep;11(3):1907–1910. [Google Scholar]
22. Yang WX, Peng ZK, Wei KX, Shi P, and Tian WY. Superiorities of variational mode decomposition over empirical mode decomposition particularly in time-frequency feature extraction and wind turbine condition monitoring. Renewable Power Generation. 2017. Mar;11(4):443–452. doi: 10.1049/iet-rpg.2016.0088 [DOI] [Google Scholar]
23. Huang YS, Gao YL, Yan Y, Ye M. A new financial data forecasting model using genetic algorithm and long short-term memory network. Neurocomputing. 2021. Feb;425:207–218. doi: 10.1016/j.neucom.2020.04.086 [DOI] [Google Scholar]
24. Bandt C, Pompe B. Permutation entropy: a natural complexity measure for time series. Physical review letters. 2002. Apr;88(17). doi: 10.1103/PhysRevLett.88.174102 [DOI] [PubMed] [Google Scholar]
25. Ranjeeta B, Dash PK, Parida AK. Hybrid Variational Mode Decomposition and evolutionary robust kernel extreme learning machine for stock price and movement prediction on daily basis. Applied Soft Computing. 2019. Jan;74:652–678. doi: 10.1016/j.asoc.2018.11.008 [DOI] [Google Scholar]
26. Guo W, Liu QF, Luo ZD, Tse YM. Forecasts for international financial series with VMD algorithms. Journal of Asian Economics. 2022. Jan;80. doi: 10.1016/j.asieco.2022.101458 [DOI] [Google Scholar]

PLoS One. doi: 10.1371/journal.pone.0299955.r001

Decision Letter 0

Samuel Asante Gyamerah

9 Jan 2024

PONE-D-23-26209Branch Error Reduction Criterion-Based Signal Recursive Decomposition and Its Application to Wind Power Generation ForecastingPLOS ONE

Dear Dr. li,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Jan 14 2024 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Samuel Asante Gyamerah, Ph.D

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please note that PLOS ONE has specific guidelines on code sharing for submissions in which author-generated code underpins the findings in the manuscript. In these cases, all author-generated code must be made available without restrictions upon publication of the work. Please review our guidelines at https://journals.plos.org/plosone/s/materials-and-software-sharing#loc-sharing-code and ensure that your code is shared in a way that follows best practice and facilitates reproducibility and reuse.

3. Thank you for stating the following in the Financial Disclosure section:

"Funding information:

State Grid Fujian Electric Power Co. Ltd.: SGTYHT/20-JS-223(SGFJJY00GHJS2200054)

The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript."

We note that one or more of the authors have an affiliation to the commercial funders of this research study : State Grid Fujian Electric Power Co. Ltd

(1) Please provide an amended Funding Statement declaring this commercial affiliation, as well as a statement regarding the Role of Funders in your study. If the funding organization did not play a role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript and only provided financial support in the form of authors' salaries and/or research materials, please review your statements relating to the author contributions, and ensure you have specifically and accurately indicated the role(s) that these authors had in your study. You can update author roles in the Author Contributions section of the online submission form.

Please also include the following statement within your amended Funding Statement.

“The funder provided support in the form of salaries for authors [insert relevant initials], but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the ‘author contributions’ section.”

If your commercial affiliation did play a role in your study, please state and explain this role within your updated Funding Statement.

(2) Please also provide an updated Competing Interests Statement declaring this commercial affiliation along with any other relevant declarations relating to employment, consultancy, patents, products in development, or marketed products, etc.

Within your Competing Interests Statement, please confirm that this commercial affiliation does not alter your adherence to all PLOS ONE policies on sharing data and materials by including the following statement: ""This does not alter our adherence to PLOS ONE policies on sharing data and materials.” (as detailed online in our guide for authors http://journals.plos.org/plosone/s/competing-interests). If this adherence statement is not accurate and there are restrictions on sharing of data and/or materials, please state these.

Please note that we cannot proceed with consideration of your article until this information has been declared.

Please include both an updated Funding Statement and Competing Interests Statement in your cover letter. We will change the online submission form on your behalf.

4. Thank you for stating the following in the Acknowledgments Section of your manuscript:

"This research was fully funded by the scientific and technological project of State Grid

Fujian Electric Power Co. Ltd. (SGTYHT/20-JS-223(SGFJJY00GHJS2200054)).The

authors would like to express their gratitude to AJE for the expert linguistic services

provided."

Funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form.

Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement. Currently, your Funding Statement reads as follows:

"Funding information:

State Grid Fujian Electric Power Co. Ltd.: SGTYHT/20-JS-223(SGFJJY00GHJS2200054)

The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript."

Please include your amended statements within your cover letter; we will change the online submission form on your behalf.

5. In your Data Availability statement, you have not specified where the minimal data set underlying the results described in your manuscript can be found. PLOS defines a study's minimal data set as the underlying data used to reach the conclusions drawn in the manuscript and any additional data required to replicate the reported study findings in their entirety. All PLOS journals require that the minimal data set be made fully available. For more information about our data policy, please see http://journals.plos.org/plosone/s/data-availability.

Upon re-submitting your revised manuscript, please upload your study’s minimal underlying data set as either Supporting Information files or to a stable, public repository and include the relevant URLs, DOIs, or accession numbers within your revised cover letter. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories. Any potentially identifying patient information must be fully anonymized.

Important: If there are ethical or legal restrictions to sharing your data publicly, please explain these restrictions in detail. Please see our guidelines for more information on what we consider unacceptable restrictions to publicly sharing data: http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions. Note that it is not acceptable for the authors to be the sole named individuals responsible for ensuring data access.

We will update your Data Availability statement to reflect the information you provide in your cover letter.

6. Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: This is a very good study which is novel. Hybrid methods are known to perform better than the underlying or single methods to analyse data. The authors also presented the same work with some level of great detail into the suggested method. However, there are few corrections that may need to be done. In line 18 its written In Ref. [6] which i presume was supposed to write the author's name like what the authors did in line 19. This needs correction. Similar mistake is made in line 24 and 43.

Decomposition has been used as a preprocessing method in many studies as also cited in this paper. I suggest adding one or 2 sentences after paragraph 1 in Introduction justifying why the wind generation data needs to be decomposed. In Line 192 I suggest carrying out stationarity Tests such as the Augmented Dickey-Fuller (ADF) test and the Kwiatkowski-Phillips-Schmidt-Shin (KPSS) test and then make a conclusion that the data is non-stationary.

It would be interesting to add few more years instead of using only 2 years to carry out a study.

Reviewer #2: The motivation for the study has been well explained. Choice of Branch Error reduction over VMD has been duly justified. Recommendation for practice and further studies have been given. Some statements in the introductory section need to be cited

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: Yes: EMMANUEL NUMAPAU GYAMFI

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2024 Mar 22;19(3):e0299955. doi: 10.1371/journal.pone.0299955.r002

Author response to Decision Letter 0

25 Jan 2024

To Academic Editor

Thank you so much for your professional and valuable comments on our manuscripts, we have made changes and improvements accordingly.

Comments:

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming.

Reply: Thank you so much for the kind suggestion. We have carefully checked our manuscript to ensure that this manuscript meets PLOS ONE's style requirements.

Reply: Thank you very much for your reminder. The code has been packaged for easy running.

3. The Funding Statement and Competing Interests Statement need to be updated.

Reply: Thank you for your kind suggestion. The Funding Statement and Competing Interests Statement are updated.

4. Funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form. Please include your amended statements within your cover letter; we will change the online submission form on your behalf.

Reply: Sorry for the oversight, the funding information has been placed in Funding Statement section of the online submission form.

The updated Funding Statement is as the following:

Funding information: State Grid Fujian Electric Power Co. Ltd.: SGTYHT/20-JS-223(SGFJJY00GHJS2200054). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. The funder provided support in the form of salaries for authors Fen Xiao and Siyu Yang, but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the ‘author contributions’ section.

5. In your Data Availability statement, you have not specified where the minimal data set underlying the results described in your manuscript can be found.

Reply: Thank you for your suggestion. The minimal data set used in the manuscript has been uploaded as Supporting Information.

6. Please review your reference list to ensure that it is complete and correct.

Reply: Thank you for the kind suggestion. All references have been checked carefully. Four new references have been supplemented.

To Reviewers #1

We really appreciate the professional and valuable comments on our manuscript. We have made the revisions and improvements accordingly.

1. In line 18 its written In Ref. [6] which i presume was supposed to write the author's name like what the authors did in line 19. This needs correction. Similar mistake is made in line 24 and 43.

Reply: Thank you for the kind suggestion. The citations formats have been corrected accordingly.

2. Decomposition has been used as a preprocessing method in many studies as also cited in this paper. I suggest adding one or 2 sentences after paragraph 1 in Introduction justifying why the wind generation data needs to be decomposed.

Reply: Thank you for the professional comment. The reason for the decomposition of the generation time series is supplemented at the beginning of the 4th paragraph in the introduction section: “The time series of electricity generation is generally a broadband signal, and its future trend is not stable. Therefore, it is difficult to approximate the relationship between historical measurements and its future changes. The future trend of a narrowband signal is normally considered to be more stable. Therefore, the second type of hybrid model is used to decompose the time series of power generation into narrowband modes, and the final forecasted results are obtained by summarizing the forecasted results of each mode” (Line 37-43).

3. In Line 192 I suggest carrying out stationarity Tests such as the Augmented Dickey-Fuller (ADF) test and the Kwiatkowski-Phillips-Schmidt-Shin (KPSS) test and then make a conclusion that the data is non-stationary.

Reply: Thank you for the enlightening comment.

The ADF test results of partial and total generation time series of 2020 and 2021 are supplemented to Table 2 in the revised manuscript. Detailed statistics of the ADF test are also presented in Table R.1, which indicates the non-stationarity of the time series data. Due to limited space, can we just not show Table R.1 in the revised manuscript, thank you.

Table R.1. ADF test results and critical values (in 'Response to Reviewers.docx')

4. It would be interesting to add few more years instead of using only 2 years to carry out a study.

Reply: Thank you for the practically significant suggestion.

The historical data of power generation in Fujian Province from Jan 1 2012 to Jun 30 2022 is shown in Fig. R.1, with a total of 3834 daily power generation data. A total of 3653 power generation data in the decade 2012- 2021 is selected as the training set, and a total of 181 power generation data in the first six months of 2022 is selected as the testing set. The same comparison experiments are conducted against the manuscript and the results are shown in Table R.2.

Fig. R.1 Power generation trend of Fujian Province from Jan o1 2012 to Jun 30 2019 (in 'Response to Reviewers.docx')

Table R.2. Comparative experiments (in 'Response to Reviewers.docx')

According to Table R. 2, the same conclusions can be drawn:

a. More decomposition layers or more decomposition numbers cannot be equated with better decomposition effect;

b. There are randomness and limitations in the final effect of direct decomposition and decomposition according to whether the center

frequency is aliased;

c. Recursive decomposition based on BER performs more consistently and efficiently in the experiment.

Although the same conclusions can be drawn, the electricity generation data have several characteristics: they are highly influenced by policy, data distribution evolves over time. Compared with one year's data, too much experimental data seems to be less suitable for comparison and presentation to some extent. Therefore, would it be more succinct to use only single year data in the experiment? We look forward to your valuable suggestions.

To Reviewers #2

The authors would like to thank you for taking the time to read our manuscript. We have modified our manuscript according to the suggestions and comments.

1. Some statements in the introductory section need to be cited.

Reply: Thank you for the kind suggestion. In the introduction section, three references are supplemented, i.e., Ref [1], Ref [2] and Ref [4]. In the comparative experiments section, Ref [25] is supplemented.

Attachment

Submitted filename: Response to Reviewers.docx

pone.0299955.s002.docx^{(82.8KB, docx)}

PLoS One. doi: 10.1371/journal.pone.0299955.r003

Decision Letter 1

Samuel Asante Gyamerah

20 Feb 2024

Branch Error Reduction Criterion-Based Signal Recursive Decomposition and Its Application to Wind Power Generation Forecasting

PONE-D-23-26209R1

Dear Dr. li,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Samuel Asante Gyamerah, Ph.D

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

Reviewer #1: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

Reviewer #1: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

Reviewer #1: Yes

**********

6. Review Comments to the Author

Reviewer #1: (No Response)

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Willard Zvarevashe

**********

PLoS One. doi: 10.1371/journal.pone.0299955.r004

Acceptance letter

Samuel Asante Gyamerah

12 Mar 2024

PONE-D-23-26209R1

PLOS ONE

Dear Dr. Li,

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now being handed over to our production team.

At this stage, our production department will prepare your paper for publication. This includes ensuring the following:

* All references, tables, and figures are properly cited

* All relevant supporting information is included in the manuscript submission,

* There are no issues that prevent the paper from being properly typeset

If revisions are needed, the production department will contact you directly to resolve them. If no revisions are needed, you will receive an email when the publication date has been set. At this time, we do not offer pre-publication proofs to authors during production of the accepted work. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few weeks to review your paper and let you know the next and final steps.

Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

If we can help with anything else, please email us at customercare@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Samuel Asante Gyamerah

Academic Editor

PLOS ONE

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Data

(CSV)

pone.0299955.s001.csv^{(14.2KB, csv)}

Attachment

Submitted filename: Response to Reviewers.docx

pone.0299955.s002.docx^{(82.8KB, docx)}

Data Availability Statement

All relevant data are within the manuscript and its Supporting information files.

[pone.0299955.ref001] 1. Wu CY, Wang JZ, Hao Y. Deterministic and uncertainty crude oil price forecasting based on outlier detection and modified multi-objective optimization algorithm. Resources Policy. 2022. Aug;102780. doi: 10.1016/j.resourpol.2022.102780 [DOI] [Google Scholar]

[pone.0299955.ref002] 2. Şahin Utkucan. Projections of Turkey’s electricity generation and installed capacity from total renewable and hydro energy using fractional nonlinear grey Bernoulli model and its reduced forms. Sustainable Production and Consumption. 2020. Jul;23:52–62. [Google Scholar]

[pone.0299955.ref003] 3.Shi XW, Shi XF, Dong WQ, Zang P, Jia HY, Wu JF, et al. Research on Energy Storage Configuration Method Based on Wind and Solar Volatility. 2020 10th International Conference on Power and Energy Systems (ICPES). 2020 Dec;464–468.

[pone.0299955.ref004] 4.Elsaraiti M, Ali G, Musbah H, Merabet A, Little T. Time Series Analysis of Electricity Consumption Forecasting Using ARIMA Model. 2021 13TH ANNUAL IEEE GREEN TECHNOLOGIES CONFERENCE GREENTECH 2021. 2021 Jun;259-262.

[pone.0299955.ref005] 5. Yi M, Xie W, Mo L. Short-Term Electricity Price Forecasting Based on BP Neural Network Optimized by SAPSO. Energies. 2021. Oct;14(20):6514. doi: 10.3390/en14206514 [DOI] [Google Scholar]

[pone.0299955.ref006] 6. Kong WC, Dong ZY, Hill DJ, Luo FJ, Xu Y. Short-term residential load forecasting based on resident behavior learning. IEEE Transactions on Power Systems. 2018. Jan;33(1):1087–1088. doi: 10.1109/TPWRS.2017.2688178 [DOI] [Google Scholar]

[pone.0299955.ref007] 7. Zhang XB, Wang JZ, Zhang KQ. Short-term electric load forecasting based on singular spectrum analysis and support vector machine optimized by Cuckoo search algorithm. Electric Power Systems Research. 2017. May;146:270–285. doi: 10.1016/j.epsr.2017.01.035 [DOI] [Google Scholar]

[pone.0299955.ref008] 8. Zhang GQ, Guo JF. A novel method for hourly electricity demand forecasting. IEEE Transactions on Power Systems. 2019. Mar;35(2):1351–1363. doi: 10.1109/TPWRS.2021.3098960 [DOI] [Google Scholar]

[pone.0299955.ref009] 9. Choi JE, Dong WS. Parallel architecture of CNN-bidirectional LSTMs for implied volatility forecast. Journal of Forecasting. 2021. Feb;41(6):1087–1098. doi: 10.1002/for.2844 [DOI] [Google Scholar]

[pone.0299955.ref010] 10. Bai Y, Li Y, Wang XX, Xie JJ, Li C. Air Pollutants Concentrations Forecasting Using Back Propagation Neural Network Based on Wavelet Decomposition with Meteorological Conditions. Atmospheric Pollution Research. 2016. May;7(3):557–566. doi: 10.1016/j.apr.2016.01.004 [DOI] [Google Scholar]

[pone.0299955.ref011] 11. Mallat SG. A theory for multiresolution signal decomposition: the wavelet representation. IEEE Transactions on Pattern Analysis and Machine Intelligence. 1989. Jul.;11(7):674–693. doi: 10.1109/34.192463 [DOI] [Google Scholar]

[pone.0299955.ref012] 12. Zheng HT, Yuan JB, Chen L. Short-Term load forecasting using EMD-LSTM neural networks with a Xgboost algorithm for feature importance evaluation. Energies. 2017. Aug;10(8):1168. doi: 10.3390/en10081168 [DOI] [Google Scholar]

[pone.0299955.ref013] 13. Chen YH, Kloft M, Yang Y, Li CH, Li L. Mixed kernel based extreme learning machine for electric load forecasting. Neurocomputing. 2018. Oct;312:90–106. doi: 10.1016/j.neucom.2018.05.068 [DOI] [Google Scholar]

[pone.0299955.ref014] 14. Huang ME, Shen Z, Long SR, Wu MLC, Shih HH, Zheng QN, et al. The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. Proceedings of the Royal Society A-Mathematical Physical and Engineering Sciences. 2020. Mar;454:903–995. doi: 10.1098/rspa.1998.0193 [DOI] [Google Scholar]

[pone.0299955.ref015] 15. Qin QD, He HD, Li L, He LY. A Novel Decomposition-Ensemble Based Carbon Price Forecasting Model Integrated with Local Polynomial Prediction. Computational Economics. 2020. Apr;55(4):1249–1273. doi: 10.1007/s10614-018-9862-1 [DOI] [Google Scholar]

[pone.0299955.ref016] 16. Wu ZH, Huang NE. Ensemble empirical mode decomposition: a noise-assisted data analysis method. Advances in adaptive data analysis. 2009. Jan;1(1):1–41. doi: 10.1142/S1793536909000047 [DOI] [Google Scholar]

[pone.0299955.ref017] 17. Lv LL, Wu ZY, Zhang JH, Zhang L, Tan ZY, Tian ZH. A VMD and LSTM Based Hybrid Model of Load Forecasting for Power Grid Security. IEEE Transactions on Industrial Informatics. 2022. Sep;18(9):6474–6482. doi: 10.1109/TII.2021.3130237 [DOI] [Google Scholar]

[pone.0299955.ref018] 18. Cai C, Li Y, Su ZH, Zhu TQ, He YY. Short-Term Electrical Load Forecasting Based on VMD and GRU-TCN Hybrid Network. Applied Sciences. 2022. Jul;12(13):6647. doi: 10.3390/app12136647 [DOI] [Google Scholar]

[pone.0299955.ref019] 19. Dragomiretskiy K, Zosso D. Variational Mode Decomposition. IEEE Transactions on Signal Processing. 2014. Feb;62(3):531–543. doi: 10.1109/TSP.2013.2288675 [DOI] [Google Scholar]

[pone.0299955.ref020] 20. Liu W, Cao SY, Wang ZM, Kong XZ, Chen YK. Spectral decomposition for hydrocarbon detection based on VMD and teager-kaiser energy. Geoscience and Remote Sensing Letters. 2017. Apr;14(4):539–543. doi: 10.1109/LGRS.2017.2656158 [DOI] [Google Scholar]

[pone.0299955.ref021] 21. Lahmiri S. Comparing variational and empirical mode decomposition in forecasting day-ahead energy prices. Systems Journal. 2017. Sep;11(3):1907–1910. [Google Scholar]

[pone.0299955.ref022] 22. Yang WX, Peng ZK, Wei KX, Shi P, and Tian WY. Superiorities of variational mode decomposition over empirical mode decomposition particularly in time-frequency feature extraction and wind turbine condition monitoring. Renewable Power Generation. 2017. Mar;11(4):443–452. doi: 10.1049/iet-rpg.2016.0088 [DOI] [Google Scholar]

[pone.0299955.ref023] 23. Huang YS, Gao YL, Yan Y, Ye M. A new financial data forecasting model using genetic algorithm and long short-term memory network. Neurocomputing. 2021. Feb;425:207–218. doi: 10.1016/j.neucom.2020.04.086 [DOI] [Google Scholar]

[pone.0299955.ref024] 24. Bandt C, Pompe B. Permutation entropy: a natural complexity measure for time series. Physical review letters. 2002. Apr;88(17). doi: 10.1103/PhysRevLett.88.174102 [DOI] [PubMed] [Google Scholar]

[pone.0299955.ref025] 25. Ranjeeta B, Dash PK, Parida AK. Hybrid Variational Mode Decomposition and evolutionary robust kernel extreme learning machine for stock price and movement prediction on daily basis. Applied Soft Computing. 2019. Jan;74:652–678. doi: 10.1016/j.asoc.2018.11.008 [DOI] [Google Scholar]

[pone.0299955.ref026] 26. Guo W, Liu QF, Luo ZD, Tse YM. Forecasts for international financial series with VMD algorithms. Journal of Asian Economics. 2022. Jan;80. doi: 10.1016/j.asieco.2022.101458 [DOI] [Google Scholar]

PERMALINK

Branch error reduction criterion-based signal recursive decomposition and its application to wind power generation forecasting

Fen Xiao

Siyu Yang

Xiao Li

Junhong Ni

Roles

Abstract

Introduction

VMD and permutation entropy

VMD

Table 1. Decomposition results of daily power generation data in 2020 under different mode numbers.

Fig 1. Normalization error trends at different mode numbers.

Permutation entropy

VMD-based signal recursive decomposition

Branch error reduction criterion

BER-based decomposition

Fig 2. Flow chart of BER to determine the mode number.

Fig 3. Flow chart of the proposed method.

Data experiments

Power generation datasets

Fig 4. Trend of wind power generation in 2020 and 2021.

Table 2. Analysis of the daily power generation datasets.

Fig 5. Statistics of wind power generation datasets.

Evaluation indicators

Error indicators

Improvement indicators

Comparative experiments

Decomposition according to the center frequency

Table 3. Central frequency of the first decomposition.

Fig 6. Normalization error trends at different mode numbers.

Table 4. Central frequencies of the second decomposition in 2020 and 2021.

Decomposition according to BER criterion

Fig 7. LSSVM decomposition process.

Table 5. Percentage decrease in the error sum.

Table 6. PE of each component.

Experimental results and analysis

Table 7. Forecasting results of Group A and Group B.

Table 8. Experimental error results for the third and fourth groups.

Table 9. Percentage reduction for secondary and tertiary decomposition.

Fig 8. Comparison of experimental errors in three levels.

Table 10. Percentage decrease in the error of discarding secondary residual components.

Fig 9. Average percentage of MAE after removing the secondary residual components.

Summary

Conclusions

Supporting information

Data Availability

Funding Statement

References

Decision Letter 0

Samuel Asante Gyamerah

Roles

Author response to Decision Letter 0

Decision Letter 1

Samuel Asante Gyamerah

Roles

Acceptance letter

Samuel Asante Gyamerah

Roles

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases