Abstract
Electricity consumption has been affected due to worldwide lockdown policies against COVID-19. Many countries have pointed out that electricity supply security during the epidemic is critical to ensuring people’s livelihood. Accurate prediction of electricity demand would act a more important role in ensuring energy security for all the countries. Although there have been many studies on electricity forecasting, they did not consider the pandemic, and many works only considered the prediction accuracy and ignored the stability. Driven by the above reasons, it is necessary to develop an electricity consumption prediction model that can be well applied in the pandemic. In this work, a hybrid prediction system is proposed with data processing, modelling, and optimization. An improved complete ensemble empirical mode decomposition with adaptive noise is used for data preprocessing, which overcomes the shortcomings of the original method; a multi-objective optimizer is adopted for ensuring the accuracy and stability; support vector machine is used as the prediction model. Taking daily electricity demand of US as an example, the results prove that the proposed hybrid models are superior to benchmark models in both prediction accuracy and stability. Moreover, selection of input parameters is discussed, and the results indicate that the model considering the daily infections has the highest prediction accuracy and stability, and it is proved that the proposed model has great potential in real-world applications.
Keywords: Electricity demand, Multi-objective optimizer, Prediction, Support vector machine, Denoising, COVID-19
Abbreviations
- AFD
adaptive fourier decomposition
- ANN
artificial neural network
- AR
autoregressive
- ARIMA
autoregressive integrated moving average model
- ARMA
autoregressive moving average
- BA
bat algorithm
- BP
back propagation
- CB
cyclic behaviour
- CEEMDAN
complete ensemble empirical mode decomposition with adaptive noise
- CNN
convolutional neural network
- COVID-19
coronavirus disease 2019
- CS
cuckoo search
- DBN
deep belief network
- DD
daily deaths
- DE
differential evolution
- DI
daily infections
- DM
Diebold-Mariano
- ED
electricity demand
- EIA
U.S. Energy Information Administration
- ELM
extreme learning machine
- EMD
empirical mode decomposition
- EEMD
ensemble empirical mode decomposition
- FAR
functional autoregressive
- FFT
fast fourier transform
- FNN
feed-forward neural network
- GRSI
Government Response Stringency Index
- GWO
grey wolf optimizer
- HHO
Harris hawks optimization
- HVAC
heating, ventilation, and air conditioning
- ICEEMDAN
improved complete ensemble empirical mode decomposition with adaptive noise
- IEA
International Energy Agency
- IMF
intrinsic mode functions
- KCC
Kendall correlation coefficient
- KM
kernel machine
- KMM
kernel machine with memory
- LightGBM
light gradient boosting machine
- LR
linear regression
- LSSVM
least-squares support-vector machine
- LSTM
long short-term memory network
- MA
moving average
- MAE
mean absolute error
- MAPE
mean absolute percentage error
- MARS
multivariate adaptive regression spline
- MIDAS
mixed data sampling
- MLP
multi-layer perceptron
- MOGWO
multi-objective grey wolf optimizer
- MOMVO
multi-objective multi-verse optimizer
- NN
neural network
- NSGAII
non-dominated sorting genetic algorithm II
- PAA
piecewise aggregated approximation
- PCC
Pearson correlation coefficient
- PI
piecewise interpolation
- PSO
particle swarm optimization
- RBFNN
radial basis function neural network
- RLF
residential load factor
- RMSE
root mean square error
- RNN
recurrent neural network
- RWF
random walk forecasting
- SAR
seasonal autoregressive
- SARS-Cov-2
severe acute respiratory syndrome coronavirus 2
- SCA
sine cosine optimization
- SCC
Spearman correlation coefficient
- SPA
seasonal patterns adjustment
- SSA
salp swarm algorithm
- STDRE
standard deviation of relative error
- SVM
support vector machine
- SWPT
stationary wavelet packet transform
- TLM
two-reservoir model with linear memory
- TNM
two-reservoir model with nonlinear memory
- TRM
two-reservoir model
- VMD
variational mode decomposition
- WHO
World Health Organization
- WOA
whale optimization algorithm
1. Introduction
1.1. Background
On December 31, 2019, a group of pneumonia cases was reported in Wuhan, China. It was confirmed as SARS-Cov-2 and was officially named COVID-19 by the WHO [1]. With the popularity of COVID-19, the global situation has become severe. On March 11, 2020, WHO announced COVID-19 as a pandemic. In the first quarter of 2020, the epidemic affected almost everyone in the world. According to data from Johns Hopkins University, as of May 29, nearly 6 million people have been infected and approximately 365,000 people have died worldwide (see Fig. 1 ). In this context, many countries have introduced lockdown policies to prevent people from contact and thus control the spread of the epidemic [2]. At the same time, the global energy sector is also profoundly affected by the epidemic. According to the statistics of IEA, except for the slight increase in the demand for renewable energy in the first quarter, the rest of the energy has declined to vary degrees, of which oil has fallen the most, reaching 9%. Moreover, electricity demand fell by 2.5% [3].
Fig. 1.
Distribution of COVID-19 infections worldwide.
Some scholars have suggested that energy and medical care are equally important during the epidemic. Among these energy sources, electricity may be most relevant to people’s lives. Many countries have issued policies requiring the power supply department to provide uninterrupted power supplies and allow users to delay payment [4]. In this case, the requirements for the power supply department to accurately allocate power resources are higher than usual. Thus, in electricity management, the importance of accurate prediction of electricity demand is self-evident.
1.2. Related works and problem statement
In recent years, numerous studies on energy demand forecasting have emerged. Table 1 lists the relevant works in recent two years and gives information for the utilized models. It reveals that energy demand prediction methods can be roughly divided into data-driven models and physical simulation models. The data-driven model is used more because the physical simulation model needs to consider too many external factors, and it is difficult to collect relevant data. Moreover, some data-driven regression models still need to collect data on massive related factors. On the other hand, most data-driven models use machine learning or deep learning algorithms, and in recent years many scholars have adopted hybrid models to make predictions because single models have some drawbacks. Although these models have obtained more accurate prediction results in some cases, the electricity demand prediction is still facing the following problems:
-
1)
according to the literature review, most scholars only considered the accuracy of the prediction model, not the stability;
-
2)
electricity demand prediction studies did not consider major global events such as COVID-19; weather and other factors considered may not be essential factors in this particular period. In other words, these models may lack applicability in the context of major global events.
Table 1.
Studies related to energy demand prediction in the past two years.
| Reference | Prediction target | Model | Factors considered | Models for comparison |
|---|---|---|---|---|
| [5] | Energy demand in Iran | A hybrid model combines scenario analysis and Bayesian approach | Historical energy demand, primary energy production, population, GDP, natural gas price, gasoline price | – |
| [6] | Energy demand in Ireland | Covariance matrix adaptation evolutionary strategy | Historical energy demand | PSO, DE, BP, MA, RWF, LR |
| [7] | Electricity demand in India | LSTM | Historical electricity demand considering cluster analysis | ANN, RNN, SVM |
| [8] | Electricity demand in New South Wales and Singapore | VMD-SSA-SVM | Historical electricity demand | SVM, SSA-SVM, SSA-LSSVM, ARIMA |
| [9] | Natural gas demand in Germany | FAR-CNN | Historical natural gas demand | FAR-LSTM, FAR, CNN, LSTM, MLP, AR, SAR, LightGBM |
| [10] | Energy demand in China | ADL-MIDAS | Historical energy demand | – |
| [11] | Residential natural gas demand | LR, KM, KMM, TRM, TLM, TNM | Historical maximum daily demand, weather | – |
| [12] | Energy demand in Basilicata and Italy | Regression analysis | End user-related factors | – |
| [13] | Load demand | SWPT–HHO–FNN | Historical load demand, date attribute, weather | PSO-ANN, PSO-LSSVM, BP |
| [14] | Energy demand | ARIMA-ANN-PSO-SVM | Historical energy demand | ARIMA, ANN, PSO-SVM |
| [15] | Building energy demand | Engineering simulation | Factors related to building energy | – |
| Reference | Prediction target | Model | Factors considered | Models for comparison |
|---|---|---|---|---|
| [16] | Heating demand | ANN with an online learning method | Historical heating demand, air temperature | Thermal model, LR, SVM, Huber regressor, orthogonal matching pursuit, SGD regressor, decision tree regression, random forest |
| [17] | Electricity demand | ANN, MARS, MLR, ARIMA | Historical electricity demand, climate | – |
| [18] | Energy load | VMD-LSTM | Historical energy load | SVM, RNN, DBN, EMD-LSTM |
| [19] | Natural gas demand | ARIMA, MLP, ANN, ELM | Historical natural gas demand, weather, biogas production, date attribute, electricity price, gas price, solar radiation | – |
| [20] | Electricity demand | AFD-FFT-SPA-SCA-SVM | Historical electricity demand | RWF, ARMA, SVM, SCA-SVM, AFD-SCA-SVM, SPA-SCA-SVM, BP, ELM |
| [21] | HVAC system energy demand | Takagi-Sugeno fuzzy-NN | Historical HVAC energy demand, weather | RLF |
| [22] | Electricity demand of fans | ANN | Historical electricity demand | SVM |
| [23] | Electricity demand | CB-PAA-PI | Historical electricity demand | Holt-Winters seasonal model, seasonal naïve model |
1.3. Motivations, contributions, and article organization
Driven by the problems described in Section 1.2, the purpose of this paper is to develop a model that can be better applied to the prediction of electricity demand during COVID-19. In this work, COVID-19-related factors are considered in the model design, and the applicability of factors as model inputs is discussed. In the model design, ICEEMDAN is utilized as a data preprocessing tool, MOGWO is used to optimize the SVM, and the accuracy and stability are considered. Thus, the work is innovative in that it discusses the adaptability of various factors related to COVID-19 in the model application. Besides, the proposed prediction model takes into account both accuracy and stability. The main contributions of this paper are as follows:
-
(1)
A hybrid model is proposed to predict the daily electricity demand during the COVID-19 pandemic.
-
(2)
The proposed model is compared with benchmark models regarding prediction accuracy and stability.
-
(3)
The influences of the denoising method and optimizer on prediction are discussed.
-
(4)
The applicability of factors related to COVID-19 as inputs to the prediction model is discussed.
-
(5)
The results of one-step ahead, two-step ahead, and three-step ahead predictions are compared.
The rest of this paper is organized as follows. Section 2 introduces the relevant theories and implementation of the proposed model. Section 3 describes the collected data and prediction steps. Section 4 gives the prediction results. Section 5 discusses four critical issues related to this work. Finally, the primary conclusions and future works are summarized in Section 6.
2. Methods
The proposed in this paper, ICEEMDAN-MOGWO-SVM, is a hybrid model with the structure of “data cleaning method-optimizer-basic prediction model”. The relevant theories associated with the different methods are introduced in this section.
2.1. Improved complete ensemble empirical mode decomposition with adaptive noise
Data decomposition breaks down the raw data into multiple datasets but does not distort the original data. The decomposed data is usually smoother, which is helpful for the execution of prediction. ICEEMDAN is a method that appeared in 2014 [24], and its predecessors include EMD, EEMD, CEEMDAN, and so on [25]. EMD is an adaptive signal time-frequency processing method suitable for nonlinear signals. It can decompose complex signals into a finite number of IMFs, and each IMF contains local characteristic signals of different time scales of the original signal; EEMD is developed based on EMD to overcome the mode mixing problem; CEEMDAN eliminates the noise involved in the reconstructed signal by adding white noise, and improves the efficiency of EEMD; ICEEMDAN is another innovation based on CEEMDAN, it has high efficiency and can avoid the generation of spurious modes. Its implementation process is as follows:
-
(1)
Perform I times EMD decomposition on the original signal:
| (1) |
where is the original signal; is the k-th mode component generated by EMD; is Gaussian noise; is noise added signal; is noise amplitude.
-
(2)
Calculate the first residue and the first mode:
| (2) |
| (3) |
where is the k-th residue; is the k-th mode; is local average of signal.
-
(3)
Calculate the second residue and the second mode:
| (4) |
| (5) |
-
(4)
Calculate the k-th residue and the k-th mode:
| (6) |
| (7) |
-
(5)
Repeat step (4) until the termination condition of decomposition is satisfied.
2.2. Multi-objective grey wolf optimizer
MOGWO is developed based on the grey wolf optimizer (GWO) [26]. GWO is a meta-heuristic algorithm inspired by the hunting behavior of wolves [27]. Each wolf in the population can be regarded as a solution to the problem. The optimal solution, optimal solution, suboptimal solution, and other solutions correspond to the wolf swarm’s four levels. When wolves find their prey, they approach them. Its position equations are:
| (8) |
where is the distance between the wolf and prey; and are coefficient vectors; and are the position vectors of the prey and grey wolf, respectively; is the current iteration.
GWO keeps the best three solutions, and continuously updates the position of the grey wolf by the following formula to find the best solution:
| (9) |
| (10) |
where , and are grey wolves of different levels.
MOGWO has two changes compared to GWO [26]. First, the update method has changed, and an archive is introduced to store the current best individual. After each iteration, the new individual generated is compared with the individual in the archive. In addition, to avoid too many similar individuals, all individuals are grouped according to the distance of the objective function value. Secondly, the selection mechanism of the leader wolf has changed. That is, using roulette to directly select the leader wolf in the archive, solving the problem that it is difficult to directly determine three non-dominant solutions through Pareto method. The probability of each hypercube can be calculated by Eq. (11). More information can be found in the literature [26].
| (11) |
where is a constant; is the number of Pareto optimal solutions; is the probability of the hypercube.
2.3. Support vector machine
SVM is one of the most popular machine learning models. It has a strong statistical foundation and is very suitable for small samples. Related theories can refer to the literature [28]. SVM has a wide range of applications in energy [29], environment [30], hydrology [31], and economy [32]. It is not only used as a target model for research, but also as a benchmark model. In regression problems, the training set can be defined as [33]:
| (12) |
where and are input and output, respectively.
The specific form of the SVM model is:
| (13) |
where is weighted vector; is nonlinear mapping function; is deviator.
In the SVM model, the penalty factor and the kernel width are two hyperparameters that affect the prediction performance. Many scholars use optimizers to optimize the original SVM model. For example, Fan et al. [34] utilized WOA, BA, and PSO to optimize SVM to predict solar radiation; Zhang et al. [35] employed CS to optimized SVM to predict short-term electricity load; Li et al. [36] used MOMVO to optimize LSSVM to predict air quality indicator.
3. Empirical analysis
In this section, the validity of the proposed model is verified through a case study. Considering that the United States is the second-largest energy-consuming country in the world, and is the most affected in this pandemic (as of May 29, 2020, the number of infected people accounts for about 30% of the world), the case study is set up for the daily electricity demand of the United States.
3.1. Data collection and description
In this work, the daily electricity demand data of the United States come from EIA (https://www.eia.gov/). Since the proposed model considers the impact of COVID-19, data on the number of daily infections, the number of daily deaths, and GRSI are collected. The data for these three factors are derived from Our World In Data (https://ourworldindata.org/). It is worth noting that GRSI is an indicator of the degree of lockdown proposed by Oxford University after the outbreak [37]. It is a comprehensive indicator of nine factors, as shown in Fig. 2 . Its total score is 100, and the higher the score, the stricter the lockdown. The horizons of the four types of data are daily, from January 19 to May 15, 2020 (see Fig. 3 ). Their statistical description is shown in Table 2 .
Fig. 2.
Nine factors considered by GRSI.
Fig. 3.
Datasets of electricity demand and three COVID-19-related factors.
Table 2.
The statistical description of the four datasets.
| Dataset | Unit | Data amount | Maximum | Minimum | Mean | Standard deviation |
|---|---|---|---|---|---|---|
| ED | MWh | 118 | 12,283,918 | 8,518,041 | 9893992.51 | 843837.09 |
| DI | – | 118 | 48,529 | 0 | 12016.01 | 13542.94 |
| DD | – | 118 | 4928 | 0 | 728.02 | 991.41 |
| GRSI | – | 118 | 73.57 | 0 | 40.53 | 31.36 |
3.2. Prediction system
3.2.1. Overall system
As shown in Fig. 4 , the prediction system includes three parts: (1) data collection, (2) data preprocessing, and (3) optimization and prediction. The details of data collection are described in Section 3.1. The rest parts are described in Section 3.2.2.
Fig. 4.
Overall prediction system.
3.2.2. Prediction steps
-
(1)
Data decomposition
ICEEDMDAN is used to decompose the raw data into multiple IMFs, so that the decomposed data can be in smaller ranges (smaller fluctuation ranges), as shown in Fig. 4 and Table 3 . According to the related theory of ICEEMDAN, the termination of decomposition needs to satisfy the condition that the last IMF has less than three local extrema. However, for some data, the termination condition may not be met, so the maximum number of decompositions is set to 5000. If 5000 decomposition times still do not meet the condition, the decomposition is terminated. In the end, the raw dataset is broken down into six IMFs.
Table 3.
-
(2)Data normalization
| Dataset | Raw data | IMF1 | IMF2 | IMF3 | IMF4 | IMF5 | IMF6 |
|---|---|---|---|---|---|---|---|
| Maximum | 12,283,918 | 498434.6 | 529416.8 | 261793.4 | 176162.7 | 739302.2 | 10,188,100 |
| Minimum | 8,518,041 | −487,111 | −487,698 | −214,320 | −170,305 | −225,773 | 9,110,830 |
Because the dimensions of datasets may be different, to eliminate the influence of dimensions and improve the accuracy and speed of prediction, normalization is executed using the following equation:
| (14) |
where is normalized data at interval [0,1]; is raw data, respectively; and are minimum and maximum of the raw data, respectively.
-
(3)
Optimization and prediction
After ICEEMDAN completes the decomposition and normalization, the decomposed data is input into the model for optimization and training operations. These two operations are performed in the training set and are synchronized. That is, the training of SVM and the optimization of SVM are carried out at the same time. When the optimization is completed, the training of SVM is also terminated, as shown in Fig. 5 . In the conventional optimization (single-objective optimization) problem, scholars usually establish one objective function to minimize the prediction error in the training set. Because the multi-objective optimization adopted in this paper considers both prediction accuracy and stability, two objective functions are established:
| (15) |
where and are the objective functions for prediction accuracy and stability, respectively; is the MAPE in the training set; is the sample size of the training set; and are the actual and prediction values at time k; is the population standard deviation.
Fig. 5.
The optimization and training processes of MOGWO-SVM.
The SVM optimized by MOGWO is output from the training set and imported into the test set for prediction. Therefore, before optimization and prediction, the data set needs to be segmented. In this work, the ratio of the training set to test set is 7:3. Besides, the one-day ahead prediction is performed in this case study (see Fig. 6 ), and the relevant theories are shown in the literature [38].
Fig. 6.
-
(4)Denormalization and addition
Since the prediction is performed in the normalized datasets, denormalization processing is required to convert them into real values after the prediction is completed. The equation for denormalization is [39]:
| (16) |
where is real prediction value; is normalized prediction value.
Because the prediction is made in each IMF after being decomposed, according to the principle of ICEEMDAN, the final prediction result is the sum of the prediction results in each IMF:
| (17) |
where is the final prediction result; is prediction results in each IMF; is the number of IMFs decomposed from the raw data.
-
(5)
Error analysis
Three commonly used error metrics are used to measure the prediction performance: MAE, RMSE, and MAPE. Their expressions are as follows:
| (18) |
| (19) |
| (20) |
where and are the real and prediction values at time t, respectively; is the sample size.
3.3. Benchmark models
To highlight the advantages of the proposed model, this paper defines NSGAII-SVM, WOA-SVM, PSO-SVM, SVM, and RBFNN as the benchmark model for comparison. Of the five models, four are based on SVM, and RBFNN is a classic neural network model. The theory and application of these models and the reasons for choosing them are shown in Table 4 .
Table 4.
Reasons for choosing benchmark models and their theories and applications.
| Model | Reason for being selected | Theories | Applications |
|---|---|---|---|
| NSGAII-SVM | NSGA-II is a classic multi-objective optimizer. It adopts fast non-dominated sorting and elite strategy. | [40] | [41] |
| WOA-SVM | WOA is one of the most popular meta-heuristic optimizers that has appeared in recent years. | [42] | [43] |
| PSO-SVM | PSO is a classic meta-heuristic optimizer. | [44] | [45] |
| SVM | The most primitive SVM model. | [28] | [33] |
| RBFNN | One of the most popular neural network models. | [46] | [47] |
4. Prediction results
4.1. Prediction accuracy
Fig. 7 shows the prediction results of each model in the test set from the time series. It reveals that the prediction results of the proposed model are basically consistent with the actual values, while the prediction results of WOA-SVM and PSO-SVM are consistent with the actual values in the overall trend. The prediction results of the other three models deviate greatly from the actual values. Such conclusions can also be obtained from Table 5 . MAE, RMSE, and MAPE for the proposed model are 45134.7 MWh, 54865.1 MWh, and 0.49%, respectively, which are lower than other benchmark models. MAPEs for WOA-SVM and PSO-SVM are 2.24% and 2.99, respectively. MAPEs for NSGAII-SVM, SVM, and RBFNN are 7.28%, 5.62%, and 5.26%, respectively. It can be concluded that the proposed model has the highest prediction accuracy among the evaluated models.
Fig. 7.
The prediction results of each model in the test set.
Table 5.
The error of each model.
| Model | MAE (MWh) | RMSE (MWh) | MAPE (%) |
|---|---|---|---|
| ICEEMDAN-MOGWO-SVM | 45134.7 | 54865.1 | 0.49 |
| NSGAII-SVM | 657551.0 | 713818.0 | 7.28 |
| WOA-SVM | 203117.0 | 261994.0 | 2.24 |
| PSO-SVM | 273117.0 | 355819.0 | 2.99 |
| SVM | 507161.0 | 574946.0 | 5.62 |
| RBFNN | 481283.0 | 577849.0 | 5.26 |
Note: Bold denotes the data with best performance in the current dataset.
4.2. Prediction stability
Fig. 8 shows the relative errors of every point for six models in the test set. It indicates that the relative errors of the proposed models are all around , the maximum value is 1.06%, and the minimum value is −0.21%. Compared with the proposed model, the distributions of relative error points for benchmark models are more chaotic, and relative error the ranges are larger. STDRE is employed to evaluate the stability of the prediction comprehensively, it can be implied from Fig. 9 that the STDRE of the proposed model is 0.389%, which is much lower than other models. It indicates that the prediction stability of the proposed model is the best among the evaluated models.
Fig. 8.
The relative error of the prediction. (a) ICEEMDAN-MOGWO-SVM; (b) NSGAII-SVM; (c) WOA-SVM; (d) PSO-SVM; (e) SVM; (f) RBFNN.
Fig. 9.
STDRE of each model in the test set.
5. Discussions
5.1. DM test
Although some error indicators can reflect the difference in prediction accuracy of the models, the results obtained may be misleading because some of the difference in accuracy is caused by the data’s feature. Therefore, using a DM test can further measure the difference in accuracy between the models [48]. Suppose the two competing models are and , respectively, and the true series is . The prediction result of the first model is , and the prediction result of the second model is , then their prediction errors and are:
| (21) |
The null hypothesis and the alternative hypothesis are:
| (22) |
| (23) |
where is loss function of the square error.
DM test statistics are calculated according to Eq. (24):
| (24) |
where is an estimation of the variance of .
Table 6 shows that the proposed model’s accuracy level is very different from the benchmark model, so it further proves that the proposed model is far superior to the benchmark model in prediction accuracy.
Table 6.
The DM test statistics of each benchmark model.
| Model | DM test |
|---|---|
| NSGAII-SVM | 15.2315∗ |
| WOA-SVM | 3.9445∗ |
| PSO-SVM | 4.6559∗ |
| SVM | 11.6551∗ |
| RBFNN | 4.7936∗ |
Note: ∗ is 5% significance level.
5.2. The impact of the denoising method and optimizer
The model proposed in this paper is developed based on SVM by introducing a denoising method and optimizer. In this section, the influences of the denoising method and the optimizer on the original model are further discussed. Thus, two other models are considered: MOGWO-SVM and ICEEMDAN-SVM. Table 7 implies that MAPE of SVM can be reduced by about 9.4% when SVM is combined with ICEEMDAN, and 72.2% when MOGWO is combined with SVM. Similar rules can be found in STDRE. They indicate that the multi-objective optimizer is better than the noise reduction method in improving the prediction performance of the original SVM. The same conclusion can be obtained by comparing ICEEMDAN-SVM, MOGWO-SVM, and ICEEMDAN-MOGWO-SVM. Nevertheless, the denoising method is still vital in some problems that require high accuracy and stability. As shown in Fig. 10 , after the introduction of ICEEMDAN in MOGWO-SVM, the prediction accuracy and stability are greatly improved on the original basis.
Table 7.
Accuracy and stability metrics of four models.
| Case | MAE (MWh) | RMSE (MWh) | MAPE (%) | STDRE (%) |
|---|---|---|---|---|
| ICEEMDAN-MOGWO-SVM | 45134.7 | 54865.1 | 0.49 | 0.389 |
| MOGWO-SVM | 141955.0 | 172917.0 | 1.56 | 1.616 |
| ICEEMDAN-SVM | 458448.0 | 529195.5 | 5.09 | 3.152 |
| SVM | 507161.0 | 574946.0 | 5.62 | 3.256 |
Note: Bold denotes the data with best performance in the current dataset.
Fig. 10.
The influence of ICEEMDAN on the prediction accuracy and stability of MOGWO-SVM. (a) Accuracy; (b) Stability.
5.3. COVID-19-related input variables
In the case study, ED is the predicted target, and the three factors of DI, DD, and PRSI are considered. Correlation analysis (to improve the reliability of the results, three correlation coefficients are used, as shown in Eqs. (25), (26), (27)) proves that these three factors are indeed closely related to ED, as shown in Table 8 .
| (25) |
| (26) |
| (27) |
where is covariance between X and Y; and are the standard deviation of X and Y, respectively; is number of samples; and are the ranking of and in their respective column vectors; and are the number of concordant pairs and discordant pairs, respectively.
Table 8.
Correlation between three COVID-19-related factors and ED.
| Variable | Correlation coefficient |
||
|---|---|---|---|
| PCC | SCC | KCC | |
| DI | −0.7861 | −0.8847 | −0.7041 |
| DD | −0.6561 | −0.8637 | −0.6754 |
| GRSI | −0.8410 | −0.8668 | −0.7074 |
Filtering input variables is critical in prediction. Excess or missing factors may make the prediction model perform poorly. In this section, six more cases are set up to explore the influence of the COVID-related factors on the prediction results, as shown in Table 9 . Fig. 11 and Table 10 imply that when ED and DI are considered in the model input, its prediction accuracy and stability are the highest. However, according to the correlation analysis, the correlation between GRSI and ED is the strongest, which indicates that the factors with strong correlation as the input of the model do not mean the best prediction results. For the prediction of electricity demand in the United States during the COVID-19 pandemic, the prediction accuracy ranking of models considering different factors is shown in Fig. 12 .
Table 9.
Model inputs corresponding to seven cases.
| Case | Input(s) |
|---|---|
| Case 1 (original) | ED, DI, DD, GRSI |
| Case 2 | ED, DI |
| Case 3 | ED, DD |
| Case 4 | ED, GRSI |
| Case 5 | ED, DI, DD |
| Case 6 | ED, DI, GRSI |
| Case 7 | ED, DD, GRSI |
Fig. 11.
The prediction results of seven cases.
Table 10.
Accuracy and stability metrics for seven cases.
| Case | MAE (MWh) | RMSE (MWh) | MAPE (%) | STDRE (%) |
|---|---|---|---|---|
| Case 1 (original) | 45134.7 | 54865.1 | 0.49 | 0.389 |
| Case 2 | 38937.1 | 41227.6 | 0.42 | 0.143 |
| Case 3 | 73448.3 | 76074.6 | 0.80 | 0.220 |
| Case 4 | 41361.7 | 44057.3 | 0.45 | 0.160 |
| Case 5 | 56840.1 | 72454.7 | 0.62 | 0.591 |
| Case 6 | 50032.1 | 58502.2 | 0.54 | 0.355 |
| Case 7 | 115,952 | 126308.0 | 1.26 | 0.540 |
Note: Bold denotes the data with best performance in the current dataset.
Fig. 12.
The prediction accuracy ranking of models considering different factors.
5.4. One-step ahead vs. multi-step ahead prediction
In the practical application of daily electricity demand prediction, if managers can predict more days, the benefits for management are more significant [49]. Thus, this paper additionally examines the performance of the proposed model in two-day ahead and three-day ahead predictions. As shown in Fig. 13 , the MAPE for the two-day ahead prediction is 2.06%, and the MAPE for the three-day ahead prediction is 1.86%. Although their performance is not as good as one-day ahead prediction, the prediction accuracy is still about 2%, indicating that the proposed model not only has high accuracy in one-step prediction, but also has great application potential in multi-step prediction. In practical applications, single-step prediction results and multi-step prediction results can be combined to measure the future short-term electricity consumption. If the forecast result is higher than the planned consumption, a policy to restrict electricity use can be introduced.
Fig. 13.
One-day ahead, two-day ahead, three-day ahead prediction results.
5.5. Considerations of real-world applications
The test of real-world data indicates that the model proposed in this work can be used to predict the daily electricity consumption in a pandemic. In the real-world applications, the real-time prediction can be carried out by establishing a prediction system. The system includes three modules: input module, model training module, and prediction module. Note that the prediction assumes that there is no significant change in energy policy.
The model proposed in this paper can be used as a power system management tool, which has the following practical or energy policy-oriented functions:
-
(1)
It can predict the electricity demand during the pandemic, and the power sector can reasonably allocate the power resources according to the prediction results (such as one-step ahead, two-step ahead, and three-step ahead), so as to ensure the security of power supply during the pandemic;
-
(2)
The supply and demand of electricity determine the price, and accurate forecasting of electricity consumption can make prices more reasonable. On the other hand, price setting can balance the relationship between supply and demand, and can also help the government to better formulate policies;
-
(3)
During the pandemic, the performance of renewable energy power generation is more outstanding and more flexible, and accurate electricity consumption forecasts are conducive to the integration of renewable energy and the power system.
6. Conclusions and future works
In this work, a hybrid model combines ICEEMDAN, MOGWO, and SVM is presented to predict daily electricity demand during the epidemic. Taking the daily electricity demand in the United States as a case study, the analysis results indicate that the proposed model has higher prediction accuracy and stability than the other five benchmark models. DM test further proved the superiority of the proposed model. In addition, this paper discusses several key issues and draws some valuable conclusions:
-
(1)
In the prediction scenario of electricity demand in the United States, the multi-objective optimizer improves the prediction performance of SVM most significantly.
-
(2)
When the external factor considered by the model is DI, the accuracy and stability of the model are the highest; however, DI is not the most correlated factor with ED, indicating that the most correlated factor as the input of the model does not mean the best prediction accuracy.
-
(3)
The proposed model not only performs well in one-day ahead prediction, but also has high accuracy in two-day ahead and three-day ahead predictions. Therefore, the proposed model has a higher potential in multi-step prediction although the prediction performance is not as good as one-step prediction.
The model proposed in this paper aims to be able to accurately predict electricity demand during the COVID-19 pandemic or major global events. Although it has been proved by practice that the proposed model can already obtain high prediction accuracy and stability, there are still some aspects worthy of further study. Thus, future works are summarized as follows:
-
(a)
The prediction is made in each IMF, and the final result is obtained by summing up all the results. However, direct addition may not be the best way. Therefore, follow-up research may consider using other result processing methods or developing new denoising methods.
-
(b)
In future work, more external factors can be considered as input to the model and test their rationality.
CRediT author statement
Hongfang Lu: Conceptualization, Methodology, Data curation, Writing – original draft. Xin Ma: Investigation, Methodology, Writing- Reviewing and Editing. Minda Ma: Writing- Reviewing and Editing
Declaration of competing interest
Authors declare that there is no conflict of interest due to the publication of this paper.
Acknowledgments
This article is funded by the National Natural Science Foundation of China (71901184).
References
- 1.Van Doremalen N., Bushmaker T., Morris D.H., Holbrook M.G., Gamble A., Williamson B.N., Lloyd-Smith J.O. Aerosol and surface stability of SARS-CoV-2 as compared with SARS-CoV-1. N Engl J Med. 2020;382(16):1564–1567. doi: 10.1056/NEJMc2004973. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Otmani A., Benchrif A., Tahri M., Bounakhla M., El Bouch M., Krombi M.H. Impact of covid-19 lockdown on PM10, SO2 and NO2 concentrations in Salé city (Morocco) Sci Total Environ. 2020:139541. doi: 10.1016/j.scitotenv.2020.139541. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.International Energy Agency . 2020. Global energy review 2020. [Google Scholar]
- 4.Broto V.C., Kirshner J. Energy access is needed to maintain health during pandemics. Nature Energy. 2020:1–3. [Google Scholar]
- 5.Ahmadi S., hossien Fakehi A., Haddadi M., Iranmanesh S.H. A hybrid stochastic model based Bayesian approach for long term energy demand managements. Energy Strategy Reviews. 2020;28:100462. [Google Scholar]
- 6.Mason K., Duggan J., Howley E. Forecasting energy demand, wind generation and carbon dioxide emissions in Ireland using evolutionary neural networks. Energy. 2018;155:705–720. [Google Scholar]
- 7.Bedi J., Toshniwal D. Deep learning framework to forecast electricity demand. Appl Energy. 2019;238:1312–1326. [Google Scholar]
- 8.Jiang P., Li R., Liu N., Gao Y. A novel composite electricity demand forecasting framework by data processing and optimized support vector machine. Appl Energy. 2020;260:114243. [Google Scholar]
- 9.Chen Y., Xu X., Koch T. Day-ahead high-resolution forecasting of natural gas demand and supply in Germany with a hybrid model. Appl Energy. 2020;262:114486. [Google Scholar]
- 10.He Y., Lin B. Forecasting China’s total energy demand and its structure using ADL-MIDAS model. Energy. 2018;151:420–429. [Google Scholar]
- 11.Potočnik P., Šilc J., Papa G. A comparison of models for forecasting the residential natural gas demand of an urban area. Energy. 2019;167:511–522. [Google Scholar]
- 12.Di Leo S., Caramuta P., Curci P., Cosmi C. Regression analysis for energy demand projection: an application to TIMES-Basilicata and TIMES-Italy energy models. Energy. 2020;196:117058. [Google Scholar]
- 13.Tayab U.B., Zia A., Yang F., Lu J., Kashif M. Short-term load forecasting for microgrid energy management system using hybrid HHO-FNN model with best-basis stationary wavelet packet transform. Energy. 2020:117857. [Google Scholar]
- 14.Kazemzadeh M.R., Amjadian A., Amraee T. Energy; 2020. A hybrid data mining driven algorithm for long term electric peak load and energy demand forecasting; p. 117948. [Google Scholar]
- 15.Yu D. A two-step approach to forecasting city-wide building energy demand. Energy Build. 2018;160:1–9. [Google Scholar]
- 16.Bünning F., Heer P., Smith R.S., Lygeros J. Improved day ahead heating demand forecasting by online correction methods. Energy Build. 2020;211:109821. [Google Scholar]
- 17.Al-Musaylh M.S., Deo R.C., Adamowski J.F., Li Y. Short-term electricity demand forecasting using machine learning methods enriched with ground-based climate and ECMWF Reanalysis atmospheric predictors in southeast Queensland, Australia. Renew Sustain Energy Rev. 2019;113:109293. [Google Scholar]
- 18.Bedi J., Toshniwal D. Energy load time-series forecast using decomposition and autoencoder integrated memory network. Appl Soft Comput. 2020:106390. [Google Scholar]
- 19.Karabiber O.A., Xydis G. Forecasting day-ahead natural gas demand in Denmark. J Nat Gas Sci Eng. 2020;76:103193. [Google Scholar]
- 20.Li R., Jiang P., Yang H., Li C. Sustainable Cities and Society; 2020. A novel hybrid forecasting scheme for electricity demand time series; p. 102036. [Google Scholar]
- 21.Homod R.Z., Togun H., Abd H.J., Sahari K.S. A novel hybrid modelling structure fabricated by using Takagi-Sugeno fuzzy to forecast HVAC systems energy demand in real-time for Basra city. Sustainable Cities and Society. 2020;56:102091. [Google Scholar]
- 22.Runge J., Zmeureanu R., Le Cam M. Hybrid short-term forecasting of the electric demand of supply fans using machine learning. Journal of Building Engineering. 2020;29:101144. [Google Scholar]
- 23.Williams S., Short M. Electricity demand forecasting for decentralised energy management. Energy and Built Environment. 2020;1(2):178–186. [Google Scholar]
- 24.Colominas M.A., Schlotthauer G., Torres M.E. Improved complete ensemble EMD: a suitable tool for biomedical signal processing. Biomed Signal Process Contr. 2014;14:19–29. [Google Scholar]
- 25.Torres M.E., Colominas M.A., Schlotthauer G., Flandrin P. 2011 IEEE international conference on acoustics, speech and signal processing (ICASSP) IEEE; 2011. May). A complete ensemble empirical mode decomposition with adaptive noise; pp. 4144–4147. [Google Scholar]
- 26.Mirjalili S., Saremi S., Mirjalili S.M., Coelho L.D.S. Multi-objective grey wolf optimizer: a novel algorithm for multi-criterion optimization. Expert Syst Appl. 2016;47:106–119. [Google Scholar]
- 27.Mirjalili S., Mirjalili S.M., Lewis A. Grey wolf optimizer. Adv Eng Software. 2014;69:46–61. [Google Scholar]
- 28.Suykens J.A., Vandewalle J. Least squares support vector machine classifiers. Neural Process Lett. 1999;9(3):293–300. [Google Scholar]
- 29.Liu M., Cao Z., Zhang J., Wang L., Huang C., Luo X. Short-term wind speed forecasting based on the Jaya-SVM model. Int J Electr Power Energy Syst. 2020;121:106056. [Google Scholar]
- 30.Yan H., Zhang J., Rahman S.S., Zhou N., Suo Y. Predicting permeability changes with injecting CO2 in coal seams during CO2 geological sequestration: a comparative study among six SVM-based hybrid models. Sci Total Environ. 2020;705:135941. doi: 10.1016/j.scitotenv.2019.135941. [DOI] [PubMed] [Google Scholar]
- 31.Sadeghi R., Zarkami R., Sabetraftar K., Van Damme P. Use of support vector machines (SVMs) to predict distribution of an invasive water fern Azolla filiculoides (Lam.) in Anzali wetland, southern Caspian Sea, Iran. Ecol Model. 2012;244:117–126. [Google Scholar]
- 32.Zhao L.T., Zeng G.R. Analysis of timeliness of oil price news information based on SVM. Energy Procedia. 2019;158:4123–4128. [Google Scholar]
- 33.Lu H., Azimi M., Iseley T. Short-term load forecasting of urban gas using a hybrid model based on improved fruit fly optimization algorithm and support vector machine. Energy Rep. 2019;5:666–677. [Google Scholar]
- 34.Fan J., Wu L., Ma X., Zhou H., Zhang F. Hybrid support vector machines with heuristic algorithms for prediction of daily diffuse solar radiation in air-polluted regions. Renew Energy. 2020;145:2034–2045. [Google Scholar]
- 35.Zhang X., Wang J., Zhang K. Short-term electric load forecasting based on singular spectrum analysis and support vector machine optimized by Cuckoo search algorithm. Elec Power Syst Res. 2017;146:270–285. [Google Scholar]
- 36.Li H., Wang J., Li R., Lu H. Novel analysis–forecast system based on multi-objective optimization for air quality index. J Clean Prod. 2019;208:1365–1383. [Google Scholar]
- 37.Zaremba A., Kizys R., Aharon D.Y., Demir E. Infected markets: novel coronavirus, government interventions, and stock return volatility around the globe. Finance Res Lett. 2020:101597. doi: 10.1016/j.frl.2020.101597. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Lu H., Ma X., Huang K., Azimi M. Prediction of offshore wind farm power using a novel two-stage model combining kernel-based nonlinear extension of the Arps decline model with a multi-objective grey wolf optimizer. Renew Sustain Energy Rev. 2020;127:109856. [Google Scholar]
- 39.Lu H., Ma X., Huang K., Azimi M. Carbon trading volume and price forecasting in China using multiple machine learning models. J Clean Prod. 2020;249:119386. [Google Scholar]
- 40.Deb K., Pratap A., Agarwal S., Meyarivan T.A.M.T. A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans Evol Comput. 2002;6(2):182–197. [Google Scholar]
- 41.Liu Y., Shen W., Man Y., Liu Z., Seferlis P. Optimal scheduling ratio of recycling waste paper with NSGAII based on deinked-pulp properties prediction. Comput Ind Eng. 2019;132:74–83. [Google Scholar]
- 42.Mirjalili S., Lewis A. The whale optimization algorithm. Adv Eng Software. 2016;95:51–67. [Google Scholar]
- 43.Mafarja M.M., Mirjalili S. Hybrid whale optimization algorithm with simulated annealing for feature selection. Neurocomputing. 2017;260:302–312. [Google Scholar]
- 44.Kennedy J., Eberhart R. vol. 4. IEEE; 1995. November). Particle swarm optimization; pp. 1942–1948. (Proceedings of ICNN’95-international conference on neural networks). [Google Scholar]
- 45.Noushabadi A.S., Dashti A., Raji M., Zarei A., Mohammadi A.H. Renewable Energy; 2020. Estimation of cetane numbers of biodiesel and diesel oils using regression and PSO-ANFIS models. [Google Scholar]
- 46.Chen S., Cowan C.F., Grant P.M. Orthogonal least squares learning algorithm for radial basis function networks. IEEE Trans Neural Network. 1991;2(2):302–309. doi: 10.1109/72.80341. [DOI] [PubMed] [Google Scholar]
- 47.Lu H., Cheng F., Ma X., Hu G. Short-term prediction of building energy consumption employing an improved extreme gradient boosting model: a case study of an intake tower. Energy. 2020:117756. [Google Scholar]
- 48.Harvey D., Leybourne S., Newbold P. Testing the equality of prediction mean squared errors. Int J Forecast. 1997;13(2):281–291. [Google Scholar]
- 49.Yang W., Wang J., Niu T., Du P. A novel system for multi-step electricity price forecasting for electricity market management. Appl Soft Comput. 2020;88:106029. [Google Scholar]













