Skip to main content
Elsevier - PMC COVID-19 Collection logoLink to Elsevier - PMC COVID-19 Collection
. 2020 Dec 11;219:119568. doi: 10.1016/j.energy.2020.119568

A hybrid multi-objective optimizer-based model for daily electricity demand prediction considering COVID-19

Hongfang Lu a, Xin Ma b,, Minda Ma c
PMCID: PMC7728554  PMID: 33324028

Abstract

Electricity consumption has been affected due to worldwide lockdown policies against COVID-19. Many countries have pointed out that electricity supply security during the epidemic is critical to ensuring people’s livelihood. Accurate prediction of electricity demand would act a more important role in ensuring energy security for all the countries. Although there have been many studies on electricity forecasting, they did not consider the pandemic, and many works only considered the prediction accuracy and ignored the stability. Driven by the above reasons, it is necessary to develop an electricity consumption prediction model that can be well applied in the pandemic. In this work, a hybrid prediction system is proposed with data processing, modelling, and optimization. An improved complete ensemble empirical mode decomposition with adaptive noise is used for data preprocessing, which overcomes the shortcomings of the original method; a multi-objective optimizer is adopted for ensuring the accuracy and stability; support vector machine is used as the prediction model. Taking daily electricity demand of US as an example, the results prove that the proposed hybrid models are superior to benchmark models in both prediction accuracy and stability. Moreover, selection of input parameters is discussed, and the results indicate that the model considering the daily infections has the highest prediction accuracy and stability, and it is proved that the proposed model has great potential in real-world applications.

Keywords: Electricity demand, Multi-objective optimizer, Prediction, Support vector machine, Denoising, COVID-19

Abbreviations

AFD

adaptive fourier decomposition

ANN

artificial neural network

AR

autoregressive

ARIMA

autoregressive integrated moving average model

ARMA

autoregressive moving average

BA

bat algorithm

BP

back propagation

CB

cyclic behaviour

CEEMDAN

complete ensemble empirical mode decomposition with adaptive noise

CNN

convolutional neural network

COVID-19

coronavirus disease 2019

CS

cuckoo search

DBN

deep belief network

DD

daily deaths

DE

differential evolution

DI

daily infections

DM

Diebold-Mariano

ED

electricity demand

EIA

U.S. Energy Information Administration

ELM

extreme learning machine

EMD

empirical mode decomposition

EEMD

ensemble empirical mode decomposition

FAR

functional autoregressive

FFT

fast fourier transform

FNN

feed-forward neural network

GRSI

Government Response Stringency Index

GWO

grey wolf optimizer

HHO

Harris hawks optimization

HVAC

heating, ventilation, and air conditioning

ICEEMDAN

improved complete ensemble empirical mode decomposition with adaptive noise

IEA

International Energy Agency

IMF

intrinsic mode functions

KCC

Kendall correlation coefficient

KM

kernel machine

KMM

kernel machine with memory

LightGBM

light gradient boosting machine

LR

linear regression

LSSVM

least-squares support-vector machine

LSTM

long short-term memory network

MA

moving average

MAE

mean absolute error

MAPE

mean absolute percentage error

MARS

multivariate adaptive regression spline

MIDAS

mixed data sampling

MLP

multi-layer perceptron

MOGWO

multi-objective grey wolf optimizer

MOMVO

multi-objective multi-verse optimizer

NN

neural network

NSGAII

non-dominated sorting genetic algorithm II

PAA

piecewise aggregated approximation

PCC

Pearson correlation coefficient

PI

piecewise interpolation

PSO

particle swarm optimization

RBFNN

radial basis function neural network

RLF

residential load factor

RMSE

root mean square error

RNN

recurrent neural network

RWF

random walk forecasting

SAR

seasonal autoregressive

SARS-Cov-2

severe acute respiratory syndrome coronavirus 2

SCA

sine cosine optimization

SCC

Spearman correlation coefficient

SPA

seasonal patterns adjustment

SSA

salp swarm algorithm

STDRE

standard deviation of relative error

SVM

support vector machine

SWPT

stationary wavelet packet transform

TLM

two-reservoir model with linear memory

TNM

two-reservoir model with nonlinear memory

TRM

two-reservoir model

VMD

variational mode decomposition

WHO

World Health Organization

WOA

whale optimization algorithm

1. Introduction

1.1. Background

On December 31, 2019, a group of pneumonia cases was reported in Wuhan, China. It was confirmed as SARS-Cov-2 and was officially named COVID-19 by the WHO [1]. With the popularity of COVID-19, the global situation has become severe. On March 11, 2020, WHO announced COVID-19 as a pandemic. In the first quarter of 2020, the epidemic affected almost everyone in the world. According to data from Johns Hopkins University, as of May 29, nearly 6 million people have been infected and approximately 365,000 people have died worldwide (see Fig. 1 ). In this context, many countries have introduced lockdown policies to prevent people from contact and thus control the spread of the epidemic [2]. At the same time, the global energy sector is also profoundly affected by the epidemic. According to the statistics of IEA, except for the slight increase in the demand for renewable energy in the first quarter, the rest of the energy has declined to vary degrees, of which oil has fallen the most, reaching 9%. Moreover, electricity demand fell by 2.5% [3].

Fig. 1.

Fig. 1

Distribution of COVID-19 infections worldwide.

Some scholars have suggested that energy and medical care are equally important during the epidemic. Among these energy sources, electricity may be most relevant to people’s lives. Many countries have issued policies requiring the power supply department to provide uninterrupted power supplies and allow users to delay payment [4]. In this case, the requirements for the power supply department to accurately allocate power resources are higher than usual. Thus, in electricity management, the importance of accurate prediction of electricity demand is self-evident.

1.2. Related works and problem statement

In recent years, numerous studies on energy demand forecasting have emerged. Table 1 lists the relevant works in recent two years and gives information for the utilized models. It reveals that energy demand prediction methods can be roughly divided into data-driven models and physical simulation models. The data-driven model is used more because the physical simulation model needs to consider too many external factors, and it is difficult to collect relevant data. Moreover, some data-driven regression models still need to collect data on massive related factors. On the other hand, most data-driven models use machine learning or deep learning algorithms, and in recent years many scholars have adopted hybrid models to make predictions because single models have some drawbacks. Although these models have obtained more accurate prediction results in some cases, the electricity demand prediction is still facing the following problems:

  • 1)

    according to the literature review, most scholars only considered the accuracy of the prediction model, not the stability;

  • 2)

    electricity demand prediction studies did not consider major global events such as COVID-19; weather and other factors considered may not be essential factors in this particular period. In other words, these models may lack applicability in the context of major global events.

Table 1.

Studies related to energy demand prediction in the past two years.

Reference Prediction target Model Factors considered Models for comparison
[5] Energy demand in Iran A hybrid model combines scenario analysis and Bayesian approach Historical energy demand, primary energy production, population, GDP, natural gas price, gasoline price
[6] Energy demand in Ireland Covariance matrix adaptation evolutionary strategy Historical energy demand PSO, DE, BP, MA, RWF, LR
[7] Electricity demand in India LSTM Historical electricity demand considering cluster analysis ANN, RNN, SVM
[8] Electricity demand in New South Wales and Singapore VMD-SSA-SVM Historical electricity demand SVM, SSA-SVM, SSA-LSSVM, ARIMA
[9] Natural gas demand in Germany FAR-CNN Historical natural gas demand FAR-LSTM, FAR, CNN, LSTM, MLP, AR, SAR, LightGBM
[10] Energy demand in China ADL-MIDAS Historical energy demand
[11] Residential natural gas demand LR, KM, KMM, TRM, TLM, TNM Historical maximum daily demand, weather
[12] Energy demand in Basilicata and Italy Regression analysis End user-related factors
[13] Load demand SWPT–HHO–FNN Historical load demand, date attribute, weather PSO-ANN, PSO-LSSVM, BP
[14] Energy demand ARIMA-ANN-PSO-SVM Historical energy demand ARIMA, ANN, PSO-SVM
[15] Building energy demand Engineering simulation Factors related to building energy
Reference Prediction target Model Factors considered Models for comparison
[16] Heating demand ANN with an online learning method Historical heating demand, air temperature Thermal model, LR, SVM, Huber regressor, orthogonal matching pursuit, SGD regressor, decision tree regression, random forest
[17] Electricity demand ANN, MARS, MLR, ARIMA Historical electricity demand, climate
[18] Energy load VMD-LSTM Historical energy load SVM, RNN, DBN, EMD-LSTM
[19] Natural gas demand ARIMA, MLP, ANN, ELM Historical natural gas demand, weather, biogas production, date attribute, electricity price, gas price, solar radiation
[20] Electricity demand AFD-FFT-SPA-SCA-SVM Historical electricity demand RWF, ARMA, SVM, SCA-SVM, AFD-SCA-SVM, SPA-SCA-SVM, BP, ELM
[21] HVAC system energy demand Takagi-Sugeno fuzzy-NN Historical HVAC energy demand, weather RLF
[22] Electricity demand of fans ANN Historical electricity demand SVM
[23] Electricity demand CB-PAA-PI Historical electricity demand Holt-Winters seasonal model, seasonal naïve model

1.3. Motivations, contributions, and article organization

Driven by the problems described in Section 1.2, the purpose of this paper is to develop a model that can be better applied to the prediction of electricity demand during COVID-19. In this work, COVID-19-related factors are considered in the model design, and the applicability of factors as model inputs is discussed. In the model design, ICEEMDAN is utilized as a data preprocessing tool, MOGWO is used to optimize the SVM, and the accuracy and stability are considered. Thus, the work is innovative in that it discusses the adaptability of various factors related to COVID-19 in the model application. Besides, the proposed prediction model takes into account both accuracy and stability. The main contributions of this paper are as follows:

  • (1)

    A hybrid model is proposed to predict the daily electricity demand during the COVID-19 pandemic.

  • (2)

    The proposed model is compared with benchmark models regarding prediction accuracy and stability.

  • (3)

    The influences of the denoising method and optimizer on prediction are discussed.

  • (4)

    The applicability of factors related to COVID-19 as inputs to the prediction model is discussed.

  • (5)

    The results of one-step ahead, two-step ahead, and three-step ahead predictions are compared.

The rest of this paper is organized as follows. Section 2 introduces the relevant theories and implementation of the proposed model. Section 3 describes the collected data and prediction steps. Section 4 gives the prediction results. Section 5 discusses four critical issues related to this work. Finally, the primary conclusions and future works are summarized in Section 6.

2. Methods

The proposed in this paper, ICEEMDAN-MOGWO-SVM, is a hybrid model with the structure of “data cleaning method-optimizer-basic prediction model”. The relevant theories associated with the different methods are introduced in this section.

2.1. Improved complete ensemble empirical mode decomposition with adaptive noise

Data decomposition breaks down the raw data into multiple datasets but does not distort the original data. The decomposed data is usually smoother, which is helpful for the execution of prediction. ICEEMDAN is a method that appeared in 2014 [24], and its predecessors include EMD, EEMD, CEEMDAN, and so on [25]. EMD is an adaptive signal time-frequency processing method suitable for nonlinear signals. It can decompose complex signals into a finite number of IMFs, and each IMF contains local characteristic signals of different time scales of the original signal; EEMD is developed based on EMD to overcome the mode mixing problem; CEEMDAN eliminates the noise involved in the reconstructed signal by adding white noise, and improves the efficiency of EEMD; ICEEMDAN is another innovation based on CEEMDAN, it has high efficiency and can avoid the generation of spurious modes. Its implementation process is as follows:

  • (1)

    Perform I times EMD decomposition on the original signal:

s(i)=s+λ0E1[w(i)] (1)

where s is the original signal; Ek() is the k-th mode component generated by EMD; w(i) is Gaussian noise; s(i) is noise added signal; λ is noise amplitude.

  • (2)

    Calculate the first residue and the first mode:

r1=M[s(i)] (2)
IMF1¯=sr1 (3)

where rk is the k-th residue; IMFk¯ is the k-th mode; M() is local average of signal.

  • (3)

    Calculate the second residue and the second mode:

r2=M{r1+λ1E2[w(i)]} (4)
IMF2¯=r1M{r1+λ1E2[w(i)]} (5)
  • (4)

    Calculate the k-th residue and the k-th mode:

rk=M{rk1+λk1Ek[w(i)]} (6)
IMFk¯=rk1rk (7)
  • (5)

    Repeat step (4) until the termination condition of decomposition is satisfied.

2.2. Multi-objective grey wolf optimizer

MOGWO is developed based on the grey wolf optimizer (GWO) [26]. GWO is a meta-heuristic algorithm inspired by the hunting behavior of wolves [27]. Each wolf in the population can be regarded as a solution to the problem. The optimal solution, optimal solution, suboptimal solution, and other solutions correspond to the wolf swarm’s four levels. When wolves find their prey, they approach them. Its position equations are:

{J=|MLp(p)Lw(p)|Lw(p+1)=Lp(p)NJ (8)

where J is the distance between the wolf and prey; M and N are coefficient vectors; Lp and Lw are the position vectors of the prey and grey wolf, respectively; p is the current iteration.

GWO keeps the best three solutions, and continuously updates the position of the grey wolf by the following formula to find the best solution:

{Jα=|M1Lα(p)Lw(p)|Jβ=|M2Lβ(p)Lw(p)|Jγ=|M3Lγ(p)Lw(p)|L1=Lα(p)N1JαL2=Lβ(p)N2JβL3=Lγ(p)N3Jγ (9)
Lp(p+1)=13(L1+L2+L3) (10)

where α, β and γ are grey wolves of different levels.

MOGWO has two changes compared to GWO [26]. First, the update method has changed, and an archive is introduced to store the current best individual. After each iteration, the new individual generated is compared with the individual in the archive. In addition, to avoid too many similar individuals, all individuals are grouped according to the distance of the objective function value. Secondly, the selection mechanism of the leader wolf has changed. That is, using roulette to directly select the leader wolf in the archive, solving the problem that it is difficult to directly determine three non-dominant solutions through Pareto method. The probability of each hypercube can be calculated by Eq. (11). More information can be found in the literature [26].

Pi=Lic (11)

where c is a constant; Li is the number of Pareto optimal solutions; Pi is the probability of the hypercube.

2.3. Support vector machine

SVM is one of the most popular machine learning models. It has a strong statistical foundation and is very suitable for small samples. Related theories can refer to the literature [28]. SVM has a wide range of applications in energy [29], environment [30], hydrology [31], and economy [32]. It is not only used as a target model for research, but also as a benchmark model. In regression problems, the training set can be defined as [33]:

{(xj,yj)|xj,yjRn,j=1,2,n} (12)

where xj and yj are input and output, respectively.

The specific form of the SVM model is:

f(x)=ϖTϕ(x)+c (13)

where ϖ is weighted vector; ϕ(x) is nonlinear mapping function; c is deviator.

In the SVM model, the penalty factor and the kernel width are two hyperparameters that affect the prediction performance. Many scholars use optimizers to optimize the original SVM model. For example, Fan et al. [34] utilized WOA, BA, and PSO to optimize SVM to predict solar radiation; Zhang et al. [35] employed CS to optimized SVM to predict short-term electricity load; Li et al. [36] used MOMVO to optimize LSSVM to predict air quality indicator.

3. Empirical analysis

In this section, the validity of the proposed model is verified through a case study. Considering that the United States is the second-largest energy-consuming country in the world, and is the most affected in this pandemic (as of May 29, 2020, the number of infected people accounts for about 30% of the world), the case study is set up for the daily electricity demand of the United States.

3.1. Data collection and description

In this work, the daily electricity demand data of the United States come from EIA (https://www.eia.gov/). Since the proposed model considers the impact of COVID-19, data on the number of daily infections, the number of daily deaths, and GRSI are collected. The data for these three factors are derived from Our World In Data (https://ourworldindata.org/). It is worth noting that GRSI is an indicator of the degree of lockdown proposed by Oxford University after the outbreak [37]. It is a comprehensive indicator of nine factors, as shown in Fig. 2 . Its total score is 100, and the higher the score, the stricter the lockdown. The horizons of the four types of data are daily, from January 19 to May 15, 2020 (see Fig. 3 ). Their statistical description is shown in Table 2 .

Fig. 2.

Fig. 2

Nine factors considered by GRSI.

Fig. 3.

Fig. 3

Datasets of electricity demand and three COVID-19-related factors.

Table 2.

The statistical description of the four datasets.

Dataset Unit Data amount Maximum Minimum Mean Standard deviation
ED MWh 118 12,283,918 8,518,041 9893992.51 843837.09
DI 118 48,529 0 12016.01 13542.94
DD 118 4928 0 728.02 991.41
GRSI 118 73.57 0 40.53 31.36

3.2. Prediction system

3.2.1. Overall system

As shown in Fig. 4 , the prediction system includes three parts: (1) data collection, (2) data preprocessing, and (3) optimization and prediction. The details of data collection are described in Section 3.1. The rest parts are described in Section 3.2.2.

Fig. 4.

Fig. 4

Overall prediction system.

3.2.2. Prediction steps

  • (1)

    Data decomposition

ICEEDMDAN is used to decompose the raw data into multiple IMFs, so that the decomposed data can be in smaller ranges (smaller fluctuation ranges), as shown in Fig. 4 and Table 3 . According to the related theory of ICEEMDAN, the termination of decomposition needs to satisfy the condition that the last IMF has less than three local extrema. However, for some data, the termination condition may not be met, so the maximum number of decompositions is set to 5000. If 5000 decomposition times still do not meet the condition, the decomposition is terminated. In the end, the raw dataset is broken down into six IMFs.

Table 3.
Ranges of the raw dataset and decomposed datasets.
  • (2)
    Data normalization
Dataset Raw data IMF1 IMF2 IMF3 IMF4 IMF5 IMF6
Maximum 12,283,918 498434.6 529416.8 261793.4 176162.7 739302.2 10,188,100
Minimum 8,518,041 −487,111 −487,698 −214,320 −170,305 −225,773 9,110,830

Because the dimensions of datasets may be different, to eliminate the influence of dimensions and improve the accuracy and speed of prediction, normalization is executed using the following equation:

dn=drdmindmaxdmin (14)

where dn is normalized data at interval [0,1]; dr is raw data, respectively; dmin and dmax are minimum and maximum of the raw data, respectively.

  • (3)

    Optimization and prediction

After ICEEMDAN completes the decomposition and normalization, the decomposed data is input into the model for optimization and training operations. These two operations are performed in the training set and are synchronized. That is, the training of SVM and the optimization of SVM are carried out at the same time. When the optimization is completed, the training of SVM is also terminated, as shown in Fig. 5 . In the conventional optimization (single-objective optimization) problem, scholars usually establish one objective function to minimize the prediction error in the training set. Because the multi-objective optimization adopted in this paper considers both prediction accuracy and stability, two objective functions are established:

min={ObjAcc=MAPEtraining=1Stk=1St|AkPkAk|ObjSt=std(AkPk) (15)

where ObjAcc and ObjSt are the objective functions for prediction accuracy and stability, respectively; MAPEtraining is the MAPE in the training set; St is the sample size of the training set; Ak and Pk are the actual and prediction values at time k; std is the population standard deviation.

Fig. 5.

Fig. 5

The optimization and training processes of MOGWO-SVM.

The SVM optimized by MOGWO is output from the training set and imported into the test set for prediction. Therefore, before optimization and prediction, the data set needs to be segmented. In this work, the ratio of the training set to test set is 7:3. Besides, the one-day ahead prediction is performed in this case study (see Fig. 6 ), and the relevant theories are shown in the literature [38].

Fig. 6.

Fig. 6

The principle of one-day ahead prediction.
  • (4)
    Denormalization and addition

Since the prediction is performed in the normalized datasets, denormalization processing is required to convert them into real values after the prediction is completed. The equation for denormalization is [39]:

pi=(1pn)dmin+pndmax (16)

where pi is real prediction value; pn is normalized prediction value.

Because the prediction is made in each IMF after being decomposed, according to the principle of ICEEMDAN, the final prediction result is the sum of the prediction results in each IMF:

Pf=m=1cpIMFm (17)

where Pf is the final prediction result; pIMFm is prediction results in each IMF; c is the number of IMFs decomposed from the raw data.

  • (5)

    Error analysis

Three commonly used error metrics are used to measure the prediction performance: MAE, RMSE, and MAPE. Their expressions are as follows:

MAE=1zt=1z|RtPt| (18)
RMSE=1zt=1z(RtPt)2 (19)
MAPE=1zt=1z|RtPtRt|×100% (20)

where Rt and Pt are the real and prediction values at time t, respectively; z is the sample size.

3.3. Benchmark models

To highlight the advantages of the proposed model, this paper defines NSGAII-SVM, WOA-SVM, PSO-SVM, SVM, and RBFNN as the benchmark model for comparison. Of the five models, four are based on SVM, and RBFNN is a classic neural network model. The theory and application of these models and the reasons for choosing them are shown in Table 4 .

Table 4.

Reasons for choosing benchmark models and their theories and applications.

Model Reason for being selected Theories Applications
NSGAII-SVM NSGA-II is a classic multi-objective optimizer. It adopts fast non-dominated sorting and elite strategy. [40] [41]
WOA-SVM WOA is one of the most popular meta-heuristic optimizers that has appeared in recent years. [42] [43]
PSO-SVM PSO is a classic meta-heuristic optimizer. [44] [45]
SVM The most primitive SVM model. [28] [33]
RBFNN One of the most popular neural network models. [46] [47]

4. Prediction results

4.1. Prediction accuracy

Fig. 7 shows the prediction results of each model in the test set from the time series. It reveals that the prediction results of the proposed model are basically consistent with the actual values, while the prediction results of WOA-SVM and PSO-SVM are consistent with the actual values in the overall trend. The prediction results of the other three models deviate greatly from the actual values. Such conclusions can also be obtained from Table 5 . MAE, RMSE, and MAPE for the proposed model are 45134.7 MWh, 54865.1 MWh, and 0.49%, respectively, which are lower than other benchmark models. MAPEs for WOA-SVM and PSO-SVM are 2.24% and 2.99, respectively. MAPEs for NSGAII-SVM, SVM, and RBFNN are 7.28%, 5.62%, and 5.26%, respectively. It can be concluded that the proposed model has the highest prediction accuracy among the evaluated models.

Fig. 7.

Fig. 7

The prediction results of each model in the test set.

Table 5.

The error of each model.

Model MAE (MWh) RMSE (MWh) MAPE (%)
ICEEMDAN-MOGWO-SVM 45134.7 54865.1 0.49
NSGAII-SVM 657551.0 713818.0 7.28
WOA-SVM 203117.0 261994.0 2.24
PSO-SVM 273117.0 355819.0 2.99
SVM 507161.0 574946.0 5.62
RBFNN 481283.0 577849.0 5.26

Note: Bold denotes the data with best performance in the current dataset.

4.2. Prediction stability

Fig. 8 shows the relative errors of every point for six models in the test set. It indicates that the relative errors of the proposed models are all around Y=0, the maximum value is 1.06%, and the minimum value is −0.21%. Compared with the proposed model, the distributions of relative error points for benchmark models are more chaotic, and relative error the ranges are larger. STDRE is employed to evaluate the stability of the prediction comprehensively, it can be implied from Fig. 9 that the STDRE of the proposed model is 0.389%, which is much lower than other models. It indicates that the prediction stability of the proposed model is the best among the evaluated models.

Fig. 8.

Fig. 8

The relative error of the prediction. (a) ICEEMDAN-MOGWO-SVM; (b) NSGAII-SVM; (c) WOA-SVM; (d) PSO-SVM; (e) SVM; (f) RBFNN.

Fig. 9.

Fig. 9

STDRE of each model in the test set.

5. Discussions

5.1. DM test

Although some error indicators can reflect the difference in prediction accuracy of the models, the results obtained may be misleading because some of the difference in accuracy is caused by the data’s feature. Therefore, using a DM test can further measure the difference in accuracy between the models [48]. Suppose the two competing models are M1 and M2, respectively, and the true series is yt. The prediction result of the first model is ytM1, and the prediction result of the second model is ytM2, then their prediction errors et1 and et2 are:

{et1=ytytM1et2=ytytM2 (21)

The null hypothesis H0and the alternative hypothesis H1 are:

H0:E[SL(et1)]=E[SL(et2)] (22)
H1:E[SL(et1)]E[SL(et2)] (23)

where SL is loss function of the square error.

DM test statistics are calculated according to Eq. (24):

DM=t=1q[SL(et1)SL(et2)]/qv2/qv2 (24)

where v2 is an estimation of the variance of [SL(et1)SL(et2)].

Table 6 shows that the proposed model’s accuracy level is very different from the benchmark model, so it further proves that the proposed model is far superior to the benchmark model in prediction accuracy.

Table 6.

The DM test statistics of each benchmark model.

Model DM test
NSGAII-SVM 15.2315∗
WOA-SVM 3.9445∗
PSO-SVM 4.6559∗
SVM 11.6551∗
RBFNN 4.7936∗

Note: ∗ is 5% significance level.

5.2. The impact of the denoising method and optimizer

The model proposed in this paper is developed based on SVM by introducing a denoising method and optimizer. In this section, the influences of the denoising method and the optimizer on the original model are further discussed. Thus, two other models are considered: MOGWO-SVM and ICEEMDAN-SVM. Table 7 implies that MAPE of SVM can be reduced by about 9.4% when SVM is combined with ICEEMDAN, and 72.2% when MOGWO is combined with SVM. Similar rules can be found in STDRE. They indicate that the multi-objective optimizer is better than the noise reduction method in improving the prediction performance of the original SVM. The same conclusion can be obtained by comparing ICEEMDAN-SVM, MOGWO-SVM, and ICEEMDAN-MOGWO-SVM. Nevertheless, the denoising method is still vital in some problems that require high accuracy and stability. As shown in Fig. 10 , after the introduction of ICEEMDAN in MOGWO-SVM, the prediction accuracy and stability are greatly improved on the original basis.

Table 7.

Accuracy and stability metrics of four models.

Case MAE (MWh) RMSE (MWh) MAPE (%) STDRE (%)
ICEEMDAN-MOGWO-SVM 45134.7 54865.1 0.49 0.389
MOGWO-SVM 141955.0 172917.0 1.56 1.616
ICEEMDAN-SVM 458448.0 529195.5 5.09 3.152
SVM 507161.0 574946.0 5.62 3.256

Note: Bold denotes the data with best performance in the current dataset.

Fig. 10.

Fig. 10

The influence of ICEEMDAN on the prediction accuracy and stability of MOGWO-SVM. (a) Accuracy; (b) Stability.

5.3. COVID-19-related input variables

In the case study, ED is the predicted target, and the three factors of DI, DD, and PRSI are considered. Correlation analysis (to improve the reliability of the results, three correlation coefficients are used, as shown in Eqs. (25), (26), (27)) proves that these three factors are indeed closely related to ED, as shown in Table 8 .

PCC=cov(X,Y)σXσY (25)
SCC=16×i=1N|R(Xi)R(Yi)|2N3N (26)
KCC=2(ncnd)N×(N1) (27)

where cov(X,Y) is covariance between X and Y; σX and σY are the standard deviation of X and Y, respectively; N is number of samples; R(Xi) and R(Yi) are the ranking of Xi and Yi in their respective column vectors; nc and nd are the number of concordant pairs and discordant pairs, respectively.

Table 8.

Correlation between three COVID-19-related factors and ED.

Variable Correlation coefficient
PCC SCC KCC
DI −0.7861 −0.8847 −0.7041
DD −0.6561 −0.8637 −0.6754
GRSI −0.8410 −0.8668 −0.7074

Filtering input variables is critical in prediction. Excess or missing factors may make the prediction model perform poorly. In this section, six more cases are set up to explore the influence of the COVID-related factors on the prediction results, as shown in Table 9 . Fig. 11 and Table 10 imply that when ED and DI are considered in the model input, its prediction accuracy and stability are the highest. However, according to the correlation analysis, the correlation between GRSI and ED is the strongest, which indicates that the factors with strong correlation as the input of the model do not mean the best prediction results. For the prediction of electricity demand in the United States during the COVID-19 pandemic, the prediction accuracy ranking of models considering different factors is shown in Fig. 12 .

Table 9.

Model inputs corresponding to seven cases.

Case Input(s)
Case 1 (original) ED, DI, DD, GRSI
Case 2 ED, DI
Case 3 ED, DD
Case 4 ED, GRSI
Case 5 ED, DI, DD
Case 6 ED, DI, GRSI
Case 7 ED, DD, GRSI

Fig. 11.

Fig. 11

The prediction results of seven cases.

Table 10.

Accuracy and stability metrics for seven cases.

Case MAE (MWh) RMSE (MWh) MAPE (%) STDRE (%)
Case 1 (original) 45134.7 54865.1 0.49 0.389
Case 2 38937.1 41227.6 0.42 0.143
Case 3 73448.3 76074.6 0.80 0.220
Case 4 41361.7 44057.3 0.45 0.160
Case 5 56840.1 72454.7 0.62 0.591
Case 6 50032.1 58502.2 0.54 0.355
Case 7 115,952 126308.0 1.26 0.540

Note: Bold denotes the data with best performance in the current dataset.

Fig. 12.

Fig. 12

The prediction accuracy ranking of models considering different factors.

5.4. One-step ahead vs. multi-step ahead prediction

In the practical application of daily electricity demand prediction, if managers can predict more days, the benefits for management are more significant [49]. Thus, this paper additionally examines the performance of the proposed model in two-day ahead and three-day ahead predictions. As shown in Fig. 13 , the MAPE for the two-day ahead prediction is 2.06%, and the MAPE for the three-day ahead prediction is 1.86%. Although their performance is not as good as one-day ahead prediction, the prediction accuracy is still about 2%, indicating that the proposed model not only has high accuracy in one-step prediction, but also has great application potential in multi-step prediction. In practical applications, single-step prediction results and multi-step prediction results can be combined to measure the future short-term electricity consumption. If the forecast result is higher than the planned consumption, a policy to restrict electricity use can be introduced.

Fig. 13.

Fig. 13

One-day ahead, two-day ahead, three-day ahead prediction results.

5.5. Considerations of real-world applications

The test of real-world data indicates that the model proposed in this work can be used to predict the daily electricity consumption in a pandemic. In the real-world applications, the real-time prediction can be carried out by establishing a prediction system. The system includes three modules: input module, model training module, and prediction module. Note that the prediction assumes that there is no significant change in energy policy.

The model proposed in this paper can be used as a power system management tool, which has the following practical or energy policy-oriented functions:

  • (1)

    It can predict the electricity demand during the pandemic, and the power sector can reasonably allocate the power resources according to the prediction results (such as one-step ahead, two-step ahead, and three-step ahead), so as to ensure the security of power supply during the pandemic;

  • (2)

    The supply and demand of electricity determine the price, and accurate forecasting of electricity consumption can make prices more reasonable. On the other hand, price setting can balance the relationship between supply and demand, and can also help the government to better formulate policies;

  • (3)

    During the pandemic, the performance of renewable energy power generation is more outstanding and more flexible, and accurate electricity consumption forecasts are conducive to the integration of renewable energy and the power system.

6. Conclusions and future works

In this work, a hybrid model combines ICEEMDAN, MOGWO, and SVM is presented to predict daily electricity demand during the epidemic. Taking the daily electricity demand in the United States as a case study, the analysis results indicate that the proposed model has higher prediction accuracy and stability than the other five benchmark models. DM test further proved the superiority of the proposed model. In addition, this paper discusses several key issues and draws some valuable conclusions:

  • (1)

    In the prediction scenario of electricity demand in the United States, the multi-objective optimizer improves the prediction performance of SVM most significantly.

  • (2)

    When the external factor considered by the model is DI, the accuracy and stability of the model are the highest; however, DI is not the most correlated factor with ED, indicating that the most correlated factor as the input of the model does not mean the best prediction accuracy.

  • (3)

    The proposed model not only performs well in one-day ahead prediction, but also has high accuracy in two-day ahead and three-day ahead predictions. Therefore, the proposed model has a higher potential in multi-step prediction although the prediction performance is not as good as one-step prediction.

The model proposed in this paper aims to be able to accurately predict electricity demand during the COVID-19 pandemic or major global events. Although it has been proved by practice that the proposed model can already obtain high prediction accuracy and stability, there are still some aspects worthy of further study. Thus, future works are summarized as follows:

  • (a)

    The prediction is made in each IMF, and the final result is obtained by summing up all the results. However, direct addition may not be the best way. Therefore, follow-up research may consider using other result processing methods or developing new denoising methods.

  • (b)

    In future work, more external factors can be considered as input to the model and test their rationality.

CRediT author statement

Hongfang Lu: Conceptualization, Methodology, Data curation, Writing – original draft. Xin Ma: Investigation, Methodology, Writing- Reviewing and Editing. Minda Ma: Writing- Reviewing and Editing

Declaration of competing interest

Authors declare that there is no conflict of interest due to the publication of this paper.

Acknowledgments

This article is funded by the National Natural Science Foundation of China (71901184).

References

  • 1.Van Doremalen N., Bushmaker T., Morris D.H., Holbrook M.G., Gamble A., Williamson B.N., Lloyd-Smith J.O. Aerosol and surface stability of SARS-CoV-2 as compared with SARS-CoV-1. N Engl J Med. 2020;382(16):1564–1567. doi: 10.1056/NEJMc2004973. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Otmani A., Benchrif A., Tahri M., Bounakhla M., El Bouch M., Krombi M.H. Impact of covid-19 lockdown on PM10, SO2 and NO2 concentrations in Salé city (Morocco) Sci Total Environ. 2020:139541. doi: 10.1016/j.scitotenv.2020.139541. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.International Energy Agency . 2020. Global energy review 2020. [Google Scholar]
  • 4.Broto V.C., Kirshner J. Energy access is needed to maintain health during pandemics. Nature Energy. 2020:1–3. [Google Scholar]
  • 5.Ahmadi S., hossien Fakehi A., Haddadi M., Iranmanesh S.H. A hybrid stochastic model based Bayesian approach for long term energy demand managements. Energy Strategy Reviews. 2020;28:100462. [Google Scholar]
  • 6.Mason K., Duggan J., Howley E. Forecasting energy demand, wind generation and carbon dioxide emissions in Ireland using evolutionary neural networks. Energy. 2018;155:705–720. [Google Scholar]
  • 7.Bedi J., Toshniwal D. Deep learning framework to forecast electricity demand. Appl Energy. 2019;238:1312–1326. [Google Scholar]
  • 8.Jiang P., Li R., Liu N., Gao Y. A novel composite electricity demand forecasting framework by data processing and optimized support vector machine. Appl Energy. 2020;260:114243. [Google Scholar]
  • 9.Chen Y., Xu X., Koch T. Day-ahead high-resolution forecasting of natural gas demand and supply in Germany with a hybrid model. Appl Energy. 2020;262:114486. [Google Scholar]
  • 10.He Y., Lin B. Forecasting China’s total energy demand and its structure using ADL-MIDAS model. Energy. 2018;151:420–429. [Google Scholar]
  • 11.Potočnik P., Šilc J., Papa G. A comparison of models for forecasting the residential natural gas demand of an urban area. Energy. 2019;167:511–522. [Google Scholar]
  • 12.Di Leo S., Caramuta P., Curci P., Cosmi C. Regression analysis for energy demand projection: an application to TIMES-Basilicata and TIMES-Italy energy models. Energy. 2020;196:117058. [Google Scholar]
  • 13.Tayab U.B., Zia A., Yang F., Lu J., Kashif M. Short-term load forecasting for microgrid energy management system using hybrid HHO-FNN model with best-basis stationary wavelet packet transform. Energy. 2020:117857. [Google Scholar]
  • 14.Kazemzadeh M.R., Amjadian A., Amraee T. Energy; 2020. A hybrid data mining driven algorithm for long term electric peak load and energy demand forecasting; p. 117948. [Google Scholar]
  • 15.Yu D. A two-step approach to forecasting city-wide building energy demand. Energy Build. 2018;160:1–9. [Google Scholar]
  • 16.Bünning F., Heer P., Smith R.S., Lygeros J. Improved day ahead heating demand forecasting by online correction methods. Energy Build. 2020;211:109821. [Google Scholar]
  • 17.Al-Musaylh M.S., Deo R.C., Adamowski J.F., Li Y. Short-term electricity demand forecasting using machine learning methods enriched with ground-based climate and ECMWF Reanalysis atmospheric predictors in southeast Queensland, Australia. Renew Sustain Energy Rev. 2019;113:109293. [Google Scholar]
  • 18.Bedi J., Toshniwal D. Energy load time-series forecast using decomposition and autoencoder integrated memory network. Appl Soft Comput. 2020:106390. [Google Scholar]
  • 19.Karabiber O.A., Xydis G. Forecasting day-ahead natural gas demand in Denmark. J Nat Gas Sci Eng. 2020;76:103193. [Google Scholar]
  • 20.Li R., Jiang P., Yang H., Li C. Sustainable Cities and Society; 2020. A novel hybrid forecasting scheme for electricity demand time series; p. 102036. [Google Scholar]
  • 21.Homod R.Z., Togun H., Abd H.J., Sahari K.S. A novel hybrid modelling structure fabricated by using Takagi-Sugeno fuzzy to forecast HVAC systems energy demand in real-time for Basra city. Sustainable Cities and Society. 2020;56:102091. [Google Scholar]
  • 22.Runge J., Zmeureanu R., Le Cam M. Hybrid short-term forecasting of the electric demand of supply fans using machine learning. Journal of Building Engineering. 2020;29:101144. [Google Scholar]
  • 23.Williams S., Short M. Electricity demand forecasting for decentralised energy management. Energy and Built Environment. 2020;1(2):178–186. [Google Scholar]
  • 24.Colominas M.A., Schlotthauer G., Torres M.E. Improved complete ensemble EMD: a suitable tool for biomedical signal processing. Biomed Signal Process Contr. 2014;14:19–29. [Google Scholar]
  • 25.Torres M.E., Colominas M.A., Schlotthauer G., Flandrin P. 2011 IEEE international conference on acoustics, speech and signal processing (ICASSP) IEEE; 2011. May). A complete ensemble empirical mode decomposition with adaptive noise; pp. 4144–4147. [Google Scholar]
  • 26.Mirjalili S., Saremi S., Mirjalili S.M., Coelho L.D.S. Multi-objective grey wolf optimizer: a novel algorithm for multi-criterion optimization. Expert Syst Appl. 2016;47:106–119. [Google Scholar]
  • 27.Mirjalili S., Mirjalili S.M., Lewis A. Grey wolf optimizer. Adv Eng Software. 2014;69:46–61. [Google Scholar]
  • 28.Suykens J.A., Vandewalle J. Least squares support vector machine classifiers. Neural Process Lett. 1999;9(3):293–300. [Google Scholar]
  • 29.Liu M., Cao Z., Zhang J., Wang L., Huang C., Luo X. Short-term wind speed forecasting based on the Jaya-SVM model. Int J Electr Power Energy Syst. 2020;121:106056. [Google Scholar]
  • 30.Yan H., Zhang J., Rahman S.S., Zhou N., Suo Y. Predicting permeability changes with injecting CO2 in coal seams during CO2 geological sequestration: a comparative study among six SVM-based hybrid models. Sci Total Environ. 2020;705:135941. doi: 10.1016/j.scitotenv.2019.135941. [DOI] [PubMed] [Google Scholar]
  • 31.Sadeghi R., Zarkami R., Sabetraftar K., Van Damme P. Use of support vector machines (SVMs) to predict distribution of an invasive water fern Azolla filiculoides (Lam.) in Anzali wetland, southern Caspian Sea, Iran. Ecol Model. 2012;244:117–126. [Google Scholar]
  • 32.Zhao L.T., Zeng G.R. Analysis of timeliness of oil price news information based on SVM. Energy Procedia. 2019;158:4123–4128. [Google Scholar]
  • 33.Lu H., Azimi M., Iseley T. Short-term load forecasting of urban gas using a hybrid model based on improved fruit fly optimization algorithm and support vector machine. Energy Rep. 2019;5:666–677. [Google Scholar]
  • 34.Fan J., Wu L., Ma X., Zhou H., Zhang F. Hybrid support vector machines with heuristic algorithms for prediction of daily diffuse solar radiation in air-polluted regions. Renew Energy. 2020;145:2034–2045. [Google Scholar]
  • 35.Zhang X., Wang J., Zhang K. Short-term electric load forecasting based on singular spectrum analysis and support vector machine optimized by Cuckoo search algorithm. Elec Power Syst Res. 2017;146:270–285. [Google Scholar]
  • 36.Li H., Wang J., Li R., Lu H. Novel analysis–forecast system based on multi-objective optimization for air quality index. J Clean Prod. 2019;208:1365–1383. [Google Scholar]
  • 37.Zaremba A., Kizys R., Aharon D.Y., Demir E. Infected markets: novel coronavirus, government interventions, and stock return volatility around the globe. Finance Res Lett. 2020:101597. doi: 10.1016/j.frl.2020.101597. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Lu H., Ma X., Huang K., Azimi M. Prediction of offshore wind farm power using a novel two-stage model combining kernel-based nonlinear extension of the Arps decline model with a multi-objective grey wolf optimizer. Renew Sustain Energy Rev. 2020;127:109856. [Google Scholar]
  • 39.Lu H., Ma X., Huang K., Azimi M. Carbon trading volume and price forecasting in China using multiple machine learning models. J Clean Prod. 2020;249:119386. [Google Scholar]
  • 40.Deb K., Pratap A., Agarwal S., Meyarivan T.A.M.T. A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans Evol Comput. 2002;6(2):182–197. [Google Scholar]
  • 41.Liu Y., Shen W., Man Y., Liu Z., Seferlis P. Optimal scheduling ratio of recycling waste paper with NSGAII based on deinked-pulp properties prediction. Comput Ind Eng. 2019;132:74–83. [Google Scholar]
  • 42.Mirjalili S., Lewis A. The whale optimization algorithm. Adv Eng Software. 2016;95:51–67. [Google Scholar]
  • 43.Mafarja M.M., Mirjalili S. Hybrid whale optimization algorithm with simulated annealing for feature selection. Neurocomputing. 2017;260:302–312. [Google Scholar]
  • 44.Kennedy J., Eberhart R. vol. 4. IEEE; 1995. November). Particle swarm optimization; pp. 1942–1948. (Proceedings of ICNN’95-international conference on neural networks). [Google Scholar]
  • 45.Noushabadi A.S., Dashti A., Raji M., Zarei A., Mohammadi A.H. Renewable Energy; 2020. Estimation of cetane numbers of biodiesel and diesel oils using regression and PSO-ANFIS models. [Google Scholar]
  • 46.Chen S., Cowan C.F., Grant P.M. Orthogonal least squares learning algorithm for radial basis function networks. IEEE Trans Neural Network. 1991;2(2):302–309. doi: 10.1109/72.80341. [DOI] [PubMed] [Google Scholar]
  • 47.Lu H., Cheng F., Ma X., Hu G. Short-term prediction of building energy consumption employing an improved extreme gradient boosting model: a case study of an intake tower. Energy. 2020:117756. [Google Scholar]
  • 48.Harvey D., Leybourne S., Newbold P. Testing the equality of prediction mean squared errors. Int J Forecast. 1997;13(2):281–291. [Google Scholar]
  • 49.Yang W., Wang J., Niu T., Du P. A novel system for multi-step electricity price forecasting for electricity market management. Appl Soft Comput. 2020;88:106029. [Google Scholar]

Articles from Energy (Oxford, England) are provided here courtesy of Elsevier

RESOURCES