Skip to main content
Springer Nature - PMC COVID-19 Collection logoLink to Springer Nature - PMC COVID-19 Collection
. 2022 Oct 5:1–18. Online ahead of print. doi: 10.1007/s10479-022-05004-8

Forecasting oil commodity spot price in a data-rich environment

Sabri Boubaker 1,2,3, Zhenya Liu 4,5,6,, Yifan Zhang 4
PMCID: PMC9534472  PMID: 36217322

Abstract

Statistical properties that vary with time represent a challenge for time series forecasting. This paper proposes a change point-adaptive-RNN (CP-ADARNN) framework to predict crude oil prices with high-dimensional monthly variables. We first detect the structural breaks in predictors using the change point technique, and subsequently train a prediction model based on ADARNN. Using 310 economic series as exogenous factors from 1993 to 2021 to predict the monthly return on the WTI crude oil real price, CP-ADARNN outperforms competing benchmarks by 12.5% in terms of the root mean square error and achieves a correlation of 0.706 between predicted and actual returns. Furthermore, the superiority of CP-ADARNN is robust for Brent oil price as well as during the COVID-19 pandemic. The findings of this paper provide new insights for investors and researchers in the oil market.

Keywords: Change point detection, Recursive neural network, Oil price prediction, COVID-19

Introduction

Crude oil is the world’s most actively traded commodity for its strategic importance to national economies. Fluctuations in crude prices have been shown to significantly impact the global economy in various respects, such as currencies of crude-related countries (Kayalar et al. 2017) and stock return dynamics (Chang and Yu 2013). Similar to other commodities, the spot market price of crude oil is determined, to some extent, by supply and demand. However, additional factors such as financial markets, economic growth, geopolitical uncertainty, and occasional incidents also impact crude oil price formation (Conrad et al. 2014; Baumeister and Kilian 2016; Miao et al. 2017).

Since the oil crisis initiated by the OPEC embargo in 1973, several oil shocks have resulted in the rapid rise in the price of crude oil. For example, the crude oil price rose drastically from $13 per barrel to $34 between 1979 and 1980 due to geopolitical conflict in the Middle East. The oil commodity price is sensitive to the production and supply, as well macroeconomic and financial factors (Ameur et al. 2021). However, the dynamics of time series of these factors are not easily predictable. They are typically considered to be non-stationary, which makes accurately predicting crude oil prices a challenging task.

To address this problem, we propose a deep framework of change point-adaptive recursive neural network (CP-ADARNN) to build a comprehensive forecasting model. The non-stationarity of time series dictates that the data distribution may shift over time, which could explain the inferior performance of conventional prediction models. For example, in the case of crude price, the influential determinants may change with the movement of economic conditions while the “price formulation rules” remain relatively stable. Our objective is to capture the variations within exogenous variables while keeping the conditional distribution of the dependent variable constant. Du et al. (2021) propose the ADARNN model to address this temporal covariate shift problem. To expand upon that model, we introduce the change point detection method as the first module to split the original data into periods with different distributions.

In this paper, we collect 310 monthly variables, including the macroeconomic factors of major economies, demand and supply factors of crude oil, proxy variables measuring economic uncertainty, and financial factors. The sample period covers more than 25 years, from March 1993 to December 2021. We then construct a balanced panel using these selected variables as predictors of crude oil prices. Given the high-dimensional data, we split the first 80% sample as a training set and detect three structure breaks located in April 2000, October 2007, and December 2009. Using the second module of CP-ADARNN, we train a model for one-step-ahead forecasting and the out-performance of proposed model provides new insights for participators in the oil market.

Our empirical results show the superiority of CP-ADARNN for the West Texas Intermediate (WTI) oil price prediction in terms of evaluation metrics including the mean absolute error (MAE), the root mean square error (RMSE), the out-of-sample R squared, and the information coefficient (IC). Specifically, CP-ADARNN outperforms the second-best model by 12.5% in terms of RMSE and achieves an out-of-sample R squared of 0.496 and an IC of 0.706. To check the robustness of the proposed model, we conduct the investigation similarly with the Brent spot crude oil price and design an empirical study to take account of the impact of COVID-19 pandemic. All the results confirm the better performance of CP-ADARNN compared with candidate models.

Our study contributes to the existing literature in several ways. First, we introduce the ADARNN approach in oil price prediction. This allows us to account for the non-stationarity of time series, which dictates that the distribution of data may change over time. Second, building on the original research of Du et al. (2021), we embed the change point detection into the framework of ADARNN to address the problem of distribution distance measuring. The technique is used to detect the changes in means for the high-dimensional panel with a structure of relatively strong cross-sectional dependence. Third, the CP-ADARNN outperforms the conventional time series and deep learning benchmarks on the prediction. Fourth, the proposed model performs well when taking into account the impacts of the COVID-19 pandemic.

The remainder of this paper is organized as follows. In Sect. 2, we review the literature for oil price forecasts and time-variant models. In Sect. 3, we propose the CP-ADARNN framework. In Sect. 4, we present the prediction performance on the WTI oil price against other models. In Sect. 5, we check the robustness of the framework for the Brent oil price and considering the impacts of COVID-19. The conclusion of this paper is provided in Sect. 6.

Literature review

Crude oil price prediction has drawn significant interest for decades. The first strand of literature on the topic is based on the traditional time series models, such as linear regression, cointegration analysis, and GARCH models (e.g., Ye et al. 2006; He et al. 2010; Mohammadi and Su 2010). Coppola (2018) employs a vector error correction model (VECM) to forecast oil price movements, defining the equilibrium error as the long-run spread between spot price and future price. Kilian and Murphy (2014) propose a structural vector autoregressive (SVAR) model of the global market for crude oil price forecast which includes shocks from speculative demand and flows of supply and demand. The second strand of literature is based on machine learning and continues to grow; it includes, for example, support vector machine (SVM, Xie et al. 2006), tree-based methods (Gumus and Kiran 2017), and ensemble learning (Dbouk and Jamali 2018). Neural networks (NN) are particularly popular for their ability to model complex features. Tang and Zhang (2012) build a multiple wavelet recurrent neural network (MWRNN) model to consider the effects of both trend and random components of the crude oil price. Li et al. (2019) extract hidden characteristics in news media content using a convolutional neural network (CNN) to obtain more accurate forecasts of oil prices.

However, few models above considered the time-variant dynamics in time series, particularly the distributional features. Structural breaks and regime switching approaches have received attention in studies on crude oil over the last decade (e.g., Zhang and Zhang 2015; Kuck and Schweikert 2017; Liu et al. 2020). In this paper, we introduce a change point detection technique to comprehensively capture the distribution changes in economic variables. Change point approaches have been widely used in economic fields, including the stock market (Horváth et al. 2021a), real estate (Horváth et al. 2021b), and macroeconomics (Barigozzi et al. 2018). To forecast oil prices, panel data, particularly high-dimensional panel data, are typically utilized. Common shocks like policy changes, market crashes, and other systemic events have the potential to affect all cross-sections simultaneously. Bai (2010) develops a least-squares type change point test for high-dimensional panel data while assuming the independence of time series observation on cross-section. Horváth and Hušková (2012) propose CUSUM-type tests to detect breaks in high dimensional factor models in terms of means. However, the strong dependence may negate the effectiveness of their methods. We follow Horváth et al. (2021c) in which they provide a valid finite-sample method for high-dimensional panel data with potentially strong cross-sectional dependence.

Methodology

In this section, we construct a time series (TS) prediction model based on the change point (CP) detection technique and adaptive recursive neural network (ADARNN) approach. Our proposed method CP-ADARNN depicts a distribution-based forecasting framework with labeled TS data. We begin with a formal formulation of the problem we aim to solve.

Time series prediction under distribution shift

Given a labeled time series of T cross-sectional observations D={xt,yt}t=1T, where xt=(xt1,,xtp)Rp is the tth p-dimensional observation, and yt is the corresponding label. A time series prediction problem is to construct a model M:xtyt on the sequentially arriving r-step observations {xt}t=T+1T+r generated by the same time series. In most time-invariant prediction approaches, the entire time series is assumed to follow the homogeneous distribution, that is, {xt}t=1T+r is from an identical data generating process. Consequently, it is reasonably to expect that the trained model M that performs well on the labeled training set is efficient on the unlabeled testing set. However, such assumption is too strong to hold in real data. For example, in the scenario of oil price forecasting, the observed oil-related predictors labeled with the realized price or return exhibit, at least between different periods, the heterogeneity in distributions. To address this issue, we need to relax the time-invariant assumption in a more practical way.

In this paper, we follow the state-of-the-art ADARNN framework of Du et al. (2021) in a specific way to cooperate with the oil price prediction problem. The approach allows changes in distribution between the training and testing sets and also over the training segments themselves, which is refereed as Temporal Covariate Shift (TCS). We specify the r-step ahead time series prediction model as follows:

Definition 1

(r-step ahead TS prediction model under TCS) For a time series with labels D={xt,yt}t=1T, where yt is the scalar label of the tth cross-sectional observation xt, and xtRp is a p-dimensional features. D is supposed to be split into K intervals, that is, D={Dk}k=1K, where Dk={xt,yt}t=nk+1nk+1, n1=0 and nK+1=T. Such partition satisfies the TCS if observations in the same interval follow the identical distribution PDk(x,y), while for different intervals that 1ijK, PDi(x)PDj(x) and PDi(yx)=PDj(yx). Given the labeled training set, our aim is to learn an r-step time series prediction model M:xtyt to forecast with the future r observations, Dtest={xt}t=T+1T+r. We assume a shift in distribution occurs between training and testing sets, i.e., PDtest(x)PDk(x) and PDtest(yx)=PDk(yx) for any 1kK.

A prediction framework under TCS is intuitive in finance. It is commonly accepted that the financial factors could vary from time to time with the fluctuation of the market, which means the data distributions of x (P(x)) are different across periods. While the economic laws or patterns P(yx) may maintain consistent. One problem remaining is the determination of the unknown number of periods K. In the original design of ADARNN, the authors use the method known as Temporal Distribution Characterization (TDC) to capture the diversity among the K periods by maximizing the average period-wise distributional distances. However, in such way, the important parameter K is required to be given exogenously.

Change point detection

We embed the CP detection technique into the framework of ADARNN to automatically determine the number of periods with different distributions. The predictor {xt}t=1T under a single-valued prediction model could be considered as a balanced panel, xi,t,1iN,1tT which may have the certain breaks in distribution across time. We define the break as the change in mean of the panel following Horváth et al. (2021c) in which the cross-sectional dependence is allowed.

Definition 2

(Change in mean of cross-dependent panel) The Temporal Covariate Shift (TCS) panel xi,t follows the basic factor model,

xi,t=μi,k+λift+ei,t,ck-1<tck,1kK, 1

where 1iN, the panel xi,t is split into K sub-periods through K-1 change points, 1<c1<<cK-1<T. μi,1,μi,2,,μi,K-1 are the means of the observations within each sub-period. Given d common factors ftRd,1tT, and the corresponding loadings λiRd, the λift presents the cross-sectional dependence. {ei,t,1tT} are the error terms.

We are interested in detecting the change points described above, equivalently, testing the null hypothesis, there is no change in means:

H0:μi,1=μi,2==μi,K-1,1iN

against the alternative,

HA:μi,jμi,j+1,j{1,,K-1},for somei{1,,N}.

We apply the well-established statistic in Horváth et al. (2021c) to detect the potential changes in means of dependent panel. Modifying the aggregated CUSUM process from Horváth and Hušková (2012), the authors propose a less restrictive form of underlying functional of

VN,T(u)=i=1NSi2(u)-uT(T-uT)τT(T-τT)1{uT1}

where Si(u)=t=1uT(xi,t-T-1t=1Txi,t) is the CUSUM process of the cross-sectional unit.

Based on their asymptotic results, we conclude the following theorem,

Theorem 1

If some necessary assumptions1 hold, then we have that

1Tω¯N,T1/2VN,T(u)D[0,1]Δ(u) 2

under H0 and

1Tω¯N,T1/2sup0u1|VN,T(u)|P 3

under HA. The detector statistic is hence considered as

vN,T=1Tω¯N,T1/2sup0u1|VN,T(u)| 4

consequently the change point location is given as

c^V=Tθ^N,T,andθ^N,T=min{u:|VN,T(u)|=sup0v1|VN,T(v)|} 5

where ω¯N,T is the adjusted estimator of the normalizing sequence.2

Based on the technique, we could detect the multiple panel change points in the predictors {xt}t=1T through the standard binary segmentation (see Csörgö and Horváth 1997).

Time series prediction with CP-ADARNN

Given the sub-period partition using the CP technique, we cooperate with the second module of ADARNN, Temporal Distribution Matching (TDM, Du et al. 2021) to learn the common mechanism within different periods by matching the distributions. Ideally, the learned prediction model M:xtyt should be more adaptive for the unobserved test data than alternative approaches that take only statistical information into consideration. We begin with formulating the loss of prediction for the parameterized model, Lpre, in a conventional way,

Lpre(θ)=1Kk=1K1|Dk|tDkloss(yt,M(xt;θ)) 6

where |Dk| is the number of observations in the k th split period, loss(·,·) is a loss function, for example, the MSE loss. M(xt;θ) denotes the parameterized prediction model.

However, such loss fails to assess the common predictive mechanism shared by different periods. We follow the framework of ADARNN in which the distributions of each pair of periods (e.g., Di and Dj) are matching with methods commonly performed in researches on domain adaptation. We denote the V hidden states RNN with q-dimensional features with H={ht}t=1VRV×q. To adaptively match the distributions between cells of RNN, the TDM is formally introduced as,

Definition 3

(Temporal distribution matching) For a pair of periods (Di,Dj), the loss function of TDM is given as:

LTDM(Di,Dj;θ)=t=1Vαi,jtd(hit,hjt;θ) 7

where αi,j represents the relative importance of the V hidden states of RNN between the periods of Di and Dj. By combining Eqs. (6) and (7), the terminal objective function of TDM with one-layer RNN is:

L(θ,α)=Lpre(θ)+λ2K(K-1)ijLTDM(Di,Dj;θ,α) 8

where λ is the parameter to trade off. α is learned using the boosting-based information evaluation (see Du et al. 2021).

We propose the change point-ADARNN (CP-ADARNN) framework to learn a comprehensive TS prediction model. Figure 1 presents the overview of CP-ADARNN.

Fig. 1.

Fig. 1

The framework of CP-ADARNN. This framework contains two modules shown as the change point detection in Panel (a) and temporal distribution matching in Panel (b)

To sum up, the proposed framework is consists of two main parts. Given the training data, CP-ADARNN first uses the CP technique to split the sample into periods with different distributions. One may note that the original ADARNN applies a TDC module as the first step to capture the distribution switching within time series, but suffering the exogenous choice of one vital parameter, K, which can be automatically determined in CP. Then the TDM module performs the distribution matching to establish a prediction model M with which we can forecast the time series with new data.

Empirical results

In this section, we investigate the prediction performance of the CP-ADARNN on crude oil real price and make a comparison to alternative candidate models. Utilizing real world data, we forecast the WTI crude oil monthly return on real spot price using numerous influential macroeconomic and financial factors. Using our proposed method, we first detect three structural changes in the cross-sectional means of predictors. We then conduct a forecast embedding our breaks with ADARNN. Overall, we find that our CP-ADARNN outperforms benchmark models in crude oil price prediction.

Data description

It is well-known that there are many assets concerning crude oil around the world. In this empirical study, we choose the WTI spot price as a global benchmark for crude oil. As one of the largest trading crude oil commodities, and with numerous derivatives targeting, the WTI is considered to be a well-priced and widely-investigated financial instrument (e.g., Chai et al. 2018; Ewald et al. 2019; Boubaker et al. 2021). The monthly WTI spot crude oil price sequence is obtained from the U.S. Energy Information Administration (EIA). The nominal price data are deflated using the U.S. Producer Price Index (PPI) for all commodities, which is obtained from the FRED database of the St. Louis FED.

A diverse set of macroeconomic and financial variables is used in the oil price prediction. To construct the data set of comprehensive exogenous factors, we start with the high-dimensional macroeconomic panel data, FRED-MD (McCracken 2021), which consists of 127 time series. The FRED-MD data set covers eight categories in the US, including output and income, labor market, housing, consumption, orders and inventories, money and credit, interest and exchange rates, prices, and stock market. We select 93 stock and flow time series based on previous studies (Zagaglia 2010; Naser 2016; Zhao et al. 2017). They consist of the crude oil production and stocks, oil products consumption, imports/exports, and other key information on rigs for major countries provided by the EIA. To account for global and regional risks, we include the Economic Policy Uncertainty (EPU, Baker et al. 2016) indexes for 8 countries, Geopolitical Risk (GPR, Caldara and Iacoviello 2021) indexes for 39 countries, and 2 aggregated indexes for geopolitical threats and acts. Forty-one series of financial variables containing market and commodities indexes and crude oil future prices are also included. Finally, our pool of predictors contains 310 time series that include variables concerning fundamentals, demand and supply of crude oil, macroeconomics, and financial markets. We process the 310 series based on the nature of data as input variables.3 Details of the selected variables in the empirical research are provided in the online materials including the data sources and results of the ADF tests.

There are two reasons for choosing these variables to forecast the WTI spot crude oil price. First, they are either directly related to the mechanism of price discovery for crude oil, or they have indirect impacts on the market conditions. Second, the proposed CP-ADARNN framework as a deep learning approach, inherits the powerful ability to model high-dimensional data. The various factors, despite the presence of noise, contain relatively complete information on oil price dynamics.

Our final sample period spans more than 25 years, from March 1993 to December 2021 (T=346 monthly observations in total). For the purpose of modeling and forecasting, we split the data into two parts. The training samples consist of the first 80% of observations (from March 1993 to February 2016 with T1=276 months) of predictors and WTI oil price and the remaining (T2=70 months) data are used for testing based on Yu et al. (2014). Our proposed CP-ADARNN model as well as competing models are recursively estimated, that is, after forecasting we add an observation at the end of the training sample. To distinguish the differences between the rolling window and anchored window, we refer to Morales-Arias and Moura (2013).

Performance evaluation criterion and benchmarks

For clarification purposes, the forecasting variable in this paper is the log-return (yt=log(Yt/Yt-1)) of the real oil price (Yt). The price series is often observed with the properties of non-stationarity and persistence; hence, directly predicting prices is much easier than the returns, which may distort the results of the comparison. We focus on the short-term accuracy of prediction; therefore, the lag order of the predictors r is set to one.

Performance evaluation metrics

In order to evaluate the one-step prediction performance of the models from comprehensive respects, we select two conventional performance metrics, the mean absolute error (MAE) and the root mean square error (RMSE) which have been frequently used in previous studies (e.g., Tang et al. 2015; Lerner and Seru 2021), formulated as:

MAE=1T2-1t=T1+2T1+T2|yt-y^t|RMSE=1T2-1t=T1+2T1+T2(yt-y^t)2

where T1 and T2 are the length of the training and testing set respectively, and yt and y^t are the actual and predicted returns on real oil price. Obviously, smaller MAE and RMSE indicate a better prediction model.

In addition, we use another popular evaluation metric, the out-of-sample R-squared, i.e., ROOS2, which compares different forecast approaches with a benchmark model. In this paper, we use the random walk model as benchmark, which is solidly based on the Efficiency Market Hypothesis (EMH), and consequently, the ROOS2 statistic is defined as:

ROOS2=100×1-t=T1+2T1+T2(yt-y^t)2t=T1+2T1+T2(yt-y^tRW)2

where the y^tRW is the random walk predictions. The ROOS2 measures the performance of candidate prediction approach relative to the trivial prediction, hence a higher and positive ROOS2 indicates a better accuracy of forecasting, compared with the benchmark as ROOS2=0. For the evaluation of return prediction, the information coefficient (IC) has been widely adopted in prior studies, see for example Guerard et al. (2021). The IC describes the correlation between the predicted and realized asset returns as:

IC=corr(yt,y^t)

where y^t is the return forecast and yt is the actual return. The IC is calculated on the testing set. Undoubtedly, higher ICs indicate a better prediction.

Candidate models

We use some popular and comparative time series forecasting models in our empirical study to compare with the prediction performance of the proposed framework. They are namely (1) univariant time series models, including the random walk (RW) along with random walk with drift (RWd), and the Autoregressive Integrated Moving Average (ARIMA); (2) regularization models, including the LASSO, the Elastic net (ENet, Zou and Hastie 2005), and the Ridge regression; (3) regression tree model as Random Forest (RF); (4) neural network models, including RNN variants like the long short-term memory (LSTM), the gated recurrent unit (GRU).

Specifically, we clarify the setting of candidate models in WTI crude oil return forecasting as follows.

Model 1–2: For the RW models, we assume that log real price of oil follows a random walk process (or with drift), that is, the one-step ahead oil real price log-change is a martingale difference sequence with E(yt|Ft)=0 (or E(yt|Ft)=drift×1).

Model 3: For the univariate ARIMA model, we assume the log returns follows an ARIMA(pdq) process and determine the optimal lag orders pdq according to the Bayesian Information Criterion (BIC). For all cases of modeling log returns, the order of integrated term d equals zero which admits an ARMA(pq) model.

Model 4–6: For the regularization models, the LASSO, ENet and the Ridge regression use parameter shrinkage and variable selection to limit the dimension of regressors which solves the OLS inefficiency problem. For a shrinkage parameter λ0 and a positive combination parameter α between 0 (Ridge) and 1 (LASSO), the optimization problem for the regularization model is given as:

β^=argmin{βi}i=1k1Tt=1Tyt+1-j=1kxj,tβj2+λPα(β) 9

and the penalty function Pα(β) is:

Pα(β)=j=1kα|βj|+(1-α)2βj2 10

where βj is the jth linear coefficients for the predictor xj,t. The optimal shrinkage parameter λ can be determined by cross-validation (CV). In this study, we use the 10-fold CV to compute the prediction error and compare different models using training set data. Note that higher λ shrinks the coefficients more intensely and λ=0 results in an OLS regression. For the weighting parameter α, we choose 0, 0.25 and 1, representing the Ridge, ENet and LASSO regressions, respectively.

Model 7: For the regression tree model, we consider the RF model that constructs multiple decision trees within training set with bootstrap aggregation (bagging). We consider the number of leaves as {5,10,20,50,100} and choose the optimal one based on the mean squared error (for regression trees) for out-of-bag observations. The number of trees is set to be 100.

Model 8–10: For the neural network (NN) models, we consider two widespread used RNN variants and one state-of-the-art ADARNN model. The standard GRU/LSTM is specified with two GRU/LSTM layers and two fully connected (FC) layers and with the dimension of the hidden state to be 64 for each. ADARNN could involve with RNN structure like GRU. We follow Du et al. (2021) and set the network consisting of two-layer GRUs and the dimension of the hidden state to be 64. The GRU layers are followed by two FC layers with output to be r=1. We use the Adam optimizer with learning rate of 0.005 and set the batch size to be 24. To measure the difference of distributions, we exogenously set the number of domain in training set to be 4 and use the cosine distance in Du et al. (2021).

Model 11: Our proposed CP-ADARNN splits the training set into different sub-periods based on the changes in cross-sectional means of predictors, which is the first step of ADARNN. To demonstrate the superiority of the framework, we keep all the settings same with ADARNN but the way to identify the different distributional periods.

The CP-ADARNN allows the distribution shifting between periods split by the change points detected within predictor panel. To extend this assumption to benchmarks, we also consider the CP- versions of those non-adaptive models by replacing the entire training samples with the sub-training-sample which is split by change point and closest to the testing samples.4

We standardize the predictors using the min-max normalization. All models are performed using MATLAB R2020a software and the ADA models are trained using Python 3.8.5, torch 1.11.0 and CUDA 11.3 on a server with a Ryzen5 2600X CPU, an NVIDIA 1080 GPU and RAM size of 16 GB.

Change points detected within the predictor panel

We perform the first module of the proposed CP-ADARNN framework to detect the common breaks in the mean of the predictors (non-normalized) following Horváth et al. (2021c). As discussed in Subsect. 3.2, we apply the change-in-mean test based on the detection statistic vN,T for the high-dimensional cross-dependent panel. To accommodate potential multiple changes, a binary segmentation method is used in the fixed sample of the training set. Accordingly, we find the first and largest change in the predictors structure as October 2007 which divides the whole training sample into two parts—from March 1993 to October 2007 (176 monthly observations) and from November 2007 to February 2016 (100 monthly observations). In addition, two sub-period changes are detected in April 2000 and December 2009.

Our predictor panel covers fundamentals for oil price determination. Within the context of temporal covariate shift, structure breaks could occur in the common factors. With the relatively stable economic mechanism, measured by the conditional distribution P(yx), the marginal distribution of oil price may change before and after the breaks. As shown in Table 1, the first detected change point is October 2007. While it is beyond the scope of this paper to investigate the cause of breaks, that point marks the early phase of 2008 financial crisis during which the values of the macroeconomic variables vary and even reverse. Not surprisingly, the second break point found in the predictor panel is December 2009. These two change points cut the process of crisis out of the whole training sample. The third detected break locates in April 2000 which coincides with the beginning of a prolonged period of oil price increases in the early 2000s.

Table 1.

Tests for common breaks in predictors of the training sample (1993/03–2016/02)

Periods Location of breaks Relevant events
Mar 1993–Feb 2016 Oct 2007 (0.00) Early phase of financial crisis
Mar 1993–Oct 2007 Apr 2000 (0.00) Beginning of oil price rising
Nov 2007–Feb 2016 Dec 2009 (0.00) Ending phase of financial crisis

Note: This table presents the detected change points in a pool of predictors. The critical values of each test are calculated following the bootstrap procedures provided by Horváth et al. (2021c). The corresponding p-values are shown in parentheses

The superiority of CP-ADARNN

In this section, we perform the one-step out-of-sample predictions for the proposed model as well as ten candidate models. For each prediction, we calculate four evaluation metrics for the purpose of comparison. The actual value of the WTI spot crude oil real price and the one-step forecasts of each model are shown in Fig. 2. We compute the real price forecast for the month t+1 as the observed price in month t multiplied by the predicted return. There are two results to be noted in Fig. 2: first, most prediction models provide reasonable forecasts in the testing sample; second, the forecasts are more accurate for milder market period than during intense rises and falls.

Fig. 2.

Fig. 2

The WTI spot crude oil real price and one-step-ahead forecasts. The figure plots the monthly forecasts of CP-ADARNN and the alternative models. The shadowed areas are sub-periods split by the detected change points

Table 2 presents the prediction accuracy metrics of the different models for one-step ahead forecasting. The full-training panel contains models estimated on the full training set. For all but the CP-ADARNN model, the CP- version panel contains models estimated on the last sub-period segmented by change points.

Table 2.

Prediction accuracy of oil real return forecasting models

Models Full-training CP- version
MAE RMSE ROOS2 IC MAE RMSE ROOS2 IC
RW 0.092 0.149 0.092 0.149
RWd 0.106 0.182 -0.496 0.253 0.105 0.149 -0.005 0.273
ARIMA 0.085 0.145 0.043 0.218 0.088 0.145 0.044 0.245
LASSO 0.075 0.132 0.211 0.555 0.091 0.148 0.005 0.031
ENet 0.074 0.126 0.278 0.574 0.091 0.148 0.005 0.016
Ridge 0.070 0.120 0.345 0.597 0.091 0.148 0.005 0.029
RF 0.075 0.126 0.280 0.543 0.090 0.148 0.008 0.052
LSTM 0.108 0.188 -0.607 0.327 0.132 0.192 -0.664 -0.063
GRU 0.118 0.202 -0.856 0.300 0.113 0.165 -0.235 0.042
ADARNN 0.087 0.155 -0.085 0.458 0.072 0.105 0.496 0.706

Note:Other than the proposed CP-ADARNN framework, models with “CP-” mean that their training samples are replaced with the last sub-period split by the change points. The bold values represent the best performance among the 20 models in terms of MAE, RMSE, ROOS2 and IC

Firstly, comparing four types of competing models—univariant time series models, regularization models, regression tree, and neutral networks—it is evident that the regularization models like LASSO, ENet, and Ridge regressions are capable of forecasting individual time series in a data-rich environment. Secondly, we split the training set into sub-periods by applying the change point detection technique, assuming that the factors of oil prices are with distribution shifting. However, the time-variant distributions do not enhance the prediction performance of the models, even for the regularization models, and the CP- versions fail due to fewer observations. Thirdly, the proposed CP-ADARNN model outperforms candidate models and achieves the lowest RMSE, highest ROOS2 and IC. The possible reason for this is that the CP-ADARNN model captures the stable mechanism of crude oil price formation as the determinants change. Unlike other CP- models based on consistent information, we assume that the distributions of factors could be different from all the periods in the CP-ADARNN framework. As shown in Table 2, the original ADARNN model exhibits lower prediction accuracy, which indicates the change points detected in cross-sectional means provide more appropriate partitions than the distribution distance measures in ADARNN.

Additionally, the RW model is typically used as a benchmark to be beaten in price forecasting. The EMH argues that the markets are efficient, and that asset prices reflect all available information. Therefore, the best predictor of future price is the present price, which is equivalent to a zero-return forecast. We compute the ROOS2 based on the RW forecasts. Surprisingly, all the neutral network models have negative ROOS2. We conjecture that the monthly data frequency and the invariant distribution assumption over such a long horizon weaken the predictability of NN models. Specifically, the CP-ADARNN model provides a much more precise forecast which generates the lowest RMSE of 0.105 (12.5% improvement from the second-best model Ridge and 32.3% improvement from ADARNN) the highest ROOS2 of 0.496 and IC of 0.706.

The superiority of the CP-ADARNN model suggests the potential occasional distribution shifting in macroeconomic variables; it also indicates the importance of considering time-variant data properties for prediction. In addition, the previous study of ADARNN raised the interest in prediction under TCS, and the empirical results show that a more appropriate method to distinguish the distributional difference may enhance the capacity of the model. Our proposed framework for forecasting oil commodity price may provide helpful insights to investors and industry stakeholders.

Robustness test

In this section, we perform two empirical studies to verify the robustness of the CP-ADARNN model for oil price prediction. The results of the Brent oil forecast and also the COVID sample reaffirm the superiority of the proposed framework.

Forecasting the Brent spot crude oil price

To determine if the prediction performance of the CP-ADARNN framework presented in the previous section is sensitive to the choice of crude oil underlying, we replicate the procedures with the WTI spot crude oil price with the Brent spot price. As the most prevalent global benchmarks for crude oil with a similar quality, the two spot prices are highly correlated with a correlation of 0.9919 from March 1993 to December 2021.5 However, the WTI and Brent spot oil prices began to decouple in 2011, with the spread between the real prices reaching 30.58 dollar per barrel in September 2011, as shown in Fig. 3.

Fig. 3.

Fig. 3

The WTI and Brent spot crude oil real prices. The figure plots the monthly spot real prices and spread. The sample period covers from January 1993 to December 2021

As argued by Geyer-Klingeberg and Rathgeber (2021), regarding the Brent-WTI spread, the influence of paper market trading on the physical spot market heavily increased after 2010. It is natural to consider the effect of the gap between underlyings. We keep the predictors constant, therefore, the change points detected in the panel remain the same. The CP-ADARNN and alternative forecasting models are performed on the Brent monthly crude oil price. The resulting evaluation metrics are presented in Table 3. It can be seen that the proposed prediction model with the lowest RMSE, highest ROOS2 and IC, exhibits higher forecasting accuracy than the other candidate models

Table 3.

Prediction accuracy of Brent oil real return forecasting models

Models Full-training CP- version
MAE RMSE ROOS2 IC MAE RMSE ROOS2 IC
RW 0.092 0.146 0.092 0.146
RWd 0.105 0.172 -0.404 0.296 0.116 0.158 -0.173 0.200
ARIMA 0.084 0.137 0.118 0.348 0.091 0.146 -0.011 0.041
LASSO 0.071 0.113 0.394 0.648 0.090 0.145 0.009 0.061
ENet 0.073 0.117 0.354 0.621 0.091 0.145 0.005 0.010
Ridge 0.069 0.111 0.415 0.659 0.091 0.145 0.009 0.080
RF 0.078 0.125 0.268 0.526 0.090 0.144 0.019 0.147
LSTM 0.145 0.208 -1.046 0.516 0.122 0.171 -0.382 0.085
GRU 0.102 0.169 -0.353 0.382 0.104 0.148 -0.027 0.301
ADARNN 0.090 0.123 0.288 0.676 0.072 0.106 0.472 0.698

Note:Other than the proposed CP-ADARNN framework, models with “CP-” mean that their training samples are replaced with the last sub-period split by the change points. The bold values represent the best performance among the 20 models in terms of MAE, RMSE, ROOS2 and IC

The results of the Brent oil prediction not only illustrate the robustness of the CP-ADARNN model, they also demonstrate the economic implications of the temporal covariate shift assumption. By controlling for the predictors, the distribution shifting in macroeconomic variables remains the same for the baseline results in Sect. 4. As expected, even when replacing the target crude oil, the economic rule of oil price formation is much more stable despite the variations in market conditions over time.

Forecasting crude oil prices during COVID-19

In the baseline empirical studies, we exogenously split the whole sample into 80% for training and 20% for testing. Under TCS, it is assumed that the unconditional distributions of predictors differ for the testing set and all the sub-periods of the training set. The COVID-19 pandemic has had a significant impact on the price of oil by affecting importing and exporting (Khalfaoui et al. 2022), risk exposure (Akhtaruzzaman et al. 2021), and other factors. Accordingly, we observe a bounce in oil prices during the COVID-19 pandemic. Hence, the training set for this robustness test is extended to December 2019. We choose January 2020 as the starting point of the testing set because the first officially confirmed case was reported in the end of 2019 and the Chinese government imposed a lock-down on Wuhan in January 2020.

Table 4.

Tests for common breaks in predictors of the pre-COVID sample (1993/03—2019/12)

Periods Location of breaks Relevant events
Mar 1993–Dec 2019 Oct 2007 (0.00) Early phase of financial crisis
Mar 1993–Oct 2007 Apr 2000 (0.00) Beginning of oil price rising
Nov 2007–Dec 2019 Jun 2017 (0.00) Rising of geopolitical conflicts

Note:This table presents the detected change points in a pool of predictors. The critical values of each test are calculated following the bootstrap procedures provided by Horváth et al. (2021c). The corresponding p-values are shown in parentheses

We re-detect the change points in variables with a potential impact on oil prices as the additional observations included. The first two breaks are exactly the same as those shown in Sect. 4.3. The profound impact of the global financial crisis in 2008 results in a considerable structure break in oil price fundamentals from its early phase. The third break located in Jun 2017 marks the rise of geopolitical conflicts, such as the economic conflict between China and the United States. We present the results of the forecast evaluation in Table 5. Regardless of the WTI or Brent crude oil price, the CP-ADARNN model generally outperforms the other models on one-step-ahead return forecasts.

Table 5.

Prediction accuracy of oil price forecasting models during the COVID-19 pandemic

Models Full-training CP- version
MAE RMSE ROOS2 IC MAE RMSE ROOS2 IC
Panel A: WTI spot crude oil price
RW 0.154 0.234 0.154 0.234
RWd 0.153 0.281 -0.432 0.277 0.159 0.230 0.035 0.199
ARIMA 0.136 0.229 0.047 0.231 0.158 0.236 -0.010 0.059
LASSO 0.129 0.209 0.208 0.583 0.131 0.201 0.267 0.515
ENet 0.120 0.201 0.265 0.574 0.138 0.199 0.277 0.759
Ridge 0.115 0.190 0.344 0.610 0.142 0.213 0.173 0.491
RF 0.131 0.212 0.185 0.454 0.155 0.232 0.020 0.305
LSTM 0.148 0.222 0.105 0.523 0.179 0.249 -0.128 0.011
GRU 0.108 0.149 0.597 0.821 0.153 0.209 0.207 0.500
ADARNN 0.109 0.166 0.499 0.713 0.094 0.153 0.573 0.765
Panel B: Brent spot crude oil price
RW 0.152 0.227 0.152 0.227
RWd 0.145 0.261 -0.321 0.335 0.157 0.226 0.010 0.161
ARIMA 0.132 0.211 0.141 0.398 0.153 0.231 -0.029 0.025
LASSO 0.105 0.169 0.444 0.683 0.130 0.189 0.305 0.642
ENet 0.107 0.176 0.402 0.658 0.130 0.193 0.278 0.548
Ridge 0.105 0.168 0.452 0.708 0.140 0.211 0.140 0.392
RF 0.114 0.182 0.356 0.666 0.150 0.224 0.025 0.344
LSTM 0.146 0.228 -0.006 0.519 0.156 0.197 0.249 0.675
GRU 0.136 0.221 0.051 0.435 0.157 0.226 0.015 0.194
ADARNN 0.155 0.223 0.033 0.513 0.117 0.160 0.502 0.713

Note:Other than the proposed CP-ADARNN framework, models with “CP-” mean that their training samples are replaced with the last sub-period split by the change points. The bold values represent the best performance among the 20 models in terms of MAE, RMSE, ROOS2 and IC

The exogenous shock of the COVID-19 outbreak has taken a toll on the financial markets and on macroeconomic conditions. As a result, such a shock provides a powerful structure break of oil predictors. As shown in Table 5, for regression models like LASSO, ENet, and Ridge to provide accurate forecasts, a necessary assumption is that the distribution of regressors remain constant between the training and testing sets. Hence, from the full-training to the CP- version of the regression models, the differences of predictor distribution are much more significant, resulting in a worse performance. However, the CP-ADARNN framework adaptively learns the distribution shifting and allows the constant conditional distribution of oil returns given predictors, allowing for more precise forecasts on oil real returns.

Conclusion

This paper proposes a deep learning framework to train an adaptive prediction model for time series forecasting in the crude oil markets. CP-ADARNN consists of two modules: change point detection to characterize the distributional information in TS and temporal distribution matching to build a generalized RNN model. We apply the framework to the WTI spot oil price market from March 1993 to December 2021 and utilize the first 80% of the data as the training set. Our forecasting results on the testing sample show that the proposed CP-ADARNN model outperforms benchmarks including conventional time series models (i.e., RW, RWd, and ARIMA), regularization models (i.e., LASSO, ENet, and Ridge regressions), regression tree model like RF, and neural network models (i.e., GRU, LSTM, ADARNN) in terms of MAE, RMSE, ROOS2, and IC. We also confirm the prediction accuracy of CP-ADARNN on the Brent oil price. In addition, for the time period inclusive of COVID-19, our model exhibits better performance when accounting for the distribution shifting of factors after the outbreak of the pandemic.

Our finding that distribution shifting should be addressed in time series analysis has important implications for investors, scholars, and other stakeholders. In the future, this research can be extended to include the various definitions of distributional change points. The CP-ADARNN framework provides a novel view of an important modeling problem for financial time series, that is, the consistent assumption on the process. This study can be used for both academic research and market participants for oil price forecasts, and it can be easily extended to other macroeconomic forecasting fields.

Funding

Yifan Zhang was supported by the Outstanding Innovative Talents Cultivation Funded Programs 2021 of Renmin University of China.

Footnotes

1

We refer to Horváth et al. (2021c) for the assumptions 2.1–2.8.

2

For the purpose of saving space, we leave the details of proof, see Horváth et al. (2021c).

3

More specifically, the FRED-MD series are processed following McCracken (2021) in which the author provides corresponding Tcodes, and the other predictors are processed based on the nature of data. We provide the Tcodes for the entire pool of predictors in online materials.

4

For example, the RF model is trained with observations indexed with t=1,,T1 and forecasts on t=T1+1,,T. Suppose the last change point is detected as t=Tc, then the CP-RF model refers to the RF model trained with t=Tc+1,,T1 and forecasts on t=T1+1,,T.

5

The correlation between WTI and Brent spot crude oil real prices is 0.9990 from March 1993 to December 2010.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Change history

10/18/2022

A Correction to this paper has been published: 10.1007/s10479-022-05022-6

Contributor Information

Zhenya Liu, Email: zhenya.liu@ruc.edu.cn.

Yifan Zhang, Email: zhangyifan@ruc.edu.cn.

References

  1. Akhtaruzzaman M, Boubaker S, Chiah M, Zhong A. Covid- 19 and oil price risk exposure. Finance Research Letters. 2021;42:101882. doi: 10.1016/j.frl.2020.101882. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Ameur, H.B., Ftiti, Z., and Louhichi, W., (2021). Revisiting the relationship between spot and futures markets: Evidence from commodity markets and nardl framework. Annals of Operations Research: 1–19 . [DOI] [PMC free article] [PubMed]
  3. Bai J. Common breaks in means and variances for panel data. Journal of Econometrics. 2010;157(1):78–92. doi: 10.1016/j.jeconom.2009.10.020. [DOI] [Google Scholar]
  4. Baker SR, Bloom N, Davis SJ. Measuring economic policy uncertainty. The Quarterly Journal of Economics. 2016;131(4):1593–1636. doi: 10.1093/qje/qjw024. [DOI] [Google Scholar]
  5. Barigozzi M, Cho H, Fryzlewicz P. Simultaneous multiple change-point and factor analysis for high-dimensional time series. Journal of Econometrics. 2018;206(1):187–225. doi: 10.1016/j.jeconom.2018.05.003. [DOI] [Google Scholar]
  6. Baumeister C, Kilian L. Forty years of oil price fluctuations: Why the price of oil may still surprise us. Journal of Economic Perspectives. 2016;30(1):139–60. doi: 10.1257/jep.30.1.139. [DOI] [Google Scholar]
  7. Boubaker, S., Liu, Z., and Zhan, Y., (2021). Risk management for crude oil futures: An optimal stopping-timing approach. Annals of Operations Research: 1–19 .
  8. Caldara, D. and Iacoviello, M., (2021) Measuring geopolitical risk. Board of Governors of the Federal Reserve .
  9. Chai J, Xing LM, Zhou XY, Zhang ZG, Li JX. Forecasting the wti crude oil price by a hybrid-refined method. Energy Economics. 2018;71:114–127. doi: 10.1016/j.eneco.2018.02.004. [DOI] [Google Scholar]
  10. Chang KL, Yu ST. Does crude oil price play an important role in explaining stock return behavior? Energy Economics. 2013;39:159–168. doi: 10.1016/j.eneco.2013.05.008. [DOI] [Google Scholar]
  11. Conrad C, Loch K, Rittler D. On the macroeconomic determinants of long-term volatilities and correlations in us stock and crude oil markets. Journal of Empirical Finance. 2014;29:26–40. doi: 10.1016/j.jempfin.2014.03.009. [DOI] [Google Scholar]
  12. Coppola, A. (2018). Forecasting oil price movements: Exploiting the information in the futures market. Journal of Futures Markets,28(1), 34–56.
  13. Csörgö, M. and Horváth, L., (1997). Limit theorems in change-point analysis.
  14. Dbouk W, Jamali I. Predicting daily oil prices: Linear and non-linear models. Research in International Business and Finance. 2018;46:149–165. doi: 10.1016/j.ribaf.2018.01.003. [DOI] [Google Scholar]
  15. Du, Y., Wang, J., Feng, W., Pan, S., Qin, T., Xu, R., and Wang, C., (2021). Adarnn: Adaptive learning and forecasting of time series. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management, pp. 402–411.
  16. Ewald CO, Zhang A, Zong Z. On the calibration of the Schwartz two-factor model to wti crude oil options and the extended Kalman filter. Annals of Operations Research. 2019;282(1):119–130. doi: 10.1007/s10479-018-2770-x. [DOI] [Google Scholar]
  17. Geyer-Klingeberg J, Rathgeber AW. Determinants of the wti-brent price spread revisited. Journal of Futures Markets. 2021;41(5):736–757. doi: 10.1002/fut.22184. [DOI] [Google Scholar]
  18. Guerard JB, Xu G, Markowitz H. A further analysis of robust regression modeling and data mining corrections testing in global stocks. Annals of Operations Research. 2021;303(1):175–195. doi: 10.1007/s10479-020-03521-y. [DOI] [Google Scholar]
  19. Gumus, M. and Kiran, M.S., (2017). Crude oil price forecasting using xgboost. In 2017 International Conference on Computer Science and Engineering (UBMK), pp. 1100–1103. IEEE.
  20. He Y, Wang S, Lai KK. Global economic activity and crude oil prices: A cointegration analysis. Energy Economics. 2010;32(4):868–876. doi: 10.1016/j.eneco.2009.12.005. [DOI] [Google Scholar]
  21. Horváth L, Hušková M. Change-point detection in panel data. Journal of Time Series Analysis. 2012;33(4):631–648. doi: 10.1111/j.1467-9892.2012.00796.x. [DOI] [Google Scholar]
  22. Horváth, L., Li, H., and Liu, Z., (2021). How to identify the different phases of stock market bubbles statistically? Finance Research Letters: 102366.
  23. Horváth, L., Liu, Z., and Lu, S., (2021). Sequential monitoring of changes in dynamic linear models, applied to the us housing market. Econometric Theory: 1–64.
  24. Horváth, L., Liu, Z., Rice, G., and Zhao, Y., (2021). Detecting common breaks in the means of high dimensional cross-dependent panels. The Econometrics Journal .
  25. Kayalar DE, Küçüközmen CC, Selcuk-Kestel AS. The impact of crude oil prices on financial market indicators: Copula approach. Energy Economics. 2017;61:162–173. doi: 10.1016/j.eneco.2016.11.016. [DOI] [Google Scholar]
  26. Khalfaoui, R., Solarin, S.A., Al-Qadasi, A., and Ben Jabeur, S., (2022). Dynamic causality interplay from covid-19 pandemic to oil price, stock market, and economic policy uncertainty: Evidence from oil-importing and oil-exporting countries. Annals of Operations Research: 1–39. [DOI] [PMC free article] [PubMed]
  27. Kilian L, Murphy DP. The role of inventories and speculative trading in the global market for crude oil. Journal of Applied Econometrics. 2014;29(3):454–478. doi: 10.1002/jae.2322. [DOI] [Google Scholar]
  28. Kuck K, Schweikert K. A Markov regime-switching model of crude oil market integration. Journal of Commodity Markets. 2017;6:16–31. doi: 10.1016/j.jcomm.2017.03.001. [DOI] [Google Scholar]
  29. Lerner, J. and Seru, A., (2021). The use and misuse of patent data: Issues for finance and beyond. The Review of Financial Studies .
  30. Li X, Shang W, Wang S. Text-based crude oil price forecasting: A deep learning approach. International Journal of Forecasting. 2019;35(4):1548–1560. doi: 10.1016/j.ijforecast.2018.07.006. [DOI] [Google Scholar]
  31. Liu Y, Sun H, Zhang J, Taghizadeh-Hesary F. Detection of volatility regime-switching for crude oil price modeling and forecasting. Resources Policy. 2020;69:101669. doi: 10.1016/j.resourpol.2020.101669. [DOI] [Google Scholar]
  32. McCracken, M. (2021). Fred-md and fred-qd: monthly and quarterly databases for macroeconomic research.
  33. Miao H, Ramchander S, Wang T, Yang D. Influential factors in crude oil price forecasting. Energy Economics. 2017;68:77–88. doi: 10.1016/j.eneco.2017.09.010. [DOI] [Google Scholar]
  34. Mohammadi H, Su L. International evidence on crude oil price dynamics: Applications of Arima-Garch models. Energy Economics. 2010;32(5):1001–1008. doi: 10.1016/j.eneco.2010.04.009. [DOI] [Google Scholar]
  35. Morales-Arias L, Moura GV. Adaptive forecasting of exchange rates with panel data. International Journal of Forecasting. 2013;29(3):493–509. doi: 10.1016/j.ijforecast.2012.10.007. [DOI] [Google Scholar]
  36. Naser H. Estimating and forecasting the real prices of crude oil: A data rich model using a dynamic model averaging (dma) approach. Energy Economics. 2016;56:75–87. doi: 10.1016/j.eneco.2016.02.017. [DOI] [Google Scholar]
  37. Tang, M., & Zhang, J (2012). A multiple adaptive wavelet recurrent neural network model to analyze crude oil prices. Journal of Economics and Business,64(4), 275–286.
  38. Tang L, Wang S, He K, Wang S. A novel mode-characteristic-based decomposition ensemble model for nuclear energy consumption forecasting. Annals of Operations Research. 2015;234(1):111–132. doi: 10.1007/s10479-014-1595-5. [DOI] [Google Scholar]
  39. Xie, W., Yu, L., Xu, S., and Wang, S., (2006) A new method for crude oil price forecasting based on support vector machines. In International Conference on Computational Science, pp. 444–451. Springer.
  40. Ye M, Zyren J, Shore J. Forecasting short-run crude oil price using high-and low-inventory variables. Energy Policy. 2006;34(17):2736–2743. doi: 10.1016/j.enpol.2005.03.017. [DOI] [Google Scholar]
  41. Yu L, Zhao Y, Tang L. A compressed sensing based ai learning paradigm for crude oil price forecasting. Energy Economics. 2014;46:236–245. doi: 10.1016/j.eneco.2014.09.019. [DOI] [Google Scholar]
  42. Zagaglia P. Macroeconomic factors and oil futures prices: A data-rich model. Energy Economics. 2010;32(2):409–417. doi: 10.1016/j.eneco.2009.11.003. [DOI] [Google Scholar]
  43. Zhang YJ, Zhang L. Interpreting the crude oil price movements: Evidence from the Markov regime switching model. Applied Energy. 2015;143:96–109. doi: 10.1016/j.apenergy.2015.01.005. [DOI] [Google Scholar]
  44. Zhao Y, Li J, Yu L. A deep learning ensemble approach for crude oil price forecasting. Energy Economics. 2017;66:9–16. doi: 10.1016/j.eneco.2017.05.023. [DOI] [Google Scholar]
  45. Zou H, Hastie T. Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 2005;67(2):301–320. doi: 10.1111/j.1467-9868.2005.00503.x. [DOI] [Google Scholar]

Articles from Annals of Operations Research are provided here courtesy of Nature Publishing Group

RESOURCES