A novel hybrid walk-forward ensemble optimization for time series cryptocurrency prediction

David Opeoluwa Oyewola; Emmanuel Gbenga Dada; Juliana Ngozi Ndunagu

doi:10.1016/j.heliyon.2022.e11862

. 2022 Nov 23;8(11):e11862. doi: 10.1016/j.heliyon.2022.e11862

A novel hybrid walk-forward ensemble optimization for time series cryptocurrency prediction

David Opeoluwa Oyewola ^a,^∗, Emmanuel Gbenga Dada ^b, Juliana Ngozi Ndunagu ^c

PMCID: PMC9706710 PMID: 36458312

Abstract

Cryptocurrency is an advanced digital currency that is secured by encryption, making it nearly impossible to forge or duplicate. Many cryptocurrencies are blockchain-based with decentralized networks. The prediction of cryptocurrency prices is a very difficult task because of the absence of an appropriate analytical basis to substantiate their claims. Cryptocurrencies are also dependent on several variables, such as technical advancement, internal competition, market pressure, economic concerns, security, and political considerations. This paper proposed the hybrid walk-forward ensemble optimization technique and applied it to predict the daily prices of fifteen cryptocurrencies, such as Cardano (ADA-USD), Bitcoin (BTC-USD), Dogecoin (DOGE-USD), Ethereum Classic (ETC-USD), Chainlink (LINK-USD), Litecoin (LTC-USD), NEO (NEO-USD), Tron (TRX-USD), Tether (USDT-USD), NEM (XEM-USD), Stellar (XLM-USD), Ripple (XRP-USD), and Tezos (XTZ-USD). A performance comparison of these cryptocurrencies was done using classical statistical models, machine learning algorithms, and deep learning algorithms on different cryptocurrency time series. Simulation results show that our proposed model performed better in terms of cryptocurrency prediction accuracy compared to the classical statistical model and machine and deep learning algorithms used in this paper.

Keywords: Cryptocurrency, BlockChain, Gated recurrent unit, Walk-forward, Optimization

Cryptocurrency; BlockChain, Gated recurrent unit; Walk-forward; Optimization.

1. Introduction

Machine learning (ML) and deep learning (DL) have their foundations in artificial intelligence (AI). ML is a branch of artificial intelligence, while DL is a subclass of ML. Deep learning is critical to the growth and development of AI in several ways. Using ML and DL techniques for daily data prediction yields better outcomes and aids in the understanding of some unnoticed characteristics of the dataset. The field of cryptocurrency has greatly developed to the point that it is estimated to be worth a billion dollars. Comprehending such a massive digital currency can be tough, and estimating the change in trend is critical since a change in trend might result in gains or losses for any cryptocurrency. Moreover, the number of cryptocurrencies has increased over the last few years as additional currencies have been introduced. The emergence of digital currency has signaled growth in the global market. However, due to the lack of understanding of the nitty-gritty of such currencies, it is difficult to predict changes in price; this is where deep learning can assist [1].

Machine learning and deep learning are not only used to predict the reversal of the trend of cryptocurrency prices on a daily, weekly, and monthly basis, depending on the available datasets [2]. Various digital currencies have been analyzed using time series. Time-series prediction has also been used to obtain the currency's daily report. The values of these currencies vary depending on the acceptance of each cryptocurrency. Moreover, the valuation of these currencies is subject to change from time to time [3]. The emergence of digital currencies in 2008, as well as the quick rise in Bitcoin values in 2017, sparked widespread condemnation in global financial and economic circles. Investors in digital currencies enjoyed tremendous profits during this period [4].

Nakamoto [2] opines that Bitcoin is the most popular cryptocurrency. Bitcoin was invented by an anonymous individual or group of individuals using the nickname, whose network of connections was inaugurated in 2009. Bitcoin is a newcomer to the currency markets, though it is officially listed as a means of exchange instead of a currency, and its price behaviour is still unclear. This presents new opportunities for investigators and financial experts to identify commonalities and contrasts with traditional financial currencies. This is when we particularly consider its very different nature in comparison to more conventional currencies or monetary systems. According to [5], which is one of the most famous sites delivering almost real-time statistics on the many cryptocurrencies listed on global exchanges. Bitcoin's market capitalization is predicted to be approximately $201 trillion in April 2020. About 4,000 cryptocurrencies have been in operation since January 2021 [6]. According to market capitalization, Bitcoin, Ethereum, XRP, Tether, and Bitcoin Cash are among the top five cryptocurrencies [7].

The effectiveness of ML approaches for stock market prediction has been studied in [8, 9, 10, 11, 12, 13, 14]. Findings from these studies indicate that these techniques might become useful for predicting cryptocurrency prices as well. Nevertheless, the use of machine learning algorithms in digital currency has so far been confined to the evaluation of Bitcoin prices, which has been done using random forests [15], Bayesian neural networks [16], long-short-term memory neural networks [17], data mining, and neural networks [8, 18]. These researchers were able to predict, to varying degrees, the price variations of Bitcoin and found that neural network-based algorithms produced the best results. Deep reinforcement learning has proved effective in predicting the prices of 12 cryptocurrencies [19, 20].

The digital currency market has grown into an international trend. It is especially renowned for its unpredictability and heterogeneity, garnering the curiosity of both new and experienced investors [21]. Forecasting financial time series is difficult because these series are characterized by temporary, hetero-multicollinearity problems, interruptions, aberrations, and rising multi-polynomial elements, making market movement prediction extremely difficult [22]. The complex properties of financial time series, as well as the massive amounts of data that must be analyzed in order to correctly predict financial time series, have prompted the development of more advanced methods, algorithms, and models. Recently, ML and data mining techniques, which are frequently used in financial market forecasting, have produced better results compared to simple technical or fundamental research methodologies. Machine learning approaches are capable of identifying patterns and predicting market opportunities [23].

The major contribution of this research is the application of a hybrid walk forward ensemble optimization technique for cryptocurrency prediction. The proposed ensemble technique uses advanced machine learning models as component learners, which are centred on combinations of autoregressive integrated moving average (ARIMA), holt winter's exponential smoothing (HWES), decision tree (BAG), stochastic gradient boosting (SGB), random forest (RF), long short term memory (LSTM), gated recurrent unit (GRU), and recurrent neural network (RNN). A wide-ranging simulation analysis was carried out to evaluate the performance of the proposed model. Furthermore, the effectiveness of the predictions of each forecasting model is evaluated using mean, standard deviation, minimum, and maximum, which represents an important test of reliability for each of the models. The contributions of this work are outlined below:

1.
This paper proposed the hybrid walk forward ensemble optimization technique for cryptocurrency prediction and analysis of Cardano (ADA-USD), BitcoinCash (BCH-USD), Dogecoin (DOGE-USD), Ethereum Classic (ETC-USD), Chainlink (LINK-USD), Litecoin (LTC-USD), NEO (NEO-USD), Tron (TRX-USD), Tether (USDT-USD), NEM (XEM-USD), Stellar (XLM-USD), and Ripple (XRP-USD).
2.
The performance of the cryptocurrencies mentioned in (1) was compared using a classical statistical model, machine learning, and deep learning on cryptocurrency time series.
3.
A hybrid walk-forward ensemble optimization algorithm that can accurately predict cryptocurrencies to generate a high and considerable financial reward for investors was presented.
4.
The effectiveness of the proposed model was analyzed using different performance metrics.

2. Related works

Several studies have been done in the area of predicting time series for cryptocurrencies. For example, Mudassir et al. [24] presented a time-series machine learning system for forecasting Bitcoin prices. The system uses regression models based on the learning process to forecast short- and medium-term Bitcoin price movements and pricing. With classification models that score up to 65% for the next day's forecast and 62%–64% accuracy in the seventh–ninetieth–day forecast, these proposed models are very effective. The inaccuracy rate for daily price forecasts was 1.44%, but it ranges from 2.88% to 4.10% for seven to ninety days. The proposed models outperform the existing models in the literature. Kyriaziset et al. [25] applied GARCH models to estimate the unpredictability of cryptocurrencies during bearish markets. The authors investigated the volatility of specific cryptocurrencies and their influence on three of the most popular digital currencies, namely Bitcoin, Ethereum, and Ripple. The effect of the decreases in these three cryptocurrencies, as well as that of the DCC-GARCH on the returns of other virtual currencies, was considered using the ARCH and GARCH models. The data used for the study was between January 1 and September 16, 2018. The findings show that the major digital currencies are also adversely affected in difficult times.

Gatabazi, Mba, and Pindza [26] used the fractional lotka-volterra model (FGLVM) to model the transaction counts of Bitcoin, Ripple, and Bitcoin. Findings show that the proposed system is both chaotic and dynamic. Moreover, despite the disorder shown by exposure to lyapunov, the three-dimensional lotka-volterra system showed parabolic patterns. The performance of the proposed model was good. Zbikowski in [27] applied support vector machines (SVM) with box theory and volume weighting to predict price direction in the Bitcoin market. The intention was to generate trading strategies utilizing some set of technical indicators computed from Bitcoin's historic data as input. A simple B&H strategy was employed as a base learner, which yielded an ROI of 4.86% and was exceeded by the BOX-SVM, which produced an ROI of 10.6%, while the VW-SVM generated an ROI of 33.5%. The simulation results showed that the proposed system performed better than the other models compared in the study.

A similar effort was made by Mallqui and Fernandes [28] to predict the daily Bitcoin price direction. Aside from the OHLC values and volume, the researchers conducted some tests by inserting other blockchain indicators and a few other "external" indicators. Many feature extraction strategies were employed, with the OHLC values and volume always being the most important attributes. The authors tested different ensemble and individual learning models. However, the SVM and an ensemble of recurrent neural networks and decision tree classifiers produced the best performance.

Akyildirim et al. [29] applied SVM, linear regression (LR), RF, and artificial neural network (ANN), as well as historical prices and technical indicators, to predict some of the most popular cryptocurrencies using data with a sampling frequency ranging from daily to minute. The goal is to forecast the price direction of the next time step in binary form. The proposed system has an accuracy of less than 55%. It was observed that the use of ANNs did not result in any substantial improvement in prediction accuracy, even though the size of the dataset used was small.

Oyewola et al. [30] presented a nature-inspired method called the auditory algorithm (AA) for stock market prediction. The technique mimics the auditory system in a way similar to that of the human ear. The performance of AA was evaluated using machine learning and continuous-time stochastic process techniques. The continuous-time models used include stochastic differential equations (SDE) and geometric brownian motion (GBM). The findings demonstrated that the overall performance of AA is better than that of other models studied since it dramatically decreased forecast error to the smallest possible level.

Liu et al. [41] proposed deep reinforcement learning and proximal policy optimization (PPO) models for automatic Bitcoin trading. It draws a comparison among high-performing machine learning-based models for static price predictions such as SVM, multi-layer perceptron (MLP), LSTM, temporal convolutional network (TCN), and transformer. Simulation results indicated that LSTM performs better than all the other ML models compared in the work. The authors created an autonomous trading scheme using PPO and LSTM based on the policy. The superiority of the proposed model over other customary trading approaches was validated by experimental results. The technique can trade Bitcoin in a virtual environment with symmetric data and achieve a 31.67 percent higher yield than the optimum benchmark, outperforming it by 12.75 percent. The proposed model can generate higher returns during both periods of price fluctuations and sharp rises, which paves the way for research into developing a single deep learning-based cryptocurrency trading tactic. Envisioning the trading process shows how the model manages and controls increased transactions, providing stimulus and demonstrating that it can be extended to other credit derivatives. Livieris et al. [42] proposed ensemble learning models for cryptocurrency forecasting using hourly prices. In the proposed model, deep learning was combined with ensemble-averaging, bagging, and stacking. The authors combined ensemble models with deep learning models such as LSTM, bi-directional LSTM, and convolutional layers. The ensemble models' performances were evaluated, and experimental analysis showed that ensemble learning and deep learning can be mutually important for creating powerful, steady, and dependable forecasting models. The summary of related works is presented in Table 1.

Table 1.

Summary of related work on cryptocurrency.

Reference	Dataset	Method	Results
Mudassir et al. [24]	Bitcoin	SVM, ANN, SANN, LSTM	SVM resulted in MAPE of 30%.
			ANN in MAPE of 22%.
			SANN in MAPE 17%
			LSTM in MAPE 41%
Gatabazi, Mba, and Pindza [26]	Bitcoin and Ripple	Fractional Lotka-Volterra model (FGLVM)	MAPE for the 2-dimensional (MAPE = 16).
			MAPE for 2-dimensional FGLVM (MAPE = 25)
			2-dimensional GLVM (MAPE = 22)
Zbikowski in [27]	Bitcoin	SVM with Box Theory and Volume Weighted	BOX-SVM = 10.6%
Zbikowski in [27]	Bitcoin	SVM with Box Theory and Volume Weighted	VW-SVM = 33.5%
Akyildirim et al. [29]		SVM, LR, RF and ANN	ANN = 55%
			LR = 55%
			SVM = 56%
			RF = 59%
Liu et al. [41]	Bitcoin	Deep reinforcement learning and proximal policy optimization, SVM, Multi-layer Perceptron, LSTM, Temporal Convolutional Network	SVM = 0.0084
			MLP = 0.0251
			LSTM = 0.00015
			TCN = 0.01327
			Transformer = 0.0044

Open in a new tab

Below is a list of abbreviations/notations and their meanings in Table 2.

Table 2.

Abbreviations/notations and meanings.

Abbreviations/Notation	Meaning
ARIMA	Autoregressive integrated moving average
BAG	Decision Bagging tree
DL	Deep learning
ML	Machine learning
SARIMA	Seasonal autoregressive integrated moving-average
GRU	Gated recurrent unit
HWES	Holt winter's exponential smoothing
LSTM	Long short term memory
MAE	Mean absolute error
MSE	Mean square error
RF	Random forest
RNN	Recurrent neural network
RMSE	Root mean square error
SGB	Stochastic gradient boosting
a	Autoregressive order
i	Differencing order
v	Moving average order
P	Seasonal autoregressive order
D	Seasonal difference order
Q m	Seasonal moving average order Number of time-steps of a seasonal period
n	Trading days
r_t	Regressed at time t
γ	Coefficients
θ	Weighted moving average
l_t	Level at time t,
b_t	Trend at time t,
s_t	Seasonal component at time t
x_i	Input variable
k	Number of K-trees
e_t	Memory at time t,
$\bar{e}$	New memory at time t
o_t	Output gate at time t
f_t	Forget gate at time t
h_t	Activation function
u_t	Update gate
c	Active reset gate

Open in a new tab

3. Methodology

This section discusses the dataset used for our study and the different techniques used for the predictions of cryptocurrencies under study.

3.1. Description of dataset

The paper explores hybrid walk-forward optimization of cryptocurrencies using classical statistical, machine learning, and deep learning models. The cryptocurrencies used in the analysis are Cardano (ADA-USD), BitcoinCash (BCH-USD), BinanceCoin (BNB-USD), Bitcoin (BTC-USD), Dogecoin (DOGE-USD), Ethereum Classic (ETC-USD), Chainlink (LINK-USD), Litecoin (LTC-USD), NEO (NEO-USD), Tron (TRX-USD), Tether (USDT-USD), NEM (XEM-USD), Stellar (XLM-USD), Ripple (XRP-USD) and Tezos (XTZ-USD). The data on cryptocurrencies was collected from yahoofinance.com [31], from January 01, 2018 to June 30, 2021, daily. The cryptocurrency data accounts for 1277 entries for each of the currencies, with a total of 19,155 observations for all cryptocurrencies.

3.2. Classical statistical model, machine and deep learning techniques

3.2.1. Auto regressive integrated moving average (ARIMA)

An autoregressive integrated moving average (ARIMA) is a statistical analysis model that forecasts future trends based on historical data [32]. ARIMA smoothes time series data using lagged moving averages and is composed of three components: autoregressive (AR), integrated (I), and moving average (MA). Autoregressive (AR) models depict a dynamic variable that regresses on its own lags or previous values, whereas integrated (I) models depict the difference between raw observations to allow the time series to become stable. The moving average (MA) takes into account the relationship between an observation and the residual error from a moving average model applied to delayed observations. ARIMA requires three hyper-parameters for the trend, which are (a = autoregressive order) (i = differencing order), and (v = moving average order). ARIMA models can be represented mathematically as depicted in Eq. (1):

Equation 1.

(1)

Where r_t is regressed at time t, γ are the coefficients, a is the autoregressive order, v is the moving average order, θ is the weighted moving average, and e_it is the error at time t.

3.2.2. Seasonal autoregressive integrated moving-average (SARIMA)

The seasonal autoregressive integrated moving average (SARIMA) is an extension of the autoregressive integrated moving average (ARIMA) that specifically accepts single-time series data with a seasonal component [33]. In the seasonal component of the series, SARIMA adds three new hyper-parameters for auto-regression (AR), differencing (I), and a moving average (MA), and an additional seasonal parameter such as (P = seasonal autoregressive order) (D = seasonal difference order) (Q = seasonal moving average order), and m is the number of time-steps of a seasonal period, respectively. The SARIMA mathematical equation is represented in (2):

Equation 2.

(2)

Where α is the Box-cox power transformation, s is the number of seasons per year, B^s is the backward shift operator, D is the times to produce a series, $φ (B^{s})$ is the seasonal autoregressive (AR) of order P, $ϑ (B^{s})$ is the seasonal moving average (MA) of order Q, d is the order of the non-seasonal differencing parameter, a_t is the identically independently distributed (IID) with a mean of zero and variance of σ_a².

3.2.3. Holt winter's exponential smoothing (HWES)

Holt winter exponential smoothing (HWES) is employed in [34] for predicting time series data that shows both trends and variations in seasons. HWES models are also known as "triple exponential smoothing technique" models because they take trends and seasonality into account as an exponentially weighted linear function of data from previous phases. The mathematical equations are as shown in (3), (4), 5 and (6):

Equation 3.

(3)

Equation 4.

(4)

Equation 5.

(5)

Equation 6.

(6)

Where l_t is the level at time t, b_t is the trend at time t, s_t is the seasonal component at time t with corresponding smoothing parameters α,β, and γ, m is the daily frequency of the seasonality.

3.2.4. Decision tree (BAG)

A decision bagging tree (BAG) is a statistical model for covariate-based outcome prediction. The model suggests a prediction rule that defines unwanted sub-sets of data, for example, population sub-sets that are hierarchically constructed by a series of binary data divisions. A tree can be used to represent the hierarchical binary partition set. In each subgroup, the projected result is determined by the average of the individual results within the subset. The goal is to create a prediction rule that minimizes loss functions and also quantifies the difference between predicted and actual values [35].

3.2.5. Stochastic gradient boosting (SGB)

Stochastic gradient boosting (SGB) is an ensemble learning method that combines boosting with decision-making, such as a decision tree, and predicts by weighing together all the trees. The SGB is created along the direction of gradient descent from the previous tree loss function. SGB's main objective is to minimize this loss function between the regression function and the actual function by training the regression function [36]. SGB mathematical equations are shown in (7(7) and (8)(8):

Equation 7.

(7)

Equation 8.

(8)

Where x_i is the input variable, k is the number of K-trees each with the terminal nodes at iteration m, andR is the regression function.

3.2.6. Random forest (RF)

Random forest (RF) is an example of an ensemble machine learning technique. RF builds several distinct decision trees during training. Predictions from the entire tree are combined to attain the ultimate prediction. RF works by randomly picking features that increase prediction accuracy and result in better efficiency. The RF does not only retain the advantages of the trees, but it generally produces better results than a decision tree [37]. For high-dimensional data modeling, the RF can effectively manage missing values and handle continuous, categorical, and binary data. The mathematical equation for RF is given in Eq. (9):

Equation 9.

(9)

Where x_p is the feature vector of input values, p is the dimension property of the available vector for the base learners, b_i is the base learners at iteration i.

3.2.7. Long short term memory (LSTM)

The LSTM is a type of recurrent neural network (RNN) that has the ability to manage long-term dependencies. This enhances the ability of the LSTM to learn from experience. The effectiveness of LSTM becomes more pronounced when there are very lengthy and unspecified delays between data [38]. An LSTM network is comprised of three gates, which are the input gate, the output gate, and the forget gate. These gates help the network arbitrarily retain a value for a lengthy period. One of the key benefits of LSTM networks is their ability to solve the vanishing gradient problem, which makes network training problematic for lengthy strings of words or integers. Gradients are utilized to update RNN parameters and to represent long word or integer sequences; however, as the gradients get reduced, network training becomes practically impossible. This drawback is solved by LSTM networks, which also enable the detection of long-term connections between words or numbers in sequences with great spatial separation. The mathematical equations are expressed in (10), (11), (12), 13) and (14):

Equation 10.

(10)

Equation 11.

(11)

Equation 12.

(12)

Equation 13.

(13)

Equation 14.

(14)

Where e_t is the memory at time t, $\bar{e}$ is the new memory at time t, o_t output gate at time t, f_t is the forget gate at time t and h_t is the activation function.

3.2.8. Gated recurrent unit (GRU)

The gated recurrent unit (GRU) is a significantly simpler variant of the LSTM. The forget and input gates merge into one called the update gate and include an extra gate termed the reset gate. The final model is simpler and has become more popular than the basic LSTM versions. However, a gated recurrent unit such as the LSTM modulates data inside the unit without a distinct memory cell [39]. The GRU activation function at t is a linear interpolation between the prior activation function and the activation function of the candidate. The mathematical equations for GRU are presented in (15), (16), (17), and (18):

Equation 15.

(15)

Equation 16.

(16)

Equation 17.

(17)

Equation 18.

(18)

Where u_t is the update gate, c is the active reset gate, h_t is the activation function and $\bar{h_{t}}$ is the candidate activation function.

3.2.9. Recurrent neural network (RNN)

The RNN is an artificial neural network that employs sequential data or time series data. RNNs use training data to learn, just like feedforward and convolutional neural networks (CNNs) do. They stand out due to their "memory," which allows them to affect the present input and output by using data from previous inputs. Unlike traditional deep neural networks, which assume that inputs and outputs are unconnected, the outputs of recurrent neural networks are dependent on the previous components in the sequence. The RNN uses a prior step that might influence the choice at the present moment. RNN has two sources of input, such as the current one and the recent past, that are combined with the determination of a reaction to a new input [40]. The mathematical equation for RNN is presented in (19):

Equation 19.

(19)

Where d_t is the hidden state, d_t−1 is the previous hidden state and x_t is the input variable.

3.2.10. Performance evaluation

The accuracy test of the fifteen selected cryptocurrencies is evaluated using:

3.2.10.1. Mean absolute error (MAE)

Consider a set of real-time closing values R_pand the predicted values $\hat{R_{p}}$ . MAE is given as shown in (20):

Equation 20.

(20)

3.2.10.2. Root mean square error (RMSE)

RMSE is given in Eq. (21):

Equation 21.

(21)

3.2.10.3. Mean square error (MSE)

MSE is given in (22)

Equation 22.

(22)

Where n is the trading days.

The description of the different cryptocurrencies used in this paper is in Table 3.

Table 3.

Description of cryptocurrency.

Name	Symbol	Mean	Standard Deviation	Min	Max
Cardano	ADA-USD	0.2606	0.4135	0.0239	2.3091
BitcoinCash	BCH-USD	492.6346	413.7250	77.3657	2895.3798
BinanceCoin	BNB-USD	57.4393	119.9112	4.5286	675.6840
Bitcoin	BTC-USD	13982.1795	13935.3095	3236.7617	63503.4570
Dogecoin	DOGE-USD	0.0279	0.0884	0.0015	0.6847
Ethereum Classic	ETC-USD	12.8142	15.2534	3.4723	134.1017
Chainlink	LINK-USD	6.7238	10.1126	0.1662	52.1986
Litecoin	LTC-USD	94.6488	63.5910	23.4643	386.4507
NEO	NEO-USD	28.4590	31.1678	5.3772	187.4049
Tron	TRX-USD	0.0345	0.0274	0.0087	0.2205
Tether	USDT-USD	1.0018	0.0060	0.9666	1.0535
NEM	XEM-USD	0.1705	0.2204	0.0310	1.8427
Stellar	XLM-USD	0.1885	0.1507	0.0334	0.8962
Ripple	XRP-USD	0.4623	0.3676	0.1396	3.3778
Tezos	XTZ-USD	2.3110	1.4170	0.3492	7.5360

Open in a new tab

Table 4 presents the results of stationarity test using augmented dickey-fuller (ADF) of cryptocurrencies.

Table 4.

Stationarity test using augmented dickey-fuller (ADF) of cryptocurrency.

Cryptocurrency	Before Differencing				After Differencing
Cryptocurrency	Test Statistics	1%	5%	10%	Test Statistics	1%	5%	10%
ADA-USD	0.1670	-3.4355	-2.8638	-2.5679	-6.7328	-3.4355	-2.8638	-2.5680
BCH-USD	-3.8040	-3.4355	-2.8638	-2.5679	-8.5869	-3.4355	-2.8638	-2.5680
BNB-USD	-2.1779	-3.4355	-2.8638	-2.5679	-4.8371	-3.4355	-2.8638	-2.5679
BTC-USD	-0.6058	-3.4355	-2.8638	-2.5679	-7.3864	-3.4355	-2.8638	-2.5680
DOGE-USD	-1.4619	-3.4355	-2.8638	-2.5679	-7.6762	-3.4355	-2.8638	-2.5679
ETC-USD	-3.1509	-3.4355	-2.8638	-2.5679	-6.4884	-3.4355	-2.8638	-2.5679
LINK-USD	-1.6450	-3.4355	-2.8638	-2.5679	-6.9834	-3.4355	-2.8638	-2.5679
LTC-USD	-2.6738	-3.4355	-2.8638	-2.5679	-10.8121	-3.4355	-2.8638	-2.5679
NEO-USD	-3.5019	-3.4355	-2.8638	-2.5679	-8.4854	-3.4355	-2.8638	-2.5679
TRX-USD	-2.7559	-3.4355	-2.8638	-2.5679	-9.2668	-3.4355	-2.8638	-2.5679
USDT-USD	-5.3742	-3.4355	-2.8638	-2.5679	-17.9564	-3.4355	-2.8638	-2.5679
XEM-USD	-4.3761	-3.4355	-2.8638	-2.5679	-6.7352	-3.4355	-2.8638	-2.5679
XLM-USD	-3.4126	-3.4355	-2.8638	-2.5679	-11.3272	-3.4355	-2.8638	-2.5679
XRP-USD	-3.3232	-3.4355	-2.8638	-2.5679	-8.0108	-3.4355	-2.8638	-2.5679
XTZ-USD	-2.6005	-3.4355	-2.8638	-2.5679	-7.9972	-3.4355	-2.8638	-2.5679

Open in a new tab

Depicted in Table 5 is the Optimum automated ARIMA fitting for classical statistical time series.

Table 5.

Optimum automated ARIMA fitting for classical statistical time series.

Cryptocurrency	P	D	Q	a	V	m
ADA-USD	2	1	0	0	2	7
BCH-USD	2	1	1	0	2	7
BNB-USD	3	1	0	1	1	7
BTC-USD	0	1	1	1	1	7
DOGE-USD	3	1	0	0	0	7
ETC-USD	2	1	0	0	0	7
LINK-USD	4	1	0	1	2	7
LTC-USD	2	1	0	0	0	7
NEO-USD	2	1	0	1	1	7
TRX-USD	1	1	0	1	0	7
USDT-USD	2	1	2	2	0	7
XEM-USD	2	1	0	1	1	7
XLM-USD	3	1	0	0	0	7
XRP-USD	3	1	0	1	0	7
XTZ-USD	0	1	0	0	0	7

Open in a new tab

Table 6 depicts the experimental results of classical linear statistical cryptocurrency time series.

Table 6.

Classical linear statistical cryptocurrency time series result.

Cryptocurrency	Classical	MSE	RMSE	MAE
ADA-USD	ARIMA	1.2471	1.1167	0.9986
	SARIMA	1.1231	1.0597	0.9476
	HWES	1.1248	1.0605	0.9484
BCH-USD	ARIMA	326341.4374	571.2630	484.6991
	SARIMA	152255.8426	390.1997	304.1406
	HWES	155190.1468	393.9418	308.8400
BNB-USD	ARIMA	98626.6265	314.0487	255.3433
	SARIMA	100358.0665	316.7934	258.0425
	HWES	100442.3864	316.9264	258.2038
BTC-USD	ARIMA	335565603.0602	18318.4497	15345.9441
	SARIMA	383177290.0518	19574.9148	16828.9033
	HWES	376262825.9250	19397.4953	19397.4953
DOGE-USD	ARIMA	0.0589	0.2427	0.1725
	SARIMA	0.0587	0.2423	0.1722
	HWES	0.0588	0.2425	0.1724
ETC-USD	ARIMA	1774.7889	42.1282	29.0576
	SARIMA	1579.4856	39.7427	26.6421
	HWES	1579.4129	39.7418	26.6408
LINK-USD	ARIMA	318.2500	17.8395	15.9089
	SARIMA	356.5472	18.8824	16.9785
	HWES	345.8754	18.5977	16.6703
LTC-USD	ARIMA	10008.0707	100.0403	80.9501
	SARIMA	8475.8677	92.0644	72.4423
	HWES	8528.0345	92.3473	72.7775
NEO-USD	ARIMA	2568.1506	50.6769	42.0725
	SARIMA	2033.2276	45.0913	36.5014
	HWES	2027.8875	45.0320	36.4276
TRX-USD	ARIMA	0.0036	0.0602	0.0472
	SARIMA	0.0033	0.0582	0.0452
	HWES	0.0033	0.0581	0.0450
USDT-USD	ARIMA	2.1531	0.0014	0.0009
	SARIMA	1.3849	0.0011	0.0006
	HWES	1.2133	0.0010	0.0005
XEM-USD	ARIMA	0.0524	0.2290	0.1821
	SARIMA	0.0339	0.1843	0.1330
	HWES	0.0318	0.1785	0.1289
XLM-USD	ARIMA	0.1145	0.3384	0.3134
	SARIMA	0.0921	0.3035	0.2780
	HWES	0.0919	0.3031	0.2777
XRP-USD	ARIMA	0.7534	0.8679	0.7110
	SARIMA	0.4637	0.6809	0.5312
	HWES	0.4582	0.6769	0.5262
XTZ-USD	ARIMA	7.1761	2.6788	2.3113
	SARIMA	6.1564	2.4812	2.1000
	HWES	6.1563	2.4812	2.0500

Open in a new tab

Presented in Table 7 is the experimental results of machine learning cryptocurrency time series.

Table 7.

Machine learning cryptocurrency time series result.

Cryptocurrency	Machine Learning	MSE	RMSE	MAE
ADA-USD	BAG	0.0119	0.1091	0.0765
	SGB	0.0104	0.1021	0.0683
	RF	0.0125	0.1118	0.0793
BCH-USD	BAG	5649.986	75.1663	44.3070
	SGB	5587.885	74.7521	42.3190
	RF	6350.812	79.6919	46.8026
BNB-USD	BAG	1475.144	38.4076	26.4980
	SGB	1313.134	36.2371	22.6948
	RF	1716.968	41.4363	28.9523
BTC-USD	BAG	4967302	2228.744	1705.363
	SGB	4888106	2210.906	1687.474
	RF	8006971	2829.659	2317.399
DOGE-USD	BAG	0.0013	0.0372	0.0213
	SGB	0.0013	0.0365	0.0198
	RF	0.0016	0.0405	0.0266
ETC-USD	BAG	51.5559	7.1802	3.2532
	SGB	49.9267	7.0658	3.1399
	RF	58.1059	7.6227	3.6339
LINK-USD	BAG	50.7834	7.1262	3.2061
	SGB	49.9084	7.0645	3.1373
	RF	57.7937	7.6022	3.6136
LTC-USD	BAG	301.6502	17.3680	11.7568
	SGB	276.487	16.6279	10.7740
	RF	307.6487	17.5399	11.7791
NEO-USD	BAG	34.9480	5.9116	3.7892
	SGB	32.9089	5.7366	3.5558
	RF	37.3880	6.1145	4.0093
TRX-USD	BAG	0.0007	0.0084	0.0052
	SGB	0.0006	0.0081	0.0048
	RF	0.0009	0.0096	0.0061
USDT-USD	BAG	0.00001	0.0013	0.0006
	SGB	0.00001	0.0013	0.0006
	RF	0.00001	0.0012	0.0005
XEM-USD	BAG	0.0012	0.0351	0.0213
	SGB	0.0012	0.0347	0.0215
	RF	0.0015	0.0393	0.0252
XLM-USD	BAG	0.0014	0.0380	0.0258
	SGB	0.0014	0.0381	0.0256
	RF	0.0019	0.0440	0.0303
XRP-USD	BAG	0.0086	0.0932	0.0573
	SGB	0.0086	0.0932	0.0563
	RF	0.0096	0.0981	0.0626
XTZ-USD	BAG	0.1706	0.4131	0.2859
	SGB	0.1454	0.3813	0.2659
	RF	0.1999	0.4471	0.3052

Open in a new tab

Depicted in Table 8 is the simulation results of deep learning cryptocurrency time series.

Table 8.

Deep learning cryptocurrency time series result.

Cryptocurrency	Deep Learning	MSE	RMSE	MAE
ADA-USD	LSTM	0.0073	0.0858	0.0616
	GRU	0.0061	0.0784	0.0544
	RNN	0.0086	0.0927	0.0708
BCH-USD	LSTU	10603.7818	102.9746	87.5668
	GRU	4097.1115	64.0086	37.1982
	RNN	9320.0343	96.5403	79.5367
BNB-USD	LSTM	919.5173	30.3235	16.4831
	GRU	919.4131	30.3218	16.3405
	RNN	1048.8674	32.3862	19.4459
BTC-USD	LSTM	4472103.4464	2114.7348	1641.8902
	GRU	3318722.8301	1821.7362	1375.9186
	RNN	4640174.4043	2154.1064	1694.4975
DOGE-USD	LSTM	0.0015	0.0398	0.0177
	GRU	0.0015	0.0398	0.0176
	RNN	0.0017	0.0421	0.0257
ETC-USD	LSTM	73.2288	8.5573	3.4730
	GRU	72.2789	8.5017	3.1744
	RNN	72.9977	8.5438	4.2903
LINK-USD	LSTM	4.9287	2.2200	1.5356
	GRU	4.87267	2.2074	1.4996
	RNN	6.9642	2.6389	1.9743
LTC-USD	LSTM	227.3473	15.0780	10.3130
	GRU	208.3591	14.4346	10.0422
	RNN	241.7402	15.5479	11.1693
NEO-USD	LSTM	34.5280	5.8760	4.3669
	GRU	23.0671	4.8028	3.0195
	RNN	92.8825	9.6375	8.2834
TRX-USD	LSTM	6.6599	0.0081	0.0060
	GRU	5.0644	0.0071	0.0043
	RNN	7.0004	0.0213	0.0196
USDT-USD	LSTM	3.4750	0.0058	0.0054
	GRU	0.00008	0.0029	0.0011
	RNN	0.0003	0.0176	0.0173
XEM-USD	LSTM	0.0023	0.0485	0.0400
	GRU	0.0012	0.0348	0.0204
	RNN	0.0021	0.0467	0.0347
XLM-USD	LSTM	0.0034	0.0586	0.0283
	GRU	0.0034	0.0584	0.0280
	RNN	0.0055	0.0683	0.0350
XRP-USD	LSTM	0.0257	0.1604	0.1454
	GRU	0.0073	0.0859	0.0501
	RNN	0.0831	0.2883	0.2697
XTZ-USD	LSTM	0.1081	0.3289	0.2207
	GRU	0.1061	0.3257	0.2180
	RNN	0.1376	0.3709	0.2576

Open in a new tab

Table 9 is the statistical results of hybrid walk-forward ensemble optimization cryptocurrency time series.

Table 9.

Hybrid walk-forward ensemble optimization cryptocurrency time series result.

Cryptocurrency	Optimization	MSE	RMSE	MAE
ADA-USD	WHWES	0.0234	0.1529	0.1211
	WGRU	0.000	0.0002	0.0001
	WSGB	0.0111	0.1055	0.0732
BCH-USD	WHWES	17234.0000	131.2783	105.3121
	WGRU	0.1037	0.3221	0.2194
	WSGB	11754.6827	108.4190	60.2092
BNB-USD	WHWES	420.6524	20.5098	15.6534
	WGRU	0.0096	0.0982	0.0605
	WSGB	358.7192	18.9398	13.7866
BTC-USD	WHWES	47231.0000	217.5339	198.3421
	WGRU	235.3799	15.3420	11.9582
	WSGB	4557040.3331	2134.7225	1725.3847
DOGE-USD	WHWES	0.0523	0.2286	0.1234
	WGRU	0.0000	0.0001	0.0000
	WSGB	0.0001	0.0128	0.0075
ETC-USD	WHWES	20.4587	4.5231	2.8654
	WGRU	0.0000	0.0093	0.0044
	WSGB	18.4354	4.2936	2.5435
LINK-USD	WHWES	5.2478	2.2908	1.9765
	WGRU	0.0000	0.0043	0.0032
	WSGB	4.9016	2.2139	1.7813
LTC-USD	WHWES	340.5643	18.4543	14.6543
	WGRU	0.0000	0.0047	0.0035
	WSGB	278.5838	16.6908	12.2733
NEO-USD	WHWES	39.8765	6.3147	3.7845
	WGRU	0.0005	0.0242	0.0157
	WSGB	39.3107	6.2698	3.7355
TRX-USD	WHWES	0.0043	0.0655	0.0422
	WGRU	0.0000	0.0000	0.0000
	WSGB	0.0000	0.0081	0.0056
USDT-USD	WHWES	0.0008	0.0282	0.0122
	WGRU	0.0000	0.0000	0.000
	WSGB	0.0000	0.0007	0.0004
XEM-USD	WHWES	0.0021	0.0458	0.0256
	WGRU	0.0000	0.0000	0.0000
	WSGB	0.0002	0.0150	0.0104
XLM-USD	WHWES	0.0078	0.0883	0.0543
	WGRU	0.0000	0.0000	0.0000
	WSGB	0.0033	0.0575	0.0351
XRP-USD	WHWES	0.0234	0.1529	0.1122
	WGRU	0.0000	0.0004	0.0002
	WSGB	0.0122	0.1105	0.0703
XTZ-USD	WHWES	0.2345	0.4842	0.2435
	WGRU	0.0000	0.0010	0.0007
	WSGB	0.1539	0.3923	0.2975

Open in a new tab

3.3. Proposed hybrid walk-forward ensemble optimization

Analysis and prediction of time series are frequently considered to be among the hardest and most demanding tasks in machine learning. This research presents a new system that is an improvement on the classical statistical model, machine learning, and deep learning algorithms. The proposed method makes use of walk-forward ensemble optimization for time-series cryptocurrency prediction. The proposed system offers solutions to the problems that characterized the original low-quality time series data, thereby generating high-quality time series data to effectively train and fit classic deep learning and machine learning models. This analysis is carried out in four stages:

•
Data visualization of cryptocurrency.
•
Dividing the dataset into training and test sets
•
The optimal model in each classical statistical model, machine learning technique, and deep learning technique is determined using performance measures such as Root Mean Square (RMSE), Mean Square Error (MSE), and Mean Absolute Error (MAE).
•
Application of walk-forward ensemble optimization on the prediction results.

The forecasting of cryptocurrency prices is a very difficult task because of the absence of an satisfactory analytical proof to substantiate their claims. Cryptocurrencies are also dependent on some variables, such as technical advancement, internal competition, market pressure, economic concerns, security, and political considerations. The suggested improvement of the walk-forward ensemble would help to overcome the major problem in cryptocurrency. The algorithm can properly forecast cryptocurrency prices to produce considerable financial benefit for investors, as explained in section 3.4.

3.4. Block diagram of proposed hybrid walk-forward ensemble optimization

Figure 1 shows the overall block diagram of the proposed hybrid walk-forward ensemble optimization. The overall framework includes four steps:

The first stage is a display of data with repeated, missing, and numerous irrelevant rows. If this information is fed into a model, it creates inaccurate forecasts. An essential issue to address is the existence of missing values in statistical survey data. Cryptocurrency data generally has missing numbers due to various causes. These include equipment failures, location monitor changes, periodic maintenance, and human mistakes.

Uncompleted data sets generally produce distortions because of variations across observational and non-observational data. It is also important that the examined data be of good quality. The heat map was taken into account for visualization. The visualization of data via a heat map is a way of expressing cryptocurrency graphically, which represents numerical data. Also, colours are represented as the value of each data point. Interpolation methods were used to replace missing values in the dataset. Box plots and distribution plots will help you choose the techniques to use. In time-series analysis, stationarity is both a key characteristic and a problematic one. However, many time series are non-stationary, suggesting significant fluctuations in mean, variance, and kurtosis. The augmented dickey-fuller (ADF) test will be used to determine the stationarity and non-stationarity of cryptocurrency datasets.

In the second stage, the dataset is divided into two sets, called training and testing sets. The training set contains 85% of the data from January 1, 2018 to December 31, 2020, with the remaining 15% for the test set from January 1, 2021 to June 30, 2021. The training is used to estimate the model parameters during the test set to validate the model and learn how the model performs on a new dataset. Three different models will be fitted to the data: a classical statistical model, a machine learning model, and a deep learning model. The first is a standard classical linear statistical model, such as the autoregressive integrated moving average (ARIMA), the Holt-Winters exponential smoothing (HWES), and the seasonal autoregressive moving average (SARIMA). Bagging (BAG), stochastic gradient boosting (SGB), and random forest (RF) are the second class of machine learning models. These are the second machine learning models, while long short-term memory (LSTM), recurrent neural networks (RNN), and gated recurrent units (GRU) are the deep learning models. In the classical linear model, the study considered auto-arima to determine the optimal a (autoregressive order), i (differencing order), v (moving average order), P (seasonal autoregressive order), D (seasonal differencing order), and Q (seasonal moving average order), where m is the number of time steps for a single seasonal period, specified as 7 for daily real-time cryptocurrency data. In machine learning, we determined the optimal hyper-parameters of each machine learning method used in this research, such as decision tree bagging, stochastic gradient boosting, and random forest, using grid search with a 5-fold cross-validation in a Python environment.

Also, in deep learning, in all the networks, the dropout function is used among layers, which is a technique to prevent network overfitting. In GRU, the optimizer consists of the learning rate, decay, and momentum, and Nesterov is set to false. Also, in both LSTM and RNN, the optimizer used was rmsprop. During training, the optimizer handles the computations required to adjust the network weight and bias variables. These computations trigger the calculation of gradients, which shows the direction in which the weights and biases must be adjusted during training to optimize the network's cost function. The third stage is the determination of the optimal model in each of the classical statistical models, machine learning techniques, and deep learning techniques using performance measures such as root mean square error (RMSE), mean square error (MSE), and mean absolute error (MAE). An ensemble technique will be performed on the best-selected model, such as classical statistical models, machine learning algorithms, and deep learning algorithms, which will help to minimize the dispersion of a predictive model and improve the average prediction performance over any given member in the ensemble. The stacking ensemble method was utilized in this study.

It is an ensemble approach that uses a meta-regression model to integrate several regression models. The basis models utilized in this study are the best classical statistical models, machine learning models, and deep learning models produced from the dataset, and the meta-model is trained on features returned (as output) by the base models. The meta-models under consideration are the same optimal classical statistical models, machine learning models, and deep learning models that are interchanged regularly. To obtain the greatest accuracy, the meta-model aids in the discovery of features in base models. The final stage is the application of walk-forward optimization to the prediction results obtained from stage three. Then, using walk-forward optimization, each training–testing set is moved forward through the time series by specific data patterns. The comprehensive numerical experiments and statistical analysis will improve the predictive performance of the model. The experiment was conducted using a miniconda installation and all the necessary libraries such as python 3.7, pandas, numpy, scipy, sklearn, seaborn, pmdarima, keras, sklearn, and statsmodels.

4. Result and discussion

Figure 2 is the heatmap visualization of the fifteen cryptocurrencies used in this study. Heatmaps are colour-coded diagrams to visualize data. In this study, heatmaps were utilized to cross-examine cryptocurrency data in a tabular format by placing variables in the rows and columns and color-coding the cells. The x-axis represents the rows of the real-time data, while the y-axis represents the columns of the real-time cryptocurrencies. The location of the missing values in Figure 2 is in rows 840 and 1020. This shows the presence of missing values in all the fifteen cryptocurrencies' real-time data. Fifteen cryptocurrencies considered in this study were split into training and test sets. The training set is from 1st January, 2018–31st December 2020, consisting of 85% of the data, while the remaining 15% is for the test set from 1st January, 2021–30th June 2021, as shown in Figures 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, and 17.

Visualization of real-time cryptocurrency.

Train (blue) and Test (red) set of Real-Time ADA-USD.

Train (blue) and Test (red) set of Real-Time BCH-USD.

Train (blue) and Test (red) set of Real-Time BNB-USD.

Train (blue) and Test (red) set of Real-Time BTC-USD.

Train (blue) and Test (red) set of Real-Time DOGE-USD.

Train (blue) and Test (red) set of Real-Time ETC-USD.

Train (blue) and Test (red) set of Real-Time LINK-USD.

Train (blue) and Test (red) set of Real-Time LTC-USD.

Train (blue) and Test (red) set of Real-Time NEO-USD.

Train (blue) and Test (red) set of Real-Time TRX-USD.

Train (blue) and Test (red) set of Real-Time USDT-USD.

Train (blue) and Test (red) set of Real-Time XEM-USD.

Train (blue) and Test (red) set of Real-Time XLM-USD.

Train (blue) and Test (red) set of Real-Time XRP-USD.

Train (blue) and Test (red) set of Real-Time XTZ-USD.

Table 3 shows the summary statistics of each of the fifteen selected cryptocurrencies from January 2018 to June 2021. The daily mean, standard deviation, minimum, and maximum are shown. The daily mean of ADA-USD, DOGE-USD, TRX-USD, USDT-USD, XEM-USD, XLM-USD, XRP-USD, and XTZ-USD is small compared to other selected cryptocurrencies. ADA-USD, DOGE-USD, TRX-USD, USDT-USD, XEM-USD, XLM-USD, XRP-USD, and XTZ-USD also have small volatility, which is within the range of 0–2 compared with other stocks, which indicates that the cryptocurrency prices fluctuate slowly and tend to be more stable. BTC-USD has a maximum price of $6353.45 due to high supply and demand.

Table 4 shows the stationarity test results before and after differencing using the Augmented Dickey-Fuller (ADF) test of all the selected cryptocurrencies. The ADF test consists of test statistics and critical values at 1%, 5%, and 10% confidence intervals. Stationarity means that the statistical properties of a cryptocurrency, such as its mean, variance, and covariance, do not change over time. Before differencing columns, the ADF test is higher than any of the critical values, which shows the presence of non-stationary in twelve, except in BCH-USD, USDT-USD, and XEM-USD. After differencing, ADF tests are applied to detrended values, and they all show the presence of stationarity.

Automated ARIMA fitting takes into account the Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC) values to determine optimal parameters. The lower these values, the better the model. Table 5 shows the optimal automated ARIMA fitting for all the selected classical statistical time-series, such as the autoregressive integrated moving average (ARIMA) and the seasonal autoregressive integrated moving average (SARIMA). Where P (seasonal autoregressive order), D (seasonal differencing order), Q (seasonal moving average order), and m are the frequencies of the daily cryptocurrency time series. The Automated ARIMA uses the AIC and BIC values generated by experimenting with different combinations of variables (a, i, v, P, D, Q, and m) to fit the model into the chosen cryptocurrency.

Table 6 shows the classical linear statistical cryptography time series of ARIMA, SARIMA, and HWES. The classical linear technique was used to determine the best algorithms among the three algorithms, ARIMA, SARIMA, and HWES, which can effectively predict cryptocurrency datasets, while Table 10 consists of all the parameters considered in all the techniques to obtain the best accuracy result. The overall performance of every one of the cryptocurrencies utilized in this study was reported by MASE, RAE, and MSLE. In eight out of fifteen selected cryptocurrencies, HWES performs excellently better than both SARIMA and ARIMA. This shows that Auto ARIMA is unable to select the best trends and seasonal parameters when predicting cryptocurrencies' time series. The HWES technique considers the average as well as trends and seasonality.

Table 10.

Computational time of classical statistical model, machine learning, deep learning, and hybrid walk-forward ensemble optimization.

Algorithms	Mean (sec/loop)	Standard Deviation (sec/loop)
ARIMA	2.72	0.975
SARIMA	2.72	0.975
HWES	2.49	0.428
BAG	2.35	0.374
SGB	2.57	0.475
RF	2.42	0.359
LSTM	6.00	3.03
GRU	5.34	2.67
RNN	2.25	0.234
WHWES	2.51	0.434
WGRU	5.45	3.14
WSGB	2.71	0.621

Open in a new tab

Table 7 reports the machine learning cryptocurrency time series of decision tree bagging (BAG), stochastic gradient boosting (SGB), and random forest (RF). The three machine learning techniques were utilized to determine the best algorithms among the three algorithms, such as BAG, SGB, and RF. In all the fifteen selected cryptocurrencies, SGB performs excellently, better than both BAG and RF. The deep learning results of the performance measure are also shown in Table 8. The deep learning models chosen are: long short-term memory (LSTM), gated recurrent unit (GRU), and recurrent neural network (RNN). GRU performs excellently in all fifteen selected cryptocurrencies.

Figure 18 shows the comparison of the classical statistical model, machine learning, and deep learning algorithms used in this study by summing up each selected cryptocurrency in the classical statistical model, machine learning model, and deep learning model. Deep learning has the fewest errors, followed by machine learning. Classical linear models perform woefully, and it shows they are not suitable for predicting cryptocurrency. Presented in Table 9 is the result of the Hybrid Walk-Forward Ensemble Optimization Cryptocurrency Time Series. In all the fifteen selected cryptocurrencies, WGRU performs excellently, but WHWES and WSGB performed woefully. This shows that walk-forward with the ensemble can help GRU perform excellently when predicting cryptocurrency.

Comparison of classical statistical model, machine and deep learning models.

The computational time of classical statistical models, machine learning, deep learning, and hybrid walk-forward ensemble optimization is shown in Table 10. The first column consists of all the algorithms used in this research, while the second column is the mean in sec/loop and the third column is the standard deviation. In two of the three classical statistical learning methods, such as ARIMA and SARIMA, the length of time required to perform a computational process is very high. It may be due to the automated ARIMA fitting of both the ARIMA and SARIMA models. In machine learning algorithms, the length of time it takes for SGB to perform a computational process is much higher than that of BAG and RF. Moreover, in deep learning, LSTM and GRU have much higher computational processes than RNN. The computational process of WHWES, WGRU, and WSGB is slightly increased, but it is very minimal compared to LSTM and GRU. The parameters used for the classical statistical models, machine learning models, and deep learning models considered in this research are shown in Table 11.

Table 11.

Parameters of classical statistical model, machine and deep learning models.

Algorithms	Hyperparameter	Values
ARIMA	Seasonal_periods	7
	Initialization_method	‘Known’
	Initial_level	“estimated”
	Trend	“add”
	Seasonal	“add”
	Smoothing_level	0.4
	Smoothing_shape	0.2
	Smoothing_seasonal	0.01
	Order	auto_arima
SARIMA	Seasonal_periods	7
	Initialization_method	‘Known’
	Initial_level	“estimated”
	Trend	“add”
	Seasonal	“add”
	Smoothing_level	0.4
	Smoothing_shape	0.2
	Smoothing_seasonal	0.01
	Order	auto_arima
HWES	Seasonal_periods	7
	Initialization_method	‘Known’
	Initial_level	“estimated”
	Trend	“add”
	Seasonal	“add”
	Smoothing_level	0.4
	Smoothing_shape	0.2
	Smoothing_seasonal	0.01
BAG	max_depth	4
	min_impurity_split	1e-07
	min_samples_leaf	1
	min_samples_split	2
	min_weight_fraction_leaf	0.0
	Presort	False
	random_state	None
	Splitter	Best
SGB	max_depth	4
	min_impurity_split	1e-07
	min_samples_leaf	1
	min_samples_split	2
	min_weight_fraction_leaf	0.0
	Presort	False
	random_state	None
	splitter	Best
RF	max_depth	4
	min_impurity_split	1e-07
	min_samples_leaf	1
	min_samples_split	2
	min_weight_fraction_leaf	0.0
	presort	False
	random_state	None
	splitter	Best
LSTM	Units	50
	return_sequences	True
	Dropout	0.2
	optimizer	Rmsprop
	loss	Mean_squared_error
	epochs	30
	batch_size	150
GRU	Units	50
	return_sequences	True
	Dropout	0.2
	optimizer	SGD
	lr	0.01
	decay	1e-7
	momentum	0.9
	nesterov	False
	loss	Mean_squared_error
	epochs	30
	batch_size	150
RNN	Units	50
	return_sequences	True
	Dropout	0.2
	optimizer	Rmsprop
	loss	Mean_squared_error
	epochs	30
	batch_size	150

Open in a new tab

5. Conclusion

One of the foundational tools of data science is time series forecasting. It is one of the most extensively utilized analytic tools in businesses and organizations. All businesses want to plan for the future. As a result, time series forecasting serves as a lynchpin for looking into the most likely future and making appropriate plans. Time-series forecasting, like any other data science approach, is comprised of a variety of techniques and methods. The hybrid walk-forward ensemble optimization model for time series forecasting is proposed in this study. The proposed technique takes care of missing values in cryptocurrency data, which are caused by a variety of reasons, including equipment failures, changes in monitor placement, periodic maintenance, and human mistakes. It also solved the problem of bias, which is often caused by disparities between observed and unobserved data in an incomplete dataset. The performance of our model was encouraging, and to the best of our knowledge, no research work has been published in the literature on the regression of real-time cryptocurrency using hybrid walk-forward ensemble optimization with distinct phases. The techniques investigated in this paper are automated ARIMA, the Augmented Dickey-Fuller test, classical statistical model, and machine and deep learning algorithms. The proposed method outperforms all the aforementioned techniques. One of the limitations of this work is the inability to obtain a large dataset for cryptocurrency. In the future, experiments will be conducted using other machine learning models, such as the Gaussian process and cubist, to confirm the strength of hybrid walk-forward ensemble optimization. Moreover, this model will be integrated into the stock market, cryptocurrency, or any time series to be used for real-time monitoring and forecasting.

Declarations

Author contribution statement

David Opeoluwa Oyewola: Conceived and designed the experiments; Performed the experiments; Analyzed and interpreted the data; Contributed reagents, materials, analysis tools or data; Wrote the paper.

Emmanuel Gbenga Dada: Conceived and designed the experiments; Analyzed and interpreted the data; Contributed reagents, materials, analysis tools or data; Wrote the paper.

Juliana Ngozi Ndunagu: Performed the experiments; Wrote the paper.

Funding statement

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Data availability statement

Data will be made available on request.

Declaration of interests statement

The authors declare no conflict of interest.

Additional information

No additional information is available for this paper.

References

1.Indulkar Y. 2021 International Conference on Emerging Smart Computing And Informatics (ESCI) IEEE; 2021, March. Time series analysis of cryptocurrencies using deep learning & fbprophet; pp. 306–311. [Google Scholar]
2.Catania L., Grassi S. 2017. Modelling crypto-currencies financial time-series. Available at SSRN 3028486. [Google Scholar]
3.Catania L., Grassi S., Ravazzolo F. Mathematical and Statistical Methods for Actuarial Sciences and Finance. Springer; Cham: 2018. Predicting the volatility of cryptocurrency time-series; pp. 203–207. [Google Scholar]
4.Ma Y., Ahmad F., Liu M., Wang Z. Portfolio optimization in the era of digital financialization using cryptocurrencies. Technol. Forecast. Soc. Change. 2020;161 doi: 10.1016/j.techfore.2020.120265. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Nakamoto S. Bitcoin: a peer-to-peer electronic cash system. Decentraliz. Bus. Rev. 2008 [Google Scholar]
6.Bagshaw R. 2021. Top 10 cryptocurrencies by market capitalization.https:www.yahoo.com/now/top-10-cryptocurrencies-market-capitalisation-160046487.html Available at. [Google Scholar]
7.Conway L. 2021. The 10 Most Important Cryptocurrencies Other than Bitcoin.https://www.investopedia.com/tech/most-important-cryptocurrencies-other-than-bitcoin/ Retrieved from. [Google Scholar]
8.Enke D., Thawornwong S. The use of data mining and neural networks for forecasting stock market returns. Expert Syst. Appl. 2005;29(4):927–940. [Google Scholar]
9.Huang W., Nakamori Y., Wang S.Y. Forecasting stock market movement direction with support vector machine. Comput. Oper. Res. 2005;32(10):2513–2522. [Google Scholar]
10.Ou P., Wang H. Prediction of stock market index movement by ten data mining techniques. Mod. Appl. Sci. 2009;3(12):28–42. [Google Scholar]
11.Gavrilov M., Anguelov D., Indyk P., Motwani R. Proceedings of the Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2000, August. Mining the stock market (extended abstract) which measure is best? pp. 487–496. [Google Scholar]
12.Kannan K.S., Sekar P.S., Sathik M.M., Arumugam P. Proceedings of the International Multiconference of Engineers and computer scientists. Vol. 1. 2010. Financial stock market forecast using data mining techniques; p. 4. March. [Google Scholar]
13.Sheta A.F., Ahmed S.E.M., Faris H. A comparison between regression, artificial neural networks and support vector machines for predicting stock market index. Soft Comput. 2015;7(8):2. [Google Scholar]
14.Chang P.C., Liu C.H., Fan C.Y., Lin J.L., Lai C.M. International Conference on Intelligent Computing. Springer; Berlin, Heidelberg: 2009, September. An ensemble of neural networks for stock trading decision making; pp. 1–10. [Google Scholar]
15.Madan I., Saluja S., Zhao A. 2015. Automated bitcoin trading via Machine Learning algorithms. URL20. [Google Scholar]
16.Jang H., Lee J. An empirical study on modelling and prediction of bitcoin prices with bayesian neural networks based on blockchain information. IEEE Access. 2017;6:5427–5437. [Google Scholar]
17.McNally S., Roche J., Caton S. 2018 26th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP) IEEE; 2018, March. Predicting the price of bitcoin using Machine Learning; pp. 339–343. [Google Scholar]
18.Hegazy K., Mumford S. CS229 Project; 2016. Comparative Automated Bitcoin Trading Strategies; p. 27. [Google Scholar]
19.Shilling A.G. Market timing: better than a buy-and-hold strategy. Financ. Anal. J. 1992;48(2):46–50. [Google Scholar]
20.Jiang Z., Liang J. 2017 Intelligent Systems Conference (IntelliSys) IEEE; 2017, September. Cryptocurrency portfolio management with deep reinforcement learning; pp. 905–913. [Google Scholar]
21.Hileman G., Rauchs M. Global cryptocurrency benchmarking study. Cambridge Centre Altern. Finan. 2017;33:33–113. [Google Scholar]
22.Hadavandi E., Shavandi H., Ghanbari A. Integration of genetic fuzzy systems and artificial neural networks for stock price forecasting. Knowl. Base Syst. 2010;23(8):800–808. [Google Scholar]
23.Nadkarni J., Neves R.F. Combining NeuroEvolution and principal component analysis to trade in the financial markets. Expert Syst. Appl. 2018;103:184–195. [Google Scholar]
24.Mudassir M., Bennbaia S., Unal D., Hammoudeh M. Time-series forecasting of Bitcoin prices using high-dimensional features: a Machine Learning approach. Neural Comput. Appl. 2020:1–15. doi: 10.1007/s00521-020-05129-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Kyriazis N.A. A survey on empirical findings about spillovers in cryptocurrency markets. J. Risk Financ. Manag. 2019;12(4):170. [Google Scholar]
26.Gatabazi P., Mba J.C., Pindza E. Fractional grey Lotka-Volterra models with application to cryptocurrencies adoption. Chaos: Interdisc. J. Nonlin. Sci. 2019;29(7) doi: 10.1063/1.5096836. [DOI] [PubMed] [Google Scholar]
27.Żbikowski K. Using volume weighted support vector machines with walk forward testing and feature selection for the purpose of creating stock trading strategy. Expert Syst. Appl. 2015;42(4):1797–1805. [Google Scholar]
28.Mallqui D.C., Fernandes R.A. Predicting the direction, maximum, minimum and closing prices of daily Bitcoin exchange rate using Machine Learning techniques. Appl. Soft Comput. 2019;75:596–606. [Google Scholar]
29.Akyildirim E., Goncu A., Sensoy A. Prediction of cryptocurrency returns using Machine Learning. Ann. Oper. Res. 2021;297(1):3–36. [Google Scholar]
30.Oyewola D.O., Ibrahim A., Kwanamu J.A., Dada E.G. A new auditory algorithm in stock market prediction on oil and gas sector in Nigerian stock exchange. Soft Comput. Lett. 2021;3 [Google Scholar]
31.Matching cryptocurrencies. 2022. https://finance.yahoo.com/cryptocurrencies Available at. [Google Scholar]
32.Torbat Sheida, Khashei Mehdi, Bijari Mehdi. A hybrid probabilistic fuzzy ARIMA model for consumption forecasting in commodity markets. Econ. Anal. Pol. 2018;58:22–31. [Google Scholar]
33.Arunraj, Sivanandam Nari, Ahrens Diane. A hybrid seasonal autoregressive integrated moving average and quantile regression for daily food sales forecasting. Int. J. Prod. Econ. 2015;170:321–335. Elsevier. [Google Scholar]
34.Jiang Weiheng, Wu Xiaogang, Gong Yi, Yu Wanxin, Zhong Xinhui. Holt–Winters smoothing enhanced by fruit fly optimization algorithm to forecast monthly electricity consumption. Energy. 2020;193 2020. [Google Scholar]
35.Venkatasubramaniam Ashwini, Wolfson Julian, Mitchell Nathan, Barnes Timothy, JaKa Meghan, French Simone. Decision trees in epidemiological research. Emerg. Themes Epidemiol. 2017;14:11. doi: 10.1186/s12982-017-0064-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Oyewola David .O., Emmanuel Gbenga Dada, Oluwatosin Temidayo Omotehinwa, Ibrahim Isa.A. Comparative analysis of linear, non-linear and ensemble machine learning algorithms for credit worthiness of consumers. Comput. Intell. Wireless Netw. 2019;1(1):1–11. [Google Scholar]
37.Emmanuel Gbenga Dada, David Opeoluwa Oyewola, Stephen Bassi Joseph, Ali Baba Dauda Ensemble machine learning model for software defect prediction. Adv. Mach. Learn. Art. Intel. 2021;2(1):11–21. [Google Scholar]
38.Oyewola David O., Bernard Alechenu, Al-Mustapha Kuluwa A., Oyewande Oluwatoyosi V. Classification of dementia diseases using Deep Learning techniques. FUDMA J. Sci. (FJS) 2020;4(2):371–379. [Google Scholar]
39.Le N.Q.K., Yapp E.K.Y., Yeh H.Y. ET-GRU: using multi-layer gated recurrent units to identify electron transport proteins. BMC Bioinf. 2019;20(1):377. doi: 10.1186/s12859-019-2972-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Oyewola D.O., Al-Mustapha K.A., Dada E.G., Kennedy O.A. Stock market movement direction with ensemble Deep Learning Network. J. Niger. Assoc. Math. Phys. 2019;53(2019):103–116. [Google Scholar]
41.Liu F.R., Ren M.Y., Zhai J.D., Sui G.Q., Zhang X.Y., Bing X.Y., Liu Y.L. 2021 IEEE 2nd International Conference on Big Data, Artificial Intelligence and Internet of Things Engineering (ICBAIE) IEEE; 2021. Bitcoin transaction strategy construction based on deep reinforcement learning; pp. 180–183. [Google Scholar]
42.Livieris I.E., Pintelas E., Stavroyiannis S., Pintelas P. Ensemble Deep Learning models for forecasting cryptocurrency time-series. Algorithms. 2020;13(5):121. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Data will be made available on request.

[bib1] 1.Indulkar Y. 2021 International Conference on Emerging Smart Computing And Informatics (ESCI) IEEE; 2021, March. Time series analysis of cryptocurrencies using deep learning & fbprophet; pp. 306–311. [Google Scholar]

[bib2] 2.Catania L., Grassi S. 2017. Modelling crypto-currencies financial time-series. Available at SSRN 3028486. [Google Scholar]

[bib3] 3.Catania L., Grassi S., Ravazzolo F. Mathematical and Statistical Methods for Actuarial Sciences and Finance. Springer; Cham: 2018. Predicting the volatility of cryptocurrency time-series; pp. 203–207. [Google Scholar]

[bib4] 4.Ma Y., Ahmad F., Liu M., Wang Z. Portfolio optimization in the era of digital financialization using cryptocurrencies. Technol. Forecast. Soc. Change. 2020;161 doi: 10.1016/j.techfore.2020.120265. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib5] 5.Nakamoto S. Bitcoin: a peer-to-peer electronic cash system. Decentraliz. Bus. Rev. 2008 [Google Scholar]

[bib6] 6.Bagshaw R. 2021. Top 10 cryptocurrencies by market capitalization.https:www.yahoo.com/now/top-10-cryptocurrencies-market-capitalisation-160046487.html Available at. [Google Scholar]

[bib7] 7.Conway L. 2021. The 10 Most Important Cryptocurrencies Other than Bitcoin.https://www.investopedia.com/tech/most-important-cryptocurrencies-other-than-bitcoin/ Retrieved from. [Google Scholar]

[bib8] 8.Enke D., Thawornwong S. The use of data mining and neural networks for forecasting stock market returns. Expert Syst. Appl. 2005;29(4):927–940. [Google Scholar]

[bib9] 9.Huang W., Nakamori Y., Wang S.Y. Forecasting stock market movement direction with support vector machine. Comput. Oper. Res. 2005;32(10):2513–2522. [Google Scholar]

[bib10] 10.Ou P., Wang H. Prediction of stock market index movement by ten data mining techniques. Mod. Appl. Sci. 2009;3(12):28–42. [Google Scholar]

[bib11] 11.Gavrilov M., Anguelov D., Indyk P., Motwani R. Proceedings of the Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2000, August. Mining the stock market (extended abstract) which measure is best? pp. 487–496. [Google Scholar]

[bib12] 12.Kannan K.S., Sekar P.S., Sathik M.M., Arumugam P. Proceedings of the International Multiconference of Engineers and computer scientists. Vol. 1. 2010. Financial stock market forecast using data mining techniques; p. 4. March. [Google Scholar]

[bib13] 13.Sheta A.F., Ahmed S.E.M., Faris H. A comparison between regression, artificial neural networks and support vector machines for predicting stock market index. Soft Comput. 2015;7(8):2. [Google Scholar]

[bib14] 14.Chang P.C., Liu C.H., Fan C.Y., Lin J.L., Lai C.M. International Conference on Intelligent Computing. Springer; Berlin, Heidelberg: 2009, September. An ensemble of neural networks for stock trading decision making; pp. 1–10. [Google Scholar]

[bib15] 15.Madan I., Saluja S., Zhao A. 2015. Automated bitcoin trading via Machine Learning algorithms. URL20. [Google Scholar]

[bib16] 16.Jang H., Lee J. An empirical study on modelling and prediction of bitcoin prices with bayesian neural networks based on blockchain information. IEEE Access. 2017;6:5427–5437. [Google Scholar]

[bib17] 17.McNally S., Roche J., Caton S. 2018 26th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP) IEEE; 2018, March. Predicting the price of bitcoin using Machine Learning; pp. 339–343. [Google Scholar]

[bib18] 18.Hegazy K., Mumford S. CS229 Project; 2016. Comparative Automated Bitcoin Trading Strategies; p. 27. [Google Scholar]

[bib19] 19.Shilling A.G. Market timing: better than a buy-and-hold strategy. Financ. Anal. J. 1992;48(2):46–50. [Google Scholar]

[bib20] 20.Jiang Z., Liang J. 2017 Intelligent Systems Conference (IntelliSys) IEEE; 2017, September. Cryptocurrency portfolio management with deep reinforcement learning; pp. 905–913. [Google Scholar]

[bib21] 21.Hileman G., Rauchs M. Global cryptocurrency benchmarking study. Cambridge Centre Altern. Finan. 2017;33:33–113. [Google Scholar]

[bib22] 22.Hadavandi E., Shavandi H., Ghanbari A. Integration of genetic fuzzy systems and artificial neural networks for stock price forecasting. Knowl. Base Syst. 2010;23(8):800–808. [Google Scholar]

[bib23] 23.Nadkarni J., Neves R.F. Combining NeuroEvolution and principal component analysis to trade in the financial markets. Expert Syst. Appl. 2018;103:184–195. [Google Scholar]

[bib24] 24.Mudassir M., Bennbaia S., Unal D., Hammoudeh M. Time-series forecasting of Bitcoin prices using high-dimensional features: a Machine Learning approach. Neural Comput. Appl. 2020:1–15. doi: 10.1007/s00521-020-05129-6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib25] 25.Kyriazis N.A. A survey on empirical findings about spillovers in cryptocurrency markets. J. Risk Financ. Manag. 2019;12(4):170. [Google Scholar]

[bib26] 26.Gatabazi P., Mba J.C., Pindza E. Fractional grey Lotka-Volterra models with application to cryptocurrencies adoption. Chaos: Interdisc. J. Nonlin. Sci. 2019;29(7) doi: 10.1063/1.5096836. [DOI] [PubMed] [Google Scholar]

[bib27] 27.Żbikowski K. Using volume weighted support vector machines with walk forward testing and feature selection for the purpose of creating stock trading strategy. Expert Syst. Appl. 2015;42(4):1797–1805. [Google Scholar]

[bib28] 28.Mallqui D.C., Fernandes R.A. Predicting the direction, maximum, minimum and closing prices of daily Bitcoin exchange rate using Machine Learning techniques. Appl. Soft Comput. 2019;75:596–606. [Google Scholar]

[bib29] 29.Akyildirim E., Goncu A., Sensoy A. Prediction of cryptocurrency returns using Machine Learning. Ann. Oper. Res. 2021;297(1):3–36. [Google Scholar]

[bib30] 30.Oyewola D.O., Ibrahim A., Kwanamu J.A., Dada E.G. A new auditory algorithm in stock market prediction on oil and gas sector in Nigerian stock exchange. Soft Comput. Lett. 2021;3 [Google Scholar]

[bib31] 31.Matching cryptocurrencies. 2022. https://finance.yahoo.com/cryptocurrencies Available at. [Google Scholar]

[bib32] 32.Torbat Sheida, Khashei Mehdi, Bijari Mehdi. A hybrid probabilistic fuzzy ARIMA model for consumption forecasting in commodity markets. Econ. Anal. Pol. 2018;58:22–31. [Google Scholar]

[bib33] 33.Arunraj, Sivanandam Nari, Ahrens Diane. A hybrid seasonal autoregressive integrated moving average and quantile regression for daily food sales forecasting. Int. J. Prod. Econ. 2015;170:321–335. Elsevier. [Google Scholar]

[bib34] 34.Jiang Weiheng, Wu Xiaogang, Gong Yi, Yu Wanxin, Zhong Xinhui. Holt–Winters smoothing enhanced by fruit fly optimization algorithm to forecast monthly electricity consumption. Energy. 2020;193 2020. [Google Scholar]

[bib35] 35.Venkatasubramaniam Ashwini, Wolfson Julian, Mitchell Nathan, Barnes Timothy, JaKa Meghan, French Simone. Decision trees in epidemiological research. Emerg. Themes Epidemiol. 2017;14:11. doi: 10.1186/s12982-017-0064-4. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib36] 36.Oyewola David .O., Emmanuel Gbenga Dada, Oluwatosin Temidayo Omotehinwa, Ibrahim Isa.A. Comparative analysis of linear, non-linear and ensemble machine learning algorithms for credit worthiness of consumers. Comput. Intell. Wireless Netw. 2019;1(1):1–11. [Google Scholar]

[bib37] 37.Emmanuel Gbenga Dada, David Opeoluwa Oyewola, Stephen Bassi Joseph, Ali Baba Dauda Ensemble machine learning model for software defect prediction. Adv. Mach. Learn. Art. Intel. 2021;2(1):11–21. [Google Scholar]

[bib38] 38.Oyewola David O., Bernard Alechenu, Al-Mustapha Kuluwa A., Oyewande Oluwatoyosi V. Classification of dementia diseases using Deep Learning techniques. FUDMA J. Sci. (FJS) 2020;4(2):371–379. [Google Scholar]

[bib39] 39.Le N.Q.K., Yapp E.K.Y., Yeh H.Y. ET-GRU: using multi-layer gated recurrent units to identify electron transport proteins. BMC Bioinf. 2019;20(1):377. doi: 10.1186/s12859-019-2972-5. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib40] 40.Oyewola D.O., Al-Mustapha K.A., Dada E.G., Kennedy O.A. Stock market movement direction with ensemble Deep Learning Network. J. Niger. Assoc. Math. Phys. 2019;53(2019):103–116. [Google Scholar]

[bib41] 41.Liu F.R., Ren M.Y., Zhai J.D., Sui G.Q., Zhang X.Y., Bing X.Y., Liu Y.L. 2021 IEEE 2nd International Conference on Big Data, Artificial Intelligence and Internet of Things Engineering (ICBAIE) IEEE; 2021. Bitcoin transaction strategy construction based on deep reinforcement learning; pp. 180–183. [Google Scholar]

[bib42] 42.Livieris I.E., Pintelas E., Stavroyiannis S., Pintelas P. Ensemble Deep Learning models for forecasting cryptocurrency time-series. Algorithms. 2020;13(5):121. [Google Scholar]

PERMALINK

A novel hybrid walk-forward ensemble optimization for time series cryptocurrency prediction

David Opeoluwa Oyewola

Emmanuel Gbenga Dada

Juliana Ngozi Ndunagu

Abstract

1. Introduction

2. Related works

Table 1.

Table 2.

3. Methodology

3.1. Description of dataset

3.2. Classical statistical model, machine and deep learning techniques

3.2.1. Auto regressive integrated moving average (ARIMA)

3.2.2. Seasonal autoregressive integrated moving-average (SARIMA)

3.2.3. Holt winter's exponential smoothing (HWES)

3.2.4. Decision tree (BAG)

3.2.5. Stochastic gradient boosting (SGB)

3.2.6. Random forest (RF)

3.2.7. Long short term memory (LSTM)

3.2.8. Gated recurrent unit (GRU)

3.2.9. Recurrent neural network (RNN)

3.2.10. Performance evaluation

3.2.10.1. Mean absolute error (MAE)

3.2.10.2. Root mean square error (RMSE)

3.2.10.3. Mean square error (MSE)

Table 3.

Table 4.

Table 5.

Table 6.

Table 7.

Table 8.

Table 9.

3.3. Proposed hybrid walk-forward ensemble optimization

3.4. Block diagram of proposed hybrid walk-forward ensemble optimization

Figure 1.

4. Result and discussion

Figure 2.

Figure 3.

Figure 4.

Figure 5.

Figure 6.

Figure 7.

Figure 8.

Figure 9.

Figure 10.

Figure 11.

Figure 12.

Figure 13.

Figure 14.

Figure 15.

Figure 16.

Figure 17.

Table 10.

Figure 18.

Table 11.

5. Conclusion

Declarations

Author contribution statement

Funding statement

Data availability statement

Declaration of interests statement

Additional information

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases