A bibliometric literature review of stock price forecasting: From statistical model to deep learning approach

Pham Hoang Vuong; Lam Hung Phu; Tran Hong Van Nguyen; Le Nhat Duy; Pham The Bao; Tan Dat Trinh

doi:10.1177/00368504241236557

. 2024 Mar 15;107(1):00368504241236557. doi: 10.1177/00368504241236557

A bibliometric literature review of stock price forecasting: From statistical model to deep learning approach

Pham Hoang Vuong ^1,², Lam Hung Phu ¹, Tran Hong Van Nguyen ³, Le Nhat Duy ², Pham The Bao ^1,^✉, Tan Dat Trinh ^1,^✉

PMCID: PMC10943735 PMID: 38490223

Abstract

We introduce a comprehensive analysis of several approaches used in stock price forecasting, including statistical, machine learning, and deep learning models. The advantages and limitations of these models are discussed to provide an insight into stock price forecasting. Traditional statistical methods, such as the autoregressive integrated moving average and its variants, are recognized for their efficiency, but they also have some limitations in addressing non-linear problems and providing long-term forecasts. Machine learning approaches, including algorithms such as artificial neural networks and random forests, are praised for their ability to grasp non-linear information without depending on stochastic data or economic theory. Moreover, deep learning approaches, such as convolutional neural networks and recurrent neural networks, can deal with complex patterns in stock prices. Additionally, this study further investigates hybrid models, combining various approaches to explore their strengths and counterbalance individual weaknesses, thereby enhancing predictive accuracy. By presenting a detailed review of various studies and methods, this study illuminates the direction of stock price forecasting and highlights potential approaches for further studies refining the stock price forecasting models.

Keywords: Autoregressive integrated moving average, machine learning, deep learning, hybrid model, stock price forecasting

Introduction

The problem of stock price forecasting (SPF) has always been one of the most widely studied issues, involving a comprehensive approach that focuses on the analysis of historical prices, price movements, or trends to forecast future prices.¹ Numerous models and predictions of stock prices have been proposed.² Because stock prices move in a random walk,³ researchers claim that the financial information of the company will be systematically reflected in the current price. According to the Efficient Market Hypothesis (EMH), an efficient market is one where prices always reflect all available information,⁴ and it is categorized into three forms of market efficiency: Weak form, semi-strong form, and strong form. In practice, investors and financial practitioners have commonly employed technical analysis and fundamental analysis for SPF or trading decision-making.⁵ According to the research in the work of Nti et al.,⁶ fundamental analysis is the study of factors influencing supply and demand. Important data used for fundamental analysis include company data such as financial reports, annual company reports, and balance sheets.

A widely used method is time series analysis, which involves techniques for analyzing time series data to extract meaningful statistical attributes and characteristics of the data. The initial approach is decomposing the series, and commonly used methods are the Holt-Winters method⁷ or the Census II X-11 method.⁸ The autoregressive integrated moving average (ARIMA) approach is a widely used statistical method for analyzing and forecasting time series data.^9,10 While ARIMA has demonstrated its utility in capturing short- to medium-term price trends, it can be difficult to handle the complex dynamics and non-linear patterns often observed in stock markets. To address the shortcomings of conventional SPF systems based on ARIMA approaches, a learning-based approach using machine learning (ML)^11–13 and deep learning (DL) techniques was introduced.^14,15 The ML approaches have shown significant promise in understanding the complexities of financial markets, characterized by dynamic interactions among various elements that influence stock prices. During the 2000s, in comparison to conventional probabilistic or ML approaches, Zuo and Kita¹⁶ employed a Bayesian network (BN) to predict the price-earnings ratio (P/E ratio). A study by Adebiyi et al. investigated the performance of both ARIMA and artificial neural network (ANN) using stock data from the New York Stock Exchange.¹⁷

Recently, DL approaches were applied to predicting stock prices. DL models can capture complex temporal dependencies and non-linear patterns that are prevalent in stock price movements. Modern models like the convolutional neural network (CNN), the long short-term memory (LSTM) network, and the bidirectional LSTM (BiLSTM) network utilize the approximation of a continuous function and adapt data with fewer assumptions, thereby achieving higher accuracy and efficiency in solving nonlinear issues. Furthermore, a hybrid model for SPF typically integrates multiple predictive modeling techniques to enhance the accuracy and reliability of stock price predictions.^18–20

Performing a literature review on SPF is a necessary preliminary step before conducting the study or making decisions within this field. This review investigates the development of SPF techniques, ranging from traditional ARIMA methods to advanced DL methods. When starting with a literature review about SPF, we first take advantage of the capabilities of well-known platforms such as Scopus and Google Scholar. Our initial step involves entering targeted keywords into the Scopus database and combining various search terms. These include “stock price forecasting,” “stock price prediction,” “stock price forecasting using ARIMA,” “machine learning in stock prediction,” “deep learning model in stock forecasting,” “CNN, LSTM in stock price forecasting,” “GAN in stock price forecasting,” “transformer for stock price forecasting,” “graph-CNN in stock price forecasting,” and “hybrid models in stock price forecasting.” Through the application of specific filters, the search is fine-tuned to align with preferred publication dates, reputable journals, and subject areas. Essential keywords are used to ensure that the reviewed papers are relevant to the topic. Employing a parallel approach with Google Scholar, the same terms are entered, with particular emphasis on the “cited by” feature. This process leads to subsequent research papers that cite foundational works. The focus of our review was on specific forecasting models such as ARIMA, traditional ML, DL, and hybrid models for historical data or a particular stock market. We give priority to selecting papers for review that are primarily from peer-reviewed, reputable journals and conferences. At the same time, we exclude articles published in workshop publications or technical reports. Additionally, the keywords we select for the subject area are mainly related to computer science, engineering, economics, econometrics and finance, business, management and accounting, and decision sciences. Consequently, a total of 110 studies (only English-language papers) were identified, and those published in conferences or book chapters were removed by subject area, or they would be published in articles (n = 15).

During the process of accessing these resources, summaries, key findings, methodologies, and significant conclusions are systematically extracted. Our literature review is based on the use of these carefully chosen findings, ensuring it is both comprehensive and based on the most recent developments in SPF. Furthermore, it is important to mention that we exclusively assessed papers introducing new models for time series data forecasting, with a specific focus on forecasting methods for traditional financial data, such as historical data and indicators. We also exclude methods applied to non-traditional sources, such as social media trends, news updates, or news sentiment. The concept of sentiment analysis and the implementation of existing models were not considered in this review. After reading the abstract (and, as needed, other sections), the implementation of inclusion and exclusion criteria yielded 73 papers. Figure 1 shows the search strategy in our study.

Figure 1. — Search strategy for the selection method of the relevant studies.

Related works

In this section, we present a literature review on several common approaches that have been applied for SPF. Figure 2 shows a flowchart illustrating the common approaches to stock price forecasting outlined in the study.

ARIMA approaches

The well-known traditional statistics time series forecasting methods, such as ARIMA and its variants^17,21–29 are still used a lot because of their efficiency level. Table 1 presents a summary of ARIMA-based approaches for SPF. For the articles reviewed, we summarize the methods used, comparison methods, datasets, target outputs, input features, metrics evaluations, and briefly discuss performance results. Low and Sakk²² examined the performance of two forecasting models, ARIMA and LSTM, for predicting stock prices. ARIMA is combined with LSTM to determine which is superior in terms of forecasting accuracy. The models were applied to data from ten different stock tickers, specifically exchange-traded funds from various market sectors. The results suggest that ARIMA shows comparable accuracy to LSTM's long-term prediction capabilities. Wahyudi²³ employed the ARIMA model to predict the volatility of Indonesian stock prices. The best ARIMA model is determined using the Akaike information criterion (AIC) criteria. The results indicate that the ARIMA model can compete well with existing techniques for stock price prediction, especially in the short term. Pulungan et al.²⁴ applied a combination of autoregressive (AR) and moving average (MA) methods. The data needs to be stationary for ARIMA to be applied efficiently. The most fitting ARIMA model for this data was identified as ARIMA (31,1).

Table 1.

Summary of the existing ARIMA-based stock price forecasting approaches.

Reference No.	Method	Comparisons	Dataset	Targets	Input features	Metrics	Results
²¹	ARIMA	LSTM, SARIMAX	Stock exchange market data from Yahoo Finance	Closing values of stock prices	Index, Open, Close, Adj Close, High, Low, Volume and Close USD	MAE	Error for ARIMA is less as compared to SARIMAX
²²	ARIMA	LSTM	Ten different stock tickers comprising exchange-traded funds	Stock price prediction	Closing price	MSE	ARIMA was found to be more accurate in making point predictions
²³	ARIMA	ARIMA (0, 1, 1) ARIMA (1, 1, 0) ARIMA (1, 1, 1)	Daily Indonesia CSPI	Daily movement of stock prices	Closing price	MAPE	ARIMA (01,1): 0.8431
²⁴	ARIMA	ARIMA (31,1)	Socially Responsible Investment Index (SRI-KEHATI) on the Indonesia Stock Exchange	Daily closure of the SRI-KEHATI Index data	Closing price	Ljung-Box Q statistical test	ARIMA (31,1) had a significant effect on the SRI-KEHATI Index

Open in a new tab

ARIMA: autoregressive integrated moving average; MAPE: mean absolute percent error; MSE: mean squared error; MAE: mean absolute error; LSTM: long short-term memory; CSPI: composite stock price index.

Machine learning approaches

ML techniques can capture nonlinear information in time series data without relying on stochastic data or economic knowledge. Thus, ML approaches can be used to build high-performance SPF systems without expert knowledge. The traditional ML algorithms, such as ANNs,^11,30,31 k-nearest neighbors (KNN),^32,33 support vector machine (SVM),^34–40 ensemble models,^41–47 and BN,^48,49 have been successfully and widely used in SPF systems. Table 2 presents articles on SPF based on ML approaches.

Table 2.

Summary of the existing ML-based stock price forecasting approaches.

Reference No.	Method	Comparisons	Dataset	Targets	Input features	Metrics	Results
³⁰	ANN	SPSS statistics tool	Bombay Stock Exchange Limited	Future direction of the stock price movements	Opening price, high price, low price, and closing price	AAE, MAE, RMSE	ANNs provide higher accuracy
³¹	ANN	ANN_SCG, ANN_LM, ANN_BR	Reliance Private Limite from Thomson Reuter Eikon	Stock prices and movements	Tick Data, and 15-min Data	MAPE, MSE	ANN_SCG obtained best performance
³²	KNN	Baseline KNN, regression prediction	Historical data of stock Neimengyiji	History	High, low, open, and close	Standard error	Improved KNN yielded the best result
³³	EEMD–MKNN–TSPI	EEMD–MKNN, MKNN–TSPI	NAS, DJI, S&P 500, Russell 2000; and stock data from 04 regions	Opening and closing price	Opening and closing prices	MAPE, MASE, NMSE	EEMD–MKNN–TSPI model outperforms the EEMD–MKNN and MKNN–TSPI models
³⁴	SSA–SVM	ANFIS, SVM, EEMD–ANFIS, EEMD–SVM, and SSA–ANFIS	Shanghai Stock Exchange Composite Index	Daily closing price	Closing price	MSE, MAPE, DS, R²	SSA–SVM model exhibiting the best prediction performance
³⁵	SVM	LR, ANN, RF	Kuala Lumpur Composite Index, Kuala Lumpur Stock Exchange Industrial, Kuala Lumpur Stock Exchange Technology	Next day movement	Stock returns, technical indicators, connected components, Holes	Average of the prediction performances	Support vector machine with persistent homology generates the best outcome
⁴²	Random Forest	LR, LDA, NB, KNN, K*, C4.5, CART, ANN, SVM	Indonesia Stock Exchange	Prediction of the LQ45 index	15 variables (volum, value, …)	Accuracy, recall, precision	RF had the best performance
⁴³	Random Forest	XGBoost, Bagging Classifier, AdaBoost, Extra Trees Classifier, Voting Classifier	NYSE, NASDAQ, NSE	Direction of stock price movement	40 technical indicators and the OHLCV variables	Accuracy, precision, f1-score, specificity, and AUC	Extra Trees classifier outperformed the other models
⁴⁸	Bayesian neural network	FNN-Adam, FNN-SGD	3 M Company, China Spacesat Company Limited, Commonwealth Bank of Australia, Daimler AG	Future trends of stock price	Adjusted closing price	RMSE	Bayesian neural network provided one of the best performances
⁴⁹	Bayesian neural network	ANN, SVM, KNN, decision tree, NB, …	12 indices: Nasdaq Composite, NYSE Composite, Dow Jones, …	Next day closing direction	Closing direction	Accuracy	The mean accuracy was around 71%

Open in a new tab

ANN: artificial neural network; KNN: k-nearest neighbor; SVM: support vector machine; SSA: singular spectrum analysis; RF: random forest; NB: Naïve Bayes; LDA: linear discriminant analysis; MAPE: mean absolute percent error; RMSE: root mean squared error; MSE: mean squared error; MAE: mean absolute error; AUC: area under the receiver operating characteristic curve; SPSS: statistical product and services solutions.

Sigo³⁰ explored the nonlinear movement patterns of three leading stocks on the Bombay Stock Exchange (BSE) in India. ANN is employed to analyze data spanning from 2008 to 2017. The results of the study aim to guide investors in making informed investment decisions and maximizing their returns by focusing on the most valuable stocks. Selvamuthu et al.³¹ addressed the challenge of predicting stock prices in the Indian stock market. Recognizing that stock price data is inherently difficult to predict due to its dynamic nature, the authors explored the efficiency of ANN.

When applied to SPF, KNN is used to predict a stock's future price based on its past values. Yunneng³² presented an enhanced version of the KNN algorithm for stock price predictions. This improvement aims to provide more accurate predictions of stock prices. Lin et al.³³ presented a novel method for improving the accuracy of stock time series forecasting using a multidimensional KNN algorithm. The results showed that the proposed method outperformed the other models in predicting stock prices, proving to be a more reliable and effective forecasting system.

SVMs have been primarily developed for classification problems, but their application has been extended to regression problems known as support vector regression (SVR). SVR can be applied to SPF. Xiao et al.³⁴ introduced a novel methodology for stock price analysis and forecasting, combining singular spectrum analysis (SSA) and SVM. Ismail et al.³⁵ aimed to predict the direction of stock price movement. The study introduced a hybrid method that combines various ML techniques—namely logistic regression (LR), ANN, SVM, and random forest (RF)—to enhance prediction accuracy.

Developing an ensemble model for SPF involves aggregating the predictions from multiple models to improve the accuracy and robustness of the predictions. Syukur and Istiawan⁴² investigated the prediction of the LQ45 index on the Indonesia Stock Exchange (ISX) using various ML techniques. RF was found to obtain the best performance in predicting the LQ45 index compared to C4.5, SVM, LR, Naïve Bayes (NB), and linear discriminant analysis (LDA).

In the context of SPF, Bayesian neural networks (BNNs) enable the prediction of the likelihood of various stock prices, based on given evidence or observed variables. Chandra and He⁴⁸ explored the utilization of BNNs for forecasting stock prices. Malagrino et al.⁴⁹ explored the potential of BNNs to understand the influence of global stock market indices on iBOVESPA, the primary index of the São Paulo Stock Exchange in Brazil. The objective is to forecast the closing direction of iBOVESPA the next day. The BNN models were able to achieve a mean accuracy of around 71%, with a peak accuracy of nearly 78%, in predicting the daily closing direction of iBOVESPA.

Deep learning approaches

A DL model can effectively outperform traditional SPF systems in terms of accuracy. CNNs^50–54 and recurrent neural network (RNN) such as LSTM or gated recurrent unit (GRU),^55–60 and BiLSTM^61–64 are extensively employed for SPF systems. Table 3 presents a summary of DL-based approaches for SPF.

Table 3.

Summary of the existing DL-based stock price forecasting approaches.

Reference No.	Method	Comparisons	Dataset	Targets	Input features	Metrics	Results
⁵⁰	CNN	LR, CNN-Rand, CNN-Corr, LR With FS	BIST 100 Index	Hourly stock price direction	25 technical indicators with different time lags	Macro-Averaged F-Measure	CNN-Corr classifier yielded the best performance
⁵¹	CNN + frequent patterns	ARIMA, Wavelet + ARIMA, HMM, LSTM, SFM	S&P 500 and 07 individual stocks	Trend of stock price	Closed value	Accuracy, recall, precision, f1-score	Proposed method outperformed the others with a 4%–7% accuracy improvement
⁵⁵	LSTM	Random Forest	S&P 500	Directional movements of stock price	Adjusted closing prices and opening prices	Various metric (mean, std error, sharpe ratio, …)	LSTM outperforms random forests
⁵⁶	LSTM	LASSO-LSTM, PCA-LSTM, LASSO-GRU, PCA-GRU	Shanghai Composite Index	Stock price trend	Open, high, low, trading volume, and other technical indicators	RMSE, MAE	LSTM and GRU with LASSO yielded better accuracy than models with PCA
⁶²	BiLSTM	WAE-BLSTM, W-BLSTM, W-LSTM, BLSTM, LSTM	S&P500	Next day closing price	Open, high, low, close (OHLC), 08 technical indicators	MAE, RMSE, R²	WAE-BLSTM model outperformed the other models. MAE (0.0211), RMSE (0.0272), and R² (0.8934)
⁶⁴	AE-BiLSTM-ECA	CNN, LSTM, BiLSTM, CNN-LSTM, AE-LSTM, CNN-BiLSTM, AE-BiLSTM, BiLSTM-ECA, AE-LSTM-ECA, CNN-LSTM-ECA	Shanghai Stock Composite Index (SSCI) and CSI 300	Closing price	Seven characteristics such as closing, high, open, low, previous day's closing price, up or down amount and and up or down rate	MSE, RMSE, MAE, MAPE	AE-BiLSTM-ECA obtain the best accuracy. CSI 300 stock data: MSE: 3158.452 RMSE: 56.200 MAE: 36.681 MAPE: 1.020 SSCI stock data MSE: 1935.398 RMSE: 43.993 MAE: 28.940 MAPE: 1.019

Open in a new tab

ARIMA: autoregressive integrated moving average; BiLSTM: bidirectional long short-term memory; LR: logistic regression; GRU: gated recurrent unit; PCA: principal component analysis; AE: auto-encoder; ECA: efficient channel attention; MAPE: mean absolute percent error; MSE: mean squared error; RMSE: root mean squared error; MAE: mean absolute error; DL: deep learning.

CNNs are commonly used for image and video processing; however, their proficiency in identifying hierarchical patterns can extend to time series forecasting as well. Gunduz et al.⁵⁰ applied a CNN model that utilized specially ordered features derived from various indicators, prices, and temporal information appropriate to stocks in the Borsa Istanbul 100. Wen et al.⁵¹ presented an approach to forecasting stock market trends utilizing financial time series data, exemplified by the S&P 500. CNN explored to distinguish the spatial structure inherent in the time series.

RNNs have been proposed for SPF systems. Results have demonstrated that methodologies based on RNN can outperform classic ML techniques. RNNs can handle sequences of variable length, offering flexibility in managing time series data of diverse lengths. To overcome the vanishing gradient problem intrinsic to RNNs, LSTM networks, GRU, and their variants were developed. The utilization of LSTMs has been substantiated as effective in accurately forecasting stock prices. Ghosh et al.⁵⁵ demonstrated the efficacy of employing both LSTM networks and RFs to forecast directional movements of stock prices from the S&P 500 index for intraday trading. Authors⁵⁶ proposed an optimized approach for predicting stock prices using advanced DL techniques, such as LSTM and GRU models. The authors employed DL LASSO and principal component analysis (PCA) for dimensionality reduction, focusing on various factors influencing stock prices.

BiLSTM is often used for sequence-to-sequence learning tasks, like SPF. The BiLSTM allows the model to capture both past and future information around a specific time step, potentially enhancing the model's ability to understand the underlying patterns in the sequence. Xu et al.⁶² focused on utilizing a stacked DL structure for stock market predictions, specifically aiming to predict the stock price of the subsequent day. This model employs historical stock price data sourced from Yahoo Finance and integrates several methodologies, including the wavelet transform technique, stacked autoencoder, RNN, and BiLSTM. Liu et al.⁶⁴ employed an auto-encoder (AE) technique to extract stock price series data, recognizing its proficiency in managing the non-smooth and non-linear characteristics inherent in the data. The core structure of the AE incorporates a BiLSTM module, which allows the model to efficiently extract substantial historical and prospective information from stock price series data.

Hybrid approaches

In SPF, hybrid models refer to combinations of different models aiming to leverage the strengths and reduce the drawbacks of individual methods. Hybrid models can achieve higher predictive accuracy than single models. However, they also come with challenges, such as increased model complexity, potential difficulties in model interpretation, and the requirement for extensive tuning and validation. These hybrid models can generally be categorized into two main types: hybrid traditional approaches and hybrid DL approaches. Table 4 presents a summary of hybrid-based approaches for SPF.

Table 4.

Summary of the existing hybrid-based stock price forecasting approaches.

Reference No.	Method	Comparisons	Dataset	Targets	Input features	Metrics	Results
Hybrid traditional approaches
⁶⁵	SVM-KNN	CEFLANN, FLIT2FNS	Bombay Stock Exchange (BSE Sensex) and CNX Nifty	Trends, volatility, and momentum of stock indices	Open, low, high, closing, and technical indicators	MAPE, MSFE, RMSFE	SVM-KNN has better performance than CEFLANN and FLIT2FNS
⁶⁶	SVR-TLBO	OFS-SVR-TLBO, KPCA-SVR-TLBO	Tata Steel from Bombay Stock Exchange	Closing price	07 features	MAE, RMSE, MAPE	KPCA-SVR-TLBO performed better than OFS-SVR-TLBO
Hyrid deep learning with traditional approaches
⁷¹	TI-CNN	CNN-TA, 1D CNN, CNN-LSTM	NASDAQ, NYSE	Stock movement, Buying and selling points	10 technical indicators	Accuracy, f1-score	TI-CNN achieves high prediction accuracy
⁷²	CNN + optimizing algorithm	CNN, RS-CNN, FF-CNN, PSO-CNN	Tata Motors from Yahoo Finance	Closing price	date, open, low, close, high, volume, and adjacent close	MSE, MAE, RMSE, MAPE	FF-CNN outperformced the others
Hybrid deep learning approaches
⁷⁶	CNN-LSTM	MLP, CNN, RNN, LSTM, CNN-RNN	Shanghai Composite Index	Next day closing price	Open, high, low, closing price, volume, turnover, ups and downs, and change of the stock data	MAE, RMSE, R²	CNN-LSTM obtained the best performance MAE(27.564), RMSE(39.688), R2(0.9646)
⁷⁷	SACLSTM	SVM, CNN-cor, CNNpred, ANN	10 stocks from American market and Taiwan	Direction of the stock market (rise and fall)	Historical data, futures, and options	Accuracy	SACLSTM performs relatively well compared to the others
⁷⁸	BiSLSTM	MLP, RNN, LSTM, BiLSTM, CNN-LSTM, CNN-BiLSTM	Shenzhen Component Index	Closing price	Open, high, low, closing, volume, turnover, ups and downs, and change	MAE, RMSE, R²	CNN-BiSLSTM has optimal values for MAE (113.47137), RMSE (162.53164), R2 (0.98634)
⁸¹	CNN-BiLSTM-AM	MLP, CNN, RNN, LSTM, BiLSTM, CNN-LSTM, CNN- BiLSTM, BiLSTM-AM	Shanghai Composite Index	Next day stock closing price	Open, high, low, closing, volume, turnover, ups and downs, and change	MAE, RMSE, R²	CNN-BiLSTM-AM yielded the best results MAE(21.952), RMSE(31.694), R2(0.9804)
⁸²	CNN-BiLSTM-ECA	CNN, LSTM, BiLSTM, CNN-LSTM, CNN-BiLSTM, BiLSTM-ECA, CNN-LSTM-ECA, CNN-BiLSTM-ECA	Shanghai Composite Index, China Unicom, CSI 300	Next day closing price	Closing, high, low, open, previous day's closing price, change, ups and downs, and other time series data	MSE, RMSE, MAE	CNN-BiLSTM-ECA obtained the best performance
⁸⁵	BiLSTM-MTRAN-CNN	BiLSTM-SA-TCN, CNN-BiLSTM, CNN-BiLSTM-AM, BiLSTM	A-share Index, Shanghai Composite Index, Shenzhen Component Index, CSI 300 and Growth Enterprise Board Index	Next day closing price	Trading data and technical indexes data	MAE, MSE, RMSE, R²	BiLSTM-MTRAN-TCN outperforms the other methods
⁸⁶	FDG-Trans	DeepLOB, DeepAtt, MHF	Limit order book (LOB), CSI-300	Price movements.	LOB information	R², MSE, MAE	The FDG-Trans has less error compared to the other models
⁸⁷	WGAN-S	H-LSTM, GAN, GAN-S, LSGAN, LSGAN-S, WGAN	Taiwan Stock Exchange Capitalization Weighted Stock Index	Three trading actions: buying, selling, and holding	Opening, closing, highest, lowest, trading volume, and technical indices	Cumulative return on investment, the Sharpe ratio, and winning percentage.	GAN outperform LSTM
⁸⁸	DCGAN	ARIMAX-SVR, RF regressor, LSTM, GAN	FTSE MIB Index	Closing price	Technical indicators	RMSE, MAE, MAPE	DCGAN obtained the best performance
⁸⁹	Improve GCNN	MOM, MR, LSTM, DARNN, SFM, GCN, TGC, HATS, STHGCN	A-share market stock prices in China	Trend of stock price	Open, high, low, close, and trading volume	Accuracy, recall, precision, f1-score, AUC	Proposed model achieves the best accuracy,
⁹⁰	CT-GCNN	GNN, LSTM, RNN, CNN, BP	A-share market stock prices in China	Stock price movement	Open, close, exchange rate, high, low, trading volume	MSE, MAPE	CT-GCNN model demonstrated stability and superiority

Open in a new tab

CT-GCNN: conceptual-temporal graph convolutional neural network; BiLSTM: bidirectional long short-term memory; KNN: k-nearest neighbor; SVM: support vector machine; SVR: support vector regression; RF: random forest; RNN: recurrent neural network; KPCA: Kernel principal component analysis; OFS: orthogonal forward selection; teaching-learning-based optimization; FF: firefly algorithm; PSO: particle swarm optimization; RS: random search; ECA: efficient channel attention; DCGAN: deep convolutional generative adversarial network; GAN: generative adversarial network; MAPE: mean absolute percent error; MSE: mean squared error; RMSE: root mean square error; MAE: mean absolute error; AUC: area under the receiver operating characteristic curve

Hybrid traditional approaches^65–70 typically combine traditional statistical methods with ML techniques, or they combine various ML approaches with each other. Nayak et al.⁶⁵ introduced a hybrid model that integrates both the SVM and KNN techniques for predicting Indian stock market indices. The model's performance was evaluated using the mean squared error (MSE), and it was found that the SVM-KNN model outperformed several baseline models. Siddique and Panda⁶⁶ compared various hybrid ML models for prediction. These models utilized dimension reduction techniques such as orthogonal forward selection (OFS) and kernel PCA (KPCA). They were combined with SVR and teaching-learning-based optimization (TLBO). The study concluded that the model incorporating KPCA (KPCA-SVR-TLBO) outperformed and was more feasible than the model employing OFS (OFS-SVR-TLBO).

Hybrid DL approaches frequently combine DL techniques with traditional methods^71–75 or DL architectures with each other, such as CNN-LSTM, LSTM or BiLSTM with attention mechanisms (AMs), transformer models, and graph convolutional neural network (GraphCNN).^76–90 These hybrid DL models prove to be efficient in identifying complex patterns and relationships in data due to the high capacity and adaptability of DL architectures, especially in applications like SPF. Chandar⁷¹ proposed a new method for stock trading by combining technical indicators and CNNs, termed TI-CNN. The model uses ten technical indicators derived from historical stock data, converts them into an image using gramian angular field, and then inputs this into the CNN. Korade and Zuber⁷² explored the usage of CNN for SPF and aim at optimizing the CNN hyperparameters using different optimization techniques. The authors employ the firefly algorithm (FF), particle swarm optimization, and random search for optimizing the hyperparameters, comparing their performance based on different evaluation metrics applied to training and testing datasets.

The study by Lu et al.⁷⁶ proposed a method for forecasting stock prices utilizing a hybrid CNN-LSTM model. This model utilizes CNN for efficient feature extraction from historical data and LSTM to analyze relationships in time-series data, subsequently predicting stock prices. Wang et al.⁷⁸ aimed to predict the closing price of stocks using a composite model called CNN-BiSLSTM. Here, the BiSLSTM represents bidirectional special LSTM.

The integration of AMs with LSTMs in SPF models presents the possibility of improved prediction accuracy and reliability. An attention-LSTM model can analyze historical stock prices and potentially other relevant information to predict future stock prices. Lu et al.⁸¹ discussed a combined approach using CNN, BiLSTM, and attention mechanism for predicting stock prices. The results showed that the CNN-BiLSTM-AM method outperforms seven other methods in accuracy. The study referenced by Chen et al.⁸² introduced a novel model for predicting stock prices, utilizing a CNN, a BiLSTM, and an efficient channel attention (ECA) module.

Transformers were developed to reduce the limitations inherent to AMs and recurrent models like RNNs. Specifically, they address the challenges brought about by the inherent sequential processing of RNNs and the high computational demands of AMs, allowing for more efficient and scalable modeling of sequential data. When employed for SPF, transformer models are good at identifying complex patterns within time series data and understanding the long-term dependencies existing between various time steps. Wang⁸⁵ introduced a novel method named BiLSTM-MTRAN-TCN for predicting stock prices. This method used BiLSTM, an improved transformer model (MTRAN-TCN), and TCN (temporal convolutional network), aiming to explore the individual benefits of each model. Li and Qian⁸⁶ introduced a novel hybrid neural network—the FDG-transformer—specifically developed for predicting stock prices.

Recently, generative adversarial networks (GANs) and GraphCNN have been applied for SPF, often achieving high accuracy. GANs can be used to generate synthetic time-series data that mimics real stock price movements. This synthetic data can help augment the training data, allowing models to generalize better to unseen data and potentially leading to more accurate forecasts. The Wu et al.⁸⁷ introduced a novel framework that combines GAN with piecewise linear representation for predicting stock market trading actions such as buying, selling, and holding. Staffini⁸⁸ proposed a novel approach to predicting stock prices using a deep convolutional GAN (DCGAN). The generator model of the GAN learns to generate data like real stock prices. The discriminator model learns to distinguish between real and generated stock prices. The results show that the proposed DCGAN model outperformed standard, widely used tools for forecasting stock prices.

GraphCNNs extend convolutional operations from regular grids to irregular graphs. This advancement allows models to effectively capture the relational structures and dependencies between different entities. GraphCNNs are particularly applicable to SPF, where they can model the relationships between different stocks or between different features of a single stock. The work of Wang et al.⁸⁹ proposed a new model for stock price prediction, integrating a knowledge graph, GraphCNN, and community detection. This model aims to overcome the limitations of existing models, which often neglect deeper influencing factors and rely on small-scale stock datasets. Fuping⁹⁰ concentrated on predicting stock price movements using a novel method called the conceptual-temporal graph CNN (CT-GCNN) model. This model explores stock price movements in both time and concept dimensions, accounting for the linkage effect of price movement among stocks within the same conceptual segment.

Dataset and metric evaluation

Statistics of selected papers

This subsection investigates a statistical analysis of 73 papers, specifically selected for this review on SPF, utilizing analyze tools from the Scopus platform. Figure 3 illustrates the annual publication count, revealing a conspicuous upward trend in papers on SPF from 2014 to 2023. The number of papers increased from a single publication in 2014 to 18 in 2023.

Figure 3. — Number of published papers by year.

Figure 4 illustrates the distribution of published papers, categorized by source (compare the document counts for the top 10 sources). Research on SPF has been published in various journals and conferences. Particularly notable is the number of studies in the ‘Expert Systems with Applications’ journal. The variety of journals, spanning specialized and multidisciplinary fields such as computing, economics, and finance, underscores the interdisciplinary essence of the research.

Research on SPF is primarily distributed through articles (61), indicating a robust presence in academic journals, followed by conference papers (11), which suggest active discussions and explorations in conference settings, and is minimally represented in book chapters (1), as shown in Figure 5. The dominance of articles points towards depth and precision in the field, while conference papers highlight ongoing, dynamic discussions and potential collaborations among researchers.

Figure 5. — Number of published papers by type.

Figure 6 illustrates the number of published papers by authors from various countries or territories (compare the document counts for the top 15 countries or territories). This shows a geographical diversity in the research within this field. China, with 24 publications, and India, with 17, lead in stock market research, reflecting their strong economic interests and growth. Other countries also show global interest in SPF. This figure shows that this research topic has widespread relevance and interest across different economic contexts.

Figure 7 illustrates the number of published papers by authors, categorized by affiliation (compare the document counts for the top 15 affiliations), with leading contributions from Capital University of EcoNomics and Business and Hebei University of Science and Technology at three publications each. Several other universities from around the world have also made notable contributions. This figure implies a robust and diverse global effort in the study of SPF, reflecting both academic and industry interests in the area.

Dataset

Many studies have used stock market data for their analyses, as shown in Tables 1–4. This research examines a wide range of datasets sourced from various stock indices and companies. These datasets have been used in multiple studies that focus on SPF. They include noticeable indices such as the FTSE MIB Index and A-share market stock prices in China, as well as specific companies like Amazon, Apple, Google, Tesla, IBM, and Oracle. The data for these studies is often sourced from platforms like Yahoo Finance.

Earlier studies have utilized a diverse set of data types. These range from closing prices, opening prices, highs, lows, and trading volumes to other technical indicators. Some even incorporate historical and tick data. Data from major global stock exchanges, including the Shanghai Stock Exchange, Kuala Lumpur Stock Exchange, and BSE Limited, plays an important role in these studies. Some studies specifically aim to predict the direction or price of the next day's close. Table 5 presents some stock datasets and their source links.

Table 5.

Some stock datasets and their source links.

Dataset/Market capitalization	Source	Dataset/Market capitalization	Source
Stock exchange data	https://www.kaggle.com/datasets/mattiuzc/stock-exchange-data https://finance.yahoo.com/	3M Company	https://finance.yahoo.com/quote/MMM/
Exchange Traded Funds	https://www.sectorspdrs.com/	China Spacesat Co., Ltd (600118.SS)	https://finance.yahoo.com/quote/600118.SS/
Indonesia Stock Market (JCI), Composite Stock Price Index (CSPI)	https://tradingeconomics.com/indonesia/stock-market	BIST 100 (XU100.IS)	https://finance.yahoo.com/quote/XU100.IS/
SRI-KEHATI Index	https://www.bloomberg.com/quote/BNPSRIK:IJ	CSI 300 Index	https://www.bloomberg.com/quote/SHSZ300:IND
Bombay Stock Exchange (BSE Sensex)	https://www.bseindia.com/	NIFTY 50	https://g.co/finance/NIFTY_50:INDEXNSE
Thomson Reuter Eikon	https://eikon.refinitiv.com/	Tata Motors Limited	https://finance.yahoo.com/quote/TATAMOTORS.BO/
Norwegian Air Shuttle ASA (NAS)	https://finance.yahoo.com/quote/NAS.OL/	China Unicom Hong Kong Ltd	https://www.bloomberg.com/quote/762:HK
Dow Jones Industrial Average (DJIA)	https://finance.yahoo.com/quote/%5EDJI/	Shanghai SE A Share	https://www.investing.com/indices/shanghai-se-a-share
Standard & Poor's 500 Stock Index (S&P 500)	https://www.marketwatch.com/investing/index/spx	Growth Enterprise Market	http://www.aastocks.com/en/stocks/market/index/hk-index-con.aspx?index = GEM
Russell 2000 Index	https://www.cnbc.com/quotes/.RUT	Taiwan Stock Exchange Weighted Index	https://www.bloomberg.com/quote/TWSE:IND
Shanghai Stock Exchange (SSE) Composite Index	https://www.bloomberg.com/quote/SHCOMP:IND	FTSE MIB Index	https://finance.yahoo.com/quote/FTSEMIB.MI/
Kuala Lumpur Composite Index (KLCI)	https://www.bloomberg.com/quote/FBMKLCI:IND	Borsa Istanbul	https://borsaistanbul.com/en
LQ45 Index	https://finance.yahoo.com/quote/%5EJKLQ45/	Nikkei 225	https://finance.yahoo.com/quote/%5EN225/
National Stock Exchange of India Ltd (NSE)	https://www.nseindia.com/	DAX	https://g.co/finance/DAX:INDEXDB
National Association of Securities Dealers Automated Quotations System (NASDAQ)	https://www.nasdaq.com/	KOSPI 200 Index	https://finance.yahoo.com/quote/%5EKS200/
New York Stock Exchange (NYSE)	https://www.nyse.com/index	Mercedes-Benz Group AG	https://finance.yahoo.com/quote/MBG.DE/
Commonwealth Bank of Australia (CBA.AX)	https://finance.yahoo.com/quote/CBA.AX/	iBOVESPA	https://finance.yahoo.com/quote/%5EBVSP/

Open in a new tab

Metric evaluation

From Tables 1–4 in the “Related works” section, we realize that various metrics are utilized to evaluate the performance of different models in SPF. These include mean absolute percent error (MAPE), mean squared error (MSE), root mean square error (RMSE), mean absolute error (MAE), accuracy, R² (coefficient of determination), mean bias error (MBE), precision, recall, F1-score, and area under the receiver operating characteristic curve (ROC-AUC), among others.

Metrics for evaluating prediction accuracy: MAPE, MSE, RMSE, and MAE are widely employed across various models and approaches to assess the accuracy and performance of stock price predictions.
Metrics for classification models: Accuracy, precision, recall, F1-score, and ROC-AUC are typically used to evaluate models where classification, such as predicting upward or downward trends, is involved.
Understanding model predictions and biases: R² is utilized to comprehend how well the model's predictions align with actual outcomes, providing insights into the explanatory power of the model by understanding the proportion of variability in the dependent variable that is predictable from the independent variable(s).

Discussion and future directions