Skip to main content
PLOS One logoLink to PLOS One
. 2022 May 2;17(5):e0266600. doi: 10.1371/journal.pone.0266600

Study of Asian indexes by a newly derived dynamic model

Tsung-Jui Chiang-Lin 1, Yong-Shiuan Lee 2,*, Tzong-Hann Shieh 3, Chien-Chang Yen 4, Shang-Yueh Tsai 5
Editor: Junhuan Zhang6
PMCID: PMC9060367  PMID: 35499989

Abstract

We take the stock prices as a dynamic system and characterize its movements by a newly derived dynamic model, called the new Price Reversion Model (nPRM), for which the solution is derived and carefully analyzed under different circumstances. We also develop a procedure of applying the nPRM to real daily closing prices of a stock index. This proposed procedure brings a different perspective to the study of stock prices based on thermodynamics, and the time varying coefficients in the nPRM offer economic meanings of the stock movements. More specifically, the average of smoothed historical data A in the nPRM, analogous to the environment temperature in the Newton’s law of cooling, represent an implied equilibrium price. The heat transfer coefficient κ is adapted to be either negative or positive, which illustrates the speed of convergence or divergence of stock prices, respectively. The empirical study of ten Asian stock indexes shows that the nPRM accurately characterizes and forecasts the market values.

Introduction

The stock index, a key variable reflecting the economic status in an area, is a popular research topic in macroeconomics. The market prices of an individual stock reflect the company’s existing value or future profit growth. A stock index, measured by the weighted mean prices of selected stocks in a stock market, represents the stock market’s performance and potential competitive ability. Investors who have different motives for participating in the market concern about the changes of a stock or a stock index’s market prices of different scales, intraday high-frequency data, daily closing prices, weekly or monthly summary, and etc. Accurately forecasting the stock movements could help the investors with decisions about investment strategy.

Whether the stock prices can be forecasted has been repeatedly discussed in finance [1, 2]. The efficient market hypothesis [3] forms a foundation of the theory that under what circumstances the stock prices is predictable. The efficient market hypothesis states that three types of markets can be distinguished as the weak form, the semi-strong form, and the strong form depending on the level of market efficiency. The stock prices fully reflect all available market information in a strong efficient market, thus they are not predictable. The martingale depicts the stock movements in the strong efficient markets by the theory of random walks [4]. In contrast, many real markets are considered as the weak or the semi-strong efficient markets [2, 5]. Consequently, to forecast the prices in these markets is possible.

The analysis of the stock movements is based on either the stock prices or the stock returns. The choice between the stock prices and the returns mainly depends on the approach adopted. From the financial accounting perspective, fundamental analysis [6, 7] and technical analysis [8] are the two main intellectual traditions. The fundamental analysis evaluates the stocks by measuring their intrinsic values from overall economic and industrial conditions. The fundamental analysis assumes that the stock prices are linearly correlated to the predictors in the financial statement. As a result, the linear regression models are usually applied to forecast the stock prices. On the contrary, the technical analysis assumes that all known fundamentals are factored into prices. Hence, charting past fluctuations in the stock indexes allows the analyst to identify the patterns and trends. Technical analysis forecasts the market movements through popular techniques like chart analysis, cycle analysis and computerized technical trading systems [8].

Various statistical methods are prevalent in the history of financial academic researches. In econometric modeling, applying the autoregressive integrated moving average (ARIMA) models and vector autoregression (VAR) models is common. When using ARIMA and VAR models, the stationarity of the time series is required. Therefore, the stock returns instead of the stock prices are the primary research subject [911]. Financial ratios like the log dividend-price ratio [12, 13], the dividends [9], the price–earnings ratio [10, 13], the dividend yield [10, 14], the consumption-wealth ratio [15], the book-to-market ratio [10] serve as important predictors for asset returns in regression models, particularly VAR models. Although some good predictors in financial literature have restricted predictive ability for the equity premium due to model uncertainty and instability [11], combining individual forecasts produce out-of-sample gains [16]. Modeling stock returns reduces low-frequency information, the trend for example, from the perspective of signal processing. Therefore, forecasting the stock prices based on models constructed from detrended data may be less accurate in this sense.

The stock prices can be treated as signals. From the signal processing perspective, the forecasts can be constructed by decomposing the signals into features and putting them into artificial intelligence models. The artificial intelligence models applied to the stock prices in existing studies include support vector regression (SVR) [17], neural networks [1822], tree-based models [23], etc. To applying the artificial intelligence models, acquiring features is an important step. The empirical mode decomposition (EMD) [17, 21], the ensemble empirical mode decomposition (EEMD) [24], and the complementary ensemble empirical mode decomposition (CEEMD) [22] are widely employed methods to decompose the stock prices, which are regarded as signals. The empirical results of US and UK stock indexes show that the artificial intelligence models are better than the ARIMA and GARCH models in terms of forecasting directional symmetry [21]. Furthermore, the CEEMD-PCA-LSTM model [22], which integrate the CEEMD, principal component analysis (PCA), and long short-term memory (LSTM) networks, outperforms naive recurrent neural networks (RNN) or LSTM networks in terms of forecasting accuracy for six selected stock indexes. Still, in a review study of the returns from financial perspective [25], the empirical evidences show that no single model out of statistical, artificial intelligence, frequency domain, and hybrid models under investigation could be applied uniformly to all markets. On the other hand, the direction prediction problem is another task, which can be tackled by the tree-based models [23] to predict the stock price direction.

This paper considers a physical-mathematical method, the approach of the differential equations and the dynamic systems. From thermodynamics, we adopt the Newton’s law of cooling to describe the mean reverting of the stock index prices. In recent years, modeling the stock index from the physical perspective by treating it as a dynamic system is emerging [2629]. To the best of our knowledge, the first attempt to study the stock index, the Taiwan Stock Exchange Capitalization Weighted Stock Index (TAIEX), as well as the options through dynamic models [26], which are derived based on parabola approximation [30]. Another study [24] also considers the stock markets as complex dynamical systems, but their interactions are analyzed by methods of signal processing, the EMD and the EEMD along with detrended cross-correlation analysis (DCCA). Existing studies recognize the nonlinear behaviors of the stock movements [3134]. However, the models fits [33, 34] are obtained via statistical method during a period of up trend or down trend.

Based on the related work [27], we consider to apply the Price reversion model (PRM), which has a form of the Newton’s law of cooling, to the stock markets. Yet, the empirical evidence in [27] shows that the PRM tends to underestimate the forecasts. Therefore, we identify the reasons of inaccurate forecasts by the PRM and propose to model the stock prices by a newly derived model, i.e. the new Price Reversion Model (nPRM). For the same dynamic model described using the differential equation in the PRM, we derive its solutions rigorously so that the solutions of the nPRM can better fit the realistic stock prices of the markets than the PRM. Two major contributions of the nPRM are as follows. Firstly, the nPRM seeks for the solutions that will not only achieve higher prediction accuracy but also provide useful economic meanings (refer to Figs 1 and 2 below). The coefficients in the nPRM, A and κ, represent the implied equilibrium price, and the convergence speed to the implied equilibrium price, respectively. Secondly, for many predictive models involved in the time series analysis, it is necessary to transform the stock prices to other economic variables, usually the returns or the price increments. The nPRM can make the predictions directly for the stock prices, which implies that the model considers all daily information in the analysis. Furthermore, the conduction of the proposed procedure is simple and convenient.

Fig 1. The phase plot of the nPRM for κ > 0.

Fig 1

Fig 2. The phase plot of the nPRM for κ < 0.

Fig 2

Methods

We treat the stock index, denoted as S(t), as a dynamic system, which is a differentiable function with respect to time t. We model the subject by the nPRM, represented by a differential equation as follows.

{dS(t)dt=f(S(t))=κ[S(t)A],d2S(t)dt2=κdS(t)dt,S(t0)=S0,S(t1)=S1, (1)

where S(t) is the value of the subject at time t, A and κ are known constants for t ∈ [t0, t1]. We further denote v(t)=dS(t)dt and a(t)=d2S(t)dt2, which represent the velocity and acceleration of the dynamic system. The system along with its velocity and acceleration at specified time points t0 and t1 are denoted as (S(t0), v(t0), a(t0)) = (S0, v0, a0) and (S(t1), v(t1), a(t1)) = (S1, v1, a1) respectively for simplicity.

The Eq (1) is adapted from the Newton’s law of cooling [35]. In the Newton’s law of cooling, κ is always negative so that the temperature of the subject converges to the environment/surrounding temperature. Additionally, when the difference between the subject temperature and the environment temperature is large, the absolute value of κ is large and the speed of convergence is high. When the subject temperature is closer to the environment temperature, the absolute value of κ is small, which results in a slower convergence.

Consider the differential Eq (1), κ and A are constant during the period t ∈ [t0, t1]. Generally they are time related in a dynamic system for a longer time period. Furthermore, the time-related coefficients are often very dynamic in real-world data. For applications, the coefficients are assumed to be constant in a relatively small interval with given values between time t0 and t1.

For the consistence of this paper, let us first review the PRM in the following subsection.

The Price Reversion Model (PRM)

In the existing study [27], the PRM assumes that the external force equals zero at A. The second derivative is ignored in the Eq (1). Depending on the values of S(t), S0, and A, four cases are considered and the solutions are listed as follows.

  • Case A.

    If S(t) > S0 and (S(t) − A) ⋅ (S0A) > 0,
    S(t)=A+(S0A)·eκ(tt0). (2)
  • Case B.

    If S(t) > S0 and (S(t) − A) ⋅ (S0A) < 0,
    S(t)=A(S0A)·eκ(tt0). (3)
  • Case C.

    If S(t) < S0 and (S(t) − A) ⋅ (S0A) > 0,
    S(t)=A+(S0A)·e|κ|(tt0). (4)
  • Case D.

    If S(t) < S0 and (S(t) − A) ⋅ (S0A) < 0,
    S(t)=A(S0A)·e|κ|(tt0). (5)

However, essentially the situations represented by the Eqs (3) and (5) does not exist since the external force equals zero at A. It motivates us to re-derive the nPRM in the next subsection.

Derivation of solutions to the new Price Reversion Model (nPRM)

In Eq (1), A is an equilibrium price of a stock index for a specific market, which is analogous to the environment temperature, and κ is a growth coefficient. The force on the stock price draws it near the equilibrium if κ < 0 or pushes it away from the equilibrium if κ > 0. The nPRM is a more generalized dynamic system than that described by the Newton’s law of cooling. Therefore, in the nPRM κ can be either positive or negative. When κ < 0, the nPRM is similar to the thermodynamic system; when κ > 0, the behavior of the dynamic system is a different story.

If κ > 0 in (1), A is a repeller. Fig 1 is the phase plot of nPRM when κ > 0. The first zone demonstrates the situation that the force pushes the price S(t) away from A if the initial price S0 is smaller than A. In the second zone, the force repels the price S(t) from the equilibrium price A if the initial price is larger than A. On the contrary, if κ < 0 in (1), A is an attractor. Fig 2 is the phase plot of nPRM when κ < 0. When the initial price is higher than A, the force draws the price to the equilibrium price A as the figure demonstrates in the third zone. In the fourth zone, if the initial price is lower than the equilibrium A, the price converges to A.

We solve the Eq (1) of the nPRM piecewisely as follows. We divide S(t) into intervals by the critical point, A, where dS(t)dt|S=A=0. As a result, S(t) is strictly monotonic in each of the intervals. From Eq (1), after the change of variables we have

S(t0)S(t)dr(rA)=t0tκdr.

Therefore,

S0S(t)dr(rA)=ln|rA||S0S(t)=ln[S(t)A]ln(S0A)=κ(tt0).

Then, we have

S(t)=A+(S0A)·eκ(tt0),t[t0,t1]. (6)
  • Case 1.

    Given S0 < A and dS(t)dt<0 for t ∈ [t0, t1] as in the first zone in Fig 1, we obtain κ > 0. Therefore, the theoretical value S(t) is strictly decreasing in t, and S(t) < A for t ∈ [t0, t1]. The solution blows up since
    limtS(t)=limt[A+(S0A)·eκ(tt0)]=.
  • Case 2.

    Given S0 > A and dS(t)dt>0 for t ∈ [t0, t1] as in the second zone in Fig 1, we obtain κ > 0. Therefore, the theoretical value S(t) is strictly increasing in t, and S(t) > A for t ∈ [t0, t1]. The solution blows up since
    limtS(t)=limt[A+(S0A)·eκ(tt0)]=.
  • Case 3.

    Given S0 < A and dS(t)dt>0 for t ∈ [t0, t1] as in the third zone in Fig 2, we obtain κ < 0. Therefore, the theoretical value S(t) is strictly increasing in t, and S(t) < A for t ∈ [t0, t1]. The solution converges to A since
    limtS(t)=limt[A+(S0A)·eκ(tt0)]=A.
  • Case 4.

    Given S0 > A and dS(t)dt<0 for t ∈ [t0, t1] as in the fourth zone in Fig 2, we obtain κ < 0. Therefore, the theoretical value S(t) is strictly decreasing in t, and S(t) > A for t ∈ [t0, t1]. The solution converges to A since
    limtS(t)=limt[A+(S0A)·eκ(tt0)]=A.

A comparison of the cases on different conditions between PRM and nPRM is given by Table 1. As mentioned before, cases B and D in the PRM do not exist due to zero external force at A. By examining the phase plots of Figs 1 and 2, the case A of the PRM can be further divided into cases 2 and 4 of the nPRM. The case C of the PRM can be further divided into cases 1 and 3 of the nPRM. We derive solutions for the cases 3 and 4 of the nPRM, which can not be obtained identically from the solutions of the PRM. As a result, the theoretical values obtained via the PRM are not so accurate as those from the nPRM.

Table 1. Comparisons between solutions of PRM and nPRM under different conditions.

Model Case S0A S(t) − A S(t) − S0 dS(t)dt κ Solution Phase plot
PRM A + + + + + Eq (2) second zone in Fig 1 or fourth zone in Fig 2
B - + + + + Eq (3) no existence
C - - - - + Eq (4) first zone in Fig 1 or third zone in Fig 2
D - + - - - Eq (5) no existence
nPRM 1 + - - - + Eq (6) first zone in Fig 1
2 + + + + + Eq (6) second zone in Fig 1
3 - - + + - Eq (6) third zone in Fig 2
4 + + - - - Eq (6) fourth zone in Fig 2

Implementation details of the nPRM

The differential Eq (1) characterizes a dynamic system, whose movement is continuous and at least second-order differentiable. However, the real data of daily stock prices consist of two parts, the trajectory of the dynamic system and the noise. The trajectory of the dynamic system is the functional part while the noise is random. Modeling the noisy data, the stock index for instance, by a differential equation is not appropriate. The reasonable approach is to model the functional movement that can be characterized by the differential equation after filtering out the random noise. We establish a procedure of 4 main steps for applying the nPRM to daily stock prices as follows.

Step 1—Smoothing

Moving average is a simple and commonly used filtering method, and hence we implement this method on stock data. We filter out the noise of the data, and define the smoothed stock prices as

S˜(t)=i=0M1S(ti)M. (7)

Afterwards, we apply model (1) to the smoothed data and obtain the theoretical values. In order to obtain accurate calculation of κ, the adequate smoothing period M has to be specified. For this purpose, we estimate the smoothing period M in the training set, which we will further explain later.

This smoothing step is not involved in the procedure of PRM [27]. Nevertheless, directly modeling the market prices of a stock index suffers from severe fluctuations in coefficient values resulted from random part of the price movements, the Brownian motion. Characterizing the smoothed stock prices by the differential Eq (1) is more reasonable so that this step is crucial in our proposed method nPRM.

Step 2—Calculating A

To solve (1) we have to know the fixed value of A for a given time period [t0, t1] in advance. However, in reality the equilibrium price A is unknown so that the estimation of A is required. A is calculated using smoothed historical data before the period [t0, t1]. More specifically, we calculate A as

A=i=0N1S˜(t0i)N, (8)

that is the average of stock prices during a period earlier than t0. The length of the time interval [t0N + 1, t0] should be carefully chosen. We also determine the time period N based on the training set.

Step 3—Calculating and controlling κ value

During the solving of model (1), the value of κ is first calculated from

dS˜(t)dt=κ[S˜0A], (9)

by substituting S˜(t) for S(t) in (1). The value of κ is constant during a short period [t0, t1]. However, it changes over time, and thus we further denote it as κt, t = t0, ⋯, tτ for different time intervals. When κt > 0 and has large value, the theoretical value at time t will explode. Nevertheless, this situation does not correspond with the changes of stock prices in the real market. To avoid generating unreasonable theoretical values, we restrict the values of κt lower than an upper bound.

A gradient control method [36] is proposed to control the values of the gradient in a reasonable range. The κt is a one-dimensional gradient in the nPRM, and we apply this method to κt and restrict its value as

κt*={κt,Q3,ifκt>κt,Q3,κt,otherwise, (10)

where κt,Q3 is the third quantile from previous values of κi, for i = tj, ⋯, t − 1 and j ≥ 100 in the series we obtained. The pool of κi’s comprises at least 100 values from historical data so that we can obtain stable values of κt*.

Step 4—Making τ-step forecasts

The theoretical values of the stock prices during time [t0, t1] are obtained by the solution (6), in which S0 are replaced by S˜0. Assuming the coefficients unchanged for a longer period [t0, tτ], we can obtain theoretical values before time tτ. For discrete time representation, we obtain theoretical daily closing prices at t = t0, t1, ⋯, tτ as the τ-step forecasts.

Model evaluation

The forecasting error is measured by three deviation-type measurements, the Mean Absolute Percentage Error (MAPE), the Mean Absolute Error (MAE), and the Root Mean Square Error (RMSE), along with three trend-type measurements, directional Symmetry (DS), correct up-trend (CP) and correct down-trend (CD). The definitions are as follows.

MAPE=1Ti=t1T|S^(i)S(i)S(i)|·100,
MAE=1Ti=t1T|S^(i)S(i)|,
RMSE=1Ti=t1T(S^(i)S(i))2,
DS=1Ti=t1Tai·100,ai={1,if(S^(i)S^(i1))(S(i)S(i1))0,0,otherwise,
CP=1T1i=t1T1bi·100,wherebi={1,if(S^(i)S^(i1))>0and(S^(i)S^(i1))(S(i)S(i1))0,0,otherwise,
CD=1T2i=t1T2ci·100,whereci={1,if(S^(i)S^(i1))<0and(S^(i)S^(i1))(S(i)S(i1))0,0,otherwise,

where S^(i) is the theoretical value of the model at time i, S(i) is the market value of the data at time i, T is the length of the forecasts, T1 is the number of data points belonging to up trend, and T2 is the number of data points belonging to down trend.

A smaller value of the MAPE indicates that the model better describes the movements of the stock prices. The MAE and the RMSE show similar tendency with the MAPE. DS measures the frequency of correct directional forecasts. CP and CD measure the frequency of correct directional forecasts when real market values rise and fall, respectively. The larger values of DS, CP, and CD suggest that the model obtains more correct directional forecasts in general, for uptrend, and for downtrend, respectively.

Results

For application to real data, we split the data into a training set and a testing set, consisting of the earlier 80% and the latest 20% of the data, respectively. We use the training set to specify adequate smoothing period and the time interval for estimation of the equilibrium, that is, the appropriate choices of M and N in (7) and (8), respectively.

Applied data

The empirical study was conducted based on information from real daily closing price data of ten stock indexes from 2009 to 2019. The indexes include Taiwan TAIEX, Japan Nikkei 225, Korea KOSPI, Hong Kong Hang Seng Index, the Philippines PSEi Index, Thailand SET Index, India S&P BSE SENSEX, Singapore Straits Times Index (STI), Indonesia JKSE, and Malaysia KLCI. The data are collected from Refinitiv Datastream. Each stock index is viewed as a dynamic system and characterized by the nPRM separately.

We implemented the nPRM with a few choices of M and N to obtain τ-step forecasts for the training dataset of each stock index. The values of M range from 2 to 10 while the choice of N includes 30, 60, 90, 120, and 180. We set the number of forecasting steps τ = 5. We assess all combinations of M and N by the criterion of the grand mean of MAPEs across 1-step to 5-step forecasts in the training set. Afterwards, we select the combination of the best performance as the coefficient setting for prediction in the testing set. Table 2 shows that the best combination of M and N with the smallest grand mean of MAPEs in the training set for all the ten indexes. In the following, the forecasts are conducted with M and N specified for each stock index in Table 2.

Table 2. The best combination of M and N in the training set.

M N M N
Hang Seng 4 180 SET 6 180
Nikkei 225 5 180 BSE SENSEX 5 180
KOSPI 4 180 STI 4 180
TAIEX 4 180 JKSE 8 180
PSEi 7 180 KLCI 4 180

We conduct all the analyses on a computer with an Intel(R) Core(TM) i7–8565U Processor at 1.80GHz with 24.0 GB RAM running Windows 10. All computations are performed using R version 4.0.3 [37] and the R package Metrics.

Empirical results

The main reason of underestimation of the PRM in the existing study [27] is the difficulty in determining the sign of the growth coefficient, κ, during the integration. In the proposed nPRM procedure, we derive the solution to the differential equation of the nPRM and propose adequate economic explanations of κ. Additionally, when κ > 0 we control its value by a gradient control method [36], and the nPRM is therefore more suited to real markets.

Fig 3 displays the 1-step forecasts of the nPRM for Hong Kong Hang Seng Index along with the market values. Fig 3 suggests that the nPRM possesses accurate forecasting ability. The patterns of the forecasts from 2-step to 5-step by the nPRM are all similar to Fig 3, and thus we omit the demonstration here. Figs 412 illustrate the 1-step forecasts of the nPRM for other Asian stock indexes individually, which also indicate accurate forecasts by the nPRM.

Fig 3. 1-step forecasts of the nPRM and market values for Hong Kong Hang Seng Index.

Fig 3

Fig 4. 1-step forecasts of the nPRM and market values for Japan Nikkei 225 Index.

Fig 4

Fig 12. 1-step forecasts of the nPRM and market values for Malaysia KLCI.

Fig 12

Fig 5. 1-step forecasts of the nPRM and market values for Korea KOSPI.

Fig 5

Fig 6. 1-step forecasts of the nPRM and market values for Taiwan TAIEX.

Fig 6

Fig 7. 1-step forecasts of the nPRM and market values for the Philippines PSEi Index.

Fig 7

Fig 8. 1-step forecasts of the nPRM and market values for Thailand SET Index.

Fig 8

Fig 9. 1-step forecasts of the nPRM and market values for India S&P BSE SENSEX.

Fig 9

Fig 10. 1-step forecasts of the nPRM and market values for Singapore Straits Times Index (STI).

Fig 10

Fig 11. 1-step forecasts of the nPRM and market values for Indonesia JKSE.

Fig 11

Table 3 displays the MAPEs of applying the investigated methods to all of the ten Asian Indexes from 1-step forecasts to 5-step forecasts. Since the performances of three deviation-type errors (MAPE, MAE, and RMSE) are similar, we simply report and discuss the results of the MAPEs. From Table 3, the martingale (see details in S1 Appendix) has the smallest errors followed by those from the nPRM, and the curve fitting (see details in S2 Appendix) has the worst performance.

Table 3. MAPE (%) for the testing set.

τ = 1 τ = 2 τ = 3 τ = 4 τ = 5
Nikkei 225 nPRM 1.04 1.27 1.53 1.78 2.05
Martingale 0.72 1.05 1.28 1.52 1.74
Curve fitting 3.05 3.32 3.60 3.87 4.15
Hang Seng nPRM 1.12 1.44 1.77 2.11 2.43
Martingale 0.81 1.21 1.53 1.79 2.00
Curve fitting 2.33 2.52 2.71 2.89 3.06
TAIEX nPRM 0.76 1.00 1.21 1.43 1.65
Martingale 0.57 0.85 1.03 1.22 1.38
Curve fitting 1.96 2.14 2.32 2.49 2.67
KOPSI nPRM 0.86 1.10 1.38 1.62 1.84
Martingale 0.61 0.92 1.17 1.39 1.56
Curve fitting 2.54 2.76 2.98 3.20 3.42
PSEi nPRM 1.46 1.71 1.93 2.16 2.35
Martingale 0.75 1.02 1.22 1.42 1.58
Curve fitting 3.09 3.36 3.65 3.93 4.22
SET nPRM 1.18 1.47 1.70 1.98 2.20
Martingale 0.46 0.69 0.86 1.01 1.13
Curve fitting 2.08 2.25 2.43 2.60 2.78
BSE SENSEX nPRM 1.31 1.62 1.88 2.20 2.42
Martingale 0.53 0.78 0.99 1.17 1.33
Curve fitting 2.29 2.51 2.72 2.94 3.17
STI nPRM 1.19 1.52 1.77 2.09 2.32
Martingale 0.54 0.76 0.91 1.06 1.19
Curve fitting 1.76 1.91 2.05 2.18 2.31
JKSE nPRM 1.27 1.42 1.57 1.73 1.84
Martingale 0.60 0.85 1.02 1.14 1.27
Curve fitting 2.39 2.62 2.85 3.08 3.31
KLCI nPRM 0.98 1.30 1.58 1.89 2.07
Martingale 0.49 0.72 0.90 1.05 1.18
Curve fitting 1.76 1.92 2.07 2.22 2.38

Table 4 displays the directional symmetry (SD) of the nPRM and the curve fitting from 1-step forecasts to 5-step forecasts for ten Asian stock indexes. The trend-type errors cannot be applied to the forecasts of the martingale since the martingale forecasts the price of the next day by the present price. Therefore, we only compare the nPRM and the curve fitting here. Table 4 shows that nPRM and the curve fitting have similar percentage of correct trend forecasts. The differences between up-trend forecasts and down-trend forecasts of the nPRM and the curve fitting are also not significant (see details in S1 and S2 Tables).

Table 4. Directional Symmetry (DS) (%) for the testing set.

τ = 1 τ = 2 τ = 3 τ = 4 τ = 5
Nikkei 225 nPRM 48.97 50.00 50.17 51.03 50.51
Curve fitting 47.11 46.60 45.92 46.77 47.45
Hang Seng nPRM 49.06 47.19 48.38 46.85 46.17
Curve fitting 48.81 48.47 48.81 49.66 51.69
TAIEX nPRM 49.23 47.69 50.77 51.62 52.48
Curve fitting 50.51 53.4 52.04 53.91 53.40
KOPSI nPRM 47.10 48.98 49.49 49.15 52.90
Curve fitting 49.92 48.05 49.92 47.71 46.35
PSEi nPRM 50.19 51.15 48.47 49.24 48.09
Curve fitting 47.04 47.58 45.16 45.43 45.97
SET nPRM 48.84 49.03 48.65 50.97 49.03
Curve fitting 49.57 47.84 49.86 50.72 51.01
BSE SENSEX nPRM 44.84 45.03 46.90 51.41 51.41
Curve fitting 48.28 48.01 49.07 51.19 54.11
STI nPRM 46.93 48.98 49.53 51.77 52.33
Curve fitting 50.92 51.18 49.34 51.44 52.76
JKSE nPRM 49.43 51.72 51.72 50.19 50.77
Curve fitting 47.03 46.76 44.86 44.86 42.97
KLCI nPRM 48.94 46.82 46.24 47.78 50.67
Curve fitting 49.72 47.84 44.84 46.53 43.71

Apart from the martingale, we can investigate the level of market efficiency by sample entropy [38, 39]. Sample entropy measures complexity. When we have a time series of length n as {S(1), S(2), ⋯, S(n)}, a template vector of length m is defined as Sm(i) = {S(i), S(i + 1), ⋯, S(i + m − 1)} and the distance function d[Sm(i), Sm(j)], ij is defined as the Chebyshev distance. Then, we define the sample entropy of the series, SampeEn, as

SampEn=logBm+1Bm, (11)

where Bm is number of template vector pairs having d[Sm(i), Sm(j)] ≤ r, and r is the tolerance. The value of sample entropy is nonnegative. The value 0 of sample entropy indicates that the series is perfectly regular without noise while a high value reflects randomness and unpredictability. If we examine the market efficiency from the perspective of sample entropy, a high value of sample entropy indicates a market with high level of efficiency. To show how large the possible value of sample entropy from a highly random series can be, we simulate a series of length 180 from a normal distribution with zero mean and unit variance for 1000 times. The average value of sample entropy is 2.21.

Table 5 shows the mean value of sample entropy calculated from all series used in estimation of A and κ to make forecasts dynamically in the testing set for ten indexes in our empirical study. The mean values range between 0.5952 (Korea KOSPI) and 0.7833 (the Philippines PSEi), which represent similar complexity among these markets. We suspect that the forecasting errors should be larger for markets with higher level of efficiency since it is harder to predict the series with random noise. By the order of the mean sample entropy values and the MAPEs for the ten stock indexes, the nPRM shows a tendency toward better forecasts for series with lower market efficiency. That is, the forecasting ability of the nPRM usually increases as the mean sample entropy decreases. However, there is an exception of Taiwan TAIEX. The mean sample entropy obtained from the TAIEX series is the second highest, so that we won’t expect a smaller prediction error among the ten indexes. However, the MAPEs of the nPRM for 1-step to 5-step forecasts are relatively low. The changes of TAIEX are similar to the movement which is characterized by the nPRM among the ten markets.

Table 5. Mean sample entropy for data series of length 180 used in calculation of A and κ.

Nikkei 225 Hang Seng TAIEX KOSPI PSEi SET BSE SENSEX STI JKSE KLCI
0.6880 0.6094 0.7052 0.5952 0.7833 0.6130 0.6258 0.6593 0.6901 0.6088

We summarize the performances of the methods as follows. Considering the deviation-type forecasting errors, the martingale seems to perform the best among the methods. According to the empirical evidence [2], the test based on the martingale can be used to examine the market efficiency of the stock market. Despite the absence of the test in this study, the low errors of the martingale for these ten Asian stock indexes imply that the markets tend to be efficient during the period of the data. The martingale suggests that no information we can obtain from the past to forecast the future stock prices. However, our proposed method nPRM still captures the trend of the stock movements quite well, and provides meaningful coefficients, A and κ, as well under the circumstances. If applying the nPRM to the markets that are less efficient, the model coefficients may acquire more information with economic meanings and obtain more accurate forecasts than the martingale.

The average computational times of all the investigated methods are shown in Table 6. The martingale uses the least computational time while the curve fitting spends the most. The nPRM is computationally efficient in view of the relationship between the forecasting accuracy and the computational time.

Table 6. Average computational time (unit: Second).

Nikkei 225 Hang Seng TAIEX KOSPI PSEi SET BSE SENSEX STI JKSE KLCI
nPRM 0.9715 0.8825 0.9584 0.9583 0.8191 0.8461 0.8695 1.0289 0.9276 0.9896
Martingale 0.0259 0.0230 0.0229 0.0269 0.0259 0.0229 0.0289 0.0239 0.0209 0.0189
Curve fitting 1.8553 1.7147 1.6480 1.7692 1.0994 1.0313 1.1555 1.0787 1.0677 1.5757

Discussion

In this study, we propose a novel procedure adapted from the thermodynamic system to model the stock prices by a newly derived model, the new Price Reversion Model (nPRM), which is a modification of PRM. From the viewpoint of signal processing, the proposed procedure of nPRM models the stock prices with all available information. Therefore, the coefficients in the nPRM preserve and reveal more information about the stock movements economically. They also may be treated as important features for forecasting by advanced models such as deep learning models.

We evaluate the forecasting accuracy of the nPRM along with the martingale, and the cubic curve fitting by ten Asian stock index examples. By investigating the daily closing prices, the nPRM accurately characterizes and forecasts the large-scale motions in these ten stock markets. Although the forecasting accuracy declines over time, which is the same phenomenon for all the methods, the deviation-type errors of the nPRM are the smallest except for the martingale. However, the martingale can not be employed to forecast the trend of the stock even though it offers an eligible prediction about the prices if the market is efficient.

The trend-type errors of the nPRM and the curve fitting are close. Furthermore, the trend-type errors of the two methods do not significantly deviate from 0.5 from 1-step to 5-step forecasts. This indicates that the nPRM and the curve fitting does not possess good trend forecasting ability. In order to improve the trend forecasting in the future, we will develop a more sophisticated method to calculate the κ values without the gradient control.

Supporting information

S1 Appendix. The martingale.

(PDF)

S2 Appendix. The curve fitting.

(PDF)

S1 Table. Correct up-trend (CP) (%) for the testing set.

(PDF)

S2 Table. Correct down-trend (CD) (%) for the testing set.

(PDF)

Acknowledgments

This research would not have been possible without the conceptual and theoretical development from Prof. Meng-Rong Li, the retired professor from Department of Mathematical Sciences, National Chengchi University. The authors are inspired by his works and have unreserved support from him.

Data Availability

The data underlying the results presented in the study are available from Refinitiv Datastream. The Refinitiv Datastream is purchased by National Chengchi University, and all faculty and students of NCCU have access to this database. Those who are interested in getting access to the data can purchase or request for the product from https://www.refinitiv.com/en or https://reurl.cc/xEg67E.

Funding Statement

The authors received no specific funding for this work.

References

  • 1. Cowles A 3rd. Can stock market forecasters forecast? Econometrica: Journal of the Econometric Society. 1933; p. 309–324. doi: 10.2307/1907042 [DOI] [Google Scholar]
  • 2. Lim KP, Luo W. The weak-form efficiency of Asian stock markets: New evidence from generalized spectral martingale test. Applied Economics Letters. 2012;19(10):905–908. doi: 10.1080/13504851.2011.607123 [DOI] [Google Scholar]
  • 3. Malkiel BG, Fama EF. Efficient capital markets: A review of theory and empirical work. The Journal of Finance. 1970;25(2):383–417. doi: 10.1111/j.1540-6261.1970.tb00518.x [DOI] [Google Scholar]
  • 4. Fama EF. The behavior of stock-market prices. The Journal of Business. 1965;38(1):34–105. doi: 10.1086/294743 [DOI] [Google Scholar]
  • 5. Cochrane JH. Presidential address: Discount rates. The Journal of Finance. 2011;66(4):1047–1108. doi: 10.1111/j.1540-6261.2011.01671.x [DOI] [Google Scholar]
  • 6. Abarbanell JS, Bushee BJ. Fundamental analysis, future earnings, and stock prices. Journal of accounting research. 1997;35(1):1–24. doi: 10.2307/2491464 [DOI] [Google Scholar]
  • 7. Abarbanell JS, Bushee BJ. Abnormal returns to a fundamental analysis strategy. Accounting Review. 1998; p. 19–45. [Google Scholar]
  • 8. Park CH, Irwin SH. What do we know about the profitability of technical analysis? Journal of Economic surveys. 2007;21(4):786–826. doi: 10.1111/j.1467-6419.2007.00519.x [DOI] [Google Scholar]
  • 9. Campbell JY, Shiller RJ. Stock prices, earnings, and expected dividends. the Journal of Finance. 1988;43(3):661–676. doi: 10.1111/j.1540-6261.1988.tb04598.x [DOI] [Google Scholar]
  • 10. Lewellen J. Predicting returns with financial ratios. Journal of Financial Economics. 2004;74(2):209–235. doi: 10.1016/j.jfineco.2002.11.002 [DOI] [Google Scholar]
  • 11. Welch I, Goyal A. A comprehensive look at the empirical performance of equity premium prediction. The Review of Financial Studies. 2008;21(4):1455–1508. doi: 10.1093/rfs/hhm014 [DOI] [Google Scholar]
  • 12. Campbell JY, Shiller RJ. The dividend-price ratio and expectations of future dividends and discount factors. The Review of Financial Studies. 1988;1(3):195–228. doi: 10.1093/rfs/1.3.195 [DOI] [Google Scholar]
  • 13. Campbell JY, Shiller RJ. Valuation ratios and the long-run stock market outlook: An update. Advances in Behavioral Finance. 2005;2:173–201. [Google Scholar]
  • 14. Cochrane JH. The dog that did not bark: A defense of return predictability. The Review of Financial Studies. 2008;21(4):1533–1575. doi: 10.1093/rfs/hhm046 [DOI] [Google Scholar]
  • 15. Lettau M, Ludvigson S. Consumption, aggregate wealth, and expected stock returns. the Journal of Finance. 2001;56(3):815–849. doi: 10.1111/0022-1082.00347 [DOI] [Google Scholar]
  • 16. Rapach DE, Strauss JK, Zhou G. Out-of-sample equity premium prediction: Combination forecasts and links to the real economy. The Review of Financial Studies. 2010;23(2):821–862. doi: 10.1093/rfs/hhp063 [DOI] [Google Scholar]
  • 17. Cheng CH, Wei LY. A novel time-series model based on empirical mode decomposition for forecasting TAIEX. Economic Modelling. 2014;36:136–141. doi: 10.1016/j.econmod.2013.09.033 [DOI] [Google Scholar]
  • 18. Atsalakis GS, Valavanis KM. Surveying stock market forecasting techniques—Part II: Soft computing methods. Expert Systems with Applications. 2009;36(3):5932–5941. doi: 10.1016/j.eswa.2008.07.006 [DOI] [Google Scholar]
  • 19. Guresen E, Kayakutlu G, Daim TU. Using artificial neural network models in stock market index prediction. Expert Systems with Applications. 2011;38(8):10389–10397. doi: 10.1016/j.eswa.2011.02.068 [DOI] [Google Scholar]
  • 20. Wang J, Wang J. Forecasting stock market indexes using principle component analysis and stochastic time effective neural networks. Neurocomputing. 2015;156:68–78. doi: 10.1016/j.neucom.2014.12.084 [DOI] [Google Scholar]
  • 21.Zhang C, Pan H. A novel hybrid model based on EMD-BPNN for forecasting US and UK stock indices. In: 2015 IEEE International Conference on Progress in Informatics and Computing (PIC). IEEE; 2015. p. 113–117.
  • 22. Yan B, Aasma M, Zhang Y. A novel deep learning framework: Prediction and analysis of financial time series using CEEMD and LSTM. Expert Systems with Applications. 2020;159:113609. doi: 10.1016/j.eswa.2020.113609 [DOI] [Google Scholar]
  • 23. Basak S, Kar S, Saha S, Khaidem L, Dey SR. Predicting the direction of stock market prices using tree-based classifiers. The North American Journal of Economics and Finance. 2019;47:552–567. doi: 10.1016/j.najef.2018.06.013 [DOI] [Google Scholar]
  • 24. Xu M, Shang P, Lin A. Cross-correlation analysis of stock markets using EMD and EEMD. Physica A: Statistical Mechanics and its Applications. 2016;442:82–90. doi: 10.1016/j.physa.2015.08.063 [DOI] [Google Scholar]
  • 25. Mallikarjuna M, Rao RP. Evaluation of forecasting methods from selected stock market returns. Financial Innovation. 2019;5(40):1–16. 10.1186/s40854-019-0157-x. [DOI] [Google Scholar]
  • 26. Chiang-Lin TJ, Li MR, Lee YS. TAIEX index option model by using nonlinear differential equation. Mathematical and Computational Applications. 2014;19(1):78–92. doi: 10.3390/mca19010078 [DOI] [Google Scholar]
  • 27. Chen NP, Li MR, Chiang-Lin TJ, Lee YS, Miao DWC. Applications of linear ordinary differential equations and dynamic system to economics—An example of Taiwan stock index TAIEX. International Journal of Dynamical Systems and Differential Equations. 2017;7(2):95–111. doi: 10.1504/IJDSDE.2017.10006751 [DOI] [Google Scholar]
  • 28. Li MR, Chiang-Lin TJ, Lee YS. Applications of parameterized nonlinear ordinary differential equations and dynamic systems—An example of the Taiwan stock index. International Journal of Differential Equations. 2018; p. Article ID 4762485. doi: 10.1155/2018/4762485 [DOI] [Google Scholar]
  • 29. Li MR, Miao DWC, Chiang-Lin TJ, Lee YS. Modelling DAX by applying parabola approximation method. International Journal of Computing Science and Mathematics. 2019;10(6):566–579. doi: 10.1504/IJCSM.2019.10025668 [DOI] [Google Scholar]
  • 30. Li MR, Shieh TH, Yue CJ, Lee P, Li YT. Parabola method in ordinary differential equation. Taiwanese Journal of Mathematics. 2011;15(4):1841–1857. doi: 10.11650/twjm/1500406383 [DOI] [Google Scholar]
  • 31. Abhyankar A, Copeland LS, Wong W. Nonlinear dynamics in real-time equity market indices: Evidence from the United Kingdom. The Economic Journal. 1995;105(431):864–880. doi: 10.2307/2235155 [DOI] [Google Scholar]
  • 32. Abhyankar A, Copeland LS, Wong W. Uncovering nonlinear structure in real-time stock-market indexes: The S&P 500, the DAX, the Nikkei 225, and the FTSE-100. Journal of Business and Economic Statistics. 1997;15(1):1–14. doi: 10.1080/07350015.1997.10524681 [DOI] [Google Scholar]
  • 33. Gkranas A, Rendoumis VL, Polatoglou HM. Athens and Lisbon stock markets—A thermodynamic approach. WSEAS Transactions on Business and Economics. 2004;1(1):95–100. [Google Scholar]
  • 34. Zarikas V, Christopoulos AG, Rendoumis VL. A thermodynamic description of the time evolution of a stock market index. European Journal of Economics, Finance and Administrative Sciences. 2009;16:73–83. [Google Scholar]
  • 35. Besson U. The history of the cooling law: When the search for simplicity can be an obstacle. Science & Education. 2012;21(8):1085–1110. doi: 10.1007/s11191-010-9324-1 [DOI] [Google Scholar]
  • 36. Shieh TH, Li MR, Li YT. Study of new accurate supplementary energy modeling with multi-components based on newly developed high resolution scheme. International Communications in Heat and Mass Transfer. 2019;107:121–140. doi: 10.1016/j.icheatmasstransfer.2019.04.003 [DOI] [Google Scholar]
  • 37. R Core Team. R: A Language and Environment for Statistical Computing; 2020. Available from: https://www.R-project.org/. [Google Scholar]
  • 38. Yentes JM, Hunt N, Schmid KK, Kaipust JP, McGrath D, Stergiou N. The appropriate use of approximate entropy and sample entropy with short data sets. Annals of biomedical engineering. 2013;41(2):349–365. doi: 10.1007/s10439-012-0668-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Richman JS, Moorman JR. Physiological time-series analysis using approximate entropy and sample entropy. American Journal of Physiology-Heart and Circulatory Physiology. 2000;278:H2039–H2049. doi: 10.1152/ajpheart.2000.278.6.H2039 [DOI] [PubMed] [Google Scholar]

Decision Letter 0

Junhuan Zhang

10 May 2021

PONE-D-21-10180

Study of Asian Indexes by a Newly Derived Dynamic Model

PLOS ONE

Dear Dr. Shieh,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Jun 24 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Junhuan Zhang, PhD

Academic Editor

PLOS ONE

Journal requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. In your Methods section and Data availability statement, please clarify how the data used in this study were obtained, and how others can access these, such as by providing references, links or contact details to the original data source.

3. Thank you for stating the following financial disclosure:

"No"

At this time, please address the following queries:

  1. Please clarify the sources of funding (financial or material support) for your study. List the grants or organizations that supported your study, including funding received from your institution.

  2. State what role the funders took in the study. If the funders had no role in your study, please state: “The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.”

  3. If any authors received a salary from any of your funders, please state which authors and which funders.

  4. If you did not receive any funding for this study, please state: “The authors received no specific funding for this work.”

Please include your amended statements within your cover letter; we will change the online submission form on your behalf.

4. In your Data Availability statement, you have not specified where the minimal data set underlying the results described in your manuscript can be found. PLOS defines a study's minimal data set as the underlying data used to reach the conclusions drawn in the manuscript and any additional data required to replicate the reported study findings in their entirety. All PLOS journals require that the minimal data set be made fully available. For more information about our data policy, please see http://journals.plos.org/plosone/s/data-availability.

Upon re-submitting your revised manuscript, please upload your study’s minimal underlying data set as either Supporting Information files or to a stable, public repository and include the relevant URLs, DOIs, or accession numbers within your revised cover letter. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories. Any potentially identifying patient information must be fully anonymized.

Important: If there are ethical or legal restrictions to sharing your data publicly, please explain these restrictions in detail. Please see our guidelines for more information on what we consider unacceptable restrictions to publicly sharing data: http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions. Note that it is not acceptable for the authors to be the sole named individuals responsible for ensuring data access.

We will update your Data Availability statement to reflect the information you provide in your cover letter.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No

Reviewer #2: Yes

Reviewer #3: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Stockmarket index forecasting is a widely studied subject and will continue as long as the stock markets exist. This research has developed a new method combining different optimisation methods: Random Walk, a stochastic method with Dynamic programming to be applied on time series with uncertainties. It is an appreciated approach. But the article has to be improved by applying the following:

1. Format of the paper is weak, please correct.

2. Include the contribution of the paper in the abstract, Introduction and Conclusion

3. The is no background used at all. A method focused paper is not interesting for the investors. Please add a background on the stock market index forecasting and take them out of the Introduction

4. Please clearly indicate why you needed the PRN method instead of any other forecasting method like Gradient Boosting

5. Nunber of references have to be at least 30

Reviewer #2: The paper analyses the explanatory and predictive power of thermodynamic models in describing and forecasting the motion of broad-based stock market indexes. In particular, the authors suggest a new price reversion model (referred to as nPRM) aimed to overcome some limitations of classic price reversion models (PRM). In order to study the empirical performance of the nPRM, the authors use market data of four Asian stock indexes to calibrate the model and analyze forecasting errors as well as trend accuracy. The authors show that, while the PRM provides smaller errors for the 1-step forecast than the nPRM, the increase in the number of forecasting steps allows the nPRM to provide smaller errors than the PRM. On the other hand, in general, the PRM and nPRM have similar percentage of correct trend forecasts. Furthermore, the authors show that the nPRM exhibits a higher predictive power in forecasting the down-trends of market indexes.

In general, the paper is interesting and is well written. Moreover, the methodology is clearly described and the validation procedure is easily understandable. In any case, I believe that the authors should introduce some changes in the manuscript in order to allow the reader to better place their contributions in the current body of knowledge. Specifically:

1. In the recent decades, the literature on asset pricing has been prolific in providing us with new predictors that have increased substantially the predictive power of return forecasting models, especially in the case of stock market indexes. Thus, the classic view of financial market, where returns were almost unforecastable –consistent with the efficient-market hypothesis (Fama, 1970)–, has been clearly overcome by different models that exploit the strong predictive power of slow-moving predictors, such as the dividend yield, the price-to-earnings ratio, or the consumption-wealth ratio. Although this paper adopts a completely different perspective to analyze the stochastic behavior of stock indexes, I believe that the authors should better place their contribution in the current body of work, referring to important papers, such as Campbell and Shiller (1988ab, 2005), Lettau and Ludvigson (2001), Lewellen (2004) or Cochrane (2008). Cochrane (2011) provides a superb revision on the topic.

2. I believe that the main contribution of the paper and the usefulness of the proposed model should be better explained in the Introduction Section. Although the paper is clear and concise, in my opinion these aspects have not been sufficiently addressed.

3. In line with the previous comment, I believe that the explanations provided in the Results Section are excessively synthetical. The authors should make an effort to better describe the results and their implications, which is somewhat applicable to the Conclusions Section.

4. Although the manuscript is well-written, there are some typos that should be corrected (e.g., “the primary goals in this study is to identify” in line 36, etc.).

5. TAIEX is first mentioned in the paper in line 16, while the acronym is defined in 21. It is important that the authors correct these minor issues throughout the document.

REFERENCES:

Campbell, John Y., and Robert J. Shiller, 1988a, “The Dividend-Price Ratio and Expectations of Future Dividends and Discount Factors,” Review of Financial Studies 1, 195–228.

Campbell, John Y., and Robert J. Shiller, 1988b, “Stock Prices, Earnings, and Expected Dividends,” Journal of Finance 43, 661–676.

Cochrane, John H., 2008, “The Dog That Did Not Bark: A Defense of Return Predictability,” Review of Financial Studies 21, 1533–1575.

Cochrane, John H., 2011, “Presidential Address: Discount Rates,” Journal of Finance 66, 1047–1108.

Fama, Eugene F., 1970, “Efficient Capital Markets: A Review of Theory and EmpiricalWork,” Journal of Finance 25, 383–417.

Lettau, Martin, and Sydney C. Ludvigson, 2001a, “Consumption, Aggregate Wealth, and Expected Stock Returns,” Journal of Finance 56, 815–849.

Lewellen, Jonathan, 2004, “Predicting Returns with Financial Ratios,” Journal of Financial Economics 74, 209–235.

Reviewer #3: A new dynamic model for stock price index is proposed in this study and its validity is discussed in four Asian indexes. The manuscript is be accepted if it is revised well according to the comments.

1. Newly derived Price reversion model (nPRM) may be a revised version of original Price reversion model (PRM). As the background of this study, PRM should be introduced and nPRM is compared with PRM in order to reveal the their difference.

2. Martingale and Curve fitting are also described briefly because nPRM is compared with them.

3. Numerical examples reveal that PRM is better than nPRM at small value of ¥tau and that Martingale seems to be best among them. The validity of nPRM is not obvious. Authors should discuss these points.

4. Computational cost is also important point of view. The computational costs of four algorithms, nPRM, PRM, Martingale and Curve fitting, should be compared.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2022 May 2;17(5):e0266600. doi: 10.1371/journal.pone.0266600.r002

Author response to Decision Letter 0


15 Sep 2021

Submission No: PONE-D-21-10180

Dear Editor,

We appreciate the reviewers’ valuable comments and suggestions. Meanwhile, we have addressed those comments one by one in the following responses. In the following, the original comments from the reviewer are in black and our responses are in blue.

With Regards,

Yong Shiuan, Lee

=======================================================

Response to Comments from Reviewer #1

1. Format of the paper is weak, please correct.

We rearrange the outline of this article. We follow the submission guidelines of the “Manuscript Organization” of PLOS ONE. We present the sections/subsections as follows.

�Abstract

�Introduction

�Methods

� The Price Reversion Model (PRM)

� Derivation of solutions to the new Price Reversion Model (nPRM)

� Implementation details of the nPRM

� Model evaluation

�Results

� Simulation design

� Applied data

� Simulation results

�Discussion

�Acknowledgments

�References

�Supporting information captions

� S1 Appendix - The martingale approach

� S2 Appendix - The curve fitting approach

� S3 Fig.

� S4 Fig.

� S5 Fig.

� S6 Fig.

2. Include the contribution of the paper in the abstract, Introduction and Conclusion.

We rewrite the Abstract, the Introduction Section, and the Conclusion Section. The distinguishing features and the contribution of the proposed model are presented in the end of the Introduction Section. The contribution of our work is included in the Abstract, the Introduction Section, and the Conclusion Section.

In details:

Abstract (the last two sentences):

“This proposed procedure brought a different perspective to the study of stock prices. The time varying coefficients in the nPRM offer economic meanings of the stock movements.”

Introduction Section (line 90-97):

“The contributions of this paper are in the following.

The proposed procedure of nPRM directly analyzes raw data of stock prices, which retains all available information up to date to avoid reducing low-frequency information by taking returns or the logarithm of returns as well as making assumptions about the model. Moreover, the coefficients in the nPRM, namely the equilibrium price and the gradient, have their very own economic meanings. The gradient value reflects the convergence speed to the equilibrium price. We may further utilize the series of the gradient values as a feature for other models.”

Discussion Section (line 382-386):

“From the viewpoint of signal processing, the proposed procedure of nPRM models the stock prices with all available information. Therefore, the coefficients in the nPRM preserve and reveal more information about the stock movements economically. They also may be treated as important features for forecasting by advanced models such as deep learning models.”

3. The is no background used at all. A method focused paper is not interesting for the investors. Please add a background on the stock market index forecasting and take them out of the Introduction.

We add some background of stock market. We also rewrite the Introduction Section to review approaches of stock forecasting from different perspectives, such as financial accounting, econometrics, signal processing plus artificial intelligence models, differential equations and dynamic systems.

In details:

Introduction Section (line 3-68):

“The market prices of an individual stock reflect the company's existing value or future profit growth. A stock index, measured by the weighted mean prices of selected stocks in a stock market, represents the stock market's performance and potential competitive ability. Investors who have different motives for participating in the market concern about the changes of a stock or a stock index's market prices of different scales, intraday high-frequency data, daily closing prices, weekly or monthly summary, and etc. Accurately forecasting the stock movements could help the investors with decisions about investment strategy.

Whether the stock prices can be forecasted has been repeatedly discussed in finance. The efficient market hypothesis forms a foundation of the theory that under what circumstances the stock prices is predictable. The efficient market hypothesis states that three types of markets can be distinguished as the weak form, the semi-strong form, and the strong form depending on the level of market efficiency. The stock prices fully reflect all available market information in a strong efficient market, thus they are not predictable. The martingale depicts the stock movements in the strong efficient markets by the theory of random walks. In contrast, many real markets are considered as the weak or the semi-strong efficient markets. Consequently, to forecast the prices in these markets is possible.

The analysis of the stock movements is based on either the stock prices or the stock returns. The choice between the stock prices and the returns mainly depends on the approach adopted. …”

4. Please clearly indicate why you needed the PRM method instead of any other forecasting method like Gradient Boosting.

The gradient boosting is an ensemble method which combines weak prediction models (classification, regression, etc.) to form a predictive model. This is a well known and commonly used technique in machine learning. We are really grateful for this valuable comment although we did not involve this in the previous manuscript. On the contrary, our proposed model takes a different perspective by using the differential equations and the dynamic systems. We choose the martingale as a baseline/benchmark to compare with the nPRM if the market is efficient. Additionally, we choose the curve fitting to compare with the nPRM since in our proposed procedure the coefficients are determined piecewisely (constant in each window). The coefficients change with time (window by window). Therefore, the dynamics of the coefficients characterizes the stock movements. This procedure is similar to the approach of curve fitting, which solves the parameters by OLS piewisely. Those parameters also change in different intervals. Since the PRM and the nPRM are deterministic models, which solve the differential equations in an interval [t0, t1], it is hard for us to apply the gradient boosting during our proposed procedure or compare the nPRM with it. However, we think that the gradient boosting may be implemented to find the best values of the coefficients (κ and A) in the nPRM in the future. Comparing the results of the nPRM and a boosted predictive model such as combining SVRs (support vector regression) may also be the future work.

5. Number of references has to be at least 30.

We add some references as follows. Cowles (1933), Abarbanell and Bushee (1997, 1998), Park and Irwin (2007), Malkiel and Fama (1970), Campbell and Shiller (1988 [9]; 1988 [12]; 2005), Lettau and Ludvigson (2001), Lewellen (2004), Cochrane (2008; 2011), Welch and Goyal (2008), Rapach et al. (2010), Cheng and Wei (2014), Lim and Luo (2012), Wang and Wang (2015), Zhang and Pan (2015), Yan et al. (2020), Basak et al. (2019), Xu et al. (2016), Mallikarjuna and Rao (2019). We also brief important references in the Related Work Section.

In details:

The number of references is 38 now.

=======================================================

Response to Comments from Reviewer #2

1. In the recent decades, the literature on asset pricing has been prolific in providing us with new predictors that have increased substantially the predictive power of return forecasting models, especially in the case of stock market indexes. Thus, the classic view of financial market, where returns were almost unforecastable –consistent with the efficient-market hypothesis (Fama, 1970)–, has been clearly overcome by different models that exploit the strong predictive power of slow-moving predictors, such as the dividend yield, the price-to-earnings ratio, or the consumption-wealth ratio. Although this paper adopts a completely different perspective to analyze the stochastic behavior of stock indexes, I believe that the authors should better place their contribution in the current body of work, referring to important papers, such as Campbell and Shiller (1988ab, 2005), Lettau and Ludvigson (2001), Lewellen (2004) or Cochrane (2008). Cochrane (2011) provides a superb revision on the topic.

We add some references as follows. Cowles (1933), Abarbanell and Bushee (1997, 1998), Park and Irwin (2007), Malkiel and Fama (1970), Campbell and Shiller (1988 [9]; 1988 [12]; 2005), Lettau and Ludvigson (2001), Lewellen (2004), Cochrane (2008; 2011), Welch and Goyal (2008), Rapach et al. (2010), Cheng and Wei (2014), Lim and Luo (2012), Wang and Wang (2015), Zhang and Pan (2015), Yan et al. (2020), Basak et al. (2019), Xu et al. (2016), Mallikarjuna and Rao (2019). We also brief important references in the Introduction Section.

In details:

Introduction Section (line 11-80) (partly):

“Whether the stock prices can be forecasted has been repeatedly discussed in finance (Cowles1933, Lim2012). The efficient market hypothesis (Malkiel1970) forms a foundation of the theory that under what circumstances the stock prices is predictable.”,

“In contrast, many real markets are considered as the weak or the semi-strong efficient markets (Cochrane2011, Lim2012).”,

“The analysis of the stock movements is based on either the stock prices or the stock returns. The choice between the stock prices and the returns mainly depends on the approach adopted. From the financial accounting perspective, fundamental analysis (Abarbanell1997, Abarbanell1998) and technical analysis (Park2007) are the two main intellectual traditions.”,

“When using ARIMA and VAR models, the stationarity of the time series is required. Therefore, the stock returns instead of the stock prices are the primary research subject (Campbell1988b, Lewellen2004, Welch2008). Financial ratios like the log dividend-price ratio (Campbell1988[9], Campbell2005), the dividends (Campbell1988[12]), the price–earnings ratio (Campbell2005, Lewellen2004), the dividend yield (Cochrane2008, Lewellen2004), the consumption-wealth ratio (Lettau2001), the book-to-market ratio (Lewellen2004) serve as important predictors for asset returns in regression models, particularly VAR models. Although some good predictors in financial literature have restricted predictive ability for the equity premium due to model uncertainty and instability (Welch2008), combining individual forecasts produce out-of-sample gains (Rapach2010). Modeling stock returns reduces low-frequency information, the trend for example, from the perspective of signal processing. Therefore, forecasting the stock prices based on models constructed from detrended data may be less accurate in this sense.”,

“The artificial intelligence models applied to the stock prices in existing studies include support vector regression (SVR) (Cheng2014), neural networks (Atsalakis2009, Guresen2011, Wang2015, Zhang2015, Yan2020), tree-based models (Basak2019), etc. To applying the artificial intelligence models, acquiring features is an important step. The empirical mode decomposition (EMD) (Cheng2014, Zhang2015), the ensemble empirical mode decomposition (EEMD) (Xu2016), and the complementary ensemble empirical mode decomposition (CEEMD) (Yan2020) are widely employed methods to decompose the stock prices, which are regarded as signals. The empirical results of US and UK stock indexes show that the artificial intelligence models are better than the ARIMA and GARCH models in terms of forecasting directional symmetry (Zhang2015). Furthermore, the CEEMD-PCA-LSTM model (Yan2020), which integrate the CEEMD, principal component analysis (PCA), and long short-term memory (LSTM) networks, outperforms naive recurrent neural networks (RNN) or LSTM networks in terms of forecasting accuracy for six selected stock indexes. Still, in a review study of the returns from financial perspective (Mallikarjuna2019), the empirical evidences show that no single model out of statistical, artificial intelligence, frequency domain, and hybrid models under investigation could be applied uniformly to all markets. On the other hand, the direction prediction problem is another task, which can be tackled by the tree-based models (Basak2019) to predict the stock price direction.”,

“Another study (Xu2016) also considers the stock markets as complex dynamical systems, but their interactions are analyzed by methods of signal processing, the EMD and the EEMD along with detrended cross-correlation analysis (DCCA).”,

2. I believe that the main contribution of the paper and the usefulness of the proposed model should be better explained in the Introduction Section. Although the paper is clear and concise, in my opinion these aspects have not been sufficiently addressed.

We rewrite the Introduction Section. The distinguishing features and the contribution of the proposed model are presented in the end of the Introduction Section.

In details:

Introduction Section (line 90-97):

“The contributions of this paper are in the following.

The proposed procedure of nPRM directly analyzes raw data of stock prices, which retains all available information up to date to avoid reducing low-frequency information by taking returns or the logarithm of returns as well as making assumptions about the model. Moreover, the coefficients in the nPRM, namely the equilibrium price and the gradient, have their very own economic meanings. The gradient value reflects the convergence speed to the equilibrium price. We may further utilize the series of the gradient values as a feature for other models.”

3. In line with the previous comment, I believe that the explanations provided in the Results Section are excessively synthetical. The authors should make an effort to better describe the results and their implications, which is somewhat applicable to the Conclusions Section.

We amend the Results Section with some more discussions. We also rewrite the Discussion Section (Conclusion Section in previous version of manuscript) to highlight the contribution of this article and point out possible future work.

In details:

Result Section (line 308-372) (important part):

“From the deviation-type errors, the martingale (see details in S1 Appendix) has the smallest errors followed by those from the nPRM, the PRM and the curve fitting (see details in S2 Appendix) in the ascending order. Comparing the nPRM with the PRM, although the PRM obtains smaller 1-step forecast errors, the PRM obtains larger errors than the nPRM with the increase of the number of forecasting steps. From the overall performance, the nPRM is more accurate than the PRM. The PRM seems to perform better at the 1-step forecast since the miscalculations for cases 3 and 4 of the nPRM in Table 1 do not lead to large errors. However, with the increases of the number of forecasting steps, those miscalculations of theoretical values accumulate errors.”

“The trend-type errors cannot be applied to the forecasts of the martingale since the martingale forecasts the price of the next day by the present price. Therefore, we compare the trend-type errors between the other three methods. The PRM, nPRM, and the curve fitting have similar percentage of correct trend forecasts. Although the differences between the three methods are not significant, the PRM produces slightly higher trend accuracy among these three methods.”

“We summarize the performances of the methods as follows. Considering the deviation-type forecasting errors, the martingale seems to perform the best among the methods. According to the empirical evidence (Lim2012), the test based on the martingale can be used to examine the market efficiency of the stock market. Despite the absence of the test in this study, the low errors of the martingale for the four Asian stock indexes imply that the markets tend to be efficient during the period of the data. The martingale suggests that no information we can obtain from the past to forecast the future stock prices. Under the circumstances, our proposed method nPRM captures the trend of the stock movements quite well, and provides meaningful coefficients, A and κ, as well. If applying the nPRM to the markets that are less efficient, the model coefficients may acquire more information with economic meanings and obtain more accurate forecasts than the martingale.

The PRM has some flaw in the integration process and does not require smoothing for the stock prices. These disadvantages cause errors in forecasts. The errors are small when the time interval of forecast is short, but they grow with the increase of the time length. The newly proposed method nPRM improves these weaknesses. Therefore, the forecasting errors are notably smaller than the PRM for multiple step forecasts.

By the deviation-type measurements, the PRM or nPRM outperforms the curve fitting while the martingale cannot deliver a forecast of trend.

In spite of small differences, the PRM has the highest forecasting accuracy in all of the four Asian markets, and produces better directional forecasts for the up trend with higher CP values when the nPRM has higher CD values. In brief, the trend forecasting ability of the three methods are close.”

Discussion Section (line 387-399):

“We evaluate the forecasting accuracy of the nPRM along with the PRM, the martingale, and the cubic curve fitting by four Asian stock index examples. By investigating the daily closing prices, the nPRM accurately characterizes and forecasts the large-scale motions in these four stock markets. Although the forecasting accuracy declines over time, which is the same phenomenon for all the methods, the deviation-type errors of the nPRM are the smallest except for the martingale. However, the martingale can not be employed to forecast the trend of the stock even though it offers an eligible prediction about the prices if the market is efficient.

The trend-type errors of the nPRM, the PRM, and the curve fitting are close. This indicates that the trend forecasting ability of the nPRM does not outperform the other methods as its overall forecasting ability. In order to improve the trend forecasting in the future, we will develop a more sophisticated method to calculate the κ values without the gradient control.”

4. Although the manuscript is well-written, there are some typos that should be corrected (e.g., “the primary goals in this study is to identify” in line 36, etc.).

We corrected the typo in line 86 (line 36 in the previous version).

�We changed the “is” to “are”.

All the expressions, “stock movement”, are changed as “stock movements” for consistency throughout the paper (three times in the Abstract and in line 9, 17, 21, 31, 80, 205, 248, 358, 385).

All the expressions, “stock price”, are changed as “stock prices” for consistency throughout the paper.

We corrected the typos for the solutions of the PRM in line 126-133.

�Cases A, B, C, D and equations (2), (3), (4), (5).

We corrected the typos for the solutions of the nPRM.

� S0 < A (line 162), S0 > A (line 166), S0 < A (line 170), increasing (line 171), S(t) < A (line 171), S0 > A (line 174), decreasing (line 175), S(t) > A (line 176).

5. TAIEX is first mentioned in the paper in line 16, while the acronym is defined in 21. It is important that the authors correct these minor issues throughout the document.

The acronym is defined when first mentioning TAIEX in line 74 (in the revised version).

=======================================================

Response to Comments from Reviewer #3

1. Newly derived Price reversion model (nPRM) may be a revised version of original Price reversion model (PRM). As the background of this study, PRM should be introduced and nPRM is compared with PRM in order to reveal their difference.

We add a subsection to describe the original Price Reversion Model (PRM) in the Methods Section. We also add Table 1 to compare the solutions of the PRM with the solutions of the nPRM.

In details:

Methods Section – The Price Reversion Model (PRM) subsection (line 122-136)

Methods Section - Derivation of solutions to the new Price Reversion Model (nPRM) (line 177-184)

“A comparison of the cases on different conditions between PRM and nPRM is given by Table 1. As mentioned before, cases B and D in the PRM do not exist due to zero external force at A. By examining the phase plots of Figure 1 and Figure 2, the case A of the PRM can be further divided into cases 2 and 4 of the nPRM. The case C of the PRM can be further divided into cases 1 and 3 of the nPRM. We derive solutions for the cases 3 and 4 of the nPRM, which can not be obtained identically from the solutions of the PRM. As a result, the theoretical values obtained via the PRM are not so accurate as those from the nPRM.”

2. Martingale and Curve fitting are also described briefly because nPRM is compared with them.

We add Appendix S1 and S2 to describe the Martingale and the Curve fitting.

In details:

We mention S1 Appendix and S2 Appendix in line 316 and 317.

3. Numerical examples reveal that PRM is better than nPRM at small value of tau and that Martingale seems to be best among them. The validity of nPRM is not obvious. Authors should discuss these points.

We compare the solutions of the PRM with the solutions of the nPRM by Table 1. We also add some paragraphs of discussion in the last part of the Results Section.

In details:

Derivation of solutions to the new Price Reversion Model (nPRM) Section (line 177-184):

“A comparison of the cases on different conditions between PRM and nPRM is given by Table 1. As mentioned before, cases B and D in the PRM do not exist due to zero external force at $A$. By examining the phase plots of Figure 1 and Figure 2, the case A of the PRM can be further divided into cases 2 and 4 of the nPRM. The case C of the PRM can be further divided into cases 1 and 3 of the nPRM. We derive solutions for the cases 3 and 4 of the nPRM, which can not be obtained identically from the solutions of the PRM. As a result, the theoretical values obtained via the PRM are not so accurate as those from the nPRM.”

Results Section (line 317-324):

“Comparing the nPRM with the PRM, although the PRM obtains smaller 1-step forecast errors, the PRM obtains larger errors than the nPRM with the increase of the number of forecasting steps. From the overall performance, the nPRM is more accurate than the PRM. The PRM seems to perform better at the 1-step forecast since the miscalculations for cases 3 and 4 of the nPRM in Table 1 do not lead to large errors. However, with the increases of the number of forecasting steps, those miscalculations of theoretical values accumulate errors.”

4. Computational cost is also important point of view. The computational costs of four algorithms, nPRM, PRM, Martingale and Curve fitting, should be compared.

We add Table 7 to present average computational time of the four methods in the empirical study. We discuss the comparison in the last paragraph in the Results Section.

In details:

Results Section (line 373-378):

“The average computational times of all the four methods are investigated through 100 simulations. The average CPU times are shown in Table 7. The martingale uses the least computational time while the curve fitting spends the most. The nPRM and the PRM take nearly the same time, which is approximately half of the time used by the curve fitting. Even so, the nPRM is computationally efficient in view of the relationship between the forecasting accuracy and the computational time.”

Attachment

Submitted filename: Response to Reviewers.docx

Decision Letter 1

Junhuan Zhang

9 Nov 2021

PONE-D-21-10180R1Study of Asian indexes by a newly derived dynamic modelPLOS ONE

Dear Dr. Lee,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Dec 24 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Junhuan Zhang, PhD

Academic Editor

PLOS ONE

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #3: All comments have been addressed

Reviewer #4: (No Response)

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #3: Partly

Reviewer #4: Partly

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #3: I Don't Know

Reviewer #4: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #3: Yes

Reviewer #4: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #3: Yes

Reviewer #4: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #3: The manuscript is revised according to the reviewer's comment.

However, the section titles should be numbered adequately for readers' convenience.

Reviewer #4: The authors propose a new Price Reversion Model (nPRM), which is adapted from the Newton’s law of cooling. The solution of the model is derived and is discussed under different circumstances. Then nPRM is applied to forecast market values of four Asian stock indexes. The results do not seem so convincing. The work is meaningful to the research of stock market, but there are some modifications.

Major:

(1) The abstract is too cumbersome and a bit long. Especially the first four sentences are somewhat redundant.

(2) In the introduction, the authors introduced a variety of methods for predicting stock prices, and commented that there is no single method that can be used in all markets. I think this evaluation is not objective. No method is universally applicable to all markets, including the method proposed in this article, which has poor predictive effect in efficient markets. Therefore, what are the differences between the nPRM and the existing methods, or what are the advantages of the nPRM in predicting prices? Please elaborate on the importance of proposing this method.

(3) The authors put forward two primary goals for PRM in the introduction. However, the contribution of PRM to price prediction does not seem to be expressed. Does PRM have a large impact on the academic field or is it worthy of being an improvement target? The second goal does not seem to have been achieved.

(4) The first contribution cannot be considered a contribution. If you don't care about the prediction accuracy, all models can input raw data. But whether this is right is worth discussing.

(5) In the model part, what is the economic implications or intuition of nPRM? Why is the change in price assumed to be such a differentiable equation?

(6) The authors explain that nPRM is better than PRM because the former has two more cases than the latter. How do the authors identity it? Why is it not because smooth historical data is used when calculating A? Why is it not caused by controlling K?

(7) In nPRM, A and K are the most critical parameters, and their estimation method will directly determine the predicted performance. Authors should focus on how to estimate A and K. How to determine the optimal A and K is the key. From the prediction results, it is unreasonable to assume that the index obeys geometric Brownian motion.

(8) In the results discussion, authors say that if applying the nPRM to the markets that are less efficient, the model coefficients may acquire more information with economic meanings and obtain more accurate forecasts than the martingale. What is the reason for this explanation? What is the definition of market efficiency? How is the author identified? If nPRM performs better in an inefficient market, why don't the authors consider using other indexes? If nPRM can perform better in other samples, I will think that this method is meaningful and contributing.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #3: No

Reviewer #4: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2022 May 2;17(5):e0266600. doi: 10.1371/journal.pone.0266600.r004

Author response to Decision Letter 1


20 Dec 2021

Submission No: PONE-D-21-10180

Dear Editor,

We appreciate the reviewers’ valuable comments and suggestions. Meanwhile, we have addressed those comments one by one in the following responses. In the following, the original comments from the reviewer are in black and our responses are in blue.

With Regards,

Yong Shiuan, Lee

===========================================================

Response to Comments from Reviewer #3

1. The manuscript is revised according to the reviewer's comment.

However, the section titles should be numbered adequately for readers' convenience.

We agree with this comment that numbered sections/subsections are more readable. As a matter of fact, we follow the submission guidelines of the “Manuscript Organization” of PLOS ONE, and organize the sections by using the downloaded sample file. The format of the template for LaTeX submission does not number the sections and subsections. Therefore, we cannot present the manuscript with numbered section titles.

We present the revised manuscript as follows.

�Abstract

�Introduction

�Methods

� The Price Reversion Model (PRM)

� Derivation of solutions to the new Price Reversion Model (nPRM)

� Implementation details of the nPRM

� Model evaluation

�Results

� Applied data

� Empirical results

�Discussion

�Acknowledgments

�References

�Supporting information captions

� S1 Appendix - The martingale approach

� S2 Appendix - The curve fitting approach

� S3 Table Correct up-trend (CP) (%) for the testing set.

� S4 Table Correct down-trend (CD) (%) for the testing set.

===========================================================

Response to Comments from Reviewer #4

1. The abstract is too cumbersome and a bit long. Especially the first four sentences are somewhat redundant.

We exclude the first four sentences in the abstract. We also amend and rearrange the sentences in the abstract since we add more empirical examples.

In details:

Abstract (from the 5th line in the abstract):

“…This proposed procedure brought a different perspective to the study of stock prices based on thermodynamics, and the time varying coefficients in the nPRM offer economic meanings of the stock movements. More specifically, the average of smoothed historical data A in the nPRM, analogous to the environment temperature in the Newton's law of cooling, represent an implied equilibrium price. The heat transfer coefficient k is adapted to be either negative or positive, which illustrates the speed of convergence or divergence of stock prices, respectively. The empirical study of ten Asian stock indexes shows that the nPRM accurately characterizes and forecasts the market values.”

2. In the introduction, the authors introduced a variety of methods for predicting stock prices, and commented that there is no single method that can be used in all markets. I think this evaluation is not objective. No method is universally applicable to all markets, including the method proposed in this article, which has poor predictive effect in efficient markets. Therefore, what are the differences between the nPRM and the existing methods, or what are the advantages of the nPRM in predicting prices? Please elaborate on the importance of proposing this method.

In the previous edition of our manuscript (line 63-66), we summarize research results from reference [25] and state that “Still, in a review study of the returns from financial perspective [25], the empirical evidences show that no single model out of statistical, artificial intelligence, frequency domain, and hybrid models under investigation could be applied uniformly to all markets.” We would like to point out that disadvantage of modeling returns instead of stock prices is a loss of low-frequency information by the signal processing perspective. That’s why we choose to model daily closing prices in this work.

The proposed method nPRM produces predictions with relatively low Mean Absolute Percentage Errors (MAPEs) in our empirical study compared to other methods regardless of the market efficiency. More importantly, the difference in MAPE between nPRM and the martingale is little.

Indeed, “all models are wrong, but some are useful”, said by George Box. This quotation refers to not only statistical models but also other scientific models. However, we are dedicated to construct models that 1. have logical representation and economically reasonable explanations, and 2. produce predictions as accurately as possible.

The main contributions of this proposed method include describing the stock price movement accurately and provide economically meaningful coefficients at the mean time.

3. The authors put forward two primary goals for PRM in the introduction. However, the contribution of PRM to price prediction does not seem to be expressed. Does PRM have a large impact on the academic field or is it worthy of being an improvement target? The second goal does not seem to have been achieved.

The PRM is developed upon the Newton’s law of cooling, which is a well known thermodynamic model and introduced into finance in some researches, for example,

“Gkranas, A., Rendoumis, V.L. and Polatoglou, H.M. (2004). Athens and Lisbon stock markets – a thermodynamic approach, WSEAS Transactions on Business and Economics, Vol. 1, No. 1, pp.95–100”,

“Zarikas, V., Christopoulos, A.G. and Rendoumis, V.L. (2009). A thermodynamic description of the time evolution of a stock market index, European Journal of Economics, Finance and Administrative Sciences, Vol. 16, pp.73–83”,

“Todorović, J. Đ., Tomić, Z., Denić, N., Petković, D., Kojić, N., Petrović, J., & Petković, B. (2018). Applicability of Newton’s law of cooling in monetary economics. Physica A - Statistical Mechanics and its Applications, 494, 209-217”,

and our reference [27] among others.

Although PRM might not have brought great impact to the academic field of finance, the concept of introducing the thermodynamics into stock index dynamic modeling is new and draws some attention. This approach models the stock prices from a different perspective without loss of information due to transformation or model assumptions.

We slightly amend the introduction, especially the paragraph detailing the primary goals and contributions of this work (line 85-90).

In details:

Abstract (line 5-12):

“This proposed procedure brought a different perspective to the study of stock prices based on thermodynamics, and the time varying coefficients in the nPRM offer economic meanings of the stock movements. More specifically, the average of smoothed historical data A in the nPRM, analogous to the environment temperature in the Newton's law of cooling, represent an implied equilibrium price. The heat transfer coefficient k is adapted to be either negative or positive, which illustrates the speed of convergence or divergence of stock prices, respectively. The empirical study of ten examples shows that the nPRM accurately characterizes and forecasts the market values for Asian stock indexes.”

Introduction (line 85-94):

“Therefore, we identify the reasons of inaccurate forecasts by the PRM and propose to model the stock prices by a newly derived model, i.e. the new Price Reversion Model (nPRM). The contributions of this paper are in the following. The proposed procedure of nPRM directly analyzes raw data of stock prices, which retains all available information up to date to avoid reducing low-frequency information by taking returns or the logarithm of returns as well as making assumptions about the model. Moreover, the coefficients in the nPRM, namely the equilibrium price and the gradient, have their very own economic meanings. The gradient value reflects the convergence speed to the equilibrium price. We may further utilize the series of the gradient values as a feature for other models.”

4. The first contribution cannot be considered a contribution. If you don't care about the prediction accuracy, all models can input raw data. But whether this is right is worth discussing.

The first contribution is “directly analyzes raw data of stock prices, which retains all available information up to date to avoid reducing low-frequency information …” (line 87-90). Since a great part of recent researches of stock in academic field of finance is based on stock returns, we try to describe the stock price movement directly without loss of information from the signal processing perspective. We try to construct a reasonable model, which characterizes our subject properly and creates precise predictions.

We do care about the prediction accuracy, which is mainly measured by the Mean

Absolute Percentage Error (MAPE) in this article. Empirical study shows that the prediction accuracy of nPRM is really close to the martingale.

In the previous edition of our manuscript, the measurement trend accuracy (Acc) is used to determine whether the direction of the prediction is the same as real market value. This measurement along with CD and CP are useful in practice if we focus on the direction which stock prices go. For a martingale, either direction happens with a 50/50 chance. We replace Acc by another terminology, Directional Symmetry (DS), in our revised manuscript to avoid misunderstanding. Larger values of DS, CD, and CP represent more price predictions going in the same direction with real market.

In details:

Methods-Model evaluation (line 236-237, 244-249):

“directional Symmetry (DS)”,

“A smaller value of the MAPE indicates that the model better describes the movements of the stock prices. The MAE and the RMSE show similar tendency with the MAPE. DS measures the frequency of correct directional forecasts. CP and CD measure the frequency of correct directional forecasts when real market values rise and fall, respectively. The larger values of DS, CP, and CD suggest that the model obtains more correct directional forecasts in general, for uptrend, and for downtrend, respectively.”

5. In the model part, what is the economic implications or intuition of nPRM? Why is the change in price assumed to be such a differentiable equation?

The economic implication of nPRM is briefly described in the Section Derivation of solutions to the new Price Reversion Model (nPRM) (line 136-150). We refer A to the market equilibrium price. We refer k to the speed of convergence or divergence of stock prices, which is determined by the force on stock.

We assume that the dynamics of the price movement is differentiable (at least twice differentiable) to time t. However, in reality we observe the price at discrete timepoints. In this study, the observations are made once a day (daily closing prices). Since we cannot observe the price at an infinity of possible argument time t, we need smoothing technique to obtain differentiable curves that capture important trend and features of our data (step 1 in the implementation of nPRM). This smoothing step is neglected in PRM so that this modification in nPRM makes it more theoretically reasonable.

6. The authors explain that nPRM is better than PRM because the former has two more cases than the latter. How do the authors identity it? Why is it not because smooth historical data is used when calculating A? Why is it not caused by controlling K?

The solutions of nPRM are theoretically reasonable since they depict the behaviors of the dynamic system on 4 conditions separately. The old PRM solutions are not so clear, moreover, two cases (B and D) do not exist. The nPRM is better than PRM not only because it describes all the cases more clearly but also because the whole procedure applying nPRM is more sophisticated. For example, we add a smoothing step to eliminate the discrepancy between differentiable assumption and discrete observations. Also, the calculation of k is more polished than the original PRM.

We identify these solutions from the phase plane plots in Figures 1 and 2, which is commonly used when describing the behaviors of differential equations. These solutions are obtained theoretically by solving equation (1). During the solving process, the A and k are constant in the short time period t in [t0, t1]. Consequently, calculating A and controlling k are not supposed to affect the solutions in each period. However, when we dynamically model the stock prices with changing time periods, we need to consider the changes of A and k.

We further reply to the next comment (comment 7) for more details in estimating A and k.

7. In nPRM, A and K are the most critical parameters, and their estimation method will directly determine the predicted performance. Authors should focus on how to estimate A and K. How to determine the optimal A and K is the key. From the prediction results, it is unreasonable to assume that the index obeys geometric Brownian motion.

We think the assumption that stock index follows the geometric Brownian motion is reasonable. This is why that in the previous edition of manuscript we simulate sample paths of a stock index prices from the geometric Brownian in our Simulation Design Section (line 260-262 in the previous edition).

We re-assess the estimations of the coefficients, M (smoothing period) and A (equilibrium price), in this revision. In this revision, we determine the coefficients in the training dataset of 80% historical daily closing prices. As a result, we revise the Result Section and delete the Simulation design subsection.

In details:

Results (line 250-254):

“For application to real data, we split the data into a training set and a testing set, consisting of the earlier 80% and the latest 20% of the data, respectively. We use the training set to specify adequate smoothing period and the time interval for estimation of the equilibrium A, that is, the appropriate choices of M and N in (7) and (8), respectively.”

Applied data (line 263-272):

“We implemented the nPRM with a few choices of M and N to obtain τ-step forecasts for the training dataset of each stock index. The values of M range from 2 to 10 while the choice of N includes 30, 60, 90, 120, and 180. We set the number of forecasting steps τ = 5. We assess all combinations of M and N by the criterion of the grand mean of MAPEs across 1-step to 5-step forecasts in the training set. Afterwards, we select the combination of the best performance as the coefficient setting for prediction in the testing set. Table 2 shows that the best combination of M and N with the smallest grand mean of MAPEs in the training set for all the ten indexes. In the following, the forecasts are conducted with M and N specified for each stock index in Table 2.”

8. In the results discussion, authors say that if applying the nPRM to the markets that are less efficient, the model coefficients may acquire more information with economic meanings and obtain more accurate forecasts than the martingale. What is the reason for this explanation? What is the definition of market efficiency? How is the author identified? If nPRM performs better in an inefficient market, why don't the authors consider using other indexes? If nPRM can perform better in other samples, I will think that this method is meaningful and contributing.

We do not focus on the market efficiency in this work. The main contribution of this work is to present a model to characterize the stock movement with economically meaningful coefficients.

This discussion (in line 325-327,

“Despite the absence of the test in this study, the low errors of the martingale for the four Asian stock indexes imply that the markets tend to be efficient during the period of the data.”)

is a conjecture based on reference [2], which we do not present more tests (the generalized spectral martingale (EV-MDS) test or the wild bootstrap Automatic Variance Ratio (AVR) test in reference [2], for example) and empirical results in this article.

The statement (in line 331-333,

“If applying the nPRM to the markets that are less efficient, the model coefficients may acquire more information with economic meanings and obtain more accurate forecasts than the martingale.”)

could be a future work if we further research into the performance of the nPRM in markets with different levels of efficiency.

In the Introduction Section (line 17-20), we summarize research results from reference [2, 4, 5] and state

“The martingale depicts the stock movements in the strong efficient markets by the theory of random walks [4]. In contrast, many real markets are considered as the weak or the semi-strong efficient markets [2, 5]. Consequently, to forecast the prices in these markets is possible.”

Additionally, the reference [2] concludes in the article that “The EV-MDS test result shows that all the 14 return series under study are not MDS, suggesting the potential of predictability using either linear or non-linear function of past values. … Although all markets are reported earlier to be non-MDS, the AVR test shows the return series of Bangladesh, China, Hong Kong, Malaysia and Taiwan are white noise.”

We add some more examples of Asian stock indexes in this revision including the Philippines PSEi Index, Thailand SET Index, India S&P BSE SENSEX, Singapore Straits Times Index (STI), Indonesia JKSE, and Malaysia KLCI. No matter how efficient an Asian stock market in our investigation is, our proposed method nPRM can predict the stock prices with relatively low MAPE.

In details:

Applied data (line 256-260):

“The empirical study was conducted based on information from real daily closing price data of ten stock indexes from 2009 to 2019. The indexes include Taiwan TAIEX, Japan Nikkei 225, Korea KOSPI, Hong Kong Hang Seng Index, the Philippines PSEi Index, Thailand SET Index, India S&P BSE SENSEX, Singapore Straits Times Index (STI), Indonesia JKSE, and Malaysia KLCI.”

Empirical results (line 283-313)

“Figure 3 displays the 1-step forecasts of the nPRM for Hong Kong Hang Seng Index along with the market values. Figure 3 suggests that the nPRM possesses accurate forecasting ability. The patterns of the forecasts from 2-step to 5-step by the nPRM are all similar to Figure 3, and thus we omit the demonstration here. Figures 4-12 illustrate the 1-step forecasts of the nPRM for other Asian stock indexes individually, which also indicate accurate forecasts by the nPRM.”

“Fig 7.- Fig 12.”

“Table 3 displays the MAPEs of applying the investigated methods to all of the ten Asian Indexes from 1-step forecasts to 5-step forecasts. Since the performances of three deviation-type errors (MAPE, MAE, and RMSE) are similar, we simply report and discuss the results of the MAPEs. From Table 3, the martingale (see details in S1 Appendix) has the smallest errors followed by those from the nPRM, and the curve fitting (see details in S2 Appendix) has the worst performance.”

Empirical results (line 325-331)

“Despite the absence of the test in this study, the low errors of the martingale for these ten Asian stock indexes imply that the markets tend to be efficient during the period of the data. The martingale suggests that no information we can obtain from the past to forecast the future stock prices. However, our proposed method nPRM still captures the trend of the stock movements quite well, and provides meaningful coefficients, A and k, as well under the circumstances.”

Attachment

Submitted filename: Response to Reviewers.docx

Decision Letter 2

Junhuan Zhang

7 Feb 2022

PONE-D-21-10180R2Study of Asian indexes by a newly derived dynamic modelPLOS ONE

Dear Dr. Lee,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Mar 24 2022 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Junhuan Zhang, PhD

Academic Editor

PLOS ONE

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #4: (No Response)

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #4: Partly

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #4: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #4: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #4: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #4: The authors made corresponding revisions to the article according to the revision comments, but some comments were not resolved.

(1) At the end of the introduction, the authors list two contributions. But they are not concrete contributions from the point of view of expression. Contributions should include the innovations of this paper, please refer to expressions in other literature.

(2) In the model section, the authors still haven't told me what the economic intuition of nPRM is. The authors only responded with an explanation of the variables in the model, but did not point out that it is reasonable for prices to change according to nPRM or is in line with economic meanings.

(3) In the results section, the authors selected ten stock index samples to test the explanatory power of the model. Although the ten stock markets were all market efficient during this period, the degree of market efficiency certainly varied. Therefore, the authors can compare the prediction accuracy of nPRM in terms of the degree of market efficiency. Empirical research needs a good result. If your model's prediction performance is not as good as existing methods, then you need to improve the model or limit the scope of use of the model.

(4) The authors proposed a newly PRM, but the comparison with the PRM was removed from the comparison of the prediction results. What is the reason? Is the comparison unsatisfactory or impossible to compare? I need a detailed comparison process.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #4: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2022 May 2;17(5):e0266600. doi: 10.1371/journal.pone.0266600.r006

Author response to Decision Letter 2


22 Feb 2022

Submission No: PONE-D-21-10180R2

Dear Editor,

We greatly appreciate the reviewers’ time and effort. Your valuable comments and suggestions help us improve our paper. After careful consideration of these comments, we have addressed those comments one by one in the following responses. The original comments from the reviewer(s) are in Black and our responses are in Blue.

With Regards,

Yong Shiuan, Lee

===========================================================

Response to Comments from Reviewer #4

1. At the end of the introduction, the authors list two contributions. But they are not concrete contributions from the point of view of expression. Contributions should include the innovations of this paper, please refer to expressions in other literature.

We appreciate the reviewer’s comment. In the nPRM, the solutions of the equation were derived rigorously so that the solutions of this model can better fit the realistic stock prices of the markets than the PRM (please refer to line 135-137). A procedure is developed to process the raw stock prices without loss of low frequency of data. In summary, under the same dynamic model we proposed before, the nPRM seeks for the solutions that will not only achieve higher prediction accuracy but also provide useful economic meanings for possible applications in general practice. In addition, the nPRM can make the predictions using only raw stock prices without the need to transform them into usual economic variables such as returns or price increments. This is one of the reasons why we compared our results with two commonly used methods dealing with raw stock prices, the curve fitting and the martingale. To emphasize the contributions of this work, we revised the introduction section (please refer to line 87-98).

In detail:

Introduction (line 87-98):

“For the same dynamic model described using the differential equation in the PRM, we derive its solutions rigorously so that the solutions of the nPRM can better fit the realistic stock prices of the markets than the PRM. Two major contributions of the nPRM are as follows. Firstly, the nPRM seeks for the solutions that will not only achieve higher prediction accuracy but also provide useful economic meanings (refer to Figures 1 and 2 below). The coefficients in the nPRM, A and κ, represent the implied equilibrium price, and the convergence speed to the implied equilibrium price, respectively. Secondly, for many predictive models involved in the time series analysis, it is necessary to transform the stock prices to other economic variables, usually the returns or the price increments. The nPRM can make the predictions directly for the stock prices, which implies that the model considers all daily information in the analysis. Furthermore, the conduction of the proposed procedure is simple and convenient.”

===========================================================

2. In the model section, the authors still haven't told me what the economic intuition of nPRM is. The authors only responded with an explanation of the variables in the model, but did not point out that it is reasonable for prices to change according to nPRM or is in line with economic meanings.

We are grateful for the reviewer’s comment. The economic intuition is indeed important for the readers to understand the nPRM. Here, we tried our best to properly address the economic intuition of the nPRM by the phase plots shown in Figures 1 and 2. These two figures are used to demonstrate that it is reasonable for the prices to change by the nPRM for a high degree of consensus among most of the market participants. The economic meanings for the four cases of the nPRM can be explained as below. For case 1 (the 1st zone in Figure 1), most market participants agree that the current price could be overestimated. Therefore, most participants expect that the future price would decrease dramatically for case 1. For case 2 (the 2nd zone in Figure 1), most participants agree that the current price could be underestimated and hence expect a sharp increase in the future price. On the condition (in both case 1 and case 2), participants have high level of consensus for the price. However, the expected equilibrium price is still unknown. The coefficient κ is the gradient value, which represents the blow-up speed of the differential equation. It is positive for both cases 1 and 2, which indicates that the future change of the price will be fierce.

For cases 3 and 4 (the 3rd and 4th zone in Figure 2), most participants think that A is the reasonable expected price. Hence, future price will converge to A. This also explains why we state that the expected price A is the implied equilibrium price in the nPRM. The gradient value, κ, is negative, and its absolute value represents the speed of convergence to the expected price A. The existence of the implied equilibrium price also means that the market is highly efficient. The empirical study shows that the prediction error (MAPE) of the nPRM can be close to the MAPEs of the martingale, and the difference between them is less than 1%.

===========================================================

3. In the results section, the authors selected ten stock index samples to test the explanatory power of the model. Although the ten stock markets were all market efficient during this period, the degree of market efficiency certainly varied. Therefore, the authors can compare the prediction accuracy of nPRM in terms of the degree of market efficiency. Empirical research needs a good result. If your model's prediction performance is not as good as existing methods, then you need to improve the model or limit the scope of use of the model.

Although we do not test the market efficiency in this article (eg. the generalized spectral martingale (EV-MDS) test in reference [2]), we try to provide a glimpse of the market efficiency from sample entropy, a measure of disorder. Sample entropy quantifies the amount of regularity and the unpredictability of fluctuations in a time series. Hence, a low value of sample entropy indicates that the series is deterministic while a high value reflects randomness. We provide an extreme example in our revised manuscript to show a possible range of sample entropy. The example is a simulated series of length 180 from the standard normal distribution, namely N(0, 1). By repeating 1000 times, we obtain the average sample entropy as 2.21. On the other hand, the value of sample entropy is 0, which corresponds to a perfectly regular time series.

We add Table 5 to present the mean value of sample entropy calculated from all series used in estimation of A and κ to make forecasts dynamically in the testing set. Basically, all the mean values are close (nearly between 0.6 and 0.8), which indicates that these ten markets exhibit market efficiency of similar levels.

From the order of the mean sample entropy values, the prediction accuracy of the nPRM mostly agrees with the degree of market efficiency excep the Taiwan stock index TAIEX. TAIEX has a mean sample entropy value of 0.7052 (the second highest). The value suggests that the fluctuations of TAIEX is more unpredictable. However, the nPRM performs well in predicting TAIEX with MAPEs as 0.76, 1.00, 1.21, 1.43, 1.65 for 1-step to 5-step forecasts respectively. This means that the changes of TAIEX are still consistent with the movement that the nPRM characterizes.

The differences between the martingale and the nPRM are close, and the difference between them is less than 1%. Additionally, the coefficients in the nPRM has their own economic meanings. We will integrate the nPRM with other models (machine learning models, for example) to see if we can reduce the prediction error in the future.

In detail:

Results – Empirical results (line 325-353):

“Apart from the martingale, we can investigate the level of market efficiency by sample entropy [38, 39]. Sample entropy measures complexity. When we have a time series of length n as {S(1), S(2),…, S(n)} , a template vector of length m is defined as

, a template vector of length m is defined as Sm(i) = {S(i), S(i + 1) ,…,S(i+m-1)} and the distance function d[Sm(i), Sm(j)], i ≠ j is defined as the Chebyshev distance. Then, we define the sample entropy of the series, SampeEn, as

SampEn = − log\\frac{Bm+1}{Bm},

where Bm is number of template vector pairs having d[Sm(i), Sm(j)]≤r, and r is the tolerance.

The value of sample entropy is nonnegative. The value 0 of sample entropy indicates that the series is perfectly regular without noise while a high value reflects randomness and unpredictability. If we examine the market efficiency from the perspective of sample entropy, a high value of sample entropy indicates a market with high level of efficiency. To show how large the possible value of sample entropy from a highly random series can be, we simulate a series of length 180 from a normal distribution with zero mean and unit variance for 1000 times. The average value of sample entropy is 2.21.”

“Table 5 shows the mean value of sample entropy calculated from all series used in estimation of A and κ to make forecasts dynamically in the testing set for ten indexes in our empirical study. The mean values range between 0.5952 (Korea KOSPI) and 0.7833 (the Philippines PSEi), which represent similar complexity among these markets. We suspect that the forecasting errors should be larger for markets with higher level of efficiency since it is harder to predict the series with random noise. By the order of the mean sample entropy values and the MAPEs for the ten stock indexes, the nPRM shows a tendency toward better forecasts for series with lower market efficiency. That is, the forecasting ability of the nPRM usually increases as the mean sample entropy decreases. However, there is an exception of Taiwan TAIEX. The mean sample entropy obtained from the TAIEX series is the second highest, so that we won't expect a smaller prediction error among the ten indexes. However, the MAPEs of the nPRM for 1-step to 5-step forecasts are relatively low. The changes of TAIEX are similar to the movement which is characterized by the nPRM among the ten markets.”

===========================================================

4. The authors proposed a newly PRM, but the comparison with the PRM was removed from the comparison of the prediction results. What is the reason? Is the comparison unsatisfactory or impossible to compare? I need a detailed comparison process.

The details of comparison between the original PRM and the nPRM are listed below. We present the MAPE (%) of the original PRM and the nPRM for the testing set in the table, where the numbers of the nPRM are exactly the same as those in Table 3 in our manuscript (page 10).

τ = 1 τ = 2 τ = 3 τ = 4 τ = 5

Nikkei 225 nPRM 1.04 1.27 1.53 1.78 2.05

PRM 0.98 1.57 2.16 2.82 3.58

Hang Seng nPRM 1.12 1.44 1.77 2.11 2.43

PRM 1.09 1.68 2.21 2.76 3.22

TAIEX nPRM 0.76 1.00 1.21 1.43 1.65

PRM 0.79 1.28 1.75 2.20 2.68

KOPSI nPRM 0.86 1.10 1.38 1.62 1.84

PRM 0.81 1.26 1.70 2.15 2.63

PSEi nPRM 1.46 1.71 1.93 2.16 2.35

PRM 1.55 2.17 2.63 3.19 3.70

SET nPRM 1.18 1.47 1.70 1.98 2.20

PRM 1.16 1.59 1.98 2.35 2.58

BSE SENSEX nPRM 1.31 1.62 1.88 2.20 2.42

PRM 1.32 1.88 2.34 2.84 3.28

STI nPRM 1.19 1.52 1.77 2.09 2.32

PRM 1.36 1.81 2.20 2.67 2.99

JKSE nPRM 1.27 1.42 1.57 1.73 1.84

PRM 1.27 1.73 2.14 2.49 2.80

KLCI nPRM 0.98 1.30 1.58 1.89 2.07

PRM 1.07 1.50 1.93 2.44 2.80

This table shows a similar phenomenon that the PRM has a tendency toward higher prediction error for multiple step forecasts. Although some (4 out of 10) of the MAPEs of 1-step forecast by the PRM are smaller than those by the nPRM, it simply happens by chance when the cumulated error is still small. The results of the above Table are not significant on the conclusions of this study.

The comparison has been made in the manuscript of our previous version. Here, we quote some sentences as follows.

“Comparing the nPRM with the original PRM, although the PRM obtains smaller 1-step forecast errors, the PRM obtains larger errors than the nPRM with the increase of the number of forecasting steps.”

“The original PRM has some flaw in the integration process and does not require smoothing for the stock price. These disadvantages cause errors in forecasts. The errors are small when the time interval of forecast is short, but they grow with the increase of the time length. The newly proposed method nPRM improves these weaknesses. Therefore, the forecasting errors are notably smaller than the PRM for multiple step forecasts.”

We removed this comparison due to limited space. It’s fair to compare the proposed method with other commonly used models instead of a method we proposed previously, which is the more general way to demonstrate the empirical study.

Attachment

Submitted filename: Response to Reviewers.docx

Decision Letter 3

Junhuan Zhang

24 Mar 2022

Study of Asian indexes by a newly derived dynamic model

PONE-D-21-10180R3

Dear Dr. Lee,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Junhuan Zhang, PhD

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #4: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #4: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #4: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #4: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #4: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #4: The authors have made improvements to all the issues mentioned in the revision comments. The article basically meets the publication standards.

Minors:

The tense of verbs: This proposed procedure brought (abstract), “PRM tend (line 84)”.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #4: No

Acceptance letter

Junhuan Zhang

21 Apr 2022

PONE-D-21-10180R3

Study of Asian Indexes by a Newly Derived Dynamic Model

Dear Dr. Lee:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Junhuan Zhang

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Appendix. The martingale.

    (PDF)

    S2 Appendix. The curve fitting.

    (PDF)

    S1 Table. Correct up-trend (CP) (%) for the testing set.

    (PDF)

    S2 Table. Correct down-trend (CD) (%) for the testing set.

    (PDF)

    Attachment

    Submitted filename: Response to Reviewers.docx

    Attachment

    Submitted filename: Response to Reviewers.docx

    Attachment

    Submitted filename: Response to Reviewers.docx

    Data Availability Statement

    The data underlying the results presented in the study are available from Refinitiv Datastream. The Refinitiv Datastream is purchased by National Chengchi University, and all faculty and students of NCCU have access to this database. Those who are interested in getting access to the data can purchase or request for the product from https://www.refinitiv.com/en or https://reurl.cc/xEg67E.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES