Abstract
In setting up a stochastic description of the time evolution of a financial index, the challenge consists in devising a model compatible with all stylized facts emerging from the analysis of financial time series and providing a reliable basis for simulating such series. Based on constraints imposed by market efficiency and on an inhomogeneous-time generalization of standard simple scaling, we propose an analytical model which accounts simultaneously for empirical results like the linear decorrelation of successive returns, the power law dependence on time of the volatility autocorrelation function, and the multiscaling associated to this dependence. In addition, our approach gives a justification and a quantitative assessment of the irreversible character of the index dynamics. This irreversibility enters as a key ingredient in a novel simulation strategy of index evolution which demonstrates the predictive potential of the model.
Keywords: complex systems, finance, stochastic processes
For over a century, it has been recognized (1) that the unpredictable time evolution of a financial index is inherently a stochastic process. However, despite many efforts (2–11), a unified framework for simultaneously understanding empirical facts (12–19), such as the non-Gaussian form and multiscaling in time of the distribution of returns, the linear decorrelation of successive returns, and volatility clustering, has been elusive. This situation occurs in many natural phenomena, when strong correlations determine various forms of anomalous scaling (20–27). Here, by employing mathematical tools at the basis of a generalization of the central limit theorem to strongly correlated variables (28), we propose a model of index evolution and a corresponding simulation strategy which account for all robust features revealed by the empirical analysis.
Let S(t) be the value of a given asset at time t. The logarithmic return over the interval [t, t + T] is defined as r(t, T) ≡ ln S(t + T) − ln S(t), where t = 0, 1, … and T = 1, 2, …, in some unit (e.g., day). From a sufficiently long historical series, one can sample the empirical probability density function (PDF) of r over a time T, p̄T(r), and the joint PDF of two successive returns r1 ≡ r(t, T) and r2 ≡ r(t + T, T), denoted by p̄2T(2)(r1, r2). This joint PDF contains the information on the correlation between r1 and r2 in the sampling. A well established property (13–16) is that, if T is longer than tens of minutes, the linear correlation vanishes: ∫p̄2T(2)(r1, r2)r1r2 dr1dr2 ≡ 〈r1r2〉p̄2T(2) = 0. This is a consequence of the efficiency of the market (3), which quickly suppresses any arbitrage opportunity. Another remarkable feature is that, within specific T ranges, p̄T approximately assumes a simple scaling form
![]() |
where ḡ and D̄ are the scaling function and exponent, respectively. Eq. 1 manifests self-similarity, a symmetry often met in natural phenomena (20–23, 27): plots of TD̄p̄T vs. r/TD̄ for different T values collapse onto the same curve representing ḡ. We verify the scaling ansatz in Eq. 1 for the Dow Jones Industrial (DJI) index using a dataset of more than one century (1900–2005) of daily closures. This index is paradigmatic of market behavior and the considerable number of data reduces sampling fluctuations substantially. In Fig. 1 the collapse of the empty symbols is rather satisfactory (the explanation of the meaning of the full symbols in Fig. 1 is given below). The scaling function in Eq. 1 is non-Gaussian (2, 12–17). Although linear correlations vanish, in the T range considered ḡ is determined by the strong nonlinear correlations of the returns. Only for T > τc (with τc of the order of the year) successive index returns become independent and p̄T turns Gaussian in force of the central limit theorem (29).
Fig. 1.
Data collapse for p̄T(r) (T measured in days) sampled from a record of ≈2.7 × 104 DJI daily closures (open symbols). The average daily trend of the order of 10−4 has been subtracted. The collapse analysis furnishes the scaling function ḡ reported as the full line and the scaling exponent D̄ ≃ 1/2 (see also Fig. 3 and SI Text). Filled symbols report the data collapse for the results of a single simulation of the DJI history.
Results and Discussion
Our first goal is to establish up to what extent the assumption of simple scaling in Eq. 1 does constrain the structure of the joint PDF p̄2T(2). One must of course have
![]() |
Indeed, the first line of Eqs. 2 follows from r(t, 2T) = r1 + r2. Furthermore, because the joint PDF p̄2T(2) is sampled from a sequence of time-translated intervals of duration 2T along the historical series, both the first and the second halves of all such intervals provide an adequate sampling basis for p̄T. This justifies the second and third lines of Eqs. 2. At this point, we notice that the property 〈r1r2〉p̄2T(2) = 0 implies that 〈r2〉p̄2T = 〈(r1 + r2)2〉p̄2T(2) = 2〈r2〉p̄T. In force of Eq. 1, 〈r2〉p̄T ∼ T2D̄. Hence, we obtain 2T2D̄ = (2T)2D̄, i.e., D̄ = 1/2. Remarkably, for all developed market indices, 〈r2〉p̄T is found to scale consistently with a D̄ pretty close to 1/2 (30).
By switching to Fourier space in Eq. 2, the notion of a generalized product operation allows to identify a solution for p̄2T(2) in terms of p̄T alone. Although the ordinary multiplication of characteristic functions (i.e., Fourier transforms) of Gaussian PDFs would yield trivially the correct p̄2T(2) in the case of independent successive returns (29), the generalized product is used here to take into account strong nonlinear correlations consistently with the anomalous scaling they determine [see supporting information (SI) Text] and can be seen to be at the basis of a central limit theorem (28). Our solution is strongly supported by the remarkable consistency with the numerical results and by the analogy with the independent case. In Fig. 2 we compare the PDF of the return r2 conditioned to a given absolute value of the return r1, as obtained through our solution (continuous lines), with the empirically sampled one (symbols). The agreement does not involve fitting parameters, because D̄ and those entering the assumed analytical form of ḡ are already fixed in Fig. 1.
Fig. 2.
Conditional probabilities of daily returns r2 for different values of r1. Empty symbols refer to the DJI data. The continuous curves are the predictions of our theory for p̄2T(2)(r2‖r1|) ≡ [p̄2T(2)(r1,r2) + p̄2T(2) (−r1,r2)]/∫[p̄2T(2)(r1,r2)+ p̄2T(2)(−r1,r2)]dr2. The absolute value of r1 is introduced for reducing sample fluctuations.
At this point, we must take into account that the simple scaling ansatz in Eq. 1 is only approximately valid (7). Indeed, a consequence of Eq. 1 is 〈|r|q〉p̄T ∼ TqD̄, which we exploited above for q = 2. However, a careful analysis reveals that the qth moment exponent deviates from the linear behavior qD̄ ≃ q/2 for q ≳ 3 (open circles in Fig. 3). Like the linear behavior with slope 1/2 observed for low-order moments, this multiscaling effect is common to most indices (30). To explain this feature, we have to investigate the relation between the empirical p̄T and the stochastic process generating the time series. If PDFs like p̄T and p̄2T(2) were directly describing such a process, this would be with stationary increments. This assumption is legitimate only for sufficiently long times, larger than τc. Below, we identify in the interplay between scaling and nonstationarity a precise mechanism accounting for the robust features of p̄T detected for T ≪ τc, including its multiscaling.
Fig. 3.
Scaling exponent of the qth moment of p̄T. Open (filled) circles refer to the DJI data (simulation) of Fig. 1. The dashed line is q/2. Multiscaling is due to the deviation of D̄(q) from a constant value. The full line reports the time-averaged asymptotic (τc ≫ 1) theoretical prediction based on Eq. 5.
Let us indicate by pt,T and pt,2T(2) the ensemble PDF's corresponding to p̄T and p̄2T(2), respectively. The additional dependence on t, the initial time of the interval [t, t + T], shows that we do not assume stationarity for these PDF's. We postulate that, within specific T ranges (e.g., the one in Fig. 1), p0,T obeys a simple scaling like that in Eq. 1, but possibly with a D and a g different from D̄ and ḡ, respectively. One then realizes that this scaling and the linear decorrelation of returns impose on pt,2T(2) constraints analogous to those for p̄2T(2) in Eq. 2, except for the third one, which now reads
![]() |
This last condition tells us that, as a consequence of the nonlinear correlations, the effective time span of the marginal PDF obtained by integrating p0,2T(2) in r1 must be renormalized by a factor a. This factor is determined again by consistency of the second moments scaling properties, as above. Since now 〈|r|2〉p0,T ∼ T2D, one gets from Eq. 3 a = (22D − 1)1/2D. So, D ≠ 1/2 implies a ≠ 1, and thus nonstationarity and irreversibility of the process. Similar functional relations hold for the PDFs of the magnetization of critical spin models upon doubling the system size and can be explained in that context by the renormalization group theory (27). Our generalized multiplication of characteristic functions allows us to express pt,2T(2) in terms of pt,T and to establish the time-inhomogeneous scaling property
![]() |
It remains now to make explicit the link between the p values and the sampled p̄, and to determine D. By construction, p̄T is a t average of pt,T. Because the time inhomogeneity of pt,T must cross over into homogeneity for t exceeding τc, we expect the following approximation
![]() |
to hold. Indeed, the history over which p̄T is sampled is much longer than τc and allows in principle also an indirect sampling of pt,T if we simply assume pt+τc,T(r) = pt,T(r).
Despite the fact that Eq. 4 implies a simple scaling exponent D for p0,T, Eq. 5 leads to the remarkable property that, independently of D, the low-q moments of p̄T approximately scale with exponent q/2 as soon as τc ≫ 1. Moreover, if D < 1/2, p̄T displays a multiscaling of the same type as that found empirically: D̄(q) < 1/2 for the high-order moments. The matching of the theoretical predictions for the multiscaling of p̄T on the basis of Eq. 5 with the empirical results is a first way of identifying D. For the DJI, in Fig. 3 we show that with D = 0.24 this matching is very satisfactory. The scaling functions of p0,T and p̄T can also be shown to be simply related, once D is known. We notice that the observed multiscaling features of financial indices, which inspired multiplicative cascade models (8, 19) in analogy with turbulence (20, 21), are explained here in terms of an additive process possessing the time-inhomogeneous scaling (Eq. 4).
The introduction of autoregressive schemes like ARCH (5) marked an advance in econometrics and financial analysis (6, 9), and, more generally, in the theory of stochastic processes. In an autoregressive simulation, a number of parameters weighting the influence of the past history on the PDF of the following return must be fixed through some optimization procedure. By our approach, a generalization of pt,2T(2) to the case of n consecutive intervals can be fully expressed just in terms of pt,T and D. This is obtained by taking the inverse Fourier transform of our solution for the characteristic function of the joint PDF (see SI Text). In this way, we can precisely calculate the PDF that rules the extraction of the ith return, ri, giving us as conditioning inputs the previous m ones, ri−m, …, ri−1. Consistently with our schematization in Eq. 5, the existence of exogenous factors acting on the market can be taken into account by resetting the width of the marginal PDFs with an (average) periodicity equal to τc (see SI Text). The results for a single simulation with m = 100, τc = 500, and D = 0.24 are illustrated by the filled symbols in Figs. 1 and 3. The coincidence of the scaling properties observed for the DJI with those of our simulation furnish a second strong indication of the validity of our approach and of the estimation of D.
The correctness of the value of D can be further checked by considering the volatility autocorrelation function at time separation τ (Fig. 4). A well established fact (12–16, 19) is its power law decay c(τ) ≈ τ−β for τ < τc, with β ≃ 0.2 for the DJI. This behavior is not reproduced by routine simulation methods in quantitative finance like GARCH (6) and requires the introduction of more sophisticated, fractional integration techniques (9). The full characterization of the joint PDF of n-consecutive returns allows us to obtain a model expression for c(τ), which again takes into account the nonstationarity of the process (see SI Text). Such an expression behaves asymptotically as a power of τ with an exponent depending on D [c(τ) is constant for D = 1/2 and decays for D < 1/2]. In particular, with D = 0.24 both the model asymptotic expression and the results of our simulation procedure furnish a nice agreement with the exponent β ≃ 0.2 observed for the DJI index (Fig. 4). Thus, the algebraic volatility autocorrelation function decay is reproduced by our scheme and provides a second criterion to fix consistently the anomalous scaling exponent D.
Fig. 4.
Volatility autocorrelation at time separation τ (in days), c(τ) ≡ (Σt=0tmax |r(t, 1)‖r(t + τ, 1)|− Σt=0tmax |r(t, 1)| Σt=0tmax |r(t + τ, 1)|/tmax)/Σt=0tmax |r(t, 1)|2−[Σt=0tmax |r(t, 1)|]2/tmax where tmax + τ − 1 is the total length of the time series. Symbols are as in Fig. 3. The simulation here is precisely the same we refer to in Figs. 1 and 3. The full line gives the slope of the time-averaged asymptotic (τc ≫ 1, τ ≲ τc) model prediction for the volatility autocorrelation, superimposed to the data (see SI Text).
Our approach is based on two postulates: inhomogeneous-time scaling and the vanishing of linear return correlations. These symmetries lead, in an unambiguous, deductive manner, to a model for the underlying stochastic process determining market evolution. Of course, the results follow only when the postulates are valid and we have shown that within specific time-ranges the consequences of these postulates are in remarkable agreement with the data. Major advances in understanding critical phenomena worked in a similar vein decades ago (20), when the scaling assumptions allowed to establish links between seemingly disparate phenomena and put the basis for the development of renormalization group theory (27). So far, the coexistence of anomalous scaling with the requirement of absence of linear correlation imposed by economic principles has been regarded as an outstanding open problem in the theory of stochastic processes. We believe that our solution could be relevant for developments in this field, as well as for describing scaling behaviors of other complex systems (20–26).
Supplementary Material
Acknowledgments
We thank J. R. Banavar for useful suggestions and encouragement.
Footnotes
The authors declare no conflict of interest.
This article is a PNAS Direct Submission.
This article contains supporting information online at www.pnas.org/cgi/content/full/0706046104/DC1.
References
- 1.Bachelier L. Ann Sci Ecole Norm Sup. 1900;17:21–86. [Google Scholar]
- 2.Mandelbrot BB. J Business. 1963;36:394–419. [Google Scholar]
- 3.Fama EF. Journal of Finance. 1970;25:383–417. [Google Scholar]
- 4.Black F, Scholes M. J Polit Econ. 1973;81:637–654. [Google Scholar]
- 5.Engle R. J Money, Credit and Banking. 1983;15:286–301. [Google Scholar]
- 6.Bollerslev T. J Econometrics. 1986;31:307–327. [Google Scholar]
- 7.Vassilicos JC, Demos A, Tata F. In: Applications of Fractals and Chaos. Crilly AJ, Earnshaw RA, Jones H, editors. Berlin: Springer; 1993. pp. 249–265. [Google Scholar]
- 8.Mandelbrot BB, Fisher AJ, Calvet LE. Cowles Foundation Discussion Paper. 1997:1164. [Google Scholar]
- 9.Andersen TG, Bollerslev T. J Empirical Finance. 1997;4:115–158. [Google Scholar]
- 10.Lux T, Marchesi M. Nature. 1999;397:498–500. [Google Scholar]
- 11.LeBaron B. Proc Natl Acad Sci USA. 2002;99:7201–7206. doi: 10.1073/pnas.072079699. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Mantegna RN, Stanley HE. Nature. 1995;376:46–49. [Google Scholar]
- 13.Mantegna RN, Stanley HE. An Introduction to Econophysics. Cambridge, UK: Cambridge Univ Press; 2000. [Google Scholar]
- 14.Bouchaud J-P, Potters M. Theory of Financial Risks. Cambridge, UK: Cambridge Univ Press; 2000. [Google Scholar]
- 15.Cont R. Quant Finance. 2001;1:223–236. [Google Scholar]
- 16.Cont R. In: Fractals in Engineering. Lutton E, Levy Véhel J, editors. New York: Springer–Verlag; 2005. [Google Scholar]
- 17.Stanley HE, Amaral LAN, Buldyrev SV, Gopikrishnan P, Plerou V, Salinger MA. Proc Natl Acad Sci USA. 2002;99:2561–2565. doi: 10.1073/pnas.022582899. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Yamasaki K, Muchnik L, Havlin S, Bunde A, Stanley HE. Proc Natl Acad Sci USA. 2005;102:9424–9428. doi: 10.1073/pnas.0502613102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Lux T. In: Power Laws in the Social Sciences. Cioffi-Revilla C, editor. Cambridge, UK: Cambridge Univ Press; 2006. [Google Scholar]
- 20.Kadanoff LP. Statistical Physics: Statics, Dynamics and Renormalization. Singapore: World Scientific; 2005. [Google Scholar]
- 21.Kadanoff LP. Physics Today. 2001;54:34–39. [Google Scholar]
- 22.Sethna JP, Dahmen KA, Myers CR. Nature. 2001;410:242–250. doi: 10.1038/35065675. [DOI] [PubMed] [Google Scholar]
- 23.Bouchaud JP, Georges A. Phys Rep. 1990;195:127. [Google Scholar]
- 24.Lu ET, Hamilton RJ, McTiernan JM, Bromond KR. Astrophys J. 1993;412:841–852. [Google Scholar]
- 25.Scholz CH. The Mechanics of Earthquakes and Faulting. New York: Cambridge Univ Press; 2002. [Google Scholar]
- 26.Kiyono K, Struzik ZR, Aoyagi N, Togo F, Yamamoto Y. Phys Rev Lett. 2005;95:058101. doi: 10.1103/PhysRevLett.95.058101. [DOI] [PubMed] [Google Scholar]
- 27.Jona-Lasinio G. Phys Rep. 2001;352:439–458. [Google Scholar]
- 28.Baldovin F, Stella AL. Phys Rev E. 2007;75:020101(R)-1–020101(R)-4. doi: 10.1103/PhysRevE.75.020101. [DOI] [PubMed] [Google Scholar]
- 29.Gnedenko BV, Kolmogorov AN. Limit Distributions for Sums of Independent Random Variables. Reading, MA: Addison Wesley; 1954. [Google Scholar]
- 30.Di Matteo T, Aste T, Dacorogna MM. J Bank Fin. 2005;29:827–851. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.









