Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2018 Jan 12.
Published in final edited form as: Stat Interface. 2017;10:529–541. doi: 10.4310/SII.2017.v10.n4.a1

Bayesian analysis of stochastic volatility-in-mean model with leverage and asymmetrically heavy-tailed error using generalized hyperbolic skew Student’s t-distribution*

William L Leão 1, Carlos A Abanto-Valle 2,, Ming-Hui Chen 3
PMCID: PMC5766051  NIHMSID: NIHMS917109  PMID: 29333210

Abstract

A stochastic volatility-in-mean model with correlated errors using the generalized hyperbolic skew Student-t (GHST) distribution provides a robust alternative to the parameter estimation for daily stock returns in the absence of normality. An efficient Markov chain Monte Carlo (MCMC) sampling algorithm is developed for parameter estimation. The deviance information, the Bayesian predictive information and the log-predictive score criterion are used to assess the fit of the proposed model. The proposed method is applied to an analysis of the daily stock return data from the Standard & Poor’s 500 index (S&P 500). The empirical results reveal that the stochastic volatility-in-mean model with correlated errors and GH-ST distribution leads to a significant improvement in the goodness-of-fit for the S&P 500 index returns dataset over the usual normal model.

Keywords and phrases: Feedback and leverage effect, GH skew Student-t distribution, Markov chain Monte Carlo, Non-Gaussian and nonlinear state space models, Stochastic volatility-in-mean

1. INTRODUCTION

Stochastic volatility (SV) models were introduced in the financial literature to describe time-varying volatilities [41, 42, 21]. Although the basic SV model offers a great flexibility in modeling data with time-varying variances, it can suffer from a lack of robustness in the presence of extreme outlying observations [see, e.g., 27, 22, 1, among others] or skewness of the returns. To deal with this problem, Abanto-Valle et al. [5] and Abanto-Valle et al. [2] proposed new stochastic volatility models based on the generalized skew-Student-t and the skew-Student-t distributions for stock returns, which allow a parsimonious, flexible treatment of the skewness and heavy tails in the conditional distribution of the returns.

However, the volatility of daily stock returns has been estimated with SV models, but the results have relied on an extensive pre-modeling of these series to avoid the problem of simultaneous estimation of the mean and variance. To remedy this problem, [26] introduced the SV in mean (SVM) by incorporating the unobserved volatility as an explanatory variable in the mean equation of the returns and provided an empirical justification that the volatility coefficient in the mean equation is related to the feedback effect, which implies that an increase in the current level of volatility causes agents to increase their forecasts of future volatility and therefore to raise their future required returns. Recently Abanto-Valle et al. [4] extended this class of models by using the scale mixture of normal distributions. It has also long been recognized in stock markets that there is a negative correlation between today’s return and tomorrow’s volatility. This phenomenon is called “leverage effect” or “asymmetry” [45]. The asymmetric stochastic volatility model is well known to describe these phenomena for stock returns. Markov chain Monte Carlo (MCMC) methods have been used for parameter estimation of SV models with leverage effect. For example, [30] and Omori and Watanabe [31] used an efficient mixture sampler and a block sampler for correlated errors, respectively.

In this article, we propose to enhance the robustness of the specification of the innovation returns in SVM models by introducing scale Generalized Hyperbolic skew Student-t distribution with correlated mean and variance errors. The resulting class of models takes into account the asymmetric effect, heavy-tailedness, the feedback, and leverage effects. We refer to this generalization as the SVML-GH-ST model. The flexibility of the SVML-GH-ST model can also capture time varying features in the mean of the returns and heavy tails simultaneously. The estimation of such intricate models is not straightforward, since volatility now appears in both mean and variance equations with correlated innovation errors. Hence, intensive computational methods are needed. Inference for this new SVML-GH-ST model is performed under the Bayesian paradigm via MCMC methods, which permits obtaining the posterior distribution of parameters via simulation, starting from reasonable prior assumptions on the parameters. We simulate the log-volatilities and the shape parameters by using the block sampler for correlated errors (31, 3, 29) and Metropolis-Hastings algorithms, respectively.

The rest of the paper is organized as follows. Section 2 outlines the SVML-GH-ST model as well as the Bayesian estimation procedure using MCMC methods. Section 3 illustrates our proposed method using simulated data. In Section 4, the proposed class of models is applied to the S&P 500 daily returns and model comparison is provided among the competing SVML models. Finally, we conclude the paper with some concluding remarks and suggestions for future developments in Section 5.

2. THE ASYMMETRIC HEAVY-TAILED STOCHASTIC VOLATILITY-IN-MEAN MODEL WITH LEVERAGE EFFECT

2.1 The SVML-GH-ST model

The basic SV in mean model with leverage effect is defined by

yt=β0+β1yt-1+β2eht+eht2εt, (1a)
ht+1=α+ϕht+σηηt, (1b)
(εtηt)~N2[(00),(1ρρ1)], (1c)

where yt and ht are, respectively, the compounded return and the log-volatility at time t. We assume that |ϕ| < 1, i.e., the log-volatility process is stationary, and the initial value h1~N(α1-ϕ,(1-ρ2)ση21-ϕ2). The parameter ρ measures the correlation between εt and ηt. A negative value of ρ (ρ < 0) indicates the so-called leverage effect, i.e., a drop in the return followed by an increase in the volatility. Empirical evidence of the leverage effect can be found in Ghysels et al. [17], Harvey and Shephard [20], Bollerslev and Zhou [8], Omori et al. [30], and Nakajima and Omori [29].

For a joint model of the asymmetric heavy-tailedness and the “feedback” and leverage effects, we replace the normal random variable εt in (1a) by a random variable from the GH skew Student’s t-distribution, denoted by ωt, which can be written in the form of the normal variance-mean mixture as

ωt=μω+δzt+ztεt, (2)

where εt ~ 𝒩 (0, 1), zt~IG(ν2,ν2), and ℐ𝒢 (., .) denotes the inverse gamma distribution, respectively. We assume that μω = −δμz, where μz = E(zt) = ν/(ν − 2), to ensure E(ωt) = 0 and ν > 4 for the finite variance of ωt.

Using the variance-mean mixture representation of the GH skew Student’s t-distribution defined by equation (2), the stochastic volatility-in-mean model with asymmetric heavy-tailedness and leverage effect can be written hierarchically as

yt=β0+β1yt-1+β2eht+eht2{δ(zt-μz)+ztεt}, (3a)
ht+1=α+ϕht+σηηt, (3b)
(εtηt)~N2[(00),(1ρρ1)], (3c)
zt~IG(ν2,ν2). (3d)

The model defined by equations (3a)(3d) will be denoted by SVML-GH-ST. In this setup, using equations (3a), (3b), and (3c) with δ = 0 and zt = 1 ∀t = 1,, T, we obtain the SVM model with leverage effect and normal distribution (SVMLN). Equations (3a)(3d) with δ = 0 define the SVM model with leverage effect and Student-t distribution (SVML-T) [see 3, for details].

Equations (3a)(3d) can be written in an alternative way as follows

(ytht+1)|θ,zt,ht,yt-1~N([β0+β1yt-1+β2eht+eht2δ(zt-μz)α+ϕht],[ztehtρσηzt1/2eht/2ρσηzteht/2ση2(1-ρ2)]), (4)

From equation (4), we have that the conditional distribution yt|θ, zt, ht, ht+1, yt−1 follows a normal distribution with mean and variance given by

μt=β0+β1yt-1+β2eht+eht2δ(zt-μz)+φφ2+τ2zteht2(ht+1-α-ϕht), (5)
Vt=τ2τ2+φ2zteht, (6)

respectively, where τ=1-ρ2ση and φ = ρση. This conditional distribution will be useful in the development of the block sampler in the subsequent subsections.

2.2 Parameter estimation via MCMC

Let θ = (β0, β1, β2, α, ϕ, τ2, φ, ν)′ be the vector of parameters for the SVML-GH-ST model, where ν is the parameter of the mixing variables zt, the degrees of freedom of the GH-ST distribution. We further let h1:T = (h1, h1,, hT )′, z1:T = (z1,, zT )′ and y0:T = (y0,, yT )′ denote the the vector of the log volatilities, the mixing variables and the information available up to time T, respectively. Using the data augmentation principle, the joint posterior density of the parameters and latent unobservable variables can be written as

p(θ,h1:T,z1:Ty0:T)[t=1T-1p(yt,ht+1zt,ht,yt-1,θ)p(ztν)]×p(yTzT,hT,yT-1,θ)p(zTν)p(h1θ)p(θ), (7)

where p(yt, ht+1 | zt, ht, yt−1, θ) is given by equation (4) and p(θ) is the prior distribution. To make Bayesian analysis feasible for parameter estimation in the SVML-GH-ST model, we draw random samples from the posterior distribution of (θ, h1:T, z1:T) given y0:T using MCMC simulation methods. The sampling scheme is described in Algorithm 2.1.

Algorithm 2.1.

  1. Set i = 0 and get starting values for the parameters θ(i) and the latent quantities z1:T(i) and h1:T(i).

  2. Generate θ(i) in turn from its full conditional distribution, given y1:T, h1:T(i-1) and z1:T(i-1).

  3. Draw z1:T(i)~p(z1:Tθ(i),h1:T(i-1),y0:T).

  4. Generate h1:T by blocks as follows:

    1. For l = 1,, K, the knot positions are generated as kl, the floor of [T ×{(l+ul)/(K+2)}], where the ul’s are independent realizations of the uniform random variable on the interval (0, 1).

    2. For l = 1,, K, generate hkl−1+1:kl−1 jointly conditional on ykl−1:kl−1, θ(i), zkl-1+1:kl-1(i),hkl-1(i-1) and hkl(i-1).

    3. For l = 1,, K, draw hkl(i) conditional on y1:T, θ(i), hkl-1(i), and hkl+1(i).

  5. Set i = i + 1 and return to 2 until convergence is achieved.

The prior distributions of the parameters in the SVML–GH-ST model are specified as follows: β0~N(β¯0,σβ02),β1~N(-1,1)(β¯1,σβ12),β2~N(β¯2,σβ22), α | τ2 ~ 𝒩(α0, τ2/q0), φ | τ2 ~ 𝒩(φ0, τ2/p0), ϕ~N(-1,1)(ϕ0,sϕ2), τ2 ~ ℐG(aτ/2, Sτ/2), and ν ~ 𝒢(aν, bν), where aν, bν, α0, φ0, ϕ0, sϕ2, aτ, Sτ, p0, and q0 are known hyperparameters and 𝒩[.,](., .) and 𝒢(., .) denote the truncated normal and gamma distributions, respectively.

As described in Algorithm 2.1, the Gibbs sampler requires sampling parameters and latent variables from their full conditional distributions. Sampling the log-volatilities h1:T in Step 4, is the most difficult task due to the nonlinear setup of the observational equation (3a). An efficient strategy is to sample from the conditional posterior distribution of h1:T by dividing it into several blocks and sampling each block given the other blocks. This idea, called the block sampler or multi-move sampler, was developed by Shephard and Pitt [37] and Watanabe and Omori [44] in the context of state space modeling. They showed that the sampler can produce efficient draws from the target conditional posterior distribution in comparison with a single-move sampler that primitively samples one state, say ht, at a time given the others, hs (st). For the SV model with leverage, Omori and Watanabe [31] developed the associated multi-move sampler and showed that it produces efficient samples. In the next subsection, we extend their method to sample h1:T in the SVML-GH-ST model. The full conditional distributions of θ and the latent variables z1:T are given in Appendix A. Some of them are easy to simulate from.

2.3 A block sampler algorithm

In order to simulate h1:T = (h1,, hT )′ in the SVMLGH-ST model, we first simulate h1 conditional on h2:T and then generate h2:T conditional on h1. To sample the vector h2:T, we develop a multi-move block algorithm. In our block sampler, we divide it into K+1 blocks, hkl−1+1:kl−1 = (hkl−1+1,, hkl−1)′ for l = 1,, K + 1, with k0 = 1 and kK+1 = T, where kl − 1 − kl−1 2 is the size of the l−th block. We sample the block of disturbances ηkl−1:kl−2 = (ηkl−1,, ηkl−2)′ given the end conditions hkl−1 and hkl instead of hkl−1+1:kl−1 = (hkl−1+1,, hkl−1)′. In order to facilitate the exposition, we omit the dependence on θ and assume that kl−1 = t and kl = t + k + 1 for the l−th block such that t + k < T. Then ηt:t+k−1 = (ηt,, ηt+k−1)′ are sampled at once from their full conditional distribution f(ηt:t+k−1|ht, ht+k+1, yt:t+k, zt+1:t+k)1, which without the constant terms is expressed in log scale as

logf(ηt:t+k-1ht,ht+k+1,yt:t+k,zt+1:t+k)-s=tt+k-1ηs22+s=tt+kls-12ση2(ht+k+1-α-ϕht+k)2I(t+k<T), (8)

where 𝕀(t +k < T) is an indicator variable. Excluding the constant terms, ls denotes the conditional distribution of ys given hs and hs+1 for s < T, which is normal with mean μs and variance Vs, given by equations (5) and (6), respectively. We define

L=s=tt+kls-(ht+k+1-α-ϕht+k)22ση2I(t+k<T),

dt+1:t+k = (dt+1,, dt+k)′, ds=Lhs and Q=E(-2Lht+1:t+kht+1:l+k). See equations (B.1) and (B.2) in the Appendix B for details.

Since -12s=tt+k-1ηs2+L in (8) does not have the closed form, we use the Metropolis-Hastings algorithm [10] to sample from this distribution. To obtain the proposal density, we are going to form an approximated linear state space model that mimics (8), from which sampling is easy. Applying a second-order Taylor series expansion to L around the mode η̂t:t+k−1, we have

logf(ηt:t+k-1ht,ht+k+1,yt+1:t+k,zt+1:t+k)const-12r=t+1t+kηr2+L^+Lηt:t+k-1|ηt:t+k-1=η.t:t+k-1(ηt:t+k-1-η^t:t+k-1)+12(ηt:t+k-1-η^t:t+k-1)×E(2Lηt:t+k-1ηt:t+k-1)|ηt:t+k-1=η.t:t+k-1×(ηt:t+k-1-η^t:t+k-1)=const-12r=t+1t+kηr2+L^+d^t+1:t+k(ht+1:t+k-h^t+1:t+k)-12(ht+1:t+k-h^t+1:t+k)Q^(ht+1:t+k-h^t+1:t+k)=const+logf(ηt:t+k-1ht,ht+k+1,θ,yt+1:t+k,zt+1:t+k), (9)

where t+1:t+k, , and denote dt+1:t+k, L, and Q evaluated at ht+1:t+k = ĥt+1:t+k. The expectations are taken with respect to ys’s conditional distribution on hs’s. We use an information matrix for Q because we require that Q is everywhere strictly positive definite. It can be shown that the proposal density f*(ηt:t+k−1|ht, ht+k+1, θ, yt+1:t+k, zt+1:t+k) is the posterior density of ηt:t+k−1 for a linear Gaussian state space model given by equations (10) and (11) below [see 31, 3, for details]. The mode η̂t:t+k−1 can be found by repeating the following algorithm until convergence.

Algorithm 2.2.

  1. Initialize η̂t:t+k−1 and calculate ĥt+1:t+k using (3b).

  2. Evaluate d̂s, M̂s, and N̂s using equations (B.1), (B.3) and (B.4) respectively.

  3. Compute Gs, Js, and bs, for s = t+2,, t+k, recursively, as follows:
    Gs=M^s-N^s2Gs-1-1,Gt+1=M^t+1,Js=Ks-1-1N^s,Jt+1=0,Jt+k+1=0,bs=d^s-JsKt-1-1bs-1bt+1=d^t+1,

    where Ks=Gs.

  4. Define the auxiliary variables y^s=γ^s+Gs-1bs, where
    γ^s=h^s+Ks-1Js+1h^s+1,s=t+1,,t+k.
  5. Consider the linear Gaussian state-space model
    y^s=cs+Zshs+Hsξs,s=t+1,,t+k, (10)
    hs+1=α+ϕhs+Lsξs,s=t,t+1,,t+k, (11)

    where ξs ~ 𝒩(0, I2), cs=Ks-1Js+1α,Zs=1+Ks-1Js+1ϕ,Hs=Ks-1[1,Js+1ση], and Ls = [0, ση]. Apply the Kalman filter and a disturbance smoother [25] to the linear Gaussian state space model in equations (10) and (11) and obtain the posterior mean of ηt:t+k−1 (ht+1:t+k) and set η̂t:t+k−1 (ĥt+1:t+k) to this value.

  6. Return to Step 2 and repeat the procedure until achieving convergence.

Applying the de Jong and Shephard simulation smoother [11] to the model defined by equations (10) and (11) with the auxiliary variables ŷt+1:t+k defined in step 4 of Algorithm 2.2 enables us to sample ηt+1:t+k from the density f*. Since f is not bounded by f*, we use the Metropolis-Hastings algorithm to sample from f as recommended by Chib [10].

In the MCMC sampling procedure, we select the expansion block ĥt+1:t+k in Algorithm 2.2 as follows: the current sample of ηt:t+k=1 (ht+1:t+k) may be taken as an initial value of the η̂t:t+k=1 (ĥt+1:t+k) in Step 1. Once an initial expansion block ĥt+1:t+k is selected, we can calculate the auxiliary ŷt+1:t+k variables in Step 4. Then, applying the Kalman filter and a disturbance smoother to the linear Gaussian state space model consisting of equations (10) and (11) with the artificial ŷt+1:t+k yields the mean of ht+1:t+k conditional on ĥt+1:t+k in the linear Gaussian state space model, which is used as the next ĥ t+1:t+k. By repeating the procedure until the smoothed estimates converge, we obtain the posterior mode of ht+1:t+k. This is equivalent to the method of scoring to maximize the logarithm of the conditional posterior density. Although we have just noted that iterating the procedure achieves the mode, this will slow our simulation algorithm if we have to iterate this procedure until full convergence. Instead we suggest using only five iterations of this procedure to provide a reasonably good sequence ĥt+1:t+k instead of an optimal one.

Finally, we describe the updating procedure of the knot conditions hkl, for l = 2,, K. As the conditional density p(hkl | hkl−1, hkl+1) does not have a closed form, we use the Metropolis-Hastings algorithm with proposal density N(α(1-ϕ)+ϕ(hkl-1+hkl+1)1+ϕ2,ση21+ϕ2). Let hklp and hkl(i-1) denote the proposal value and the previous iteration value. Then, the acceptance probability is given by αMH=min{1,Q(hklp)Q(hkl(i-1))}, where Q(hkl) is the product of the conditional densities ykl−1 | zkl−1, ykl−2, hkl−1, hkl ~ 𝒩(μkl−1, Vkl−1), and ykl | zkl, ykl−1, hkl+1, hkl ~ 𝒩(μkl, Vkl) with μs and Vs defined by equations (5) and (6), respectively, for s = kl − 1 and kl.

3. NUMERICAL ILLUSTRATION WITH A SIMULATED DATASET

In order to assess the performance of the MCMC algorithms described in the previous section, we present results based on a simulated dataset. All the calculations were performed by running a stand alone code developed by the authors using the Scythe statistical library [32], which is available for free download at http://scythe.wustl.edu. We simulated a dataset of 2000 observations of the SVM-L-GH-SST distribution using β0 = 0.25, β1 = 0.03, β3 = −0.2, α = −0.008, ϕ = 0.95, ση2=0.0225, ρ = −0.35, and ν = 10, which correspond to typical values found in daily series of returns. Figure 1 shows the raw data and the histograms of the simulated dataset.

Figure 1.

Figure 1

Simulated dataset from the SVML-GH-ST: Time series of returns (left) and the histogram (right).

We set the prior distributions as follows: β0 ~ 𝒩(0, 100), β1 ~ 𝒩(−1,1)(0.1, 100), β2 ~ 𝒩(−0.1, 100), α|τ2 ~ 𝒩(0, τ2/0.002), ϕ|τ2 ~ 𝒩(−1,1)(0.95, 100), τ2 ~ ℐ𝒢 (2.5, 0.025), φ|τ2 ~ 𝒩(−0.3, τ2/0.005), δ ~ 𝒩(0, 1) and ν ~ 𝒢(12, 0.5). The prior means of β1 and ϕ are, respectively, 0.0032 and 0.0003 and the corresponding prior variances are 0.3328 and 0.3329. In both cases, the priors are equivalent to the uniform distribution on interval (−1, 1), which gives zero mean and variance of 0.3333. Thus, it is clear that the priors specified for β1 and ϕ are essentially non-informative.

The number of blocks, K, in the block sampler was set equal to 30 so that each block contained 66 hts on average. We conducted the MCMC simulation for 50,000 iterations. The first 10,000 draws were discarded as a “burn-in” period, and then the next 40,000 were recorded. In order to reduce the autocorrelation between successive values of the simulated chain, only every 10th values of the chain were stored. With the resulting 4000, we calculated the posterior means, the 95% credible intervals and the convergence diagnostic (CD) statistics proposed by Geweke [16] for all the parameters.

The proposed algorithm is evaluated in terms of how well it estimates the true parameter values. From Table 1 and Figure 2, it can be seen that the estimated results for the parameters appear quite reasonable, because all the 95% credible intervals include true values. According to the CD values, the null hypothesis that the sequence of 4000 draws is stationary was accepted at the 5% level for all the parameters in all the models considered here. The inefficiency factor is defined by 1+s=1ρs, where ρs is the sample auto-correlation at lag s. It measures how well the MCMC chain mixes [see, e.g, 23]. It is the estimated ratio of the numerical variance of the posterior sample mean to the variance of the sample mean from uncorrelated draws. When the inefficiency factor is equal to m, we need to draw MCMC samples m times as many as the number of uncorrelated samples. From Table 1, we found that our algorithm produces a good mixing of the MCMC chain. This fact is further confirmed in Figure 3, where the the autocorrelation function (acf) of the parameters shows a faster decay.

Table 1.

Simulated dataset: summary results for tge SVML-GH-ST model

Parmater True value Posterior mean 95% CI IF CD

β0 0.2500 0.2810 (0.1220, 0.4650) 6.26 −0.12
β1 0.0300 0.0260 (−0.0170, 0.0680) 1.28 0.95
β2 −0.2000 −0.2500 (−0.4450,−0.0700) 5.38 0.01
α −0.0080 −0.0160 (−0.0340,−0.0030) 10.36 −0.95
ϕ 0.9500 0.9210 (0.8680, 0.9610) 10.67 −1.01
ση2
0.0225 0.0330 (0.0160, 0.0550) 21.83 0.96
δ −0.5000 −0.7680 (−1.6100,−0.3400) 21.31 −0.32
ρ 0.3500 −0.2350 (−0.4270,−0.0420) 7.02 0.52
ν 10.0000 12.4430 (8.2330, 19.5520) 20.02 0.15

Figure 2.

Figure 2

Simulated dataset. Histograms and estimated densities from the MCMC output for the SVML-GH-ST. The solid line indicates the true value and the dotted line the 95% credible interval.

Figure 3.

Figure 3

SVML-GH-ST, simulated dataset. Autocorrelation function (acf) for the parameters obtained from the MCMC output.

In Figure 4, the smoothed mean calculated from the MCMC output (dotted line) and true values (solid line) of eht2 are shown. They show that the estimated values follow the behavior of the true volatilities.

Figure 4.

Figure 4

SVML-GH-ST, simulated dataset. True values (solid line) and posterior smoothed mean (dotted line) of eht2.

4. EMPIRICAL APPLICATION

This section analyzes the daily closing prices for the S&P 500 stock market index. The S&P 500 index contains the stocks of 500 Large-Cap corporations. Although a majority of those corporations are US based, it also includes other companies having their common stocks within the index. The data set was obtained from the Yahoo finance web site available to download at http://finance.yahoo.com. The period of analysis is January 4, 1980 – December 31, 2015, which yields 9078 observations. Throughout, we work with the compounded return expressed as yt = 100(log Pt − log Pt−1), where Pt is the closing price on day t.

The compounded S&P 500 returns are plotted in Figure 5 as a time series and also as a histogram. The mean and standard deviation (SD) of returns are 0.03 and 1.12, respectively. As shown in Figure 5, the returns are skewed (−1.16) with heavy tails. From Table 2, we also note that the returns have a large range (minimum, −22.90 and maximum, 10.95). Some extreme observations, explained by some turbulences in financial markets as the stock market crash occurred by October 1987, the Asian financial crises in July 1997, the Russian financial crises in August 1998 and the U. S. market subprime crises in December 2007, contribute to the large kurtosis (29.54) of the S&P 500 returns. As a result, the S&P 500 index returns likely depart from the underlying normality assumption.

Figure 5.

Figure 5

Compounded S & P 500 index returns from January 4, 1980 to December 31, 2015. The left panel shows the plot of the raw series and the right panel the histogram of returns.

Table 2.

Summary statistics for the S&P 500 returns

Median SD Minimum Maximum Skewness Kurtosis
0.03 1.12 −22.90 10.95 −1.16 29.54

We fitted the SVML-N, SVML-T, and SVML-GH-ST models. In all cases, we simulated the ht’s in a multi-move fashion with stochastic knots based on the method described in Section 2.2. We set the prior distributions for the common parameters as follows: β0 ~ 𝒩(0, 100), β1 ~ 𝒩(−1,1)(0.1, 100), β2 ~ 𝒩(−0.1, 100), ϕ ~ 𝒩(−1,1)(0.95, 100), τ2 ~ ℐ𝒢(2.5, 0.025), α | τ2 ~ 𝒩(0, τ2/0.002), and φ | τ2 ~ 𝒩(−0.3, τ2/0.005). The prior distribution on the shape parameter was chosen as ν ~ 𝒢(12, 0.8) for the SVML-T and SVML-GH-ST models, respectively. For the SVML-GH-ST, we set δ ~ 𝒩(0, 100). The initial values of the parameters were randomly generated from the prior distributions. We set initial values of all the log-volatilities, ht, to be zero. Finally the initial z1:T were generated from the prior p(zt | ν).

For the block sampler algorithm, we set the number of blocks K to be 180 in such a way that each block contained 50 hts on average. For the SVML-N, SVML-T, and SVML-GH-ST models, we conducted the MCMC simulation for 25,000 iterations. In all the cases, the first 5000 draws were discarded as a burn-in period. As before, in order to reduce the autocorrelation between successive values of the simulated chain, only every 10th values of the chain were stored. With the resulting 2000 values, we calculated the posterior means, the 95% credible intervals and the convergence diagnostic (CD) statistics [16]. Table 3 summarizes the results. According to the CD values, the null hypothesis that the sequence of 2000 draws is stationary was accepted at the 5% level for all the parameters in all the models considered here. From Table 3 and Figure 7, we found that our algorithm yields a good mixing of the MCMC chain.

Table 3.

Estimation results for the S & P 500 index returns. First row: Posterior mean. Second row: 95% credible interval in parentheses. Third row: CD statistics. Fourth row: Inefficiency factors

Parameter SVML-N SVML-T SVML-GH-ST
0.0614 0.1801 0.0608
β0 (0.0378,0.0863) (0.0274,0.0761) (0.0356,0.0867)
1.78 −0.70 −0.77
1.63 1.76 1.78

0.0017 −0.0081 −0.0122
β1 (−0.0193,0.0231) (−0.0271, 0.01226) (−0.0316,0.0082)
−0.01 1.08 −0.82
1.00 1.00 1.00

−0.0237 −0.0176 −0.0270
β2 (−0.0535,0.0058) (−0.0446,0.0017) (−0.0654, −0.0006)
0.24 1.78 0.49
1.54 1.25 1.26

−0.0078 −0.0063 −0.0126
α (−0.0125, −0.0037) (−0.0102, −0.0030) (−0.0184, −0.0074)
−1.72 1.82 −1.46
3.33 7.51 1.26

0.9760 0.9787 0.9743
ϕ (0.9679,0.9829) (0.9643,0.9826) (0.9661,0.9812)
−1.84 1.17 0.68
9.78 11.26 1.26

0.0332 0.0243 0.0340
ση2
(0.0251,0.0431) (0.0246,0.0918) (0.0258,0.0437)
1.51 −1.12 0.68
17.30 26.33 13.40

−0.3622 −0.5038 −0.2837
ρ (−0.4385, −0.2904) (−0.5933, −0.4145) (−0.3443, −0.2192
1.78 −1.69 −0.53
5.25 11.00 2.36

9.1988 10.7500
ν (7.9690,16.9087) (8.5897,13.7853)
1.32 0.16
26.25 46.83

−0.1534
δ (−0.3443, −0.2192)
−0.53
9.22

Figure 7.

Figure 7

S&P 500 returns dataset. Autocorrelation function (acf) for the parameters obtained from the MCMC output.

Table 3 shows that the posterior mean and 95% credible interval of ϕ. For all the models, the posterior means of ϕ are above 0.97, showing higher persistence, as expected. We found that the persistence values of the SVML-T and the SVML-GH-ST are slightly different from the one for the SVML-N. The posterior means of ση2 under the SVML-N and SVML-GH-ST models are greater than the posterior mean under the SVML-T model, indicating that the volatilities of the SVML-T models is less variable than the equivalent SVML-N and SVM-GH-ST models.

The posterior means together with the 95% credible intervals of the three parameters, which govern the mean process for each of the three models, are reported in Table 3. In all cases the posterior mean of β0 is always positive and statistically significant under each fitted model. The posterior mean of β1 is positive for the SVML-N and negative for the SVML-T and SVML-GH-ST models and similar to the first-order autocorrelation (not reported here). Since the 95% credible interval contains zero, this coefficient is not significant. The β2 parameter, which measures both the ex ante relationship between returns and volatility and the volatility feedback effect, has a negative posterior mean under all of the fitted models. Although the credible interval of β2 barely contains zero under the SVML-N and SVML-T the models, its posterior distribution is primarily located in the negative range, as shown in Table 4. The posterior mean of β2 in the SVML-GH-ST is negative and the 95% credibility interval does not contains zero. This result confirms previous results in the literature and indicates that when investors expect higher persistent levels of volatility in the future, they require compensation for this in the form of higher expected returns.

Table 4.

S & P 500 index returndataset: P(β2 < 0) estimated from the MCMC output

SVML-N SVML-T. SVML-GH-ST
P(β2 < 0) 0.9465 0.9710 0.9805

As expected for all the models considered here, the posterior means of ρ, the correlation coefficient between shocks to return at time t and shocks to volatility at time t + 1, are always negative and the 95% credible intervals do not contain zero. This result indicates that this parameter is statistically significant. Hence, we may conclude that there is a strong and significant “leverage effect” for the S & P 500 index returns returns dataset.

We found that the posterior mean of δ is −0.1534, which indicates that the returns are slightly asymmetric. We also found that the 95% credible interval does not contains zero.

The magnitude of the tail fatness is measured by the shape parameter ν in the SVML-T and SVML-GH-ST models. The posterior means of ν are almost 9.19 and 10.75 under the SVML-T and SVML-GH-ST models, respectively. This difference can be explained by δ, the extra asymmetry parameter, which is considered in the specification of the SVML-GH-ST model. These results seem to indicate that the measurement errors of the stock returns are better explained by heavy-tailed distributions.

Now, we compare the volatility estimates. In Figure 8, we plot the smoothed mean of eht2. The posterior smoothed mean of eht2 under the SVML-T, SVML-GH-ST models show smoother movements than that under the SVML-N model (solid line). Extreme returns, such as the stock market crash occurred and the U. S. market subprime crises in December 2007, clearly make the differences. The models with heavy tails accommodate possible outliers in a somewhat different way by inflating the variance eht2 by zt12eht2. This can have a substantial impact, for instance, on the evaluation of derivative instruments and several strategic or tactical asset allocation topics.

Figure 8.

Figure 8

S & P 500 returns dataset. Posterior smoothed mean (dotted line) of eht2, SVML-GH-ST (solid line), SVML-T (dotted line), SVML-N (tiny line).

To assess the goodness of the estimated models, we calculate the deviance information criteria, DIC [39], Bayesian predictive information criteria, BPIC [6, 7] and the log-predictive score, LPS [19, 18, 12, 5, 2]. The DIC is defined as

DIC=-2Eθy1:T[logp(y1:Tθ)]+pD. (12)

The second term in (12) measures the complexity of the model by the effective number of parameters, pD, defined as the difference between the posterior mean of the deviance and the deviance evaluated at the posterior mean of the parameters:

pD=2[logp(y1:Tθ¯)-Eθy1:T[logp(y1:Tθ)]]. (13)

To calculate the DIC in the context of SVML-GHST model, we use the conditional likelihood p(y1:Tα,ϕ,ση2,ν,δ,ρ,β0,β1,β2,z1:T,h0:T), in this case θ encompasses (α,ϕ,ση2,ν,δ,ρ,β0,β1,β2), z1:T and h1:T.

As pointed by Stone [40], Robert and Titterington [34], Celeux et al. [9], and Ando [7], the DIC suffers from some theoretical aspects. First, in the derivation of DIC, Spiegelhalter et al. [39] assumed that the specified parametric family of probability distributions that generate future observations encompasses the true model. This assumption may not always hold true. Secondly, the observed data are used both to construct the posterior distribution and to compute the posterior mean of the expected log-likelihood. Thus, the bias in the estimate of DIC tends to underestimate the true bias considerably. To overcome these theoretical problems in DIC, recently Ando [7] proposed the Bayesian predictive information criterion (BPIC) as an improved alternative of the DIC. The BPIC criterion is defined as

BPIC=-2Eθy1:T[log{p(y1:Tθ)}]+2Tb^, (14)

where is given by

b1T{Eθy1:T[log{p(y1:Tθ)p(θ)}]-log[p(y1:Tθ^)p(θ^)]+tr{JT-1(θ^)IT(θ^)}+0.5q}. (15)

Here q is the dimension of θ, Eθ|y1:T [.] denotes the expectation with respect to the posterior distribution, θ̂ is the posterior mode, and

IT(θ^)=1Tt=1T(ηT(yt,θ)θηT(yt,θ)θ)|θ=θ^,JT(θ^)=1Tt=1T(2ηT(yt,θ)θθ)|θ=θ^,

with ηT (yt, θ) = logp(yt | y1:t−1, θ) + log p(θ)/T.

Scoring rules provide summary measures for the evaluation of probabilistic forecast by assigning a numerical score based on the predictive distribution and on the event or value that materializes. The fit of the models studied here will be assessed using log predictive scores [19, 18, 12, 5, 2]. The average log predictive score for the one-step ahead prediction is given by

LPS=-1Tt=1Tlogp(yty1:t-1,θ^), (16)

where θ̂ is an estimate of the model parameters and p(yt | y1:t−1,θ̂ ) is the one-step ahead predictive density. The smaller the DIC, BPIC and LPS values, the better the model fits the data.

In the SVML class of models, the log-likelihood function, log p(y1:T | θ) and p(yt | y1:t−1, θ) are estimated using the auxiliary particle filter [see, e.g., 33, 30] with 10,000 particles. Table 5 shows the values of BPIC. According with the DIC, BPIC and LPS criterion, the SVML-GH-ST model fits the data better among all the considered models, suggesting that the S&P 500 index returns return data demonstrate sufficient departure from underlying normality assumptions.

Table 5.

S&P 500 returns dataset. Deviance Information Criteria (DIC), Bayesian predictive information criteria (BPIC) and Log Predictive Score (LPS)

Modelo DIC Ranking BPIC Ranking LPS Ranking
SVML-N 23579.7 3 23988.9 3 1.321 3
SVML-T 23502.3 2 23957.5 2 1.320 2
SVML-GH-ST 23452.7 1 23941.8 1 1.318 1

In order to check the distribution assumptions of the SV models, we use an approach similar to Kim, Shephard and Chib [24]. The diagnostics test is based on the probability integral transform of the realizations yt+1o taken with respect to the one-step-ahead prediction density p(yt+1 | y1:t, θ). The probability integral transform, εt+1, is simply the cumulative distribution function corresponding to the prediction density p(yt+1 | y1:t, θ) evaluated at yt+1o:εt+1=Pr(yt+1yt+1oy1:t,θ). For t = 1, . . . , T, under the null hypothesis that the true distribution of yt+1o is p(yt+1 | y1:t, θ) (or equivalently, the model is correctly specified), the εt+1 converges in distribution to independent and identically distributed uniform random variables on [0, 1] [see, 35, 38, 24, 15, 28, among others]. By letting ςt+1 = Φ−1(εt+1), where Φ() denotes the standard normal cumulative distribution function, a sequence of independent standard normal random variables ςt+1 is obtained, which are the standardized innovations. The probability Pr(yt+1yt+1oy1:t,θ) can then be approximated by

Pr(yt+1yt+1oy1:t,θ)=1ni=1NPr(yt+1yt+1oy1:t,ht+1(i),θ).

The QQ-plots for pseudo residuals the three models fitted, SVML-N, SVML-T and SVML-GH-ST are shown in Figure 9. The qq-plots indicate a lack of fit in the left tail, specially in the SVML-N and SVML-ST models. The indicated mis-specification could be solved by using the SVML with generalized skew-Student-t or skew-Student-t distributions as in Abanto-Valle et al. [5] and Abanto-Valle et al. [2].

Figure 9.

Figure 9

S & P 500 returns dataset. Quantile-Quantile plot of the residuals ςt. The solid line plots the quantiles of the 𝒩(0, 1) against the quantiles of the standard normal and the points were the sorted values of ςt against the quantiles of the standard normal.

5. CONCLUSIONS

This article presented a Bayesian implementation of a robust alternative for estimation in the stochastic volatility-in-mean model with correlated errors, as an extension of the model proposed by [26] and Abanto-Valle et al. [3] via MCMC methods. The SVML model enables us to investigate the dynamic relationship between returns and their time-varying volatility. The Gaussian assumption of the mean innovation was replaced by univariate thick-tailed processes, known as the variance-mean mixture of the normal distribution. Under a Bayesian perspective, we developed an algorithm based on MCMC simulation methods to estimate all the parameters and latent quantities in our proposed SVML-GH-ST model. We illustrated our methods through an empirical application of the S&P 500 returns series, which shows that the SVML-GH-ST model provides a better fit than the SVML-N and SVML-T models in terms of parameter estimates, interpretation, and robustness aspects. The β2 estimate, which measures both the ex ante relationship between returns and volatility and the volatility feedback effect, was found to be negative. These results are in line with those of French et al. [14], who found a similar relationship between unexpected volatility dynamics and returns, and confirm the hypothesis that investors require higher expected returns when unanticipated increases in future volatility are highly persistent. This is consistent with our findings of higher values of ϕ combined with larger negative values for the in-mean parameter. On the other hand, since the posterior mean and 95% credible interval contains only negative values, we can conclude that there is a strong and significant “leverage effect” for the S&P 500 returns dataset.

Our SVML-GH-ST models showed considerable flexibility to accommodate outliers, but their robustness aspects could be seriously affected by the prior of the ν and δ parameters. In this set-up, for example, it would be possible to study different objective priors for the parameters in the GH-ST distributions in the same spirit of the works of Fonseca et al. [13] and Salazar et al. [36] or using a different skew-student-t parameterization as in Abanto-Valle et al. [5] and [2] for example. Nevertheless, an in-depth investigation of this modification is beyond the scope of the present paper, but provides stimulating topics for future research.

Figure 6.

Figure 6

S&P 500 returns dataset. Histograms and estimated densities from the MCMC output for the SVML-GH-ST. The solid line indicates the posterior mean and the dotted line the 95% credible interval.

Acknowledgments

We would like to thank the Editor-in-chief, an associate editor and the two referees for their constructive and insightful comments, which have led to a much improved version of the paper. The first author gratefully acknowledges financial support from the Fundação de Amparo à Pesquisa do Estado de Rio de Janeiro (FAPERJ). Dr. C. A. Abanto-Valle is deeply indebted to CNPq-Brazil and FAPERJ. Dr. M.-H. Chen’s research was partially supported by NIH grants #GM70335 and #P01CA142538.

APPENDIX A: THE FULL CONDITIONAL DISTRIBUTIONS

In this appendix, we describe the full conditional distributions for the parameters and the mixing latent variables z1:T of the SVML-GH-ST model.

Full conditional distributions of β0, β1, and β2

Let mt and Vt be defined by

mt={zteht2φτ2+φ2(ht+1-α-ϕht),t<T,0,t=T,Vt={ztehtτ2τ2+φ2,t<T,zteht,t=T.

For parameters β0, β1 and β2, we set the prior distributions as: β0~N(β¯0,σβ02),β1~N(-1,1)(β¯1,σβ12),β2~N(β¯2,σβ22). Then, the full conditionals are given by

β0y0:T,h1:T,z1:T,β1,β2~N(bβ0aβ0,1aβ0), (A.1)
β1y0:T,h1:T,z1:T,β0,β1~N(bβ1aβ1,1aβ1)Iβ2<1, (A.2)
β2y0:T,h1:T,z1:T,β0,β1~N(bβ2aβ2,1aβ2), (A.3)

where aβ0=t=1T1Vt+1σβ02,bβ0=t=1TwtVt+β¯0σβ02,aβ1=t=1Tyt-12Vt+1σβ12,bβ1=t=1Tutyt-1Vt+β¯1σβ12,aβ2=t=1Te2htVt+1σβ22,bβ2=t=1TriehtVt+β¯2σβ22,wt=yt-β1yt-1-β2eht-eht2δ(zt-μz)-mt,ut=yt-β0-β2eht-eht2δ(zt-μz)-mt,rt=yt-β0-β1yt-1-eht2δ(zt-μz)-mt, and 𝕀2|<1 is the indicator variable.

Full conditional distributions of α, ϕ, φ, δ, and τ2

We assume the following prior distributions: α | τ2 ~ 𝒩(α0, τ2/q0), φ | τ2 ~ 𝒩(φ0, τ2/p0), ϕ~N(-1,1)(ϕ0,sϕ2),δ~N(δ0,sδ2), and τ2 ~ 𝒢I(aτ/2, Sτ /2), where α0, φ0, ϕ0, sϕ2, δ0, sδ2, aτ, Sτ, p0, and q0 are known hyper parameters.

After some simple but tedious algebra, we obtain

α.~N(BαAα,τ2Aα), (A.4)
φ.~N(BφAφ,τ2Aφ), (A.5)
δ.~N(BδAδ,1Aδ), (A.6)

where Aα=q0+1+ϕ1-ϕ+T-1,Bα=α0q0+(1+ϕ)h1+t=1T-1kt,kt=ht+1-ϕht-φgtzt-12e-ht2,Aφ=p0+t=1T-1gt2zt-1e-ht,Bφ=φ0p0+t=1T-1ctgtzt-12e-ht2,Aδ=-φτ2t=1T-11zt(zt-μz)(ht+1-α-ϕht)+(φ2+τ2τ2)t=1T-11zteht/2(zt-μz)(yt-β0-β1yt-1-β2eht),Bδ=(φ2+τ2τ2)t=1T-1(1zt(zt-μz)2)+1zn(zn-μz)2+1sδ2, ct = ht+1 − α − ϕht, and gt=yt-β0-β1yt-1-β2eht-eht2δ(zt-μz). In a similar way, the conditional distribution of ϕ is given by

p(ϕ.)Q(ϕ)exp{-Aϕ2(ϕ-BϕAϕ)2}, (A.7)

where

Q(ϕ)=1-ϕ2exp{-1-ϕ22τ2(h1-α1-ϕ)2},

Aϕ=1sϕ2+t=1T-1ht2τ2,Bϕ=ϕ0sϕ2+t=1T-1lthtτ2,lt=ht+1-α-φ(yt-β0-β1yt-1-β2eht)zt-12e-ht2, and 𝕀|ϕ|<1 is the indicator variable. As p(ϕ | h1:T, α, p(ϕh1:T,α,ση2)) in (A.7) does not have a closed form, we sample from it by using the Metropolis-Hastings algorithm with truncated N(-1,1)(bϕaϕ,ση2aϕ) as the proposal density. The conditional distribution of τ2 is IG(T12,M12), where T1 = aτ +T +2 and M1=(1-ϕ2)(h1-α1-ϕ)2+t=1T-1(ct-φzt-12e-ht2gt)2+p0(φ-φ0)2+q0(α-α0)2+Sτ. Once τ2 and φ are sampled, respectively, from their conditional posteriors, we can calculate ρ and ση2 through ση2=τ2+φ2 and ρ = φ/ση.

Full conditional distributions of zt and ν

The full conditional distribution of zt is given by

p(zt.)Q(zt)(γϑ)λzλ-12Kλ(γ,ϑ)exp{-12(ϑ2zt-1+γ2zt)},

where the values of λ, ϑ and γ are the parameters of a distribution GIG(λ, ϑ, γ) whose values are given by

λ=-ν+12,γ2=δ2φ2+τ2τ2,ϑ2=φ2+τ2τ2e-ht×(yt-β0-β1yt-1-β2eht+μzeht/2δ)2+ν.

We sample zt by the Metropolis-Hastings algorithm. We use GIG(λ, ϑ, γ) as the proposal distribution such that zt and zt(i-1) are the proposal value and previous iteration value, respectively. Thus, the acceptance probability is given by αMH=min{1,Q(zt)Q(zt(i-1))}, where

Q(zt)=exp{φτ2[zt-1/2e-ht/2(ht+1-α-ϕht)(yt-β0-β1yt-1-β2eht+μzeht/2δ)--zt1/2δ(ht+1-α-ϕht)]I{t<n}}.

We assume the prior distribution of ν as 𝒢(aν, bν)𝕀4≤40. Then, the full conditional distribution of ν is

p(νz1:T)ν2Tν2Γ(ν2)Texp{-t=1T12Vt[yt-β0-β1yt-1-β2eht-eht2δ(zt-νν-2)-mt]2-ν2t=1T[1zt+logzt]}νaν-1exp{-bνν}I4<ν40.

We sample ν by the Metropolis-Hastings algorithm [43, 10]. Let ν* denote the mode (or approximate mode) of p(ν | z1:T ), and let (ν) = logp(ν | z1:T ). We use the proposal density N(4,40)(μν,σν2), where μν = ν*′(ν*)/″(ν*) and σν2=-1/(ν). ′(ν*) and ″(ν*) are the first and second derivatives of (ν) evaluated at ν = ν*.

APPENDIX B: SOME DERIVATIONS OF THE BLOCK SAMPLER

First, we define

ds=Lhs=-12+(ys-μs)22Vs+(ys-μs)Vsμshs+(ys-1-μs-1)Vs-1μs-1hs-ϕ(hs+1-α-ϕhs)ση2I(t+k<T) (B.1)

for s = t + 1, . . . , t + k, and

Q=(Mt+1Nt+200Nt+2Mt+2Nt+300Nt+3Mt+3Nt+k00Nt+kMt+k), (B.2)

where

Ms=-E[2Lhs2]=12+1Vs(μshs)2+1Vs-1(μs-1hs)2+ϕ2ση2I(t+k<T),s=t+1,,t+k, (B.3)
Ns=-E[2Lhshs-1]=1Vs-1μs-1hs-1μs-1hs, (B.4)

where s = 2, . . . , T and Nt+1 = 0. Next, we define

μshs={β2ehs+12ehs2δ(zs-μz)+φφ2+τ2zsehs2[(hs+1-α-ϕhs)2-ϕ]s=1,,T-1,β2ehs+12ehs2δ(zs-μz),s=T, (B.5)
μs-1hs={0,s=1,φφ2+τ2zs-1ehs-12,s=2,,T. (B.6)

Footnotes

*

William L. Leão gratefully acknowledges financial support from the Fundação de Amparo à Pesquisa do Estado de Rio de Janeiro (FAPERJ). Dr. C. A. Abanto-Valle is deeply indebted to CNPq-Brazil and FAPERJ. Dr. M.-H. Chen’s research was partially supported by NIH grants #GM70335 and #P01CA142538.

1

For the last block, we have yT | yT−1, hT ~ 𝒩 (β0+β1yT−1+β2ehT + ehT δ(zTμz), zT ehT).

Contributor Information

William L. Leão, Departament of Statistics, Federal University of Rio de Janeiro, Caixa Postal 68530, CEP: 21945-970, Rio de Janeiro, Brazil

Carlos A. Abanto-Valle, Departament of Statistics, Federal University of Rio de Janeiro, Caixa Postal 68530, CEP: 21945-970, Rio de Janeiro, Brazil

Ming-Hui Chen, Department of Statistics, University of Connecticut, 215 Glenbrook Rd, U-4120, Storrs, CT 06269, USA.

References

  • 1.Abanto-Valle CA, Bandyopadhyay D, Lachos VH, Enriquez I. Robust Bayesian analysis of heavy-tailed stochastic volatility models using scale mixtures of normal distributions. Computational Statistics & Data Analysis. 2010;54:2883–2898. doi: 10.1016/j.csda.2009.06.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Abanto-Valle CA, Dey DK, Lachos VH. Bayesian estimation of a skew-Student-t stochastic volatility model. Methodology and Computing in Applied Probability. 2015;17:721–738. [Google Scholar]
  • 3.Abanto-Valle CA, Migon HS, Lachos VH. Stochastic volatility in mean models with scale mixtures of normal distributions and correlated errors: A Bayesian approach. Journal of Statistical Planning and Inference. 2011;54:1875–1887. [Google Scholar]
  • 4.Abanto-Valle CA, Migon HS, Lachos VH. Stochastic volatility in mean models with heavy-tailed distributions. Brazilian Journal of Probability and Statistics. 2012;26(4):402–422. [Google Scholar]
  • 5.Abanto-Valle CA, Wang C, Wang X, Wang F-X, Chen M-H. Bayesian inference for stochastic volatility models using the generalized skew-t distribution with applications to the Shenzhen stock exchange return. Statistical and Its Interface. 2014;7:487–502. [Google Scholar]
  • 6.Ando T. Bayesian inference for nonlinear and nongaussian stochastic volatility model with leverge effect. Journal of Japan Statistical Society. 2006;36:173–197. [Google Scholar]
  • 7.Ando T. Bayesian predictive information criterion for the evaluation of hierarchical Bayesian and empirical Bayes models. Biometrika. 2007;94:443–458. [Google Scholar]
  • 8.Bollerslev T, Zhou H. Volatility puzzles: A simple framework for gauging return-volatility regressions. Journal of Econometrics. 2005;131:123–150. [Google Scholar]
  • 9.Celeux G, Forbes F, Robert CP, Titterington DM. Deviance information criteria for missing data models. Bayesian Analysis. 2006;1:651–674. [Google Scholar]
  • 10.Chib S. Marginal likelihood from the Gibbs output. Journal of the American Statistical Association. 1995;90:1313–1321. [Google Scholar]
  • 11.de Jong P, Shephard N. The simulation smoother for time series models. Biometrika. 1995;82:339–350. [Google Scholar]
  • 12.Delatola E-I, Griffin JE. Bayesian nonparametric modelling of the return distribution with stochastic volatility. Bayesian Analysis. 2011;6:901–926. [Google Scholar]
  • 13.Fonseca TCO, Ferreira MAR, Migon HS. Objective Bayesian analysis for the Student-t regression model. Biometrika. 2008;95:325–333. [Google Scholar]
  • 14.French KR, Schert WG, Stambugh RF. Expected stock return and volatility. Journal of Financial Economics. 1987;19:3–29. [Google Scholar]
  • 15.Gerlach R, Carter C, Kohn R. Diagnostics for time series analysis. Journal of Time Series Analysis. 1999;20:309–330. [Google Scholar]
  • 16.Geweke J. Evaluating the accuracy of sampling-based approaches to the calculation of posterior moments. In: Bernardo JM, Berger JO, Dawid AP, Smith AFM, editors. Bayesian Statistics. Vol. 4. Oxford, U.K: Oxford University Press; 1992. pp. 169–193. [Google Scholar]
  • 17.Ghysels E, Harvey AC, Renault E. Stochastic volatility. In: Maddala G, Rao CR, editors. Handbook of Statistics. Vol. 14. Amsterdam: North-Holland; 1996. pp. 119–191. [Google Scholar]
  • 18.Gneiting T, Raftery AE. Strictly proper scoring rules, prediction and estimation. Journal of the American Statistical Association. 2007;6:901–926. [Google Scholar]
  • 19.Good IJ. Rational decisions. Journal of the Royal Statistical Society, Series B. 1952;14:107–114. [Google Scholar]
  • 20.Harvey AC, Shephard N. The estimation of an asymmetric stochastic volatility model for asset returns. Journal of Business and Economic Statistics. 1996;14:429–434. [Google Scholar]
  • 21.Jacquier E, Polson N, Rossi P. Bayesian analysis of stochastic volatility models. Journal of Business and Economic Statistics. 1994;12:371–418. [Google Scholar]
  • 22.Jacquier E, Polson N, Rossi P. Bayesian analysis of stochastic volatility models with fat-tails and correlated errors. Journal of Econometrics. 2004;122:185–212. [Google Scholar]
  • 23.Kim S, Shepard N, Chib S. Stochastic volatility: Likelihood inference and comparison with ARCH models. Review of Economic Studies. 1998;65:361–393. [Google Scholar]
  • 24.Kim S, Shephard N, Chib S. Stochastic volatility: Likelihood inference and comparison with ARCH models. Review of Economic Studies. 1998;65:361–393. [Google Scholar]
  • 25.Koopman S. Disturbance smoothers for state space models. Biometrika. 1993;80:117–126. [Google Scholar]
  • 26.Koopman SJ, Uspensky EH. The stochastic volatility in mean model: Empirical evidence from international tock markets. Journal of Applied Econometrics. 2002;17:667–689. [Google Scholar]
  • 27.Liesenfeld R, Jung RC. Stochastic volatility models: Conditional normality versus heavy-tailed distrutions. Journal of Applied Econometics. 2000;15:137–160. [Google Scholar]
  • 28.Liesenfeld R, Richard J-F. Univariate and multivariate stochastic volatility models: Estimation and diagnostics. Journal of Empirical Finance. 2003;10:505–531. [Google Scholar]
  • 29.Nakajima J, Omori Y. Stochastic volatility model with leverage and asymmetrically heavy-tailed error using GH skew Student’s t-distribution. Computational Statistics & Data Analysis. 2012;56:3690–3704. [Google Scholar]
  • 30.Omori Y, Chib S, Shephard N, Nakajima J. Stochastic volatility with leverage: Fast likelihood inference. Journal of Econometrics. 2007;140:425–449. [Google Scholar]
  • 31.Omori Y, Watanabe T. Block sampler and posterior mode estimation for asymmetric stochastic volatility models. Computational Statistics & Data Analysis. 2008;52:2892–2910. [Google Scholar]
  • 32.Pemstein D, Quinn KV, Martin AD. The scythe statistical library: An open source C++ library for statistical computation. Journal of Statistical Software. 2011;42:1–26. [Google Scholar]
  • 33.Pitt M, Shephard N. Filtering via simulation: Auxiliary particle filter. Journal of the American Statistical Association. 1999;94:590–599. [Google Scholar]
  • 34.Robert CP, Titterington DM. Discussion on “Bayesian measures of model complexity and fit”. Biometrical Journal. 2002;64:573–590. [Google Scholar]
  • 35.Rosenblatt M. Remarks on a multivariate transformation. Annals of Mathematical Statistics. 1952;23:470–472. [Google Scholar]
  • 36.Salazar E, Migon HS, Ferreira MAR. Technical report. Federal University of Rio de Janeiro, Departament of Statistics; 2009. Objective Bayesian analysis for exponential power regression models. [Google Scholar]
  • 37.Shephard N, Pitt M. Likelihood analysis of non-Gaussian measurements time series. Biometrika. 1997;84:653–667. [Google Scholar]
  • 38.Smith JQ. Diagnostic checks of non-standard time series models. Journal of Forecasting. 1985;4:283–291. [Google Scholar]
  • 39.Spiegelhalter DJ, Best NG, Carlin BP, van der Linde A. Bayesian measures of model complexity and fit. Journal of the Royal Statistical Society, Series B. 2002;64:621–622. [Google Scholar]
  • 40.Stone M. Discussion on “Bayesian measures of model complexity and fit”. Journal of the Royal Statistical Society, Series B. 2002;64:621. [Google Scholar]
  • 41.Taylor S. Financial returns modelled by the product of two stochastic processes-a study of the daily sugar prices 1961–75. In: Anderson O, editor. Time Series Analysis: Theory and Practice. Vol. 1. Amsterdam: North-Holland; 1982. pp. 203–226. [Google Scholar]
  • 42.Taylor S. Modeling Financial Time Series. Chichester: Wiley; 1986. [Google Scholar]
  • 43.Tierney L. Markov chains for exploring posterior distributions (with discussion) Annal of Statistics. 1994;21:1701–1762. [Google Scholar]
  • 44.Watanabe T, Omori Y. A multi-move sampler for estimate non-Gaussian time series model: Comments on Shepard and Pitt (1997) Biometrika. 2004;91:246–248. [Google Scholar]
  • 45.Yu J. On leverage in stochastic volatility model. Journal of Econometrics. 2005;127:165–178. [Google Scholar]

RESOURCES