Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2021 Jul 22.
Published in final edited form as: Commun Stat Theory Methods. 2007 Jun 27;27(9):2137–2141. doi: 10.1080/03610929808832218

ON THE TRACE OF A WISHART

Deborah H Glueck 1, Keith E Muller 2
PMCID: PMC8296823  NIHMSID: NIHMS1630980  PMID: 34305271

Abstract

A derivation based on spectral decomposition allows specifying the characteristic function of the trace of a singular or nonsingular, central or noncentral, true or pseudo-Wishart. The trace equals a weighted sum of noncentral chi-squared random variables and constants. We describe computational methods.

Key Words and Phrases: characteristic function, central, noncentral, pseudo, singular

1. INTRODUCTION

Johnson and Kotz (1972) gave a thorough review of many results about the Wishart. Arnold (1984, pp. 314–317) defined the noncentral Wishart in terms of the matrix normal distribution. Mathai and Provost (1992, p.258) provided definitions for singular and nonsingular, true and pseudo-Wisharts.

Saw (1973) and Shah and Khatri (1974) provided proofs for de Waal’s (1972) conjecture about the expected values of elementary symmetric functions of a noncentral nonsingular true Wishart matrix. Mathai (1980) gave the moments, cumulants and asymptotic distribution of the trace of a noncentral nonsingular true Wishart matrix. Mathai and Pillai (1982) derived the exact density of the trace both in terms of confluent hypergeometric functions and in terms of zonal polynomials. They noted in passing that the trace could also be represented as a weighted sum of noncentral chi-squared random variables, but did not give an explicit form. Kourouklis and Moschopoulos (1985) gave the distribution of the trace of a noncentral nonsingular Wishart using a single gamma series representation, by inverting the moment generating function. Muller and Barton (1989, Appendix 1; 1991) provided the explicit form, as well as a proof. Their result holds for nonsingular true and pseudo-Wisharts. Wong and Liu (1995) suggested calculating the moments of the trace of any Wishart using tensor differential forms.

We extend the result of Muller and Barton (1989; 1991) to include singular Wisharts. Note that the density of a singular or pseudo-Wishart does not exist. The traces of singular true and pseudo-Wisharts are important in the study of power in general linear multivariate models with both fixed and random predictors.

2. RESULTS

With mj an r × 1 vector, and M = [m1 m2mN], define vec(M)=[m1m2mN] (Graybill, 1983, p. 309). Define Dg(mj) as the r × r matrix with i, i element mj,i and zero elsewhere (Stewart, 1973, p.22).

Definition 1. Let N and p be positive integers. Consider the constant matrices M (N × p), Σ (p × p, symmetric, positive semidefinite), Ξ (N × N, symmetric, positive semidefinite), and the random matrix Y (N × p). Then vec(Y) = NNp[vec(M), Σ ⊗ Ξ] if and only if Y has a matrix normal distribution, written Y = NN,p(M, Ξ, Σ) (Arnold, 1981, p. 311). Here Σ ⊗ Ξ = {σij · Ξ}. If Σ or Ξ are less than full rank, Y has a singular matrix normal distribution, written Y = SNN,p(M, Ξ, Σ). Write Y = (S)NN,p(M, Ξ, Σ) when Y has a matrix normal distribution which may or may not be singular.

Definition 2. Let M be an N × p constant matrix, Δ = M′M, Σ a symmetric, positive semidefinite, constant p × p matrix, and Z = (S)NN,p(M, IN, Σ). Then W = Z′Z is a Wishart matrix, indicated

W=Wp(N,Σ,Δ).

W is said to have a Wishart distribution if 0 < p ≤ N and Σ positive definite and hence of full rank, a pseudo-Wishart distribution if 0 < N < p and Σ positive definite, a singular Wishart distribution if 0 < pN and Σ is less than full rank, and a singular pseudo-Wishart distribution if 0 < N < p and Σ is less than full rank. These are central Wisharts if M = 0, and noncentral otherwise.

Lemma. Let Σ be a positive semidefinite symmetric matrix and let Δ = M′M. If W = WP(N, Σ, Δ), then there exists a matrix Y = (S)NN,p(M, IN, Σ) such that Y′Y = W.

Proof. Let Z = NN,p(0, IN, Ip). Use the spectral decomposition to write Σ = VDg(λ)V′, and define F=VDg(λ)12. Let Y = ZF′ + M. Then Y = (S)NN,p(M, I, Σ) (Arnold, 1981, p.312, Theorem 17.2d) and hence Y′Y has the desired distribution.

Theorem. Let v, p and p* be non-negative. Let A be a constant N × N idempotent matrix of rank v ≥ p. Let Σ be a constant, symmetric, positive semidefinite matrix of rank p*p, with spectral decomposition Σ = VDg(λ)V′, with V′V = Ip. Let λ+ be the p* × 1 vector of strictly positive eigenvalues. Without loss of generality, write

Dg(λ)=[Dg(λ+)0p*×(pp*)0(pp*)×p*0(pp*)×(pp*)].

Let δk be a p × 1 vector with a 1 in the kth row and 0 elsewhere. Let Y = (S)NN,p(M, IN, Σ), and M*k = MVδk. Then

tr(YAY)=k=1p*λkχ2(ν,M*k'AM*k)+k=p*+1pM*kAM*k.

Proof. YV = (S)NN,p[MV, IN, Dg(λ)], and thus YVδk is independent of YVδk′ for k ≠ k′, since Dg(λ) ⊗ IN is diagonal. For each kp*, YVδk = NN,1[M*k, IN, λk], and thus δkVYAYVδk=λkχ2(ν,M*kAM*k). For each k such that p* < kp, YVδk = (S)NN,1 [M*k, IN, 0]. This implies that Pr{YVδk = M*k} = 1, and that Pr{δkVYAYVδk=M*kAM*k}=1. The result then follows since

tr(YAY)=tr(VYAYV)=k=1pδkVYAYVδk.

Corollary 1. The characteristic function of the trace of a Wishart may be written as

ϕ(t)=exp[k=1p*λkM*kAM*kit(12λkit)+k=p*+1pitM*kAM*k]k=1p*(12λkit)ν/2.

Corollary 2. The mean and the variance of the trace of a Wishart are

E[tr(YAY)]=νk=1p*λk+k=1p*λkM*kAM*k+k=p*+1pM*kAM*k

and

V[tr(YAY)]=k=1p*λk2(2ν+4M*kAM*k).

3. COMPUTATIONAL METHODS

Observe that λk > 0 and

Pr{tr(YAY)t}=Pr{k=1p*λkχ2(ν,M*kAM*k)tk=p*+1pM*kAM*k}.

One can use this characterization to compute the cumulative distribution function of the trace of a Wishart. Two different methods apply. The first is an approximation, and the second a numerical integration which allows specifying the degree of accuracy. Muller and Barton (1989; 1991) described how to approximate a positively weighted sum of noncentral χ2 variables with a single scaled noncentral χ2. Davies (1980) gave a computer algorithm that employs the inverse Mellin transform to invert the characteristic function of a sum of noncentral χ2 variables.

ACKNOWLEDGEMENTS

The authors gratefully acknowledge the help of Drs. G. G. Koch, R. M. Hamer, L. M. LaVange, D. F. Ransohoff and P. W. Stewart.

Contributor Information

Deborah H. Glueck, Room N-101A, Robert Wood Johnson Medical School, University of Medicine and Dentistry of NJ, 675 Hoes Lane, Piscataway, NJ 08854

Keith E. Muller, Dept. of Biostatistics, CB#7400, University of North Carolina, Chapel Hill, NC 27599

BIBLIOGRAPHY

  1. Arnold SF (1981). The Theory of Linear Models and Multivariate Analysis, Wiley, New York. [Google Scholar]
  2. Davies RB (1980). “The distribution of a linear combination of χ2 random variables,” Applied Statistics, 29, 323–333. [Google Scholar]
  3. De Waal DJ (1972). “On the expected values of the elementary symmetric functions of a noncentral Wishart matrix,” Annals of Mathematical Statistics, 43, 344–347. [Google Scholar]
  4. Graybill FA (1983). Matrices with Applications in Statistics, Wadsworth International Group, Belmont, California. [Google Scholar]
  5. Johnson NL and Kotz S (1972). Distributions in Statistics; Continuous Multivariate Distributions, John Wiley and Sons, New York. [Google Scholar]
  6. Kourouklis S and Moschopoulos PG (1985). “On the distribution of the trace of a noncentral Wishart matrix,” Metron, 43(2), 85–92. [Google Scholar]
  7. Mathai AM (1980). “Moments of the trace of noncentral Wishart matrix,” Communications in Statistics:Theory and Methods, 9(8), 795–801. [Google Scholar]
  8. Mathai AM and Pillai KCS (1982). “Further results on the trace of a noncentral Wishart matrix,” Communications in Statistics. Theory and Methods, 11(10), 1077–1086. [Google Scholar]
  9. Mathai AM and Provost SB (1992). Quadratic Forms in Random Variables: Theory and Applications, Marcel Dekker, Inc., New York. [Google Scholar]
  10. Muller KE and Barton CN (1989). “Approximate power for repeated measures ANOVA lacking sphericity,” Journal of the American Statistical Association, 84(420), 549–555. [Google Scholar]
  11. Muller KE and Barton CN (1991). “Correction to ‘Approximate power for repeated measures ANOVA lacking sphericity,’“ Journal of the American Staistical Association, 86, 255–256. [Google Scholar]
  12. Saw JG (1973). “Expectation of elementary symmetric functions of a Wishart matrix,” Annals of Statistics, 1(3), 580–582. [Google Scholar]
  13. Shah BK and Khatri CG (1974). “Proof of the conjectures about the expected values of the elementary symmetric functions of a noncentral Wishart matrix,” Annals of Statistics, 2(4), 833–836. [Google Scholar]
  14. Stewart GW (1973). Introduction to Matrix Computations, Academic Press, New York. [Google Scholar]
  15. Uhlig H (1994). “On Singular Wishart and Singular Multivariate Beta Distributions,” Annals of Statistics, 22(1), 395–405. [Google Scholar]
  16. Wong CS and Liu D (1995). “Moments of generalized Wishart distributions,” Journal of Multivariate Analysis, 52, 280–94. [Google Scholar]

RESOURCES