Eigenvalues of Random Matrices with Isotropic Gaussian Noise and the Design of Diffusion Tensor Imaging Experiments*

Dario Gasbarra; Sinisa Pajevic; Peter J Basser

doi:10.1137/16M1098693

. Author manuscript; available in PMC: 2017 Oct 6.

Published in final edited form as: SIAM J Imaging Sci. 2017 Sep 14;10(3):1511–1548. doi: 10.1137/16M1098693

Eigenvalues of Random Matrices with Isotropic Gaussian Noise and the Design of Diffusion Tensor Imaging Experiments*

Dario Gasbarra ^†, Sinisa Pajevic ^‡, Peter J Basser ^§

PMCID: PMC5630232 NIHMSID: NIHMS906978 PMID: 28989561

Abstract

Tensor-valued and matrix-valued measurements of different physical properties are increasingly available in material sciences and medical imaging applications. The eigenvalues and eigenvectors of such multivariate data provide novel and unique information, but at the cost of requiring a more complex statistical analysis. In this work we derive the distributions of eigenvalues and eigenvectors in the special but important case of m×m symmetric random matrices, D, observed with isotropic matrix-variate Gaussian noise. The properties of these distributions depend strongly on the symmetries of the mean tensor/matrix, D̄. When D̄ has repeated eigenvalues, the eigenvalues of D are not asymptotically Gaussian, and repulsion is observed between the eigenvalues corresponding to the same D̄ eigenspaces. We apply these results to diffusion tensor imaging (DTI), with m = 3, addressing an important problem of detecting the symmetries of the diffusion tensor, and seeking an experimental design that could potentially yield an isotropic Gaussian distribution. In the 3-dimensional case, when the mean tensor is spherically symmetric and the noise is Gaussian and isotropic, the asymptotic distribution of the first three eigenvalue central moment statistics is simple and can be used to test for isotropy. In order to apply such tests, we use quadrature rules of order t ≥ 4 with constant weights on the unit sphere to design a DTI-experiment with the property that isotropy of the underlying true tensor implies isotropy of the Fisher information. We also explain the potential implications of the methods using simulated DTI data with a Rician noise model.

Keywords: eigenvalue and eigenvector distribution, asymptotics, sphericity test, singular hypothesis testing, DTI, spherical t-design, Gaussian orthogonal ensemble

AMS subject classifications: 60F05, 62K05, 62E20, 68U10

1. Introduction

Tensors of second and higher order are ubiquitous in the physical sciences. Some examples include the moment of inertia tensor; electrical, hydraulic, and thermal conductivity tensors; stress and strain tensors, etc. One key advance in the field of tensor measurement was the advent of diffusion tensor imaging (DTI), a magnetic resonance–based imaging technique that provides an estimate of a second-order diffusion tensor in each voxel within an imaging volume [5, 6]. This effectively provides discrete estimates of a continuous or piecewise continuous tensor field within tissue and organs. With the possibility of measuring tensors in millions of individual voxels within, for example, a live human brain, there is a clear need for a statistical framework to be developed to (a) design optimal DTI-experiments, (b) characterize central tendencies and variability in such data, and (c) provide a family of hypothesis tests to assess and compare tensors and the quantities derived from them.

1.1. Tensor-variate normal distribution

In DTI, a tensor D is represented by a symmetric matrix D = (D_i,j : 1 ≤ i ≤ j ≤ 3), and it has been established that the measured tensor components D_ij, over multiple independent acquisitions from the same subject in the same voxel, conform to a multivariate normal distribution [34]. We previously proposed a normal distribution for tensor-valued random variables that arise in DTI whose precision and covariance structures could be written as fourth-order tensors [10]:

p (D) \propto exp (- \frac{1}{2} (D - \bar{D}) : A : (D - \bar{D})),

where A is a fourth-order precision tensor, D̄ is the mean tensor, and “:” is a tensor contraction.

There are distinct advantages to analyzing tensor or tensor-field data in the laboratory coordinate system in which their components are measured, and using the tensor-valued variates with a fourth-order tensor precision tensor rather than writing the tensor as a vector and using a square covariance matrix. For example, by retaining the tensor form, it is easy to establish the condition that the statistical properties be coordinate independent, yielding an isotropic fourth-order precision tensor

A_{ijkl}^{iso} = λ δ_{i j} δ_{k l} + μ (δ_{i k} δ_{j l} + δ_{i l} δ_{j k}),

which can be parameterized with only two constants, μ and λ. This form, if achieved, can greatly simplify statistical analysis and is the focus of this paper.

In the following sections, we switch from tensor to matrix notation [10], as the correspondence between the Gaussian tensor-variate and standard multivariate normal can be established using appropriate conversion factors [12]. The outline of the paper is as follows. First, in this section we state the properties for the m-dimensional isotropic Gaussian matrix. In section 2 we describe a spectral representation and change of variables applicable to general symmetric random matrices. In section 3 we derive distributions for the eigenvalues and eigenvectors of the isotropic Gaussian, while in section 4 we obtain the analytical expressions in the limit of small noise for different symmetries of the mean tensor D̄. In the remaining sections, we focus on the application of these results to DTI. In section 5 we develop a sphericity test, testing for the isotropy of the diffusion tensor; in section 6 we study the isotropy of the Fisher information and justify the use of spherical t-designs as gradient tables in DTI experimental design; and finally, in section 7 we test many of the mathematical results and predictions using Monte Carlo simulations of the DTI-experiment. The main theorems are proved in Appendix SM1 of the supplementary material.

1.2. Isotropic Gaussian matrix distribution

Given a fixed symmetric matrix D̄ ∈ ℝ^m×m, it is shown in [31, 10] that the probability distribution of an m×m symmetric Gaussian random matrix D = (D_ij : 1 ≤ i ≤ j ≤ m) is isotropic around D̄ if and only if it has density of the form

p (D) = C_{m} (μ, λ) exp (- μ Tr ({(D - \bar{D})}^{2}) - \frac{λ}{2} {Tr (D - \bar{D})}^{2}),

(1)

C_{m} (μ, λ) = 2^{(m - 1) m / 4} π^{- (m + 1) m / 4} μ^{(m + 1) m / 4} \sqrt{1 + λ m / (2 μ)},

(2)

with precision parameter μ > 0 and interaction parameter λ satisfying the constraint λm > −2μ. To fix the ideas, when m = 3 this corresponds to a Gaussian distribution for the vectorized matrix

vec (D) = (D_{11}, D_{22}, D_{33}, D_{12}, D_{13}, D_{23}),

(3)

with mean vec(D̄) and precision matrix

A (μ, λ) = (\begin{matrix} λ + 2 μ & λ & λ & 0 & 0 & 0 \\ λ & λ + 2 μ & λ & 0 & 0 & 0 \\ λ & λ & λ + 2 μ & 0 & 0 & 0 \\ 0 & 0 & 0 & 4 μ & 0 & 0 \\ 0 & 0 & 0 & 0 & 4 μ & 0 \\ 0 & 0 & 0 & 0 & 0 & 4 μ \end{matrix}) .

(4)

In particular (D_ij : 1 ≤ i < j ≤ m) are independent, and (D_ii : 1 ≤ i ≤ m) are negatively correlated for λ > 0, with covariance Σ(μ, λ) = A(μ, λ)⁻¹, where

\sum_{i j, i j} = E ({(D_{i j} - {\bar{D}}_{i j})}^{2}) = {(4 μ)}^{- 1}, i \neq j, \sum_{i i, j j} = E ((D_{i i} - {\bar{D}}_{i i}) (D_{j j} - {\bar{D}}_{j j})) = (δ_{i j} - \frac{λ}{2 μ + λ m}) \frac{1}{2 μ} .

Remark 1.1

When D̄ = 0, λ = 0, and μ = 1 or, depending on the scaling convention, μ = 1/2, the random matrix distribution (1) is known in the literature as the Gaussian orthogonal ensemble (GOE). The connection between general isotropic Gaussian matrices and the GOE was first noticed in [37]. The fluctuations of the diagonal elements ((D_ii − D̄_ii) : 1 ≤ i ≤ m) are exchangeable and independent of the off-diagonal elements.

2. Spectral representation and change of variables

We summarize basic facts from the random matrix literature [3, 21, 25, 32, 22, 16, 24, 40]. A symmetric matrix D ∈ ℝ^m×m has spectral decomposition D = OGO^⊤, where G is a diagonal matrix containing the m eigenvalues (γ₁, γ₂, … , γ_m) ∈ ℝ^m, and O = (O⁻¹)^⊤ is an orthogonal matrix with columns corresponding to the normalized eigenvectors. The orthogonal matrices form a compact group 𝒪(m) with respect to the matrix multiplication, which contains the special orthogonal group 𝒮𝒪(m) = {O ∈ 𝒪(m) : det(O) = 1} of rotations. The (m − 1)m/2 independent entries under the diagonal (O_ij : 1 ≤ j < i ≤ m) determine O, and the eigenvalues are distinct for symmetric matrices outside a set of Lebesgue measure zero in ℝ ⁽^m⁺¹⁾^m/². The spectral decomposition is not unique, since D = OGO^⊤ = ROPGP^⊤O^⊤R^⊤ for any permutation matrix P, and any R = (R_ij = ±δ_ij)_1≤_i_≤_j_≤_m, which form the subgroup ℛ(m) of reflections with respect to the Cartesian axes, isomorphic to {1,−1}^m. In order to determine unique O and G, we sort the eigenvalues in descending order γ₁ > γ₂ > · · · > γ_m and impose, for each column vector (O₁_j,O₂_j, … , O_mj)^⊤, j = 1, … , m, the condition that the first encountered nonzero coordinate is positive, and denote by 𝒪(m)⁺ the set of such matrices. An O ∈ 𝒪(m)⁺ is a representative of the left coset Oℛ(m). The change of variables

D \mapsto X = (γ_{i}, O_{i j} : 1 \leq j < i \leq m)

(5)

has differential

\prod_{1 \leq i \leq j \leq m} d D_{i j} = ∣ J (γ, O) ∣ \prod_{1 \leq i \leq m} d γ_{i} \prod_{j < i} d O_{i j},

where J is the Jacobian of the inverse map X ↦ D, which is evaluated by means of differential geometry. We consider a differentiable map Y : ℝ^m×m → ℝ^m×m. The matrix differential can then be written using the chain rule

d Y_{i j} = \sum_{k, h = 1}^{m} \frac{\partial Y_{i j}}{\partial X_{h k}} d X_{h k},

and the wedge product acting on the transformed differentials is

{(d Y)}^{^} : = ⋀_{i, j = 1}^{m} d Y_{i j} = det (\frac{\partial Y_{i j}}{\partial X_{h k}}) ⋀_{h, k = 1}^{m} d X_{h k} .

Note that the wedge product is taken over the independent entries of the matrix, for example, if X is symmetric,

{(d X)}^{^} = \underset{1 \leq h \leq k \leq m}{⋀} d X_{h k},

and when X is skew-symmetric,

{(d X)}^{^} = \underset{1 \leq h < k \leq m}{⋀} d X_{h k} .

The wedge product is also anticommutative, meaning that dx∧dy = −dy∧dx. However, when we compute volume elements, we always choose an ordering of the wedge product producing a nonnegative volume. The Jacobian calculation is based on the following result.

Proposition 2.1 (see [24, Prop. 1.2])

When A,D are m×m matrices and D is symmetric,

{(A^{⊤} dDA)}^{^} = det {(A)}^{m + 1} {(d D)}^{^} .

(6)

Since O^⊤O = I, it follows that the matrix differential O^⊤dO = −dO^⊤O is skew-symmetric. We also have

d D = d O G O^{⊤} + {OdGO}^{⊤} + {OGdO}^{⊤}

and

O^{⊤} d D O = d G + O^{⊤} dOG - G O^{⊤} d O,

where the differential matrix on the right-hand side has diagonal entries dγ_i and off-diagonal entries,

{(O^{⊤} d O)}_{i j} (γ_{i} - γ_{j}) = (γ_{i} - γ_{j}) \sum_{k = 1}^{m} O_{k i} d O_{k j}, i \neq j .

By using property (6), we obtain

{(d D)}^{^} = {(O^{⊤} dDO)}^{^} = V (γ) ⋀_{i = 1}^{m} d γ_{i} {(O^{⊤} d O)}^{^}, where V (γ) = | \begin{matrix} 1 & γ_{1} & γ_{1}^{2} & \dots & γ_{1}^{m - 1} \\ 1 & γ_{2} & γ_{2}^{2} & \dots & γ_{2}^{m - 1} \\ ⋱ \\ 1 & γ_{m} & γ_{m}^{2} & \dots & γ_{m}^{m - 1} \end{matrix} | = \prod_{1 \leq i < j \leq m} (γ_{i} - γ_{j})

(7)

is the Vandermonde determinant. The wedge product (O^⊤dO)^∧ defines a uniform measure on 𝒪(m) which is invariant under the group action, and the Haar probability measure given by

H_{m} (d O) = \frac{1}{Vol (O (m))} {(O^{⊤} d O)}^{^}

is obtained by normalizing with the volume measure (see Corollary 2.1.16 in [33])

Vol (O (m)) = 2^{m} π^{(m + 1) m / 4} \prod_{j = 1}^{m} Γ {(j / 2)}^{- 1} = 2^{m} Vol (O {(m)}^{+}) = 2 Vol (S O (m)) .

We rewrite (7) as

\underset{i \leq j}{⋀} d D_{i j} = Vol (O (m)) V (γ) d γ \times H_{m} (d O), O \in O {(m)}^{+}, γ_{1} > γ_{2} > \dots > γ_{m} .

(8)

3. Eigenvalue and eigenvector distribution

3.1. Zero-mean isotropic Gaussian matrix

We consider first a zero-mean symmetric random matrix D with isotropic Gaussian distribution (1), where D̄ = 0. This is an important special case to consider. While it does not satisfy the physical requirement that the eigenvalues of a diffusion (or other transport) tensor all must be nonnegative, it illustrates the mathematical machinery necessary to derive a closed-form expression for the resulting distribution of tensor eigenvalues. From the spectral decomposition D = OGO^⊤, it follows by using the change of variables (5) in the density (1) that O is independent from G and represents a random rotation distributed according to the constrained probability

2^{m} 1 (O \in O^{+} (m)) H_{m} (d O),

and the ordered D-eigenvalues have joint density on {γ ∈ ℝ^m : γ₁ > · · · > γ_m},

q_{0} (γ) = Z_{m} (μ, λ) V (γ) exp (- (μ + \frac{λ}{2}) \sum_{i = 1}^{m} γ_{i}^{2} - λ \sum_{1 \leq i < j \leq m} γ_{i} γ_{j}),

(9)

with normalizing constant

Z_{m} (μ, λ) = Z_{m} (1, 0) μ^{m (m + 1) / 4} \sqrt{1 + λ m / (2 μ)},

(10a)

Z_{m} (1, 0) = 2^{- m} Vol (O (m)) C_{m} (1, 0) = 2^{m (m - 1) / 4} \prod_{l = 1}^{m} Γ {(l / 2)}^{- 1} .

(10b)

Remark 3.1

The density (9) is not generally Gaussian, since the Vandermonde determinant induces repulsion between the eigenvalues, which are never independent, even in the case when λ = 0 and the diagonal elements D_ii are independent. When λ = 0, after rescaling, (9) is the well-known GOE eigenvalue density, which plays a special role below (see Theorem 4.1). For m = 3, $Z_{3} (μ, λ) = 4 π^{- 1} μ^{5 / 2} \sqrt{2 μ + 3 λ}$ .

3.2. General case

Theorem 3.2

Let D̄ ∈ R^m×m be a symmetric matrix with a spectral decomposition D̄ = ŌḠŌ^⊤, where Ḡ = diag(γ̄₁, γ̄₂, … , γ̄_m), γ̄₁ ≥ γ̄₂ ≥ ·· · ≥ γ̄_m, are the ordered eigenvalues of D̄ , and Ō ∈ 𝒪(m)⁺ (which is not uniquely determined when there are repeated eigenvalues), and let D be a symmetric m×m Gaussian matrix with density (1) isotropic around the mean value D̄. Then, the ordered D-eigenvalues γ₁ > γ₂ > · · · > γ_m have joint density

q_{\bar{γ}} (γ) = Z_{m} (μ, λ) V (γ) exp (- \sum_{i, j = 1}^{m} (δ_{i, j} μ + \frac{λ}{2}) (γ_{i} - {\bar{γ}}_{i}) (γ_{j} - {\bar{γ}}_{j})) \times exp (- 2 μ \sum_{i = 1}^{m} γ_{i} {\bar{γ}}_{i}) I_{m} (2 μ \bar{γ}, γ),

(11)

and ℐ_m is the spherical integral below known as the Harish–Chandra–Itzykson–Zuber (HCIZ) integral [41, 26]:

I_{m} (\bar{γ}, γ) = \int_{O (m)} exp (Tr ({OGO}^{⊤} \bar{G})) H_{m} (d O) = \int_{O (m)} exp (\sum_{i j} {O_{i j}}^{2} {\bar{γ}}_{i} γ_{j}) H_{m} (d O) .

Conditionally on the eigenvalues (γ₁, … , γ_m), the conditional probability of R = Ō^⊤O has density

q_{\bar{γ}} (R ∣ γ) = 2^{m} I_{m} {(2 μ \bar{γ}, γ)}^{- 1} exp (2 μ \sum_{i, j = 1}^{m} {\bar{γ}}_{i} γ_{j} R_{i j}^{2})

(12)

with respect to the Haar probability measure H_m(dR) on Ō^⊤𝒪(m)⁺.

Proof

As in the zero-mean case, we start from the isotropic Gaussian matrix density (1) with mean D̄. By using the spectral representations D = OGO^⊤ and D̄ = ŌḠŌ^⊤, after the change of variables described in section 2, we find the joint density of (G,O) with respect to the product measure

d γ_{1} \times \dots \times d γ_{m} \times H_{m} (d O) on {γ \in ℝ^{m} : γ_{1} > γ_{2} > \dots > γ_{m}} \times O {(m)}^{+},

given as

q_{\bar{D}} (G, O) = C_{m} (μ, λ) Vol (O (m)) V (G) \times exp (- μ Tr {(G - O^{⊤} \bar{O} \bar{G} {\bar{O}}^{⊤} O)}^{2} - \frac{λ}{2} {Tr (G - O^{⊤} \bar{O} \bar{G} {\bar{O}}^{⊤} O)}^{2}) = 2^{m} Z_{m} (μ, λ) V (γ) exp (- μ Tr (G^{2} + {\bar{G}}^{2}) - \frac{λ}{2} {Tr (G - \bar{G})}^{2}) \times exp (2 μ Tr ({\bar{O}}^{⊤} {OGO}^{⊤} \bar{O} \bar{G})) .

We change coordinates with O ↦ R = Ō^⊤O ∈ 𝒪(m)⁺, and using the invariance property of the Haar measure we see that

2^{m} \int_{O {(m)}^{+}} exp (2 μ Tr ({\bar{O}}^{⊤} {OGO}^{⊤} \bar{O} \bar{G})) H_{m} (d O) = \int_{O (m)} exp (2 μ Tr (R^{⊤} G R \bar{G})) H_{m} (d R),

which proves (11). In the new coordinates the random matrix has density

q_{\bar{γ}} (G, R) = 2^{m} Z_{m} (μ, λ) V (G) exp (- μ Tr ({(G - \bar{G})}^{2}) - \frac{λ}{2} {Tr (G - \bar{G})}^{2}) \times exp (2 μ Tr (G R^{⊤} \bar{G} R) - 2 μ γ \cdot \bar{γ})

(13)

with respect to dγ × H_m(dR) on {γ ∈ ℝ^m : γ₁ > γ₂ > · · · > γ_m}× Ō^⊤𝒪(m)⁺ which proves (12).

Remark 3.3

When Ḡ = γ̄Id we say that D̄ is spherical. In such a case, G is stochastically independent from O, which follows the Haar probability distribution. Equation (11) shows the density of the ordered eigenvalues. Often the random matrix literature deals with the density of the unordered eigenvalues on ℝ^m, which depends only on the order statistics and differs by a 1/m! factor. The HCIZ integral admits the series expansion

I_{m} (\bar{γ}, γ) = \sum_{k = 0}^{\infty} \frac{1}{k!} \sum_{α \in Π_{m}^{k}} \frac{C_{α} (\bar{γ}) C_{α} (γ)}{C_{α} (1)},

where the sum is over the set of partitions of k into at most m parts,

Π_{m}^{k} = {α \in ℕ^{m} : α_{1} \geq α_{2} \geq \dots \geq α_{m} \geq 0 and α_{1} + α_{2} + \dots + α_{m} = k},

and C_α(z₁, … , z_m) is the homogeneous zonal polynomial corresponding to the partition α [28, 33, 39, 23]. Theorem 4.6 deals with the second-order asymptotics of ℐ_m(nγ, γ̄) as n→∞. When m = 3,

I_{3} (\bar{γ}, γ) = \frac{1}{8 π} \int_{0}^{2 π} \int_{0}^{2 π} \int_{0}^{π} exp (\bar{γ} Ω (θ, ϕ, ψ) γ^{⊤}) sin (θ) d θ d ϕ d ψ, with Ω (θ, ϕ, ψ) = [\begin{matrix} {(cos ϕ cos ψ - sin ϕ sin ψ cos θ)}^{2} & {(cos ϕ sin ψ + sin ϕ cos ψ cos θ)}^{2} & {(sin ϕ sin θ)}^{2} \\ {(sin ϕ cos ψ - cos ϕ sin ψ cos θ)}^{2} & {(sin ϕ sin ψ - cos ϕ cos ψ cos θ)}^{2} & {(cos ϕ sin θ)}^{2} \\ {(sin ψ sin θ)}^{2} & {(cos ψ cos θ)}^{2} & {(cos θ)}^{2} \end{matrix}]

expressed in Euler angular coordinates.

4. Small noise asymptotics

4.1. Spectral grouping

Theorem 4.1

Let (D⁽ⁿ⁾, n ∈ ℕ) be a sequence of random m×m symmetric matrices such that, for some deterministic limit D̄ and scaling sequence a⁽ⁿ⁾ →∞,

\sqrt{a^{(n)}} (D^{(n)} - \bar{D}) \overset{law}{\to} X,

(14)

where vec(X) is Gaussian with zero-mean and covariance Σ(1, λ) for some λ > −2/m as in (4).

Denoting by ( $γ_{j}^{(n)}$ , 1 ≤ j ≤ m) and (γ̄_j : 1 ≤ j ≤ m) the ordered eigenvalues of D⁽ⁿ⁾ and D̄ , respectively, assume that D̄ has k distinct eigenvalues, i.e.,

{\bar{γ}}_{1} = \dots = {\bar{γ}}_{ℓ_{1}} > {\bar{γ}}_{ℓ_{1} + 1} = \dots = {\bar{γ}}_{ℓ_{2}} > \dots > {\bar{γ}}_{ℓ_{k - 1} + 1} = \dots = {\bar{γ}}_{ℓ_{k}},

with 1 ≤ k ≤ m, ℓ₀ = 0, ℓ_k = m, corresponding to eigenspaces of respective dimensions m_i = (ℓ_i − ℓ_i₋₁). Consider the clusters

C_{i}^{(n)} = {γ_{ℓ_{i - 1} + 1}^{(n)} > γ_{ℓ_{i - 1} + 2}^{(n)} > \dots > γ_{ℓ_{i}}^{(n)}}, 1 \leq i \leq k,

formed by the ordered eigenvalues of D⁽ⁿ⁾ corresponding to the eigenspaces of D̄ taken in the D̄-eigenvalue order, and define the corresponding cluster barycenters as

{\tilde{γ}}_{i}^{(n)} = \frac{γ_{ℓ_{i - 1} + 1}^{(n)} + \dots + γ_{ℓ_{i}}^{(n)}}{m_{i}}, 1 \leq i \leq k .

We also consider the eigenvalue fluctuations

ξ_{j}^{(n)} = \sqrt{a^{(n)}} (γ_{j}^{(n)} - {\bar{γ}}_{j}), 1 \leq j \leq m,

and the cluster barycenter fluctuations

{\tilde{ξ}}_{i}^{(n)} = \sqrt{a^{(n)}} ({\tilde{γ}}_{i}^{(n)} - {\bar{γ}}_{ℓ_{i}}) = \frac{1}{m_{i}} \sum_{j = ℓ_{i - 1} + 1}^{ℓ_{i}} ξ_{j}^{(n)}, 1 \leq i \leq k .

As n→∞, the following limiting distribution appears:

For the cluster barycenters, we have
$({\tilde{ξ}}_{1}^{(n)}, \dots, {\tilde{ξ}}_{k}^{(n)}) \overset{law}{\to} ({\tilde{X}}_{1}, \dots, {\tilde{X}}_{k}),$

where
${\tilde{X}}_{i} = \frac{X_{ℓ_{i - 1} + 1, ℓ_{i - 1} + 1} + \dots + X_{ℓ_{i}, ℓ_{i}}}{m_{i}}, 1 \leq i \leq k,$ (15)

have joint Gaussian density
$q ({\tilde{ξ}}_{1}, \dots, {\tilde{ξ}}_{k}) = \sqrt{1 + λ m / 2} \prod_{i = 1}^{k} \sqrt{\frac{m_{i}}{π}} exp (- \sum_{i = 1}^{k} m_{i} {\tilde{ξ}}_{i}^{2} - \frac{λ}{2} \sum_{i = 1}^{k} \sum_{j = 1}^{k} m_{i} m_{j} {\tilde{ξ}}_{i} {\tilde{ξ}}_{j}),$ (16)

with zero-mean and covariance
$E ({\tilde{X}}_{i} {\tilde{X}}_{j}) = \frac{1}{2} (\frac{δ_{i j}}{m_{i}} - \frac{λ}{2 + λ m}), 1 \leq i, j \leq k .$
For each cluster, the differences between the eigenvalues and their barycenter
$ξ_{j}^{(n)} - {\tilde{ξ}}_{i}^{(n)} = \sqrt{a^{(n)}} (γ_{j}^{(n)} - {\tilde{γ}}_{i}^{(n)}) : i = 1, \dots, k, j = ℓ_{i - 1} + 1, \dots, ℓ_{i},$

are asymptotically independent from their cluster barycenter and the other clusters, with limiting distribution
$(ξ_{ℓ_{i - 1} + 1}^{(n)} - {\tilde{ξ}}_{i}^{(n)}, \dots, ξ_{ℓ_{i}}^{(n)} - {\tilde{ξ}}_{i}^{(n)}) \overset{law}{\to} (γ_{1} - {\tilde{γ}}_{m_{i}}, \dots, γ_{m_{i}} - {\tilde{γ}}_{m_{i}}), 1 \leq i \leq k,$ (17)

where (γ₁ > γ₂ > · · · > γ_{m_i}) are eigenvalues of the standard m_i-dimensional GOE of symmetric Gaussian matrices with zero-mean and precision A_{m_i} (1, 0) with barycenter
${\tilde{γ}}_{m_{i}} = \frac{1}{m_{i}} \sum_{j = 1}^{m_{i}} γ_{j} ~ N (0, 1 / (2 d_{i})) .$

Moreover, the differences (γ₁ − γ̃_{m_i}, … , γ_{m_i} − γ̃_{m_i}) are independent from γ̃_{m_i}, with degenerate density
$q_{m_{i}} (ζ_{ℓ_{i - 1} + 1}, \dots, ζ_{ℓ}) = Z_{m_{i}} (1, 0) \sqrt{π m_{i}} exp (- \sum_{j = ℓ_{i - 1} + 1}^{ℓ_{i}} ζ_{j}^{2}) δ_{0} (ζ_{ℓ_{i - 1} + 1} + \dots + ζ_{ℓ_{i}}) \prod_{ℓ_{i - 1} + 1 \leq j < h \leq ℓ_{i}} | ζ_{j} - ζ_{h} |,$ (18)

where δ₀(z) denotes the Dirac distribution, which is also the conditional density of the GOE eigenvalues (γ₁, … , γ_{m_i}) conditioned on {γ₁ + · · · + γ_{m_i} = 0}.
In particular, for each cluster,
$(\sqrt{a^{(n)}} (γ_{j}^{(n)} - γ_{h}^{(n)}) : ℓ_{i - 1} + 1 \leq j < h \leq ℓ_{i - 1}) \overset{law}{\to} ((γ_{j} - γ_{h}) : 1 \leq j < h \leq m_{i}),$

and these eigenvalue differences are asymptotically independent from the cluster barycenter and the other clusters.

Remark 4.2

The weak convergence hypothesis (14) implies

D^{(n)} = O^{(n)} diag (γ^{(n)}) O^{{(n)}^{⊤}} \overset{P}{\to} \bar{D} = \bar{O} diag (\bar{γ}) {\bar{O}}^{⊤},

which means that $γ^{(n)} \overset{P}{\to} \bar{γ}$ and ${\bar{O}}^{⊤} O^{(n)} \overset{P}{\to} I$ in probability. The asymptotic distribution in (17) depends only on m_i (the size of the cluster) and not on the interaction parameter λ. When $\sqrt{a^{(n)}} (D^{(n)} - \bar{D})$ has an isotropic Gaussian distribution with covariance Σ(1, λ), and the mean D̄ = γ̄I is spherically symmetric, there is only one cluster, and the distributional equalities in Theorem 4.1 hold exactly without going to the limit in distribution. A related result is given in [44] for the joint asymptotic distribution of eigenvalues and eigenvectors. Similar results have been derived in the special cases of noncentral Wishart random matrices and sample covariance matrices which are asymptotically Gaussian [2], [33, Theorem 9.5.5].

Next, we illustrate the implications of Theorem 4.1 in the 3-dimensional situation, which is relevant for DTI.

Corollary 4.3

Let D be a 3 × 3 symmetric matrix with Gaussian density (1). As μ→∞ with λ > −2μ/3, we have four asymptotic regimes depending on the symmetries of the mean matrix D̄.

γ̄₁ > γ̄₂ > γ̄₃ (totally asymmetric tensor). The joint density of (γ₁, γ₂, γ₃) is approximated by the Gaussian density of (D₁₁,D₂₂,D₃₃), i.e.,
$q (γ_{1}, γ_{2}, γ_{3}) ≃ \frac{μ \sqrt{2 μ + 3 λ}}{π^{3 / 2} \sqrt{2}} exp (- μ \sum_{i} {(γ_{i} - {\bar{γ}}_{i})}^{2} - \frac{λ}{2} \sum_{i j} (γ_{i} - {\bar{γ}}_{i}) (γ_{j} - {\bar{γ}}_{j})) .$ (19)
γ̄₁ > γ̄₂ = γ̄₃ (prolate tensor). Let γ̃₂₃ = (γ₂+γ₃)/2. The joint distribution of (γ₁, γ̃₂₃) is approximated by the Gaussian distribution of (D₁₁, (D₂₂ + D₃₃)/2, i.e.,
$q (γ_{1}, {\tilde{γ}}_{23}) ≃ π^{- 1} \sqrt{2 μ^{2} + λ^{2} 3 / 4 + 3 μ λ} \times exp (- (μ + \frac{λ}{2}) {(γ_{1} - {\bar{γ}}_{1})}^{2} - 2 (μ + λ) {({\tilde{γ}}_{23} - {\bar{γ}}_{2})}^{2} - 2 λ (γ_{1} - {\bar{γ}}_{1}) ({\tilde{γ}}_{23} - {\bar{γ}}_{2})) .$ (20)

Conditionally on (γ₁, γ̃₂₃), the asymptotic distribution of (γ₂, γ₃) is degenerate, with γ₃ = (2γ̃₂₃ − γ₂) and
$q (γ_{2} ∣ {\tilde{γ}}_{23}) ≃ (γ_{2} - {\tilde{γ}}_{23}) exp (- 2 μ {(γ_{2} - {\tilde{γ}}_{23})}^{2}) 2 μ 1 (γ_{2} > {\tilde{γ}}_{23}),$ (21)

that is, $(γ_{2} - {\tilde{γ}}_{23}) = ({\tilde{γ}}_{23} - γ_{3}) ≃ \sqrt{τ}$ , with τ exponentially distributed with rate 2μ and independent from the barycenter γ̃₂₃.
γ̄₁ = γ̄₂ > γ̄₃ (oblate tensor). This is similar to the prolate case. Let γ̃₁₂ = (γ₁+γ₂)/2. Asymptotically the joint distribution of (γ̃₁₂, γ₃) is approximated by the Gaussian distribution of ((D₁₁ + D₂₂)/2,D₃₃, with
$q ({\tilde{γ}}_{12}, γ_{3}) ≃ π^{- 1} \sqrt{2 μ^{2} + λ^{2} 3 / 4 + 3 μ λ} \times exp (- (μ + \frac{λ}{2}) {(γ_{3} - {\bar{γ}}_{3})}^{2} - 2 (μ + λ) {({\tilde{γ}}_{12} - {\bar{γ}}_{1})}^{2} - 2 λ (γ_{3} - {\bar{γ}}_{3}) ({\tilde{γ}}_{12} - {\bar{γ}}_{1})),$ (22)

and the asymptotic conditional distribution of (γ₁, γ₂) given (γ̃₁₂, γ₃) is degenerate with γ₂ = (2γ̃₁₂ − γ₁), and
$q (γ_{1} ∣ {\tilde{γ}}_{12}) ≃ (γ_{1} - {\tilde{γ}}_{12}) exp (- 2 μ {(γ_{1} - {\tilde{γ}}_{12})}^{2}) 2 μ 1 (γ_{1} > {\tilde{γ}}_{12}),$ (23)

i.e. $(γ_{1} - {\tilde{γ}}_{12}) = ({\tilde{γ}}_{12} - γ_{2}) ≃ \sqrt{τ}$ , with τ exponentially distributed with rate 2μ, independent from γ̃₁₂.
γ̄₁ = γ̄₂ = γ̄₃ (isotropic tensor). The barycenter ${\tilde{γ}}_{123} = (γ_{1} + γ_{2} + γ_{3}) / 3 = \frac{1}{3} Tr (D)$ is Gaussian with mean γ̄₁ and variance 1/(6μ + 9λ). Conditionally on γ̃₁₂₃, (γ₁, γ₂, γ₃) is degenerate, with γ₂ = (3γ̃₁₂₃ −γ₁ −γ₃), and the conditional density of (γ₁, γ₃) given γ̃₁₂₃ is approximated as
$q (γ_{1}, γ_{3} ∣ {\tilde{γ}}_{123}) ≃ {(2 μ)}^{5 / 2} \sqrt{\frac{3}{π}} (γ_{1} - γ_{3}) (2 γ_{1} + γ_{3} - 3 {\tilde{γ}}_{123}) (3 {\tilde{γ}}_{123} - γ_{1} - 2 γ_{3}) \times exp (- 2 μ {{(γ_{1} - {\tilde{γ}}_{123})}^{2} + {(γ_{3} - {\tilde{γ}}_{123})}^{2} + (γ_{1} - {\tilde{γ}}_{123}) (γ_{3} - {\tilde{γ}}_{123})}) 1 (γ_{1} > γ_{123} > γ_{3}) .$ (24)

Asymptotically, the conditional distribution of the vector
$\sqrt{2 μ} (γ_{1} - {\tilde{γ}}_{123}, γ_{2} - {\tilde{γ}}_{123}, 2 {\tilde{γ}}_{123} - γ_{1} - γ_{2})$
coincides with the conditional distribution of the ordered eigenvalues of the 3-dimensional standard GOE, conditioned on having zero barycenter, and is independent of γ̃₁₂₃.

Remark 4.4

For a totally anisotropic mean tensor D̄, the asymptotic Gaussian density (19) for the rescaled eigenvalue fluctuations around their barycenter coincides with the Gaussian eigenvalue density (18) of [10]. However, in [10] it was erroneously postulated that the map D = (OGO^⊤) ↦ G was linear with constant Jacobian and (19) would be the eigenvalue density of a random tensor with isotropic Gaussian noise; in fact, in the nonasymptotic case the eigenvalue density is given by (11).

4.2. Axial and radial diffusivity marginals

Two eigenvalue statistics that are particularly relevant in DTI are axial diffusivity (AD), which corresponds to the largest D-eigenvalue γ₁, is measured along the principal axis of the diffusion tensor, and is considered a putative axonal damage marker, and radial diffusivity (RD), which corresponds to γ̃₂₃ = (γ₂+γ₃)/2, is measured perpendicular to the principal axis, and is thought to be sensitive to the degree of hindrance that diffusing water molecules experience due to the axonal membrane and myelin sheath. In this subsection we derive the distributions for AD and RD in dimension m = 3 when D has the density given in (1). When the mean matrix D̄ is prolate, we show in Corollary 4.3 that in the small noise limit the joint distribution of AD and RD is asymptotically Gaussian, as shown in (20).

In the case of D with spherical mean D̄ = γ̄Id, we can also derive the marginal densities of AD and RD. See also [15], which contains a recursive expression for the distribution of the largest GOE eigenvalue in arbitrary dimension. After changing variables in the joint conditional eigenvalue density (24), we see that z_i = (γ_i − γ̃₁₂₃) are independent of the barycenter γ̃₁₂₃, z₁ = (γ₁− γ̃₁₂₃) and (−z₃) = (γ̃₁₂₃−γ₃) are identically distributed, with marginal density

q (z_{1}) = {(2 μ)}^{5 / 2} \sqrt{\frac{3}{2}} exp (- 3 μ z_{1}^{2} / 2) 1 (z_{1} > 0) \times \int_{- 2 z_{1}}^{- z_{1} / 2} (z_{3} - z_{1}) (2 z_{1} + z_{3}) (2 z_{3} + z_{1}) exp (- {(z_{3} + z_{1} / 2)}^{2} 2 μ) d z_{3} = μ^{3 / 2} \sqrt{\frac{6}{π}} (\frac{9 z_{1}^{2}}{2} + \frac{exp (- 9 μ z_{1}^{2} / 2) - 1}{μ}) exp (- 3 μ z_{1}^{2} / 2) 1 (z_{1} > 0)

and cumulative distribution function

P (γ_{1} - {\tilde{γ}}_{123} \leq t) = 1 - P (γ_{3} - {\tilde{γ}}_{123} \leq - t) = 1 - P ({\tilde{γ}}_{23} - {\tilde{γ}}_{123} \leq - t / 2) = μ^{3 / 2} \sqrt{\frac{6}{π}} \int_{0}^{t} (\frac{9 z^{2}}{2} + \frac{exp (- 9 μ z^{2} / 2) - 1}{μ}) exp (- 3 μ z^{2} / 2) d z = {Φ (t \sqrt{3 μ}) + Φ (t \sqrt{12 μ}) - 1} - 3 t \sqrt{\frac{3 μ}{2 π}} exp (- 3 μ t^{2} / 2),

where

ϕ (t) = \frac{1}{\sqrt{2 π}} exp (- t^{2} / 2), Φ (t) = \int_{- \infty}^{t} ϕ (s) d s

denote the standard Gaussian density and cumulative distribution function, respectively. The cumulative distribution function of γ₁ is obtained by taking convolution with the barycenter γ̃₁₂₃ distribution 𝒩(γ̄, 1/(6μ + 9λ)), as

P (γ_{1} - \bar{γ} \leq t) = 1 - P (γ_{3} - \bar{γ} \leq - t) \int_{- \infty}^{t} P (γ_{1} - {\tilde{γ}}_{123} \leq t - x) exp (- (6 μ + 9 λ) x^{2} / 2) \sqrt{\frac{6 μ + 9 λ}{2 π}} d x = \int_{- \infty}^{t} [Φ ((t - x) \sqrt{3 μ}) + Φ ((t - x) 2 \sqrt{3 μ})] exp (- (6 μ + 9 λ) x^{2} / 2) \sqrt{\frac{6 μ + 9 λ}{2 π}} d x + \frac{μ}{(μ + λ)} {\frac{(2 μ + 3 λ) t}{\sqrt{μ + λ}} ϕ (\frac{t \sqrt{2 μ^{2} + 3 λ μ}}{\sqrt{μ + λ}}) [Φ (\frac{t (2 μ + 3 λ)}{\sqrt{μ + λ}}) - 1] + \frac{ϕ (t \sqrt{6 μ + 9 λ})}{\sqrt{2 π}}} - Φ (t \sqrt{6 μ + 9 λ}) .

The joint density of AD and RD is given by

q (γ_{1}, {\tilde{γ}}_{23}) = \frac{4 μ^{3 / 2} \sqrt{2 μ + 3 λ}}{π} ({(γ_{1} - {\tilde{γ}}_{23})}^{2} + \frac{exp (- 2 μ {(γ_{1} - {\tilde{γ}}_{23})}^{2}) - 1}{2 μ}) exp (- (μ + \frac{λ}{2}) {(γ_{1} - \bar{γ})}^{2} - (2 μ + 2 λ) {({\tilde{γ}}_{23} - \bar{γ})}^{2} - 2 λ (λ_{1} - \tilde{γ}) ({\tilde{γ}}_{23} - \tilde{γ})) 1 (γ_{1} > {\tilde{γ}}_{23}) .

4.3. Eigenvector asymptotics

In the settings of Theorem 4.1, where D⁽ⁿ⁾ and D̄ have respective spectral decompositions O⁽ⁿ⁾G⁽ⁿ⁾O⁽ⁿ^{)^⊤} and ŌḠŌ^⊤, we study the asymptotics of R⁽ⁿ⁾ = Ō^⊤O⁽ⁿ⁾ ∈ 𝒪(m). Omitting the n superscript, we use the decomposition R = ŘR̂, where

\overset{ˇ}{R} = (\begin{matrix} {\overset{ˇ}{R}}_{(1, 1)} & 0 \\ {\overset{ˇ}{R}}_{(2, 2)} \\ ⋱ \\ 0 & {\overset{ˇ}{R}}_{(k, k)} \end{matrix})

(25)

is block diagonal with blocks Ř₍_j,j₎ ∈ 𝒪(m_j) corresponding to the m_j-dimensional eigenspaces of D̄.

These matrices form a subgroup 𝒦_γ̄ ≃ 𝒪(m₁)×𝒪(m₂)×· ··×𝒪(m_k) such that ŘD̄Ř^⊤ = D̄ for all Ř ∈ 𝒦_γ̄, and the conditional eigenvector density (12) is invariant under the action of 𝒦_γ̄.

R̂ ∈ 𝒮𝒪(m) is a rotation with Lie matrix exponential representation

\hat{R} = exp (\hat{S}) = \sum_{k = 0}^{\infty} \frac{{\hat{S}}^{k}}{k!}, where \hat{S} = (\begin{matrix} 0 & {\hat{S}}_{(1, 2)} & \dots & {\hat{S}}_{(1, k - 1)} & {\hat{S}}_{(1, k)} \\ - {\hat{S}}_{(1, 2)}^{⊤} & 0 & \dots & {\hat{S}}_{(2, k - 1)} & {\hat{S}}_{(2, k)} \\ ⋱ \\ - {\hat{S}}_{(1, k - 1)}^{⊤} & - {\hat{S}}_{(2, k - 1)}^{⊤} & \dots & 0 & {\hat{S}}_{(k - 1, k)} \\ - {\hat{S}}_{(1, k)}^{⊤} & - {\hat{S}}_{(2, k)}^{⊤} & \dots & - {\hat{S}}_{(k - 1, k)}^{⊤} & 0 \end{matrix})

(26)

is skew-symmetric, with blocks ${\hat{S}}_{(j, l)} = - {\hat{S}}_{(l, j)}^{⊤} \in ℝ^{m_{j} \times m_{l}}$ for 1 ≤ j < l ≤ k, and zero (m_j × m_j)-blocks on the diagonal, with $(m^{2} - \sum_{i = 1}^{k} m_{i}^{2}) / 2$ free parameters. The subgroup

C_{\bar{γ}} = {exp (\hat{S}) : \hat{S} has the skew-symmetric structure (26)}

is a complement subgroup of 𝒦_γ̄ in 𝒪(m).

In dimension m = 3, R̂ = exp(Ŝ) is a clockwise rotation by an angle $θ = \sqrt{{\hat{S}}_{23}^{2} + {\hat{S}}_{13}^{2} + {\hat{S}}_{12}^{2}}$ around the unit vector u = (Ŝ₂₃,− Ŝ₁₃, Ŝ₁₂)/θ. The matrix exponential exp(dŜ) of an infinitesimal 3 × 3 skew-symmetric matrix is the composition of three infinitesimal rotations around the Cartesian axes x, y, z by the Euler angles dŜ₂₃ (roll), dŜ₁₃ (pitch), and dŜ₁₂ (yaw), respectively, which commute up to infinitesimals of higher order.

Theorem 4.5

In the settings of Theorem 4.1, let

{\bar{O}}^{⊤} O^{(n)} = R^{(n)} = {\overset{ˇ}{R}}^{(n)} {\hat{R}}^{(n)} = {\overset{ˇ}{R}}^{(n)} exp ({\hat{S}}^{(n)}),

with Ř⁽ⁿ⁾ ∈ 𝒦_γ̄ and Ŝ⁽ⁿ⁾ skew-symmetric. The blocks ${\overset{ˇ}{R}}_{(i, i)}^{(n)}$ ,i = 1,…, k, corresponding to the D̄ eigenspaces are asymptotically distributed according to the product of the Haar measures on the respective orthogonal groups 𝒪(m_i), with the constraint ŌŘ⁽ⁿ⁾ ∈ 𝒪(m)⁺, and asymptotically independent from the eigenvalue fluctuations.

After rescaling, the entries ( $\sqrt{a^{(n)}} {\hat{S}}_{i j}^{(n)} : {\bar{γ}}_{i} > {\bar{γ}}_{j}$ ) are asymptotically mutually independent, and independent of Ř⁽ⁿ⁾ and the eigenvalue fluctuations, with limiting Gaussian distribution

N (0, \frac{1}{4 {({\bar{γ}}_{i} - {\bar{γ}}_{j})}^{2}}) .

Remark

Theorem 4.5 extends Theorem 4.1 in [37] for D̄ with nonnegative distinct eigenvalues, given also in [35], to the case with repeated eigenvalues.

4.4. Second-order approximation of the HCIZ integral

Theorem 4.6

Let γ, γ̄ ∈ ℝ^m be ordered vectors such that the coordinates (γ₁ > γ₂ > · · · > γ_m) are distinct, while the γ̄ coordinates may coincide, with multiplicities m_i = (ℓ_i− ℓ_i₋₁) and

{\bar{γ}}_{1} = \dots = {\bar{γ}}_{ℓ_{1}} > {\bar{γ}}_{ℓ_{1} + 1} = \dots = {\bar{γ}}_{ℓ_{2}} > \dots > {\bar{γ}}_{ℓ_{k - 1} + 1} = \dots = {\bar{γ}}_{ℓ_{k}}

for 0 = ℓ₀ < ℓ₁ < · · · < ℓ_k = m, 1 ≤ k ≤ m. Then, as n→∞,

lim_{n \to \infty} I_{m} (n γ, \bar{γ}) exp (- n γ \cdot \bar{γ}) n^{(m^{2} - \sum_{i = 1}^{k} m_{i}^{2}) / 4} = \frac{\prod_{l = 1}^{m} Γ (l / 2)}{\prod_{i = 1}^{k} \prod_{l = 1}^{m_{i}} Γ (l / 2)} \prod_{i = 1}^{k - 1} \prod_{j = ℓ_{i - 1} + 1}^{ℓ_{i}} \prod_{h = ℓ_{i} + 1}^{m} {[(γ_{j} - γ_{h}) ({\bar{γ}}_{j} - {\bar{γ}}_{h})]}^{- 1 / 2} .

(27)

Remark 4.7

Theorem 4.6 was proven by [2] (see also [33, Theorem 9.5.2]) in the case of nonnegative eigenvalues without multiplicities.

5. Testing the sphericity hypothesis

In DTI, it is often desirable to establish different symmetries of the underlying tensor field. One of the often-used tests is that of isotropy of the underlying mean diffusion tensor [6, 17]. Here we also develop one such test and call it a test of sphericity to avoid confusion with the “isotropy” of the precision tensor. Consider a sequence of random symmetric matrices D⁽ⁿ⁾ such that $\sqrt{a^{(n)}} (D^{(n)} - \bar{D}) \overset{law}{\to} X$ , where the limit is a zero-mean Gaussian symmetric matrix, D̄ is deterministic, and a⁽ⁿ⁾ →∞ is a scaling sequence. For example, in section 6 the scaling sequence is given by the number of gradients in the DTI measurement. In order to test the sphericity hypothesis

H_{0} : \bar{D} = \bar{γ} Id for some unknown \bar{γ} \in ℝ,

we introduce the sampled eigenvalue central moments

κ_{1} (D) = \frac{1}{m} \sum_{i = 1}^{m} γ_{i} = \frac{1}{m} Tr (D), κ_{r} (D) = \frac{1}{m} \sum_{i = 1}^{m} {(γ_{i} - κ_{1} (D))}^{r} = \frac{1}{m} Tr ({(D - \frac{Tr (D)}{m} Id)}^{r}) = \sum_{q = 0}^{r} (\begin{matrix} r \\ q \end{matrix}) \frac{{(- 1)}^{q}}{m^{q + 1}} Tr (D^{r - q}) Tr {(D)}^{q}, 2 \leq r \in ℕ,

where γ_i are the eigenvalues of D.

Lemma 5.1

κ_r(D) is a homogeneous polynomial of degree r in the matrix entries, satisfying for all c ∈ ℝ

κ_{1} (D + c I d) = κ_{1} (D) + c, κ_{r} (D + c I d) = κ_{r} (D), r \geq 2.

(28)

This implies that the derivatives satisfy ∇^ℓκ_r(Id) = 0 for all 0 ≤ ℓ < r, while ∇^rκ_r(D) = ∇^rκ_r(0) are constant tensors such that

κ_{r} (D) = \frac{1}{r!} \nabla^{r} κ_{r} (0) \underset{r - times}{\underset{︸}{D \otimes \dots \otimes D}}, Tr (\nabla^{ℓ} κ_{r} (D)) = 0 \forall r \geq 2, 1 \leq ℓ \leq r .

Corollary 5.2

Let D⁽ⁿ⁾ be a sequence of m × m symmetric random matrices, and let X be a zero mean symmetric Gaussian matrix such that for some γ̄ ∈ ℝ and scaling sequence a⁽ⁿ⁾ →∞,

\sqrt{a^{(n)}} (D^{(n)} - \bar{γ} I d) \overset{law}{\to} X .

Then

(\sqrt{a^{(n)}} (D^{(n)} - \bar{γ} I d), {(a^{(n)})}^{r / 2} κ_{r} (D^{(n)}) : 2 \leq r \leq m) \overset{law}{\to} (κ_{r} (X) : 1 \leq r \leq m) .

When the covariance of X is isotropic, (κ_r(X) : 2 ≤ r ≤ m) are stochastically independent from κ₁(X).

Proof

For the first statement we apply the continuous mapping theorem together with (28). If X has zero-mean isotropic Gaussian distribution, the conditional distribution of (X −κ₁(X)Id) given κ₁(X) is also zero-mean isotropic Gaussian and does not depend on the value of κ₁(X).

To test the sphericity hypothesis with γ̄ ≠ 0, it is natural to use statistics of the form

τ^{(n)} = τ (κ_{1} (D^{(n)}), {(a^{(n)})}^{r / 2} κ_{r} (D^{(n)}) : 2 \leq r \leq m)

and calibrate the test against the distribution of

τ^{(\infty)} = τ (c, κ_{r} (X) : 2 \leq r \leq m),

(29)

evaluated at c = κ₁(D⁽ⁿ⁾). However, without additional assumptions on the covariance structure of X, the probability density functions of κ_r(X) for r ≥ 2 do not have closed-form expressions and can be computed only numerically, for example, by Monte Carlo simulations. Note also that, since ∇^ℓκ_r(Id) = 0 for all r ≥ 2, 0 ≤ ℓ < r, we are dealing with a singular hypothesis testing problem [19, 20, 43], where the constraints {κ_r(D̄ ) = 0, r ≥ 2} for which we are testing are singular at the true parameter D̄ = γ̄ Id; consequently, any smooth sphericity statistics τ⁽ⁿ⁾ will follow non-Gaussian higher order asymptotics. We proceed now in dimension m = 3, assuming that the Gaussian matrix limit X has zero-mean and isotropic precision matrix A(1, λ) with λ > −2/3, to explicitly compute the asymptotic density of some commonly used sphericity statistics based on eigenvalues sample mean, variance, and skewness.

Lemma 5.3

In the settings of Theorem 4.1, under the sphericity hypothesis H₀, the test statistics

τ_{1}^{(n)} = \sqrt{a^{(n)}} (κ_{1} (γ^{(n)}) - κ_{1} (\bar{γ})), τ_{2}^{(n)} = 6 a^{(n)} κ_{2} (γ^{(n)}), τ_{3}^{(n)} = \sqrt{2} κ_{3} (γ^{(n)}) κ_{2} {(γ^{(n)})}^{- 3 / 2}

(30)

are asymptotically independent, with limiting distributions

τ_{1}^{(n)} \overset{law}{\to} N (0, 1 / (6 + 9 λ)), τ_{2}^{(n)} \overset{law}{\to} χ_{5}^{2}, τ_{3}^{(n)} \overset{law}{\to} Uniform ([- 1, 1]) .

(31)

In dimension m,

((2 m + λ m^{2}) {κ_{1} (γ^{(n)}) - κ_{1} (\bar{γ})}^{2}, 2 m κ_{2} (γ^{(n)})) a^{(n)} \overset{law}{\to} (χ_{1}^{2}, χ_{(m + 1) m / 2 - 1}^{2}),

with asymptotically independent components.

Proof

We start from the asymptotic eigenvalue density (11), which under H₀ is given by

q_{\bar{γ}} (γ_{1}, γ_{2}, γ_{3}) = \frac{4 μ^{5 / 2} \sqrt{2 μ + 3 λ}}{π} V (γ_{1}, γ_{2}, γ_{3}) exp (- \frac{(6 μ + 9 λ)}{2} {(κ_{1} (γ) - κ_{1} (\bar{γ}))}^{2} - μ \sum_{i = 1}^{3} {(γ_{i} - κ_{1} (γ))}^{2}),

and we apply the continuous mapping theorem [42] to the smooth bijection

(γ_{1}, γ_{2}, γ_{3}) \mapsto (κ_{1}, κ_{2}, κ_{3}), with Jacobian [\frac{\partial κ_{i}}{\partial γ_{j}}] = (\begin{array}{l} 1 / 3 & (γ_{1} - κ_{1}) 2 / 3 & {(γ_{1} - κ_{1})}^{2} - κ_{2} \\ 1 / 3 & (γ_{2} - κ_{1}) 2 / 3 & {(γ_{2} - κ_{1})}^{2} - κ_{2} \\ 1 / 3 & (γ_{3} - κ_{1}) 2 / 3 & {(γ_{3} - κ_{1})}^{2} - κ_{2} \end{array}) satisfying det ([\frac{\partial κ_{i}}{\partial γ_{j}}]) = V (γ) 2 / 9.

By changing variables, the Vandermonde determinant cancels out, and the resulting joint central moments density is given by

q (κ_{1}, κ_{2}, κ_{3}) = \frac{18 μ^{5 / 2} \sqrt{2 μ + 3 λ}}{π} exp (- \frac{(6 μ + 9 λ)}{2} {(κ_{1} - κ_{1} (\bar{γ}))}^{2} - 3 μ κ_{2}) .

It follows by an optimization argument that the support of the κ₃ conditional distribution given κ₂ is the interval [ $- κ_{2}^{3 / 2} / \sqrt{2}, κ_{2}^{3 / 2} / \sqrt{2}$ ]. We do a further change of variables, setting κ′ = (κ₁, κ₂, τ₃) with $τ_{3} = κ_{3} κ_{2}^{- 3 / 2} \sqrt{2}$ , obtaining

q (κ_{1}, κ_{2}, τ_{3}) = \sqrt{\frac{6 μ + 9 λ}{2 π}} exp (- \frac{6 μ + 9 λ}{2} {(κ_{1} - κ_{1} (\bar{γ}))}^{2}) d κ_{1} \times 1 (κ_{2} \geq 0) \frac{{(3 μ)}^{5 / 2}}{Γ (5 / 2)} exp (- 3 μ κ_{2}) κ_{2}^{3 / 2} d κ_{2} \times 1 (∣ τ_{3} ∣ \leq 1) \frac{d τ_{3}}{2},

(32)

which factorizes as the distribution of independent random variables κ₁ ~ 𝒩(κ₁(γ̄), 1/(6μ + 9λ), $κ_{2} ~ (χ_{5}^{2} / (6 μ))$ and τ₃ uniformly distributed on [−1, 1].

Related ellipticity and sphericity measures are fractional anisotropy [7]

FA = \sqrt{\frac{3 Tr (D^{2}) - Tr {(D)}^{2}}{2 Tr (D^{2})}} = \sqrt{\frac{3 κ_{2}}{2 (κ_{1}^{2} + κ_{2})}},

relative anisotropy [10]

RA = \sqrt{\frac{3 Tr (D^{2}) - Tr {(D)}^{2}}{Tr {(D)}^{2}}} = \frac{\sqrt{κ_{2}}}{∣ κ_{1} ∣},

and volume ratio [36]

VR = 27 \frac{det (D)}{Tr {(D)}^{3}} = \frac{γ_{1} γ_{2} γ_{3}}{κ_{1} {(γ)}^{3}}, where γ_{1} γ_{2} γ_{3} = κ_{3} (γ) + κ_{1} {(γ)}^{3} - \frac{3}{2} κ_{1} (γ) κ_{2} (γ) .

Corollary 5.4

In the settings of Theorem 4.1 with dimension m = 3, under the sphericity hypothesis H₀, there are two possible asymptotic regimes:

When D̄ = 0, the sequence of statistics
$(FA (γ^{(n)}), RA (γ^{(n)}), (1 - VR (γ^{(n)})), {(τ_{1}^{(n)})}^{2}, τ_{2}^{(n)}, τ_{3}^{(n)})$ (33)

converges jointly in distribution to the random vector
$(\sqrt{\frac{3 χ_{2}^{5}}{2 χ_{5}^{2} + χ_{1}^{2} 12 / (9 λ + 6)}}, \sqrt{\frac{(3 λ + 2)}{2} \frac{χ_{5}^{2}}{χ_{1}^{2}}}, \frac{(9 λ + 6)}{4} \frac{χ_{5}^{2}}{χ_{1}^{2}} - {(3 λ + 2) \frac{χ_{5}^{2}}{χ_{1}^{2}}}^{3 / 2} \frac{U}{4}, \frac{χ_{1}^{2}}{9 λ + 6}, χ_{5}^{2}, U),$

with independent $χ_{1}^{2}, χ_{5}^{2}$ and U ~ Uniform[−1, 1].
Otherwise, the rescaled statistics
$τ_{4}^{(n)} = 2 \sqrt{a^{(n)}} ∣ κ_{1} (γ^{(n)}) ∣ FA (γ^{(n)}) ≃ 2 \sqrt{a^{(n)}} ∣ κ_{1} (\bar{γ}) ∣ FA (γ^{(n)}),$ (34)

$τ_{5}^{(n)} = 4 a^{(n)} k_{1} {(γ^{(n)})}^{2} (1 - VR (γ^{(n)})) ≃ 4 a^{(n)} k_{1} {(\bar{γ})}^{2} (1 - VR (γ^{(n)})),$ (35)

$τ_{6}^{(n)} = - 4 a^{(n)} k_{1} {(γ^{(n)})}^{2} log | VR (γ^{(n)}) | ≃ - 4 a^{(n)} k_{1} {(\bar{γ})}^{2} log | VR (γ^{(n)}) |$ (36)

are asymptotically equivalent, with
$| {(τ_{4}^{(n)})}^{2} - τ_{2}^{(n)} | \overset{P}{\to} 0, | τ_{5}^{(n)} - τ_{6}^{(n)} | \overset{P}{\to} 0, and | τ_{5}^{(n)} - τ_{2}^{(n)} | \overset{P}{\to} 0$

in probability, and $τ_{2}^{(n)} \overset{law}{\to} χ_{5}^{2}$ .

Remark 5.5

Corollary 5.4 generalizes Theorem 8.3.7 in [33] on VR asymptotics without positivity assumptions. In order to use the VR statistics to test the isotropy of the mean D̄, one should first test the hypothesis κ₁(γ̄) = 0, under which

(9 λ + 6) a^{(n)} κ_{1} {(γ^{(n)})}^{2} \overset{law}{\to} χ_{1}^{2} .

If this hypothesis is accepted, we assume that we are in the asymptotic regime (1) and construct a conditional sphericity test by using the conditional distribution of VR(γ⁽ⁿ⁾) given {a⁽ⁿ⁾κ₁(γ⁽ⁿ⁾)² = t}, which converges in distribution to the law of

1 + {(\frac{χ_{5}^{2}}{3 t})}^{3 / 2} \frac{U}{4} - \frac{χ_{5}^{2}}{4 t},

with $χ_{5}^{2}$ independent from U ~ Uniform[−1, 1]. If the hypothesis κ₁(γ̄) = 0 is rejected, we use the rescaled VR statistics $τ_{5}^{(n)}$ in (35).

Eigenvalue central moment statistics have been considered earlier in the DTI literature, the distribution of Tr(D) for D isotropic Gaussian is derived in [11], the variance is discussed in [7, 44, 37], and skewness is explored in [8]. Note that under H₀, the limit laws of $τ_{2}^{(n)}, τ_{3}^{(n)}$ are parameter free. However, evaluating $τ_{2}^{(n)}$ requires knowledge of the scaling sequence normalization, while evaluating $τ_{3}^{(n)}$ does not. $∣ τ_{3}^{(n)} ∣$ can be used as two-sided test statistics, accepting the sphericity hypothesis with confidence level α when $∣ τ_{3}^{(n)} ∣ \in ((1 - α) / 2, (1 + α) / 2)$ . The left-tail rejection region corresponds to the anomalous situation, with $(γ_{1}^{(n)} - γ_{2}^{(n)}) ≃ (γ_{2}^{(n)} - γ_{3}^{(n)})$ , and the right tail corresponds to $γ_{1}^{(n)} ≃ γ_{2}^{(n)} ≫ γ_{3}$ or $γ_{1}^{(n)} ≫ γ_{2}^{(n)} ≃ γ_{3}^{(n)}$ . We can test for symmetries with a sequence of confidence levels $p^{(n)} = ℙ (χ_{5}^{2} < c^{(n)})$ , with c⁽ⁿ⁾ →∞ and c⁽ⁿ⁾/a⁽ⁿ⁾ → 0, and construct an asymptotically superefficient eigenvalue estimator γ̂⁽ⁿ⁾ as follows:

If κ₂(γ⁽ⁿ⁾)< c⁽ⁿ⁾/(6a⁽ⁿ⁾), accept the isotropy hypothesis and set ${\hat{γ}}_{1}^{(n)} = {\hat{γ}}_{2}^{(n)} = {\hat{γ}}_{3}^{(n)} = κ_{1} (γ^{(n)})$ ;
else if
${(γ_{1}^{(n)} - γ_{2}^{(n)})}^{2} a^{(n)} < - 2 log (1 - p^{(n)}),$

accept the oblate tensor hypothesis and set ${\hat{γ}}_{1}^{(n)} = {\hat{γ}}_{2}^{(n)} = (γ_{1}^{(n)} + γ_{2}^{(n)}) / 2 > {\hat{γ}}_{3}^{(n)} = γ_{3}^{(n)}$ ;

else if
${(γ_{2}^{(n)} - γ_{3}^{(n)})}^{2} a^{(n)} < - 2 log (1 - p^{(n)}),$

accept the prolate diffusion tensor hypothesis and set ${\hat{γ}}_{1}^{(n)} = γ_{1}^{(n)} < {\hat{γ}}_{2}^{(n)} = {\hat{γ}}_{3}^{(n)} = (γ_{2}^{(n)} + γ_{3}^{(n)}) / 2$ ;
otherwise, reject the hypothesis that the tensor has symmetries and use the unmodified estimator γ̂⁽ⁿ⁾ = γ⁽ⁿ⁾.

The situation with mean matrix D̄ = 0 arises in two-sample problems. Consider two m × m symmetric random matrices D′,D″, which are measured with independent and isotropic Gaussian noises, with precision matrices A(μ, λ) and A(μ, λ) and means D̄′, D̄″, respectively. Their difference D = (D′ − D″) is again symmetric Gaussian with mean D̄ = (D̄′ − D̄″) and isotropic precision matrix A(μ, λ), with parameters

μ = \frac{μ^{'} μ^{″}}{μ^{'} + μ^{″}}, λ = \frac{2 α μ}{μ^{'} + μ^{″} - m α}, α = \frac{λ^{'} μ^{″}}{2 μ^{'} + m λ^{'}} + \frac{λ^{″} μ^{'}}{2 μ^{″} + m λ^{″}} .

In order to test the hypothesis D̄′ = D̄″, one could use the statistics

{2 m μ κ_{2} (D) + (2 m μ + λ m^{2}) κ_{1} {(D)}^{2}} ~ χ_{(m + 1) m / 2}^{2} .

(37)

Testing equality in distribution of two sample matrix eigenvalues and eigenvectors separately was discussed in [37], under the hypothesis of asymptotically Gaussian and isotropic error, and was generalized in [38] to nonisotropic error covariances.

6. Asymptotic statistics in DTI under Rician noise

We consider an ideal DTI-experiment with measurements following the Rician likelihood,

p_{S, η^{2}} (Y) = \frac{Y}{η^{2}} exp (- \frac{Y^{2} + S^{2}}{2 η^{2}}) I_{0} (Y S / η^{2}),

(38)

where S is the signal, Y is the observation, η² is the noise parameter, and I_ℓ(z) is the modified Bessel function of the first kind of order ℓ. The signal is determined by the second-order tensor model

S = S (g, D) = ρ exp (- {gDg}^{⊤}), ρ > 0, g \in ℝ^{3}, D \in ℝ^{3 \times 3},

(39)

where D is the (symmetric) diffusion tensor, ρ is the unweighted reference signal, and g is the applied magnetic field gradient. The function g ↦ S(g,D)/ρ is interpreted as the Fourier transform of the displacement distribution of a water molecule undergoing Gaussian diffusion in a unit time interval, and the problem is to estimate the diffusion tensor D from the noisy spectral measurements Y. For fixed ρ and η² we denote the log-likelihood of D as

L (D) = log (p_{S, η^{2}} (Y)) .

The observed information with respect to the tensor parameter D is given by

J_{o} (D) = - {[\frac{\partial^{2} L (D)}{\partial D_{i j} \partial D_{l r}}]}_{i \leq j, l \leq r} = \frac{S^{2}}{η^{2}} (2 + \frac{Y^{2}}{η^{2}} {\frac{I_{1} {(S Y / η^{2})}^{2}}{I_{0} {(S Y / η^{2})}^{2}} - 1}) {[(2 - δ_{i j}) (2 - δ_{l r}) g (i) g (j) g (l) g (r)]}_{i \leq j, l \leq r},

and the Fisher information is obtained by integrating out the data Y with respect to (38) under the signal model (39) with tensor parameter D, as

J (D) = E_{D} (J_{o} (D)) = E_{D} {([\frac{\partial L (D)}{\partial D_{i j}} \frac{\partial L (D)}{\partial D_{l r}}])}_{i \leq j, l \leq r} = w (S / η) {[(2 - δ_{i j}) (2 - δ_{l r}) g (i) g (j) g (l) g (m)]}_{i \leq j, l \leq r},

(40)

depending on the signal to noise ratio (SNR) S/η of the complex Gaussian error model through the weight function

w (z) = \frac{exp (- z^{2} / 2)}{z^{2}} \int_{0}^{\infty} x^{3} exp (- \frac{x^{2}}{2 z^{2}}) \frac{I_{1} {(x)}^{2}}{I_{0} (x)} d x - z^{4} \geq 0;

see [27]. Note that necessarily, J_ij,ij(D) = 4J_ii,jj(D) for all 1 ≤ j < i ≤ 3. By replacing the Rician density (38) with another likelihood which is a function of the SNR, we always obtain Fisher information of the form (40), with a different weight function.

We now consider a sequence of DTI-experiments, with measurements ( $Y_{k}^{(n)} : k = 1, \dots, M^{(n)}$ ) from respective signals ( $S_{k}^{(n)} : k = 1, \dots, M^{(n)}$ ), corresponding to the gradients $(g_{k}^{(n)} : k = 1, \dots, M^{(n)}) \subset ℝ^{3}$ , and denote the scaled Fisher information as

J^{(n)} (D) = \frac{1}{M^{(n)}} {[(2 - δ_{i j}) (2 - δ_{l r}) \sum_{k = 1}^{M^{(n)}} w (S_{k}^{(n)} / η) g_{k}^{(n)} (i) g_{k}^{(n)} (j) g_{j}^{(n)} (l) g_{k}^{(n)} (r)]}_{i \leq j, l \leq r} .

(41)

Assume that M⁽ⁿ⁾ → ∞ and that the sequence of discrete gradient distributions

π^{(n)} (d g) = \frac{1}{M^{(n)}} \sum_{k = 1}^{M^{(n)}} 1 (g_{k}^{(n)} \in d g)

converges weakly to a probability π on ℝ³, which implies

lim_{n \to \infty} J^{(n)} (D) = J^{(\infty)} (D) = {[(2 - δ_{i j}) (2 - δ_{l r}) \int_{ℝ^{3}} w (exp (- {gDg}^{⊤}) ρ / η) g (i) g (j) g (l) g (r) π (d g)]}_{i \leq j, l \leq r} .

(42)

Let D⁽ⁿ⁾ be a regular statistical estimator of the tensor parameter such as, for example, the maximum likelihood estimator (MLE), the penalized MLE, the Bayesian maximum a posteriori estimator (MAP), or the posterior mean, based on the data ( $Y_{k}^{(n)} : 1 \leq k \leq M^{(n)}$ ) with gradients ( $g_{k}^{(n)} : 1 \leq k \leq M^{(n)}$ ). When 0 < det(J^(∞)) < ∞, under the tensor model with true parameter D̄, all these regular estimators are consistent with asymptotically Gaussian error such that

\sqrt{M^{(n)}} (D^{(n)} - \bar{D}) \overset{law}{\to} X ~ N (0, {(J^{(\infty)} (\bar{D}))}^{- 1}) .

(43)

6.1. Isotropic Gaussian limit error distribution

When J^(∞)( D̄) = A(μ̄, μ̄) as in (4) for some μ̄ > 0, the Gaussian limit distribution (43) is isotropic. In such a case, Theorem 4.1, Corollary 4.3, and Lemma 5.3 apply with a⁽ⁿ⁾ = μ̄M⁽ⁿ⁾ and λ = 1. When the true tensor D̄ = γ̄ I is isotropic and the asymptotic gradient design distribution π(dg) is radially symmetric, asymptotic isotropy is achieved with

J^{(\infty)} (\bar{D}) = (\int_{0}^{\infty} w (exp (- \bar{γ} b) ρ / η) ν (d b)) {[(2 - δ_{i j}) (2 - δ_{l r}) \int_{S^{2}} u (i) u (j) u (l) u (r) σ (d u)]}_{i \leq j, l \leq r} = A (1, 1) \bar{μ}, \bar{μ} = (\int_{0}^{\infty} w (exp (- \bar{γ} b) ρ / η) ν (d b)) / 15,

(44)

where b = ||g||², referred to as the b-value, is integrated with respect to

ν (d b) = π ({g : {‖ g ‖}^{2} \in d b}),

and u = g/||g|| has uniform distribution σ(du) on the surface of the unit sphere 𝒮² = {u ∈ ℝ³: ||u|| = 1}. A more general condition implying (44) is the following: the asymptotic gradient design distribution decomposes as

π (d g) = ν (d b) s (d u ∣ b),

(45)

where for ν-almost all b-values, the conditional probability on 𝒮² is such that

\int_{S^{2}} f (u) s (d u ∣ b) = \int_{S^{2}} f (u) σ (d u)

(46)

for all homogeneous polynomials f(u₁, u₂, u₃) of degree t = 4.

Proposition 6.1

When the true diffusion tensor D̄ is isotropic, the uniform gradient distribution σ(du) maximizes det(J) among all probability distributions on the unit sphere.

Proof

When J is invertible, we have [30, Theorem 8.1]

d log det (J) = Tr (J^{- 1} d J), d^{2} log det (J) = - Tr (J^{- 1} {dJJ}^{- 1} d J) \leq 0,

(47)

which implies that the function J ↦ log det(J) ∈ ℝ ∪ {−∞} is concave, and that a local maximum is also a global maximum. Let ν(du) be a probability measure on 𝒮², and consider a small perturbation of the uniform measure σ in the direction ν. By taking the differential using (47), we obtain

lim_{ε \to 0 +} \frac{log det J ((1 - ε) σ + ε ν) - log det J (σ)}{ε} = \int_{S^{2}} (\sum_{i \leq j, l \leq r} J_{i j, l r}^{- 1} (σ) (2 - δ_{i j}) (2 - δ_{l r}) u_{i} u_{j} u_{l} u_{r}) (ν - σ) (d u) = 0,

(48)

where, since J⁻¹(σ) is also isotropic, for every u, v ∈ 𝒮² we have

\sum_{i \leq j, l \leq r} J_{i j, l r}^{- 1} (σ) (2 - δ_{i j}) (2 - δ_{l r}) u_{i} u_{j} u_{l} u_{r} = \sum_{i \leq j, l \leq r} J_{i j, l r}^{- 1} (σ) (2 - δ_{i j}) (2 - δ_{l r}) v_{i} v_{j} v_{l} v_{r},

and the integrand in (48) is constant, which means that det (J(σ)) is a global maximum.

This shows that when the true tensor D̄ is isotropic, asymptotically uniform gradient designs are most informative, minimizing the Gaussian entropy of the asymptotic estimation error

H (J^{(\infty)}) = const. - log (det (J^{(\infty)})) / 2.

In the next section, we introduce discrete gradient distributions which attain the same bound.

6.2. Spherical t-designs in diffusion tensor imaging

A spherical t-design ϒ ⊂ 𝒮^m⁻¹ is a finite subset of m-dimensional unit vectors with the property

\int_{S^{m - 1}} f (u) σ (d u) = \frac{1}{# ϒ} \sum_{v \in ϒ} f (v)

(49)

for all polynomials f(u₁, …, u_m) of degree r ≤ t, where σ is the uniform probability measure on 𝒮^m⁻¹, and #ϒ is the number of points in ϒ. In other words, a spherical t-design is a quadrature rule on S^m⁻¹ with constant weights. The algebraic theory behind such designs is deep and beautiful [18]; for a recent survey see [4, 1]. In particular, in dimension m = 3, spherical t-designs of order t ≥ 4 satisfy (46). A database of spherical t-designs on 𝒮² computed by Rob Womersley is available on his webpage http://web.maths.unsw.edu.au/~rsw/Sphere/EffSphDes/. Table 1 displays the sizes of these designs, and Figure 1 shows a spherical t-design of order 4 with 14 gradients from Womersley’s database.

Table 1.

Number n_a of points in some known antipodal spherical t-designs of order 4 ≤ t ≤ 17 in 𝒮², computed by Rob Womersley, while n is for his nonantipodal spherical t-designs.

Size of Spherical t-Designs
t	4	5	6	7	8	9	10	11	12	13	14	15	16	17
n_a	-	12	-	32	-	48	-	70	-	94	-	120	-	156
n	14	18	26	32	42	50	62	72	86	98	114	128	146	163

Open in a new tab

A nonantipodal spherical t-design of order 4, with 14 gradients, by Rob Womersley.

When ϒ = −ϒ, we say that the spherical design is antipodal. Two well-known examples (see [10, 13]) are the regular icosahedron and its dual, the regular dodecahedron, whose vertices form antipodal spherical t-designs of order 5 with sizes 12 and 20, respectively. Note that any two antipodal gradients produce the same DTI-signal. Starting from an antipodal spherical t-design ϒ and selecting one gradient from each antipodal pair {u,−u} ⊂ ϒ, we obtain a design ϒ′ of size #ϒ′ = #ϒ/2 which satisfies (49) for all homogeneous polynomials of even degree ≤ t. Figures 2 and 3 show, respectively, the intersection of the northern hemisphere with the regular icosahedron and the dodecahedron, forming gradient designs of sizes 6 and 10, which satisfy (49) for all homogeneous polynomials of degrees 2 and 4.

Gradient design based on the icosahedron with six gradients on the northern hemisphere.

Gradient design based on the dodecahedron with 10 gradients.

In the DTI-experiment, for a finite subset of b-values $0 \leq b_{1}^{(n)} \leq \dots \leq b_{n}^{(n)}$ and respective spherical t-designs $ϒ_{ℓ}^{(n)}$ of order $t_{ℓ}^{(n)} \geq 4$ , we construct the gradient set as the union of shells

G^{(n)} = \cup_{ℓ = 1}^{n} ϒ_{ℓ}^{(n)} \sqrt{b_{ℓ}^{(n)}} \subset ℝ^{3} .

The resulting gradient distribution

π^{(n)} (B) = \frac{# (G^{(n)} ⋀ B)}{# G^{(n)}}, B \subseteq ℝ^{3},

satisfies (45), and when the true tensor D̄ = γ̄ I is totally symmetric, we have

J^{(n)} (\bar{D}) = {\bar{μ}}^{(n)} A (1, 1),

with

{\bar{μ}}^{(n)} = \frac{1}{15 # G^{(n)}} \sum_{ℓ = 1}^{n} w (exp (- \bar{γ} b_{ℓ}^{(n)}) ρ / η) # ϒ_{ℓ}^{(n)};

i.e., the Fisher information coincides with the precision matrix of an isotropic Gaussian matrix distribution. When ϒ ⊂ S² is a spherical t-design and O ∈ SO(3) is a rotation matrix, the rotated design Oϒ is a spherical t-design as well. Since the true tensor D̄ is unknown (and possibly not isotropic), in practice it is advisable to choose the gradient directions covering 𝒮² as uniformly as possible. To achieve this, different t-designs can be rotated with respect to one another in order to maximize the spread between gradient directions. Namely, starting from a collection of spherical t-designs $ϒ_{1}^{0}, \dots, ϒ_{n}^{0}$ of respective orders t_k, 1 ≤ k ≤ n, we find the optimized design $ϒ_{k}^{(n)} = O_{k}^{*} ϒ_{k}^{0}$ , 1 ≤ k ≤ n, where $O_{1}^{*}, \dots, O_{n}^{*}$ are rotation matrices maximizing

max_{O_{1}, \dots, O_{n} \in S O (3)} min_{1 \leq k < l \leq n} {dist (O_{k} ϒ_{k}^{0}, O_{l} ϒ_{l}^{0})},

(50)

with dist(U, V ) = sup_u_∈_U,v_∈_V dist(u, v), and dist(u, v) is the geodesic distance on 𝒮². The maximizer can be achieved by a greedy iterative algorithm, where in turn (50) is optimized with respect to each single O_k keeping fixed the other rotations until convergence to a fixed point. Figure 4 shows a gradient sequence obtained in such a way, with colors corresponding to spherical t-designs on different shells. The benefits of these gradient designs are illustrated in the next section.

Gradient sequence based on combined antipodal spherical t-designs of orders 5 (black), 7 (red), 9 (green), and 11 (blue), of respective sizes 12, 32, 48, and 70. The spherical t-designs on different shells were rotated in order to maximize the minimal geodesic distance (50) between gradients.

7. Illustration of the methods

7.1. Monte Carlo study with isotropic Gaussian noise

Figure 5 shows the results from a Monte Carlo study with a sample of N = 10000 i.i.d. 3×3 symmetric random matrices from the isotropic Gaussian density (1) with precision parameters μ = 1/2, λ = 0 for the following various choices of the diagonal mean matrix:

10000 pairs of distinct eigenvalues of i.i.d. symmetric random matrices with isotropic Gaussian noise (μ = 1/2, λ = 0) with various mean: zero, corresponding to the 3×3 GOE (a), isotropic (b), prolate (c), oblate (d). For comparison we show i.i.d. 2 × 2 GOE eigenvalue pairs (e), and i.i.d. standard Gaussian pairs (f). Within each pair the ordering is randomized to emphasize the repulsion effect around the diagonal.

D̄ = 0, corresponding to the 3 × 3 GOE;
D̄ isotropic, with γ̄₁ = γ̄₂ = γ̄₃ = 15;
D̄ prolate, with γ̄₁ = 15 > γ̄₂ = γ̄₃ = 3;
D̄ oblate, with γ̄₁ = γ̄₂ = 15 > γ̄₃ = 3.

For comparison, in Figure 5e we show i.i.d. eigenvalue pairs from the 2 × 2 Gaussian orthogonal ensemble, and in Figure 5f we show i.i.d. pairs of independent standard Gaussian random variables. The empirical joint eigenvalue distribution avoids the diagonal, in agreement with (9). We see that the fluctuations of the eigenvalues corresponding to the same D̄ eigenspaces around their mean are distributed like the GOE corresponding to the dimension of the eigenspace. One can see also some differences between the GOE eigenvalue distribution in dimension 2 (in Figure 5e, sampled with precision parameters μ = 1/2, λ = 0, which agrees with Figures 5c and 5d), and dimension 3 (in Figure 5a, which agrees with 5b).

Figure 6 shows that in the case with prolate mean matrix, the empirical distribution of the cluster barycenter (γ₂ + γ₃)/2 fits the Gaussian distribution very well.

Histogram and fitted Gaussian curve from 10000 i.i.d. realizations of the cluster barycenter (γ₂ + γ₃)/2 in the prolate mean tensor case.

Figure 7 shows the behavior of the sphericity test statistics τ₂, τ₄, τ₅ under Gaussian matrix distributions, with the same isotropic precision matrix A(2, 2) and different means, namely, a spherical mean tensor and 15 prolate mean tensors, all with the same mean diffusivity κ₁(D̄) = 15, and FA in (0.01, 0.15]. We can see that at this noise level, under the null hypothesis, the distributions of these three test statistics fit the asymptotic $χ_{5}^{2}$ distribution very well, while under prolate alternatives the corresponding sphericity tests have approximately the same power at all significance levels.

Probability densities (left) and cumulative probabilities (right) of the sphericity test statistics τ₂(D), τ₄(D), τ₅(D), where the 3 × 3 symmetric random matrix D is Gaussian with isotropic precision A(2, 2), and there are 16 alternative mean tensors D̄, with fixed mean diffusivity κ₁(D̄) = 15. Under the null hypothesis, D̄ is spherical, while the alternatives correspond to prolate mean tensors with FA in (0.0, 0.15]. For each test statistics, the probability density and cumulative probability curves are labeled by the FA values of the corresponding mean tensors. The broken curves display the $χ_{5}^{2}$ limit distribution under the null hypothesis.

Figure 8b displays on the unit sphere the orthonormal eigenvector triples from the Gaussian model with isotropic noise parameters μ = 1/2, λ = 0, with N = 200 i.i.d. replications. On the left side of the figure, the mean tensor is diagonal and totally anisotropic with γ̄₁ = 15, γ̄₂ = 7.5, γ̄₃ = 3. On the right, the mean tensor is diagonal and oblate, with γ̄₁ = γ̄₂ = 15, γ̄₃ = 3, and the eigenvectors corresponding to the first two eigenvalues are uniformly distributed around the equator.

200 i.i.d. orthonormal eigenvector triples from the Gaussian model with isotropic noise parameters μ = 1/2, λ = 0, with totally asymmetric (left) and oblate (right) diagonal mean tensor, using a graphical construction similar to the one introduced in [9].

7.2. Monte Carlo study of sphericity test statistics based on DTI data with Rician noise

In order to validate the asymptotic results of Lemma 5.3 and Corollary 5.4, we conducted another large Monte Carlo study, with DTI data simulated under the Rician noise model with ground truth parameters η² = 64.056, ρ = 110.046 and isotropic diffusion tensor D̄ = 6.622×10⁻⁴×Id mm²/s. For each of the experimental designs 1–5 below, which have an increasing number of acquisitions, we simulated N = 50000 replications of the dataset, and for each replication we computed the MLE D⁽ⁿ⁾ based on the simulated data by using the Expectation-Maximization algorithm from [29]. The empirical distribution of the sphericity statistics $τ_{2}^{(n)}, τ_{3}^{(n)}$ (30) and $τ_{5}^{(n)}$ (35) with their theoretical limit distributions are displayed correspondingly in Figures 9–13.

Scatterplot of the eigenvalue statistics ( $τ_{2}^{(n)}, ∣ τ_{3}^{(n)} ∣$ ) in (a) and ( $τ_{5}^{(n)}, τ_{3}^{(n)}$ ) in (b), from a Monte Carlo study based on N = 50000 replications of a dataset generated under Design 1, where the true tensor and the Fisher information are isotropic. The histogram density estimators are compared with theoretical limit densities (black continuous curves), which are uniform on the vertical axes and $χ_{5}^{2}$ on the horizontal axes. The best-fitting gamma densities (red broken curves) are also shown, with shape parameter 2.4238 and scale parameter 2.0627 in (a) and with shape parameter 2.4566 and scale parameter 2.0137 in (b).

Design 1: Spherical t-design of order 4 with 14 gradients computed by R. Womersley, shown in Figure 1, with b-value 996 s/mm², and one acquisition at zero b-value, for a total of 15 acquisitions. The corresponding Fisher information is given by
$J^{(n)} (\bar{D}) = {\bar{μ}}^{(n)} A_{3} (1, 1), {\bar{μ}}^{(n)} = 4.63 \times 10^{7} s^{2} / {mm}^{4},$

and the ML estimator vec(D⁽ⁿ⁾) has a Gaussian approximation with mean vec(D̄) and isotropic covariance
$\sum^{(n)} = J^{(n)} {(\bar{D})}^{- 1} = 10^{- 9} \times (\begin{matrix} 8.64 & - 2.16 & - 2.16 & 0 & 0 & 0 \\ - 2.16 & 8.64 & - 2.16 & 0 & 0 & 0 \\ - 2.16 & - 2.16 & 8.64 & 0 & 0 & 0 \\ 0 & 0 & 0 & 5.4 & 0 & 0 \\ 0 & 0 & 0 & 0 & 5.4 & 0 \\ 0 & 0 & 0 & 0 & 0 & 5.4 \end{matrix}) \frac{{mm}^{4}}{s^{2}} .$
Design 2: This design is based on the icosahedron with the six gradients shown in Figure 2 for each b-value in the set {560, 778, 996, 1276, 1556, 1898, 2240} s/mm², and one acquisition at zero b-value, for a total of 43 acquisitions. The corresponding Fisher information is given by
$J^{(n)} (\bar{D}) = {\bar{μ}}^{(n)} A_{3} (1, 1), {\bar{μ}}^{(n)} = 1.323 \times 10^{8} s^{2} / {mm}^{4},$

and the ML estimator vec(D⁽ⁿ⁾) has a Gaussian approximation with mean vec(D̄) and isotropic covariance
$\sum^{(n)} = J^{(n)} {(\bar{D})}^{- 1} = 10^{- 9} \times (\begin{matrix} 3.02 & - 0.76 & - 0.76 & 0 & 0 & 0 \\ - 0.76 & 3.02 & - 0.76 & 0 & 0 & 0 \\ - 0.76 & - 0.76 & 3.02 & 0 & 0 & 0 \\ 0 & 0 & 0 & 1.89 & 0 & 0 \\ 0 & 0 & 0 & 0 & 1.89 & 0 \\ 0 & 0 & 0 & 0 & 0 & 1.89 \end{matrix}) \frac{{mm}^{4}}{s^{2}} .$
Design 3: This design is based on the dodecahedron with the 10 gradients shown in Figure 3 for each b-value in the set
${560, 778, 996, 1276, 1556, 1898, 2240} s / {mm}^{2},$

and one acquisition at zero b-value, for a total of 71 acquisitions. The corresponding Fisher information is given by
$J^{(n)} (\bar{D}) = {\bar{μ}}^{(n)} A_{3} (1, 1), {\bar{μ}}^{(n)} = 2.205 \times 10^{8} s^{2} / {mm}^{4},$

and the ML estimator vec(D⁽ⁿ⁾) has a Gaussian approximation with mean vec(D̄) and isotropic covariance
$\sum^{(n)} = J^{(n)} {(\bar{D})}^{- 1} = 10^{- 9} \times (\begin{matrix} 1.81 & - 0.45 & - 0.45 & 0 & 0 & 0 \\ - 0.45 & 1.81 & - 0.45 & 0 & 0 & 0 \\ - 0.45 & - 0.45 & 1.81 & 0 & 0 & 0 \\ 0 & 0 & 0 & 1.13 & 0 & 0 \\ 0 & 0 & 0 & 0 & 1.13 & 0 \\ 0 & 0 & 0 & 0 & 0 & 1.13 \end{matrix}) \frac{{mm}^{4}}{s^{2}} .$
Design 4: Combination of spherical t-designs of orders 5, 7, 9, and 11 shown in Figure 4 on shells corresponding to the b-values {560, 996, 1556, 2240}, respectively, with one acquisition at zero b-value, for a total of 163 acquisitions. The corresponding Fisher information is given by
$J (D^{(n)}) = {\bar{μ}}^{(n)} A_{3} (1, 1), {\bar{μ}}^{(n)} = 5.263 \times 10^{8} s^{2} / {mm}^{4},$

and the ML estimator vec(D⁽ⁿ⁾) has a Gaussian approximation with mean vec(D̄) and isotropic covariance
$\sum^{(n)} = J^{(n)} {(\bar{D})}^{- 1} = 10^{- 10} \times (\begin{matrix} 7.6 & - 1.9 & - 1.9 & 0 & 0 & 0 \\ - 1.9 & 7.6 & - 1.9 & 0 & 0 & 0 \\ - 1.9 & - 1.9 & 7.6 & 0 & 0 & 0 \\ 0 & 0 & 0 & 4.75 & 0 & 0 \\ 0 & 0 & 0 & 0 & 4.75 & 0 \\ 0 & 0 & 0 & 0 & 0 & 4.75 \end{matrix}) \frac{{mm}^{4}}{s^{2}} .$

Design 5: This design consists of three repetitions of the 32 gradients in Figure 14 for each b-value in

{62, 249, 560, 996, 1556, 2240, 3049, 3982, 5040, 6222, 7529, 8960, 10516, 12196, 14000} s / {mm}^{2},

and three acquisitions at zero b-value, for a total of 1443 acquisitions. The ML estimator vec(D⁽ⁿ⁾) has a Gaussian approximation with mean vec(D̄ ) and nonisotropic covariance

\sum^{(n)} = J^{(n)} {(\bar{D})}^{- 1} = 10^{- 10} \times (\begin{matrix} 6.77 & - 3.24 & - 2.31 & - 0.07 & - 0.08 & 0.21 \\ - 3.24 & 7.04 & - 2.53 & - 0.11 & 0.15 & - 0.05 \\ - 2.31 & - 2.53 & 6.70 & 0.10 & - 0.10 & - 0.59 \\ - 0.07 & - 0.11 & 0.10 & 1.17 & - 0.14 & 0.01 \\ - 0.08 & 0.15 & - 0.10 & - 0.14 & 1.3 & - 0.01 \\ 0.21 & - 0.05 & - 0.59 & 0.01 & - 0.01 & 1.33 \end{matrix}) \frac{{mm}^{4}}{s^{2}} .

(51)

The 32 gradients table used by default with the commercial 3T Philips Achieva MR-scanner.

All scatterplots in Figures 9–13 are consistent with the asymptotic independence of the sphericity statistics $τ_{2}^{(n)}$ and $τ_{5}^{(n)}$ from $τ_{3}^{(n)}$ . When the experimental design is based on spherical t-designs of order t ≥ 4 (Designs 1–4), with isotropic Fisher information, the empirical distributions of $τ_{2}^{(n)}$ and $τ_{5}^{(n)}$ fit the theoretical limit distribution $χ_{5}^{2}$ (Figures 9–12). Design 5 has the largest number of acquisitions and is the most informative of all; however, the Fisher information is not isotropic, and Figure 13 shows that the empirical distributions of $τ_{2}^{(n)}$ and $τ_{5}^{(n)}$ do not fit the $χ_{5}^{2}$ distribution, with the consequence of underestimating the type I error probability of rejecting an isotropic true tensor. We conclude that the distribution of these sphericity statistics is sensitive to anisotropies of the estimation error distribution. As was shown in section 5, these sphericity test statistics should be calibrated against the law of τ (c + κ₁(X), κ₂(X), κ₃(X)), evaluated at c = κ₁(D⁽ⁿ⁾), where X is the zero-mean symmetric Gaussian matrix with covariance (51).

We also remark that in part (b) of Figure 9–11, compared with the uniform density, the histogram estimator of the $τ_{3}^{(n)}$ density shows an increasing linear trend. This linear trend is not as evident in 12, which is based on a larger number of acquisitions, and the distribution of the MLE D⁽ⁿ⁾ is presumably better approximated by a Gaussian than in the previous cases. By taking absolute value $∣ τ_{3}^{(n)} ∣$ , the linear trend cancels out, and the histogram of $∣ τ_{3}^{(n)} ∣$ in part (a) of Figures 9–13 robustly fits the uniform distribution in all the situations we have considered.

8. Conclusion

We have considered the problem of estimating the spectrum γ̄₁ ≥ γ̄₂ ≥ · · · ≥ γ̄_m and the eigenvectors of a real symmetric m×m matrix D̄, possibly nonpositive, by the spectrum and the eigenvectors of a consistent and asymptotically Gaussian matrix estimator D⁽ⁿ⁾, assuming that the covariance of the rescaled limit is isotropic. When D̄ has repeated eigenvalues, the delta method does not apply, and the spectrum of the matrix estimator has a non-Gaussian limit distribution. In the limit, the random eigenvalues $γ_{1}^{(n)} > γ_{2}^{(n)} > \dots > γ_{m}^{(n)}$ of D⁽ⁿ⁾ form clusters corresponding to the D̄ eigenspaces, with jointly Gaussian barycenters. Within each cluster, the differences between eigenvalues and barycenter are independent from the barycenter and the other clusters and follow the conditional law of GOE eigenvalues conditioned on having zero barycenter.

In many applications it is important to detect the symmetries of the true matrix parameter D̄, in particular to test whether D̄ is spherical, which leads to singular hypothesis testing problems. A statistical test against D̄-symmetries needs to be calibrated, taking into account the repulsion between the random eigenvalues of D⁽ⁿ⁾ corresponding to the same D̄ -eigenspace. In dimension m = 3, we derived the asymptotic joint distribution of some commonly used sphericity statistics such as fractional anisotropy (FA), relative anisotropy (RA), and volume ratio (VR) under isotropy assumptions. We have also discussed the implications of these general results for the design and analysis of DTI measurements, and we showed that gradient designs based on spherical t-designs have isotropic Fisher information and are asympotically most informative when the true tensor is spherical. A direct application would be in denoising the FA maps derived from diffusion tensor estimates. Testing for sphericity at each volume element with a fixed confidence level corresponds to an FA cut-off threshold which is not constant over the voxels but depends locally on the estimated noise and mean diffusivity parameters. We have seen in the Monte Carlo study that the simulated sphericity statistics are well fit to their theoretical limit distribution when the Fisher information of the experiment was isotropic. However, there was a significant discrepancy under experimental Design 5, with nonisotropic Fisher information. We conclude that these findings give a strong theoretical argument in favor of using spherical t-designs in DTI, and we plan to conduct similar experiments with real DTI data in the near future. Finally, our work in progress is to generalize this theory to situations in which the covariance of the Gaussian limit matrix has symmetries without being fully isotropic.

Supplementary Material

Appendix

NIHMS906978-supplement-Appendix.pdf^{(299.4KB, pdf)}

Acknowledgments

We thank Konstantin Izyurov, Sangita Kulathinal, Antti Kupiainen, and Juha Railavo for insightful discussions, and we thank the two anonymous reviewers for their valuable questions and remarks.

References

1.An C, Chen X, Sloan IH, Womersley RS. Well conditioned spherical designs for integration and interpolation on the two-sphere. SIAM J Numer Anal. 2010;48:2135–2157. https://doi.org/10.1137/100795140. [Google Scholar]
2.Anderson GA. An asymptotic expansion for the distribution of the latent roots of the estimated covariance matrix. Ann Math Statist. 1965;36:1153–1173. [Google Scholar]
3.Anderson GW, Guionnet AA, Ofer Z. An Introduction to Random Matrices. Cambridge University Press; 2010. [Google Scholar]
4.Bannai E, Bannai E. A survey on spherical designs and algebraic combinatorics on spheres. European J Combin. 2009;30:1392–1425. [Google Scholar]
5.Basser PJ, Mattiello J, LeBihan D. MR diffusion tensor spectroscopy and imaging. Biophys J. 1994;66:259. doi: 10.1016/S0006-3495(94)80775-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Basser PJ, Mattiello J, LeBihan D. Estimation of the effective self-diffusion tensor from the NMR spin echo. J Magnetic Resonance Ser B. 1994;103:247–254. doi: 10.1006/jmrb.1994.1037. [DOI] [PubMed] [Google Scholar]
7.Basser PJ. Inferring microstructural features and the physiological state of tissues from diffusion weighted images. NMR Biomed. 1995;8:333–344. doi: 10.1002/nbm.1940080707. [DOI] [PubMed] [Google Scholar]
8.Basser PJ. New histological and physiological stains derived from diffusion-tensor MR-images. Imaging Brain Structure and Function, Ann New York Acad Sci. 1997;820:123–138. doi: 10.1111/j.1749-6632.1997.tb46192.x. [DOI] [PubMed] [Google Scholar]
9.Basser PJ, Pajevic S. Statistical artifacts in diffusion tensor MRI (DT-MRI) caused by background noise. Magnetic Resonance Med. 2000;44:41–50. doi: 10.1002/1522-2594(200007)44:1<41::aid-mrm8>3.0.co;2-o. [DOI] [PubMed] [Google Scholar]
10.Basser PJ, Pajevic S. A normal distribution for tensor-valued random variables: Applications to diffusion tensor MRI. IEEE Trans Med Imaging. 2003;22:785–794. doi: 10.1109/TMI.2003.815059. [DOI] [PubMed] [Google Scholar]
11.Basser PJ, Pajevic S. Dealing with uncertainty in diffusion tensor MR data. Israel J Chem. 2003;43:129–144. [Google Scholar]
12.Basser PJ, Pajevic S. Spectral decomposition of a 4th-order covariance tensor: Applications to diffusion tensor MRI. Signal Process. 2007;87:220–236. [Google Scholar]
13.Batchelor PG, Atkinson D, Hill DLG, Calamante F, Connelly A. Anisotropic noise propagation in diffusion tensor MRI sampling schemes. Magnetic Resonance Med. 2003;49:1143–1151. doi: 10.1002/mrm.10491. [DOI] [PubMed] [Google Scholar]
14.Chattopadhyay AK, Pillai KC. Asymptotic expansions for the distributions of characteristic roots when the parameter matrix has several multiple roots. Multivariate Analysis, III; Proc. Third Internat. Sympos., Wright State Univ; Dayton, OH. 1972; Academic Press; 1973. pp. 117–127. [Google Scholar]
15.Chiani M. Distribution of the largest eigenvalue for real Wishart and Gaussian random matrices and a simple approximation for the Tracy–Widom distribution. J Multivariate Anal. 2014;129:69–81. [Google Scholar]
16.Chikuse Y. Statistics on Special Manifolds. Springer Lecture Notes in Statistics. 2003;174 [Google Scholar]
17.Clement-Spychala ME, Couper D, Zhu H, Muller KE. Approximating the Geisser-Greenhouse sphericity estimator and its applications to diffusion tensor imaging. Stat Interface. 2010;3:81–90. doi: 10.4310/SII.2010.v3.n1.a7. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Delsarte P, Goethals JM, Seidel JJ. Spherical codes and designs. Geom Dedicata. 1977;6:363–388. [Google Scholar]
19.Drton M. Likelihood ratio tests and singularities. Ann Statist. 2009;37:979–1012. [Google Scholar]
20.Drton M, Xiao H. Wald tests of singular hypothesis. Bernoulli. 2016;22:38–59. [Google Scholar]
21.Dyson FJ. A Brownian-motion model for the eigenvalues of a random matrix. J Math Phys. 1962;3:1191–1198. [Google Scholar]
22.Edelman A. PhD Thesis. MIT; 1989. Eigenvalue and Condition Numbers of Random Matrices. [Google Scholar]
23.Farrell RH. Multivariate Calculation Use of the Continuous Groups. Springer; 1985. [Google Scholar]
24.Forrester PJ. London Math Soc Monographs. Princeton University Press; 2010. Log-Gases and Random Matrices. [Google Scholar]
25.Guionnet A. Noncommutative Probability and Random Matrices at Saint-Flour. Springer; 2012. Large random matrices: Lectures on macroscopic asymptotics; pp. 169–463. [Google Scholar]
26.Hikami S, Brézin E. WKB-expansion of the Harish-Chandra-Itzykson-Zuber integral for arbitrary β. Progr Theoret Phys. 2006;116:441–502. [Google Scholar]
27.Idier J, Collewet G. Properties of Fisher Information for Rician Distributions and Consequences in MRI. 2014 preprint hal-01072813. [Google Scholar]
28.James AT. Distributions of matrix variates and latent roots derived from normal samples. Ann Math Statist. 1964;35:475–501. [Google Scholar]
29.Liu J, Gasbarra D, Railavo J. Fast estimation of diffusion tensors under Rician noise by the EM algorithm. J Neurosci Methods. 2016;257:147–158. doi: 10.1016/j.jneumeth.2015.09.029. [DOI] [PubMed] [Google Scholar]
30.Magnus JR, Neudecker H. Matrix Differential Calculus with Applications in Statistics and Econometrics. Wiley; 1999. [Google Scholar]
31.Mallows CL. Latent vectors of random symmetric matrices. Biometrika. 1961;48:133–149. [Google Scholar]
32.Mehta ML. Random Matrices. 3. Elsevier; 2004. [Google Scholar]
33.Muirhead RJ. Aspects of Multivariate Statistics. Wiley; 1984. [Google Scholar]
34.Pajevic S, Basser PJ. Parametric and non-parametric statistical analysis of DT-MRI data. J Magnetic Resonance. 2003;161:1–14. doi: 10.1016/s1090-7807(02)00178-7. [DOI] [PubMed] [Google Scholar]
35.Pajevic S, Basser PJ. A joint PDF for the eigenvalues and eigenvectors of a diffusion tensor. Proc Internat Soc Magnetic Resonance Med. 2010;18:303. [Google Scholar]
36.Pierpaoli C, Infante I, Mattiello J, Di Chiro G, Le Bihan D, Basser PJ. Diffusion tensor imaging of brain white matter anisotropy. Proc Internat Soc Magnetic Resonance Med. 1994;2:1038. [Google Scholar]
37.Schwartzman A, Mascarenhas WF, Taylor JE. Inference for eigenvalues and eigenvectors of Gaussian symmetric matrices. Ann Statist. 2008;36:2886–2919. [Google Scholar]
38.Schwartzman A, Dougherty RF, Taylor JE. Group comparison of eigenvalues and eigenvectors of diffusion tensors. J Amer Statist Assoc. 2010;105:588–598. doi: 10.1198/jasa.2010.ap07291. [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Takemura A. Zonal Polynomials. Institute of Mathematical Statistics; 1984. IMS Lecture Notes Monogr. Ser. 4. [Google Scholar]
40.Tao T. Topics in Random Matrix Theory. American Mathematical Society; 2012. [Google Scholar]
41.Tao T. The Harish-Chandra-Itzykson-Zuber Integral Formula. 2013 https://terrytao.wordpress.com/2013/02/08/the-harish-chandra-itzykson-zuber-integral-formula/
42.Van Der Vaart AW. Asymptotic Statistics. Cambridge University Press; 2000. [Google Scholar]
43.Watanabe S. Cambridge Monogr Appl Comput Math. Vol. 25. Cambridge University Press; 2009. Algebraic Geometry and Statistical Learning Theory. [Google Scholar]
44.Zhu H, Zhang H, Ibrahim JG, Peterson BS. Statistical analysis of diffusion tensors in diffusion-weighted magnetic resonance imaging data. J Amer Statist Assoc. 2007;102:1085–1102. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Appendix

NIHMS906978-supplement-Appendix.pdf^{(299.4KB, pdf)}

[R1] 1.An C, Chen X, Sloan IH, Womersley RS. Well conditioned spherical designs for integration and interpolation on the two-sphere. SIAM J Numer Anal. 2010;48:2135–2157. https://doi.org/10.1137/100795140. [Google Scholar]

[R2] 2.Anderson GA. An asymptotic expansion for the distribution of the latent roots of the estimated covariance matrix. Ann Math Statist. 1965;36:1153–1173. [Google Scholar]

[R3] 3.Anderson GW, Guionnet AA, Ofer Z. An Introduction to Random Matrices. Cambridge University Press; 2010. [Google Scholar]

[R4] 4.Bannai E, Bannai E. A survey on spherical designs and algebraic combinatorics on spheres. European J Combin. 2009;30:1392–1425. [Google Scholar]

[R5] 5.Basser PJ, Mattiello J, LeBihan D. MR diffusion tensor spectroscopy and imaging. Biophys J. 1994;66:259. doi: 10.1016/S0006-3495(94)80775-1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R6] 6.Basser PJ, Mattiello J, LeBihan D. Estimation of the effective self-diffusion tensor from the NMR spin echo. J Magnetic Resonance Ser B. 1994;103:247–254. doi: 10.1006/jmrb.1994.1037. [DOI] [PubMed] [Google Scholar]

[R7] 7.Basser PJ. Inferring microstructural features and the physiological state of tissues from diffusion weighted images. NMR Biomed. 1995;8:333–344. doi: 10.1002/nbm.1940080707. [DOI] [PubMed] [Google Scholar]

[R8] 8.Basser PJ. New histological and physiological stains derived from diffusion-tensor MR-images. Imaging Brain Structure and Function, Ann New York Acad Sci. 1997;820:123–138. doi: 10.1111/j.1749-6632.1997.tb46192.x. [DOI] [PubMed] [Google Scholar]

[R9] 9.Basser PJ, Pajevic S. Statistical artifacts in diffusion tensor MRI (DT-MRI) caused by background noise. Magnetic Resonance Med. 2000;44:41–50. doi: 10.1002/1522-2594(200007)44:1<41::aid-mrm8>3.0.co;2-o. [DOI] [PubMed] [Google Scholar]

[R10] 10.Basser PJ, Pajevic S. A normal distribution for tensor-valued random variables: Applications to diffusion tensor MRI. IEEE Trans Med Imaging. 2003;22:785–794. doi: 10.1109/TMI.2003.815059. [DOI] [PubMed] [Google Scholar]

[R11] 11.Basser PJ, Pajevic S. Dealing with uncertainty in diffusion tensor MR data. Israel J Chem. 2003;43:129–144. [Google Scholar]

[R12] 12.Basser PJ, Pajevic S. Spectral decomposition of a 4th-order covariance tensor: Applications to diffusion tensor MRI. Signal Process. 2007;87:220–236. [Google Scholar]

[R13] 13.Batchelor PG, Atkinson D, Hill DLG, Calamante F, Connelly A. Anisotropic noise propagation in diffusion tensor MRI sampling schemes. Magnetic Resonance Med. 2003;49:1143–1151. doi: 10.1002/mrm.10491. [DOI] [PubMed] [Google Scholar]

[R14] 14.Chattopadhyay AK, Pillai KC. Asymptotic expansions for the distributions of characteristic roots when the parameter matrix has several multiple roots. Multivariate Analysis, III; Proc. Third Internat. Sympos., Wright State Univ; Dayton, OH. 1972; Academic Press; 1973. pp. 117–127. [Google Scholar]

[R15] 15.Chiani M. Distribution of the largest eigenvalue for real Wishart and Gaussian random matrices and a simple approximation for the Tracy–Widom distribution. J Multivariate Anal. 2014;129:69–81. [Google Scholar]

[R16] 16.Chikuse Y. Statistics on Special Manifolds. Springer Lecture Notes in Statistics. 2003;174 [Google Scholar]

[R17] 17.Clement-Spychala ME, Couper D, Zhu H, Muller KE. Approximating the Geisser-Greenhouse sphericity estimator and its applications to diffusion tensor imaging. Stat Interface. 2010;3:81–90. doi: 10.4310/SII.2010.v3.n1.a7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R18] 18.Delsarte P, Goethals JM, Seidel JJ. Spherical codes and designs. Geom Dedicata. 1977;6:363–388. [Google Scholar]

[R19] 19.Drton M. Likelihood ratio tests and singularities. Ann Statist. 2009;37:979–1012. [Google Scholar]

[R20] 20.Drton M, Xiao H. Wald tests of singular hypothesis. Bernoulli. 2016;22:38–59. [Google Scholar]

[R21] 21.Dyson FJ. A Brownian-motion model for the eigenvalues of a random matrix. J Math Phys. 1962;3:1191–1198. [Google Scholar]

[R22] 22.Edelman A. PhD Thesis. MIT; 1989. Eigenvalue and Condition Numbers of Random Matrices. [Google Scholar]

[R23] 23.Farrell RH. Multivariate Calculation Use of the Continuous Groups. Springer; 1985. [Google Scholar]

[R24] 24.Forrester PJ. London Math Soc Monographs. Princeton University Press; 2010. Log-Gases and Random Matrices. [Google Scholar]

[R25] 25.Guionnet A. Noncommutative Probability and Random Matrices at Saint-Flour. Springer; 2012. Large random matrices: Lectures on macroscopic asymptotics; pp. 169–463. [Google Scholar]

[R26] 26.Hikami S, Brézin E. WKB-expansion of the Harish-Chandra-Itzykson-Zuber integral for arbitrary β. Progr Theoret Phys. 2006;116:441–502. [Google Scholar]

[R27] 27.Idier J, Collewet G. Properties of Fisher Information for Rician Distributions and Consequences in MRI. 2014 preprint hal-01072813. [Google Scholar]

[R28] 28.James AT. Distributions of matrix variates and latent roots derived from normal samples. Ann Math Statist. 1964;35:475–501. [Google Scholar]

[R29] 29.Liu J, Gasbarra D, Railavo J. Fast estimation of diffusion tensors under Rician noise by the EM algorithm. J Neurosci Methods. 2016;257:147–158. doi: 10.1016/j.jneumeth.2015.09.029. [DOI] [PubMed] [Google Scholar]

[R30] 30.Magnus JR, Neudecker H. Matrix Differential Calculus with Applications in Statistics and Econometrics. Wiley; 1999. [Google Scholar]

[R31] 31.Mallows CL. Latent vectors of random symmetric matrices. Biometrika. 1961;48:133–149. [Google Scholar]

[R32] 32.Mehta ML. Random Matrices. 3. Elsevier; 2004. [Google Scholar]

[R33] 33.Muirhead RJ. Aspects of Multivariate Statistics. Wiley; 1984. [Google Scholar]

[R34] 34.Pajevic S, Basser PJ. Parametric and non-parametric statistical analysis of DT-MRI data. J Magnetic Resonance. 2003;161:1–14. doi: 10.1016/s1090-7807(02)00178-7. [DOI] [PubMed] [Google Scholar]

[R35] 35.Pajevic S, Basser PJ. A joint PDF for the eigenvalues and eigenvectors of a diffusion tensor. Proc Internat Soc Magnetic Resonance Med. 2010;18:303. [Google Scholar]

[R36] 36.Pierpaoli C, Infante I, Mattiello J, Di Chiro G, Le Bihan D, Basser PJ. Diffusion tensor imaging of brain white matter anisotropy. Proc Internat Soc Magnetic Resonance Med. 1994;2:1038. [Google Scholar]

[R37] 37.Schwartzman A, Mascarenhas WF, Taylor JE. Inference for eigenvalues and eigenvectors of Gaussian symmetric matrices. Ann Statist. 2008;36:2886–2919. [Google Scholar]

[R38] 38.Schwartzman A, Dougherty RF, Taylor JE. Group comparison of eigenvalues and eigenvectors of diffusion tensors. J Amer Statist Assoc. 2010;105:588–598. doi: 10.1198/jasa.2010.ap07291. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R39] 39.Takemura A. Zonal Polynomials. Institute of Mathematical Statistics; 1984. IMS Lecture Notes Monogr. Ser. 4. [Google Scholar]

[R40] 40.Tao T. Topics in Random Matrix Theory. American Mathematical Society; 2012. [Google Scholar]

[R41] 41.Tao T. The Harish-Chandra-Itzykson-Zuber Integral Formula. 2013 https://terrytao.wordpress.com/2013/02/08/the-harish-chandra-itzykson-zuber-integral-formula/

[R42] 42.Van Der Vaart AW. Asymptotic Statistics. Cambridge University Press; 2000. [Google Scholar]

[R43] 43.Watanabe S. Cambridge Monogr Appl Comput Math. Vol. 25. Cambridge University Press; 2009. Algebraic Geometry and Statistical Learning Theory. [Google Scholar]

[R44] 44.Zhu H, Zhang H, Ibrahim JG, Peterson BS. Statistical analysis of diffusion tensors in diffusion-weighted magnetic resonance imaging data. J Amer Statist Assoc. 2007;102:1085–1102. [Google Scholar]

PERMALINK

Eigenvalues of Random Matrices with Isotropic Gaussian Noise and the Design of Diffusion Tensor Imaging Experiments*

Dario Gasbarra

Sinisa Pajevic

Peter J Basser

Abstract

1. Introduction

1.1. Tensor-variate normal distribution

1.2. Isotropic Gaussian matrix distribution

Remark 1.1

2. Spectral representation and change of variables

Proposition 2.1 (see [24, Prop. 1.2])

3. Eigenvalue and eigenvector distribution

3.1. Zero-mean isotropic Gaussian matrix

Remark 3.1

3.2. General case

Theorem 3.2

Proof

Remark 3.3

4. Small noise asymptotics

4.1. Spectral grouping

Theorem 4.1

Remark 4.2

Corollary 4.3

Remark 4.4

4.2. Axial and radial diffusivity marginals

4.3. Eigenvector asymptotics

Theorem 4.5

Remark

4.4. Second-order approximation of the HCIZ integral

Theorem 4.6

Remark 4.7

5. Testing the sphericity hypothesis

Lemma 5.1

Corollary 5.2

Proof

Lemma 5.3

Proof

Corollary 5.4

Remark 5.5

6. Asymptotic statistics in DTI under Rician noise

6.1. Isotropic Gaussian limit error distribution

Proposition 6.1

Proof

6.2. Spherical t-designs in diffusion tensor imaging

Table 1.

Figure 1.

Figure 2.

Figure 3.

Figure 4.

7. Illustration of the methods

7.1. Monte Carlo study with isotropic Gaussian noise

Figure 5.

Figure 6.

Figure 7.

Figure 8.

7.2. Monte Carlo study of sphericity test statistics based on DTI data with Rician noise

Figure 9.

Figure 13.

Figure 14.

Figure 12.

Figure 11.

8. Conclusion

Supplementary Material

Figure 10.

Acknowledgments

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases