APPROXIMATING SYMMETRIC POSITIVE SEMIDEFINITE TENSORS OF EVEN ORDER

ANGELOS BARMPOUTIS; HO JEFFREY; BABA C VEMURI

doi:10.1137/100801664

. Author manuscript; available in PMC: 2012 Dec 31.

Published in final edited form as: SIAM J Imaging Sci. 2012 Mar 20;5(1):434–464. doi: 10.1137/100801664

APPROXIMATING SYMMETRIC POSITIVE SEMIDEFINITE TENSORS OF EVEN ORDER^*

ANGELOS BARMPOUTIS, HO JEFFREY, BABA C VEMURI

PMCID: PMC3533448 NIHMSID: NIHMS356677 PMID: 23285313

Abstract

Tensors of various orders can be used for modeling physical quantities such as strain and diffusion as well as curvature and other quantities of geometric origin. Depending on the physical properties of the modeled quantity, the estimated tensors are often required to satisfy the positivity constraint, which can be satisfied only with tensors of even order. Although the space $P_{0}^{2 m}$ of 2m^th-order symmetric positive semi-definite tensors is known to be a convex cone, enforcing positivity constraint directly on $P_{0}^{2 m}$ is usually not straightforward computationally because there is no known analytic description of $P_{0}^{2 m}$ for m > 1. In this paper, we propose a novel approach for enforcing the positivity constraint on even-order tensors by approximating the cone $P_{0}^{2 m}$ for the cases 0 < m < 3, and presenting an explicit characterization of the approximation Σ₂_m ⊂ Ω₂_m for m ≥ 1, using the subset $Ω_{2 m} \subset P_{0}^{2 m}$ of semi-definite tensors that can be written as a sum of squares of tensors of order m. Furthermore, we show that this approximation leads to a non-negative linear least-squares (NNLS) optimization problem with the complexity that equals the number of generators in Σ₂_m. Finally, we experimentally validate the proposed approach and we present an application for computing 2m^th-order diffusion tensors from Diffusion Weighted Magnetic Resonance Images.

Keywords: high-order tensors, sum of squares of polynomials, diffusion tensor imaging

1. Introduction

Multi-linear algebra is a generalization of linear algebra and tensors which are multi-linear forms are widely used for modeling various physical quantities commonly encountered in engineering and physics. Elasticity [34], stress, strain and diffusion [10] are some examples. In differential geometry, tensors are used to represent metrics, curvatures [40] and other geometric quantities. In image processing, structure tensors [46] have been used for texture analysis, trifocal tensors in multi-view geometry, etc. The tensors in most of these applications are required to satisfy certain properties. For example, the tensors that approximate the Bidirectional Reflectance Distribution Function (BRDF) [7] are anti-symmetric, while the diffusion [10] and the structure tensors [46] are antipodally symmetric. Furthermore, certain applications demand that the estimated tensors be positive-definite since they model positive-valued physical quantities such as the diffusivity function or the displacement probability of water molecules [8]. In this paper, we are interested in the case of fully symmetric positive-definite tensors of various orders and hence for sake of simplicity, every reference to the term tensor will imply this particular case of tensors unless otherwise stated.

Let Inline graphic denote the set of m^th-order symmetric positive-definite tensors in ℝ³. As is well-known, positivity condition requires the order m to be even. Denote $P_{0}^{m}$ the closure of consisting of symmetric positive semi-definite tensors (PSD) in ℝ³. As subsets of the space of m^th-order symmetric tensors, Inline graphic , $P_{0}^{m}$ are cones, convex subsets that are invariant under positive scaling [18]. In most applications, the main computational problem can be formulated as data interpolation problem with the domain being $P_{0}^{2 m}$ . Specifically, the input data are often in the form {(x₁, y₁), ···, (x_k, y_k)} where x_i are directions in ℝ³ represented as points on the unit sphere S², and y_i are the values to be interpolated. The interpolation problem requires a non-negative tensor $T \in P_{0}^{2 m}$ that interpolates the input data. Formulated as a least-squares problem, it has the form

T = arg min_{p \in P_{0}^{2 m}} \sum_{i = 1}^{k} {∣ y_{i} - p (x_{i}) ∣}^{2} .

We note that both the objective function and the domain $P_{0}^{2 m}$ are convex, and therefore, the optimization problem above is in fact a convex optimization problem that, in principle, can be solved using existing techniques [12]. However, a formal and significant difficulty of applying these methods is that except for the m = 1 case, there exists no known description of the cone $P_{0}^{2 m}$ as it is well-known that the positivity test for polynomials of degree m > 2 is a difficult problem. In the second-order case, the cone $P_{0}^{2}$ is known to be self-dual in the sense that there exists an inner product < ·, · > on Inline graphic such that < A, B >≥ 0 for any $A, B \in P_{0}^{2}$ . The inner product allows the extension of the usual duality theory using Lagrange multipliers to the cone $P_{0}^{2}$ , and there is a well-developed theory of semi-definite programming (SDP) [12] that deals with linear objective functions on $P_{0}^{2}$ .

While the difficulty of providing a complete description of $P_{0}^{2 m}$ seems to be unsurmountable at this point, the main contribution of this paper is the realization of another formal difficulty that can be overcome relatively easily. A cone C in a vector space is said to be finitely-generated if there exists a finite number of elements v₁, ···, v_n ∈ C, its generators, such that every element c ∈ C can be written as a non-negative linear combination of the generators

c = a_{1} v_{1} + \dots + a_{n} v_{n}, a_{1}, \dots, a_{n} \geq 0.

If the cone $P_{0}^{2 m}$ were finitely generated, the above optimization problem becomes a non-negative linear least-squares (NNLS) problem, with complexity (number of variables) equals to the number of generators. The advantage of solving an NNLS problem is that there are software packages that can efficiently solve NNLS problems containing thousands of variables [28]. While $P_{0}^{2 m}$ is not finitely-generated, it follows naturally that we can try to approximate $P_{0}^{2 m}$ with a finitely-generated subcone, and restrict the above optimization problem to the subcone. The restriction can be justified if the subcone can be shown to be a good approximation of $P_{0}^{2 m}$ .

The second contribution of this paper is an explicit characterization of the approximations $\sum_{2 m} \subset P_{0}^{2 m}$ for 0 < m < 3, and Σ₂_m ⊂ Ω₂_m for m ≥ 1, where Σ₂_m is a finitely-generated subcone in the respective spaces. More specifically, let Ω₂_m denote the subcone in $P_{0}^{2 m}$ consisting of semi-definite tensors that can be written as a sum of squares of tensors of order m. We have the natural inclusions $\sum_{2 m} \subset Ω_{2 m} \subset P_{0}^{2 m}$ , and our result gives a detailed characterization of the approximation Σ₂_m ⊂ Ω₂_m in terms of the geometry of the generators in Σ₂_m. In particular, for m = 1, 2, it is known that $Ω_{2 m} = P_{0}^{2 m}$ , and our result then gives a detailed characterization of the approximation $\sum_{2 m} \subset P_{0}^{2 m}$ . Our analysis have shown that, for the lower-order cases m = 1, 2, 3, which are of primary interest here, for a reasonable precision requirement, Ω₂_m can be approximated by Σ₂_m containing a few hundreds or at most a few thousands of generators. It follows that the corresponding NNLS problems have the complexity that are well within the capability of currently available NNLS algorithms [28]. We quantitatively validate our method via several experiments, and we also present an application of the proposed technique for estimating the diffusivity function from diffusion-weighted MRI to demonstrate both the efficiency and accuracy of the proposed method.

The rest of this paper is organized as follows: In Sec. 2, we define the finitely-generated subcone Σ₂_m. We also develop the theory that quantifies the approximation Σ₂_m ⊂ Ω₂_m, and the main theorem proved in this section relates the approximation error with the geometry of the generators in Σ₂_m. Using the theory developed in Sec. 2, in Sec. 3 we explicitly work out the formulas for the number of generators for Σ₂_m required for a given accuracy requirement. The results show that, up to order-6 and depending on the order, it generally requires at most a few thousands of generators for Σ₂_m in order to achieve a relative approximation error of less than 10%. Finally, in Sec. 4, we validate our theoretical findings using a set of experiments and we present an application of our method on diffusion-weighted MR datasets.

Related Work

Symmetric positive-definite (SPD) tensors of order-2 have been used in modeling the diffusivity function in the so called Diffusion Tensor MR Imaging (DT-MRI) [10]. SPD matrices can be endowed with a Riemannian metric that is invariant under affine transforms. This metric or its approximations have been employed for estimating and processing diffusion tensor fields [48, 47, 29, 38, 18, 9]. Tensors of 3^rd and 5^th order can model reflectance distributions with specularities and cast shadows in facial images and have been used for re-lighting in [7]. In general, odd-order tensors are generalizations of the order-1 tensor, which have been commonly used in computer graphics for representing the Lambertian reflectance model. Similarly, 4^th, 6^th or higher even-order tensors generalize the 2^nd-order tensors and have the ability to approximate multi-lobed functions [35, 30, 36] such as the kurtosis of diffusion [26]. In particular, some 4^th-order tensors can be expressed as 2^nd-order tensors in higher dimensions and their properties have been studied in detail by Moakher in [32, 33]. They however do not span the full space of the higher-order tensors as was shown in the case of order-4 tensors in [6, 5]. In [20], Ghosh et al. used the metric proposed by Moakher in [32, 33] to represent the space of 4^th-order SPD tensors using the geometry of 2^nd-order SPD tensors in higher dimensions. Recently, an algorithm for imposing positivity constraints on 4^th-order tensors using their equivalent ternary quartic polynomial representation was proposed in [6] and this was further developed in [5] and [21, 49].

After estimating a field of high-order tensors, it can be processed using a Finsler metric by appropriately modifying the polynomial equivalent representation of the tensors that satisfy the properties of Finsler geometry [4]. This method can be used for neuronal fiber tracking from high angular resolution diffusion MRI data. Further processing of higher-order tensor fields can be achieved by using the eigenvalue decomposition of matrices which has been extended for the case of high-order tensors in [23]. In this framework, the eigenvalues correspond to the extreme values (minima or maxima) of a tensor and they can be used to extract useful information from the kurtosis tensor [42] as well as the orientation of maximum diffusion [11, 22]. Another method for extracting the principal orientation of diffusion from a higher-order tensor was recently described in [44].

Although, high-order tensors have been employed in most of the aforementioned methods due to their simple polynomial form and their ability to model multi-lobed spherical functions, there are no existing methods for imposing positivity constraints in symmetric tensors of any order higher than two and four. The need to impose positivity constraints becomes essential especially in the case where the tensors approximate positive-valued physical quantities, and it has been shown that imposing the positivity constraint on the tensors approximating the diffusivity function being estimated reduces the approximation errors significantly [5]. Recently, Pasternak et al. [37] also emphasized the importance of enforcing positivity constraints in processing diffusion tensor MR images.

Finally, although Cartesian tensors basis have been widely used for modeling the diffusivity function in DW-MRI, we would like to mention that Spherical Harmonic basis have been employed in approximating other spherical functions involved in DW-MRI processing such as the diffusion propagator. A detailed review of several multi fiber reconstruction methods that employ spherical harmonic basis can be found in the recent article by Descoteaux et al. on Diffusion Propagator Imaging [17]. The orientation distribution function (ODF) is another example of a DW-MRI related spherical function, which can be reconstructed from Q-ball imaging data [16, 13, 3] and was recently done in [2] by using the mathematically correct definition of ODF and deriving a closed form expression for the same. In this article, however, our main focus is on the use of Cartesian tensor basis for parameterizing the diffusivity function in DW-MR datasets.

2. Theory

We will consider symmetric tensors of order m as functions defined on the unit sphere S² in ℝ³. In particular, symmetric tensors of order m can be identified with homogeneous polynomials of degree m: for a symmetric tensor T of order m, its associated homogeneous polynomial P(x, y, z) is given as

P (x, y, z) = T (\underset{m}{\underset{︸}{x, \dots, x}}),

where x = [x y z]^⊤. Under this identification, Inline graphic are homogeneous polynomials of degree m that do not vanish on S², and similarly, $P_{0}^{m}$ are degree-m homogeneous polynomials that do not take negative values in ℝ³. Both are now considered as cones in H_m, the set of homogeneous polynomials of degree m. For even degree 2m, let Ω₂_m denote the subset of $P_{0}^{2 m}$ consisting of polynomials that can be written as a sum of squares of polynomials of degree m. Ω₂_m is clearly a subcone of $P_{0}^{2 m}$ for all m ≥ 1, and for m = 1, 2, it is known that $Ω_{2 m} = P_{0}^{2 m}$ : the m = 1 case follows easily from linear algebra and m = 2 case is the content of Hilbert’s theorem on ternary quartics [24]. For m > 2, however, the inclusion is strict $Ω_{2 m} ⊊ P_{0}^{2 m}$ . In this section, we will describe a general method for approximating Ω₂_m using a finitely-generated subcone Σ₂_m in Ω₂_m, and we will provide a characterization of the approximation error in terms of the geometry of the generators of Σ₂_m. For the important quadratic and quartic cases m = 1, 2, our result provides an approximation of the full PSD cone $P_{0}^{2 m}$ using a finitely-generated subcone Ω₂_m.

The basic norm used in this paper is the L¹-norm over the sphere S². More specifically, for any P ∈ H_m, its L¹-norm ||P ||₁ is the integral over S²

{| | P | |}_{1} = \int_{S^{2}} ∣ P (x) ∣ d x .

That it is indeed a norm follows from the fact that for two homogeneous polynomials P, Q, P = Q as polynomials if and only if ||P − Q||₁ = 0. Note that the other norm properties are trivial to prove. For any $P \in P_{0}^{2 m}$ and a subcone $\sum_{2 m} \subset P_{0}^{2 m}$ , we define the relative L¹-approximation error of P as

E_{\sum_{2 m}} (P) = \frac{{min}_{p \in \sum_{2 m}} {| | P - p | |}_{1}}{{| | P | |}_{1}} .

(2.1)

Proposition 2.1

Let Σ₂_m be a closed subcone in $P_{0}^{2 m}$ and $P \in P_{0}^{2 m}$ .

The L¹-norm is convex: for any $p \in P_{0}^{2 m}$ , the function g(q), $q \in P_{0}^{2 m}$
$g (q) = {| | p - q | |}_{1}$

is a convex function on $P_{0}^{2 m}$ .
For P ≠ 0, E_{Σ_2m}(P) = 0 if and only if P ∈ Σ₂_m. For any s > 0,
$E_{\sum_{2 m}} (s P) = E_{\sum_{2 m}} (P) .$

Proof

For any $q_{1}, q_{2} \in P_{0}^{2 m}$ ,

g ({t q}_{1} + (1 - t) q_{2}) = \int_{S^{2}} ∣ p - {t q}_{1} - (1 - t) q_{2} ∣ d x \leq \int_{S^{2}} ∣ t p - {t q}_{1} ∣ d x + \int_{S^{2}} ∣ (1 - t) p - (1 - t) q_{2} ∣ d x,

and the convexity of the norm on $P_{0}^{2 m}$ follows. (2) is clear because Σ₂_m is closed. The invariance of E_{Σ_2m} under positive scaling follows readily from the definition.

Let m₁(x), ···, m_d₍_m₎(x) denote the $d (m) = \frac{(m + 2) (m + 1)}{2}$ monomials in H_m. (Note that d(m) also equals to the number of symmetric spherical harmonic basis elements, which can be mapped to the monomials in H_m using an one-to-one transformation [35, 15].) The monomials form a basis in H_m that identifies H_m with ℝ^d⁽^m⁾. We will denote HS_m the unit sphere in H_m, consisting of polynomials

p (x) = \sum_{i = 1}^{d (m)} a_{i} m_{i} (x)

such that $a_{1}^{2} + \dots + a_{d (m)}^{2} = 1$ . The subcone Σ₂_m will be defined using polynomials in HS_m, and this is accomplished through the square map $F_{m}^{2} : H_{m} \to H_{2 m}$ :

F_{m}^{2} (p) = p^{2} .

Clearly $F_{m}^{2}$ is a smooth map, and $F_{m}^{2} (p) = F_{m}^{2} (q)$ if and only if p = ±q. While $F_{m}^{2}$ is not linear, it maps rays in H_m to rays in $H_{2 m} : F_{m}^{2} (t p) = t^{2} F_{m}^{2} (p)$ . The geometry of the map $F_{m}^{2}$ will play a crucial role in our analysis below, and it is quantified by its condition number η_m. First, we define two quantities.

η_{m}^{max} = max_{p \in {HS}_{m}} {| | F_{m}^{2} (p) | |}_{1}, η_{m}^{min} = min_{p \in {HS}_{m}} {| | F_{m}^{2} (p) | |}_{1} .

Clearly we have $η_{m}^{min} > 0$ since HS_m does not contain the zero polynomial. The two numbers measure the amount of stretching and shrinking $F_{m}^{2}$ does to the sphere HS_m. Their ratio gives the condition number η _m for $F_{m}^{2}$

η_{m} = \frac{η_{m}^{max}}{η_{m}^{min}} .

In the following, we will often drop the subscript and denote the condition number simply as η when the degree m in the context is clear. Figure 2.1 illustrates the effect of $F_{m}^{2}$ and its condition number η.

Fig. 2.1 — **Left:** Comparison between $F_{1}^{2}$ and $F_{2}^{2}$ . Let $S_{r}^{2 m}$ , r > 0 denote the circle with radius r centered at origin in H_2m. $F_{1}^{2}$ is isotropic in the sense that ${| | F_{1}^{2} (p) | |}_{1} \in S_{4 π / 3}^{2}$ for all p ∈ HS₁. $F_{2}^{2}$ , on the other hand, is not isotropic. HS₂ is the five-dimensional sphere S⁵. Its equator can be identified with S⁴, and the two polynomials $\pm 1 / \sqrt{3} (x^{2} + y^{2} + z^{2})$ form the two poles. Inside the equator, are embedded S¹ and S². $F_{2}^{2}$ maps the poles ±(x² + y² + z²) to $S_{4 π / 3}^{4}$ , and it maps the embedded S¹ and S² to $S_{8 π / 15}^{4}, S_{4 π / 15}^{4}$ , respectively. The condition number η for $F_{2}^{2}$ on HS₂ is 5. Restricting $F_{2}^{2}$ to the equator S⁴, the condition number improves to 2. **Right:** Local and non-local approximations. For each p ∈ HS_m, Lemma 2.8 approximates p first with the vertices of the simplex containing p. This local approximation is further improved using non-local approximations as the polynomials (p_i − p_j)² are approximated by polynomials in HS_m that are generally far from p.

Proposition 2.2

$η_{m}^{max}, η_{m}^{min}$ and hence η can be determined by evaluating $\frac{d {(m)}^{2} + d (m)}{2}$ trigonometric integrals.

Proof

Let m₁, ···, m_d₍_m₎ denote the d(m) monomials in H_m. A polynomial p ∈ HS_m is identified with the vector of coefficients a = [a₁, ···, a_d₍_m₎]^⊤ as p = a₁m_i + ··· + a_d₍_m₎m_d₍_m₎. The L¹-norm || Inline graphic (p)||₁ is the integral of p² over S² that can be written as

{| | F_{2} (p) | |}_{1} = \sum_{i, j = 1}^{d (m)} a_{i} a_{j} \int_{S^{2}} m_{i} (x) m_{j} (x) d x .

Let Λ^m denote the d(m) × d(m) matrix whose components $Λ_{i j}^{m}$ are the integrals ∫_S² m_i(x)m_j(x) dx, we have

{| | F_{2} (p) | |}_{1} = a^{⊤} Λ^{m} a .

It follows that $η_{m}^{max}, η_{m}^{min}$ can be determined as

η_{m}^{max} = min_{a^{⊤} a = 1} a^{⊤} Λ^{m} a, η_{m}^{min} = min_{a^{⊤} a = 1} a^{⊤} Λ^{m} a,

both of which can be solved once Λ^m is known using Singular Value Decomposition. The integrals ∫_S² m_i(x)m_j(x) dx can be computed in closed form since using spherical coordinates, x = sin ψ cos θ, y = sin ψ sin θ, z = cos ψ, each integral is a product of two trigonometric integrals

\int_{S^{2}} m_{i} (x) m_{j} (x) d x = (\int_{θ = 0}^{2 π} {cos}^{b_{1}} θ {sin}^{b_{2}} θ d θ) (\int_{ψ = 0}^{π} {cos}^{b_{3}} ψ {sin}^{b_{4}} ψ d ψ),

with exponents b₁, b₂, b₃, b₄ depending on m_i, m_j.

In practice, $Λ_{i j}^{m}$ can be numerically evaluated to any desired accuracy without appealing to the closed-form integral formulas. Next we prove a simple result that partially explains why the linear case m = 1 is substantially easier than the nonlinear cases m > 1.

Proposition 2.3

η_m = 1 if and only if m = 1. That is, $F_{1}^{2}$ is isotropic with respect to the L¹-norm in H₂.

Proof

The ‘if’ part follows readily from the fact that

\int_{S^{2}} x y d x = \int_{S^{2}} x z d x = \int_{S^{2}} y z d x = 0,

and

\int_{S^{2}} x^{2} d x = \int_{S^{2}} y^{2} d x = \int_{S^{2}} z^{2} d x = \frac{4 π}{3} .

The matrix Λ¹ is therefore diagonal with constant diagonal element $\frac{4 π}{3}$ , and $F_{1}^{2}$ is isotropic with respect to the L¹-norm in H₂.

Conversely, for m > 1, let p = x^m, q = x^m⁻¹y. We show that || Inline graphic (p)||₁ ≠ || (q)||₁:

\begin{array}{l} {| | F_{2} (p) | |}_{1} = \int_{S^{2}} x^{2 m} d x = \int_{ψ = 0}^{π} \int_{θ = 0}^{2 π} {sin}^{2 m} ψ {cos}^{2 m} θ sin ψ d θ d ψ, \\ {| | F_{2} (q) | |}_{1} = \int_{S^{2}} x^{2 m - 2} y^{2} d x = \int_{ψ = 0}^{π} \int_{θ = 0}^{2 π} {sin}^{2 m} ψ {cos}^{2 m - 2} θ {sin}^{2} θ sin ψ d θ d ψ . \end{array}

Let $c = \int_{ψ = 0}^{π} {sin}^{2 m + 1} ψ d ψ$ , we have

\begin{array}{l} {| | F_{2} (p) | |}_{1} = c \int_{θ = 0}^{2 π} {cos}^{2 π} θ d θ, \\ {| | F_{2} (q) | |}_{1} = c \int_{θ = 0}^{2 π} {cos}^{2 m - 2} θ {sin}^{2} θ d θ . \end{array}

Therefore,

\begin{array}{l} {| | F_{2} (p) | |}_{1} - {| | F_{2} (q) | |}_{1} = c \int_{θ = 0}^{2 π} {cos}^{2 m - 2} θ ({cos}^{2} θ - {sin}^{2} θ) d θ \\ = c \int_{θ = 0}^{2 π} {cos}^{2 m - 2} θ (2 {cos}^{2} θ - 1) d θ . \end{array}

Since $\int_{π = 0}^{2 π} {cos}^{n} θ d θ = \frac{n - 1}{n} \int_{π = 0}^{2 π} {cos}^{n - 2} θ d θ$ for any n ≥ 2, we have

{| | F_{2} (p) | |}_{1} - {| | F_{2} (q) | |}_{1} = c (\frac{2 m - 1}{m} - 1) \int_{θ = 0}^{2 π} {cos}^{2 m - 2} θ d θ,

which shows that || Inline graphic (p)||₁ − || (q)||₁ ≠ 0 if m > 1. This implies η_m > 1 if m > 1.

Using the square map $F_{m}^{2}$ , we will define the approximating subcone $\sum_{2 m}^{C}$ by specifying its generators as polynomials in HS_m. More specifically, let Inline graphic = {p₁, ···, p_k} denote a finite set of k polynomials (points) in HS_m. Its associated cone $\sum_{2 m}^{C}$ in H₂_m is generated by the finite set of generators $F_{m}^{2} (C) = {p_{1}^{2}, \dots, p_{k}^{2}}$ : elements in $\sum_{2 m}^{C}$ are non-negative linear combinations of $F_{m}^{2} (p_{i})$ :

p = a_{1} p_{1}^{2} + \dots + a_{k} p_{k}^{2},

for some a₁, ···, a_k ≥ 0. It is immediately clear that $\sum_{2 m}^{C} \subset Ω_{2 m} \subset P_{0}^{2 m}$ for any finite subset Inline graphic ⊂ HS_m. Since $F_{m}^{2} (p) = F_{m}^{2} (- p)$ , we can restrict points in to lie in one chosen hemisphere of HS_m. For such , its completion ⊂ is obtained by joining all antipodal points of points in ,

\bar{C} = {p_{1}, \dots, p_{k}, - p_{1}, \dots, - p_{k}} .

Examples

For m = 1, H₁ is ℝ³ and HS₁ is S². If Inline graphic consists of four points {[1, 0, 0]^⊤, [0, 1, 0]^⊤, [0, 0, 1]^⊤, [ $\sqrt{1 / 3}, \sqrt{1 / 3}, \sqrt{1 / 3}$ ]}, the four polynomials p₁, p₂, p₃, p₄ are x, y, z and $\sqrt{1 / 3} x + \sqrt{1 / 3} y + \sqrt{1 / 3} z$ , respectively. Elements in $\sum_{2}^{C}$ are non-negative combinations of the four polynomials $p_{1}^{2}, p_{2}^{2}, p_{3}^{2}, p_{4}^{2}$ . More precisely, any $p \in \sum_{2}^{C}$ is determined (in this case, uniquely) by four non-negative numbers a₁, a₂, a₃, a₄ ≥ 0 such that

p (x, y, z) = (a_{1} + \frac{a_{4}}{3}) x^{2} + (a_{2} + \frac{a_{4}}{3}) y^{2} + (a_{3} + \frac{a_{4}}{3}) z^{2} + \frac{2 a_{4}}{3} (x y + x z + y z) .

For m = 2, H₂ can be identified with ℝ⁶ using the monomial basis {x², y², z², xy, xz, yz} and HS₂ is S⁵. If Inline graphic consists of three points

{[λ, λ, 0, 0, 0, 0]}^{⊤}, {[λ, 0, 0, - λ, 0, 0]}^{⊤}, {[0, - λ, 0, 0, 0, λ]}^{⊤},

where $λ = \sqrt{1 / 2}$ , the three polynomials p₁, p₂, p₃ are λ(x² + y²), λ(x² − xy), λ(yz − y²). Any $p \in \sum_{4}^{C}$ can be written (again uniquely) as

p (x, y, z) = \frac{(a_{1} + a_{2})}{2} x^{4} + \frac{(a_{1} + a_{3})}{2} y^{4} + (a_{1} + \frac{a_{2}}{2}) x^{2} y^{2} - a_{2} x^{3} y - a_{3} y^{3} z + \frac{a_{3}}{2} y^{2} z^{2}

for three non-negative a₁, a₂, a₃.

The inclusion $\sum_{2 m}^{C} \subset Ω_{2 m}$ gives an approximation of Ω₂_m by $\sum_{2 m}^{C}$ , and it involves two main components: the square map $F_{m}^{2}$ and the chosen polynomials in Inline graphic that provide the generators in $\sum_{2 m}^{C}$ through $F_{m}^{2}$ . The main result of our analysis on the approximation error of $\sum_{2 m}^{C} \subset Ω_{2 m}$ is given in the next theorem, which asserts that the approximation error can be bounded by a product of contributions from both components: the condition number η_m of $F_{m}^{2}$ and the condition number θ( Inline graphic ) of the set whose definition we now turn to.

Condition Number θ( ) of

We use θ( Inline graphic ) as the measure that quantifies the approximation of any q ∈ HS_m, considered as a point on the sphere, by the finite set . We will use the spherical distance d_{HS_m}(p, q) (arc-length in radians) to measure the distance between a pair of points p, q on the sphere HS_m, and in particular, d_{HS_m}(p, q) is the angle between the two unit vectors p, q in HS_m. A set Inline graphic is said to be good if there is a triangulation of HS_m as a simplicial complex whose vertex set is the completion of . Since HS_m has dimension d(m) − 1, the top-dimensional simplexes in have dimension d(m) − 1 as well. Therefore, for any q ∈ HS_m, there is a d(m) − 1-simplex σ ∈ Inline graphic containing q. In particular, we will assume that q can be written as a non-negative linear combination of the vertices of σ: q = a₀p₀ + ··· + a_d₍_m₎₋₁p_d₍_m₎₋₁ with a₀, ···, a_d₍_m₎₋₁ ≥ 0. While this is in general not true for an arbitrary triangulation Inline graphic of HS_m, it is not difficult to show that can be modified (without changing its underlying abstract simplicial complex) to satisfy this property, e.g., by first defining a triangulation of the vertices in considered as points in the Euclidean space ℝ^d⁽^m⁾ using the same abstract simplicial complex as Inline graphic and radially projecting the simplices onto HS_m. For 0 ≤ k ≤ d(m) − 1, will denote the set of k-simplices in , and for a k-simplex σ ∈ , its width δ(σ) is defined as the maximal distance between its vertices, p₀, · · ·, p_k,

δ (σ) = max_{0 \leq i, j \leq k} d_{{HS}_{m}} (p_{i}, p_{j}) .

For a triangulation Inline graphic , we define its width to be the maximal width of its top-dimensional simplices:

σ (T) = max_{σ \in T^{d (m) - 1}} δ (σ) .

The condition number of Inline graphic is then defined as the minimal width of the triangulations that have as its vertex set:

σ (C) = min_{T, T^{0} = \bar{C}} δ (T) .

Since Inline graphic is finite, there exists a triangulation Δ( ) whose width gives the condition number θ( ). We note that 0 < θ( ) < π, and for a good set , the following conditions hold,

For each q ∈ HS_m, there are d(m) elements, p₀, ···, p_d₍_m₎₋₁, in such that q = a₀p₀ + ··· a_d₍_m₎₋₁p_d₍_m₎₋₁ for a₀, ···, a_d₍_m₎₋₁ ≥ 0 and d_{HS_m}(p_i, p_j) < θ( ) for any 0 ≤ i, j ≤ d(m).
For each q ∈ HS_m, there exists p ∈ such that d_{HS_m}(q, p) < θ( ).

Property (1) follows immediately from the definition. Property (2) can be shown to follow from the requirement that if q ∈ σ ∈ Inline graphic , q is a non-negative linear combination of vertices in σ.

Theorem 2.4

Let Inline graphic denote a good finite subset in HS_m and $\sum_{2 m}^{C}$ its associated finitely-generated subcone in H₂_m. Let θ = θ( ) denote the condition number of as defined above and η_m the condition number of $F_{m}^{2}$ . Then, for any polynomial r ∈ Ω₂_m, its L¹-relative approximation error $E_{\sum_{2 m}^{C}} (r)$ satisfies

E_{\sum_{C}^{2 m}} (r) \leq 4 tan θ {sin}^{2} \frac{θ}{2} η_{m}^{2} .

The bound above constitutes our quantitative characterization of the approximation $\sum_{2 m}^{C} \subset Ω_{2 m}$ . Not surprisingly, the bound provided above depends on both the map $F_{m}^{2}$ as well as the set Inline graphic through θ and η. The error measured by $E_{\sum_{C}^{2 m}}$ takes place in H₂_m, and the bound on the right factored into two components with contribution from θ that essentially measures how well an arbitrary point q ∈ HS_m can be approximated using and its associated triangulation Δ( Inline graphic ). In particular, as will be seen from the proof, tan θ arises from approximating q using its nearest neighbor in as in Property (2) above while ${sin}^{2} \frac{θ}{2}$ comes from approximating q using the simplex σ containing it as in Property (1).

We will prove the theorem through a sequence of lemmas given below. However, before delving into the proof, we remark that although using the triangulation Δ( Inline graphic ) to define θ( ) may seem unnecessary at first, it is in fact crucial to have Property (1) in order to produce a smaller bound on the error. For example, it is possible to define θ( ) using only Property (2), i.e., each q ∈ HS_m can be approximated by a p ∈ such that d_{HS_m}(q, p) < θ( Inline graphic ). However, this hypothesis itself is only strong enough to produce the bound given in Lemma 2.6 (Equation 2.4). Disregarding η_m, the bound given in Equation 2.4 is 2 sin θ, which is considerably inferior to the bound of 4 tan θ sin² given in Theorem 2.4. In particular, for small θ, the former is approximately 2θ while the latter is θ³ (See Equation 3.1), two order of magnitude less. As will be clear in the proof, the main issue is to approximate the polynomial q² for any q ∈ HS_m with a sum of squares of polynomials in Inline graphic . Using only Property (2), it is difficult to determine what polynomials in HS_m can be used to approximate q² other than the polynomial p ∈ that is closest to q. With Property (1), we have more choices at our disposal as we can approximate q² using the vertices p_i of the simplex σ that contains q, and more importantly, the remainder of this approximation (sum of (p_i − p_j)²) can be further approximated using polynomials in Inline graphic . This is the content of Lemma 2.8. In particular, when approximating q², Property (1) allows the access of not only the polynomials p_i ∈ that are neighbors of q but also polynomials in that are usually far away from q. See Figure 2.1. Furthermore, as will be detailed in Section 3, Property (1) allows us to formulate a simple method for estimating the minimal number of points (polynomials) in Inline graphic needed for a given precision requirement.

Lemma 2.5

Let p, q be two polynomials in HS_m and θ = d_{HS_m}(p, q) denote their geodesic distance considered as points on the sphere HS_m. We have

\int_{S^{2}} {∣ p (x) - q (x) ∣}^{2} d x \leq 4 {sin}^{2} \frac{θ}{2} η_{m}^{max} .

Proof

Let r = p − q. As a vector in H_m, |r| = |p − q|. Using the law of cosines,

γ = ∣ r ∣ = ∣ p - q ∣ = \sqrt{2 - 2 cos θ} = 2 sin \frac{θ}{2} .

(2.2)

Therefore, r/γ ∈ HS_m, and we have

\int_{S^{2}} r^{2} (x) d x = γ^{2} \int_{S^{2}} {(r (x) / γ)}^{2} d x \leq γ^{2} η_{m}^{max},

and the result follows.

Next we prove an important lemma which shows that for two nearby p, q in HS_m, we can approximate q² using p² such that the L¹-approximation error is a fraction (depending on the geodesic distance) of the L¹-norm of p².

Lemma 2.6

Let p, q be two polynomials in HS_m and θ = d_{HS_m}(p, q) denote their geodesic distance considered as points on the sphere HS_m. Let || Inline graphic (p) − (q)||₁ denote the L¹-difference between (p), (q)

{| | F_{2} (p) - F_{2} (q) | |}_{1} = \int_{S^{2}} ∣ p^{2} (x) - q^{2} (x) ∣ d x .

(2.3)

We have

{| | F_{2} (p) - F_{2} (q) | |}_{1} \leq 2 sin θ η_{m} {| | F_{2} (p) | |}_{1} .

(2.4)

Proof

Using Hölder’s inequality, we have

\begin{array}{l} \int_{S^{2}} ∣ p^{2} (x) - q^{2} (x) ∣ d x = \int_{S^{2}} ∣ p (x) - q (x) ∣ ∣ p (x) + q (x) ∣ d x \\ \leq {(\int_{S^{2}} {∣ p (x) - q (x) ∣}^{2} d x)}^{\frac{1}{2}} {(\int_{S^{2}} {∣ p (x) + q (x) ∣}^{2} d x)}^{\frac{1}{2}} . \end{array}

The proof will proceed to bound the two terms on the right. By the preceding lemma, we have

{(\int_{S^{2}} {∣ p (x) - q (x) ∣}^{2} d x)}^{\frac{1}{2}} \leq 2 sin \frac{θ}{2} \sqrt{η_{m}^{max}} .

For the second term, we will consider the polynomial $r = γ \frac{p + q}{2}$ , where γ > 1 ensures that r ∈ HS_m. A quick calculation shows that $γ = 1 / cos \frac{θ}{2}$ . Since, by definition,

\int_{S^{2}} r^{2} (x) d x \leq η_{m}^{max},

we have

{(\int_{S^{2}} {∣ p (x) + q (x) ∣}^{2} d x)}^{\frac{1}{2}} = {(\int_{S^{2}} \frac{4}{γ^{2}} r^{2} (x) d x)}^{\frac{1}{2}} \leq \frac{2}{γ} \sqrt{η_{m}^{max}} .

Combining the two inequalities, we have

\int_{S^{2}} ∣ p^{2} (x) - q^{2} (x) ∣ d x \leq 4 cos \frac{θ}{2} sin \frac{θ}{2} η_{m}^{max} = 2 sin θ η_{m}^{max} .

Since

{| | F_{2} (p) | |}_{1} = \int_{S^{2}} p^{2} (x) d x \leq η_{m}^{min},

it follows that

{| | F_{2} (p) - F_{2} (q) | |}_{1} \leq 2 sin θ \frac{η_{m}^{max}}{η_{m}^{min}} {| | F_{2} (p) | |}_{1} .

This completes the proof.

We will use the preceding lemma to prove two basic error estimates. For any two points p, q in Inline graphic , the following lemma provides a bound on the approximation error for points that lie on the arc (geodesic path) joining p, q.

Lemma 2.7

Let p, q be two neighboring points in Δ( Inline graphic ), i.e., there is a 1-simplex σ¹ in Δ( ) with p, q as its two vertices. Let r = ap + bq be a convex combination of p, q with a, b ≥ 0 and a + b = 1. If θ = θ( ) denotes the condition number of , then

E_{\sum_{2 m}^{C}} (r^{2}) \leq 2 sin θ {tan}^{2} \frac{θ}{2} η_{m}^{2} .

Proof

By definition of θ, d_{HS_m}(p, q) ≤ θ. Let ϕ = a²p² + b²q² + abp² + abq² be an element in $\sum_{C}^{2 m}$ . We have

\begin{array}{l} ϕ - r^{2} = (a^{2} p^{2} + b^{2} q^{2} + {abp}^{2} + {abq}^{2}) - {(a p + b q)}^{2} \\ = a b {(p - q)}^{2} . \end{array}

Let γ = |p − q| and t = (p − q)/γ ∈ HS_m. There exists s ∈ Inline graphic such that the geodesic distance between t and s is less than θ. By the preceding lemma,

\int_{S^{2}} ∣ t^{2} (x) - s^{2} (x) ∣ d x \leq 2 sin θ η_{m} {| | t^{2} (x) | |}_{1} .

Now let φ = ϕ + abγ²s² be another element in $\sum_{C}^{2 m}$ . We have

\int_{S^{2}} ∣ r^{2} (x) - φ (x) ∣ d x = a b \int_{S^{2}} ∣ {(γ t)}^{2} - {(γ s)}^{2} ∣ d x \leq 2 ab sin θ η_{m} {| | {(γ t)}^{2} (x) | |}_{1} .

By Lemma 2.5, ${∣ {(γ t)}^{2} (x) ∣}_{1} \leq 4 {sin}^{2} \frac{θ}{2} η_{m}^{max}$ . This gives

\int_{S^{2}} ∣ r^{2} (x) - φ (x) ∣ d x \leq 2 sin θ {sin}^{2} \frac{θ}{2} η_{m} η_{m}^{max} .

(2.5)

as $a b \leq \frac{1}{4}$ for a, b ≥ 0 and a + b = 1. We next bound the L¹-norm of r²(x). Since r = ap + bq, there exists $1 \leq γ \leq 1 / cos \frac{θ}{2}$ such that γr ∈ HS_m. This implies that

\int_{W S} γ^{2} r^{2} (x) d x \geq η_{m}^{min},

\int_{S^{2}} r^{2} (x) d x \geq {cos}^{2} \frac{θ}{2} η_{m}^{min} .

(2.6)

Combining Equations 2.5 and 2.6 gives the desired result.

The preceding lemma can be generalized immediately to higher-order convex combinations.

Lemma 2.8

Let p₁, ···, p_k denote the vertices of a k − 1-simplex σ^k⁻¹ in Δ( Inline graphic ) as well as the corresponding homogeneous polynomials in HS_m. Let r = a₁p₁ + ··· +a_kp_k be a convex combination of p₁, ···, p_k with a₁, ···, a_k ≥ 0 and a₁ + ··· + a_k = 1. If θ denote the condition number of , Then

E_{\sum_{2 m}^{C}} (r^{2}) \leq 4 tan θ {sin}^{2} \frac{θ}{2} η_{m}^{2} .

Proof

Expanding r², we have

r^{2} = \sum_{i = 1}^{k} a_{i}^{2} p_{i}^{2} + 2 \sum_{i < j} a_{i} a_{j} p_{i} p_{j} .

The second sum contains $C_{2}^{k} = \frac{k (k - 1)}{2}$ terms. To approximate r² using an element $φ \in \sum_{C}^{2 m}$ , we proceed similarly as before. We start with ϕ equals the first sum above. For each cross-term 2a_ia_jp_ip_j in the second sum, we add $a_{i} a_{j} (p_{i}^{2} + p_{j}^{2})$ to ϕ. This gives

ϕ = \sum_{i = 1}^{k} a_{i}^{2} p_{i}^{2} + \sum_{i \leq j} a_{i} a_{j} (p_{i}^{2} + p_{j}^{2}) .

It follows that

ϕ - r^{2} = \sum_{i < j} a_{i} a_{j} {(p_{i} + p_{j})}^{2} .

Next, we will approximate the squares (p_i − p_j)² using elements in $\sum_{C}^{2 m}$ exactly as before. More specifically, let γ_ij = |p_i − q_j| and t_ij = (p_i − q_j)/γ_ij. There exists s_ij ∈ Inline graphic such that the geodesic distance between t_ij and s_ij is less than θ. Now let φ = ϕ + Σ_i_<_j a_ib_j(γ_ijs_ij)² be an element in $\sum_{C}^{2 m}$ . We have

\int_{S^{2}} ∣ r^{2} (x) - φ (x) ∣ d x \leq \sum_{i < j} a_{i} a_{j} \int_{S^{2}} ∣ {(γ_{i j} t_{i j})}^{2} (x) - {(γ_{i j} s_{i j})}^{2} (x) ∣ d x .

It follows from Equations 2.2 and 2.4 that all the integrals on the right can be uniformly bounded

\int_{S^{2}} ∣ {(γ_{i j} t_{i j})}^{2} (x) - {(γ_{i j} s_{i j})}^{2} (x) ∣ d x \leq 8 sin θ {sin}^{2} \frac{θ}{2} η_{m}^{max} η_{m},

and this gives

\int_{S^{2}} ∣ r^{2} (x) - φ (x) ∣ d x \leq 8 sin θ {sin}^{2} \frac{θ}{2} η_{m}^{max} η_{m} \sum_{i < j} a_{i} a_{j} .

Since a₁ + ··· + a_k = 1,

\begin{array}{l} \sum_{i \leq j} a_{i} a_{j} = \frac{{(a_{1} + \dots + a_{k})}^{2} - (a_{1}^{2} + \dots + a_{k}^{2})}{2} = \frac{1 - (a_{1}^{2} + \dots + a_{k}^{2})}{2} \\ \leq \frac{1 - \frac{1}{k}}{2} = \frac{k - 1}{2 k} < \frac{1}{2} \end{array}

(2.7)

as $a_{1}^{2} + \dots + a_{k}^{2} \geq \frac{1}{k}$ by Cauchy-Schwarz inequality. This yields the bound

\int_{S^{2}} ∣ r^{2} (x) - φ (x) ∣ d x \leq 4 sin θ {sin}^{2} \frac{θ}{2} η_{m}^{max} η_{m} .

(2.8)

We next bound the L¹-norm of r². Given that r = a₁p₁ + ··· a_kp_k, the following lemma shows that the L²-magnitude |r| of the vector r satisfies

∣ r ∣ \geq \sqrt{cos θ} .

Hence, there exists $1 \leq γ \leq \frac{1}{\sqrt{cos θ}}$ such that γr ∈ HS_m. Exactly as before, we have

\int_{S^{2}} r^{2} (x) d x \geq \frac{1}{γ^{2}} η_{m}^{min} \geq cos θ η_{m}^{min} .

(2.9)

Equations 2.8 and 2.9 together complete the proof.

Lemma 2.9

Let Δ denote a k-simplex in ℝ^d⁽^m⁾ whose vertices p₀, ···, p_k are on the unit sphere, i.e., ||p₀||₂ = ··· = ||p_k||₂ = 1. If there exists some α such that 1 > α > 0 and $p_{i}^{⊤} p_{j} \geq α$ for all i ≠ j, then for any x ∈ Δ,

{| | x | |}_{2} > \sqrt{α} .

Proof

Let x = a₁p₁ + ··· a_kp_k with a_i ≥ 0 and a₁ + ··· + a_k = 1. It follows that

x^{⊤} x \geq \sum_{i = 0}^{k} a_{i}^{2} + 2 α \sum_{i < j} a_{i} a_{j} .

Let s = 2Σ_i_<_j a_ia_j and the above inequality becomes x^⊤x ≥ 1 − (1 − α)s. From Equation 2.7, we have $0 \leq s \leq \frac{k - 1}{k} < 1$ . It follows that

x^{⊤} x > 1 - (1 - α) = α .

We remark that when k = 2, $\frac{k - 1}{k} = \frac{1}{2}$ and the bound becomes tighter $x^{⊤} x \geq \frac{1}{2} + \frac{α}{2}$ . This gives the cos θ term in Equation 2.9. Finally, we are ready to complete the proof of Theorem 2.4:

Proof

Since r(x) can be written as a sum of squares, by Proposition 2.10, it can be written as a sum of no more than d(m) terms with p_i ∈ HS_m:

r (x) = \sum_{i = 1}^{d (m)} a_{i} p_{i}^{2} (x) .

Each p_i belongs to a (d(m) − 1)-dimensional simplex σ_i ∈ Δ( Inline graphic ). By the preceding lemma, each $p_{i}^{2}$ can be approximated by an element p̃i in $\sum_{C}^{2 m}$ with uniformly bounded relative L¹-error

{| | p_{i}^{2} (x) - {\tilde{p}}_{i} (x) | |}_{1} \leq C {| | p_{i}^{2} (x) | |}_{1},

where $C = 4 tan θ {sin}^{2} \frac{θ}{2} η_{m}^{2}$ . Define $\tilde{r} \in \sum_{C}^{2 m}$ as

\tilde{r} = \sum_{i = 1}^{d (m)} a_{i} {\tilde{p}}_{i},

and we have

{| | r (x) - \tilde{r} (x) | |}_{1} \leq \sum_{i = 1}^{d (m)} a_{i} {| | p_{i}^{2} (x) - {\tilde{p}}_{i} (x) | |}_{1} \leq C \sum_{i = 1}^{d (m)} a_{i} {| | p_{i}^{2} (x) | |}_{1} .

On the other hand, we also have

{| | r (x) | |}_{1} \leq \sum_{i = 1}^{d (m)} a_{i} \int_{S^{2}} p_{i}^{2} (x) d x = \sum_{i = 1}^{d (m)} a_{i} {| | p_{i}^{2} (x) | |}_{1}

Combining both inequalities yields the desired result.

In the proof above we made use of the following proposition.

Proposition 2.10

Let r denote a homogeneous polynomial of degree 2m that can be written as a sum of squares of homogeneous polynomials of degree m. Then, r can be written as a sum of at most d(m) squares

r (x) = \sum_{i = 1}^{d (m)} a_{i} p_{i}^{2} (x),

where a₁, ···, a_d₍_m₎ ≥ 0 and p₁, ···, p_d₍_m₎ ∈ HS_m.

Proof

Suppose r is a sum of k squares of homogeneous polynomials q̃₁, ···, q̃_k of degree m

r (x) = {\tilde{q}}_{1}^{2} (x) + \dots + {\tilde{q}}_{k}^{2} (x) .

Denote m₁, ···, m_d₍_m₎ the d(m) monomials of degree m, and X the vector

X = {[m_{1} (x), m_{2} (x), \dots, m_{d (m)} (x)]}^{⊤}

whose components are the monomials. It follows that ${\tilde{q}}_{i} (x) = a_{i}^{⊤} X$ with a_i the vector whose components are coefficients of q̃_i(x), and

r (x) = X^{⊤} (a_{1} a_{1}^{⊤} + \dots + a_{k} a_{k}^{⊤}) X = X^{⊤} SX .

The matrix S is symmetric and positive semi-definite with non-negative eigenvalues. Let λ₁, ··· λ_d₍_m₎ denote its complete set of eigenvalues and v₁, ···, v_d₍_m₎ their associated unit eigenvectors, |v_i|₂ = 1. It follows that

S = λ_{1} v_{1} v_{1}^{⊤} + \dots + λ_{d (m)} v_{d (m)} v_{d (m)}^{⊤},

and

\begin{array}{l} r (x) = λ_{1} X^{⊤} v_{1} v_{1}^{⊤} X + \dots + λ_{d (m)} X^{⊤} v_{d (m)} v_{d (m)}^{⊤} X \\ = λ_{1} q_{1}^{2} (x) + \dots λ_{d (m)} q_{d (m)}^{2} (x), \end{array}

where $q_{i} (x) = v_{i}^{⊤} X \in {HS}_{m}$ as |v_i|₂ = 1 for i = 1, ···, d(m).

3. Approximating PSD Tensors of Orders two, four and six

In this section, we apply Theorem 2.4 to derive formulas for the minimal number of generators in $\sum_{2 m}^{C}$ needed to ensure that the approximation $\sum_{2 m}^{C} \subset Ω_{2 m}$ is within a given accuracy requirement. Specifically, the accuracy requirement is presented in the form of the relative L¹-approximation error $E_{2 m}^{C}$ (cf. Equation 2.1): for 0 < ε < 1, we derive a formula that gives the (approximated) minimal number Inline graphic (ε, m) of generators in $\sum_{2 m}^{C}$ such that any r ∈ Ω₂_m can be approximated within ε using $\sum_{2 m}^{C}$ , i.e.,

E_{2 m}^{C} (r) < ε .

For PSD ternary tensors of orders two and four, it is known that they can be written as sums of squares of three tensors of order one and two, respectively. This follows from the well-known result that any ternary positive semi-definite homogeneous polynomial p(x) of degree two and four can be written as a sum of three squares of polynomials of degree one and two, respectively. The quadratic case follows easily from linear algebra while the quartic case follows from the celebrated theorem of Hilbert on ternary quartics [24]. We will first describe a general method for obtaining the formula Inline graphic (ε, m) for any order m, and we will then explicitly work out the three cases m = 1, 2, 3 that are of most interest for various applications.

3.1. Preliminaries

Given a required precision ε > 0, the bound provided by Theorem 2.4 allows us to determine the condition number θ = θ( Inline graphic ) for the point set in HS_m to ensure that the precision requirement is satisfied. The main result in this section is a simple estimate on the number (ε, m) of points in needed to achieve the desired θ on the sphere HS_m. Let $C_{η} (θ) = 4 tan θ {sin}^{2} \frac{θ}{2} η^{2}$ denote the bound given in Theorem 2.4. Since

tan θ {sin}^{2} \frac{θ}{2} = \frac{1}{2} (tan θ - sin θ),

C_η(θ) is a monotonically increasing function for $0 \leq θ \leq \frac{π}{2}$ , and we will denote its inverse by $f_{η} (ε) = C_{η}^{- 1} (ε)$ . f_η can be numerically evaluated and the plots for f_η over the range 0.01 ≤ ε ≤ 0.1 for several different η-values are shown in Figure 3.1. If θ is assumed to be small,

Fig. 3.1 — **Left:** Plots of f_η for η = 1, 2, 4 in red, blue and green, respectively. ε varies from 0.01 to 0.1 and θ is given in degree. **Right:** Comparison plot of (ε, 1) according to Equations 3.7 (in red) and 3.8 (in blue). The estimate using Equation 3.7 is between 17% and 20% less than the estimate using Equation 3.8.

Inline graphic — **Left:** Plots of f_η for η = 1, 2, 4 in red, blue and green, respectively. ε varies from 0.01 to 0.1 and θ is given in degree. **Right:** Comparison plot of (ε, 1) according to Equations 3.7 (in red) and 3.8 (in blue). The estimate using Equation 3.7 is between 17% and 20% less than the estimate using Equation 3.8.

tan θ {sin}^{2} \frac{θ}{2} \approx \frac{θ^{3}}{4} .

(3.1)

Therefore, $4 tan θ {sin}^{2} \frac{θ}{2} η^{2} \approx ε$ implies that

θ \approx {(\frac{ε}{η^{2}})}^{\frac{1}{3}} .

(3.2)

The formula above gives an estimate on the condition number θ = θ( Inline graphic ) given ε and η. We next give an estimate on the size of for the given θ( ). Let n = d(m) − 1 denote the dimension of the sphere HS_m and Δ( ) denote the triangulation associated with . A simplex in Δ( ) is said to be θ-regular if the distance between any pair of its vertices equals θ, and the edge joining any pair of vertices is a geodesics on HS_m. Due to the curvature on the sphere HS_m, it is not possible to cover HS_m with only θ-regular simplices. Therefore, we assume that the n-simplices in Δ( Inline graphic ) are approximately θ-regular in the sense that the geodesic distance between any pair of vertices of a n-simplex in HS_m is approximately θ and the edge joining them is approximately a geodesic as well. For each vertex v in Δ( ), its degree is the number of n-dimensional simplices having it as a vertex. To estimate the number of points in Inline graphic , we will estimate two quantities: the number K of n-dimensional simplices in Δ( ) and the average degree ν of the vertices. The number of points in can then be estimated as

N (ε, m) = # of points in C ≃ \frac{(n + 1) K}{2 ν} .

The occurrence of 2 in the denominator accounts for the fact that points in Inline graphic are located only on a hemisphere.

Estimate on K

Since HS_m is covered by a collection of θ-regular n-simplices, K can be estimated by taking the ratio between the volume of the sphere HS_m and the volume of a θ-regular n-simplex. Since θ is in general assumed to be small, we will approximate the volume of a θ-regular n-simplex on the sphere HS_m with the volume ω_n(θ) of a corresponding θ-regular n-simplex in the Euclidean space ℝⁿ:

ω_{n} (θ) = \frac{\sqrt{n + 1}}{n! \sqrt{2^{n}}} θ^{n} .

(3.3)

It then follows that the number K of n-simplexes can be estimated as

K = \frac{V_{n}}{ω_{n} (θ)},

(3.4)

where the volume of the sphere V_n is given by the formula [25]

V_{n} = {\begin{array}{l} \frac{{(2 π)}^{(n + 1) / 2}}{2 \cdot 4 \dots (n - 1)} & if n is odd; \\ \frac{2 {(2 π)}^{n / 2}}{1 \cdot 3 \dots (n - 1)} & if n is even . \end{array}

Estimate on ν

For a typical vertex v in Δ( Inline graphic ), a small neighborhood around v in HS_m is covered by the θ-regular n-simplices having v as one of their vertices. Again, assuming θ is small, we can approximate this using Euclidean geometry, by transforming the neighborhood U onto the tangent space T_v at v using the log map. The geodesic ball B_θ of radius θ on HS_m is mapped to the Euclidean ball of radius θ and the image of each n-simplex under the log map can be approximated by a regular n-simplex in the Euclidean space with side length θ. See Figure 3.2. It follows that the degree of v can be estimated as the ratio between the volume of the unit n-dimensional ball and the volume of regular n-simplex in ℝⁿ with side length θ. The volume Vⁿ of an n-ball in ℝⁿ with radius r = 1 is given by the formula [25]

Fig. 3.2 — **Left:** For small θ, we can approximate the volume of a θ-regular spherical simplex by the volume of a θ-regular Euclidean simplex. The exponential map **Exp**_p maps a neighborhood of the origin in the tangent space T_p diffeomorphically onto a neighborhood at p. Since the derivative of **Exp**_p at p is the identity, for small enough θ, **Exp**_p is close to an isometry in B_θ. **Right:** The average degree of a vertex, ν, can be approximated by the number of θ-regular simplexes contained in the ball of radius θ.

V^{n} = {\begin{array}{l} \frac{{(2 π)}^{n / 2}}{2 \cdot 4 \dots n} & if n is even; \\ \frac{2 {(2 π)}^{(n - 1) / 2}}{1 \cdot 3 \dots n} & if n is odd . \end{array}

The degree ν is then estimated as

ν = \frac{V^{n}}{ω_{n} (1)} .

(3.5)

Combining Equations 3.3, 3.4, 3.5, we have

\begin{array}{l} N (ε, m) = # of points in C \approx \frac{1}{2} \frac{(n + 1) V_{n}}{ω_{n} (θ) \frac{V^{n}}{ω_{n} (1)}} = \frac{(n + 1) V_{n}}{2 V^{n} θ^{n}} \\ = \frac{(n + 1) V_{n}}{2 V^{n}} f_{η} {(ε)}^{- n} . \end{array}

(3.6)

In the remaining section, we will work out the implication of the above estimate for 2^nd, 4^th and 6^th-order tensors.

3.2. Second-Order Tensors

A quadratic homogeneous polynomial P(x, y, z) in ℝ³ has six coefficients P(x, y, z) = ax² + by² + cz² + dxy + exz + fyz. It can be written in a matrix form as,

P (x, y, z) = [\begin{array}{l} x & y & z \end{array}] [\begin{array}{l} a & \frac{d}{2} & \frac{e}{2} \\ \frac{d}{2} & b & \frac{f}{2} \\ \frac{e}{2} & \frac{f}{2} & c \end{array}] [\begin{array}{l} x \\ y \\ z \end{array}] = x^{⊤} Sx .

Positive semi-definiteness of the polynomial P(x, y, z) is equivalent to the positive semi-definiteness of the matrix S. It follows that determining positive semi-definiteness of a homogeneous quadratic polynomial is straightforward by examining eigenvalues of S: S is positive semi-definite if and only its eigenvalues λ₁, λ₂, λ₃ are all non-negative and S can be written as

S = λ_{1} v_{1}^{⊤} v_{1} + λ_{2} v_{2}^{⊤} v_{2} + λ_{3} v_{3}^{⊤} v_{3},

where v_i is the unit eigenvector with eigenvalue λ_i for i = 1, 2, 3. It follows that P(x, y, z) can be written as a sum of three linear polynomials p₁(x), p₂(x), p₃(x),

P (x) = p_{1} {(x)}^{2} + p_{2} {(x)}^{2} + p_{3} {(x)}^{2},

with $p_{i} (x) = \sqrt{λ_{i}} v^{⊤} x$ .

With m = 1, the sphere HS_m has dimension n = 2. According to Proposition 2.3, the map $F_{1}^{2}$ is isotropic with respect to the L¹-norm and η = 1. Equation 3.6 (together with Equation 3.2) then gives

N (ε, 1) \approx \frac{3 V_{2}}{2 V^{2}} {(\frac{1}{ε})}^{\frac{2}{3}} = 6 {(\frac{1}{ε})}^{\frac{2}{3}} .

(3.7)

More Precise Estimate

For the linear case m = 1, since HS_m is the two-sphere S², its geometry is well-known and a better estimate on N can be obtained. Given θ, S² is covered by geodesic triangles whose sides have lengths of approximately θ. Approximating the areas of these geodesic triangles with the area of an Euclidean equilateral triangles with side θ gives $θ^{2} \sqrt{3} / 4$ . Let F, E, V denote the number of triangles, edges and vertices in the triangulation Δ( Inline graphic ). According to Euler’s formula

F - E + V = χ (S^{2}) = 2

where χ(S²) is the Euler characteristic of S². Since E = 3F/2, V = 2 + F/2 ≈ F/2. This gives ν = 6 as the average degree of a vertex on S². Our estimate on the degree ν in Equation 3.5 in this case gives $ν = 4 π / \sqrt{3} \approx 7.2$ , which gives a 20% overestimate.

The area A of a geodesic triangle on S² with three interior angles α, β, γ is given as [1]

A = α + β + γ - π .

In particular, for a geodesic equilateral triangle on S² with side length θ, its angle α is given as

α = {cos}^{- 1} (\frac{cos θ - {cos}^{2} θ}{{sin}^{2} θ}),

and the estimate on the number of triangles is

K = \frac{4 π}{3 {cos}^{- 1} (\frac{cos θ - {cos}^{2} θ}{{sin}^{2} θ}) - π} .

Let $4 tan θ {sin}^{2} \frac{θ}{2} = ε$ and θ = f(ε) be the solution to the trigonometric equation. It then follows that

N (ε, 1) = \frac{π}{3 {cos}^{- 1} (\frac{cos (f (ε)) - {cos}^{2} (f (ε))}{{sin}^{2} (f (ε))}) - π} .

(3.8)

In Figure 3.1, we compare the two estimates using Equations 3.8 and 3.7. For ε = 0.1, Equation 3.7 gives Inline graphic ≈ 30. And for ε = 0.01 and 0.001, it gives ≈ 130 and 600, respectively. As for Equation 3.8 it gives ≈ 34, 156, 725 for ε = 0.1, 0.01, 0.001, respectively.

3.3. Fourth-Order Tensors

In this case, m = 2 and H₂ and HS₂ have dimensions six and five, respectively. The map $F_{2}^{2}$ is no longer isotropic with respect to L¹-norm in HS₂. An analytic evaluation of the matrix Λ² gives

Λ^{2} = \frac{4 π}{5} (\begin{matrix} 1 & 1 / 3 & 1 / 3 & 0 & 0 & 0 \\ 1 / 3 & 1 & 1 / 3 & 0 & 0 & 0 \\ 1 / 3 & 1 / 3 & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & 1 / 3 & 0 & 0 \\ 0 & 0 & 0 & 0 & 1 / 3 & 0 \\ 0 & 0 & 0 & 0 & 0 & 1 / 3 \end{matrix}) .

The singular values of Λ² arranged in the descending order are

σ (Λ^{2}) = \frac{4 π}{3} [1, 2 / 5, 2 / 5, 1 / 5, 1 / 5, 1 / 5] .

This gives η = 5, and Equation 3.6 gives

N \approx \frac{3 V_{5}}{V^{5}} f_{η = 5} {(ε)}^{- 5} = \frac{90 π}{16} f_{η = 5} {(ε)}^{- 5} .

For ε = 0.1, this yields N ≈ 176790. However, in H₂, the polynomial v(x, y, z) = x²+y²+z² is the constant function 1 on S². In particular, $u (x, y, z) = v (x, y, z) / \sqrt{3} \in {HS}_{2}$ , and ${| | F_{2}^{2} (u) | |}_{1} = 4 π / 3$ . The map $F_{2}^{2}$ stretches the constant polynomial considerably more than any other quadratic polynomials, and this is the reason for the large condition number η. Let ℝu denote the one-dimensional subspace in H₂ spanned by the constant polynomial u(x, y, z), and W its orthogonal complement,

H_{2} = R u \oplus W .

The intersection of the sphere HS₂ with the subspace W is a four-sphere S⁴. If we specialize to this four-sphere, i.e., polynomials orthogonal to the constant polynomial x² + y² + z², the condition number η becomes 2 and the dimension of the sphere drops by one. Theorem 2.4 then provides the following estimate on the number of points

N \approx \frac{5 V_{4}}{2 V^{4}} f_{η = 2} {(ε)}^{- 4} = \frac{40}{3} f_{η = 2} {(ε)}^{- 4} .

This number is considerably less than 176790. For example, for ε = 0.1, we have Inline graphic ≈ 1800 and for ε = 0.05, 0.01, ≈ 4670, 39620, respectively.

3.4. Sixth-Order Tensors

In this case, m = 3 and H₃, HS_m have dimensions 10, 9, respectively. The map $F_{3}^{2}$ is again non-isotropic with respect to L¹-norm in H₆. The singular values of Λ³ arranged in the descending order are

σ (Λ^{3}) = 4 π [\frac{19 + \sqrt{193}}{210}, \frac{19 + \sqrt{193}}{210}, \frac{19 + \sqrt{193}}{210}, \frac{19 - \sqrt{193}}{210}, \frac{19 - \sqrt{193}}{210}, \frac{19 - \sqrt{193}}{210}, 2 / 105, 2 / 105, 2 / 105, 1 / 105] .

The condition number η = 16.44, which is quite substantial. However, similar analysis as above can be applied to eliminate polynomials in HS₃ coming from polynomials of lower degree to substantially decrease the condition number. First, the three linear polynomials x, y, z are now embedded in H₃ as x(x² + y² + z²), y(x² + y² + z²), z(x² + y² + z²). Let r̂(x), ŝ(x), t̂(x), r(x), s(x), t(x) be the following polynomials

\begin{array}{l} \hat{r} (x) = x (x^{2} + y^{2} + z^{2}) / \sqrt{3}, r (x) = 0.7184 x^{3} + 0.3951 \hat{r} (x), \\ \hat{s} (x) = y (x^{2} + y^{2} + z^{2}) / \sqrt{3}, s (x) = 0.7184 y^{3} + 0.3951 \hat{s} (x), \\ \hat{t} (x) = z (x^{2} + y^{2} + z^{2}) / \sqrt{3}, t (x) = 0.7184 z^{3} + 0.3951 \hat{t} (x) . \end{array}

The three polynomials r(x), s(x), t(x) are responsible for the three largest singular values of Λ³. The smallest singular value of 4π/105 comes from the polynomial q(x) = xyz. Let W denote the six-dimensional subspace in H₃ that is the orthogonal complement of the subspace spanned by r(x), s(x), t(x) and q(x),

H_{3} = R r \oplus R s \oplus R t \oplus R q \oplus W .

The sphere in W is five-dimensional, and the condition number of $F_{3}^{2}$ on S⁵ is η = 1.2769.

N \approx \frac{3 V_{5}}{V^{5}} f_{η = 1.27} {(ε)}^{- 5} = \frac{90 π}{16} f_{η = 1.27} {(ε)}^{- 5} .

For ε = 0.1, 0.05, 0.01, the result above gives Inline graphic ≈ 1943, 6021, 85495, respectively.

4. Experimental Results

In this section we experimentally validate the proposed theory and at the end of this section we present an application to Diffusion-Weighted MRI. In all the experiments we use tensors in ℝ³, which can be visualized by plotting the corresponding homogeneous polynomial P(x, y, z) as a spherical function (see Fig. 4.1). Such tensor glyphs can be generated by scaling the radius of a unit sphere at orientation x = [x y z]^T with the value of P(x, y, z). Additionally, we assign a color to each tensor glyph by using the following coloring scheme: we use the method in [11, 22] to compute the unit vector [x y z]^T that maximizes P(x, y, z) and then we assign to the R, G, B color channels the squares of the three components in the vector x (i.e. R = x², G = y², B = z²). This color map produces smooth color transitions when visualizing fields of tensors such as the diffusion tensor fields.

Fig. 4.1 — Examples of randomly computed symmetric positive semi-definite tensors in Ω₂, Ω₄, Ω₆. The tensor glyphs are shown.

First, we construct a dataset with samples from Ω₂_m as follows: we first generate random vectors in ℝ^d⁽^m⁾ using the normal distribution N(μ = 0, σ² = 1) in d(m) = 3, 6, and 10 dimensions, and we use them as coefficients of linear, quadratic and cubic homogeneous polynomials p ∈ HS₁, HS₂, HS₃ in three variables, respectively. Then we construct 2^nd, 4^th and 6^th-order positive semi-definite tensors that belong to Ω₂_m by taking sums of squares of the polynomials in HS₁, HS₂, HS₃, respectively. This process is repeated for 5000 times for each order, producing a dataset of 15000 tensors in total. Several of the generated tensors are shown in Fig. 4.1(right). The primary goal of the aforementioned process is to generate samples from Ω₂_m in order to test the error analysis presented in Section 3, and it should not be perceived as a DW-MRI simulation as in this section we do not discuss any application of the proposed method to DW-MR imaging.

In order to investigate how many generators in the finitely-generated cone $\sum_{2 m}^{C}$ are necessary for our algorithm to approximate accurately a set of given tensors, we apply our framework to the previously described synthetic dataset using finite subsets Inline graphic ∈ HS_m of various sizes N. The sets are constructed as the vertices computed by triangulating the unit n-sphere. The triangulation is based on a variation of the algorithm for mesh generation presented in [39], which extends to any dimension n of the n-sphere. This method is an iterative force-based technique that uses a force displacement function to move the nodes of the mesh and the Delaunay triangulation [14], which is a fundamental and widely used triangulation process, to adjust the topology (i.e. the edges). Obviously, in our particular case we discard the edge information since we only need the finite set of nodes. This algorithm produces at the end the finite subsets Inline graphic ∈ HS_m for different predefined sizes N.

We first use the constructed finite sets C in a numerical framework for approximating the error rate ε achieved by the finitely-generated cone $\sum_{2 m}^{C}$ for m = 1, 2, 3. The numerical calculations were performed by randomly generating points in the n-sphere and testing if each point lies inside or outside the cone $\sum_{2 m}^{C}$ . The error rate ε is the ratio of the points outside the cone over the total number of generated points. For each numerical computation we used 100k points. The numerical approximations are shown as circles in Fig. 4.2. By observing the figures we can see that in most of the cases the numerical approximations are close to the proposed formulas for computing N. We should note that the results are based on the computed sets Inline graphic using the method in [39]. One may expect that the results will be slightly different if another method is employed for triangulating the n-sphere.

Fig. 4.2 — Comparison of the proposed formulas for computing N for m=1,2,3 with results produced using a numerical approximation algotithm. The horizontal axis show the the accuracy achieved by N finite generators (vertical axis) in the unit n-sphere. The circles show the numerical results produced for specific sets of various sizes N.

We also use the sets Inline graphic in a non-negative least squares (NNLS) optimization framework [28] in order to estimate tensors from the finitely-generated cone $\sum_{2 m}^{C}$ that approximate the given 15000 tensors. For each order of tensors, the NNLS system is formulated as Aw = b, where A a matrix constructed from Inline graphic , w the unknown solution vector and b contained the values of the given positive-semidefinite homogeneous polynomial at K = 81 three-dimensional unit vectors x₁ ··· x₈₁ (producing 81 components of b as b₁ = P (x₁) ··· b₈₁ = P (x₈₁)) for each tensor in the dataset. Although this problem seems extremely unconstrained in general, in our particular case the NNLS algorithm by definition constrains the number of non-zero elements in the solution vector to be at most d(2m), which is significantly smaller than the number of data points K in all of our experiments. In order to estimate such a constrained solution the NNLS algorithm implements a basis selection mechanism that starts with a set of possible basis vectors in Inline graphic , computes the associated dual vector, and then reselects the basis in the solution by iteratively performing swaps in order to minimize the entries in the dual vector until they are all non positive. In our particular case of m = 1, 2, 3 the estimated unknown non-zero entries are 6, 15, 28 respectively which are all significantly smaller than the number of given samples K = 81. For a detailed description of the NNLS algorithm the reader is referred to [28].

The solutions w provide tensors in $\sum_{2 m}^{C}$ that approximate the given tensors in Ω₂_m, for m = 1, 2, and 3. The computed tensors are compared to the ground truth (given) tensors using the relative L¹-error (fitting error):

\frac{\int_{S^{2}} ∣ P_{given} (x) - P (x) ∣ d x}{\int_{S^{2}} ∣ P_{given} (x) ∣ d x} .

(4.1)

The histograms of the errors found in the experiments (measured by Eq. 4.1) are plotted in Fig. 4.3 for the case of 2^nd, 4^th, and 6^th-order tensors, respectively. Obviously, by increasing Inline graphic , i.e. the number of generators in the finitely-generated subcone $\sum_{2 m}^{C}$ , the error decreases correspondingly. The table in Fig. 4.3 reports the mean errors for various difference sizes N of the generator set.

The experimental results presented in Fig. 4.3 and Fig. 4.4 validate empirically our method as the results corroborate well with our previous analysis on the number of generators required for a given relative error bound. For 2^nd-order tensors, the analysis in Section 3 shows that for the error to be less than ε = 10%, 1%, 0.1%, it requires approximately N ≈ 30, 130 and 600 generators, respectively. The first plot in Fig. 4.3 shows that with N = 45, there are no occurrences of error greater than 10%, and with N = 150, there are no occurrences of error greater than 1%. With 321 generators, the error becomes negligible. For 4^th-order tensors, our analysis shows that for the error to be less than ε = 10%, 5%, it requires approximately N ≈ 1800, 4670 generators, respectively. This can be seen from the second plot in Fig. 4.3. With N < 1500 generators, there are occurrences of 10% error, and with N ≥ 1500, there are no occurrences of error greater than 10%. To decrease the error under 5% level, the plot shows that we need at least N = 3000 generators. Finally, for 6^th-order tensors, our analysis shows that for the error to be less than ε = 10% and 5%, it requires approximately N ≈ 1943 and 6021 generators, respectively. The third plot in the figure show that at N = 3000, there is only a small percentage of errors greater than 10%, and with N = 6000, there is an even smaller percentage (less than 1%) of errors greater than 5%. In most cases, our earlier analysis underestimate the required numbers of generators, and this is not surprising as these analysis are themselves based on several approximations. Nevertheless, the experimental results do agree in general with the predictions made in Section 3.

Figure 4.4 shows the running time of the optimization method for fitting one tensor versus the approximation error for various orders and number of generators N in the set Inline graphic . The running times are measured using an Intel Pentium Dual CPU at 1.60 GHz and 1GB RAM. The plots demonstrate that the proposed technique can efficiently estimate positive tensors of various orders. More specifically, 2^nd, 4^th, and 6^th-order tensors can be estimated using finitely-generated subcones of size N = 45, N = 900, and N = 6000 at 0.5ms, 12ms, and 243ms, respectively.

4.1. Application: Diffusion-Weighted MRI

Finally, we present an application of the proposed tensor approximation theory to Diffusion-Weighted MRI (DW-MRI). In several DW-MRI processing methods, a diffusion tensor is computed from the acquired diffusion-weighted signals. Negative diffusion values are non-physical; therefore, appropriate methods such as our proposed framework are necessary to ensure positive semi-definiteness of the estimated Diffusion tensors.

In order to demonstrate the necessity for estimating tensors with the positivity constraints, we compare our method with an existing one that computes tensors without the constraints [35]. In this experiment, we use the aforementioned synthetic dataset of 6^th-order tensors, and we sample the corresponding homogeneous polynomials using K = 81 3-dimensional unit vectors x₁ ··· x₈₁ in the Stejskal-Tanner model [45], producing 81 DW-MRI samples for each tensor in the dataset. Various levels of Rician noise are added to the samples with standard deviations ranging from σ = 0.04 up to σ = 0.12. The noisy datasets are given as inputs to: a) the proposed algorithm (using N = 6000), and b) the method proposed in in [35], which is one of the several existing methods in the literature [15, 19] that estimate 6^th-order tensors. For both, the computed 6^th – order tensors P(x) are compared to the ground truth tensors using the error defined in Eq. 4.1.

Figure 4.5 shows the comparison of the fitting errors between the two methods for various levels of noise in the data. The results conclusively demonstrate that tensors estimated using positivity constraints approximate the data significantly better than the ones without. We also note that this result agrees with similar comparisons reported earlier for tensors of lower orders (e.g. 4^th-order comparison in [5]), showing that the errors incurred in approximating positive-valued functions are significantly smaller when positivity constraints are enforced in the process. Our current results have provided further evidence that supports the importance of imposing positivity constraints in this context.

Fig. 4.5 — Comparison of the 6^th-order tensor fitting errors obtained by the proposed method and the technique in [35] for various Rician noise levels in the data.

In order to illustrate the performance of our framework on real data sets, we applied the method to a DW-MRI data set of an excised rat hippocampus (shown in Fig. 4.6). The data set contains 46 images acquired using a pulsed gradient spin echo pulse sequence, with 45 different diffusion gradients and approximate b value of 1250s/mm². Figure 4.6 shows the computed 6^th-order diffusion tensor field. The highlighted regions of interest demonstrate the variability of the estimated structures. At each voxel, the fiber orientations can be estimated from the peaks of the displacement probability, which can be computed from the diffusion tensors as was shown in [5].

Fig. 4.6 — DW-MRI dataset from an isolated rat hippocampus. The image without diffusion weighting (S₀) is shown on the top left. The 6^th-order diffusion tensors estimated by the proposed method are shown as a field of spherical functions. The three regions of interest depict 6^th-order diffusion tensors that model one, two, and three fiber structures.

Finally, Fig. 4.7 presents the results obtained by applying our method to a DW-MRI dataset from an excised rat optic chiasm. The data acquisition protocol was the same as in the rat hippocampus dataset. The computed field of 4^th-order diffusion tensors is shown in the center. Using the estimated diffusion tensors, we can compute the underlying fiber orientations by computing the orientations that correspond to the maxima of the water molecule displacement probabilities. The computed fiber orientations are shown on the right and they agree with the known fiber orientations in the optic chiasm. Further quantitative validations of these orientations with respect to those from histology will be performed as part of our future work.

5. Discussion and conclusions

Symmetric positive semi-definite tensors have been used in many applications. Although there are existing methods for imposing positivity constraints on the estimated tensors of order two and four, none of these techniques can be easily extended to higher orders. In this paper, we presented a framework for estimating PSD tensors of any order by approximating the space (cone) of PSD tensors with a finitely-generated subcone Σ₂_m. We discussed in detail the geometry of the higher-order tensors, and we presented an explicit characterization of the approximation, using the subset of semi-definite tensors that can be written as a sum of squares of tensors of order m. This approximation leads to a non-negative linear least-squares (NNLS) optimization problem, which can be efficiently solved, as it was demonstrated using synthetic datasets and real diffusion-weighted MR images.

An interesting property of the NNLS optimization algorithm is that it produces sparse solution vectors. In our particular case, although the problem seems significantly unconstrained, the solution vector contains at most d(2m) non-zero weights, which corresponds to the rank of the basis matrix. Therefore if the finitely-generated set Inline graphic contains a few thousands bases, the algorithm will select only 6, 15, 28 for tensors of order 2, 4, and 6 respectively. Note that the number of non-zero weights in the solution vector equals to the number of the unique unknown parameters of the symmetric tensor in each case. The sparsity of NNLS in comparison with other optimization techniques for modeling the diffusion-weighted MR signal has also been studied in [27].

In our experiments the sets Inline graphic were generated by tessellating the unit n-sphere using the iterative force-based technique in [39]. The vertices produced by this algorithm form the finite subset ∈ HS_m for different predefined sizes N. An alternative approach could involve constructing as a finite dictionary of elements in HS_m by running a training algorithm on a control dataset [31]. A finite set of diffusion basis for multi-fiber reconstruction is also employed by the method in [43].

One of the advantages of the proposed algorithm is that it enforces positive semi-definite constraints to the estimated tensors. The need for positivity constraints in DW-MRI has been demonstrated in [6] and [5]. It has been shown that unconstrained methods may yield negative diffusivities in real datasets, especially in voxels with high anisotropy or in the presence of noise in the data.

Finally, although high order tensors can approximate several distinct fiber orientations, in the current standard clinical settings for DW-MRI acquisition most of the multi-fiber reconstruction techniques cannot estimate more than two fiber orientations [41], due to the low diffusion weighting (b-value) and the small number of gradient orientations. However, theoretically or in experimental settings with higher b-values and larger sets of diffusion gradient orientations, the proposed technique can estimate up to 2 and 3 distinct fiber orientations using tensors of order 4 and 6 respectively, which also agrees with the results presented in [35].

Fig. 3.3 — The geometry of the map $F_{3}^{2}$ . **Left: HS**₃ is the nine-dimensional sphere S9. The decomposition of H₃ into four subspaces of dimensions of 3, 3, 3, 1 respectively implies that HS₃ contains separate copies of sphere S², S², S² and S⁰. $F_{3}^{2}$ maps these spheres to spheres of radii 52π/83, 68π/699, 8π/105 and 4π/105, respectively. **Right:** The number of generators in Σ₂, Σ₄ and Σ₆ that can ensure the given accuracy requirement. The plots for m = 1, 2, 3 are in red, blue and green, respectively.

Footnotes

This research was supported by the NIH grant EB007082 & NSF066340 to BCV.

References

1.Abramowitz M, Stegun IA. Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables. New York: Dover; 1972. [Google Scholar]
2.Aganj Iman, Lenglet Christophe, Sapiro Guillermo. Odf reconstruction in q-ball imaging with solid angle consideration. ISBI; 2009. pp. 1398–1401. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Alexander Daniel C. Maximum entropy spherical deconvolution for diffusion MRI. IPMI; 2005. pp. 76–87. [DOI] [PubMed] [Google Scholar]
4.Astola L, Florack L. Finsler geometry on higher order tensor fields and applications to high angular resolution diffusion imaging. Scale Space and Variational Methods in Computer Vision. 2009:224–234. [Google Scholar]
5.Barmpoutis A, Hwang MS, Howland D, Forder JR, Vemuri BC. Regularized positive-definite fourth-order tensor field estimation from DW-MRI. NeuroImage. 2009;45:153–162. doi: 10.1016/j.neuroimage.2008.10.056. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Barmpoutis Angelos, Jian Bing, Vemuri Baba C, Shepherd Timothy M. Symmetric positive 4th order tensors and their estimation from diffusion weighted MRI. IPMI. 2007;4584:308–319. doi: 10.1007/978-3-540-73273-0_26. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Barmpoutis A, Kumar R, Vemuri BC, Banerjee A. Beyond the Lambertian assumption: A generative model for apparent BRDF fields of faces using anti-symmetric tensor splines. Proceedings of CVPR08: IEEE Conference on Computer Vision and Pattern Recognition; 2008. pp. 1–6. [Google Scholar]
8.Barmpoutis A, Vemuri BC, Forder JR. Fast displacement probability profile approximation from hardi using 4th-order tensors. Proceedings of ISBI08: IEEE International Symposium on Biomedical Imaging; 2008. pp. 911–914. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Barmpoutis A, Vemuri BC, Shepherd TM, Forder JR. Tensor splines for interpolation and approximation of DT-MRI with applications to segmentation of isolated rat hippocampi. TMI: Transactions on Medical Imaging. 2007;26:1537–1546. doi: 10.1109/TMI.2007.903195. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Basser PJ, Mattiello J, Lebihan D. Estimation of the Effective Self-Diffusion Tensor from the NMR Spin Echo. J Magn Reson B. 1994;103:247–254. doi: 10.1006/jmrb.1994.1037. [DOI] [PubMed] [Google Scholar]
11.Bloy L, Verma R. On computing the underlying fiber directions from the diffusion orientation distribution function. In the proceedings of MICCAI; 2008. pp. 1–8. [DOI] [PubMed] [Google Scholar]
12.Boyd SP, Vandenberghe L. Convex optimization. Cambridge University Press; 2004. [Google Scholar]
13.Cho Kuan-Hung, Yeh Chun-Hung, Tournier Jacques-Donald, Chao Yi-Ping, Chen Jyh-Horng, Lin Ching-Po. Evaluation of the accuracy and angular resolution of q-ball imaging. NeuroImage. 2008;42:262–271. doi: 10.1016/j.neuroimage.2008.03.053. [DOI] [PubMed] [Google Scholar]
14.Delaunay B. Sur la sphre vide. Izvestia Akademii Nauk SSSR, Otdelenie Matematicheskikh i Estestvennykh Nauk. 1934;7:793800. [Google Scholar]
15.Descoteaux Maxime, Angelino Elaine, Fitzgibbons Shaun, Deriche Rachid. Apparent diffusion coefficients from high angular resolution diffusion imaging: Estimation and applications. Magnetic Resonance in Medicine. 2006;56:395–410. doi: 10.1002/mrm.20948. [DOI] [PubMed] [Google Scholar]
16.Descoteaux Maxime, Angelino Elaine, Fitzgibbons Shaun, Deriche Rachid. Regularized, fast and robust analytical q-ball imaging. MRM. 2007;58:497–510. doi: 10.1002/mrm.21277. [DOI] [PubMed] [Google Scholar]
17.Descoteaux Maxime, Deriche Rachid, Le Bihan Denis, Mangin Jean-Francois, Poupon Cyril. Diffusion propagator imaging: Using laplace’s equation and multiple shell acquisitions to reconstruct the diffusion propagator. IPMI; 2009. pp. 1–13. [DOI] [PubMed] [Google Scholar]
18.Fletcher PT, Lu Conglin, Pizer SM, Joshi Sarang. Principal geodesic analysis for the study of nonlinear statistics of shape. IEEE Transactions on Medical Imaging. 2004;23:995–1005. doi: 10.1109/TMI.2004.831793. [DOI] [PubMed] [Google Scholar]
19.Florack LMJ, Balmachnov Sizykh EG. Two canonical representations for regularized high angular resolution diffusion imaging. MICCAI Workshop on Computational Diffusion MRI; 2008. pp. 94–105. [Google Scholar]
20.Ghosh A, Descoteaux M, Deriche R. Riemannian framework for estimating symmetric positive definite 4th order diffusion tensors. Proceedings of MICCAI; 2008. pp. 858–865. [DOI] [PubMed] [Google Scholar]
21.Ghosh A, Moakher M, Deriche R. Ternary quartic approach for positive 4th-order diffusion tensors revisited. Proceedings of ISBI; 2009. pp. 618–621. [Google Scholar]
22.Ghosh A, Tsigaridas E, Descoteaux M, Comon P, Mourrain B, Deriche R. A polynomial based approach to extract the maxima of an antipodally symmetric spherical function and its application to extract directions from the orientation distribution function in diffusion MRI. Workshop on Computational Diffusion MRI; MICCAI. 2008. [Google Scholar]
23.Han D, Qi L, Wu EX. Extreme diffusion values for non-gaussian diffusions. Optimization Methods and Software. 2008;23:703–716. [Google Scholar]
24.Hilbert D. Über die darstellung definiter formen als summe von formenquadraten. Math Ann. 1888;32:342–350. [Google Scholar]
25.Huber Greg. Gamma function derivation of n-sphere volumes. Am Math Monthly. 1982;89:301–302. [Google Scholar]
26.Jensen JH, Helpern JA, Ramani A, Lu H, Kaczynski K. Diffusional kurtosis imaging: The quantification of non-gaussian water diffusion by means of magnetic resonance imaging. MRM. 2005;53:1432–1440. doi: 10.1002/mrm.20508. [DOI] [PubMed] [Google Scholar]
27.Jian B, Vemuri BC. Multi-fiber reconstruction from diffusion mri using mixture of wisharts and sparse deconvolution. In the proceedings of IPMI; 2007. pp. 384–395. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Lawson CL, Hanson RJ. Solving Least Squares Problems. Prentice-Hall; 1974. [Google Scholar]
29.Lenglet C, Rousson M, Deriche R. DTI segmentation by statistical surface evolution. IEEE Trans Med Imaging. 2006;25:685–700. doi: 10.1109/tmi.2006.873299. [DOI] [PubMed] [Google Scholar]
30.Liu C, Acar B, Moseley ME. Characterizing non-gaussian diffusion by using generalized diffusion tensors. Magnetic Resonance in Medicine. 2004;51:924–937. doi: 10.1002/mrm.20071. [DOI] [PubMed] [Google Scholar]
31.Mallat S, Zhang Z. Matching pursuits with time-frequency dictionaries. Trans on Signal Processing. 1993;41:3397–2415. [Google Scholar]
32.Moakher M. Fourth-order cartesian tensors old and new facts, notions applications. Quarterly Journal of Mechanics and Applied Mathematics. 2008;61:181–203. [Google Scholar]
33.Moakher M. The algebra of fourth-order tensors with applications to diffusion MRI. In: Laidlaw D, Weickert J, editors. Visualization and Processing of Tensor Fields. 2009. pp. 57–80. [Google Scholar]
34.Moakher M, Norris AN. The closest elastic tensor of arbitrary symmetry to an elasticity tensor of lower symmetry. Journal of Elasticity. 2006;85(3):215–263. [Google Scholar]
35.Ozarslan E, Mareci TH. Generalized diffusion tensor imaging and analytical relationships between DTI and HARDI. MRM. 2003;50:955–965. doi: 10.1002/mrm.10596. [DOI] [PubMed] [Google Scholar]
36.Ozarslan E, Vemuri BC, Mareci TH. Generalized scalar measures for diffusion MRI using trace, variance, and entropy. Magn Reson Med. 2005;53:866–76. doi: 10.1002/mrm.20411. [DOI] [PubMed] [Google Scholar]
37.Pasternak O, Sochen N, Basser PJ. The effect of metric selection on the analysis of diffusion tensor mri data. NeuroImage. 2010;49:2190–2204. doi: 10.1016/j.neuroimage.2009.10.071. [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Pennec X, Fillard P, Ayache N. A Riemannian framework for tensor computing. International Journal of Computer Vision. 2005;65 doi: 10.1007/11566489_116. [DOI] [PubMed] [Google Scholar]
39.Persson PO, Strang G. A simple mesh generator in matlab. SIAM Review. 2004;46:329–345. [Google Scholar]
40.Petrovic V. Concircular curcature tensor. Pub Inst Math. 1979;25:131–137. [Google Scholar]
41.Prckovska V, et al. Optimal acquisition schemes in high angular resolution diffusion weighted imaging. In the proceedings of MICCAI; 2008. pp. 9–17. [DOI] [PubMed] [Google Scholar]
42.Qi L, Han D, Wu EX. Principal invariants and inherent parameters of diffusion kurtosis tensors. Journal of Mathematical Analysis and Applications. 2009;349:165–180. [Google Scholar]
43.Ramirez-Manzanares A, et al. Diffusion basis functions decomposition for estimating white matter intravoxel fiber geometry. IEEE Transactions on Medical Imaging. 2007;26:1091–1102. doi: 10.1109/TMI.2007.900461. [DOI] [PubMed] [Google Scholar]
44.Schultz T, Seidel HP. Estimating crossing fibers: A tensor decomposition approach. IEEE Trans Vis Comput Graph. 2008;14:1635–1642. doi: 10.1109/TVCG.2008.128. [DOI] [PubMed] [Google Scholar]
45.Stejskal EO, Tanner JE. Spin diffusion measurements: Spin echoes in the presence of a time-dependent field gradient. Journal of Chemical Physics. 1965;42:288–292. [Google Scholar]
46.Wang Wei, Gao Jinghuai, Li Kang. Structure-adaptive anisotropic filter with local structure tensors. Intelligent Information Technology Applications, 2007 Workshop on; 2008. pp. 1005–1010. [Google Scholar]
47.Wang Z, Vemuri BC. DTI segmentation using an information theoretic tensor dissimilarity measure. IEEE Transactions on Medical Imaging. 2005;24:1267–1277. doi: 10.1109/TMI.2005.854516. [DOI] [PubMed] [Google Scholar]
48.Wang Zhizhou, Vemuri Baba C, Chen Yunmei, Mareci Thomas H. A constrained variational principle for direct estimation and smoothing of the diffusion tensor field from complex dwi. IEEE Trans Med Imaging. 2004;23:930–939. doi: 10.1109/TMI.2004.831218. [DOI] [PubMed] [Google Scholar]
49.Yassine I, McGraw T. 4th order diffusion tensor interpolation with divergence and curl constrained bezier patches. In Proceedings of ISBI; 2009. pp. 634–637. [Google Scholar]

[R1] 1.Abramowitz M, Stegun IA. Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables. New York: Dover; 1972. [Google Scholar]

[R2] 2.Aganj Iman, Lenglet Christophe, Sapiro Guillermo. Odf reconstruction in q-ball imaging with solid angle consideration. ISBI; 2009. pp. 1398–1401. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R3] 3.Alexander Daniel C. Maximum entropy spherical deconvolution for diffusion MRI. IPMI; 2005. pp. 76–87. [DOI] [PubMed] [Google Scholar]

[R4] 4.Astola L, Florack L. Finsler geometry on higher order tensor fields and applications to high angular resolution diffusion imaging. Scale Space and Variational Methods in Computer Vision. 2009:224–234. [Google Scholar]

[R5] 5.Barmpoutis A, Hwang MS, Howland D, Forder JR, Vemuri BC. Regularized positive-definite fourth-order tensor field estimation from DW-MRI. NeuroImage. 2009;45:153–162. doi: 10.1016/j.neuroimage.2008.10.056. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R6] 6.Barmpoutis Angelos, Jian Bing, Vemuri Baba C, Shepherd Timothy M. Symmetric positive 4th order tensors and their estimation from diffusion weighted MRI. IPMI. 2007;4584:308–319. doi: 10.1007/978-3-540-73273-0_26. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R7] 7.Barmpoutis A, Kumar R, Vemuri BC, Banerjee A. Beyond the Lambertian assumption: A generative model for apparent BRDF fields of faces using anti-symmetric tensor splines. Proceedings of CVPR08: IEEE Conference on Computer Vision and Pattern Recognition; 2008. pp. 1–6. [Google Scholar]

[R8] 8.Barmpoutis A, Vemuri BC, Forder JR. Fast displacement probability profile approximation from hardi using 4th-order tensors. Proceedings of ISBI08: IEEE International Symposium on Biomedical Imaging; 2008. pp. 911–914. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R9] 9.Barmpoutis A, Vemuri BC, Shepherd TM, Forder JR. Tensor splines for interpolation and approximation of DT-MRI with applications to segmentation of isolated rat hippocampi. TMI: Transactions on Medical Imaging. 2007;26:1537–1546. doi: 10.1109/TMI.2007.903195. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R10] 10.Basser PJ, Mattiello J, Lebihan D. Estimation of the Effective Self-Diffusion Tensor from the NMR Spin Echo. J Magn Reson B. 1994;103:247–254. doi: 10.1006/jmrb.1994.1037. [DOI] [PubMed] [Google Scholar]

[R11] 11.Bloy L, Verma R. On computing the underlying fiber directions from the diffusion orientation distribution function. In the proceedings of MICCAI; 2008. pp. 1–8. [DOI] [PubMed] [Google Scholar]

[R12] 12.Boyd SP, Vandenberghe L. Convex optimization. Cambridge University Press; 2004. [Google Scholar]

[R13] 13.Cho Kuan-Hung, Yeh Chun-Hung, Tournier Jacques-Donald, Chao Yi-Ping, Chen Jyh-Horng, Lin Ching-Po. Evaluation of the accuracy and angular resolution of q-ball imaging. NeuroImage. 2008;42:262–271. doi: 10.1016/j.neuroimage.2008.03.053. [DOI] [PubMed] [Google Scholar]

[R14] 14.Delaunay B. Sur la sphre vide. Izvestia Akademii Nauk SSSR, Otdelenie Matematicheskikh i Estestvennykh Nauk. 1934;7:793800. [Google Scholar]

[R15] 15.Descoteaux Maxime, Angelino Elaine, Fitzgibbons Shaun, Deriche Rachid. Apparent diffusion coefficients from high angular resolution diffusion imaging: Estimation and applications. Magnetic Resonance in Medicine. 2006;56:395–410. doi: 10.1002/mrm.20948. [DOI] [PubMed] [Google Scholar]

[R16] 16.Descoteaux Maxime, Angelino Elaine, Fitzgibbons Shaun, Deriche Rachid. Regularized, fast and robust analytical q-ball imaging. MRM. 2007;58:497–510. doi: 10.1002/mrm.21277. [DOI] [PubMed] [Google Scholar]

[R17] 17.Descoteaux Maxime, Deriche Rachid, Le Bihan Denis, Mangin Jean-Francois, Poupon Cyril. Diffusion propagator imaging: Using laplace’s equation and multiple shell acquisitions to reconstruct the diffusion propagator. IPMI; 2009. pp. 1–13. [DOI] [PubMed] [Google Scholar]

[R18] 18.Fletcher PT, Lu Conglin, Pizer SM, Joshi Sarang. Principal geodesic analysis for the study of nonlinear statistics of shape. IEEE Transactions on Medical Imaging. 2004;23:995–1005. doi: 10.1109/TMI.2004.831793. [DOI] [PubMed] [Google Scholar]

[R19] 19.Florack LMJ, Balmachnov Sizykh EG. Two canonical representations for regularized high angular resolution diffusion imaging. MICCAI Workshop on Computational Diffusion MRI; 2008. pp. 94–105. [Google Scholar]

[R20] 20.Ghosh A, Descoteaux M, Deriche R. Riemannian framework for estimating symmetric positive definite 4th order diffusion tensors. Proceedings of MICCAI; 2008. pp. 858–865. [DOI] [PubMed] [Google Scholar]

[R21] 21.Ghosh A, Moakher M, Deriche R. Ternary quartic approach for positive 4th-order diffusion tensors revisited. Proceedings of ISBI; 2009. pp. 618–621. [Google Scholar]

[R22] 22.Ghosh A, Tsigaridas E, Descoteaux M, Comon P, Mourrain B, Deriche R. A polynomial based approach to extract the maxima of an antipodally symmetric spherical function and its application to extract directions from the orientation distribution function in diffusion MRI. Workshop on Computational Diffusion MRI; MICCAI. 2008. [Google Scholar]

[R23] 23.Han D, Qi L, Wu EX. Extreme diffusion values for non-gaussian diffusions. Optimization Methods and Software. 2008;23:703–716. [Google Scholar]

[R24] 24.Hilbert D. Über die darstellung definiter formen als summe von formenquadraten. Math Ann. 1888;32:342–350. [Google Scholar]

[R25] 25.Huber Greg. Gamma function derivation of n-sphere volumes. Am Math Monthly. 1982;89:301–302. [Google Scholar]

[R26] 26.Jensen JH, Helpern JA, Ramani A, Lu H, Kaczynski K. Diffusional kurtosis imaging: The quantification of non-gaussian water diffusion by means of magnetic resonance imaging. MRM. 2005;53:1432–1440. doi: 10.1002/mrm.20508. [DOI] [PubMed] [Google Scholar]

[R27] 27.Jian B, Vemuri BC. Multi-fiber reconstruction from diffusion mri using mixture of wisharts and sparse deconvolution. In the proceedings of IPMI; 2007. pp. 384–395. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R28] 28.Lawson CL, Hanson RJ. Solving Least Squares Problems. Prentice-Hall; 1974. [Google Scholar]

[R29] 29.Lenglet C, Rousson M, Deriche R. DTI segmentation by statistical surface evolution. IEEE Trans Med Imaging. 2006;25:685–700. doi: 10.1109/tmi.2006.873299. [DOI] [PubMed] [Google Scholar]

[R30] 30.Liu C, Acar B, Moseley ME. Characterizing non-gaussian diffusion by using generalized diffusion tensors. Magnetic Resonance in Medicine. 2004;51:924–937. doi: 10.1002/mrm.20071. [DOI] [PubMed] [Google Scholar]

[R31] 31.Mallat S, Zhang Z. Matching pursuits with time-frequency dictionaries. Trans on Signal Processing. 1993;41:3397–2415. [Google Scholar]

[R32] 32.Moakher M. Fourth-order cartesian tensors old and new facts, notions applications. Quarterly Journal of Mechanics and Applied Mathematics. 2008;61:181–203. [Google Scholar]

[R33] 33.Moakher M. The algebra of fourth-order tensors with applications to diffusion MRI. In: Laidlaw D, Weickert J, editors. Visualization and Processing of Tensor Fields. 2009. pp. 57–80. [Google Scholar]

[R34] 34.Moakher M, Norris AN. The closest elastic tensor of arbitrary symmetry to an elasticity tensor of lower symmetry. Journal of Elasticity. 2006;85(3):215–263. [Google Scholar]

[R35] 35.Ozarslan E, Mareci TH. Generalized diffusion tensor imaging and analytical relationships between DTI and HARDI. MRM. 2003;50:955–965. doi: 10.1002/mrm.10596. [DOI] [PubMed] [Google Scholar]

[R36] 36.Ozarslan E, Vemuri BC, Mareci TH. Generalized scalar measures for diffusion MRI using trace, variance, and entropy. Magn Reson Med. 2005;53:866–76. doi: 10.1002/mrm.20411. [DOI] [PubMed] [Google Scholar]

[R37] 37.Pasternak O, Sochen N, Basser PJ. The effect of metric selection on the analysis of diffusion tensor mri data. NeuroImage. 2010;49:2190–2204. doi: 10.1016/j.neuroimage.2009.10.071. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R38] 38.Pennec X, Fillard P, Ayache N. A Riemannian framework for tensor computing. International Journal of Computer Vision. 2005;65 doi: 10.1007/11566489_116. [DOI] [PubMed] [Google Scholar]

[R39] 39.Persson PO, Strang G. A simple mesh generator in matlab. SIAM Review. 2004;46:329–345. [Google Scholar]

[R40] 40.Petrovic V. Concircular curcature tensor. Pub Inst Math. 1979;25:131–137. [Google Scholar]

[R41] 41.Prckovska V, et al. Optimal acquisition schemes in high angular resolution diffusion weighted imaging. In the proceedings of MICCAI; 2008. pp. 9–17. [DOI] [PubMed] [Google Scholar]

[R42] 42.Qi L, Han D, Wu EX. Principal invariants and inherent parameters of diffusion kurtosis tensors. Journal of Mathematical Analysis and Applications. 2009;349:165–180. [Google Scholar]

[R43] 43.Ramirez-Manzanares A, et al. Diffusion basis functions decomposition for estimating white matter intravoxel fiber geometry. IEEE Transactions on Medical Imaging. 2007;26:1091–1102. doi: 10.1109/TMI.2007.900461. [DOI] [PubMed] [Google Scholar]

[R44] 44.Schultz T, Seidel HP. Estimating crossing fibers: A tensor decomposition approach. IEEE Trans Vis Comput Graph. 2008;14:1635–1642. doi: 10.1109/TVCG.2008.128. [DOI] [PubMed] [Google Scholar]

[R45] 45.Stejskal EO, Tanner JE. Spin diffusion measurements: Spin echoes in the presence of a time-dependent field gradient. Journal of Chemical Physics. 1965;42:288–292. [Google Scholar]

[R46] 46.Wang Wei, Gao Jinghuai, Li Kang. Structure-adaptive anisotropic filter with local structure tensors. Intelligent Information Technology Applications, 2007 Workshop on; 2008. pp. 1005–1010. [Google Scholar]

[R47] 47.Wang Z, Vemuri BC. DTI segmentation using an information theoretic tensor dissimilarity measure. IEEE Transactions on Medical Imaging. 2005;24:1267–1277. doi: 10.1109/TMI.2005.854516. [DOI] [PubMed] [Google Scholar]

[R48] 48.Wang Zhizhou, Vemuri Baba C, Chen Yunmei, Mareci Thomas H. A constrained variational principle for direct estimation and smoothing of the diffusion tensor field from complex dwi. IEEE Trans Med Imaging. 2004;23:930–939. doi: 10.1109/TMI.2004.831218. [DOI] [PubMed] [Google Scholar]

[R49] 49.Yassine I, McGraw T. 4th order diffusion tensor interpolation with divergence and curl constrained bezier patches. In Proceedings of ISBI; 2009. pp. 634–637. [Google Scholar]

PERMALINK

APPROXIMATING SYMMETRIC POSITIVE SEMIDEFINITE TENSORS OF EVEN ORDER*

ANGELOS BARMPOUTIS

HO JEFFREY

BABA C VEMURI

Abstract

1. Introduction

Related Work

2. Theory

Proposition 2.1

Proof

Fig. 2.1.

Proposition 2.2

Proof

Proposition 2.3

Proof

Examples

Condition Number θ( ) of

Theorem 2.4

Lemma 2.5

Proof

Lemma 2.6

Proof

Lemma 2.7

Proof

Lemma 2.8

Proof

Lemma 2.9

Proof

Proof

Proposition 2.10

Proof

3. Approximating PSD Tensors of Orders two, four and six

3.1. Preliminaries

Fig. 3.1.

Estimate on K

Estimate on ν

Fig. 3.2.

3.2. Second-Order Tensors

More Precise Estimate

3.3. Fourth-Order Tensors

3.4. Sixth-Order Tensors

4. Experimental Results

Fig. 4.1.

Fig. 4.2.

Fig. 4.3.

Fig. 4.4.

4.1. Application: Diffusion-Weighted MRI

Fig. 4.5.

Fig. 4.6.

Fig. 4.7.

5. Discussion and conclusions

Fig. 3.3.

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

APPROXIMATING SYMMETRIC POSITIVE SEMIDEFINITE TENSORS OF EVEN ORDER^*