Abstract
Tensors of various orders can be used for modeling physical quantities such as strain and diffusion as well as curvature and other quantities of geometric origin. Depending on the physical properties of the modeled quantity, the estimated tensors are often required to satisfy the positivity constraint, which can be satisfied only with tensors of even order. Although the space of 2mth-order symmetric positive semi-definite tensors is known to be a convex cone, enforcing positivity constraint directly on is usually not straightforward computationally because there is no known analytic description of for m > 1. In this paper, we propose a novel approach for enforcing the positivity constraint on even-order tensors by approximating the cone for the cases 0 < m < 3, and presenting an explicit characterization of the approximation Σ2m ⊂ Ω2m for m ≥ 1, using the subset of semi-definite tensors that can be written as a sum of squares of tensors of order m. Furthermore, we show that this approximation leads to a non-negative linear least-squares (NNLS) optimization problem with the complexity that equals the number of generators in Σ2m. Finally, we experimentally validate the proposed approach and we present an application for computing 2mth-order diffusion tensors from Diffusion Weighted Magnetic Resonance Images.
Keywords: high-order tensors, sum of squares of polynomials, diffusion tensor imaging
1. Introduction
Multi-linear algebra is a generalization of linear algebra and tensors which are multi-linear forms are widely used for modeling various physical quantities commonly encountered in engineering and physics. Elasticity [34], stress, strain and diffusion [10] are some examples. In differential geometry, tensors are used to represent metrics, curvatures [40] and other geometric quantities. In image processing, structure tensors [46] have been used for texture analysis, trifocal tensors in multi-view geometry, etc. The tensors in most of these applications are required to satisfy certain properties. For example, the tensors that approximate the Bidirectional Reflectance Distribution Function (BRDF) [7] are anti-symmetric, while the diffusion [10] and the structure tensors [46] are antipodally symmetric. Furthermore, certain applications demand that the estimated tensors be positive-definite since they model positive-valued physical quantities such as the diffusivity function or the displacement probability of water molecules [8]. In this paper, we are interested in the case of fully symmetric positive-definite tensors of various orders and hence for sake of simplicity, every reference to the term tensor will imply this particular case of tensors unless otherwise stated.
Let
denote the set of mth-order symmetric positive-definite tensors in ℝ3. As is well-known, positivity condition requires the order m to be even. Denote
the closure of
consisting of symmetric positive semi-definite tensors (PSD) in ℝ3. As subsets of the space
of mth-order symmetric tensors,
,
are cones, convex subsets that are invariant under positive scaling [18]. In most applications, the main computational problem can be formulated as data interpolation problem with the domain being
. Specifically, the input data are often in the form {(x1, y1), ···, (xk, yk)} where xi are directions in ℝ3 represented as points on the unit sphere S2, and yi are the values to be interpolated. The interpolation problem requires a non-negative tensor
that interpolates the input data. Formulated as a least-squares problem, it has the form
We note that both the objective function and the domain
are convex, and therefore, the optimization problem above is in fact a convex optimization problem that, in principle, can be solved using existing techniques [12]. However, a formal and significant difficulty of applying these methods is that except for the m = 1 case, there exists no known description of the cone
as it is well-known that the positivity test for polynomials of degree m > 2 is a difficult problem. In the second-order case, the cone
is known to be self-dual in the sense that there exists an inner product < ·, · > on
such that < A, B >≥ 0 for any
. The inner product allows the extension of the usual duality theory using Lagrange multipliers to the cone
, and there is a well-developed theory of semi-definite programming (SDP) [12] that deals with linear objective functions on
.
While the difficulty of providing a complete description of seems to be unsurmountable at this point, the main contribution of this paper is the realization of another formal difficulty that can be overcome relatively easily. A cone C in a vector space is said to be finitely-generated if there exists a finite number of elements v1, ···, vn ∈ C, its generators, such that every element c ∈ C can be written as a non-negative linear combination of the generators
If the cone were finitely generated, the above optimization problem becomes a non-negative linear least-squares (NNLS) problem, with complexity (number of variables) equals to the number of generators. The advantage of solving an NNLS problem is that there are software packages that can efficiently solve NNLS problems containing thousands of variables [28]. While is not finitely-generated, it follows naturally that we can try to approximate with a finitely-generated subcone, and restrict the above optimization problem to the subcone. The restriction can be justified if the subcone can be shown to be a good approximation of .
The second contribution of this paper is an explicit characterization of the approximations for 0 < m < 3, and Σ2m ⊂ Ω2m for m ≥ 1, where Σ2m is a finitely-generated subcone in the respective spaces. More specifically, let Ω2m denote the subcone in consisting of semi-definite tensors that can be written as a sum of squares of tensors of order m. We have the natural inclusions , and our result gives a detailed characterization of the approximation Σ2m ⊂ Ω2m in terms of the geometry of the generators in Σ2m. In particular, for m = 1, 2, it is known that , and our result then gives a detailed characterization of the approximation . Our analysis have shown that, for the lower-order cases m = 1, 2, 3, which are of primary interest here, for a reasonable precision requirement, Ω2m can be approximated by Σ2m containing a few hundreds or at most a few thousands of generators. It follows that the corresponding NNLS problems have the complexity that are well within the capability of currently available NNLS algorithms [28]. We quantitatively validate our method via several experiments, and we also present an application of the proposed technique for estimating the diffusivity function from diffusion-weighted MRI to demonstrate both the efficiency and accuracy of the proposed method.
The rest of this paper is organized as follows: In Sec. 2, we define the finitely-generated subcone Σ2m. We also develop the theory that quantifies the approximation Σ2m ⊂ Ω2m, and the main theorem proved in this section relates the approximation error with the geometry of the generators in Σ2m. Using the theory developed in Sec. 2, in Sec. 3 we explicitly work out the formulas for the number of generators for Σ2m required for a given accuracy requirement. The results show that, up to order-6 and depending on the order, it generally requires at most a few thousands of generators for Σ2m in order to achieve a relative approximation error of less than 10%. Finally, in Sec. 4, we validate our theoretical findings using a set of experiments and we present an application of our method on diffusion-weighted MR datasets.
Related Work
Symmetric positive-definite (SPD) tensors of order-2 have been used in modeling the diffusivity function in the so called Diffusion Tensor MR Imaging (DT-MRI) [10]. SPD matrices can be endowed with a Riemannian metric that is invariant under affine transforms. This metric or its approximations have been employed for estimating and processing diffusion tensor fields [48, 47, 29, 38, 18, 9]. Tensors of 3rd and 5th order can model reflectance distributions with specularities and cast shadows in facial images and have been used for re-lighting in [7]. In general, odd-order tensors are generalizations of the order-1 tensor, which have been commonly used in computer graphics for representing the Lambertian reflectance model. Similarly, 4th, 6th or higher even-order tensors generalize the 2nd-order tensors and have the ability to approximate multi-lobed functions [35, 30, 36] such as the kurtosis of diffusion [26]. In particular, some 4th-order tensors can be expressed as 2nd-order tensors in higher dimensions and their properties have been studied in detail by Moakher in [32, 33]. They however do not span the full space of the higher-order tensors as was shown in the case of order-4 tensors in [6, 5]. In [20], Ghosh et al. used the metric proposed by Moakher in [32, 33] to represent the space of 4th-order SPD tensors using the geometry of 2nd-order SPD tensors in higher dimensions. Recently, an algorithm for imposing positivity constraints on 4th-order tensors using their equivalent ternary quartic polynomial representation was proposed in [6] and this was further developed in [5] and [21, 49].
After estimating a field of high-order tensors, it can be processed using a Finsler metric by appropriately modifying the polynomial equivalent representation of the tensors that satisfy the properties of Finsler geometry [4]. This method can be used for neuronal fiber tracking from high angular resolution diffusion MRI data. Further processing of higher-order tensor fields can be achieved by using the eigenvalue decomposition of matrices which has been extended for the case of high-order tensors in [23]. In this framework, the eigenvalues correspond to the extreme values (minima or maxima) of a tensor and they can be used to extract useful information from the kurtosis tensor [42] as well as the orientation of maximum diffusion [11, 22]. Another method for extracting the principal orientation of diffusion from a higher-order tensor was recently described in [44].
Although, high-order tensors have been employed in most of the aforementioned methods due to their simple polynomial form and their ability to model multi-lobed spherical functions, there are no existing methods for imposing positivity constraints in symmetric tensors of any order higher than two and four. The need to impose positivity constraints becomes essential especially in the case where the tensors approximate positive-valued physical quantities, and it has been shown that imposing the positivity constraint on the tensors approximating the diffusivity function being estimated reduces the approximation errors significantly [5]. Recently, Pasternak et al. [37] also emphasized the importance of enforcing positivity constraints in processing diffusion tensor MR images.
Finally, although Cartesian tensors basis have been widely used for modeling the diffusivity function in DW-MRI, we would like to mention that Spherical Harmonic basis have been employed in approximating other spherical functions involved in DW-MRI processing such as the diffusion propagator. A detailed review of several multi fiber reconstruction methods that employ spherical harmonic basis can be found in the recent article by Descoteaux et al. on Diffusion Propagator Imaging [17]. The orientation distribution function (ODF) is another example of a DW-MRI related spherical function, which can be reconstructed from Q-ball imaging data [16, 13, 3] and was recently done in [2] by using the mathematically correct definition of ODF and deriving a closed form expression for the same. In this article, however, our main focus is on the use of Cartesian tensor basis for parameterizing the diffusivity function in DW-MR datasets.
2. Theory
We will consider symmetric tensors of order m as functions defined on the unit sphere S2 in ℝ3. In particular, symmetric tensors of order m can be identified with homogeneous polynomials of degree m: for a symmetric tensor T of order m, its associated homogeneous polynomial P(x, y, z) is given as
where x = [x y z]⊤. Under this identification,
are homogeneous polynomials of degree m that do not vanish on S2, and similarly,
are degree-m homogeneous polynomials that do not take negative values in ℝ3. Both are now considered as cones in Hm, the set of homogeneous polynomials of degree m. For even degree 2m, let Ω2m denote the subset of
consisting of polynomials that can be written as a sum of squares of polynomials of degree m. Ω2m is clearly a subcone of
for all m ≥ 1, and for m = 1, 2, it is known that
: the m = 1 case follows easily from linear algebra and m = 2 case is the content of Hilbert’s theorem on ternary quartics [24]. For m > 2, however, the inclusion is strict
. In this section, we will describe a general method for approximating Ω2m using a finitely-generated subcone Σ2m in Ω2m, and we will provide a characterization of the approximation error in terms of the geometry of the generators of Σ2m. For the important quadratic and quartic cases m = 1, 2, our result provides an approximation of the full PSD cone
using a finitely-generated subcone Ω2m.
The basic norm used in this paper is the L1-norm over the sphere S2. More specifically, for any P ∈ Hm, its L1-norm ||P ||1 is the integral over S2
That it is indeed a norm follows from the fact that for two homogeneous polynomials P, Q, P = Q as polynomials if and only if ||P − Q||1 = 0. Note that the other norm properties are trivial to prove. For any and a subcone , we define the relative L1-approximation error of P as
| (2.1) |
Proposition 2.1
Let Σ2m be a closed subcone in and .
-
The L1-norm is convex: for any , the function g(q),
is a convex function on .
- For P ≠ 0, EΣ2m(P) = 0 if and only if P ∈ Σ2m. For any s > 0,
Proof
For any ,
and the convexity of the norm on follows. (2) is clear because Σ2m is closed. The invariance of EΣ2m under positive scaling follows readily from the definition.
Let m1(x), ···, md(m)(x) denote the monomials in Hm. (Note that d(m) also equals to the number of symmetric spherical harmonic basis elements, which can be mapped to the monomials in Hm using an one-to-one transformation [35, 15].) The monomials form a basis in Hm that identifies Hm with ℝd(m). We will denote HSm the unit sphere in Hm, consisting of polynomials
such that . The subcone Σ2m will be defined using polynomials in HSm, and this is accomplished through the square map :
Clearly is a smooth map, and if and only if p = ±q. While is not linear, it maps rays in Hm to rays in . The geometry of the map will play a crucial role in our analysis below, and it is quantified by its condition number ηm. First, we define two quantities.
Clearly we have since HSm does not contain the zero polynomial. The two numbers measure the amount of stretching and shrinking does to the sphere HSm. Their ratio gives the condition number η m for
In the following, we will often drop the subscript and denote the condition number simply as η when the degree m in the context is clear. Figure 2.1 illustrates the effect of and its condition number η.
Fig. 2.1.
Left: Comparison between and . Let , r > 0 denote the circle with radius r centered at origin in H2m. is isotropic in the sense that for all p ∈ HS1. , on the other hand, is not isotropic. HS2 is the five-dimensional sphere S5. Its equator can be identified with S4, and the two polynomials form the two poles. Inside the equator, are embedded S1 and S2. maps the poles ±(x2 + y2 + z2) to , and it maps the embedded S1 and S2 to , respectively. The condition number η for on HS2 is 5. Restricting to the equator S4, the condition number improves to 2. Right: Local and non-local approximations. For each p ∈ HSm, Lemma 2.8 approximates p first with the vertices of the simplex containing p. This local approximation is further improved using non-local approximations as the polynomials (pi − pj)2 are approximated by polynomials in HSm that are generally far from p.
Proposition 2.2
and hence η can be determined by evaluating trigonometric integrals.
Proof
Let m1, ···, md(m) denote the d(m) monomials in Hm. A polynomial p ∈ HSm is identified with the vector of coefficients a = [a1, ···, ad(m)]⊤ as p = a1mi + ··· + ad(m)md(m). The L1-norm ||
(p)||1 is the integral of p2 over S2 that can be written as
Let Λm denote the d(m) × d(m) matrix whose components are the integrals ∫S2 mi(x)mj(x) dx, we have
It follows that can be determined as
both of which can be solved once Λm is known using Singular Value Decomposition. The integrals ∫S2 mi(x)mj(x) dx can be computed in closed form since using spherical coordinates, x = sin ψ cos θ, y = sin ψ sin θ, z = cos ψ, each integral is a product of two trigonometric integrals
with exponents b1, b2, b3, b4 depending on mi, mj.
In practice, can be numerically evaluated to any desired accuracy without appealing to the closed-form integral formulas. Next we prove a simple result that partially explains why the linear case m = 1 is substantially easier than the nonlinear cases m > 1.
Proposition 2.3
ηm = 1 if and only if m = 1. That is, is isotropic with respect to the L1-norm in H2.
Proof
The ‘if’ part follows readily from the fact that
and
The matrix Λ1 is therefore diagonal with constant diagonal element , and is isotropic with respect to the L1-norm in H2.
Conversely, for m > 1, let p = xm, q = xm−1y. We show that ||
(p)||1 ≠ ||
(q)||1:
Let , we have
Therefore,
Since for any n ≥ 2, we have
which shows that ||
(p)||1 − ||
(q)||1 ≠ 0 if m > 1. This implies ηm > 1 if m > 1.
Using the square map
, we will define the approximating subcone
by specifying its generators as polynomials in HSm. More specifically, let
= {p1, ···, pk} denote a finite set of k polynomials (points) in HSm. Its associated cone
in H2m is generated by the finite set of generators
: elements in
are non-negative linear combinations of
:
for some a1, ···, ak ≥ 0. It is immediately clear that
for any finite subset
⊂ HSm. Since
, we can restrict points in
to lie in one chosen hemisphere of HSm. For such
, its completion
⊂
is obtained by joining all antipodal points of points in
,
Examples
For m = 1, H1 is ℝ3 and HS1 is S2. If
consists of four points {[1, 0, 0]⊤, [0, 1, 0]⊤, [0, 0, 1]⊤, [
]}, the four polynomials p1, p2, p3, p4 are x, y, z and
, respectively. Elements in
are non-negative combinations of the four polynomials
. More precisely, any
is determined (in this case, uniquely) by four non-negative numbers a1, a2, a3, a4 ≥ 0 such that
For m = 2, H2 can be identified with ℝ6 using the monomial basis {x2, y2, z2, xy, xz, yz} and HS2 is S5. If
consists of three points
where , the three polynomials p1, p2, p3 are λ(x2 + y2), λ(x2 − xy), λ(yz − y2). Any can be written (again uniquely) as
for three non-negative a1, a2, a3.
The inclusion
gives an approximation of Ω2m by
, and it involves two main components: the square map
and the chosen polynomials in
that provide the generators in
through
. The main result of our analysis on the approximation error of
is given in the next theorem, which asserts that the approximation error can be bounded by a product of contributions from both components: the condition number
ηm of
and the condition number
θ(
) of the set
whose definition we now turn to.
Condition Number θ(
) of
We use θ(
) as the measure that quantifies the approximation of any q ∈ HSm, considered as a point on the sphere, by the finite set
. We will use the spherical distance dHSm(p, q) (arc-length in radians) to measure the distance between a pair of points p, q on the sphere HSm, and in particular, dHSm(p, q) is the angle between the two unit vectors p, q in HSm. A set
is said to be good if there is a triangulation of HSm as a simplicial complex
whose vertex set
is the completion
of
. Since HSm has dimension d(m) − 1, the top-dimensional simplexes in
have dimension d(m) − 1 as well. Therefore, for any q ∈ HSm, there is a d(m) − 1-simplex σ ∈
containing q. In particular, we will assume that q can be written as a non-negative linear combination of the vertices of σ: q = a0p0 + ··· + ad(m)−1pd(m)−1 with a0, ···, ad(m)−1 ≥ 0. While this is in general not true for an arbitrary triangulation
of HSm, it is not difficult to show that
can be modified (without changing its underlying abstract simplicial complex) to satisfy this property, e.g., by first defining a triangulation of the vertices in
considered as points in the Euclidean space ℝd(m) using the same abstract simplicial complex as
and radially projecting the simplices onto HSm. For 0 ≤ k ≤ d(m) − 1,
will denote the set of k-simplices in
, and for a k-simplex σ ∈
, its width δ(σ) is defined as the maximal distance between its vertices, p0, · · ·, pk,
For a triangulation
, we define its width to be the maximal width of its top-dimensional simplices:
The condition number of
is then defined as the minimal width of the triangulations
that have
as its vertex set:
Since
is finite, there exists a triangulation Δ(
) whose width gives the condition number θ(
). We note that 0 < θ(
) < π, and for a good set
, the following conditions hold,
For each q ∈ HSm, there are d(m) elements, p0, ···, pd(m)−1, in
such that q = a0p0 + ··· ad(m)−1pd(m)−1 for a0, ···, ad(m)−1 ≥ 0 and dHSm(pi, pj) < θ(
) for any 0 ≤ i, j ≤ d(m).For each q ∈ HSm, there exists p ∈
such that dHSm(q, p) < θ(
).
Property (1) follows immediately from the definition. Property (2) can be shown to follow from the requirement that if q ∈ σ ∈
, q is a non-negative linear combination of vertices in σ.
Theorem 2.4
Let
denote a good finite subset in HSm and
its associated finitely-generated subcone in H2m. Let θ = θ(
) denote the condition number of
as defined above and ηm the condition number of
. Then, for any polynomial r ∈ Ω2m, its L1-relative approximation error
satisfies
The bound above constitutes our quantitative characterization of the approximation
. Not surprisingly, the bound provided above depends on both the map
as well as the set
through θ and η. The error measured by
takes place in H2m, and the bound on the right factored into two components with contribution from θ that essentially measures how well an arbitrary point q ∈ HSm can be approximated using
and its associated triangulation Δ(
). In particular, as will be seen from the proof, tan θ arises from approximating q using its nearest neighbor in
as in Property (2) above while
comes from approximating q using the simplex σ containing it as in Property (1).
We will prove the theorem through a sequence of lemmas given below. However, before delving into the proof, we remark that although using the triangulation Δ(
) to define θ(
) may seem unnecessary at first, it is in fact crucial to have Property (1) in order to produce a smaller bound on the error. For example, it is possible to define θ(
) using only Property (2), i.e., each q ∈ HSm can be approximated by a p ∈
such that dHSm(q, p) < θ(
). However, this hypothesis itself is only strong enough to produce the bound given in Lemma 2.6 (Equation 2.4). Disregarding ηm, the bound given in Equation 2.4 is 2 sin θ, which is considerably inferior to the bound of 4 tan θ sin2 given in Theorem 2.4. In particular, for small θ, the former is approximately 2θ while the latter is θ3 (See Equation 3.1), two order of magnitude less. As will be clear in the proof, the main issue is to approximate the polynomial q2 for any q ∈ HSm with a sum of squares of polynomials in
. Using only Property (2), it is difficult to determine what polynomials in HSm can be used to approximate q2 other than the polynomial p ∈
that is closest to q. With Property (1), we have more choices at our disposal as we can approximate q2 using the vertices pi of the simplex σ that contains q, and more importantly, the remainder of this approximation (sum of (pi − pj)2) can be further approximated using polynomials in
. This is the content of Lemma 2.8. In particular, when approximating q2, Property (1) allows the access of not only the polynomials pi ∈
that are neighbors of q but also polynomials in
that are usually far away from q. See Figure 2.1. Furthermore, as will be detailed in Section 3, Property (1) allows us to formulate a simple method for estimating the minimal number of points (polynomials) in
needed for a given precision requirement.
Lemma 2.5
Let p, q be two polynomials in HSm and θ = dHSm(p, q) denote their geodesic distance considered as points on the sphere HSm. We have
Proof
Let r = p − q. As a vector in Hm, |r| = |p − q|. Using the law of cosines,
| (2.2) |
Therefore, r/γ ∈ HSm, and we have
and the result follows.
Next we prove an important lemma which shows that for two nearby p, q in HSm, we can approximate q2 using p2 such that the L1-approximation error is a fraction (depending on the geodesic distance) of the L1-norm of p2.
Lemma 2.6
Let p, q be two polynomials in HSm and θ = dHSm(p, q) denote their geodesic distance considered as points on the sphere HSm. Let ||
(p) −
(q)||1 denote the L1-difference between
(p),
(q)
| (2.3) |
We have
| (2.4) |
Proof
Using Hölder’s inequality, we have
The proof will proceed to bound the two terms on the right. By the preceding lemma, we have
For the second term, we will consider the polynomial , where γ > 1 ensures that r ∈ HSm. A quick calculation shows that . Since, by definition,
we have
Combining the two inequalities, we have
Since
it follows that
This completes the proof.
We will use the preceding lemma to prove two basic error estimates. For any two points p, q in
, the following lemma provides a bound on the approximation error for points that lie on the arc (geodesic path) joining p, q.
Lemma 2.7
Let p, q be two neighboring points in Δ(
), i.e., there is a 1-simplex σ1 in Δ(
) with p, q as its two vertices. Let r = ap + bq be a convex combination of p, q with a, b ≥ 0 and a + b = 1. If θ = θ(
) denotes the condition number of
, then
Proof
By definition of θ, dHSm(p, q) ≤ θ. Let ϕ = a2p2 + b2q2 + abp2 + abq2 be an element in . We have
Let γ = |p − q| and t = (p − q)/γ ∈ HSm. There exists s ∈
such that the geodesic distance between t and s is less than θ. By the preceding lemma,
Now let φ = ϕ + abγ2s2 be another element in . We have
By Lemma 2.5, . This gives
| (2.5) |
as for a, b ≥ 0 and a + b = 1. We next bound the L1-norm of r2(x). Since r = ap + bq, there exists such that γr ∈ HSm. This implies that
or
| (2.6) |
Combining Equations 2.5 and 2.6 gives the desired result.
The preceding lemma can be generalized immediately to higher-order convex combinations.
Lemma 2.8
Let p1, ···, pk denote the vertices of a k − 1-simplex σk−1 in Δ(
) as well as the corresponding homogeneous polynomials in HSm. Let r = a1p1 + ··· +akpk be a convex combination of p1, ···, pk with a1, ···, ak ≥ 0 and a1 + ··· + ak = 1. If θ denote the condition number of
, Then
Proof
Expanding r2, we have
The second sum contains terms. To approximate r2 using an element , we proceed similarly as before. We start with ϕ equals the first sum above. For each cross-term 2aiajpipj in the second sum, we add to ϕ. This gives
It follows that
Next, we will approximate the squares (pi − pj)2 using elements in
exactly as before. More specifically, let γij = |pi − qj| and tij = (pi − qj)/γij. There exists sij ∈
such that the geodesic distance between tij and sij is less than θ. Now let φ = ϕ + Σi<j aibj(γijsij)2 be an element in
. We have
It follows from Equations 2.2 and 2.4 that all the integrals on the right can be uniformly bounded
and this gives
Since a1 + ··· + ak = 1,
| (2.7) |
as by Cauchy-Schwarz inequality. This yields the bound
| (2.8) |
We next bound the L1-norm of r2. Given that r = a1p1 + ··· akpk, the following lemma shows that the L2-magnitude |r| of the vector r satisfies
Hence, there exists such that γr ∈ HSm. Exactly as before, we have
| (2.9) |
Equations 2.8 and 2.9 together complete the proof.
Lemma 2.9
Let Δ denote a k-simplex in ℝd(m) whose vertices p0, ···, pk are on the unit sphere, i.e., ||p0||2 = ··· = ||pk||2 = 1. If there exists some α such that 1 > α > 0 and for all i ≠ j, then for any x ∈ Δ,
Proof
Let x = a1p1 + ··· akpk with ai ≥ 0 and a1 + ··· + ak = 1. It follows that
Let s = 2Σi<j aiaj and the above inequality becomes x⊤x ≥ 1 − (1 − α)s. From Equation 2.7, we have . It follows that
We remark that when k = 2, and the bound becomes tighter . This gives the cos θ term in Equation 2.9. Finally, we are ready to complete the proof of Theorem 2.4:
Proof
Since r(x) can be written as a sum of squares, by Proposition 2.10, it can be written as a sum of no more than d(m) terms with pi ∈ HSm:
Each pi belongs to a (d(m) − 1)-dimensional simplex σi ∈ Δ(
). By the preceding lemma, each
can be approximated by an element p̃i in
with uniformly bounded relative L1-error
where . Define as
and we have
On the other hand, we also have
Combining both inequalities yields the desired result.
In the proof above we made use of the following proposition.
Proposition 2.10
Let r denote a homogeneous polynomial of degree 2m that can be written as a sum of squares of homogeneous polynomials of degree m. Then, r can be written as a sum of at most d(m) squares
where a1, ···, ad(m) ≥ 0 and p1, ···, pd(m) ∈ HSm.
Proof
Suppose r is a sum of k squares of homogeneous polynomials q̃1, ···, q̃k of degree m
Denote m1, ···, md(m) the d(m) monomials of degree m, and X the vector
whose components are the monomials. It follows that with ai the vector whose components are coefficients of q̃i(x), and
The matrix S is symmetric and positive semi-definite with non-negative eigenvalues. Let λ1, ··· λd(m) denote its complete set of eigenvalues and v1, ···, vd(m) their associated unit eigenvectors, |vi|2 = 1. It follows that
and
where as |vi|2 = 1 for i = 1, ···, d(m).
3. Approximating PSD Tensors of Orders two, four and six
In this section, we apply Theorem 2.4 to derive formulas for the minimal number of generators in
needed to ensure that the approximation
is within a given accuracy requirement. Specifically, the accuracy requirement is presented in the form of the relative L1-approximation error
(cf. Equation 2.1): for 0 < ε < 1, we derive a formula that gives the (approximated) minimal number
(ε, m) of generators in
such that any r ∈ Ω2m can be approximated within ε using
, i.e.,
For PSD ternary tensors of orders two and four, it is known that they can be written as sums of squares of three tensors of order one and two, respectively. This follows from the well-known result that any ternary positive semi-definite homogeneous polynomial p(x) of degree two and four can be written as a sum of three squares of polynomials of degree one and two, respectively. The quadratic case follows easily from linear algebra while the quartic case follows from the celebrated theorem of Hilbert on ternary quartics [24]. We will first describe a general method for obtaining the formula
(ε, m) for any order m, and we will then explicitly work out the three cases m = 1, 2, 3 that are of most interest for various applications.
3.1. Preliminaries
Given a required precision ε > 0, the bound provided by Theorem 2.4 allows us to determine the condition number θ = θ(
) for the point set
in HSm to ensure that the precision requirement is satisfied. The main result in this section is a simple estimate on the number
(ε, m) of points in
needed to achieve the desired θ on the sphere HSm. Let
denote the bound given in Theorem 2.4. Since
Cη(θ) is a monotonically increasing function for , and we will denote its inverse by . fη can be numerically evaluated and the plots for fη over the range 0.01 ≤ ε ≤ 0.1 for several different η-values are shown in Figure 3.1. If θ is assumed to be small,
Fig. 3.1.
Left: Plots of fη for η = 1, 2, 4 in red, blue and green, respectively. ε varies from 0.01 to 0.1 and θ is given in degree. Right: Comparison plot of
(ε, 1) according to Equations 3.7 (in red) and 3.8 (in blue). The estimate using Equation 3.7 is between 17% and 20% less than the estimate using Equation 3.8.
| (3.1) |
Therefore, implies that
| (3.2) |
The formula above gives an estimate on the condition number θ = θ(
) given ε and η. We next give an estimate on the size of
for the given θ(
). Let n = d(m) − 1 denote the dimension of the sphere HSm and Δ(
) denote the triangulation associated with
. A simplex in Δ(
) is said to be θ-regular if the distance between any pair of its vertices equals θ, and the edge joining any pair of vertices is a geodesics on HSm. Due to the curvature on the sphere HSm, it is not possible to cover HSm with only θ-regular simplices. Therefore, we assume that the n-simplices in Δ(
) are approximately θ-regular in the sense that the geodesic distance between any pair of vertices of a n-simplex in HSm is approximately θ and the edge joining them is approximately a geodesic as well. For each vertex v in Δ(
), its degree is the number of n-dimensional simplices having it as a vertex. To estimate the number of points in
, we will estimate two quantities: the number K of n-dimensional simplices in Δ(
) and the average degree ν of the vertices. The number of points in
can then be estimated as
The occurrence of 2 in the denominator accounts for the fact that points in
are located only on a hemisphere.
Estimate on K
Since HSm is covered by a collection of θ-regular n-simplices, K can be estimated by taking the ratio between the volume of the sphere HSm and the volume of a θ-regular n-simplex. Since θ is in general assumed to be small, we will approximate the volume of a θ-regular n-simplex on the sphere HSm with the volume ωn(θ) of a corresponding θ-regular n-simplex in the Euclidean space ℝn:
| (3.3) |
It then follows that the number K of n-simplexes can be estimated as
| (3.4) |
where the volume of the sphere Vn is given by the formula [25]
Estimate on ν
For a typical vertex v in Δ(
), a small neighborhood
around v in HSm is covered by the θ-regular n-simplices having v as one of their vertices. Again, assuming θ is small, we can approximate this using Euclidean geometry, by transforming the neighborhood U onto the tangent space Tv at v using the log map. The geodesic ball Bθ of radius θ on HSm is mapped to the Euclidean ball of radius θ and the image of each n-simplex under the log map can be approximated by a regular n-simplex in the Euclidean space with side length θ. See Figure 3.2. It follows that the degree of v can be estimated as the ratio between the volume of the unit n-dimensional ball and the volume of regular n-simplex in ℝn with side length θ. The volume Vn of an n-ball in ℝn with radius r = 1 is given by the formula [25]
Fig. 3.2.
Left: For small θ, we can approximate the volume of a θ-regular spherical simplex by the volume of a θ-regular Euclidean simplex. The exponential map Expp maps a neighborhood of the origin in the tangent space Tp diffeomorphically onto a neighborhood
at p. Since the derivative of Expp at p is the identity, for small enough θ, Expp is close to an isometry in Bθ. Right: The average degree of a vertex, ν, can be approximated by the number of θ-regular simplexes contained in the ball of radius θ.
The degree ν is then estimated as
| (3.5) |
Combining Equations 3.3, 3.4, 3.5, we have
| (3.6) |
In the remaining section, we will work out the implication of the above estimate for 2nd, 4th and 6th-order tensors.
3.2. Second-Order Tensors
A quadratic homogeneous polynomial P(x, y, z) in ℝ3 has six coefficients P(x, y, z) = ax2 + by2 + cz2 + dxy + exz + fyz. It can be written in a matrix form as,
Positive semi-definiteness of the polynomial P(x, y, z) is equivalent to the positive semi-definiteness of the matrix S. It follows that determining positive semi-definiteness of a homogeneous quadratic polynomial is straightforward by examining eigenvalues of S: S is positive semi-definite if and only its eigenvalues λ1, λ2, λ3 are all non-negative and S can be written as
where vi is the unit eigenvector with eigenvalue λi for i = 1, 2, 3. It follows that P(x, y, z) can be written as a sum of three linear polynomials p1(x), p2(x), p3(x),
with .
With m = 1, the sphere HSm has dimension n = 2. According to Proposition 2.3, the map is isotropic with respect to the L1-norm and η = 1. Equation 3.6 (together with Equation 3.2) then gives
| (3.7) |
More Precise Estimate
For the linear case m = 1, since HSm is the two-sphere S2, its geometry is well-known and a better estimate on N can be obtained. Given θ, S2 is covered by geodesic triangles whose sides have lengths of approximately θ. Approximating the areas of these geodesic triangles with the area of an Euclidean equilateral triangles with side θ gives
. Let F, E, V denote the number of triangles, edges and vertices in the triangulation Δ(
). According to Euler’s formula
where χ(S2) is the Euler characteristic of S2. Since E = 3F/2, V = 2 + F/2 ≈ F/2. This gives ν = 6 as the average degree of a vertex on S2. Our estimate on the degree ν in Equation 3.5 in this case gives , which gives a 20% overestimate.
The area A of a geodesic triangle on S2 with three interior angles α, β, γ is given as [1]
In particular, for a geodesic equilateral triangle on S2 with side length θ, its angle α is given as
and the estimate on the number of triangles is
Let and θ = f(ε) be the solution to the trigonometric equation. It then follows that
| (3.8) |
In Figure 3.1, we compare the two estimates using Equations 3.8 and 3.7. For ε = 0.1, Equation 3.7 gives
≈ 30. And for ε = 0.01 and 0.001, it gives
≈ 130 and 600, respectively. As for Equation 3.8 it gives
≈ 34, 156, 725 for ε = 0.1, 0.01, 0.001, respectively.
3.3. Fourth-Order Tensors
In this case, m = 2 and H2 and HS2 have dimensions six and five, respectively. The map is no longer isotropic with respect to L1-norm in HS2. An analytic evaluation of the matrix Λ2 gives
The singular values of Λ2 arranged in the descending order are
This gives η = 5, and Equation 3.6 gives
For ε = 0.1, this yields N ≈ 176790. However, in H2, the polynomial v(x, y, z) = x2+y2+z2 is the constant function 1 on S2. In particular, , and . The map stretches the constant polynomial considerably more than any other quadratic polynomials, and this is the reason for the large condition number η. Let ℝu denote the one-dimensional subspace in H2 spanned by the constant polynomial u(x, y, z), and W its orthogonal complement,
The intersection of the sphere HS2 with the subspace W is a four-sphere S4. If we specialize to this four-sphere, i.e., polynomials orthogonal to the constant polynomial x2 + y2 + z2, the condition number η becomes 2 and the dimension of the sphere drops by one. Theorem 2.4 then provides the following estimate on the number of points
This number is considerably less than 176790. For example, for ε = 0.1, we have
≈ 1800 and for ε = 0.05, 0.01,
≈ 4670, 39620, respectively.
3.4. Sixth-Order Tensors
In this case, m = 3 and H3, HSm have dimensions 10, 9, respectively. The map is again non-isotropic with respect to L1-norm in H6. The singular values of Λ3 arranged in the descending order are
The condition number η = 16.44, which is quite substantial. However, similar analysis as above can be applied to eliminate polynomials in HS3 coming from polynomials of lower degree to substantially decrease the condition number. First, the three linear polynomials x, y, z are now embedded in H3 as x(x2 + y2 + z2), y(x2 + y2 + z2), z(x2 + y2 + z2). Let r̂(x), ŝ(x), t̂(x), r(x), s(x), t(x) be the following polynomials
The three polynomials r(x), s(x), t(x) are responsible for the three largest singular values of Λ3. The smallest singular value of 4π/105 comes from the polynomial q(x) = xyz. Let W denote the six-dimensional subspace in H3 that is the orthogonal complement of the subspace spanned by r(x), s(x), t(x) and q(x),
The sphere in W is five-dimensional, and the condition number of on S5 is η = 1.2769.
For ε = 0.1, 0.05, 0.01, the result above gives
≈ 1943, 6021, 85495, respectively.
4. Experimental Results
In this section we experimentally validate the proposed theory and at the end of this section we present an application to Diffusion-Weighted MRI. In all the experiments we use tensors in ℝ3, which can be visualized by plotting the corresponding homogeneous polynomial P(x, y, z) as a spherical function (see Fig. 4.1). Such tensor glyphs can be generated by scaling the radius of a unit sphere at orientation x = [x y z]T with the value of P(x, y, z). Additionally, we assign a color to each tensor glyph by using the following coloring scheme: we use the method in [11, 22] to compute the unit vector [x y z]T that maximizes P(x, y, z) and then we assign to the R, G, B color channels the squares of the three components in the vector x (i.e. R = x2, G = y2, B = z2). This color map produces smooth color transitions when visualizing fields of tensors such as the diffusion tensor fields.
Fig. 4.1.
Examples of randomly computed symmetric positive semi-definite tensors in Ω2, Ω4, Ω6. The tensor glyphs are shown.
First, we construct a dataset with samples from Ω2m as follows: we first generate random vectors in ℝd(m) using the normal distribution N(μ = 0, σ2 = 1) in d(m) = 3, 6, and 10 dimensions, and we use them as coefficients of linear, quadratic and cubic homogeneous polynomials p ∈ HS1, HS2, HS3 in three variables, respectively. Then we construct 2nd, 4th and 6th-order positive semi-definite tensors that belong to Ω2m by taking sums of squares of the polynomials in HS1, HS2, HS3, respectively. This process is repeated for 5000 times for each order, producing a dataset of 15000 tensors in total. Several of the generated tensors are shown in Fig. 4.1(right). The primary goal of the aforementioned process is to generate samples from Ω2m in order to test the error analysis presented in Section 3, and it should not be perceived as a DW-MRI simulation as in this section we do not discuss any application of the proposed method to DW-MR imaging.
In order to investigate how many generators in the finitely-generated cone
are necessary for our algorithm to approximate accurately a set of given tensors, we apply our framework to the previously described synthetic dataset using finite subsets
∈ HSm of various sizes N. The sets
are constructed as the vertices computed by triangulating the unit n-sphere. The triangulation is based on a variation of the algorithm for mesh generation presented in [39], which extends to any dimension n of the n-sphere. This method is an iterative force-based technique that uses a force displacement function to move the nodes of the mesh and the Delaunay triangulation [14], which is a fundamental and widely used triangulation process, to adjust the topology (i.e. the edges). Obviously, in our particular case we discard the edge information since we only need the finite set of nodes. This algorithm produces at the end the finite subsets
∈ HSm for different predefined sizes N.
We first use the constructed finite sets C in a numerical framework for approximating the error rate ε achieved by the finitely-generated cone
for m = 1, 2, 3. The numerical calculations were performed by randomly generating points in the n-sphere and testing if each point lies inside or outside the cone
. The error rate ε is the ratio of the points outside the cone over the total number of generated points. For each numerical computation we used 100k points. The numerical approximations are shown as circles in Fig. 4.2. By observing the figures we can see that in most of the cases the numerical approximations are close to the proposed formulas for computing N. We should note that the results are based on the computed sets
using the method in [39]. One may expect that the results will be slightly different if another method is employed for triangulating the n-sphere.
Fig. 4.2.
Comparison of the proposed formulas for computing N for m=1,2,3 with results produced using a numerical approximation algotithm. The horizontal axis show the the accuracy achieved by N finite generators (vertical axis) in the unit n-sphere. The circles show the numerical results produced for specific sets
of various sizes N.
We also use the sets
in a non-negative least squares (NNLS) optimization framework [28] in order to estimate tensors from the finitely-generated cone
that approximate the given 15000 tensors. For each order of tensors, the NNLS system is formulated as Aw = b, where A a matrix constructed from
, w the unknown solution vector and b contained the values of the given positive-semidefinite homogeneous polynomial at K = 81 three-dimensional unit vectors x1 ··· x81 (producing 81 components of b as b1 = P (x1) ··· b81 = P (x81)) for each tensor in the dataset. Although this problem seems extremely unconstrained in general, in our particular case the NNLS algorithm by definition constrains the number of non-zero elements in the solution vector to be at most d(2m), which is significantly smaller than the number of data points K in all of our experiments. In order to estimate such a constrained solution the NNLS algorithm implements a basis selection mechanism that starts with a set of possible basis vectors in
, computes the associated dual vector, and then reselects the basis in the solution by iteratively performing swaps in order to minimize the entries in the dual vector until they are all non positive. In our particular case of m = 1, 2, 3 the estimated unknown non-zero entries are 6, 15, 28 respectively which are all significantly smaller than the number of given samples K = 81. For a detailed description of the NNLS algorithm the reader is referred to [28].
The solutions w provide tensors in that approximate the given tensors in Ω2m, for m = 1, 2, and 3. The computed tensors are compared to the ground truth (given) tensors using the relative L1-error (fitting error):
| (4.1) |
The histograms of the errors found in the experiments (measured by Eq. 4.1) are plotted in Fig. 4.3 for the case of 2nd, 4th, and 6th-order tensors, respectively. Obviously, by increasing
, i.e. the number of generators in the finitely-generated subcone
, the error decreases correspondingly. The table in Fig. 4.3 reports the mean errors for various difference sizes N of the generator set.
Fig. 4.3.
Histograms of tensor fitting errors obtained by our method for the case of 2nd, 4th, and 6th-order tensors respectively, using various sizes N of the set
. The vertical axis corresponds to the percentage of the tensors in the dataset (i.e. number of occurances), and the horizontal axis corresponds to the given fitting error.
The experimental results presented in Fig. 4.3 and Fig. 4.4 validate empirically our method as the results corroborate well with our previous analysis on the number of generators required for a given relative error bound. For 2nd-order tensors, the analysis in Section 3 shows that for the error to be less than ε = 10%, 1%, 0.1%, it requires approximately N ≈ 30, 130 and 600 generators, respectively. The first plot in Fig. 4.3 shows that with N = 45, there are no occurrences of error greater than 10%, and with N = 150, there are no occurrences of error greater than 1%. With 321 generators, the error becomes negligible. For 4th-order tensors, our analysis shows that for the error to be less than ε = 10%, 5%, it requires approximately N ≈ 1800, 4670 generators, respectively. This can be seen from the second plot in Fig. 4.3. With N < 1500 generators, there are occurrences of 10% error, and with N ≥ 1500, there are no occurrences of error greater than 10%. To decrease the error under 5% level, the plot shows that we need at least N = 3000 generators. Finally, for 6th-order tensors, our analysis shows that for the error to be less than ε = 10% and 5%, it requires approximately N ≈ 1943 and 6021 generators, respectively. The third plot in the figure show that at N = 3000, there is only a small percentage of errors greater than 10%, and with N = 6000, there is an even smaller percentage (less than 1%) of errors greater than 5%. In most cases, our earlier analysis underestimate the required numbers of generators, and this is not surprising as these analysis are themselves based on several approximations. Nevertheless, the experimental results do agree in general with the predictions made in Section 3.
Fig. 4.4.
Plots of the running time of our method for fitting one tensor versus the approximation error for the case of 2nd, 4th, and 6th-order tensors, using various sizes N of the set
. The vertical axis corresponds to the obtained mean fitting error, and the horizontal axis corresponds to the execution time.
Figure 4.4 shows the running time of the optimization method for fitting one tensor versus the approximation error for various orders and number of generators N in the set
. The running times are measured using an Intel Pentium Dual CPU at 1.60 GHz and 1GB RAM. The plots demonstrate that the proposed technique can efficiently estimate positive tensors of various orders. More specifically, 2nd, 4th, and 6th-order tensors can be estimated using finitely-generated subcones of size N = 45, N = 900, and N = 6000 at 0.5ms, 12ms, and 243ms, respectively.
4.1. Application: Diffusion-Weighted MRI
Finally, we present an application of the proposed tensor approximation theory to Diffusion-Weighted MRI (DW-MRI). In several DW-MRI processing methods, a diffusion tensor is computed from the acquired diffusion-weighted signals. Negative diffusion values are non-physical; therefore, appropriate methods such as our proposed framework are necessary to ensure positive semi-definiteness of the estimated Diffusion tensors.
In order to demonstrate the necessity for estimating tensors with the positivity constraints, we compare our method with an existing one that computes tensors without the constraints [35]. In this experiment, we use the aforementioned synthetic dataset of 6th-order tensors, and we sample the corresponding homogeneous polynomials using K = 81 3-dimensional unit vectors x1 ··· x81 in the Stejskal-Tanner model [45], producing 81 DW-MRI samples for each tensor in the dataset. Various levels of Rician noise are added to the samples with standard deviations ranging from σ = 0.04 up to σ = 0.12. The noisy datasets are given as inputs to: a) the proposed algorithm (using N = 6000), and b) the method proposed in in [35], which is one of the several existing methods in the literature [15, 19] that estimate 6th-order tensors. For both, the computed 6th – order tensors P(x) are compared to the ground truth tensors using the error defined in Eq. 4.1.
Figure 4.5 shows the comparison of the fitting errors between the two methods for various levels of noise in the data. The results conclusively demonstrate that tensors estimated using positivity constraints approximate the data significantly better than the ones without. We also note that this result agrees with similar comparisons reported earlier for tensors of lower orders (e.g. 4th-order comparison in [5]), showing that the errors incurred in approximating positive-valued functions are significantly smaller when positivity constraints are enforced in the process. Our current results have provided further evidence that supports the importance of imposing positivity constraints in this context.
Fig. 4.5.

Comparison of the 6th-order tensor fitting errors obtained by the proposed method and the technique in [35] for various Rician noise levels in the data.
In order to illustrate the performance of our framework on real data sets, we applied the method to a DW-MRI data set of an excised rat hippocampus (shown in Fig. 4.6). The data set contains 46 images acquired using a pulsed gradient spin echo pulse sequence, with 45 different diffusion gradients and approximate b value of 1250s/mm2. Figure 4.6 shows the computed 6th-order diffusion tensor field. The highlighted regions of interest demonstrate the variability of the estimated structures. At each voxel, the fiber orientations can be estimated from the peaks of the displacement probability, which can be computed from the diffusion tensors as was shown in [5].
Fig. 4.6.
DW-MRI dataset from an isolated rat hippocampus. The image without diffusion weighting (S0) is shown on the top left. The 6th-order diffusion tensors estimated by the proposed method are shown as a field of spherical functions. The three regions of interest depict 6th-order diffusion tensors that model one, two, and three fiber structures.
Finally, Fig. 4.7 presents the results obtained by applying our method to a DW-MRI dataset from an excised rat optic chiasm. The data acquisition protocol was the same as in the rat hippocampus dataset. The computed field of 4th-order diffusion tensors is shown in the center. Using the estimated diffusion tensors, we can compute the underlying fiber orientations by computing the orientations that correspond to the maxima of the water molecule displacement probabilities. The computed fiber orientations are shown on the right and they agree with the known fiber orientations in the optic chiasm. Further quantitative validations of these orientations with respect to those from histology will be performed as part of our future work.
Fig. 4.7.
DW-MRI dataset from an isolated rat optic chiasm. A field of 4th-order diffusion tensors computed by the proposed method is shown in the central plate. The corresponding estimated fiber orientations are shown on the right.
5. Discussion and conclusions
Symmetric positive semi-definite tensors have been used in many applications. Although there are existing methods for imposing positivity constraints on the estimated tensors of order two and four, none of these techniques can be easily extended to higher orders. In this paper, we presented a framework for estimating PSD tensors of any order by approximating the space (cone) of PSD tensors with a finitely-generated subcone Σ2m. We discussed in detail the geometry of the higher-order tensors, and we presented an explicit characterization of the approximation, using the subset of semi-definite tensors that can be written as a sum of squares of tensors of order m. This approximation leads to a non-negative linear least-squares (NNLS) optimization problem, which can be efficiently solved, as it was demonstrated using synthetic datasets and real diffusion-weighted MR images.
An interesting property of the NNLS optimization algorithm is that it produces sparse solution vectors. In our particular case, although the problem seems significantly unconstrained, the solution vector contains at most d(2m) non-zero weights, which corresponds to the rank of the basis matrix. Therefore if the finitely-generated set
contains a few thousands bases, the algorithm will select only 6, 15, 28 for tensors of order 2, 4, and 6 respectively. Note that the number of non-zero weights in the solution vector equals to the number of the unique unknown parameters of the symmetric tensor in each case. The sparsity of NNLS in comparison with other optimization techniques for modeling the diffusion-weighted MR signal has also been studied in [27].
In our experiments the sets
were generated by tessellating the unit n-sphere using the iterative force-based technique in [39]. The vertices produced by this algorithm form the finite subset
∈ HSm for different predefined sizes N. An alternative approach could involve constructing
as a finite dictionary of elements in HSm by running a training algorithm on a control dataset [31]. A finite set of diffusion basis for multi-fiber reconstruction is also employed by the method in [43].
One of the advantages of the proposed algorithm is that it enforces positive semi-definite constraints to the estimated tensors. The need for positivity constraints in DW-MRI has been demonstrated in [6] and [5]. It has been shown that unconstrained methods may yield negative diffusivities in real datasets, especially in voxels with high anisotropy or in the presence of noise in the data.
Finally, although high order tensors can approximate several distinct fiber orientations, in the current standard clinical settings for DW-MRI acquisition most of the multi-fiber reconstruction techniques cannot estimate more than two fiber orientations [41], due to the low diffusion weighting (b-value) and the small number of gradient orientations. However, theoretically or in experimental settings with higher b-values and larger sets of diffusion gradient orientations, the proposed technique can estimate up to 2 and 3 distinct fiber orientations using tensors of order 4 and 6 respectively, which also agrees with the results presented in [35].
Fig. 3.3.
The geometry of the map . Left: HS3 is the nine-dimensional sphere S9. The decomposition of H3 into four subspaces of dimensions of 3, 3, 3, 1 respectively implies that HS3 contains separate copies of sphere S2, S2, S2 and S0. maps these spheres to spheres of radii 52π/83, 68π/699, 8π/105 and 4π/105, respectively. Right: The number of generators in Σ2, Σ4 and Σ6 that can ensure the given accuracy requirement. The plots for m = 1, 2, 3 are in red, blue and green, respectively.
Footnotes
This research was supported by the NIH grant EB007082 & NSF066340 to BCV.
References
- 1.Abramowitz M, Stegun IA. Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables. New York: Dover; 1972. [Google Scholar]
- 2.Aganj Iman, Lenglet Christophe, Sapiro Guillermo. Odf reconstruction in q-ball imaging with solid angle consideration. ISBI; 2009. pp. 1398–1401. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Alexander Daniel C. Maximum entropy spherical deconvolution for diffusion MRI. IPMI; 2005. pp. 76–87. [DOI] [PubMed] [Google Scholar]
- 4.Astola L, Florack L. Finsler geometry on higher order tensor fields and applications to high angular resolution diffusion imaging. Scale Space and Variational Methods in Computer Vision. 2009:224–234. [Google Scholar]
- 5.Barmpoutis A, Hwang MS, Howland D, Forder JR, Vemuri BC. Regularized positive-definite fourth-order tensor field estimation from DW-MRI. NeuroImage. 2009;45:153–162. doi: 10.1016/j.neuroimage.2008.10.056. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Barmpoutis Angelos, Jian Bing, Vemuri Baba C, Shepherd Timothy M. Symmetric positive 4th order tensors and their estimation from diffusion weighted MRI. IPMI. 2007;4584:308–319. doi: 10.1007/978-3-540-73273-0_26. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Barmpoutis A, Kumar R, Vemuri BC, Banerjee A. Beyond the Lambertian assumption: A generative model for apparent BRDF fields of faces using anti-symmetric tensor splines. Proceedings of CVPR08: IEEE Conference on Computer Vision and Pattern Recognition; 2008. pp. 1–6. [Google Scholar]
- 8.Barmpoutis A, Vemuri BC, Forder JR. Fast displacement probability profile approximation from hardi using 4th-order tensors. Proceedings of ISBI08: IEEE International Symposium on Biomedical Imaging; 2008. pp. 911–914. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Barmpoutis A, Vemuri BC, Shepherd TM, Forder JR. Tensor splines for interpolation and approximation of DT-MRI with applications to segmentation of isolated rat hippocampi. TMI: Transactions on Medical Imaging. 2007;26:1537–1546. doi: 10.1109/TMI.2007.903195. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Basser PJ, Mattiello J, Lebihan D. Estimation of the Effective Self-Diffusion Tensor from the NMR Spin Echo. J Magn Reson B. 1994;103:247–254. doi: 10.1006/jmrb.1994.1037. [DOI] [PubMed] [Google Scholar]
- 11.Bloy L, Verma R. On computing the underlying fiber directions from the diffusion orientation distribution function. In the proceedings of MICCAI; 2008. pp. 1–8. [DOI] [PubMed] [Google Scholar]
- 12.Boyd SP, Vandenberghe L. Convex optimization. Cambridge University Press; 2004. [Google Scholar]
- 13.Cho Kuan-Hung, Yeh Chun-Hung, Tournier Jacques-Donald, Chao Yi-Ping, Chen Jyh-Horng, Lin Ching-Po. Evaluation of the accuracy and angular resolution of q-ball imaging. NeuroImage. 2008;42:262–271. doi: 10.1016/j.neuroimage.2008.03.053. [DOI] [PubMed] [Google Scholar]
- 14.Delaunay B. Sur la sphre vide. Izvestia Akademii Nauk SSSR, Otdelenie Matematicheskikh i Estestvennykh Nauk. 1934;7:793800. [Google Scholar]
- 15.Descoteaux Maxime, Angelino Elaine, Fitzgibbons Shaun, Deriche Rachid. Apparent diffusion coefficients from high angular resolution diffusion imaging: Estimation and applications. Magnetic Resonance in Medicine. 2006;56:395–410. doi: 10.1002/mrm.20948. [DOI] [PubMed] [Google Scholar]
- 16.Descoteaux Maxime, Angelino Elaine, Fitzgibbons Shaun, Deriche Rachid. Regularized, fast and robust analytical q-ball imaging. MRM. 2007;58:497–510. doi: 10.1002/mrm.21277. [DOI] [PubMed] [Google Scholar]
- 17.Descoteaux Maxime, Deriche Rachid, Le Bihan Denis, Mangin Jean-Francois, Poupon Cyril. Diffusion propagator imaging: Using laplace’s equation and multiple shell acquisitions to reconstruct the diffusion propagator. IPMI; 2009. pp. 1–13. [DOI] [PubMed] [Google Scholar]
- 18.Fletcher PT, Lu Conglin, Pizer SM, Joshi Sarang. Principal geodesic analysis for the study of nonlinear statistics of shape. IEEE Transactions on Medical Imaging. 2004;23:995–1005. doi: 10.1109/TMI.2004.831793. [DOI] [PubMed] [Google Scholar]
- 19.Florack LMJ, Balmachnov Sizykh EG. Two canonical representations for regularized high angular resolution diffusion imaging. MICCAI Workshop on Computational Diffusion MRI; 2008. pp. 94–105. [Google Scholar]
- 20.Ghosh A, Descoteaux M, Deriche R. Riemannian framework for estimating symmetric positive definite 4th order diffusion tensors. Proceedings of MICCAI; 2008. pp. 858–865. [DOI] [PubMed] [Google Scholar]
- 21.Ghosh A, Moakher M, Deriche R. Ternary quartic approach for positive 4th-order diffusion tensors revisited. Proceedings of ISBI; 2009. pp. 618–621. [Google Scholar]
- 22.Ghosh A, Tsigaridas E, Descoteaux M, Comon P, Mourrain B, Deriche R. A polynomial based approach to extract the maxima of an antipodally symmetric spherical function and its application to extract directions from the orientation distribution function in diffusion MRI. Workshop on Computational Diffusion MRI; MICCAI. 2008. [Google Scholar]
- 23.Han D, Qi L, Wu EX. Extreme diffusion values for non-gaussian diffusions. Optimization Methods and Software. 2008;23:703–716. [Google Scholar]
- 24.Hilbert D. Über die darstellung definiter formen als summe von formenquadraten. Math Ann. 1888;32:342–350. [Google Scholar]
- 25.Huber Greg. Gamma function derivation of n-sphere volumes. Am Math Monthly. 1982;89:301–302. [Google Scholar]
- 26.Jensen JH, Helpern JA, Ramani A, Lu H, Kaczynski K. Diffusional kurtosis imaging: The quantification of non-gaussian water diffusion by means of magnetic resonance imaging. MRM. 2005;53:1432–1440. doi: 10.1002/mrm.20508. [DOI] [PubMed] [Google Scholar]
- 27.Jian B, Vemuri BC. Multi-fiber reconstruction from diffusion mri using mixture of wisharts and sparse deconvolution. In the proceedings of IPMI; 2007. pp. 384–395. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Lawson CL, Hanson RJ. Solving Least Squares Problems. Prentice-Hall; 1974. [Google Scholar]
- 29.Lenglet C, Rousson M, Deriche R. DTI segmentation by statistical surface evolution. IEEE Trans Med Imaging. 2006;25:685–700. doi: 10.1109/tmi.2006.873299. [DOI] [PubMed] [Google Scholar]
- 30.Liu C, Acar B, Moseley ME. Characterizing non-gaussian diffusion by using generalized diffusion tensors. Magnetic Resonance in Medicine. 2004;51:924–937. doi: 10.1002/mrm.20071. [DOI] [PubMed] [Google Scholar]
- 31.Mallat S, Zhang Z. Matching pursuits with time-frequency dictionaries. Trans on Signal Processing. 1993;41:3397–2415. [Google Scholar]
- 32.Moakher M. Fourth-order cartesian tensors old and new facts, notions applications. Quarterly Journal of Mechanics and Applied Mathematics. 2008;61:181–203. [Google Scholar]
- 33.Moakher M. The algebra of fourth-order tensors with applications to diffusion MRI. In: Laidlaw D, Weickert J, editors. Visualization and Processing of Tensor Fields. 2009. pp. 57–80. [Google Scholar]
- 34.Moakher M, Norris AN. The closest elastic tensor of arbitrary symmetry to an elasticity tensor of lower symmetry. Journal of Elasticity. 2006;85(3):215–263. [Google Scholar]
- 35.Ozarslan E, Mareci TH. Generalized diffusion tensor imaging and analytical relationships between DTI and HARDI. MRM. 2003;50:955–965. doi: 10.1002/mrm.10596. [DOI] [PubMed] [Google Scholar]
- 36.Ozarslan E, Vemuri BC, Mareci TH. Generalized scalar measures for diffusion MRI using trace, variance, and entropy. Magn Reson Med. 2005;53:866–76. doi: 10.1002/mrm.20411. [DOI] [PubMed] [Google Scholar]
- 37.Pasternak O, Sochen N, Basser PJ. The effect of metric selection on the analysis of diffusion tensor mri data. NeuroImage. 2010;49:2190–2204. doi: 10.1016/j.neuroimage.2009.10.071. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Pennec X, Fillard P, Ayache N. A Riemannian framework for tensor computing. International Journal of Computer Vision. 2005;65 doi: 10.1007/11566489_116. [DOI] [PubMed] [Google Scholar]
- 39.Persson PO, Strang G. A simple mesh generator in matlab. SIAM Review. 2004;46:329–345. [Google Scholar]
- 40.Petrovic V. Concircular curcature tensor. Pub Inst Math. 1979;25:131–137. [Google Scholar]
- 41.Prckovska V, et al. Optimal acquisition schemes in high angular resolution diffusion weighted imaging. In the proceedings of MICCAI; 2008. pp. 9–17. [DOI] [PubMed] [Google Scholar]
- 42.Qi L, Han D, Wu EX. Principal invariants and inherent parameters of diffusion kurtosis tensors. Journal of Mathematical Analysis and Applications. 2009;349:165–180. [Google Scholar]
- 43.Ramirez-Manzanares A, et al. Diffusion basis functions decomposition for estimating white matter intravoxel fiber geometry. IEEE Transactions on Medical Imaging. 2007;26:1091–1102. doi: 10.1109/TMI.2007.900461. [DOI] [PubMed] [Google Scholar]
- 44.Schultz T, Seidel HP. Estimating crossing fibers: A tensor decomposition approach. IEEE Trans Vis Comput Graph. 2008;14:1635–1642. doi: 10.1109/TVCG.2008.128. [DOI] [PubMed] [Google Scholar]
- 45.Stejskal EO, Tanner JE. Spin diffusion measurements: Spin echoes in the presence of a time-dependent field gradient. Journal of Chemical Physics. 1965;42:288–292. [Google Scholar]
- 46.Wang Wei, Gao Jinghuai, Li Kang. Structure-adaptive anisotropic filter with local structure tensors. Intelligent Information Technology Applications, 2007 Workshop on; 2008. pp. 1005–1010. [Google Scholar]
- 47.Wang Z, Vemuri BC. DTI segmentation using an information theoretic tensor dissimilarity measure. IEEE Transactions on Medical Imaging. 2005;24:1267–1277. doi: 10.1109/TMI.2005.854516. [DOI] [PubMed] [Google Scholar]
- 48.Wang Zhizhou, Vemuri Baba C, Chen Yunmei, Mareci Thomas H. A constrained variational principle for direct estimation and smoothing of the diffusion tensor field from complex dwi. IEEE Trans Med Imaging. 2004;23:930–939. doi: 10.1109/TMI.2004.831218. [DOI] [PubMed] [Google Scholar]
- 49.Yassine I, McGraw T. 4th order diffusion tensor interpolation with divergence and curl constrained bezier patches. In Proceedings of ISBI; 2009. pp. 634–637. [Google Scholar]










