Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2016 Feb 9.
Published in final edited form as: Med Image Comput Comput Assist Interv. 2011;14(0 2):182–190. doi: 10.1007/978-3-642-23629-7_23

Probabilistic ODF Estimation from Reduced HARDI Data with Sparse Regularization

Antonio Tristán-Vega 1, Carl-Fredrik Westin 1
PMCID: PMC4747241  NIHMSID: NIHMS754672  PMID: 21995028

Abstract

High Angular Resolution Diffusion Imaging (HARDI) demands a higher amount of data measurements compared to Diffusion Tensor Imaging (DTI), restricting its use in practice. We propose to represent the probabilistic Orientation Distribution Function (ODF) in the frame of Spherical Wavelets (SW), where it is highly sparse. From a reduced subset of measurements (nearly four times less than the standard for HARDI), we pose the estimation as an inverse problem with sparsity regularization. This allows the fast computation of a positive, unit-mass, probabilistic ODF from 14–16 samples, as we show with both synthetic diffusion signals and real HARDI data with typical parameters.

1 Introduction

Diffusion Magnetic Resonance Imaging (MRI) provides an unparalleled tool to probe the connectivity of the nerve fibers within the white matter of the brain in vivo. At the core of this technique is the relationship between the signal E(q), acquired by the MRI scanner when a pulsed gradient encoded by the wave-vector q is applied, and the random process P(R) driving the restricted diffusion of water molecules. Such relationship is modeled as the following Fourier transform [1]:

P(R)=3|E(q)|exp(j2πq·R)dq. (1)

The obvious way to recover P(R) is to sample E(q) and approximate eq. (1) as a discrete Fourier transform, what is known as Diffusion Spectrum Imaging [2]. The main drawback of this technique is the need to sample the entire space of wave-vectors q, since each of them implies the acquisition of a whole MRI volume. Though it has been recently shown [3] how this amount of data can be drastically reduced using Compressed Sensing (CS, [4]), this approach still requires several hundreds of samples to attain a reliable reconstruction.

High Angular Resolution Diffusion Imaging (HARDI, [5]) permits reduction of the measurements to a sampling of the space of orientations with constant ‖q‖. This is achieved at the expense of losing the radial information in P(R), computing some sort of projection or Orientation Distribution Function (ODF) [2]:

Φ(r)=P(Rr)R2dR=18π2rΔE(q)dq. (2)

There are some other alternative definitions for the orientation function (see [6] for an excellent review on this topic), but eq. (2) has the advantage of being a true probability law [7]. HARDI typically requires 60–100 wave-vectors q, but considerations similar to those in [3] allow relaxation of this requirement to a few tens (16–30) [8, 9], whenever a sparse representation of E(q) may be achieved. This condition is met by representing E(q) in a suitable vector frame, namely Spherical Ridgelets (SR), whose components closely resemble the shape of E(q) at different scales. This frame is adequate in [8] since the object of interest is the diffusion signal. Unfortunately, these plate-like functions do not fit the cigar-shaped appearance of the probabilistic ODF, Φ(r), and cannot be expected to provide a sparse representation of it.

In section 2, we postulate Spherical Wavelets [10] as a suitable frame to sparsely represent the Φ(r) in eq. (2), as depicted in Fig. 1. In section 3 we formulate the estimation of the ODF as an inverse problem, by relating the SW functions to their counterparts in the q-space. We propose two alternative solutions based on either ℓ1 or ℓ2 regularization, justifying that non-negative ℓ2 provides sparse representations analogous to ℓ1. Finally, the results in section 4 illustrate how positive, unit-mass, probabilistic ODFs can be recovered from as few as 14 gradient directions. With ℓ2 regularization, the minimization problem can be solved with Newton-like methods, becoming computationally efficient.

Fig. 1.

Fig. 1

Examples of: a) the true Φ(r) when P(R) is Gaussian; b) its corresponding diffusion signal E(q); c) a SW, Ψv,j (r). These wavelets aim to sparely represent signals of the type a), which seems quite reasonable; d) the response Ξ(q) to Ψv,j (r) in the space of measurements; e) a SR, which in [8] aims to sparsely represent signals of the type b). Note this shape prevents the use of SR to represent ODFs like a).

2 Spherical Wavelets to Sparsely Represent Φ(r)

The shape of the ODF in Fig. 1 implies that a combination of a few Spherical Wavelets (SW, [10]) with different widths and orientations may accurately represent Φ(r) for an arbitrary ensemble of diffusion compartments, i.e. the coefficients of Φ(r) in its SW expansion are sparse. The domain of definition of SW is the set Ω ≡ {r ∈ ℝ3 : ‖r‖ = 1} of unit directions (i.e. diffusion gradients). Their definition is founded on the basis of Spherical Harmonics (SH), for which the following summation formula holds:

m=llYlm(u)Ylm(v)=2l+14πPl(u·v), (3)

where Ylm is the real SH basis function of degree l ≥ 0 and order −lml, and Pl is the Legendre polynomial of degree l. Note Pl(u · v) is a rotation of Yl0(u) to an auxiliary system for which the ‘z’ axis is aligned with v. Any symmetric (i.e. f(u) = f(−u)), square-integrable function f𝕃sym2(Ω), can be written as [10]:

f(u)=Ωf(v)l=0,2,2l+14πPl(u·v)dv=Ωf(v)Θv(u)dv, (4)

where Θv(u) is a spherical convolution kernel. To achieve a scale-space representation of f(u), the following scaling kernel Kv,j(u) and semi-discrete frame j=1vΩΨv,j(u), that spans 𝕃sym2(Ω), are introduced in [10]:

Kv,j(u)=l=0,2,2l+14πκρ(2jl)Pl(u·v), (5)
Ψv,j(u)={Kv,0(u),j=1;Kv,j+1(u)Kv,j(u),j0, (6)

where we choose κρ(x) = e−ρx(x + 1). When the resolution j → ∞, κρ (2j l) → 1 for each l, and we have the exact recovery in eq. (4). When j → −∞, κρ (2jl) → 0 for l ≠ 0, and we have the DC component of f(u). This reasoning clarifies the multiscale nature of Kv, j, from coarse (j → −∞) to fine details (j → ∞).

3 ODF Reconstruction from Sparse Representations

From the representation provided by SW as described in the previous section, we aim to estimate the ODF, Φ(r), as the sparsest combination of SW that can explain the data acquired in the q-space, E(q). Therefore, we need to compute the q-space response of each SW defined in r ∈ Ω (section 3.1), and then discretize the problem to account for the discrete nature of the measurements (section 3.2).

3.1 Response of SW Ψv, j(r) in the Space of Wave-Vectors q

We are interested in probabilistic ODF estimators [7], so that the response pursued obeys the following form:

Φ(r)=14π+116π2𝒢{Δbζ(E(q))}(r), (7)

Among the chances provided in [7], we choose ζ(·) = −Ei(−log(·)) as it seems to be a trade-off between angular resolution and noise sensitivity. Also, the unit-mass constraint comes for free with this model, since the Beltrami operator Δb eliminates all DC components so that we keep only the 1/4π factor. We denote by 𝒢 the Funk-Radon Transform (FRT, [5]).

Since SH are eigenfunctions for both 𝒢 and Δb, it is straightforward to prove this property holds also for the rotated versions of Yl0(u), i.e. Pl(u · v). If we insert eqs. (5) and (6) into (7), we get the following duality between the SW, Ψv,j(r), and their counterparts in the q-space, Ξv,j(q) (see Fig. 1):

Ψv,j(r)=l=2,4,2l+14πνρ,j(l)Pl(r·v)Ξv,j(q)=l=2,4,4π(2l+1)l(l+1)νρ,j(l)λlPl(q·v),
νρ,j(l)={κρ(l),j=1;κρ(2j1l)κρ(2jl),j0, (8)

where λl and −l(l + 1) are respectively the eigenvalues for 𝒢 and Δb. Note we drop the DC term l = 0 from the sums, according to our previous statement.

3.2 Optimization Problem with ℓp Regularization

Like in [8], we need a fully discrete version of the frame we use for sparse representation: let {qn}n=1N be the set of wave-vectors in the HARDI data-set; let {rm}m=1M be the set of directions for which the ODF is computed; let {vj(k),k}k=1K be a collection of discretizations of Ω for each resolution level −1 ≤ jJ. The estimation of x ≡ Φ(r) is arranged as the ℓp inverse problem:

minaap,s.t.Aay22<ηwith:x=Ba+14π|An,k=Ξvj(k),k,j(k)(qn);yn=E(qn);Bm,k=Ψvj(k),k,j(k)(rm);xm=Φ(rm). (9)

In brief, we discretize the responses Ξv,j(q) into a N × K sampling matrix A to find that a best describing the N × 1 vector yE(q). These coefficients are used to recover x from the discretized M × K sparsifying matrix B. We deliberately choose the same names used in [4], given the formal similarity of eq. (9) with CS for p = 1. Typically, NK, so that Aa = y is highly underdetermined. This is the reason why a has to be assumed sparse a priori. Though sparsity is only enforced in case p ≤ 1 [4], we also consider here the case p = 2 for the reasons detailed below. Note the ℓ2 problem has a closed-form solution for eq. (9):

a=AT(AAT+τ(η)·IN)1y, (10)

3.3 Positivity Constraints on the Probabilistic ODF

Since Φ(r) is a probability law, it has to be positive for all r. Given a set of directions {rm}m=1M, we add M′ additional linear constraints to eq. (9):

Ba+14π0,Bm,k=Ψvj(k),k,j(k)(rm). (11)

Two important remarks arise here regarding the ℓ2 problem: 1) the solution in eq. (10) no longer holds, but eq. (9) becomes a quadratic program which is still very efficient to solve compared to ℓ1. 2) Non-negative least squares in fact provide sparse solutions [11], so that we may reasonably hypothesize (see [9]) that the solution of the constrained ℓ2 will approach that of the constrained ℓ1.

4 Results

Synthetic Experiments

We generate a two-tensor model simulating two fibers crossing in angles 45°, 60°, or 90°. The results are the average for 300 trials with random rotations of the tensor ensemble, with the diffusion signal generated as:

S(u)=(12exp (buTD1u)+12exp (buTD2u)+σ·nc)2+(σ·ns)2, (12)

for eigenvalues of Dn: [1.7, 0.3, 0.3] · 10−3 mm2/s, b = 2, 000 s/mm2, nc,s ~ 𝒩(0, 1), and Signal to Noise Ratios (SNR): 1/σ =100, 40.

All the methods are coded in MatLab®, using the ℓ1-magic library (http://www.acm.caltech.edu/l1magic/) to solve ℓ1 and quadprog to solve ℓ2.

The parameters are fixed following the guidelines in [8]: a width ρ = 0.75 of the kernel in eq. (5) is chosen by trial and error for best performance, and so it is the error threshold η in eq. (9) (its value depends on the scenario). For the directions r, r′, and q, we choose regular samplings of Ω in all cases, with respective set sizes M = 214, M′ = 50, and N (the number of HARDI measurements) ranging from 12 to 28. For the discretization of the SW frame, we uniformly sample Ω at each level −1 ≤ jJ, taking (2j+1m0 + 1)2 evenly spaced samples, with m0 = 4 and J = 1.

For the sake of comparison, we have adapted the approach in [8] (namely, SR) to interpolate ζ(E), and used eq. (7) to evaluate the probabilistic ODF. We have also adapted the method described in section 3 by alternatively representing the ODF in the basis of SH with l ≤ 4, where Φ(r) is not sparse1. The comparison is based on the relative Mean Squared Error (rMSE) between the recovered ODF and the ground-truth, computed using eq. (7) over a SH fitting (l ≤ 8) of 214 noise-free q-space samples, see [7, Table 2] for details.

Discussion

Numerical results are shown in Fig. 2, using the optimal parameters in each case. It is worth to stress the following remarks:

  • 1)

    With ℓ1 regularization, our approach (SW) provides the lowest reconstruction errors (see Fig. 2.a, top). Though this numeric difference may seem subtle, the ODFs reconstructed in Fig. 2.c suggest it might be critical to correctly resolve the fiber crossings in poor SNR scenarios.

  • 2)

    As the number of gradients acquired increases, the rMSE with SW and SR rapidly decreases, but not with SH. Since the ODFs are not actually sparse in the SH basis (see Fig. 2.b), the a priori sparse behavior does not hold.

  • 3)

    From Fig. 2.a, bottom, our SW approach provides very similar rMSE (and visually identical ODFs, see Fig. 2.c) with either ℓ1 or ℓ2 regularization. As stated before, non-negative ℓ2 yield sparse solutions [11], close to those with ℓ1 in the space of coefficients. We show in Appendix A that SW, as used here, nearly meet the Parseval property: if the ℓ1 and ℓ2 solutions are alike, the Euclidean distance between the corresponding ODFs will also be small. The SR in [8] do not exhibit this feature. Moreover, according to eq (7), a differential operator Δb is applied to compute Φ from ζ(E), so the difference in the reconstruction of ζ(E) (which is not bounded by the distance between the ℓ1 and ℓ2 solutions) is amplified when computing the ODF. Accordingly, the corresponding errors in Fig. 2.a, bottom (ℓ2) for [8] are far larger compared to the ℓ1 case.

  • 4)

    The minimum number of gradients to attain a reliable ODF reconstruction seems to be 14–16 (Fig. 2.a, top and bottom), and below this threshold the rMSE blows. With SW, this range coincides with the minimum number of non-zero coefficients of the solution (see remark 5), so we may hypothesize that a minimum of 3% of the SW coefficients are required to properly describe ODFs.

  • 5)

    Our method, SW, provides also the sparsest solution (Fig. 2.b). As the number of measurements grows, we have more information on the ODF, so the solution may be refined with more SW coefficients and it becomes less sparse.

Fig. 2.

Fig. 2

Results obtained with our approach (SW), our adaptation of [8] to estimate ODFs (SR), and an implementation of our method based on SH. a) Mean and variance of the rMSE for ℓ1 (top row) and ℓ2 (bottom row). b) Sparsity of the solutions for a 60° crossing. c) Examples of the ODFs reconstructed for SNR=40 and a 60° crossing.

In vivo Experiments

A HARDI data set with 8 unweighted T2 baseline images and 16 evenly spaced gradients (b = 3, 000 s/mm2) is used in this section.

Fig. 3 shows an axial slice intersecting the corpus callosum (red lobes aligned with the ‘x’ axis) and the uppermost part of the cingulum (green lobes, ‘y’ axis).

Fig. 3.

Fig. 3

Axial slice of a real data set with 16 gradients. The ODFs field has been recovered with our method. The highlighted regions have been further analyzed with both our method (SW) and the method in [8] (SR), with either ℓ1 or ℓ2 regularization.

We are able to correctly resolve the crossings between these structures in the top row, or even the complex three-compartment crossings in the bottom row (blue lobes along the ‘z’ axis correspond to the corona radiata).

While for ℓ1 the improvement with our method over [8] is not dramatic, the reconstructed ODFs are still sharper (see blue square), according to remark 1.

With ℓ2 regularization, the method in [8] completely fails, while our method outputs accurate ODFs, and even the three-compartment crossings are better resolved than with ℓ1 (see green square).

This is in agreement with remark 3. Our final remark is that:

  • 6)

    1-based reconstruction takes ~200 s, while ℓ2-based takes ~1.5 s, for Fig. 3 (the speedup is ~130), which is an additional argument supporting ℓ2.

5 Conclusion

We have shown that as few as 14–16 diffusion directions suffice to obtain accurate probabilistic ODFs estimates. Positivity constraints, which are not considered in [7], can be easily incorporated into our model. Although sparse regularization has been previously used in [8] to fit the diffusion signal, our approach is, to our knowledge, the very first to use this technique to reconstruct the ODF from HARDI data with reduced measurements. Though we have discussed in the results section a work-around to compute the ODF from the diffusion signal provided by [8], the results with our own approach are clearly better with ℓ1, and the only usable at all with ℓ2. This is specially important, since ℓ2 reconstruction is two orders of magnitude faster than ℓ1: for this latter approach, the overwhelming duration of the estimation for an entire volume (in the order of 100 days with our Matlab implementation) makes the overall acquisition/reconstruction time unattractive. Our ℓ2 approach provides virtually identical results as ℓ1 with reasonable reconstruction times, while allowing an acquisition speedup in the order of 4.

Acknowledgments

Work funded by grant numbers: R01 MH074794, R01 R01MH092862, P41 RR013218 (NIH); TEC2010-17982, FMECD-2010/71131616E (Ministry of Education, Spain/Fulbright Committee).

Appendix

A Parseval Property in the Semi-Discrete SW Frame

A frame is said to be tight or Parseval if the projections of a vector f in each frame element conserve the energy of f. If C is the frame bound, clm the coefficient of f(u) for the SH basis function Ylm(u), and fl(u) the orthogonal projection of f(u) in the subspace of SH of degree l, this condition reads:

j=1Ω|f(u),Ψv,j(u)|2dv=?Cf(u)22 (13)

for the semi-discrete frame of SW. From its definition in eq. (6), eq. (13) reads:

j=1Ωdv|f(u),l=0,2,νρ,j(l)m=llYlm(u)Ylm(v)|2=j=1Ωdv|l=0,2,νρ,j(l)m=llclmYlm(v)|2=j=1l,l=0,2,νρ,j(l)νρ,j(l)m,m=llclmclmYlm,Ylm=j=1l=0,2,νρ,j(l)2m=ll(clm)2=l=0,2,slfl22. (14)

The condition (13) is equivalent to sl=j=1νρ,j(l)2 (see eq. (8)) being constant for all l. Fig. 4 shows the partial sums for the first few l. SW are near Parseval (i.e. the curves approach the same value when j → ∞) if l = 0 is ignored, but this is the DC component, unnecessary for our derivations.

Fig. 4.

Fig. 4

Partial sums k=1Jνρ,k(l)2 for different degrees of l (for l = 0, the curve is a line with constant value 1). We use ρ = 0.75 as in all the experiments throughout the results section (this behavior does not hold for all ρ).

Footnotes

1

We cannot reproduce the respective derivations due to space limits, but they are straightforward by updating eq. (8) using that SH are eigenfunctions of 𝒢, Δb.

Contributor Information

Antonio Tristán-Vega, Email: atriveg@bwh.harvard.edu.

Carl-Fredrik Westin, Email: westin@bwh.harvard.edu.

References

  • 1.Callaghan P. Principles of Nuclear Magnetic Resonance Microscopy. Oxford: Clarendon Press; 1991. [Google Scholar]
  • 2.Wedeen VJ, Hagmann P, Tseng W-YI, Reese TG, Weisskoff RM. Mapping complex tissue architecture with DSI. Mag. Res. Med. 2005;54:1377–1386. doi: 10.1002/mrm.20642. [DOI] [PubMed] [Google Scholar]
  • 3.Merlet S, Deriche R. MICCAI Workshop Comp. Beijing (China): Diffusion MRI; 2010. Sep, Compressed sensing for accelerated EAP recovery in diffusion MRI. [Google Scholar]
  • 4.Donoho DL. Compressed sensing. IEEE Trans. Info. Th. 2006;52(4):1289–1306. [Google Scholar]
  • 5.Tuch DS. Q–Ball imaging. Mag. Res. Med. 2004;52:1358–1372. doi: 10.1002/mrm.20279. [DOI] [PubMed] [Google Scholar]
  • 6.Assemlal H-E, Tschumperlé D, Brun L, Siddiqui K. Recent advances in diffusion MRI modeling: Angular and radial reconstruction. Med. Im. Anal. 2011 doi: 10.1016/j.media.2011.02.002. [DOI] [PubMed] [Google Scholar]
  • 7.Tristán-Vega A, Westin C-F, Aja-Fernández S. A new methodology for the estimation of fiber populations in the white matter of the brain with the Funk-Radon transform. NeuroIm. 2010;49:1301–1315. doi: 10.1016/j.neuroimage.2009.09.070. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Michailovich O, Rathi Y. Fast and accurate reconstruction of HARDI data using compressed sensing. In: Jiang T, Navab N, Pluim JPW, Viergever MA, editors. MICCAI. Vol. 6361. Heidelberg: Springer; 2010. pp. 607–614. LNCS. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Jian B, Vemuri BC. Multi-fiber reconstruction from diffusion MRI using mixture of wisharts and sparse deconvolution. In: Karssemeijer N, Lelieveldt B, editors. IPMI. Vol. 4584. Heidelberg: Springer; 2007. pp. 384–395. LNCS. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Freeden W, Schreiner M. Orthogonal and non-orthogonal multirresolution analysis, scale discrete and exact fully discrete wavelet transform on the sphere. Const. Approx. 1998;14:493–515. [Google Scholar]
  • 11.Lawson CL, Hanson RJ. Solving Least Squares Problems. Englewood Cliffs: Prentice-Hall; 1974. [Google Scholar]

RESOURCES