Abstract
Estimating diffusion tensors is an essential step in many applications-such as diffusion tensor image (DTI) registration, segmentation and fiber tractography. Most of the methods proposed in the literature for this task are not simultaneously statistically robust and feature preserving techniques. In this paper, we propose a novel and robust variational framework for simultaneous smoothing and estimation of diffusion tensors from diffusion MRI. Our variational principle makes use of a recently introduced total Kullback-Leibler (tKL) divergence for DTI regularization. tKL is a statistically robust dissimilarity measure for diffusion tensors, and regularization using tKL ensures the symmetric positive definiteness of tensors automatically. Further, the regularization is weighted by a non-local factor adapted from the conventional nonlocal means filters. Finally, for the data fidelity, we use the nonlinear least-squares term derived from the Stejskal-Tanner model. We present experimental results depicting the positive performance of our method in comparison to competing methods on synthetic and real data examples.
Keywords: DTI estimation, variational principle, non-local means, total Kullback-Leibler divergence, diffusion MRI, limited memory quasi-Newton method
1. Introduction
Diffusion weighted magnetic resonance imaging (MRI) is a very popular imaging technique that has been widely applied (Jones, 2010) in recent times. It uses diffusion sensitizing gradients to non-invasively capture the anisotropic properties of the tissue being imaged. Diffusion tensor imaging (DTI) approximates the diffusivity function by a symmetric positive definite tensor of order two (Basser et al., 1994). DTI is an MRI modality that provides information about the movement of water molecules in a tissue. DTI describes the diffusion direction of water molecules in the brain which is associated with the direction of fiber tracts in the white matter. When this movement is hindered by membranes and macromolecules, water diffusion becomes anisotropic. Therefore, in highly structured tissues such as nerve fibers, this anisotropy can be used to characterize local structure of the tissue. Consequently, many applications are based on the estimated diffusion tensor fields, such as registration (Gur and Sochen, 2007; Jia et al., 2011; Yang et al., 2008; Yeo et al., 2009; Wang et al., 2011), segmentation (Descoteaux et al., 2008; Hasan et al., 2007; Liu et al., 2007; Goh and Vidal, 2008; Savadjiev et al., 2008; Lenglet et al., 2006; Motwani et al., 2010; Wang and Vemuri, 2005; Vemuri et al., 2011), atlas construction (Assemlal et al., 2011; Mori et al., 2008; Barmpoutis and Vemuri, 2009; Xie et al., 2010), anatomy modeling (Faugeras et al., 2004), fiber tract related applications (Mori and van Zijl, 2002; Burgela et al., 2006; Durrleman et al., 2011; Lenglet et al., 2009; Savadjiev et al., 2008; Wang et al., 2010, 2012; Zhu et al., 2011) and so on. All of these latter tasks will benefit from the estimation of smooth diffusion tensors.
Estimating the diffusion tensors (DTs) from DWI is a challenging problem, since the DWI data are invariably affected by noise during its acquisition process (Poupon et al., 2008b; Tang et al., 2009; Tristan-Vega and Aja-Fernandez, 2010). Therefore, a robust DTI estimation method which is able to perform feature preserving de-noising is desired. For most of the existing methods, the DTs are estimated using the raw diffusion weighted echo intensity image (DWI). At each voxel of the 3D image lattice, the diffusion signal intensity S is related with its diffusion tensor D ∈ SPD(3) 1 via the Stejskal-Tanner equation (Stejskal and Tanner, 1965)
| (1) |
where S0 is the signal intensity without diffusion, b is the b-value and g is the direction of the diffusion sensitizing gradient.
There are various methods (Salvador et al., 2005; Batchelor et al., 2005; Fillard et al., 2007; Hamarneh and Hradsky, 2007; Pennec et al., 2006; Niethammer et al., 2006; Tang et al., 2009; Tristan-Vega and Aja-Fernandez, 2010; Wang et al., 2004; Vemuri et al., 2001; Wang et al., 2003; Chefd’hotel et al., 2004; Tschumperle and Deriche, 2003, 2005; Mishra et al., 2006; Poupon et al., 2008b; Barmpoutis et al., 2009a; Chang et al., 2005; Mangin et al., 2002) in existing literature, to estimate D from S. A very early one is direct tensor estimation (Westin et al., 2002), which gives an explicit solution for D and S0. Though time efficient, it is sensitive to noise because only 7 gradient directions are used to estimate D and S0. Another method is the minimum recovery error (MRE) estimation or least squares fitting (Basser et al., 1994) which minimizes the error when recovering the DTs from the DWI. MRE is better than direct estimation because it uses more gradient directions, which increases its reliability. However, it does not smooth the DWI or the DTI, and thus it is subject to noise in the input data.
With this in mind, many denoising frameworks (Gilboa et al., 2004; Spira et al., 2007) have been proposed to improve the signal to noise ratio (SNR). Some methods perform denoising on the DWI and then estimate the DTI. Typical approaches to DWI denoising are designed according to the statistical properties of the noise. Most of these approaches assume the noise follows the Rician distribution (Bennett A. Landman, 2007; Piurica et al., 2003; Koay and Basser, 2006; Descoteaux et al., 2008), and when denoising, they use the second order moment of Rician noise (McGibney and Smith, 1993), Maximum Likelihood (ML) (Sijbers and den Dekker, 2004) and Expectation Maximization (EM) approaches (Marzetta, 1995; DeVore et al., 2000), wavelets (Nowak, 1999), anisotropic Wiener filtering (Martin-Fernndez et al., 2007), total variation schemes (McGraw et al., 2004), Markov Random Fields (Zhang et al., 2001), nonparametric neighborhood statistics techniques like non-local means (NLM) (Awate and Whitaker, 2005; Coupe et al., 2008) and unbiased NLM (Manjon et al., 2008; Wiest-Daessle et al., 2008) algorithms, Perona-Malik-like smoothing (Basu et al., 2006) or linear minimum mean square error (LMMSE) scheme (Aja-Fernandez et al., 2008).
Alternatively, some methods first estimate the diffusion tensors from the raw DWI and then perform denoising on the tensor field (Moraga et al., 2007). One representative method is using the NLM framework incorporating a Log-Euclidean metric (Fillard et al., 2005). The drawback of such two-stage processes is that the errors might be accumulated from one stage to the other.
Bearing these deficiencies in mind, researchers developed variational framework (VF) based estimation (Tschumperle and Deriche, 2003; Chefd’hotel et al., 2004; Wang et al., 2003, 2004). These approaches take into account the SPD (symmetric positive definite) constraint on the diffusion tensors. The smoothing in all these approaches involves some kind of weighted averaging over neighborhoods which define the smoothing operators resulting from the variational principles. Some of these smoothing operators are locally defined and do not capture global geometric structure present in the image. Moreover, they are not statistically robust.
To overcome the aforementioned drawbacks, we propose a novel statistically robust variational non-local approach for simultaneous smoothing and tensor estimation from the raw DWI data. This approach combines the variational framework, non-local means and a statistically robust regularizer on the tensor field. The main contributions of this approach are three-fold. First, we use a statistically robust divergence measure total Bregman divergence to regularize the smoothness measure on the tensor field. Combined with the Cholesky decomposition of the diffusion tensors, this automatically ensures the positive definiteness of the estimated diffusion tensors, which overcomes the common problem for many techniques that manually force the tensor to be positive definite or resort to accuracy of finite precision arithmetic leading to the equivalence between testing for positive definiteness and positive semidefiniteness as in (Wang et al., 2003, 2004). Second, it uses an adaptation of the NLM to find the weight for the smoothness regularization terms. This preserves the global structure of the tensor field while denoising. Finally, it achieves simultaneous denoising and DTI estimation, which is able to avoid the error propagation of a two stage approach described earlier. Besides, this method can be easily extended to higher order tensor estimation. We will explain these points at length in the rest of the paper.
The rest of the paper is organized as follows. In Section 2, we introduce our proposed method, followed by the empirical validation in Section 3. Finally we conclude in Section 4.
2. Proposed method
Our proposed integrated variational non-local approach has three components, minimizing the data fidelity error, smoothing over S0 and the tensor field. The proposed model is given by the following equation:
| (2) |
where Ω is the domain of the image, n is the number of diffusion gradients,
(x) is the search window at voxel x, and δ(D, D(y)) is the total Kullback-Leibler (tKL) divergence (Vemuri et al., 2011) between tensors D and D(y) which will be explained in detail later. The first term captures the non-linear data fitting error, the second and third terms are smoothness constraints on S0 and D. α and β are constants balancing the fitting error and the smoothness 2. w1(x, y) and w2(x, y) are the regularization weights for S0 and D. Since S0 and S are linearly related, while D and S are “logarithmically” related, so we use different methods to calculate w1(x, y) and w2(x, y). Note, Si, S0 and D by default represent the values at voxel x, unless specified otherwise.
The discrete case of (2) is
| (3) |
Since most of the time, DTI estimation problems are in discrete case, we will focus on the discrete case in this work.
2.1. Computation of the weights w1(x, y) and w2(x, y)
w1(x, y) and w2(x, y) are the regularization weights of the smoothness terms. If w1(x, y) is large, it requires S0 and S0(y) to be similar. Similarly, if w2(x, y) is large, it requires D and D(y) to be similar. Usually, one requires S0’s and D’s to be respectively similar only if the corresponding diffusion signals are similar. We will compute w1(x, y) and w2(x, y) according to the statistical properties of the diffusion weighted signals. It has been recognized that the diffusion signal S follows the Rician distribution (Piurica et al., 2003; Koay and Basser, 2006; Descoteaux et al., 2008), i.e.
| (4) |
where S̄ is the signal without noise, σ is the variance of the Rician noise. Since S0 is linearly related with S and D is “logarithmically” related with S, we set the regularization weights for S0 and D to be
| (5) |
| (6) |
where Z1 and Z2 are normalizers, h is the filtering parameter (Coupe et al., 2006), and σ is the standard variation of the noise, which is estimated using the first mode of the background (Aja-Fernandez et al., 2009; Tristan-Vega and Aja-Fernandez, 2010).
(x) and
(y) denote the neighborhoods of x and y respectively. The neighborhood of x can be viewed as the voxels around x or a square centered at x with a user defined radius. Furthermore,
where μj and νj are the jth voxels in the neighborhoods
(x) and
(y) respectively, and m is the number of voxels in each neighborhood.
From (5), we can see that if the signal intensities for two voxels are similar, w1(x, y) and w2(x, y) are large. Consequently according to (3), S0 and S0(y), D and D(y) should be similar respectively.
NLM is known for its high accuracy and high computational complexity. To address the computational load problem, we use two methods. One is to decrease the number of computations performed by selecting voxels in the search window, and the other is to make use of parallel computing. Concretely, we will prefilter the voxels in the search window which are not similar to the voxel under consideration if their diffusion weighted signal intensities are not similar. This is specified as
τ1 and τ2 are the thresholds for prefiltering. We set τ1 = 0.1 and τ2 = 10 in our experiments.
In the context of parallel computing, we divide the computations into smaller parts and assign the computations to several processors. Since the smaller parts for NLM are not correlated, thus it can improve the efficiency a lot using parallel computing. In our case, we divide the volumes into 8 subvolumes, and assign each subvolume to one processor, and a desktop with 8 processors is used. This multi-threading technique greatly enhances the efficiency.
2.2. Computation of the tKL Divergence
tKL divergence is a special case of the recently proposed total Breman divergence (tBD) (Liu et al., 2010; Vemuri et al., 2011). This divergence measure is based on the orthogonal distance between the convex generating function of the divergence and its tangent approximation at the second argument of the divergence. The total Bregman divergence δf associated with a real valued strictly convex and differentiable function f defined on a convex set X between points x, y ∈ X is defined as,
| (7) |
where 〈·, ·〉 is inner product, and ||∇f(y)||2 = 〈∇f(y), ∇f(y)〉 generally. tBD has been proven to have the property of being intrinsically robust to noise and outliers. Furthermore, it yields a closed form formula for computing the median (an ℓ1-norm average) for a set of symmetric positive definite tensors. When f(x) = − log x and X is the set of probability density functions (pdf), Eq. (7) becomes the tKL divergence, which is
| (8) |
Motivated by earlier use of tKL divergence as a dissimilarity measure between DTs for DTI segmentation (Vemuri et al., 2011), we use tKL to measure the dissimilarity between tensors and apply it in DTI regularization. It has been shown that the tKL divergence (Vemuri et al., 2011) based ℓ1-norm average, termed by t-center, is invariant to special linear group transformations (denoted by SL(n))3. This is detailed in the following.
Since order-2 SPD tensors can be seen as covariance matrices of zero mean Gaussian pdf (Wang et al., 2004). Let P, Q ∈ SPD(l), then their corresponding pdf are
and the tKL between them is explicitly given by,
where and .
Moreover, the partial minimization of the third term in (3)
leads to the t-center for the set of D(y). The t-center has been well studied in (Vemuri et al., 2011). Given a set of tensors {Qi}, the t-center P* minimizes the ℓ1-norm divergence to all the tensors, i.e.,
| (9) |
and P* is explicitly expressed as
| (10) |
The t-center for a set of DTs is the weighted harmonic mean, which is in closed form. Moreover, the weight is invariant to SL(n) transformations, i.e., ai(Qi) = ai(AT QiA), ∀A ∈ SL(n). The t-center after the transformation becomes
| (11) |
This means that if are transformed by some member of SL(n), the t-center will undergo the same transformation. It was also found that the t-center will be robust to noise in that the weight will be smaller if the tensor has more noise (Vemuri et al., 2011). These properties make it an appropriate tool for DTI applications.
2.3. The SPD Constraint
It is known that if a matrix D ∈ SPD, there exists a unique lower diagonal matrix L with its diagonal values all positive, and D = LLT (Golub and Loan, 1996). This is the well known Cholesky factorization theorem. Wang et al. (Wang et al., 2003, 2004) were the first to use Cholesky factorization to enforce the positive definiteness condition on the estimated smooth diffusion tensors from the DWI data. They use the argument that testing for positive definiteness is equivalent to testing for positive semidefiniteness under finite precision arithmetic and hence their cost function minimization is set on the space of positive semidefinite matrices, which is a closed set that facilitates the existence of a solution within that space. Unlike Wang et al. (Wang et al., 2003, 2004), we use Cholesky decomposition and tKL divergence to regularize the smoothness of the tensor field, and this automatically ensures the diagonal values of L to be positive. This argument is validated as follows.
Substituting D = LLT into (9), we get
| (12) |
Because of using the “log” function in the computation, Eq. (12) automatically requires Liis to be positive, therefore we do not need to manually force the tensor to be SPD. The detailed explanation is given in Appendix A.
2.4. Numerical Solution
In this section, we present the numerical solution to the variational principle (3). The partial derivative equations of (3) with respect to S0 and L can be computed explicitly and are,
| (13) |
where
| (14) |
To solve (13), we use the limited memory quasi-Newton method described in (Nocedal and Wright, 2000). This method is useful for solving large problems with a lot of variables, as is in our case. This method maintains simple and compact approximations of Hessian matrices making them require, as the name suggests, modest storage, besides yielding linear rate of convergence. Specifically, we use linear Broyden-Fletcher-Goldfarb-Shanno (L-BFGS) method (Nocedal and Wright, 2000) to construct the Hessian approximation.
3. Experiments
We evaluated our method on synthetic datasets with various levels of noise, and real datasets, including rat spinal cord datasets and human brain datasets. Based on the estimated tensor fields using the technique presented in this paper, we achieved DTI segmentation for the rat spinal cord datasets, and some preliminary fiber tracking on human brain datasets. However, since the main thrust of this paper is estimation of smooth diffusion tensor fields, the segmentation and fiber tracking results are not presented here.
We compared our method with other state-of-the-art techniques including VF (Tschumperle and Deriche, 2003), NLM (Wiest-Daessle et al., 2008) and LMMSE (Tristan-Vega and Aja-Fernandez, 2010) respectively. We also presented the MRE method for comparison since several software packages (3DSlicer 3.6 4 and fanDTasia (Barmpoutis et al., 2009b)) use this technique due to its simplicity. We implemented VF by ourselves since we did not find any open source versions on the web. For LMMSE, we used the implementation in 3DSlicer 3.6. For NLM, we used existing code 5 for DWI denoising and used our own implementation of the least squares fitting to estimate DTI from the denoised DWI. To ensure fairness, we tuned all the parameters of each method for every experiment, and chose the set of parameters yielding the best results. The visual and numerical results show that our method yields better results than competing methods.
3.1. DTI estimation from synthetic datasets
There are two groups of synthetic datasets. The first one is a 16 × 16 DTI with two homogeneous regions as shown in Fig. 1(a). Each region is a repetition of a tensor, and the two tensors are D1 = [3.3, 1.8, 1.3, 0, 0, 1.2]′ and D2 = [3, 2.2, 3, −1, 0, 0]′ 6. To generate the DWI based on this DTI, we let S0 = 5, b = 1500s/mm2, and g be 22 uniformly-spaced directions on the unit sphere starting from (1, 0, 0). Substituting the DTI, S0, b, g into the Stejskal-Tanner equation, we generate a 16 × 16 × 22 DWI S. One representative slice of S is shown in Fig. 1(b). Then following the method proposed in (Koay and Basser, 2006), we add Rician noise to S and get S̃, using the formula , where nr, ni ~ N(0, σ2). By varying σ, we get different levels of noise and therefore a wide range of .
Figure 1.

(a) Ground truth synthetic DTI field, (b) the original DWI corresponding to g1, (c) the Rician noise affected DWI, the DTI estimation using (d) MRE, (e) VF, (f) LMMSE, (g) NLM, and (h) the proposed technique.
In our experiments, we set α = 0.1 and β = 0.4. The search window size is set to be 25 and the neighborhood size is 9. Fig. 1(c) shows the slice in Fig. 1(b) after adding noise (SNR=60). The estimated DTI from using MRE, VF, NLM, LMMSE and the proposed method are shown in Fig. 1. The figure visually depicts that our method can estimate the tensor field more accurately.
To quantitatively assess the proposed variational unified model, we determine the accuracy of the computed principle eigenvectors of the tensors. Let εθ be the average angle between the principle eigenvectors of the estimated tensor field and the original known tensor field. Besides we compare the difference, denoted as εS0, between the estimated and ground truth S0. The results are shown in Table 1, from which it is evident that our method outperforms others and the significance in performance is more evident at higher noise levels. Even though the accuracy of NLM and our proposed method is very similar at high SNR. however, our method is much more computationally efficient than NLM. The average CPU time taken to converge for our method on a desktop computer with Intel 8 Core 2.8GHz, 24GB of memory, GNU Linux and MAT-LAB (Version 2010a) is 3.51s, whereas, NLM requires 5.26s (note both methods are executed using multi-core processors).
Table 1.
Error in estimated DTI and S0, using different methods, from synthetic DWI with different levels of noise.
| SNR | error | MRE | VF | LMMSE | NLM | proposed | α = 0 | β = 0 |
|---|---|---|---|---|---|---|---|---|
|
| ||||||||
| 50 | εθ | 0.204 ± 0.064 | 0.107 ± 0.038 | 0.096 ± 0.060 | 0.085 ± 0.054 | 0.082 ± 0.045 | 0.112 ± 0.049 | 0.179 ± 0.062 |
| εS0 | 0.269 ± 0.138 | 0.217 ± 0.106 | 0.208 ± 0.080 | 0.191 ± 0.112 | 0.133 ± 0.103 | 0.171 ± 0.098 | 0.224 ± 0.082 | |
|
| ||||||||
| 40 | εθ | 0.550 ± 0.351 | 0.275 ± 0.312 | 0.203 ± 0.304 | 0.195 ± 0.301 | 0.108 ± 0.102 | 0.169 ± 0.192 | 0.539 ± 0.348 |
| εS0 | 0.584 ± 0.354 | 0.278 ± 0.359 | 0.215 ± 0.320 | 0.216 ± 0.328 | 0.140 ± 0.226 | 0.218 ± 0.225 | 0.542 ± 0.355 | |
|
| ||||||||
| 30 | εθ | 0.588 ± 0.359 | 0.434 ± 0.394 | 0.230 ± 0.363 | 0.217 ± 0.348 | 0.144 ± 0.130 | 0.214 ± 0.251 | 0.570 ± 0.359 |
| εS0 | 0.753 ± 0.441 | 0.485 ± 0.370 | 0.282 ± 0.401 | 0.275 ± 0.352 | 0.173 ± 0.231 | 0.286 ± 0.432 | 0.607 ± 0.451 | |
|
| ||||||||
| 15 | εθ | 0.727 ± 0.561 | 0.504 ± 0.388 | 0.396 ± 0.426 | 0.384 ± 0.422 | 0.218 ± 0.252 | 0.372 ± 0.283 | 0.711 ± 0.577 |
| εS0 | 0.968 ± 0.570 | 0.622 ± 0.517 | 0.479 ± 0.482 | 0.478 ± 0.457 | 0.235 ± 0.272 | 0.481 ± 0.474 | 0.921 ± 0.580 | |
|
| ||||||||
| 7 | εθ | 1.092 ± 0.595 | 0.688 ± 0.656 | 0.460 ± 0.526 | 0.519 ± 0.469 | 0.265 ± 0.276 | 0.457 ± 0.472 | 1.001 ± 0.591 |
| εS0 | 1.334 ± 1.094 | 0.827 ± 0.680 | 0.543 ± 0.583 | 0.562 ± 0.554 | 0.273 ± 0.283 | 0.592 ± 0.57 | 1.306 ± 1.095 | |
We also evaluated the importance of the two regularization terms separately. α = 0 means removing the regularization on S0, while β = 0 means removing the regularization on D. For these two cases, we evaluate the εθ and εS0, and the results are shown in the last two columns of Table 1. The results show that removing either regularization term will increase the DTI estimation error. This implies that both regularization terms are necessary in order to get accurate estimation results.
We also evaluated our method on the 64 × 64 fibercup dataset (Poupon et al., 2008a; Fillard et al., 2011) with a voxel size of 3 × 3 × 3mm3, a b-value of 1500s/mm2 and 130 gradient directions. For the parameter settings in the proposed model, we chose α = 0.1 and β = 0.4. The search window size is 9 and the neighborhood size is 4. We showed the estimated S0, D11, D12, D13, D22, D23, D33, FA and the visualization of the estimated DTI using fanDTasia (Barmpoutis et al., 2009a) in Fig. 2. The results depict that the proposed method can give a well smoothed and feature preserved tensor field.
Figure 2.

From left to right, top to bottom are the estimated S0, D11, D12, D13, D22, D23, D33, FA and the visualization of the estimated DTI using fanDTasia.
3.2. DTI estimation from real datasets
We also did DTI estimation on a 100 × 80 × 32 × 52 3D rat brain DWI. The data was acquired using a PGSE technique with TR=1.5s, TE=28.3ms, bandwidth=35Khz, 52 diffusion weighted images with a b-value of 1334s/mm2.
We compared with several other methods on the DTI estimation, however, to save space, we only show the results of MRE, LMMSE and our proposed method. We present D11, D22, D33, S0, FA, and mean trace for each estimated result. The DTI estimation results of MRE, LMMSE and our proposed method are shown in Fig. 3, 4 and 5 respectively.
Figure 3.
From left to right, top to bottom are D11, D22, D33, S0, FA, and mean trace of the estimated tensor field using MRE on the rat cord data set.
Figure 4.
From left to right, top to bottom are D11, D22, D33, S0, FA, and mean trace of the estimated tensor field using LMMSE on the rat cord data set.
Figure 5.
From left to right, top to bottom are D11, D22, D33, S0, FA, and mean trace of the estimated tensor field using our proposed method on the rat cord data set.
We used a human brain DWI dataset (256 × 256 × 72) provided by Alfred Anwander of the Max Planck Institute for Human Neuroscience (Makuuchi et al., 2009). The DWIs were acquired with a whole-body 3 Tesla Magnetom TRIO operating at 3T (Siemens Medical Solutions) equipped with an 8-channel head array coil. The twice-refocused spin-echo EPI sequence (TR = 12 s, TE = 100 ms) consists of 22 diffusion gradients with a b-value of 800s/mm2.
For the parameters of our model, we chose α = 0.15 and β = 0.45. The search window size is 64 and the neighborhood size is 27. We compared with several other methods on the DTI estimation, however, to save space, we only show the results of MRE, LMMSE and our proposed method. We present D11, D22, D33, S0, FA, and mean trace for each estimated result. The DTI estimation results of MRE, LMMSE and our proposed method are shown in Fig. 6, 7 and 8 respectively. The comparisons indicate that the proposed DTI estimation method generates better results.
Figure 6.
From left to right, top to bottom are D11, D22, D33, S0, FA, and mean trace of the estimated tensor field using MRE on the human brain dataset.
Figure 7.
From left to right, top to bottom are D11, D22, D13, S0, FA, and mean trace of the estimated tensor field using LMMSE on the human brain dataset.
Figure 8.
From left to right, top to bottom are D11, D22, D33, S0, FA, and mean trace of the estimated tensor field using our proposed method on the human brain dataset.
4. Conclusions
We proposed a robust variational non-local means based unified approach for simultaneous denoising and DTI estimation. The proposed method is a combination of a variational framework, non-local means and an intrinsically robust divergence measure to regularize the DTI estimation. In the variational principle, we used non-linear diffusion tensor fitting term, along with a combination of non-local means and the tKL based smoothness measure for denoising. To speed up the NLM method, we performed pre-filtering on the voxels in the search window to reduce the number of computations and made use of parallel computing to distribute the computational load. This variational non-local approach was validated with both synthetic and real datasets and was shown to be more accurate than competing methods in the literature. The results show that our method depicts better noise removal while preserving the structure information even at high levels of noise.
For future work, we plan to develop a GPU-based implementation to further reduce the computation time. After getting a more comprehensive tensor estimation technique, we will utilize it as a preprocessing step in applications to fiber tracking and DTI segmentation.
Highlights.
We use a statistically robust divergence-tBD to regularize the smoothness.
We use NLM as the weight for the regularization terms. This preserves the global structure.
This method enables simultaneous denoising and DTI estimation
Acknowledgments
This work was in part supported by the NIH grant NS066340 to Vemuri, the University of Florida Allumni Fellowship and the INRIA Internships program to Liu, the Association France Parkinson and the French National Research Agency (ANR, Neurodegenerative and Psychiatric Diseases) to Deriche.
We thank Dr. Alfred Anwander of the Max Planck Institute for providing the human brain datasets and Dr. John Forder of the Department of Radiology, University of Florida, McKnight Brain Institute for providing the rat spinal cord datasets.
Appendix A. Automatically ensuring the positivity of diag(L)
Since
and
Let Lii = exp(ai), i = 1, 2, 3, then by solving ai, we can ensure the Lii is positive. Therefore, the positiveness of the diagonal values of L is transferred to solve ai.
Now L is converted to L̂:
| (A.1) |
where “·” is element-wise product, and
Therefore, the partial derivative is in closed form. This can be solved using the L-BFGS method.
Footnotes
SPD(3) represents the space of 3×3 symmetric positive definite matrices
Usually, the noisier the image, larger will α and β be, and vice-versa. The noise in the images can be estimated using any of the popular methods described in (Aja-Fernandez et al., 2009; Tristan-Vega and Aja-Fernandez, 2010)
An n × n matrix A ∈ SL(n) implies det(A) = 1
D is written as [D11,D22,D33,D12,D23,D13]
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
References
- Aja-Fernandez S, Alberola-Lpez C, Westin CF. Noise and signal estimation in magnitude MRI and Rician distributed images: A LMMSE approach. IEEE Trans Image Proccess. 2008;17(8):1383–1398. doi: 10.1109/TIP.2008.925382. [DOI] [PubMed] [Google Scholar]
- Aja-Fernandez S, Tristan-Vega A, Alberola-Lopez C. Noise estimation in single- and multiple-coil magnetic resonance data based on statistical models. Magn Res Imag. 2009;27:1397–1409. doi: 10.1016/j.mri.2009.05.025. [DOI] [PubMed] [Google Scholar]
- Assemlal H, Tschumperle D, Brun L, Siddiqi K. Recent advances in diffusion MRI modeling: Angular and radial reconstruction. Med Image Anal. 2011;15(4):369–396. doi: 10.1016/j.media.2011.02.002. [DOI] [PubMed] [Google Scholar]
- Awate S, Whitaker R. Nonparametric neighborhood statistics for MRI denoising. Infor Proc Med Imag. 2005;3565:677–688. doi: 10.1007/11505730_56. [DOI] [PubMed] [Google Scholar]
- Barmpoutis A, Hwang MS, Howland D, Forder JR, Vemuri BC. Regularized positive-definite fourth-order tensor field estimation from DW-MRI. NeuroImage. 2009a;45(1):153–162. doi: 10.1016/j.neuroimage.2008.10.056. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barmpoutis A, Jian B, Vemuri BC. Adaptive kernels for multi-fiber reconstruction. Infor Proc Med Imag. 2009b:338–349. doi: 10.1007/978-3-642-02498-6_28. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barmpoutis A, Vemuri BC. Groupwise registration and atlas construction of 4th-order tensor fields using the Riemannian metric. MICCAI. 2009;1:640–647. [PubMed] [Google Scholar]
- Basser P, Mattiello J, LeBihan D. Estimation of the effective self-diffusion tensor from the NMR spin echo. J Magn Res. 1994;103(3):247–254. doi: 10.1006/jmrb.1994.1037. [DOI] [PubMed] [Google Scholar]
- Basu S, Fletcher T, Whitaker R. Rician noise removal in diffusion tensor MRI. MICCAI. 2006:117–125. doi: 10.1007/11866565_15. [DOI] [PubMed] [Google Scholar]
- Batchelor PG, Moakher M, Atkinson D, Calamante F, Connelly A. A rigorous framework for diffusion tensor calculus. Magn Reson Med. 2005;53(1):221–225. doi: 10.1002/mrm.20334. [DOI] [PubMed] [Google Scholar]
- Bennett A, Landman P-LBJLP. Diffusion tensor estimation by maximizing Rician likelihood. IEEE ICCV. 2007:1–8. doi: 10.1109/iccv.2007.4409140. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Burgela U, Amuntsb K, Hoemkec L, Mohlbergc H, Gilsbacha JM, Zilles K. White matter fiber tracts of the human brain: Three-dimensional mapping at microscopic resolution, topography and intersubject variability. NeuroImage. 2006;29(4):1092–1105. doi: 10.1016/j.neuroimage.2005.08.040. [DOI] [PubMed] [Google Scholar]
- Chang LC, Jones DK, Pierpaoli C. RESTORE: Robust estimation of tensors by outlier rejection. Magn Reson Med. 2005;53:1088–1095. doi: 10.1002/mrm.20426. [DOI] [PubMed] [Google Scholar]
- Chefd’hotel C, Tschumperle D, Deriche R, Faugeras O. Regularizing flows for constrained matrix valued images. J Mathematical Imag Vision. 2004;20(1–2):147–162. [Google Scholar]
- Coupe P, Yger P, Barillot C. Fast non local means denoising for 3d MR images. MICCAI. 2006:33–40. doi: 10.1007/11866763_5. [DOI] [PubMed] [Google Scholar]
- Coupe P, Yger P, Prima S, Hellier P, Kervrann C, Barillot C. An optimized blockwise non local means denoising filter for 3d magnetic resonance images. IEEE Trans Med Imag. 2008;27(4):425–441. doi: 10.1109/TMI.2007.906087. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Descoteaux M, Wiest-Daessl N, Prima S, Barillot C, Deriche R. Impact of Rician adapted non-local means filtering on HARDI. MICCAI. 2008:122–130. doi: 10.1007/978-3-540-85990-1_15. [DOI] [PubMed] [Google Scholar]
- DeVore MD, Lanterman AD, Sullivan OJA. ATR performance of a Rician model for SAR images. Proc SPIE. 2000:34–37. [Google Scholar]
- Durrleman S, Fillard P, Pennec X, Trouve A, Ayache N. Registration, atlas estimation and variability analysis of white matter fiber bundles modeled as currents. NeuroImage. 2011 doi: 10.1016/j.neuroimage.2010.11.056. [DOI] [PubMed] [Google Scholar]
- Faugeras O, Adde G, Charpiat G, Chefd’Hotel C, Clerc M, Deneux T, Deriche R, Hermosillo G, Keriven R, Kornprobst P, Kybic J, Lenglet C, Lopez-Perez L, Papadopoulo T, Pons J, Segonne F, Thirion B, Tschumperlé D, Viéville T, Wotawa N. Variational, geometric and statistical methods for modeling brain anatomy and function. NeuroImage. 2004;23(1):S46–S55. doi: 10.1016/j.neuroimage.2004.07.015. [DOI] [PubMed] [Google Scholar]
- Fillard P, Arsigny V, Pennec X, Thompson PM, Ayache N. Extrapolation of sparse tensor fields: Application to the modeling of brain variability. Infor Proc Med Imag. 2005;19:27–38. doi: 10.1007/11505730_3. [DOI] [PubMed] [Google Scholar]
- Fillard P, Descoteaux M, Goh A, Gouttard S, Jeurissen B, Malcolm J, Ramirez-Manzanares A, Reisert M, Sakaie K, Tensaouti F, Yo T, Mangin JF, Poupon C. Quantitative evaluation of 10 tractography algorithms on a realistic diffusion MR phantom. NeuroImage. 2011;56(1):220–234. doi: 10.1016/j.neuroimage.2011.01.032. [DOI] [PubMed] [Google Scholar]
- Fillard P, Pennec X, Arsigny V, Ayache N. Clinical DT-MRI estimation, smoothing, and fiber tracking with Log-Euclidean metrics. IEEE Trans Med Imag. 2007;26(11):1472–1482. doi: 10.1109/TMI.2007.899173. [DOI] [PubMed] [Google Scholar]
- Gilboa G, Sochen N, Zeevi YY. Image enhancement and denoising by complex diffusion processes. IEEE Trans on PAMI. 2004;26(8):1020–1036. doi: 10.1109/TPAMI.2004.47. [DOI] [PubMed] [Google Scholar]
- Goh A, Vidal R. Segmenting fiber bundles in diffusion tensor images. ECCV. 2008;24(10):238–250. [Google Scholar]
- Golub GH, Loan CFV. Matrix Computations. JHU Press; 1996. [Google Scholar]
- Gur Y, Sochen N. Fast invariant riemannian DT-MRI regularization. ICCV. 2007:1–7. [Google Scholar]
- Hamarneh G, Hradsky J. Bilateral filtering of diffusion tensor magnetic resonance images. IEEE Trans Im Proc. 2007;16:2463–2475. doi: 10.1109/tip.2007.904964. [DOI] [PubMed] [Google Scholar]
- Hasan KM, Halphen C, Sankar A, Eluvathingal TJ, Kramer L, Stuebing KK, Ewing-Cobbs L, Fletcher JM. Diffusion tensor imaging based tissue segmentation: Validation and application to the developing child and adolescent brain. NeuroImage. 2007;34(4):1497–1505. doi: 10.1016/j.neuroimage.2006.10.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jia H, Yap P, Wu G, Wang Q, Shen D. Intermediate templates guided groupwise registration of diffusion tensor images. NeuroImage. 2011;54:928–939. doi: 10.1016/j.neuroimage.2010.09.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jones DK. Diffusion MRI Theory, Methods, and Applications. Oxford University Press; 2010. [Google Scholar]
- Koay CG, Basser PJ. Analytically exact correction scheme for signal extraction from noisy magnitude MR signals. J Magn Res. 2006;179(2):317–322. doi: 10.1016/j.jmr.2006.01.016. [DOI] [PubMed] [Google Scholar]
- Lenglet C, Prados E, Pons J, Deriche R, Faugeras O. Brain connectivity mapping using Riemannian geometry, control theory and PDEs. SIAM J on Imag Sciences. 2009;2(2):285–322. [Google Scholar]
- Lenglet C, Rousson M, Deriche R. DTI segmentation by statistical surface evolution. IEEE Trans Med Imag. 2006;25(6):685–700. doi: 10.1109/tmi.2006.873299. [DOI] [PubMed] [Google Scholar]
- Liu M, Vemuri BC, Amari S-I, Nielsen F. Total bregman divergence and its applications to shape retrieval. IEEE CVPR. 2010:3463–3468. doi: 10.1109/CVPR.2010.5539979. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu T, Li H, Wong K, Tarokh A, Guo L, Wong ST. Brain tissue segmentation based on DTI data. NeuroImage. 2007;38(1):114–123. doi: 10.1016/j.neuroimage.2007.07.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Makuuchi M, Bahlmann J, Anwander A, Friederici AD. Segregating the core computational faculty of human language from working memory. Proc National Acad Sciences. 2009;106(20):8362–8367. doi: 10.1073/pnas.0810928106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mangin JF, Poupon C, Clark C, Le Bihan D, Bloch I. Distortion correction and robust tensor estimation for MR diffusion imaging. Med Image Anal. 2002;6(3):191–198. doi: 10.1016/s1361-8415(02)00079-8. [DOI] [PubMed] [Google Scholar]
- Manjon JV, Carbonell-Caballero J, Lull JJ, García-Marti G, Marti-Bonmati L, Robles M. MRI denoising using non-local means. Med Image Anal. 2008;12:514–523. doi: 10.1016/j.media.2008.02.004. [DOI] [PubMed] [Google Scholar]
- Martin-Fernndez M, Alberola-Lpez C, Ruiz-Alzola J, Westin CF. Sequential anisotropic wiener filtering applied to 3D MRI data. Magn Res Imag. 2007;25:278–292. doi: 10.1016/j.mri.2006.05.001. [DOI] [PubMed] [Google Scholar]
- Marzetta T. EM algorithm for estimating the parameters of multivariate complex Rician density for polarimetric SAR. Procs of ICASSP. 1995;5:3651–3654. [Google Scholar]
- McGibney G, Smith M. Unbiased signal to noise ratio measure for magentic resonance images. Med Phys. 1993;20(4):1077–1078. doi: 10.1118/1.597004. [DOI] [PubMed] [Google Scholar]
- McGraw T, Vemuri BC, Chen Y, Rao M, Mareci T. DT-MRI denoising and neuronal fiber tracking. Med Imag Anal. 2004;8:95–111. doi: 10.1016/j.media.2003.12.001. [DOI] [PubMed] [Google Scholar]
- Mishra A, Lu Y, Meng J, Anderson AW, Ding Z. Unified framework for anisotropic interpolation and smoothing of diffusion tensor images. NeuroImage. 2006;31(4):1525–1535. doi: 10.1016/j.neuroimage.2006.02.031. [DOI] [PubMed] [Google Scholar]
- Moraga CC, Lenglet C, Deriche R, Ruiz-Alzola J. A Riemannian approach to anisotropic filtering of tensor fields. Signal Processing. 2007;87(2):217–352. [Google Scholar]
- Mori S, Oishi K, Jiang H, Jiang L, Li X, Akhter K, Hua K, Faria A, Mahmood A, Woods R, Toga A, Pike G, Neto P, Evans A, Zhang J, Huang H, Miller M, van Zijl P, Mazziotta J. Stereotaxic white matter atlas based on diffusion tensor imaging in an ICBM template. NeuroImage. 2008;40(2):570–582. doi: 10.1016/j.neuroimage.2007.12.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mori S, van Zijl PC. Fiber tracking: principles and strategies - a technical review. NMR Biomed. 2002;15(7):464–480. doi: 10.1002/nbm.781. [DOI] [PubMed] [Google Scholar]
- Motwani K, Adluru N, Hinrichs C, Alexander AL, Singh V. Epitome driven 3-D diffusion tensor image segmentation: on extracting specific structures. NIPS. 2010:1696–1704. [PMC free article] [PubMed] [Google Scholar]
- Niethammer M, Estepar RSJ, Bouix S, Shenton M, Westin CF. On diffusion tensor estimation. IEEE Eng Med Biol Soc. 2006;1:2622–2625. doi: 10.1109/IEMBS.2006.259826. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nocedal J, Wright SJ. Numerical Optimization. Springer; 2000. [Google Scholar]
- Nowak R. Wavelet-based Rician noise removal for magnetic resonance imaging. IEEE Trans Im Proc. 1999;8(10):1408–1419. doi: 10.1109/83.791966. [DOI] [PubMed] [Google Scholar]
- Pennec X, Fillard P, Ayache N. A Riemannian framework for tensor computing. Int J Comp Vision. 2006;66(1):41–66. [Google Scholar]
- Piurica A, Philips W, Lemahieu I, Acheroy M. A versatile wavelet domain noise filtration technique for medical imaging. IEEE Trans Med Imag. 2003;22(3):323–331. doi: 10.1109/TMI.2003.809588. [DOI] [PubMed] [Google Scholar]
- Poupon C, Rieul B, Kezele I, Perrin M, Poupon F, Mangin J. New diffusion phantoms dedicated to the study and validation of high-angular-resolution diffusion imaging (HARDI) models. Magn Reson Med. 2008a;60:1276–1283. doi: 10.1002/mrm.21789. [DOI] [PubMed] [Google Scholar]
- Poupon C, Roche A, Dubois J, Mangin JF, Poupon F. Real-time MR diffusion tensor and Q-ball imaging using Kalman filtering. Med Image Anal. 2008b;12:527–534. doi: 10.1016/j.media.2008.06.004. [DOI] [PubMed] [Google Scholar]
- Salvador R, Pena A, Menon DK, Carpenter TA, Pickard JD, Bullmore ET. Formal characterization and extension of the linearized diffusion tensor model. Human Brain Mapping. 2005;24(2):144–155. doi: 10.1002/hbm.20076. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Savadjiev P, Campbell JSW, Descoteaux M, Deriche R, Pike GB, Siddiqi K. Labeling of ambiguous subvoxel fibre bundle configurations in high angular resolution diffusion MRI. NeuroImage. 2008;41(1):58–68. doi: 10.1016/j.neuroimage.2008.01.028. [DOI] [PubMed] [Google Scholar]
- Sijbers J, den Dekker A. Maximum likelihood estimation of signal amplitude and noise variance from MR data. Magn Res Imag. 2004;51(3):586–594. doi: 10.1002/mrm.10728. [DOI] [PubMed] [Google Scholar]
- Spira A, Kimmel R, Sochen N. A short-time Beltrami kernel for smoothing images and manifolds. IEEE Trans Im Proc. 2007;16(6):1628–1636. doi: 10.1109/tip.2007.894253. [DOI] [PubMed] [Google Scholar]
- Stejskal EO, Tanner JE. Spin diffusion measurements: Spin echoes in the presence of a time-dependent field gradient. J Chem Phys. 1965;42:288–292. [Google Scholar]
- Tang S, Fan Y, Zhu H, Yap P, Gao W, Lin W, Shen D. Regularization of diffusion tensor field using coupled robust anisotropic diffusion. MMBIA. 2009:52–57. [Google Scholar]
- Tristan-Vega A, Aja-Fernandez S. DWI filtering using joint information for DTI and HARDI. Med Image Anal. 2010;14(2):205–218. doi: 10.1016/j.media.2009.11.001. [DOI] [PubMed] [Google Scholar]
- Tschumperle D, Deriche R. Variational frameworks for DT-MRI estimation, regularization and visualization. IEEE ICCV. 2003;1:116–121. [Google Scholar]
- Tschumperle D, Deriche R. Vector-valued image regularization with PDEs: A common framework for different applications. IEEE Trans on PAMI. 2005;27(4):506–517. doi: 10.1109/TPAMI.2005.87. [DOI] [PubMed] [Google Scholar]
- Vemuri BC, Chen Y, Rao M, McGraw T, Wang Z, Mareci T. Fiber tract mapping from diffusion tensor MRI. Proceedings IEEE Workshop on Variational and Level Set Methods in Computer Vision (VLSM) 2001:81–88. [Google Scholar]
- Vemuri BC, Liu M, Amari SI, Nielsen F. Total Bregman divergence and its applications to DTI analysis. IEEE Trans Med Imag. 2011;30(2):475–483. doi: 10.1109/TMI.2010.2086464. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang Q, Yap P, Wu G, Shen D. Application of neuroanatomical features to tractography clustering. Human Brain Mapping. 2012 doi: 10.1002/hbm.22051. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang Y, Gupta A, Liu Z, Zhang H, Escolar ML, Gilmore JH, Gouttard S, Fillard P, Maltbie E, Gerig G, Styner M. DTI registration in atlas based fiber analysis of infantile Krabbe disease. NeuroImage. 2011;55(4):1577–1586. doi: 10.1016/j.neuroimage.2011.01.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang Y, Zhang J, Gutman B, Chan TF, Becker JT, Aizenstein HJ, Lopez OL, Tamburo RJ, Toga AW, Thompson PM. Multivariate tensor-based morphometry on surfaces: application to mapping ventricular abnormalities in HIV/AIDS. NeuroImage. 2010;49(3):2141–2157. doi: 10.1016/j.neuroimage.2009.10.086. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang Z, Vemuri BC. DTI segmentation using an information theoretic tensor dissimilarity measure. IEEE Trans Med Imag. 2005;24(10):1267–1277. doi: 10.1109/TMI.2005.854516. [DOI] [PubMed] [Google Scholar]
- Wang Z, Vemuri BC, Chen Y, Mareci T. A constrained variational principle for direct estimation and smoothing of the diffusion tensor field from DWI. Infor Proc Med Imag 2732/2003. 2003:660–671. doi: 10.1007/978-3-540-45087-0_55. [DOI] [PubMed] [Google Scholar]
- Wang Z, Vemuri BC, Chen Y, Mareci TH. A constrained variational principle for direct estimation and smoothing of the diffusion tensor field from complex DWI. IEEE Trans Med Imag. 2004;23:930–939. doi: 10.1109/TMI.2004.831218. [DOI] [PubMed] [Google Scholar]
- Westin CF, Maier S, Mamata H, Nabavi A, Jolesz F, Kikinis R. Processing and visualization for diffusion tensor MRI. Med Image Anal. 2002;6:93–108. doi: 10.1016/s1361-8415(02)00053-1. [DOI] [PubMed] [Google Scholar]
- Wiest-Daessle N, Prima S, Coupé P, Morrissey SP, Barillot C. Rician noise removal by non-local means filtering for low signal-to-noise ratio MRI: Applications to DT-MRI. MICCAI. 2008:171–179. doi: 10.1007/978-3-540-85990-1_21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xie Y, Vemuri BC, Ho J. Statistical analysis of tensor fields. MICCAI. 2010;1:682–689. doi: 10.1007/978-3-642-15705-9_83. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yang J, Shen D, Davatzikos C, Verma R. Diffusion tensor image registration using tensor geometry and orientation features. MICCAI. 2008:905–913. doi: 10.1007/978-3-540-85990-1_109. [DOI] [PubMed] [Google Scholar]
- Yeo B, Vercauteren T, Fillard P, Peyrat JM, Pennec X, Golland P, Ayache N, Clatz O. DT-REFinD: Diffusion tensor registration with exact finite-strain differential. IEEE Tran Med Imag. 2009;28(12):1914–1928. doi: 10.1109/TMI.2009.2025654. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang Y, Brady M, Smith S. Segmentation of brain MR images through a hidden Markov random field model and the expectation-maximization algorithm. IEEE Trans Med Imag. 2001;2(1):45–57. doi: 10.1109/42.906424. [DOI] [PubMed] [Google Scholar]
- Zhu H, Kong L, Li R, Styner M, Gerig G, Lin W, Gilmore JH. FADTTS: Functional analysis of diffusion tensor tract statistics. NeuroImage. 2011;56(3):1412–1425. doi: 10.1016/j.neuroimage.2011.01.075. [DOI] [PMC free article] [PubMed] [Google Scholar]






