Abstract
In this paper, we present a method based on sparse non-negative matrix factorization (NMF) for brain tissue segmentation using diffusion MRI (DMRI) data. Unlike existing NMF-based approaches, in our method NMF is applied to the spherical mean data, computed on a per-shell basis, instead of the original diffusion-weighted images. This is motivated by the fact that the spherical mean is independent of the fiber orientation distribution and is only dependent on tissue microstructure. Applying NMF to the spherical mean data will hence allow tissue signal separation based solely on the microstructural properties, unconfounded by factors such as fiber dispersion and crossing. We show results explaining why applying NMF directly on the diffusion-weighted images fails and why our method is able to yield the expected outcome, producing tissue segmentation with greater accuracy.
Keywords: Tissue segmentation, Sparse NMF, Spherical mean, Diffusion MRI
1. Introduction
In diffusion MRI (DMRI), tissue segmentation is a key step to tease apart signal contributions of white matter (WM), gray matter (GM), and cerebrospinal fluid (CSF) for fine-grained analysis [1]. For example, it has been demonstrated in [2] that the overestimation of apparent WM density in voxels containing GM and CSF can be mitigated by explicitly considering the response functions associated with the individual tissue types. Unlike segmentation of anatomical (e.g., T1-weighted) images, which is typically intensity-based, DMRI segmentation is based on diffusion-attenuated signals stemming from the motion of water molecules.
Recently, nonnegative matrix factorization (NMF) has been proposed for tissue segmentation in DMRI [3]. NMF is a data-driven approach for blind source separation and can be viewed as a clustering algorithm [4–9]. The goal of NMF is to approximate a nonnegative data matrix by the product of two unknown matrices: A basis matrix and a weight matrix. The rank of the factorization is generally smaller than that of the data matrix and the product can hence be seen as a compressed representation of the data. The solution to the plain vanilla NMF [4] is not unique. However, this non-uniqueness problem can be mitigated by imposing sparsity or convexity constraints [3, 9, 10].
Jeurissen et al. [3] applied NMF directly to the diffusion-weighted (DW) images for tissue-type segmentation. The data matrix is formed with each row corresponding to a different DW image and each column corresponding to a different voxel. The factorization rank is set to 3 to recover the three main tissue types in the brain. Each column of the basis matrix returned by NMF is a basis vector that best represents a certain tissue type and the corresponding row of the weight matrix contains the weights of the voxels in relation to the tissue type.
We note that the method described above, while seemingly effective, suffers from a major limitation. Although it is straightforward to understand that CSF and GM can each be represented by a basis vector in the basis matrix, it is unclear how WM, which can come in a large variety of orientations and configurations, can be sufficiently captured using a single basis vector. In fact, we will show in this paper that this is an unreasonable representation and causes significant errors in segmentation.
We will address the above problem by applying NMF to spherical mean DMRI data. The spherical mean is computed for each shell associated with a diffusion weighting (i.e., b-value). The spherical mean is independent of the fiber orientation distribution and is only dependent on tissue microstructure [11]. Applying NMF on the spherical mean data will hence allow tissue classification based solely on the microstructural properties, unconfounded by factors such as fiber dispersion and crossing. We will show experimental results demonstrating why applying NMF directly on the DW images is not ideal and why our method is able to overcome the problem, producing tissue segmentation results with greater accuracy.
2. Approach
Our approach to tissue segmentation is inspired by the work of Jeurissen et al. [3]. However, instead of applying NMF directly to the DW images, we apply NMF on the spherical mean, which is computed on a per-shell basis.
2.1. The Spherical Mean
The spherical mean is not dependent on the fiber orientation distribution and is a function of per-axon diffusion characteristics [11]. This is based on the observation that for a fixed b-value the spherical mean of the diffusion-attenuated signal over the gradient directions, i.e.,
| (1) |
does not depend on the fiber orientation distribution. Assuming that the signal can be represented as the spherical convolution of a fiber orientation distribution p(ω) (p(ω) ≥ 0, , ) with an axial and antipodal symmetric kernel hb(g, ω) ≡ hb (|〈g, ω〉|), i.e.,
| (2) |
it can be shown that [11]
| (3) |
This shows that the spherical mean is independent of the orientation distribution.
2.2. Tissue Segmentation via Sparse NMF
Brain tissue segmentation can be cast as a blind source separation problem, where the sources and the mixing weights are determined concurrently. NMF is an ideal tool for this problem owing to its ability to represent the data as an additive nonnegative combination of a set of non-negative basis vectors, resulting in a parts-based decomposition. NMF factorizes a nonnegative data matrix into two unknown nonnegative matrices: The basis matrix and the weight matrix :
| (4) |
In Jeurissen et al.’s [3] approach, each row of V corresponds to a different DW image and each column corresponds to a different voxel inside the brain. Instead of taking this approach, we show that a better approach is to perform segmentation based on the spherical mean. That is, each row of V should correspond to a shell associated with a b-value and each column to the voxels. Upon performing NMF, each column of W is a basis vector that best represents a certain tissue type and the corresponding row of the H contains the weights (also known as volume fractions when normalized to have a sum of 1) of the voxels in relation to the tissue type.
The parameter K of the factorization should be usually chosen to be smaller than M and N. Naturally, for brain tissue segmentation into WM, GM, and CSF, it should be set to 3. The value can be increased beyond 3 to account for pathology such as lesions. In general, NMF is undetermined and the factorization is obtained by solving the following problem
| (5) |
The non-uniqueness of NMF can be controlled by sparsity penalization [9]:
| (6) |
where h j is j-th column vector of H. This form of penalization is reasonable since partial volume occurs mostly at tissue boundaries and hence most parts of the brain should be comprised of single tissue types [3].
The solution to (6) can be obtained by alternating between solving two subproblems, i.e., a non-negative least-squares problem:
| (7) |
when H is fixed, and an elastic net problem
| (8) |
when W is fixed. We use SPAMS [12–14] in our implementation.
Using the spherical mean instead of the original signal in V allows us to solve the problem inherent in Jeurissen et al.’s method [3]. This is because the spherical mean captures the per-axon diffusion characteristics independent of the orientation distribution. For example, voxels with very different fiber configuration, but with the same per-axon microstructural parameters, can be represented using exactly the same spherical mean vector.
Our formulation of the problem also means that a smaller matrix V needs to be decomposed since it has a significantly smaller number of rows. This implies that tissue segmentation can be performed much faster using our method.
3. Experiment
We evaluated our method with Jeurissen et al.’s [3] method as the comparison baseline. Both qualitative and quantitative results are reported.
3.1. Synthetic Data
Three different sets of synthetic datasets were generated for evaluation. The parameters used for generating the synthetic data were consistent with the real data described in the next section: b = 500, 1000, 1500, 2000, 2500, 3000 s/mm2, with respectively 9, 12, 17, 24, 34, and 48 non-collinear gradient directions, 20 × 20 voxels. Rician noise with level comparable to the real data was added to the synthetic data. The three sets of synthetic datasets model three different scenarios: (1) Single: WM with single fiber orientations + GM + CSF; (2) Crossing: WM with two crossing fiber directions + GM + CSF; and (3) Partial Volume: WM with two crossing fiber directions with partial volume of varying degree with GM + CSF. The voxel number ratio is 9:9:2 for WM:GM:CSF, giving 45% WM voxels, 45% GM voxels, and 10% CSF voxels. The ground truth synthetic data with the fiber orientations (FOs) and color-coded tissue volume fractions are shown on the far left of Fig. 1.
Fig. 1.

Synthetic Datasets. Synthetic datasets with different configurations and with/without partial volume effects. Different fiber orientations are simulated. Volume fractions are colored coded with blue for WM, green for GM, and red for CSF. The ground truth volume fractions and those estimated by the baseline and the proposed method are shown
Comparing the estimation results obtained using the baseline and the proposed method, we can see that the baseline method is easily confused by the different WM orientations, resulting in incorrect volume fraction estimates. Note that the GM and CSF volume fraction estimates are also affected by the WM errors. On the other hand, the proposed method performs consistently for the different datasets, confirming the fact that it is not sensitive to variation in WM configurations.
We compute the mean absolute difference (MAD) between the estimated and the ground truth volume fractions. Table 1 shows the MAD values for the datasets, indicating that the proposed method yields estimates that are significantly closer to the ground truth. The baseline method, on the other, deviates significantly from the ground truth. Note that since the volume fraction is a normalized quantity, the estimation errors of one tissue type will affect others.
Table 1.
MAD between estimated volume fraction and ground truth
| Method | Dataset | WM | GM | CSF |
|---|---|---|---|---|
| Baseline | Single | 0.6206 | 0.5584 | 0.0167 |
| Crossing | 0.4411 | 0.6512 | 0.0237 | |
| Mixture | 0.4775 | 0.2728 | 0.0219 | |
| Proposed | Single | 0.0050 | 0.0043 | 0 |
| Crossing | 0.0020 | 0.0028 | 0 | |
| Mixture | 0.0430 | 0.0403 | 0 |
3.2. Real Data
The DW images of an adult were acquired using a Siemens 3T Prisma MR scanner with the following protocol: 140 × 140 imaging matrix, 1.5mm × 1.5mm × 1.5mm resolution, TE = 88ms, TR = 2365ms, 32-channel receiver coil, and multi-band factor 9, 12, 17, 24, 34, and 48 non-collinear directions respectively for b = 500,1000,1500,2000,2500,3000s/mm2. A non-DW image with b = 0s/mm2 was collected for every 24 images, giving a total of 6.
Figure 2 shows the volume fraction maps of the baseline and the proposed method. One can appreciate that with the proposed method a clearer separation of the different types can be obtained. This is apparent from the better contrast of the volume fraction maps given by the proposed method. The WM map given by the proposed method is also more homogeneous in terms of intensity. This is an indicator that the proposed method is less affected by the WM orientation distribution.
Fig. 2.

Volume Fractions. Volume fraction maps given by the baseline and the proposed method
Figure 3 shows the plots for the basis vectors determined using the two methods. For the baseline method, the values (in W) are averaged over each shell to obtain the curves. The curves are expected to decay with diffusion weighting due to the increase in signal attenuation. It can be observed from the figure that the proposed method yields results that are closer to this expectation.
Fig. 3.

Basis. Plots of basis (normalized to 1 at b = 0s/mm2)
4. Conclusion
In this paper, we have shown that improved brain tissue segmentation can be improved by applying NMF on the spherical mean DMRI data. Specifically, we have demonstrated that, in contrast to Jeurissen et al.’s method, the proposed method is unconfounded by the differences in fiber orientation distributions. The proposed method will hence allow tissue segmentation to be performed more robustly using DMRI data.
Acknowledgments
This work was supported in part by NIH grants (NS093842, EB022880, EB006733, EB009634, AG041721, MH100217, and AA012388) and NSFC grants (11671022 and 61502392).
Contributor Information
Peng Sun, Department of Radiology and BRIC, University of North Carolina, Chapel Hill, NC, USA; School of Computer Science, Northwestern Polytechnical University, Xi An, China.
Ye Wu, Department of Radiology and BRIC, University of North Carolina, Chapel Hill, NC, USA; Institute of Information Processing and Automation, Zhejiang University of Technology, Hangzhou, China.
Geng Chen, Department of Radiology and BRIC, University of North Carolina, Chapel Hill, NC, USA.
Jun Wu, School of Electronics and Information, Northwestern Polytechnical University, Xi An, China.
Dinggang Shen, Department of Radiology and BRIC, University of North Carolina, Chapel Hill, NC, USA.
Pew-Thian Yap, Department of Radiology and BRIC, University of North Carolina, Chapel Hill, NC, USA.
References
- 1.Yap PT, Zhang Y, Shen D: Multi-tissue decomposition of diffusion MRI signals via ℓ0 sparse-group estimation. IEEE Trans. Image Process. 25(9), 4340–4353 (2016) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Jeurissen B, Tournier JD, Dhollander T, Connelly A, Sijbers J: Multi-tissue constrained spherical deconvolution for improved analysis of multi-shell diffusion MRI data. NeuroImage 103, 411–426 (2014) [DOI] [PubMed] [Google Scholar]
- 3.Jeurissen B, Tournier JD, Sijbers J: Tissue-type segmentation using non-negative matrix factorization of multi-shell diffusion-weighted MRI images. In: The Annual Meeting of the Internal Society for Magnetic Resonance in Medicine, vol. 23, p. 0349 (2015) [Google Scholar]
- 4.Lee DD, Seung HS: Learning the parts of objects by non-negative matrix factorization. Nature 401(6755), 788–791 (1999) [DOI] [PubMed] [Google Scholar]
- 5.Zdunek R, Cichocki A: Blind image separation using nonnegative matrix factorization with gibbs smoothing. In: International Conference on Neural Information Processing, Springer, pp. 519–528 (2007) [Google Scholar]
- 6.Wang J, Lai S, Li M, Lies D: Medical image retrieval based on an improved non-negative matrix factorization algorithm. J. Digit. Inf. Manag. 13(6) (2015) [Google Scholar]
- 7.Matloff N: Quick introduction to nonnegative matrix factorization (2016) [Google Scholar]
- 8.Rajapakse M, Wyse L: Face recognition with non-negative matrix factorization. In: Visual Communications and Image Processing, International Society for Optics and Photonics, vol. 5150, pp. 1838–1848 (2003) [Google Scholar]
- 9.Hoyer PO: Non-negative matrix factorization with sparseness constraints. J. Mach. Learn. Res. 5(Nov), 1457–1469 (2004) [Google Scholar]
- 10.Christiaens D, Maes F, Sunaert S, Suetens P: Convex non-negative spherical factorization of multi-shell diffusion-weighted images. In: International Conference on Medical Image Computing and Computer-Assisted Intervention, Springer, pp. 166–173 (2015) [Google Scholar]
- 11.Kaden E, Kruggel F, Alexander DC: Quantitative mapping of the per-axon diffusion coefficients in brain white matter. Magn. Reson. Med. 75(4), 1752–1763 (2016) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Mairal J, Bach F, Ponce J, Sapiro G, Jenatton R, Obozinski G: SPAMS: A SPArse Modeling Software, v2.3 (2012)
- 13.Mairal J, Bach F, Ponce J, Sapiro G: Online dictionary learning for sparse coding. In: International Conference on Machine Learning, ACM, pp. 689–696 (2009) [Google Scholar]
- 14.Mairal J, Bach F, Ponce J, Sapiro G: Online learning for matrix factorization and sparse coding. J. Mach. Learn. Res. 11(Jan), 19–60 (2010) [Google Scholar]
