Abstract
In this paper we present a framework for unsupervised segmentation of white matter fiber traces obtained from diffusion weighted MRI data. Fiber traces are compared pairwise to create a weighted undirected graph which is partitioned into coherent sets using the normalized cut (N cut) criterion. A simple and yet effective method for pairwise comparison of fiber traces is presented which in combination with the N cut criterion is shown to produce plausible segmentations of both synthetic and real fiber trace data. Segmentations are visualized as colored stream-tubes or transformed to a segmentation of voxel space, revealing structures in a way that looks promising for future explorative studies of diffusion weighted MRI data.
1 Introduction
Diffusion Weighted MRI (DWI) makes it possible to non-invasively measure water diffusion within tissue. In a volume acquired using DWI, each voxel contains a diffusion tensor or other higher order descriptor for the local water diffusion. In fibrous tissue such as muscles and human brain white matter, water tend to diffuse less in the directions perpendicular to the fiber structure. This makes it possible to study the local fiber orientations indirectly by interpreting the water diffusion within the voxel. From DWI data it is therefore possible to create so called fiber traces from virtual particles, traveling along the direction of maximum diffusion, starting from a set of seed points [1,2,13], a.k.a fiber tracking. Performing fiber tracking in DWI data from the human brain gives valuable insights about fiber tract connectivity, useful in for instance surgical planning and for the study of various diseases such as schizophrenia.
In our experiments we have exclusively used data from so called diffusion tensor MRI (DT-MRI) [9], where the diffusion inside a voxelis described by a second order symmetric positive definite 3 × 3 tensor, which may be thought of as an ellipsoid. An elongated ellipsoid represent high diffusivity in a particular direction, which may be interpreted as the dominant orientation of the fibers going thru that particular voxel. From this data, fiber traces were created within the white matter areas using a standard fiber tracking algorithm following the principal direction of diffusion based on a fourth-order Runge-Kutta integration scheme.
The contribution of this paper is a novel post processing method for clustering or segmentation of such fiber traces. Fiber traces are grouped according to a pairwise similarity function which takes into account the shape and connectivity of fiber traces. The clustering method we propose builds on so called normalized cuts, which have previously been introduced in the computer vision community by Shi and Malik [10] for automatic segmentation of digital images. This results in an unsupervised segmentation of human brain white matter, in which fiber traces are grouped into coherent bundles, applicable to any DWI technology able to produce fiber traces. For an overview of the method, see figure 1.
1.1 Previous Work
There are numerous examples where fiber traces from DWI have successfully revealed fiber tracts in the human brain, see for instance [2,3,13]. Stream-tubes have often been used for visualization [14], sometimes in combination with coloring schemes and variation of the stream-tube thickness according to some aspect of the underlying local diffusion descriptor. The idea of using fiber traces to obtain segmentations of white matter fiber tracts, as well as gray matter areas, have been explored in a number of papers recently. In [3] a segmentation of deep gray matter structures is performed using probabilistic fiber tracking, which connects pre-segmented areas of the human cortex with the thalamus. There also exist manual approaches to organize fiber traces into fiber bundles, such as the virtual dissection proposed in [5]. In [4] the idea of pseudo-coloring (soft clustering) fiber traces to enhance the perception of connectivity in visualizations of human brain white matter was presented. Some unsupervised approaches to clustering of fiber traces, similar to the one in this paper, have also been reported. For instance fuzzy c-means clustering [11] and K nearest neighbors [6]. Outside the area of medical image processing, clustering of curves [8] has been reported.
2 Determining Fiber Similarity
Many clustering methods, including the N Cut being used in this paper, operate on a graph with undirected weighted edges describing the pairwise similarity of the objects to be clustered. This graph may be described using a weight matrix W, which is symmetric and has values ranging from 0 (dissimilar) to 1 (similar).
A fiber trace, represented as an ordered set of points in space, is a fairly high-dimensional object. Therefore the pairwise comparison of all fiber traces could potentially be a time-demanding task if fiber trace similarity is cumbersome to calculate and the number of fiber traces is high. In this paper we propose to split the computation of similarity into two steps:
Mapping high-dimensional fiber traces to a relatively low-dimensional Euclidean feature space, preserving some but not all information about fiber shape and fiber connectivity. This mapping is oblivious, acting on each fiber separately.
The use of a Gaussian kernel for comparison of points in the Euclidean feature space. This function acts on pairs of fiber traces.
It is important to point out early that even though the above mapping to a feature space may seem to be crude at a first glance, it works surprisingly well for fiber traces in practice. For a set of N fiber traces the first calculation above cost O(N), while the second calculation cost O(N2) operations. This is also the reason for pre-processing the fiber data in the first step, making the second calculation more computationally efficient.
2.1 Mapping Fiber Traces to an Euclidean Feature Space
The position, shape and connectivity are important properties of a fiber trace to preserve in the mapping to a feature space. If we regard a fiber trace as just a set of points in space, we capture a sketch of these properties by calculating the mean vector m and the covariance matrix C of the points building up the fiber trace. In order to avoid non-linear scaling behavior, we take the square root of the covariance matrix, Now the mapping of a fiber F may be described
(1) |
which is a 9-dimensional vector. This mapping has the desirable property that it is rotation and translation invariant in the sense that the Euclidean distance between two fiber traces mapped to the 9-dimensional space is invariant to any rotations or translations in the original space. For applications where mean and covariance is not enough to discriminate between different clusters of fiber traces, higher order central moments [12] could add more feature dimensions to the above mapping. Also, in cases when fiber connectivity is more important than shape, a higher weight could be given to the fiber trace end-points in the calculation of the mean vector and covariance matrix above. Different weights could also be given to the mean vector and the covariance components in the mapping in eq (1).
2.2 Using the Gaussian Kernel for Pairwise Comparison
When fiber traces have been mapped to points in a Euclidean feature space, they may be compared relatively easy for similarity. We choose Gaussian kernels (a.k.a. Radial Basis Functions in Neural Networks literature)
(2) |
which are symmetric and contain a parameter σ which we may use to adjust the sensitivity of the similarity function. This function maps similar points in feature space to unity and dissimilar points to zero.
2.3 Constructing W
By combining the mapping to a Euclidean feature space with the Gaussian kernel, we obtain the weights of an undirected graph describing the similarity between all pairs of fiber traces. The weights are stored in a matrix W defined as
(3) |
This matrix is expected to be sparse, having most of the values close to zero.
3 Normalized Cut and Clustering
Clustering, segmentation and perceptual grouping using normalized cuts was introduced to the computer vision community by Shi and Malik in [10]. The points to be clustered are represented by a undirected graph G = (V, E), where the nodes correspond to the points to be clustered and each edge has weight w(i, j) which represent the similarity between point i and j. The cut is a graph theoretical concept which for a partition of the nodes into two disjunct sets A and B bipartitioning V is defined as
(4) |
Using the cut, an optimal partitioning of the nodes may be defined as one that minimizes the cut. Intuitively this could be used for segmentation, while the minimum cut corresponds to a partitioning which keeps well connected components of the graph together. However, there is no bias in the minimum cut which says it should partition the graph in two parts of equal size. Shi and Malik therefore defined the normalized cut, which is defined as
(5) |
where asso(A, V) = Σu∈A,t∈V w(u, t). This new measure N cut tries to minimize the cut, while at the same time penalizing partitions in which one set of nodes is only loosely connected to the graph at large. If we define xi = 1 when node i ∈ A, xi = −1 when node i ∈ B, d(i) = Σj w(i, j), , D is a matrix with d = d(i) in it’s diagonal and W is the matrix defined by w(i, j) then it is shown in [10] that
(6) |
which can be shown to be equivalent to
(7) |
where yi ∈ {1, −b}, b = k/(1 − k), for y = (1 + x) − b(1 − x) while yT d = 0. Relaxing the problem by allowing y to take any real values results in the minimization of the so called Rayleigh quotient, which can be minimized by solving
(8) |
It is shown in [10] that the second smallest eigenvector of eq. (8) minimizes the real valued version of the normalized cut.
In our implementation we used the second smallest eigenvector to obtain a 1-d ordering of the vertices of the graph. A random search was then performed to determine a good threshold for the bipartitioning of the graph. To test the goodness of a specific threshold the true discrete N cut was calculated. When the graph has been split into two, [10] recommends the partitioning continues recursively until the N cut raises above a certain value.
4 Results
The method was tested on both synthetic datasets and fiber traces from real diffusion weighted MRI. Results are visualized using both stream-tubes and color coded voxel data. All algorithms were implemented in Matlab.
– In figure 2 the method is tested on synthetically generated fiber traces in 2-D.
– In figure 3 real fiber traces obtained from DT-MRI was used as input.
– In figure 4 the method was tested with a very large value of σ and direct mapping of the second, third and fourth eigenvector to colors similar to the approach described in [4].
5 Discussion
We have not yet investigated the effects of noise to the clustering. Also, the maximum number of fiber traces we have analyzed so far is only about 5000, due to the current implementation in Matlab which does not fully exploit the sparsity of W. Never the less, the experiments show the potential of the N Cut criterion and the proposed measure of fiber similarity.
One insight from the experiments is that pseudo-coloring of fiber traces is very effective to reveal errors in fiber tracking algorithms. A collection of fiber traces may look ok at a first glance, but after pseudo-coloring, the anomalies are easily spotted. One example of this is in figure 4 (middle) where the red fiber traces may be identified instantly as outliers because they have a very different color than surrounding fiber traces.
In our experiments we have segmented fiber traces and then sometimes transformed the results back to voxel space. One may ask if it would be possible to segment voxels directly, and what features to use to discriminate voxels. A solution with obvious similarities to the approach presented in this paper would be to perform fiber tracking, possibly stochastic [3], from each voxel inside white matter and regard the fiber traces as a non-local features of the voxels – macro features.
The continuous coloring in figure 4 appears to be more visually pleasing than the discrete coloring according to the segmentation in figure 3. One may in fact ask if a structure such as the corpus callosum is meaningful to partition into several clusters or whether it is better described as one fiber bundle parameterized by a coordinate system going from anterior to posterior. One could think of the smoothly varying colors in figure 4 as coordinate systems, parameterizing all fiber traces in the cross bundle directions. It is in fact then also natural to add a parameterization of each bundle in the fiber direction.
In conclusion the proposed clustering method seems to be a promising new way to automatically reveal the global structure of white matter by segmentation of fiber traces obtained from DWI data. We believe this to be useful in for instance explorative studies of the brain and for visualization of DWI data in surgical planning applications. Important to note though is that all post processing methods for fiber traces share the same weakness: they all rely on a good fiber tracking algorithm to perform well.
Acknowledgments
Thanks to Marek Kubicki, Lauren O’Donnell, William Wells, Steven Haker, Stephan Maier, Robert McCarley and Ron Kikinis for interesting discussions and to Raúl San José Estépar for valuable comments on the manuscript.
This work was funded in part by NIH grants P41-RR13218 and R01 MH 50747, and CIMIT. It was also supported in part by the Post-doctoral Fellowship Program of Korea Science & Engineering Foundation (KOSEF).
Contributor Information
Anders Brun, Email: andbr@imt.liu.se.
Hans Knutsson, Email: knutte@imt.liu.se.
Hae-Jeong Park, Email: parkhj@yumc.yonsei.ac.kr.
Martha E. Shenton, Email: martha_shenton@hms.harvard.edu.
Carl-Fredrik Westin, Email: westin@bwh.harvard.edu.
References
- 1.Basser PJ. Inferring microstructural features and the physiological state of tissues from diffusion-weighted images. NMR in Biomedicine. 1995;8:333–344. doi: 10.1002/nbm.1940080707. [DOI] [PubMed] [Google Scholar]
- 2.Basser PJ, Pajevic S, Pierpaoli C, Duda J, Aldroubi A. In vivo fiber tractography using DT-MRI data. Magn Reson Med. 2000;44:625–632. doi: 10.1002/1522-2594(200010)44:4<625::aid-mrm17>3.0.co;2-o. [DOI] [PubMed] [Google Scholar]
- 3.Behrens„ TEJ, Johansen-Berg H, Woolrich MW, Smith SM, Wheeler-Kingshott CAM, Boulby PA, Barker GJ, Sillery EL, Sheehan K, Ciccarelli O, Thompson AJ, Brady JM, Matthews PM. Non-invasive mapping of connections between human thalamus and cortex using diffusion imaging. Nature Neuroscience. 2003;6(7):750–757. doi: 10.1038/nn1075. [DOI] [PubMed] [Google Scholar]
- 4.Brun A, Park H-J, Knutsson H, Westin Carl-Fredrik. Coloring of DT-MRI Fiber Traces using Laplacian Eigenmaps. Computer Aided Systems Theory - EUROCAST 2003; LNCS; Spain: Las Palmas de Gran Canaria; Feb, 2003. [Google Scholar]
- 5.Catani M, Howard RJ, Pajevic S, Jones DK. Virtual in vivo interactive dissection of white matter fasciculi in the human brain. Neuroimage. 2002;17:77–94. doi: 10.1006/nimg.2002.1136. [DOI] [PubMed] [Google Scholar]
- 6.Ding Z, Gore JC, Anderson AW. Classification and quantification of neuronal fiber pathways using diffusion tensor MRI. Mag Reson Med. 2003;49:716–721. doi: 10.1002/mrm.10415. [DOI] [PubMed] [Google Scholar]
- 7.Fowlkes C, Belongie S, Malik J. Efficient Spatiotemporal Grouping Using the Nyström Method. CVPR; Hawaii: Dec, 2001. [Google Scholar]
- 8.Gaffney SJ, Smyth P. In: Bishop CM, Frey BJ, editors. Curve clustering with random effects regression mixtures; Proc. Ninth Int. Workshop on AI and Stat; Florida: 2003. [Google Scholar]
- 9.Lebihan D, Breton E, Lallemand D, Grenier P, Cabanis E, LavalJeantet M. MR imaging of intravoxel incoherent motions: application to diffusion and perfusion in neurologic disorders. Radiology. 1986;161:401–407. doi: 10.1148/radiology.161.2.3763909. [DOI] [PubMed] [Google Scholar]
- 10.Shi J, Malik J. Normalized Cust and Image Segmentation. PAMI. 2000;22(8) [Google Scholar]
- 11.Shimony JS, Snyder AZ, Lori N, Conturo TE. Proc Intl Soc Mag Reson Med. Vol. 10. Honolulu, Hawaii: May, 2002. Automated fuzzy clustering of neur-Sonal pathways in diffusion tensor tracking. [Google Scholar]
- 12.Weisstein E. Eric Weisstein’s world of mathematics. 2004 March 5; http://mathworld.wolfram.com/
- 13.Westin CF, Maier SE, Mamata H, Nabavi A, Jolesz FA, Kikinis R. Processing and Visualization of Diffusion Tensor MRI. Medical Image Analysis. 2002;6(8) doi: 10.1016/s1361-8415(02)00053-1. [DOI] [PubMed] [Google Scholar]
- 14.Zhang S, Curry T, Morris DS, Laidlaw DH. Streamtubes and streamsurfaces for visualizing diffusion tensor MRI volume images. Visualization ’00 Work in Progress. 2000 October; [Google Scholar]