Abstract
In this paper, we propose an efficient framework for parcellation of white matter tractograms using discriminative dictionary learning. Key to our framework is the learning of a compact dictionary for each fiber bundle so that the streamlines within the bundle can be sufficiently represented. Dictionaries for multiple bundles are combined for whole-brain tractogram representation. These dictionaries are learned jointly to encourage inter-bundle incoherence for discriminative power. The proposed method allows tractograms to be assigned to more than one bundle, catering to scenarios where tractograms cannot be clearly separated. Experiments on a bundle-labeled HCP dataset and an infant dataset highlight the ability of our framework in grouping streamlines into anatomically plausible bundles.
Keywords: Diffusion MRI, Tractography, Fiber bundle, Dictionary learning
1. Introduction
Diffusion magnetic resonance imaging (DMRI) [1] allows analysis of white matter (WM) axonal trajectories via streamlines (i.e., tractogram) generated by a process called tractography. Automatically grouping several millions of streamlines into anatomically meaningful bundles is an important yet challenging task [2–4], since streamlines within a bundle can have different shapes, lengths, and endpoints.
A fiber tract is a collection of axons having common origin and destination sites. Automatic tract identification strategies can be based on regions of interest (ROIs) [5], clustering [2,3], atlases [2,6], and segmentation [7,8]. Many existing methods rely on point correspondences between streamlines [9]. However, even within the same bundle, streamlines can have different lengths and endpoints. Therefore, standard geometric distance measures often lead to poor results. Moreover, the streamlines are often first aligned to a common space defined for example by a tract atlas and then resampled with the same number of points to facilitate computation of distances between streamlines [3,10]. If the underlying anatomy is altered due to pathology or development, meaningful information might be lost in the process of resampling [11].
In this paper, we propose a method to efficiently parcellate the whole-brain tractogram, catering especially to scenarios where streamline separation is challenging. This is achieved via tract dictionary learning (TractDL) to learn tractogram exemplars for fast and automated analysis of large datasets. Unlike existing dictionary-based approaches [12,13], our method adopts a natural means of enforcing consistency across hemispheres and is unconfounded by the lengths of streamlines. Instead of learning the dictionary in the native point space, we map the tractograms to a Hilbert space via cosine series representation. Parcellation is performed in the cosine representation space via a sparse framework with a set of learned dictionaries, each for a bundle. We exploit structured incoherence and shared features across bundles so that within-bundle variability is small but between-bundle variability is large. Using the dictionaries, whole-brain bilateral clustering is applied for simultaneous bundle parcellation in both hemispheres. We evaluated TractDL based on HCP data and infant data in comparison with existing bundle parcellation methods.
2. Methods
2.1. Modeling Fiber Streamlines
We represent the streamlines in Hilbert space [11]. Let denote a streamline of a fiber bundle, consisting of n points . Note that n can be different for each streamline. We consider a function f−1 that maps these points onto the unit interval [0, 1] as
(1) |
That is, γ(j) indicates the ratio between the arc-length from the point p(1) to p(j), and the total arc-length from p(1) to p(n). We can parameterize the smooth inverse map f : [0, 1] → l as a linear combination of smooth basis function.
Let the coordinates of f be (f1, f2, f3). Then, each coordinate can be approximated using K degree cosine series expansion [11]:
(2) |
where {ψk} is a cosine basis. Then, for corresponding to n sampling points , we have
(3) |
The matrix of cosine coefficients can be estimated from in the least-squares manner as
(4) |
where . We note that the dimension of the coefficients only depends on the degree K of cosine series (K = 19 in this paper), but not on the number of sampling points along the streamline.
2.2. Tract Dictionary Learning
Let Xt represent the set of cosine coefficients of the streamlines of t-th bundle. We learn a dictionary Dt for the t-th bundle using sparse dictionary learning (DL) by minimizing the following objective function:
(5) |
Each column of Xt can be represented as a linear combination of a set of vectors in an over-complete dictionary Dt with sparse coefficients in the corresponding column of At. Based on the coefficients, the streamlines can be classified into different bundles.
We train the dictionaries of multiple fiber bundles jointly. Note that if the dictionaries of the individual bundles are trained separately, they are not necessarily discriminative enough to help distinguish the streamlines from the different bundles. To obtain incoherent discriminative dictionaries, we let X = [X1, X2, …], D = [D1, D2, …], and , and solve the following problem:
(6) |
where encourages orthogonality between each dictionary pair [14]. λ and η are scalar tuning parameters. The size of each bundle dictionary is fixed as m. Each row of X is normalized to have zero mean and unit variance to remove the effects of distribution location and scale. In this paper, we solved (6) using an open source dictionary learning toolbox (DICTOL)1.
2.3. Streamline Classification
The sparse code at of the cosine representation x of a streamline with respect to the dictionary of bundle t can be used for streamline classification. This is achieved by solving
(7) |
where t* is the label of streamline x, and λ is a regularization parameter.
To rapidly classify whole-brain streamlines into bundles, we first perform unsupervised clustering to cluster the streamlines into a number of clusters. We employ a fast k-medoids clustering algorithm on the cosine representation coefficients of the streamlines. To improve clustering robustness, bilateral clustering is applied for simultaneous bundle parcellation in both cerebral hemispheres. A total of 2000 of fiber clusters are generated for good anatomical separation of the streamlines while maintaining good clustering consistency across subjects [3]. Figure 1 shows the clustering results, color-coded by local streamline orientations. We then randomly select 50 streamlines from each cluster and label each streamline via solving (7). The final bundle label assigned to each cluster is the label with count fraction (out of 50) greater than a threshold v.
Fig. 1.
Inter-hemispheric bilateral clustering of the whole-brain tractogram.
3. Experimental Results
3.1. Dataset
The dataset used for evaluation consisted of 72 WM tract bundles for each of 105 young adults from the Human Connectome Project (HCP) [15]. The ground-truth tracts were extracted semi-automatically from whole-brain tractograms [16]. 70 subjects were used for training and the remaining 35 subjects were used for testing. The fiber streamlines of each subject were transformed to a common space via affine transformation determined by volumetric registration. For each subject, 1000 streamlines were randomly selected from each bundle for training. For bundles with less than 1000 streamlines, all streamlines were selected. The abbreviations of the fiber bundles are defined in [16].
3.2. Validation
To choose the optimal dictionary size per bundle, we evaluated the leave-one-out cross-validation identification accuracy (ACC) [16,17] of association and projection tracts of the training subjects as a function of the dictionary size m. Figure 2 indicates a clear convergence trend when m is greater than 2000. Therefore, we used m = 2000 in our experiments. We also repeated tract classification 100 times with different tract samples and Fig. 3 indicates that classification variability across repetitions is low.
Fig. 2.
ACC as a function of the dictionary size m.
Fig. 3.
ACC variation over 100 classification repetitions.
We used RecoBundles [2], a state-of-the-art bundle parcellation method, as the comparison method. RecoBundles registers the tractograms to a reference subject and uses clustering to detect streamlines that are similar to reference tracts. We used the default parameters while running RecoBundles on the 35 test subjects. For evaluation, we combined the 72 fiber bundles for each test subject to form whole-brain tractograms. To compare the accuracy of the methods, we let and be the ground-truth streamlines and the identified streamlines, respectively, and define the ACC of the tract identification in two ways:
(8) |
Figure 4 indicates that TractDL yields higher classification accuracy in general, especially in identifying association and projection tracts.
Fig. 4.
The identification ACC of each tract across subjects measured in two ways; ACC(, ) (top) and ACC(, ) (bottom).
To evaluate the robustness of TractDL, we regenerated the whole-brain tractogram of one subject randomly chosen from the testing dataset. We used MRtrix to compute the fiber orientation distribution functions with multi-shell multi-tissue constrained spherical deconvolution and generated two million fibers with anatomically constrained probabilistic tractography. By employing TractDL, we were able to identify difficult bundles such as UF, OR, CST, FPT, and POPT (Fig. 5).
Fig. 5.
The reference bundles and the bundles identified by white matter query language (WMQL) [5], RecoBundles, and TractDL.
To study hemispheric asymmetry of the bundles, we computed the laterality index: LI = (L − R)/(L + R), where R and L are the streamline counts in the right and left hemispheres, respectively. Table 1 shows the absolute values of LI [18,19], which ranges from 0 (not lateralized) to 1 (completely lateralized). This confirms that fiber bundles identified by TractDL are more symmetric for most of the bundles, which is in line with previous studies [19,20].
Table 1.
Laterality index.
Laterality | ATR | UF | ST_FO | ICP | OR | T_PREM | ST_PREM | CST | POPT | FPT |
---|---|---|---|---|---|---|---|---|---|---|
Reference | 0.2640 | 0.5539 | 0.2796 | 0.3311 | 0.3339 | 0.1066 | 0.2019 | 0.3669 | 0.0058 | 0.0887 |
WMQL | 0.5236 | 0.5870 | 0.0470 | 0.0877 | 0.4539 | 0.5322 | 0.1483 | 0.0178 | 0.0714 | 0.3850 |
RecoBundles | 0.5459 | 0.9872 | 0.0292 | 0.0178 | 0.2288 | 0.1581 | 0.1044 | 0.1706 | 0.4079 | 0.4501 |
TractDL | 0.3071 | 0.2172 | 0.0844 | 0.0494 | 0.1143 | 0.2206 | 0.2188 | 0.0023 | 0.0896 | 0.0905 |
To evaluate the generalizability of TractDL, we applied it to a 54-days-old infant subject image acquired as part of Baby Connectome Project (BCP) [21], with a protocol that is different from the training subjects. Tractography was conducted with asymmetry spectrum imaging (ASI) as described in [22,23]. ASI fits a mixture of asymmetric fiber orientation distribution function (AFODF) to the diffusion signal. Fiber streamlines were then generated by successively following local directions determined from the AFODF [23]. Figure 6 indicates that the bundles are correctly identified by TractDL.
Fig. 6.
Fiber bundles identified by TractDL in an infant subject.
4. Conclusion
We presented a method for fast and robust identification of fiber bundles of interest from a large number of streamlines. The coordinates of each streamline were parameterized as coefficients of cosine series expansions. Based on this representation, we constructed dictionaries for multiple bundles of the whole-brain tractogram. The dictionaries were learned jointly to encourage incoherence for tractogram classifications. The effectiveness, robustness, generalizability, and bilaterality of our method were validated by the experimental results.
Acknowledgments
This work was supported in part by NIH grants (NS093842, EB006733, and MH110274) and the efforts of the UNC/UMN Baby Connectome Project Consortium.
Footnotes
References
- 1.Johansen-Berg H, Behrens TE: Diffusion MRI: From Quantitative Measurement to In Vivo Neuroanatomy. Academic Press, London: (2013) [Google Scholar]
- 2.Garyfallidis E, et al. : Recognition of white matter bundles using local and global streamline-based registration and clustering. NeuroImage 170, 283–295 (2018) [DOI] [PubMed] [Google Scholar]
- 3.Zhang F, et al. : An anatomically curated fiber clustering white matter atlas for consistent white matter tract parcellation across the lifespan. NeuroImage 179, 429–447 (2018) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Kumar K, Desrosiers C, Siddiqi K: Brain fiber clustering using non-negative kernelized matching pursuit. In: Zhou L, Wang L, Wang Q, Shi Y (eds.) MLMI 2015. LNCS, vol. 9352, pp. 144–152. Springer, Cham: (2015). 10.1007/978-3-319-24888-218 [DOI] [Google Scholar]
- 5.Wassermann D, et al. : The white matter query language: a novel approach for describing human white matter anatomy. Brain Struct. Funct 221(9), 4705–4721 (2016) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Eckstein I, et al. : Active fibers: matching deformable tract templates to diffusion tensor images. Neuroimage 47, T82–T89 (2009) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Hua K, et al. : Tract probability maps in stereotaxic spaces: analyses of white matter anatomy and tract-specific quantification. NeuroImage 39(1), 336–347 (2008) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Wassermann D, Bloy L, Kanterakis E, Verma R, Deriche R: Unsupervised white matter fiber clustering and tract probability map generation: applications of a Gaussian process framework for white matter fibers. NeuroImage 51(1), 228–241 (2010) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.O’Donnell LJ, Golby AJ, Westin CF: Fiber clustering versus the parcellation-based connectome. NeuroImage 80, 283–289 (2013) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.O’Donnell LJ, Westin CF: Automatic tractography segmentation using a high-dimensional white matter atlas. IEEE Trans. Med. Imaging 26(11), 1562–1575 (2007) [DOI] [PubMed] [Google Scholar]
- 11.Chung MK, Adluru N, Lee JE, Lazar M, Lainhart JE, Alexander AL: Cosine series representation of 3D curves and its application to white matter fiber bundles in diffusion tensor imaging. Stat. Interface 3(1), 69–80 (2010) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Alexandroni G, et al. : White matter fiber representation using continuous dictionary learning. In: Descoteaux M, Maier-Hein L, Franz A, Jannin P, Collins DL, Duchesne S (eds.) MICCAI 2017. LNCS, vol. 10433, pp. 566–574. Springer, Cham: (2017). 10.1007/978-3-319-66182-765 [DOI] [Google Scholar]
- 13.Kumar K, Siddiqi K, Desrosiers C: White matter fiber analysis using kernel-dictionary learning and sparsity priors. Pattern Recogn. 95, 83–95 (2019) [Google Scholar]
- 14.Ramirez I, Sprechmann P, Sapiro G: Classification and clustering via dictionary learning with structured incoherence and shared features. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 3501–3508. IEEE (2010) [Google Scholar]
- 15.Van Essen DC, et al. : The WU-Minn human connectome project: an overview. NeuroImage 80, 62–79 (2013) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Wasserthal J, Neher P, Maier-Hein KH: TractSeg - fast and accurate white matter tract segmentation. NeuroImage 183, 239–253 (2018) [DOI] [PubMed] [Google Scholar]
- 17.Taha AA, Hanbury A: Metrics for evaluating 3D medical image segmentation: analysis, selection, and tool. BMC Med. Imaging 15(1), 29 (2015) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.O’Donnell LJ, et al. : The fiber laterality histogram: a new way to measure white matter asymmetry. In: Jiang T, Navab N, Pluim JPW, Viergever MA (eds.) MICCAI 2010. LNCS, vol. 6362, pp. 225–232. Springer, Heidelberg: (2010). 10.1007/978-3-642-15745-528 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Propper RE, et al. : A combined fMRI and DTI examination of functional language lateralization and arcuate fasciculus structure: effects of degree versus direction of hand preference. Brain Cogn. 73(2), 85–92 (2010) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.de SchottenSchotten MT, et al. : Atlasing location, asymmetry and inter-subject variability of white matter tracts in the human brain with MR diffusion tractography. NeuroImage 54(1), 49–59 (2011) [DOI] [PubMed] [Google Scholar]
- 21.Howell BR, et al. : The UNC/UMN baby connectome project (BCP): an overview of the study design and protocol development. NeuroImage 185, 891–905 (2019) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Wu Y, Lin W, Shen D, Yap P-T: Asymmetry spectrum imaging for baby diffusion tractography. In: Chung ACS, Gee JC, Yushkevich PA, Bao S (eds.) IPMI 2019. LNCS, vol. 11492, pp. 319–331. Springer, Cham: (2019). 10.1007/978-3-030-20351-124 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Wu Y, Hong Y, Feng Y, Shen D, Yap PT: Mitigating gyral bias in cortical tractography via asymmetric fiber orientation distributions. Med. Image Anal 59, 101543 (2020) [DOI] [PMC free article] [PubMed] [Google Scholar]