Skip to main content
PLOS Computational Biology logoLink to PLOS Computational Biology
. 2021 Jun 28;17(6):e1009136. doi: 10.1371/journal.pcbi.1009136

Multidimensional analysis and detection of informative features in human brain white matter

Adam Richie-Halford 1,*, Jason D Yeatman 2, Noah Simon 3, Ariel Rokem 1,4
Editor: Roberto Toro5
PMCID: PMC8270416  PMID: 34181648

Abstract

The white matter contains long-range connections between different brain regions and the organization of these connections holds important implications for brain function in health and disease. Tractometry uses diffusion-weighted magnetic resonance imaging (dMRI) to quantify tissue properties along the trajectories of these connections. Statistical inference from tractometry usually either averages these quantities along the length of each fiber bundle or computes regression models separately for each point along every one of the bundles. These approaches are limited in their sensitivity, in the former case, or in their statistical power, in the latter. We developed a method based on the sparse group lasso (SGL) that takes into account tissue properties along all of the bundles and selects informative features by enforcing both global and bundle-level sparsity. We demonstrate the performance of the method in two settings: i) in a classification setting, patients with amyotrophic lateral sclerosis (ALS) are accurately distinguished from matched controls. Furthermore, SGL identifies the corticospinal tract as important for this classification, correctly finding the parts of the white matter known to be affected by the disease. ii) In a regression setting, SGL accurately predicts “brain age.” In this case, the weights are distributed throughout the white matter indicating that many different regions of the white matter change over the lifespan. Thus, SGL leverages the multivariate relationships between diffusion properties in multiple bundles to make accurate phenotypic predictions while simultaneously discovering the most relevant features of the white matter.

Author summary

The connections between different parts of the brain form networks that are important for information transmission and for brain health. These connections are composed of nerve fibers that travel through the white matter portion of the brain. Thus, mapping tissue properties in white matter pathways can help us understand which connections and what tissue properties are relevant to brain diseases, or explain differences in behavior and cognition between different individuals. We developed a new statistical method that helps map the white matter by automatically identifying the features of the white matter that correspond to individual differences. Our approach relies on incorporating our knowledge of the anatomical sub-divisions of the white matter into the statistical model itself. We demonstrate that the model accurately captures differences between individual’s with amyotrophic lateral sclerosis (ALS) and healthy controls. It also accurately captures brain changes that correspond with the ages of different individuals. We tested our method on four different datasets and found that it is more accurate than previous methods used with these datasets, while also allowing us to highlight which white matter tissue properties and which connections account for individual differences, thus providing interpretable results.


This is a PLOS Computational Biology Methods paper.

Introduction

Non-invasive methods for measuring human brain structure and function have revolutionized our understanding of brain function. These measurements have demonstrated that interactions between networks of brain regions give rise to coordinated information processing and to the complex adaptive behavior that characterizes human cognition. Diffusion-weighted Magnetic Resonance Imaging (dMRI) provides a unique view into the physical properties of the connections that comprise these networks, by sensitizing the measurement to the directional diffusion of water in each voxel [1, 2]. Methods for computational tract-tracing from diffusion MRI, or tractography, combine the estimates of fiber orientations in each voxel to form streamlines that traverse the volume of the white matter [3, 4]. A variety of methods can be used to delineate the trajectory of major neural pathways among these streamlines [5]. Tractometry uses the results of tractography and models of tissue biophysics based on the patterns of diffusion in each measurement voxel to assess the physical properties of the white matter along specific pathways [6, 7]. In some previous tractometry-based studies, tissue properties along the length of each tract were summarized by taking the mean along each bundle, but there is a large body of evidence showing that there is systematic variability in the values of diffusion metrics along the trajectory of each bundle. This justifies retaining the individual samples along the length of each bundle [5, 6, 8, 9], hereafter referred to as tract profiling. While this retains important information about each individual’s white matter, it also presents statistical challenges due to the dimensionality of the data. In past work, comparisons between groups or across individuals were done independently at each node of each bundle, for each diffusion metric. This approach is exhaustive, but statistical power is compromised by a multiple comparison problem [8, 1012]. An alternative that circumvents the multiple comparison problem is to select just a few tracts to compare in each individual, or even segments of these tracts based on a priori hypotheses. This approach is appropriate when the biological basis of the process of interest is relatively well understood (for a recent example, see [13]). Sometimes, these approaches are combined: a bundle is selected based on a priori knowledge, and all the data in the bundle of interest are used together to fit a model that can predict differences between individuals [14].

The present work aims to balance predictive accuracy with descriptive power [15, 16] by capitalizing on all of the available data across all bundles, while also retaining and elucidating spatial information about the locations that are most informative for discriminative performance. Unlike many previous analyses of white matter (e.g., using TBSS [17] and connectometry [18]), which focus on classical inference about the differences between groups or individuals in terms of their white matter properties, the focus of the present work is on the combination of features that facilitate accurate prediction of the individual differences in a particular phenotype. For example, an accurate prediction of which individuals in a group have a particular disease, or a prediction of their age with small error. The distinct goals of inference and prediction are not necessarily incongruent, but can sometimes be in tension [19]. This distinction is also similar to the distinction between “encoding” and “decoding” used in the functional MRI literature [20]. In the present work, we predict the phenotypical variance in a group of subjects, or classify group membership, based on a linear combination of the features estimated with tract profiling.

Using this approach, we first need to deal with the large and asymmetric dimensionality of the data: tract profile data usually has many more features (i.e., number of measurements per individual) than samples (number of subjects), which makes inferences from the data about phenotypical differences between individuals ill-posed. This regime is the target of several statistical learning techniques, and is often solved by various forms of regularization.

The Lasso algorithm minimizes the sum of the absolute values of contributions of each feature [21]. This tends to shrink to zero the contributions of many of the features, providing results that are both accurate and interpretable. When additional structure is available in the organization of the data, regularization algorithms can take advantage of this structure. For example, if the features lend themselves to a natural division into different groups, the Group Lasso (GL) can be used to select groups of features, rather than individual features [22]. The Sparse Group Lasso (SGL) elaborates on this idea by providing control both of group sparsity, as well as overall sparsity of the solutions [23]. Because the features measured with tractomery lend themselves to grouping based on the tracts from which each measurement is taken, GL and SGL provide useful tools for linear model fitting in problems of this form. Here we develop an implementation of SGL that is well suited to the analysis of tract profile data. In addition, we demonstrate the power and flexibility of this approach by applying it to both classification and continuous prediction problems. Our data flow is represented in Fig 1 and explained in further detail in the Methods section.

Fig 1. Tractometry data flow.

Fig 1

(a) Whole brain tractography generates streamlines approximating the trajectories of white matter connections. (b) Tractometry classifies these streamlines into anatomical bundles. In this case, we show the left corticospinal tract (CSTL) and the left arcuate fasciculus (ARCL) over a mid-saggital anatomical slice. Tract profiling further extracts bundle profiles, quantifications of various diffusion metrics along the length of the fiber bundle. Here, we show one subject’s fractional anisotropy (FA) profile for (c) the CSTL and (d) the ARCL. (e) the phenotypical target data and tract profile features can be organized into a linear model, y^=Xβ^. The feature matrix X is color-coded to reveal a natural group structure: the left (orange) group contains k features from the CSTL, the middle (green) group contains k features from the left cingulum cingulate (CGCL), and the right (blue) group contains k features from the ARCL. The coefficients in β^ follow the same natural grouping. Panels (a) and (b) are adapted from https://figshare.com/articles/figure/example_tractography-segmentation/14485350, and reproduced under the CC-BY license (https://creativecommons.org/licenses/by/4.0/).

One more approach to multivariate analysis of neuroimaging data is to perform inference or prediction in a transformed feature space rather then the original diffusion metric feature space. For example, in Lasso PCR, diffusion features can be projected onto a principal components (PC) basis for use in a sparsity constrained model [24, 25]. While these approaches achieve high predictive performance and may even yield biologically interpretable principal components [12], they neglect anatomical grouping information. We view these approaches as complementary to the SGL-based approach: PCR based approaches seek to model brain-phenotype relationships using the most parsimonious representation of variance in the diffusion measures. In contrast, our SGL-based approach seeks to establish whether prior knowledge of anatomical grouping improves modeling of brain-phenotype relationships. Motivated by this distinction, we also introduce a union of the two approaches called PCR-SGL, in which each group of features is independently transformed into its PC basis, thereby retaining the anatomical grouping information of the original feature space. We demonstrate circumstances in which this approach both helps and hinders predictive performance.

Results

We developed a method for analyzing dMRI tract profile data that uses the Sparse Group Lasso (SGL) to select features that are sparse both at the group (bundle) level, as well as overall. We demonstrate the use of this method on four different datasets in both a classification setting and a regression setting.

SGL accurately detects ALS from tractometry data

Using data from a previous study of patients with amyotrophic lateral sclerosis (ALS) [26], we tested the performance of SGL in a classification setting. The previous study predicted ALS status with a mean accuracy of 80% using a random forest algorithm based on a priori selection of features only within the CST bundle-of-interest. SGL delivers improved predictive performance, with a cross-validated accuracy of 83% and an area under the receiver operating characteristic curve (ROC AUC) of 0.88, without the need for a priori feature engineering. We also predicted ALS diagnosis using the PCR-SGL, wherein each group of features is independently transformed to its PC basis and an SGL model is fit to the transformed features. This model achieves 88% accuracy and an ROC AUC of 0.9 (Fig 2a). In addition to this classification performance, both SGL and PCR-SGL also identify the white matter tracts most important for ALS classification. Fig 2b shows that PCR-SGL identified as potential disease biomarkers the diffusion measures in the CST from the cerebral peduncle to the corona radiata, agreeing with the previous study from which these data were extracted [26]. The relative importance of white matter features is captured in the β coefficients from Eq (3). Fig 2c depicts these coefficients along the right CST, plotted over the FA values for the control and ALS subject groups (see S1 and S2 Figs for tract profiles of all 18 tracts in these groups). We find that SGL and PCR-SGL select FA metrics in the corticospinal tract and particularly in the right corticospinal tract as most important to ALS classification, confirming previous findings [2736] and identifying the portions of the brain that were selected a priori in the previous study from which we obtained the data [26].

Fig 2. PCR-SGL accurately and interpretably predicts ALS diagnosis.

Fig 2

(a) Classification probabilities for ALS diagnosis, with controls on the left, patients on the right, predicted controls in blue, and predicted patients in orange. That is, orange dots on the left represent false positives, while blue dots on the right represent false negatives. We achieve 83% accuracy with an ROC AUC of 0.88. (b) PCR-SGL coefficients are presented on the core fibers of major fiber bundles. They exhibit high group sparsity and are concentrated in the FA of the corticospinal tract (CST). The brain is oriented with the right hemisphere in the foreground and anterior to the right of the page. The CSTL, CSTR, callosum forceps anterior (CFA), left arcuate (ARCL), and right arcuate (ARCR) bundles are indicated for orientation. (c) PCR-SGL identifies three portions of the CST as important, where β^ (dashed line, right axis) has large values. These are centered around nodes 30, 65, and 90, corresponding to locations of substantial differences in FA between the ALS and control groups (shaded areas indicates standard error of the mean). (d) Bundle profiles for false positive classifications. Line colors correspond to the marker edge color in the top left plot. These individuals have reduced FA in the CST portions which SGL identified as important. Their misclassification is coherent with the feature importance and the group differences in FA. (e) Individual bundle profiles for false negative classifications. These individuals have bundle profiles which oscillate between the group means.

To assess the added value of incorporating anatomical knowledge into our model, we compared our classification results to those obtained from four other models: a pure lasso model, an elastic net model, a bundle-mean lasso (i.e. a lasso model trained only on the mean metric values from each bundle), and a principal components regression lasso (Lasso PCR) model [24, 25]. These models achieved accuracies of 76%, 76%, 79.5%, and 71.5% respectively and ROC AUC values of 0.76, 0.75, 0.82, and 0.71, respectively using the same cross-validation strategy used to assess the SGL model. This difference in performance justifies the additional complexity of the SGL over simpler sparsity-inducing regression strategies. The relatively poor performance of Lasso PCR suggests that features with small variance are nonetheless relevant for predicting ALS diagnosis. Or equivalently, that group differences between ALS diagnoses are not the dominant source of variance in the diffusion metrics.

The β coefficients exhibit high bundle level sparsity; only some bundles are important, which can be be confirmed by observing the value of α, the regularization hyperparameter that controls the mixture of the Group Lasso and lasso penalties, selected through nested cross-validation (see Eq (3)). If α is closer to zero, it indicates that the phenotype in question preferentially correlates with only a few groups of covariates. For the ALS dataset, the SGL model has α = 0.21 and the PCR-SGL model has α = 0.4, confirming that the white matter correlates of ALS reside mostly in one bundle, namely the CST.

Analyzing the ways in which the model mislabels individuals also provides insight. We found that mislabelled subjects are outliers relative to their group with respect to diffusion features of the CST (Fig 2d and 2e). The false positive classifications have reduced FA in one or more of the three sections of the CST where β^ is large in Fig 2c. The false negative subjects have FA profiles that oscillate between the two group means. Thus, when the SGL method predicts incorrectly this is done in a comprehensible manner.

SGL accurately predicts age from tractometry data

To test the performance of SGL in a continuous regression task, we focus here on the prediction of biological age in three datasets named WH, HBN, and Cam-CAN (see the Methods section for a description of each dataset). Prediction of “brain age” is a commonly undertaken task in neuroimaging machine learning, in part because these predictions, and deviations therefrom, may be diagnostic of overall brain health (for a review, see Cole et al. [37]). However, as Nelson et al. [38] have observed, aging biomarkers are subject to unique challenges and tend to be noncausative. Our interest in aging here relies on its utility as a methodological benchmark. Biological age operates on a natural scale, with meaningful and easily understood units, and it’s popularity as a machine learning target makes it valuable for comparisons with other studies.

The WH [39], HBN [40], and Cam-CAN [41, 42] datasets used here contain data from 76, 1651, and 640 subjects, respectively, ranging from 6–50, 5–21, and 18–88 years of age, respectively. In each case, biological age was used as the predicted variable y. SGL was fit to the tract profile features FA and mean diffusivity (MD) in 18 major brain tracts, with diffusion metrics extracted from diffusion tensor imaging (DTI) for the WH dataset and diffusion kurtosis imaging (DKI) [43] for the HBN and Cam-CAN datasets see S3S8 Figs for the full tract profile information in all tracts/datasets.

To evaluate the fit of the model, we used a nested cross-validation procedure. In this procedure, batches of subjects are held out. For each batch (or fold), the model is fully fit without this data. Then, once the parameters are fixed, the model is applied to predict the ages of held out subjects based on the linear coeffiecients. This scheme automatically finds the right level of sparsity and fits the coefficients to the ill-posed linear model, while guarding against overfitting. SGL accurately predicts the age of the subjects in this procedure, with a median absolute error of 2.67, 1.45, and 6.02 years for the WH, HBN, and Cam-CAN datasets, respectively and coefficients of determination R2 = 0.52, 0.57, and 0.77, respectively (see Fig 3, top panels). The predictions for Cam-CAN are competitive with a recent state of the art prediction [44], which used streamline density to estimate the brain’s structural connectivity and achieved R2 = 0.63. The median absolute errors are also lower than the results of a recent study that predicted age in a large sample that included the Cam-CAN data, and was based on diffusion MRI features [45].

Fig 3. Predicting age with tractometry and SGL.

Fig 3

(top) The predicted age vs. true age of each individual from the test splits (i.e., when each subject’s data was held out in fitting the model) for the (a) WH, (b) HBN, and (c) Cam-CAN datasets; an accurate prediction falls close to the y = x line (dashed). The mean absolute error (MAE) and coefficient of determination R2 are presented in the lower right of each scatter plot. (middle) Feature importance for predicting age from tract profile in the (d) WH, (e) HBN, and (f) Cam-CAN datasets. The orientation of the brain is that same as in Fig 2b, however because the coefficients exhibit high global sparsity (as opposed to group sparsity), we plot the mean of the absolute value of β^ for each bundle on the core fiber. The global distrubution of the β^ coefficients reflects the fact that aging is not confined to a single white matter bundle. (bottom) Age quintile bundle profiles for the (g) WH, (h) HBN, and (i) Cam-CAN datasets.

In contrast to the ALS classification case, the selected α values indicate high global sparsity over group sparsity, with α = 0.83, 0.67, and 0.68, for the WH, HBN, and Cam-CAN datasets, respectively. The model weights are distributed over many different tracts and dMRI tissue properties (see Fig 3d–3f and S3S8 Figs). This demonstrates that SGL is not coerced to produce overly sparse results when a more accurate model requires a dense selection of features. Furthermore, inspecting the portions of bundles with larger coefficients in Fig 3g–3i reveals that SGL selects informative regions where diffusion properties are different between the age quintiles.

As with ALS classification, we also compared SGL performance with results obtained using the pure lasso, elastic net, bundle-mean lasso, and Lasso PCR. The pure lasso models achieved R2 = 0.47, 0.54, and 0.70 for the WH, HBN, and Cam-CAN datasets respectively using the same cross-validation strategy used to assess the SGL. The elastic net models achieved R2 = 0.46, 0.59, and 0.73 for the WH, HBN, and Cam-CAN datasets, respectively. The bundle-mean lasso models achieved R2 = 0.33, 0.22, and 0.55, respectively. And the Lasso PCR models achieved R2 = 0.54, 0.57, 0.74, respectively. The SGL models’ performance improvement over lasso is more modest than in the ALS classification case, which accords with the selected values of the α hyperparameter. These SGL models were more lasso-like in their sparsity penalties so they are more lasso-like in their predictive performance.

Model performance across all four datasets and all six model types is summarized in Fig 4. In contrast to the ALS classification case, the PCR Lasso performs competitively in age regression, suggesting that aging is a significant source of global variance in the tract profiles. On the other hand, the PCR-SGL models destroy cross-bundle covariance information in their initial group-wise PC projection step (see Eq (4)), limiting performance for globally distributed phenomena. Conversely, the SGL models are able to adapt to this regime.

Fig 4. Model performance across all datasets.

Fig 4

Each panel shows model performance measured on the test set for each cross-validation split, with each black dot representing a split, box plots representing the quartiles, and white diamonds representing the mean performance. The y-scale varies in each subplot. (a) Accuracy of test set predictions for the ALS dataset. Because group differences in ALS diagnosis are mostly confined to a single bundle, the group structure-preserving methods, SGL and PCR-SGL, outperform the other models. The remaining frames show coefficient of determination, R2 in test sets for the (b) WH, (c) HBN, and (d) Cam-CAN datasets. Because aging affects the white matter globally, group structure-blind methods like elastic net and PCR Lasso perform well. Nonetheless, the SGL models show competitve predictive performance, adapting to a problem where group structure is not as informative. PCR-SGL performs poorly in this regime because its initial group-wise PC projection destroys between bundle covariance. The bundle-mean lasso performs poorly, demonstrating the value of along-tract profiling.

Discussion

We present here a novel method for analysis of dMRI tract profile data that relies on the Sparse Group Lasso [46] to make accurate predictions of phenotypic properties of individual subjects while, simultaneously, identifying the features of the white matter that are most important for this prediction. The method is data-driven and it is broadly applicable to a wide range of research questions: it performs well in predicting both continuous variables, such as biological age, as well as categorical variables, such as whether a person is a patient or a healthy control. In both of these cases, SGL out-performs previous algorithms that have been developed for these tasks [26, 44, 45]. The nested cross-validation approach used to fit the model and make both predictions and inferences from the model guards against overfitting and tunes the degree of sparseness required by the algorithm. This means that SGL can accurately describe phenomena that are locally confined to a particular anatomical location or diffusion property (e.g., FA in the CST) as well as phenomena that are widely distributed amongst brain regions and measured diffusion properties.

Specifically, we demonstrated that the algorithm correctly detects the fact that ALS, which is a disease of motor neurons, is localized to the cortico-spinal tract. This recapitulates the results of previous analysis of these same data, using a bundle-of-interest approach [26]. As with the original study, it is unclear whether our strong predictive performance is based on image properties that would be visible in patient scans or whether it is identifying a new subclinical disease biomarker. In contrast, for the analysis of biological age, the coefficients identified by the algorithm are very widely distributed across many parts of the white matter, mirroring previous results that show a large and continuous distribution of life-span changes in white matter properties [39]. We present age quintile bundle profiles and SGL β coefficients for all four datasets in S3S8 Figs. For age regression, substantial differences between the datasets preclude generalization across all three: (i) different acquisition parameters, which can be challenging to harmonize [47], (ii) different diffusion models, with DTI for the WH dataset and DKI for the HBN and Cam-CAN datasets, (iii) different age ranges and distributions (which is evident in the figure legends for Fig 3), with HBN being a developmental dataset, while WH and Cam-CAN are lifespan maturation datasets, and (iv) different anatomical extents, with the WH streamlines truncated to remain with the bundle’s bounding regions of interest (the default behavior in the legacy mAFQ) and the HBN and Cam-CAN streamlines allowed to retain their full extent from whole-brain tractography (the default behavior in pyAFQ). In a biological characteristic that has widespread effects in the brain, validity is difficult to assess [48], so it is both unsurprising and vexing that we can do so well at predicting within each dataset, and yet do poorly in interpreting informative features across instruments.

We also implemented a union of the PCR Lasso and SGL approaches by first projecting each group onto its PC basis. This PCR-SGL approach transformed the feature space into a more parsimonious representation of its variance while also preserving group structure. But it was unable to efficiently represent cross-bundle covariance, which is an advantage of PCR Lasso. As a result, PCR-SGL performed well in ALS classification, where the white matter interactions are highly localized, and poorly in age regression, where the white matter interactions are global.

One drawback of our approach is also evident in the age regression results. There are portions of the bundle profiles in Fig 3g–3i for which the β^ coefficients are zero but for which the bundle profiles clearly contain an age differentiating signal. While SGL correctly identifies certain features as important, it is not guaranteed to identify all of the important features. It will identify a parsimonious set of important features, dense enough to predict the phenotype and sparse enough (at either the group or global level) to satisfy the sparsity constraints.

Another limitation of our approach is in its computational execution time, which, in our experience, is roughly five times slower than simpler models like the lasso. This slow-down is likely to be insignificant for “production” cycles in which researchers report their findings, but may be prohibitive for “development” cycles in which researchers repeatedly train models while adjusting their analysis pipeline. In practice, we circumvent this limitation by developing our analysis infrastructure with fast, simple models, and substituting slower, more performant models afterwards. The consistent scikit-learn-based application programming interface of AFQ-Insight provides a good basis for this pattern.

Taken together, our results demonstrate the promise of the group-regularized regression approach. Even at the scale of dozens of subjects, the results provided by SGL are both accurate, as well as interpretable [15]: tract profiling capitalizes on domain knowledge to engineer meaningful features; SGL scores these features based on their relative importance; enables a visualization of these feature importance scores in the anatomical coordinate frame of the bundles (e.g., Figs 2b and 3d–3f) and provides a means to understand model errors (e.g., Fig 2d and 2e). Thus, this multivariate analysis approach achieves high cross-validated accuracy for precision medicine applications of dMRI data and identifies relevant features of brain anatomy that can further our neuroscientific understanding of clinical disorders.

Neuroscience has entered an era in which consortium efforts are amassing large datasets of high-quality dMRI measurements to address a variety of scientific questions [40, 4952], but the volume and complexity of these data pose substantial challenges. Tract profiling followed by analysis with SGL provides a promising approach to distill meaningful insights from the wealth of data measured in these efforts.

SGL has many other potential applications in neuroscience, because of the hierarchical and grouped nature of many data types that are collected in multiple sample points within anatomically-defined areas. For example, this method may be useful to understand the relationship between fMRI recordings and behavior, where activity in each voxel may co-vary with voxels within the same anatomical region and form features and groups of features. Similarly, large-scale multi-electrode recordings of neural activity in awake behaving animals are becoming increasingly feasible [53, 54] and these recordings can form features (neurons) and groups (neurons within an anatomical region).

The results we present here also motivate extensions of the method using more sophisticated cost functions. For example, the fused sparse group lasso (FSGL) [55] extends SGL to enforce additional spatial structure: smoothness in the variation of diffusion metrics along the bundles. As brain measurements include additional structure (for example, bilateral symmetry), future work could also incorporate overlapping group membership for each entry in the tract profiles [56]. For example, a measurement could come from the corpus callosum, but also from the right hemipshere. This would also require extending the cost function used here to incorporate these constraints. Similarly, unsupervised dimensionality reduction of tractometry data (e.g., [12]) could also benefit from constraints based on grouping, as our implementation of PCR-SGL suggests.

The method is packaged as open-source software called AFQ-Insight that is openly available at https://github.com/richford/AFQ-Insight, and provides a clear API to allow for extensions of the method. The sofware integrates within a broader automated fiber quantification software ecosystem: AFQ [5] and pyAFQ [57], which extract tract profile data from raw and processed dMRI datasets, as well as AFQ-Browser, which visualizes tract profiles data and facilitates sharing of the results of dMRI studies [58]. To facilitate reproducibility and ease use of the software, the results presented in this paper are also provided in https://github.com/richford/afq-insight-paper as a series of Jupyter notebooks [59].

Materials and methods

Data

Four different datasets were used here:

  1. Diffusion MRI from a previous study of the corticospinal tract (CST) in patients with amyotrophic lateral sclerosis (ALS [26]), containing data from 24 ALS patients and 24 demographically matched healthy controls. These data were measured in a GE Discovery 750 3T MRI scanner at the Institute of Bioimaging and Molecular Physiology in Catanzaro. Informed consent was provided as approved by the Ethical Committee of the University “Magna Graecia” of Catanzaro. Voxel resolution was 2 × 2 × 2mm3 and 27 non-colinear directions were measured with a b = 1000 s/mm2. Data was preprocessed to correct for subject motion and for eddy currents. The diffusion tensor model [60] was fit in every voxel. We will refer to this dataset as ALS.

  2. Diffusion MRI data from a previous study of properties of the white matter across the lifespan [39], containing dMRI data from 76 subjects with ages 6–50. These data were measured in a GE Discovery 750 3T MRI scanner at the Stanford Center for Cognitive and Neurobiological Imaging. The Stanford University IRB approved the procedures of this study. Informed consent was obtained from each adult participant, and assent for participation was provided by parents/guardians for children. Voxel resolution was 2 × 2 × 2mm3 with 96 non-colinear directions measured with a b = 2000 s/mm2 and 30 non-colinear directions measured with a b = 1000 s/mm2. These data were acquired using a twice refocused spin echo sequence, in which there is sufficient time for eddy currents to subside between the application of the gradients and the image acquisition, so no eddy current correction was applied, but motion correction was applied before fitting the diffusion tensor model [60] in every voxel using a robust fit [61] on the b = 1000 s/mm2 shell only. We will refer to this dataset as WH.

  3. Diffusion MRI data from the Healthy Brain Network pediatric mental health study [40], containing dMRI data from 1651 subjects with ages 5–21. These data were measured in 3T Siemens MRI scanners at two sites in the New York area: Rutgers University Brain Imaging Center and the CitiGroup Cornell Brain Imaging Center. Age distributions of participants in the different sites are in S9 Fig. Informed consent was obtained from each participant aged 18 or older. For participants younger than 18, written consent was obtained from their legal guardians and written assent was obtained from the participant. Voxel resolution was 1.8 × 1.8 × 1.8mm3 with 64 non-colinear directions measured for each of b = 1000s/mm2 and b = 2000s/mm2. Preprocessing was performed using QSIPrep 0.12.1, which is based on Nipype 1.5.1 [62, 63], RRID:SCR_002502.
    • Anatomical data preprocessing The T1-weighted (T1w) image was corrected for intensity non-uniformity (INU) using N4BiasFieldCorrection [64, ANTs 2.3.1], and used as T1w-reference throughout the workflow. The T1w-reference was then skull-stripped using antsBrainExtraction.sh (ANTs 2.3.1), using OASIS as target template. Spatial normalization to the ICBM 152 Nonlinear Asymmetrical template version 2009c [65], RRID:SCR_008796 was performed through nonlinear registration with antsRegistration [66], ANTs 2.3.1, RRID:SCR_004757, using brain-extracted versions of both T1w volume and template. Brain tissue segmentation of cerebrospinal fluid (CSF), white-matter (WM) and gray-matter (GM) was performed on the brain-extracted T1w using FAST [67], FSL 6.0.3:b862cdd5, RRID:SCR_002823.
    • Diffusion data preprocessing
      Any images with a b-value less than 100 s/mm2 were treated as a b = 0 image. MP-PCA denoising as implemented in MRtrix3’s dwidenoise [68] was applied with a 5-voxel window. After MP-PCA, B1 field inhomogeneity was corrected using dwibiascorrect from MRtrix3 with the N4 algorithm [64]. After B1 bias correction, the mean intensity of the DWI series was adjusted so all the mean intensity of the b = 0 images matched across eachseparate DWI scanning sequence.
      FSL (version 6.0.3:b862cdd5)’s eddy was used for head motion correction and Eddy current correction [69]. Eddy was configured with a q-space smoothing factor of 10, a total of 5 iterations, and 1000 voxels used to estimate hyperparameters. A linear first level model and a linear second level model were used to characterize Eddy current-related spatial distortion. q-space coordinates were forcefully assigned to shells. Field offset was attempted to be separated from subject movement. Shells were aligned post-eddy. Eddy’s outlier replacement was run [70]. Data were grouped by slice, only including values from slices determined to contain at least 250 intracerebral voxels. Groups deviating by more than 4 standard deviations from the prediction had their data replaced with imputed values. Data was collected with reversed phase-encode blips, resulting in pairs of images with distortions going in opposite directions. Here, b = 0 reference images with reversed phase encoding directions were used along with an equal number of b = 0 images extracted from the DWI scans. From these pairs the susceptibility-induced off-resonance field was estimated using a method similar to that described in [71]. The fieldmaps were ultimately incorporated into the Eddy current and head motion correction interpolation. Final interpolation was performed using the jac method.
      Several confounding time-series were calculated based on the preprocessed DWI: framewise displacement (FD) using the implementation in Nipype following the definitions by [72]. The DWI time-series were resampled to ACPC, generating a preprocessed DWI run in ACPC space.
    • MRtrix3 Reconstruction
      Reconstruction was performed using QSIprep 0.12.1. Multi-tissue fiber response functions were estimated using the dhollander algorithm. FODs were estimated via constrained spherical deconvolution (CSD, [73, 74]) using an unsupervised multi-tissue method [75, 76]. Reconstruction was done using MRtrix3 [77]. FODs were intensity-normalized using mtnormalize [78].

    Many internal operations of QSIPrep use Nilearn 0.6.2 [79], RRID:SCR_001362 and DIPY [80]. For more details of the pipeline, see the section corresponding to workflows in QSIPrep’s documentation. We will refer to this dataset as HBN.

  4. Diffusion MRI data from the Cambridge Centre for Ageing and Neuroscience (Cam-CAN) “CC700” dataset [41, 42], containing data from 640 subjects with ages 18–88. These data were measured on a 3T Siemens TIM Trio system and written informed consent was obtained from each participant. Voxel resolution was 2 × 2 × 2mm3 with 30 non-colinear directions measured for each of b = 1000 s/mm2 and b = 2000 s/mm2. The diffusion weighted images were acquired with a twice refocused spin-echo sequence and the same preprocessing and reconstruction pipelines used for the HBN dataset was applied to this data. We will refer to this dataset as Cam-CAN.

Data from the ALS and WH studies was processed in a similar manner, using the Matlab Automated Fiber Quantification toolbox (mAFQ, version 1.1 for WH and version 1.2 for ALS) [5]: streamlines representing fascicles of white matter tracts were generated using a determinstic tractography algorithm that follows the prinicpal diffusion direction of the diffusion tensor in each voxel (STT) [81]. Eighteen major tracts, which are enumerated in S1 Fig, were identified using multiple criteria: streamlines are selected as candidates for inclusion in a bundle of streamlines that represents a tract if they pass through known inclusion ROIs and do not pass through exclusion ROIs [82]. In addition, a probabilistic atlas is used to exclude streamlines which are unlikely to be part of a tract [83]. Each streamline is resampled to 100 nodes and the robust mean at each location is calculated by estimating the 3D covariance of the location of each node and excluding streamlines that are more than 5 standard deviations from the mean location in any node. Finally, a bundle profile of tissue properties in each bundle was created by interpolating the value of MRI maps of these tissue properties to the location of the nodes of the resampled streamlines designated to each bundle. In each of 100 nodes, the values are summed across streamlines, weighting the contribution of each streamline by the inverse of the mahalanobis distance of the node from the average of that node across streamlines. This means that streamlines that are more representative of the tract contribute more to the bundle profile, relative to streamlines that are on the edge of the tract.

Data from the HBN and Cam-CAN studies were processed using the updated Python Automated Fiber Quantification toolbox (pyAFQ; [57]). In addition to demonstrating the our analysis pipeline is robust to changes in tractometry software, the use of the updated pyAFQ capitalized upon the following improvements over the legacy Matlab version: (i) the ability to ingest data provided in the BIDS format [84], and (ii) the calculation of diffusion kurtosis imaging (DKI [43]) metrics We will refer to the mAFQ and pyAFQ pipeline collectively as AFQ.

These processes create bundle profiles, in which diffusion measures are quantified and averaged along eighteen major fiber tracts, which are enumerated in S1 Fig. See S3 Fig of [57] for a depiction of these white matter bundles. Here, we use only the mean diffusivity (MD) and the fractional anisotropy (FA) of the diffusion tensor, but additional dMRI-based maps or maps based on other quantitative MRI measurements can also be used. The resulting feature space was the same for all four datasets, with the FA and MD metrics at each of 100 nodes in eighteen bundles comprising 3600 features per subject. These bundle profiles, along with the phenotypical data we wish to explain or predict, form the input to the SGL algorithm. In a domain-agnostic machine learning context, the phenotypical data comprise the target variables while the bundle profiles form the feature or predictor variables (see Fig 1e).

Data harmonization for HBN

For the multisite HBN study, we use the ComBat harmonization method to robustly adjust for site effects in the tract profiles. Initially designed to correct for site effects in gene expression studies [85], ComBat employs a parametric empirical Bayes approach to adjust for batch effects and has since been applied to multi-site cortical thickness measurements [86], multi-site DTI studies [87], and functional MRI data in the Adolescent Brain Cognitive Development Study (ABCD) [88]. We rely on the neurocombat_sklearn library [89], to apply ComBat in our scikit-learn analysis pipeline and present bundle profile site differences and ComBat correction in S10S13 Figs.

Sparse group lasso

Before fitting a model to the data, imputation and standardization are performed. Missing node values (e.g., in cases where AFQ designates a node as not-a-number) are imputed via linear interpolation. Care is taken not to interpolate across the boundaries between different bundles. Some diffusion metrics will have naturally larger variance than others and may therefore dominate the objective function and make the SGL estimator unable to learn from the lower variance metrics. For example, fractional anisotropy (FA) is bounded between zero and one and could be overwhelmed by an unscaled higher-variance metric like the mean diffusivity (MD). To prevent this we remove each feature’s mean and scale it to unit variance (z-score) using the StandardScaler from scikit-learn [90]. The scaling parameters are learned only from the training data and then applied equally to the training and test data in order to prevent leakage of information between the testing and training sets [91].

After scaling and imputation, the tract profile data and target phenotypical data can be organized in a linear model:

y=Xβ+ϵ, (1)

where y is the phenotype—categorical, such as a clinical diagnosis, or continuous numerical, such as the subject’s age. The tract profile data is represented in the feature matrix X, with rows corresponding to different subjects, and columns corresponding to diffusion measures at different nodes within each bundle. The relationship between tractometric features and the phenotypic target is characterized by the coefficients in β. The error term, ϵ is an unobserved random variable that captures the error in the model. We denote our prediction of the target phenotype as y^ and the coefficients that produce this prediction as β^, so that

y^=Xβ^, (2)

without the error term, ϵ. In general, the feature matrix X has S rows and B × N × M columns, where S is the number of subjects, B is the number of white matter bundles, N is the number of nodes in each bundle, and M is the number of diffusion metrics calculated at each node. Typically, B = 18, N = 100, and 2 ≤ M ≤ 8, resulting in ∼4, 000 − 14, 400 features. Conversely, many dMRI studies have between a few dozen and a few hundred subjects, yielding a feature matrix that is wide and short. Even in cases where more than a thousand subjects are measured (e.g., in the Human Connectome Project, where 1,200 subjects were measured [52]), the problem is ill-posed: the high dimensionality of this data requires regularization to avoid overfitting and generate interpretable results.

We propose that in addition to regularizing the coefficients in β^, we can also capitalize on our knowledge of the group structure of the bundle profile features in X. The bundle-metric combinations form a natural grouping. For example, we expect that MD features within the left arcuate fasciculus will co-vary across individuals. Likewise for FA values within the right corticospinal tract (CST) and so on. This group structure is represented in Fig 1e, which depicts the linear model y^=Xβ^. Thus, we seek a regularization approach that will fit a linear model with anatomically-grouped covariates, where we expect to observe both groupwise sparsity, where the number of groups (bundle/metric combinations) with at least one non-zero coefficients is small, as well as within-group sparsity, where the number of non-zero coefficients within each non-zero group is small. The sparse group lasso (SGL) is a penalized regression technique that satisfies these criteria [46]. It solves for a coefficient vector β^ that satisfies

β^=minβLmse+(1α)λ=1Gpβ()2+αλβ1,whereLmse=12y=1GX()β()22. (3)

Here, G is the number of groups, X() is the submatrix of X corresponding to group , β() is the coefficient vector for group and p is the length of β(). In the tractomtetry setting, G = B × M and ∀: p = 100. The first term is the mean square error loss, Lmse, as in the standard linear regression framework. The second and third terms encourage groupwise sparsity and overall sparsity, respectively. If α = 1 the SGL reduces to the traditional lasso [92]. Conversely, if α = 0 the SGL reduces to the group lasso [93]. Thus, the model hyperparameter α controls the combination of group-lasso and lasso. The hyperparameter λ controls the strength of the regularization.

SGL with principal components

SGL may be combined with principal components regression (PCR-SGL) by performing dimensionality reduction separately for each group of covariates. Let

X()=UΣVT, (4)

be the compact singular value decomposition (SVD) of X(), the n × p submatrix of X corresponding to group . Here, Σ is an r × r matrix, where r = min(n, p), that contains the non-zero singular values of X(). VT is an r × p semi-orthogonal matrix containing the principal axes of X(). The product Z = is an n × r matrix containing the principal component row vectors needed to reproduce X() in the basis provided by VT.

Since this decomposition is performed separately for each group of covariates, the grouping information is preserved in the transformation from X to Z. We may then build an SGL model relating y and Z,

y^=Zθ^, (5)
θ^=minθLmse+(1α)λ=1Gpθ()2+αλθ1, (6)
whereLmse=12y=1GZ()θ()22. (7)

The PCR-SGL coefficients θ^ may be projected back onto the original feature space using β^=Vθ^.

Bagging meta-estimators

The previous section describes a single SGL model. To further improve model performance, we create ensemble models composed of m = 20 individual SGL models using bootstrap aggregation (bagging) [94]. Bagging relies on the underlying assumption that some of the error in a single SGL model’s prediction stems from a mismatch in the distributions of training data used to fit the model and test data used to evaluate its performance. To overcome this, bagging invokes the same base estimator (e.g. SGL) many times with different training sets, which are created by sampling the original training samples with replacement. The bagging meta-estimator’s prediction is then the average of its constituent estimators’ predictions. Likewise, when we report the hyperparameter values α and λ, or regression coefficients β^, we are refering to these values averaged over 20 estimators in the bagging meta-estimator.

Incorporating target transformations

Often, the target variable y is not in the domain in which the linear model can be best fit to it. Eq (2) can be slightly modified as:

y^=f1(Xβ^), (8)

where the transformation function f−1 characterizes the transform applied to the data before fitting the linear coefficients. This is similar to the use of a link function in a generalized linear model, but without the adoption of an exponential probability distribution [95]. For the WH, HBN, and Cam-CAN datasets, we use a logarithmic transform,

f(y^)=ln(y^), (9)

implemented using scikit-learn’s TransformedTargetRegressor meta-estimator [90] with numpy’s log and exp as the transform and inverse transform functions, respectively [96].

Classification of categorical targets

When the phenotypical target variable is categorical, as in the case of explaining or predicting the presence of a clinical diagnosis, the SGL is readily adapted to logistic regression, where the probability of a target variable belonging to an arbitrary defined “true” class is the logistic function of the result of the linear model,

p(y^=1)=11+exp(Xβ^), (10)

or equivalently, the mean squared error loss function in Eq (3) is replaced with the cross-entropy loss, which for binary classification is the negative log likelihood of the SGL classifier giving the “true” label:

LmseLlog=(ylog(p)+(1y)log(1p)). (11)

SGL implementation, cross-validation and metaparameter optimization

Because the SGL is not specific to tractometry, we implemented its solution as a general-purpose Python package called groupyr [97]. Groupyr solves the cost function in Eq (3) using proximal gradient descent [98] by implementing a custom proximal operator and relying on the C-OPT optimization library [99], providing a fitted SGL model as a scikit-learn compatible estimator [100]. Groupyr also selects the hyperparameters α and λ that yield the best cross-validated performance using either: (i) an exhaustive grid search of hyperparameter combinations, or (ii) sequential model based optimization using the scikit-optimize library [101].

To objectively evaluate model performance and guard against over-fitting, we used a nested cross-validation scheme, which is depicted for the binary classification case in Fig 5. The subjects (i.e. rows of the feature matrix X in Fig 1e and Eq (1)) are first shuffled and then decomposed into k0 batches, hereafter referred to as folds. For the ALS and WH datasets, we used k0 = 10 and for the HBN and Cam-CAN datasets, k0 = 5. For each unique fold, we hold that fold out as the test0 set and let the remaining data comprise the train0 set, with the subscript indicating the depth of the nested decomposition. We further decompose each train0 set into three folds, and again for each unique fold, we hold out that fold as the test1 set and let the remaining data comprise the train1 set. At level-1 of the decomposition, we fit an SGL model using fixed regularization meta-parameters α and λ, training the model using train1 and evaluating the fit on test1. We find the optimal values for α and λ using sequential model-based optimization, implemented using the scikit-optimize BayesSearchCV class [101]. For continuous numerical y, BayesSearchCV searches for meta-parameter values that maximize the R2 averaged over test sets. For binary categorical y BayesSearchCV seeks to maximize the classification accuracy. Using hyperoptimization, we find optimal regularization parameters and β^ for each train0 set and then use those to predict values for data in test0. Thus each subject in the dataset has a predicted phenotype derived from a model that never saw its particular subject’s data. In the classification case, the shuffling into folds is stratified such that each fold has a population that preserves the percentage of each class found in the larger input data.

Fig 5. Nested cross-validation.

Fig 5

We evaluate model quality using a nested k-fold cross validation scheme. At level-0, the input data is decomposed into k0 shuffled groups and optimal hyperparameters are found for the level-0 training set. To avoid overfitting, the optimal hyperparameters are themselves evaluated using a cross-validation scheme taking place at level-1 of the decomposition, where each level-0 training set is further decomposed into k1 = 3 shuffled groups. In the classification case, the training and test splits are stratified by diagnosis. For the ALS and WH data, k0 = 10, while for the HBN and Cam-CAN data, k0 = 5.

For each dataset, we also perform a randomization test by training similar models on copies of the data for which the rows of the target variable y have been shuffled while the feature matrix X remains the same. This effectively destroys any relationship between the diffusion data and the outcome. Indeed, all of our models perform no better than random guessing. One should expect this since any better performance might indicate data leakage between train and test sets [91] or other common machine learning pitfalls.

Software implementation

The full software implementation of the SGL approach presented here is available in a Python software package called AFQ-Insight, which is developed publicly in https://github.com/richford/afq-insight. The version of the code used to produce the results herein is also available in https://doi.org/10.5281/zenodo.4316000. AFQ-Insight reads the target and feature data that has been processed by AFQ from comma separated value (CSV) files conforming to the AFQ-Browser data format [58] and represents them internally as DataFrame objects from the pandas Python library [102]. The software provides different options for imputing missing data in the feature matrix. Missing interior nodes are imputed using linear interpolation. For missing exterior nodes, the user may choose between linear extrapolation and constant forward- or back-fill. Imputation uses only values from adjacent nodes in the same white matter bundle in the same subject so there is no danger of data leakage from other subjects. It uses the scikit-learn [90] library to decompose input data into separate test and train datasets, to scale each feature to have zero mean and unit variance, and to evaluate each fit in the hyperparameter search using appropriate classification and regression metrics such as accuracy, area under the receiver operating curve (AUC ROC), and coefficient of determination (R2). Internally, AFQ-Insight uses the groupyr software library [97] mentioned above to solve the SGL.

Supporting information

S1 Fig. Bundle profiles and regression coefficients for the ALS dataset FA.

Diffusion metrics are plotted along the length of eighteen bundles: right corticospinal (CSTR), left corticospinal (CSTL), right uncinate (UNCR), left uncinate (UNCL), left inferior fronto-occipital fasciculus (IFOL), right inferior fronto-occipital fasciculus (IFOR), right arcuate (ARCR), left arcuate (ARCL), right thalamic radiation (ATRR), left thalamic radiation (ATRL), right cingulum cingulate (CGCR), left cingulum cingulate (CGCL), callosum forceps posterior (CFP), callosum forceps anterior (CFA), right inferior longitudinal fasciculus (ILFR), left inferior longitudinal fasciculus (ILFL), right superior longitudinal fasciculus (SLFR), and left superior longitudinal fasciculus (SLFL). FA is plotted on the left y-axis while the β^ coefficients are displayed on the twin axis on the right-hand-side. SGL selected the right corticospinal tract (CSTR) as important and regularized coefficients in the CSTL. Yet, there are also group FA differences in the CSTL. This highlighted a potential drawback of the SGL method, discussed in the main text in the context of age regression. Namely, SGL is not guaranteed to identify all important features. In this case, if the diagnostic signal in the CSTL is redundant to that in the CSTR, SGL will regularize the CSTL features, thereby reducing its sparsity penalty without any corresponding increase in loss. This parsimony cuts both ways; it is a feature of the method when one seeks an efficient predictive model, but is a disadvantage of the method when one wants an exhaustive explanation of feature importance. We use the phrase “parsimony pitfall” to refer to the case when SGL regularizes away redundant but obviously important features.

(TIF)

S2 Fig. Bundle profiles and regression coefficients for the ALS dataset MD.

The scale of the β^-axis is identical to that used in S1 Fig, to facilitate the comparison of the relative importance of each metric.

(TIF)

S3 Fig. Bundle profiles and regression coefficients for the WH dataset FA.

In contrast to the ALS classification case, the β^ coefficients are distributed widely through the brain, supporting the interpretation that aging is a large and continuous whole-brain process.

(TIF)

S4 Fig. Bundle profiles and regression coefficients for the WH dataset MD.

This and S3 Fig demonstrate that SGL behaves much more like the lasso than the group lasso, as discussed in the main text. The parsimony pitfall is most evident in the IFOL and IFOR bundles.

(TIF)

S5 Fig. FA bundle profiles and β^ coefficients for age regression in the HBN dataset.

Like the WH dataset, the β^ coefficients are distributed widely through the brain and SGL behaves more like the lasso than the group lasso.

(TIF)

S6 Fig. MD bundle profiles and β^ coefficients for age regression in the HBN dataset.

In contrast to the WH results, the bundle profiles show different behaviors. For example the SLFL and SLFR bundle profiles in have different concavity. This is unsurprising, however, given the differences between these datasets (see also Discussion). The parsimony pitfall is most evident in the UNCL, UNCR, ARCL, SLFL, and SLFR bundles.

(TIF)

S7 Fig. FA bundle profiles and β^ coefficients for age regression in the Cam-CAN dataset.

The β^ coefficients are distributed widely through the brain and SGL behaves more like the lasso than the group lasso. As before, one must be cautious about comparing bundle profiles and β^ coefficients between models. While the HBN and Cam-CAN datasets share the same diffusion model and refrain from clipping streamlines, the age distributions for the two are roughly disjoint, with the WH age distribution straddling the two.

(TIF)

S8 Fig. Mean diffusivity (MD) bundle profiles and β^ coefficients for age regression in the Cam-CAN dataset.

(TIF)

S9 Fig. Age distributions are similar between the different HBN sites.

Rutgers Rutgers University Brain Imaging Center (RU) and the CitiGroup Cornell Brain Imaging Center (CBIC).

(TIF)

S10 Fig. FA bundle profiles exhibit strong site differences in the HBN dataset.

(TIF)

S11 Fig. MD bundle profiles exhibit strong site differences in the HBN dataset.

(TIF)

S12 Fig. Site differences in FA are eliminated by ComBat harmonization.

(TIF)

S13 Fig. Site differences in MD are eliminated by ComBat harmonization.

(TIF)

Acknowledgments

We would like to thank Scott Murray for a useful discussion of the SGL method and Mareike Grotheer and John Kruper for helpful comments on the manuscript. Data collection and sharing for this project was provided by the Cambridge Centre for Ageing and Neuroscience (CamCAN). CamCAN funding was provided by the UK Biotechnology and Biological Sciences Research Council (grant number BB/H008217/1), together with support from the UK Medical Research Council and University of Cambridge, UK. This manuscript was prepared using a limited access dataset obtained from the Child Mind Institute Biobank, The Healthy Brain Network dataset. This manuscript reflects the views of the authors and does not necessarily reflect the opinions or views of the Child Mind Institute.

Data Availability

The main software described in this study is available through GitHub at https://github.com/richford/AFQ-Insight. The version of software used in this study is also available at https://doi.org/10.5281/zenodo.4316000. To facilitate reproducibility and ease use of the software, the results presented in this paper are also provided in https://github.com/richford/afq-insight-paper as a series of Jupyter notebooks. We refer to four datasets in this work: ALS, WH, HBN, and Cam-CAN. The ALS dataset is openly available at https://doi.org/10.5281/zenodo.1161864. The WH dataset is openly available at https://doi.org/10.5281/zenodo.1161846. To facilitate acquisition of these data, we provide a ‘make data’ command in the paper repository at https://github.com/richford/afq-insight-paper. Data from the HBN study is available to access through: http://fcon_1000.projects.nitrc.org/indi/cmi_healthy_brain_network/. MRI data from this study is openly available. Phenotypic data can be freely accessed, upon establishing a data usage agreement with the Child Mind Institute, which is the data provider for this study. Data from the Cam-CAN study is available to access through: https://www.cam-can.org/index.php?content=dataset, and can be freely accessed upon agreeing to terms and conditions set forth by the Cambridge Centre for Ageing and Neuroscience, which is the data provider for this study.

Funding Statement

ARH, AR, JDY and NS are supported by BRAIN grant 1RF1MH121868-01 from the National Institutes for Mental Health. ARH and AR are also supported by a grant from the Gordon & Betty Moore Foundation (https://www.moore.org/) and the Alfred P. Sloan Foundation (https://sloan.org/) to the University of Washington eScience Institute Data Science Environment. Computational resources for this study were provided by the Google Cloud Platform Academic Research Credits Program (AR). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1. Stejskal EO, Tanner JE. Spin Diffusion Measurements: Spin Echoes in the Presence of a Time-Dependent Field Gradient. The Journal of Chemical Physics. 1965;42(1):288–292. doi: 10.1063/1.1695690 [DOI] [Google Scholar]
  • 2. Wandell BA. Clarifying human white matter. Annual review of neuroscience. 2016;39:103–128. doi: 10.1146/annurev-neuro-070815-013815 [DOI] [PubMed] [Google Scholar]
  • 3. Conturo TE, Lori NF, Cull TS, Akbudak E, Snyder AZ, Shimony JS, et al. Tracking neuronal fiber pathways in the living human brain. Proc Natl Acad Sci U S A. 1999;96(18):10422–10427. doi: 10.1073/pnas.96.18.10422 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Mori S, Van Zijl PCM. Fiber tracking: principles and strategies–a technical review. NMR in Biomedicine: An International Journal Devoted to the Development and Application of Magnetic Resonance In Vivo. 2002;15(7-8):468–480. doi: 10.1002/nbm.781 [DOI] [PubMed] [Google Scholar]
  • 5. Yeatman JD, Dougherty RF, Myall NJ, Wandell BA, Feldman HM. Tract profiles of white matter properties: automating fiber-tract quantification. PloS one. 2012;7(11):e49790. doi: 10.1371/journal.pone.0049790 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Jones DK, Travis AR, Eden G, Pierpaoli C, Basser PJ. PASTA: pointwise assessment of streamline tractography attributes. Magn Reson Med. 2005;53(6):1462–1467. doi: 10.1002/mrm.20484 [DOI] [PubMed] [Google Scholar]
  • 7. Bells S, Cercignani M, Deoni S, Assaf Y, Pasternak O, Evans C, et al. Tractometry–comprehensive multi-modal quantitative assessment of white matter along specific tracts. In: Proc. ISMRM. vol. 678; 2011. p. 1. [Google Scholar]
  • 8. Colby JB, Soderberg L, Lebel C, Dinov ID, Thompson PM, Sowell ER. Along-tract statistics allow for enhanced tractography analysis. Neuroimage. 2012;59(4):3227–3242. doi: 10.1016/j.neuroimage.2011.11.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. O’Donnell LJ, Westin CF, Golby AJ. Tract-based morphometry for white matter group analysis. Neuroimage. 2009;45(3):832–844. doi: 10.1016/j.neuroimage.2008.12.023 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Nichols TE, Holmes AP. Nonparametric permutation tests for functional neuroimaging: a primer with examples. Hum Brain Mapp. 2002;15(1):1–25. doi: 10.1002/hbm.1058 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Nichols T, Hayasaka S. Controlling the familywise error rate in functional neuroimaging: a comparative review. Stat Methods Med Res. 2003;12(5):419–446. doi: 10.1191/0962280203sm341ra [DOI] [PubMed] [Google Scholar]
  • 12. Chamberland M, Raven EP, Genc S, Duffy K, Descoteaux M, Parker GD, et al. Dimensionality reduction of diffusion MRI measures for improved tractometry of the human brain. NeuroImage. 2019;200:89–100. doi: 10.1016/j.neuroimage.2019.06.020 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Huber E, Donnelly PM, Rokem A, Yeatman JD. Rapid and widespread white matter plasticity during an intensive reading intervention. Nature communications. 2018;9(1):2260. doi: 10.1038/s41467-018-04627-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Dayan M, Monohan E, Pandya S, Kuceyeski A, Nguyen TD, Raj A, et al. Profilometry: a new statistical framework for the characterization of white matter pathways, with application to multiple sclerosis. Human brain mapping. 2016;37(3):989–1004. doi: 10.1002/hbm.23082 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Murdoch WJ, Singh C, Kumbier K, Abbasi-Asl R, Yu B. Definitions, methods, and applications in interpretable machine learning. Proc Natl Acad Sci U S A. 2019;116(44):22071–22080. doi: 10.1073/pnas.1900654116 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Breiman L. Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author). Stat Sci. 2001;16(3):199–231. doi: 10.1214/ss/1009213725 [DOI] [Google Scholar]
  • 17. Smith SM, Jenkinson M, Johansen-Berg H, Rueckert D, Nichols TE, Mackay CE, et al. Tract-based spatial statistics: voxelwise analysis of multi-subject diffusion data. Neuroimage. 2006;31(4):1487–1505. doi: 10.1016/j.neuroimage.2006.02.024 [DOI] [PubMed] [Google Scholar]
  • 18. Yeh FC, Badre D, Verstynen T. Connectometry: a statistical approach harnessing the analytical potential of the local connectome. Neuroimage. 2016;125:162–171. doi: 10.1016/j.neuroimage.2015.10.053 [DOI] [PubMed] [Google Scholar]
  • 19. Bzdok D, Engemann D, Thirion B. Inference and Prediction Diverge in Biomedicine. Patterns (N Y). 2020;1(8):100119. doi: 10.1016/j.patter.2020.100119 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Naselaris T, Kay KN, Nishimoto S, Gallant JL. Encoding and decoding in fMRI. Neuroimage. 2011;56(2):400–410. doi: 10.1016/j.neuroimage.2010.07.073 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Tibshirani R. Regression Shrinkage and Selection via the Lasso. J R Stat Soc Series B Stat Methodol. 1996;58(1):267–288. [Google Scholar]
  • 22. Yuan M, Lin Y. Model selection and estimation in regression with grouped variables. J R Stat Soc Series B Stat Methodol. 2006;68(1):49–67. doi: 10.1111/j.1467-9868.2005.00532.x [DOI] [Google Scholar]
  • 23. Simon N, Friedman J, Hastie T, Tibshirani R. A sparse-group lasso. Journal of Computational and Graphical Statistics. 2013;22(2):231–245. doi: 10.1080/10618600.2012.681250 [DOI] [Google Scholar]
  • 24. Powell MA, Garcia JO, Yeh FC, Vettel JM, Verstynen T. Local connectome phenotypes predict social, health, and cognitive factors. Network Neuroscience. 2018;2(1):86–105. doi: 10.1162/NETN_a_00031 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Rasero J, Sentis AI, Yeh FC, Verstynen T. Integrating across neuroimaging modalities boosts prediction accuracy of cognitive ability. PLOS Computational Biology. 2021;17(3):1–25. doi: 10.1371/journal.pcbi.1008347 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Sarica A, Cerasa A, Valentino P, Yeatman J, Trotta M, Barone S, et al. The corticospinal tract profile in amyotrophic lateral sclerosis. Human brain mapping. 2017;38(2):727–739. doi: 10.1002/hbm.23412 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. van der Graaff MM, Sage CA, Caan MW, Akkerman EM, Lavini C, Majoie CB, et al. Upper and extra-motoneuron involvement in early motoneuron disease: a diffusion tensor imaging study. Brain. 2011;134(4):1211–1228. doi: 10.1093/brain/awr016 [DOI] [PubMed] [Google Scholar]
  • 28. Toosy A, Werring D, Orrell R, Howard R, King M, Barker G, et al. Diffusion tensor imaging detects corticospinal tract involvement at multiple levels in amyotrophic lateral sclerosis. Journal of Neurology, Neurosurgery & Psychiatry. 2003;74(9):1250–1257. doi: 10.1136/jnnp.74.9.1250 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Sarica A, Cerasa A, Vasta R, Perrotta P, Valentino P, Mangone G, et al. Tractography in amyotrophic lateral sclerosis using a novel probabilistic tool: a study with tract-based reconstruction compared to voxel-based approach. Journal of neuroscience methods. 2014;224:79–87. doi: 10.1016/j.jneumeth.2013.12.014 [DOI] [PubMed] [Google Scholar]
  • 30. Sage CA, Peeters RR, Görner A, Robberecht W, Sunaert S. Quantitative diffusion tensor imaging in amyotrophic lateral sclerosis. Neuroimage. 2007;34(2):486–499. doi: 10.1016/j.neuroimage.2006.09.025 [DOI] [PubMed] [Google Scholar]
  • 31. Sage CA, Van Hecke W, Peeters R, Sijbers J, Robberecht W, Parizel P, et al. Quantitative diffusion tensor imaging in amyotrophic lateral sclerosis: revisited. Human brain mapping. 2009;30(11):3657–3675. doi: 10.1002/hbm.20794 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Karlsborg M, Rosenbaum S, Wiegell MR, Simonsen H, Larsson HB, Werdelin LM, et al. Corticospinal tract degeneration and possible pathogenesis in ALS evaluated by MR diffusion tensor imaging. Amyotrophic Lateral Sclerosis and Other Motor Neuron Disorders. 2004;5(3):136–140. doi: 10.1080/14660820410018982 [DOI] [PubMed] [Google Scholar]
  • 33. Ellis C, Simmons A, Jones D, Bland J, Dawson J, Horsfield M, et al. Diffusion tensor MRI assesses corticospinal tract damage in ALS. Neurology. 1999;53(5):1051–1051. doi: 10.1212/WNL.53.5.1051 [DOI] [PubMed] [Google Scholar]
  • 34. Cosottini M, Giannelli M, Siciliano G, Lazzarotti G, Michelassi MC, Del Corona A, et al. Diffusion-tensor MR imaging of corticospinal tract in amyotrophic lateral sclerosis and progressive muscular atrophy. Radiology. 2005;237(1):258–264. doi: 10.1148/radiol.2371041506 [DOI] [PubMed] [Google Scholar]
  • 35. Ciccarelli O, Behrens TE, Johansen-Berg H, Talbot K, Orrell RW, Howard RS, et al. Investigation of white matter pathology in ALS and PLS using tract-based spatial statistics. Human brain mapping. 2009;30(2):615–624. doi: 10.1002/hbm.20527 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Abe O, Takao H, Gonoi W, Sasaki H, Murakami M, Kabasawa H, et al. Voxel-based analysis of the diffusion tensor. Neuroradiology. 2010;52(8):699–710. doi: 10.1007/s00234-010-0716-3 [DOI] [PubMed] [Google Scholar]
  • 37. Cole JH, Marioni RE, Harris SE, Deary IJ. Brain age and other bodily’ages’: implications for neuropsychiatry. Mol Psychiatry. 2019;24(2):266–281. doi: 10.1038/s41380-018-0098-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Nelson PG, Promislow DEL, Masel J. Biomarkers for Aging Identified in Cross-sectional Studies Tend to Be Non-causative. The Journals of Gerontology: Series A. 2019;75(3):466–472. doi: 10.1093/gerona/glz174 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Yeatman JD, Wandell BA, Mezer AA. Lifespan maturation and degeneration of human brain white matter. Nature communications. 2014;5:4932. doi: 10.1038/ncomms5932 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Alexander LM, Escalera J, Ai L, Andreotti C, Febre K, Mangone A, et al. An open resource for transdiagnostic research in pediatric mental health and learning disorders. Scientific Data. 2017;4:170181. doi: 10.1038/sdata.2017.181 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Shafto MA, Tyler LK, Dixon M, Taylor JR, Rowe JB, Cusack R, et al. The Cambridge Centre for Ageing and Neuroscience (Cam-CAN) study protocol: a cross-sectional, lifespan, multidisciplinary examination of healthy cognitive ageing. BMC Neurology. 2014;14:204. doi: 10.1186/s12883-014-0204-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Taylor JR, Williams N, Cusack R, Auer T, Shafto MA, Dixon M, et al. The Cambridge Centre for Ageing and Neuroscience (Cam-CAN) data repository: Structural and functional MRI, MEG, and cognitive data from a cross-sectional adult lifespan sample. NeuroImage. 2017;144:262–269. doi: 10.1016/j.neuroimage.2015.09.018 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Jensen JH, Helpern JA, Ramani A, Lu H, Kaczynski K. Diffusional kurtosis imaging: The quantification of non-gaussian water diffusion by means of magnetic resonance imaging. Magnetic Resonance in Medicine. 2005;53(6):1432–1440. doi: 10.1002/mrm.20508 [DOI] [PubMed] [Google Scholar]
  • 44. McPherson B, Pestilli F. A single-mode associates global patterns of brain network structure and behavior across the human lifespan. bioRxiv. 2020. [Google Scholar]
  • 45. Richard G, Kolskår K, Sanders AM, Kaufmann T, Petersen A, Doan NT, et al. Assessing distinct patterns of cognitive aging using tissue-specific brain age prediction based on diffusion tensor imaging and brain morphometry. PeerJ. 2018;6:e5908. doi: 10.7717/peerj.5908 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Simon N, Friedman J, Hastie T, Tibshirani R. A sparse-group lasso. Journal of Computational and Graphical Statistics. 2013;22(2):231–245. doi: 10.1080/10618600.2012.681250 [DOI] [Google Scholar]
  • 47. Pinto MS, Paolella R, Billiet T, Van Dyck P, Guns PJ, Jeurissen B, et al. Harmonization of Brain Diffusion MRI: Concepts and Methods. Frontiers in Neuroscience. 2020;14:396. doi: 10.3389/fnins.2020.00396 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Lerma-Usabiaga G, Mukherjee P, Ren Z, Perry ML, Wandell BA. Replication and generalization in applied neuroimaging. NeuroImage. 2019;202:116048. doi: 10.1016/j.neuroimage.2019.116048 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Jernigan TL, Brown TT, Hagler DJ Jr, Akshoomoff N, Bartsch H, Newman E, et al. The Pediatric Imaging, Neurocognition, and Genetics (PING) Data Repository. Neuroimage. 2016;124(Pt B):1149–1154. doi: 10.1016/j.neuroimage.2015.04.057 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. Jernigan TL, Brown SA, Dowling GJ. The Adolescent Brain Cognitive Development Study. J Res Adolesc. 2018;28(1):154–156. doi: 10.1111/jora.12374 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51. Miller KL, Alfaro-Almagro F, Bangerter NK, Thomas DL, Yacoub E, Xu J, et al. Multimodal population brain imaging in the UK Biobank prospective epidemiological study. Nat Neurosci. 2016;. doi: 10.1038/nn.4393 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Van Essen DC, Ugurbil K, Auerbach E, Barch D, Behrens TEJ, Bucholz R, et al. The Human Connectome Project: A data acquisition perspective. NeuroImage. 2012;62(4):2222–2231. doi: 10.1016/j.neuroimage.2012.02.018 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53. Steinmetz N, Zatka-Haas P, Carandini M, Harris K. Distributed correlates of visually-guided behavior across the mouse brain. Nature. 2018; p. in press. [Google Scholar]
  • 54. Jun JJ, Steinmetz NA, Siegle JH, Denman DJ, Bauza M, Barbarits B, et al. Fully integrated silicon probes for high-density recording of neural activity. Nature. 2017;551(7679):232–236. doi: 10.1038/nature24636 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Zhou J, Liu J, Narayan VA, Ye J. Modeling Disease Progression via Fused Sparse Group Lasso. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. KDD’12. New York, NY, USA: ACM; 2012. p. 1095–1103. [DOI] [PMC free article] [PubMed]
  • 56. Rao N, Nowak R, Cox C, Rogers T. Classification with Sparse Overlapping Groups; 2014. [Google Scholar]
  • 57. Kruper J, Yeatman JD, Richie-Halford A, Bloom D, Grotheer M, Caffarra S, et al. Evaluating the reliability of human brain white matter tractometry. Aperture. 2021. in press. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58. Yeatman JD, Richie-Halford A, Smith JK, Keshavan A, Rokem A. A browser-based tool for visualization and analysis of diffusion MRI data. Nature communications. 2018;9(1):940. doi: 10.1038/s41467-018-03297-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59. Kluyver T, Ragan-Kelley B, Pérez F, Granger BE, Bussonnier M, Frederic J, et al. Jupyter Notebooks-a publishing format for reproducible computational workflows. In: ELPUB; 2016. p. 87–90. [Google Scholar]
  • 60. Basser PJ, Mattiello J, LeBihan D. MR diffusion tensor spectroscopy and imaging. Biophysical journal. 1994;66(1):259–267. doi: 10.1016/S0006-3495(94)80775-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61. Chang LC, Jones DK, Pierpaoli C. RESTORE: robust estimation of tensors by outlier rejection. Magnetic Resonance in Medicine: An Official Journal of the International Society for Magnetic Resonance in Medicine. 2005;53(5):1088–1095. doi: 10.1002/mrm.20426 [DOI] [PubMed] [Google Scholar]
  • 62. Gorgolewski K, Burns CD, Madison C, Clark D, Halchenko YO, Waskom ML, et al. Nipype: a flexible, lightweight and extensible neuroimaging data processing framework in Python. Frontiers in Neuroinformatics. 2011;5:13. doi: 10.3389/fninf.2011.00013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63. Gorgolewski KJ, Esteban O, Markiewicz CJ, Ziegler E, Ellis DG, Notter MP, et al. Nipype. Software. 2018. [Google Scholar]
  • 64. Tustison NJ, Avants BB, Cook PA, Zheng Y, Egan A, Yushkevich PA, et al. N4ITK: Improved N3 Bias Correction. IEEE Transactions on Medical Imaging. 2010;29(6):1310–1320. doi: 10.1109/TMI.2010.2046908 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65. Fonov V, Evans A, McKinstry R, Almli C, Collins D. Unbiased nonlinear average age-appropriate brain templates from birth to adulthood. NeuroImage. 2009;47, Supplement 1:S102. doi: 10.1016/S1053-8119(09)70884-5 [DOI] [Google Scholar]
  • 66. Avants BB, Epstein CL, Grossman M, Gee JC. Symmetric diffeomorphic image registration with cross-correlation: Evaluating automated labeling of elderly and neurodegenerative brain. Medical Image Analysis. 2008;12(1):26–41. doi: 10.1016/j.media.2007.06.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67. Zhang Y, Brady M, Smith S. Segmentation of brain MR images through a hidden Markov random field model and the expectation-maximization algorithm. IEEE Transactions on Medical Imaging. 2001;20(1):45–57. doi: 10.1109/42.906424 [DOI] [PubMed] [Google Scholar]
  • 68. Veraart J, Novikov DS, Christiaens D, Ades-Aron B, Sijbers J, Fieremans E. Denoising of diffusion MRI using random matrix theory. NeuroImage. 2016;142:394–406. doi: 10.1016/j.neuroimage.2016.08.016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69. Andersson JL, Sotiropoulos SN. An integrated approach to correction for off-resonance effects and subject movement in diffusion MR imaging. Neuroimage. 2016;125:1063–1078. doi: 10.1016/j.neuroimage.2015.10.019 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70. Andersson JL, Graham MS, Zsoldos E, Sotiropoulos SN. Incorporating outlier detection and replacement into a non-parametric framework for movement and distortion correction of diffusion MR images. Neuroimage. 2016;141:556–572. doi: 10.1016/j.neuroimage.2016.06.058 [DOI] [PubMed] [Google Scholar]
  • 71. Andersson JL, Skare S, Ashburner J. How to correct susceptibility distortions in spin-echo echo-planar images: application to diffusion tensor imaging. Neuroimage. 2003;20(2):870–888. doi: 10.1016/S1053-8119(03)00336-7 [DOI] [PubMed] [Google Scholar]
  • 72. Power JD, Mitra A, Laumann TO, Snyder AZ, Schlaggar BL, Petersen SE. Methods to detect, characterize, and remove motion artifact in resting state fMRI. NeuroImage. 2014;84(Supplement C):320–341. doi: 10.1016/j.neuroimage.2013.08.048 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73. Tournier JD, Calamante F, Gadian DG, Connelly A. Direct estimation of the fiber orientation density function from diffusion-weighted MRI data using spherical deconvolution. NeuroImage. 2004;23(3):1176–1185. doi: 10.1016/j.neuroimage.2004.07.037 [DOI] [PubMed] [Google Scholar]
  • 74. Tournier JD, Yeh CH, Calamante F, Cho KH, Connelly A, Lin CP. Resolving crossing fibres using constrained spherical deconvolution: validation using diffusion-weighted imaging phantom data. Neuroimage. 2008;42(2):617–625. doi: 10.1016/j.neuroimage.2008.05.002 [DOI] [PubMed] [Google Scholar]
  • 75. Dhollander T, Mito R, Raffelt D, Connelly A. Improved white matter response function estimation for 3-tissue constrained spherical deconvolution. In: Proc. Intl. Soc. Mag. Reson. Med; 2019. p. 555. [Google Scholar]
  • 76.Dhollander T, Raffelt D, Connelly A. Unsupervised 3-tissue response function estimation from single-shell or multi-shell diffusion MR data without a co-registered T1 image. In: ISMRM Workshop on Breaking the Barriers of Diffusion MRI. vol. 5; 2016. p. 5.
  • 77. J-Donald, Smith R, Raffelt D, Tabbara R, Dhollander T, Pietsch M, et al. MRtrix3: A fast, flexible and open software framework for medical image processing and visualisation. NeuroImage. 2019;202:116137. doi: 10.1016/j.neuroimage.2019.116137 [DOI] [PubMed] [Google Scholar]
  • 78. Raffelt D, Dhollander T, Tournier JD, Tabbara R, Smith RE, Pierre E, et al. Bias field correction and intensity normalisation for quantitative analysis of apparent fibre density. In: Proc. Intl. Soc. Mag. Reson. Med. vol. 25; 2017. p. 3541. [Google Scholar]
  • 79. Abraham A, Pedregosa F, Eickenberg M, Gervais P, Mueller A, Kossaifi J, et al. Machine learning for neuroimaging with scikit-learn. Frontiers in Neuroinformatics. 2014;8. doi: 10.3389/fninf.2014.00014 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80. Garyfallidis E, Brett M, Amirbekian B, Rokem A, Van Der Walt S, Descoteaux M, et al. DIPY, a library for the analysis of diffusion MRI data. Frontiers in neuroinformatics. 2014;8:8. doi: 10.3389/fninf.2014.00008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81. Basser PJ, Pajevic S, Pierpaoli C, Duda J, Aldroubi A. In vivo fiber tractography using DT-MRI data. Magnetic resonance in medicine. 2000;44(4):625–632. doi: [DOI] [PubMed] [Google Scholar]
  • 82. Wakana S, Caprihan A, Panzenboeck MM, Fallon JH, Perry M, Gollub RL, et al. Reproducibility of quantitative tractography methods applied to cerebral white matter. Neuroimage. 2007;36(3):630–644. doi: 10.1016/j.neuroimage.2007.02.049 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83. Hua K, Zhang J, Wakana S, Jiang H, Li X, Reich DS, et al. Tract probability maps in stereotaxic spaces: analyses of white matter anatomy and tract-specific quantification. Neuroimage. 2008;39(1):336–347. doi: 10.1016/j.neuroimage.2007.07.053 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84. Gorgolewski KJ, Auer T, Calhoun VD, Craddock RC, Das S, Duff EP, et al. The brain imaging data structure, a format for organizing and describing outputs of neuroimaging experiments. Scientific data. 2016;3(1):1–9. doi: 10.1038/sdata.2016.44 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85. Johnson WE, Li C, Rabinovic A. Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics. 2007;8(1):118–127. doi: 10.1093/biostatistics/kxj037 [DOI] [PubMed] [Google Scholar]
  • 86. Fortin JP, Cullen N, Sheline YI, Taylor WD, Aselcioglu I, Cook PA, et al. Harmonization of cortical thickness measurements across scanners and sites. Neuroimage. 2018;167:104–120. doi: 10.1016/j.neuroimage.2017.11.024 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87. Fortin JP, Parker D, Tunç B, Watanabe T, Elliott MA, Ruparel K, et al. Harmonization of multi-site diffusion tensor imaging data. NeuroImage. 2017;161:149–170. doi: 10.1016/j.neuroimage.2017.08.047 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88. Nielson DM, Pereira F, Zheng CY, Migineishvili N, Lee JA, Thomas AG, et al. Detecting and harmonizing scanner differences in the ABCD study—annual release 1.0. bioRxiv. 2018. [Google Scholar]
  • 89.Pinaya WHL. NeuroCombat-sklearn; 2020. Available from: https://github.com/Warvito/neurocombat_sklearn.
  • 90. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: Machine Learning in Python. Journal of Machine Learning Research. 2011;12:2825–2830. [Google Scholar]
  • 91. Kaufman S, Rosset S, Perlich C, Stitelman O. Leakage in data mining: Formulation, detection, and avoidance. ACM Transactions on Knowledge Discovery from Data (TKDD). 2012;6(4):15. doi: 10.1145/2382577.2382579 [DOI] [Google Scholar]
  • 92. Tibshirani R. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society Series B (Methodological). 1996; p. 267–288. doi: 10.1111/j.2517-6161.1996.tb02080.x [DOI] [Google Scholar]
  • 93. Yuan M, Lin Y. Model selection and estimation in regression with grouped variables. Journal of the Royal Statistical Society: Series B (Statistical Methodology). 2006;68(1):49–67. doi: 10.1111/j.1467-9868.2005.00532.x [DOI] [Google Scholar]
  • 94. Breiman L. Bagging predictors. Machine learning. 1996;24(2):123–140. doi: 10.1007/BF00058655 [DOI] [Google Scholar]
  • 95. Nelder JA, Wedderburn RWM. Generalized Linear Models. Journal of the Royal Statistical Society Series A (General). 1972;135(3):370–384. doi: 10.2307/2344614 [DOI] [Google Scholar]
  • 96. Harris CR, Millman KJ, van der Walt SJ, Gommers R, Virtanen P, Cournapeau D, et al. Array programming with NumPy. Nature. 2020;585(7825):357–362. doi: 10.1038/s41586-020-2649-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97. Richie Halford A, Narayan M, Simon N, Yeatman J, Rokem A. Groupyr: Sparse Group Lasso in Python. Journal of Open Source Software. 2021;6(58):3024. doi: 10.21105/joss.03024 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98. Parikh N, Boyd S, et al. Proximal algorithms. Foundations and Trends in Optimization. 2014;1(3):127–239. doi: 10.1561/2400000003 [DOI] [Google Scholar]
  • 99.Pedregosa F. C-OPT: composite optimization in Python; 2018. Available from: http://openopt.github.io/copt/.
  • 100.Buitinck L, Louppe G, Blondel M, Pedregosa F, Mueller A, Grisel O, et al. API design for machine learning software: experiences from the scikit-learn project. In: ECML PKDD Workshop: Languages for Data Mining and Machine Learning; 2013. p. 108–122.
  • 101.Head T, Kumar M, Nahrstaedt H, Louppe G, Shcherbatyi I. scikit-optimize/scikit-optimize; 2020. Available from: 10.5281/zenodo.4014775. [DOI]
  • 102.McKinney W, et al. Data structures for statistical computing in python. In: Proceedings of the 9th Python in Science Conference. vol. 445. Austin, TX; 2010. p. 51–56.
PLoS Comput Biol. doi: 10.1371/journal.pcbi.1009136.r001

Decision Letter 0

Roberto Toro, Daniele Marinazzo

26 Mar 2021

Dear Richie-Halford,

Thank you very much for submitting your manuscript "Multidimensional analysis and detection of informative features in human brain white matter" for consideration at PLOS Computational Biology. Three reviewers have provided their comments and they are all positive about the quality of your contribution. They ask, however, for some additional precision in the discussion of alternative methods and the description of the results that I encourage you to take into consideration, as they should help increase the impact of your work. We will be looking forward to receiving a revised version of your manuscript, taking these comments into account.

We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent to reviewers for further evaluation.

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Please prepare and submit your revised manuscript within 60 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. Please note that revised manuscripts received after the 60-day due date may require evaluation and peer review similar to newly submitted manuscripts.

Thank you again for your submission. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Roberto Toro

Associate Editor

PLOS Computational Biology

Daniele Marinazzo

Deputy Editor

PLOS Computational Biology

***********************

Dear Authors,

Thank you for submitting your article. Three reviewers have provided their comments and they are all positive about the quality of your contribution. They ask, however, for some additional precision in the discussion of alternative methods and the description of the results that I encourage you to take into consideration, as they should help increase the impact of your work. We will be looking forward to receiving a revised version of your manuscript, taking these comments into account.

Best regards,

Roberto Toro

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: This article describes a systematic evaluation of sparse group lasso (SGL) applied to tractometry output of dMRI-derived streamlines as a classification and individual differences method. Using SGL+tractometry in the classification setting, the authors reliably distinguish between ALS patients and matched controls at a classification accuracy of 83%, with the major contributing pathways to classification being the descending corticospinal (CST) pathways, as would be expected for an upper motor neuron disease. In the regression context, the authors found that they could reliably predict chronological age in 3 different data sets, with r^2’s ranging from 0.52-0.77. The authors conclude that this approach provides an effective way of identifying white matter phenotypes from dMRI data that also identifies contributing pathways relevant to model accuracy.

Overall, I found the results to be quite compelling. There are some outstanding questions about how this method relates to prior approaches and details about the modeling work itself that should be addressed. But in general I think this work will be a valuable contribution to the field.

MAJOR COMMENTS

- The authors make a compelling case that standard voxelwise approaches to developing brain-based phenotypes, particularly from dMRI data, are quite limited due to their high dimensionality and lack of specificity. One thing that is not clear up front, and could help in the interpretation, is that the authors are specifically talking about it in the context of decoding models (e.g., Y = group/individual measure), unlike many previous models (e.g., TBSS, connectometry) that are encoding models (e.g., Y= brain measure). For example, when I first read the Introduction, the discussion of prior approaches not characterizing specificity of bundles relevant to individual differences seemed incorrect as TBSS, connectometry, and even the initial tractometry approaches do this to some degree (and even control for the multiple comparisons). But the authors are correct that there are not many approaches that try to answer decoding questions while maintaining inferential specificity of individual pathways. Since the field is largely dominated by encoding models, I recommend making this clearer up front to avoid similar confusion in the future.

- Along the same lines as my prior comment, I wonder if the authors could compare their approach to the commonly used sparsity-constrained PCR (i.e., LASSO PCR) approach. This is becoming widely used in neuroimaging and has been applied specifically to dMRI data (see: 2) Powell, M. A., Garcia, J. O., Yeh, F. C., Vettel, J. M., & Verstynen, T. (2018). Local connectome phenotypes predict social, health, and cognitive factors. Network neuroscience, 2(1), 86-105. 2) Rasero, Javier, Amy Isabella Sentis, Fang-Cheng Yeh, and Timothy Verstynen. "Integrating across neuroimaging modalities boosts prediction accuracy of cognitive ability." PLOS Computational Biology 17, no. 3 (2021): e1008347.)*. I do not mean to vet directly against the SGL+tractometry method itself, but maybe add a section in the Discussion describing how this method compares to other common multivariate approaches used in computational neuroimaging like LASSO-PCR.

*Full disclosure: I am the senior author of these papers and am fully aware of how annoying it is for reviewers to ask that you cite their work. I’m not asking for citations per se, but a comparison of these PCR-based methods.

- The comparison of the SGL approach to traditional lasso seems limited. Traditional lasso will pick the best predictor out of a correlated set of predictors. Adding in the group constraint allows for keeping some degree of correlations/clustering in the final model (a good assumption when building predictors from brain imaging data sets). It seems to me that the appropriate comparison would be against a ridge model, since ridge allows for clustering of predictors, but in a different way than group lasso does. If I were to place money, my guess is that the performance of a ridge or elastic net model would sit somewhere in the middle of the lasso results and SGL results.

- There is substantial variability in the contributing pathways for age prediction across data sets. This seems incredibly important, but overlooked. One of the interpretive validations of the classifier results is that the pathways contributing to the classification make sense given the underlying pathologies of ALS. You’d assume that if age impacted the same white matter pathways, a validity test would be to show (i.e., quantify) that the same pathways predict age across data sets. But that’s not what we really see here beyond just a qualitative comparison of the arcuate fasciculus. Why is that? How similar or dissimilar are the tractometry results across datasets for this analysis?

- In the Methods it appears that the 3 data sets used in the age prediction differ substantially in their preprocessing routines. Is this a correct reading of the Methods? If so, how much could this variability explain the differences in model accuracies across data sets? Is there a way of understanding how specific preprocessing steps may impact model accuracy using this data?

MINOR POINTS

- Fig 1. Define acronyms (e.g., CFA, CST) in the caption.

- Since the arcuate is a major predicting pathway in the age prediction, it makes sense to show/highlight it in Fig. 1 for contrastive comparison, in the same way that CST pathways are highlighted in Fig. 2.

- Fig. 2b is not mentioned in the Results.

- Is one of the reasons that the prediction is so good in the ALS case is that there are lesions in the descending CST pathways? These should be visible in the B0 images (or T2 FLAIRs if they are available). If so, this should be added to the text as interpreting these results will be different if it is picking up on subclinical pathologies in the white matter signal or if it is just detecting visible lesions.

Reviewer #2: The paper presents a method (Sparse Group Lasso, SGL) to analyse along-tract data derived from diffusion MRI. Along-tract profiling of various microstructural measures typically poses statistical problems due to the high dimensionality nature of the data. Here, data from 4 databases were employed to test the proposed SGL framework under two scenarios: classification and regression. In particular, the authors looked at the ROC AUC scores to classify ALS subjects based on their tract profiles. The authors found that SGL generally improved classification over a previously published approach based on Random Forest, while detecting the CST – a WM bundle typically affected by ALS - as an important feature. For regression, the authors used SGL to predict brain age on 3 different datasets.

The paper is well written and well-structured. As a recommendation, I believe that the reader would benefit from a summary of the input format for each dataset (i.e., it is unclear if the feature space was the same across datasets regarding the number of tracts and metrics.)

My major comment concerns the classification scenario. Although results may seem impressive, I wonder how difficult was the task at first glance. The authors failed to provide a comparison with standard classification approaches relying on more standard tract-averaging approaches. Would a simple tract average of FA disentangle both groups (e.g., using a standard deviation threshold)? I appreciate that the framework identified the CST as an important feature in an unsupervised fashion, but in this “straightforward” clinico-radiological experiment, one expects the CST to be the driving feature for ALS. In other words, it is unclear if along tract-profiling provides added benefit for classification here (i.e., are 100 features per tract, per metric really necessary over 1 single feature per tract per metric?)

Minor comments:

“directional diffusion of water in each voxel [1].”

Stejskal, E. O., & Tanner, J. E. (1965). Spin diffusion measurements: spin echoes in the presence of a time‐dependent field gradient. The journal of chemical physics, 42(1), 288-292.

This approach is exhaustive, but statistical power is compromised by a multiple comparison problem [7, 9, 10].

Chamberland, M., Raven, E. P., Genc, S., Duffy, K., Descoteaux, M., Parker, G. D., ... & Jones, D. K. (2019). Dimensionality reduction of diffusion MRI measures for improved tractometry of the human brain. NeuroImage, 200, 89-100.

Fig. 1b) The red ROIs appear to be arbitrarily hand drawn over the corticospinal tract, which makes it confusing; e.g., were those starting-ending ROIs used to clip the CST profile or were they used for dissection only? I would say it is probably best to remove the drawing. The tract profile depicted in c) represents the traditional “bottom-to-top” CST FA profile with the well-known dip due to crossing fibers in the centrum semi-ovale. I would add the position along the tract along the x axis (c) so that uninitiated readers better appreciate the mapping from b to c.

Material, dataset 2: “…fitting the diffusion tensor model [51] in every voxel using a robust fit [52].” Please specify that this was done on the b=1000 s/mm2 shell only.

Typo: mahalnobis

Tractography: Please add details of missing parameters (step size, angular threshold, number of seeds, seeding method, number of streamlines per bundle per subject…)

Tractometry vs along tract profiling: I recommend paying special attention as to how both terms are mutually employed. In essence., Tractometry can refer to the quantitative mapping of multiples measures averaged over a set of tracts, as defined by Bells et al. 2011 and de Santis et al. 2014, whereas along-tract profiling specifically refers to the process of profiling the various MR-derived metrics, as defined by co-author J. Yeatman.

Overall, the framework provides a great addition to the existing AFQ software suite, and I find it worthy of publication at PLOS Computional Biology, conditional to the aforementioned comments.

Reviewer #3: The authors present a multidimensional analysis method of white matter diffusion-based measures. It is based on sparse group lasso and allows to obtain informative features. It has several advantages, such as being able to perform analysis along the tracts, to automatically determine relevant features, and to perform classification or regression.

The method is interesting and very sound and can be very useful for neuroscientists.

The manuscript is well written and explains quite clearly the method, experiments, and results.

However, I think that more detail could be given on the results obtained. These should be better explained and analyzed. The figures contain considerable information but are not sufficiently explained in the text.

In addition, for example, I do not see displays of the tracts, where the different zones detected as relevant are highlighted (especially for the study of ALS). There are some small plots of the centroids, but these are very small and do not show the actual tracts. What is CFA?

Also, there is not enough comparison with previous methods.

On the other hand, the limitations of the method with respect to previous methods need to be better discussed, considering different aspects such as complexity, processing time, the minimum sample size required, selection of hyperparameters. Is it always better to use this method?

Also, how the transformation function is specified?

Finally, there is not enough reference to the supplementary material in the text (or I did not see it). A discussion of the most important results could be in the main manuscript

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Computational Biology data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Timothy Verstynen

Reviewer #2: No

Reviewer #3: No

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1009136.r003

Decision Letter 1

Roberto Toro, Daniele Marinazzo

31 May 2021

Dear Dr. Rokem,

We are pleased to inform you that your manuscript 'Multidimensional analysis and detection of informative features in human brain white matter' has been provisionally accepted for publication in PLOS Computational Biology.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. 

Best regards,

Roberto Toro

Associate Editor

PLOS Computational Biology

Daniele Marinazzo

Deputy Editor

PLOS Computational Biology

***********************************************************

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: The authors have done an exceptional job addressing my concerns.

Reviewer #2: The authors have addressed most of my concerns. I recommend acceptance of the manuscript, although relevant information regarding the tract-generation process is still missing. See original comment:

Tractography: Please add a simple description of the tractography parameters that were employed (step size, angular threshold, number of seeds, seeding placement (WM, GM...), number of streamlines per bundle per subject…).

Latex link seem to be broken + typo: (see Figs 3d to 3f and supporing figures in ??).

Reviewer #3: The authors addressed all the comments and have largely improved the manuscript.

I think it is a very good work and a valuable contribution to the field.

My only observation is about the supplementary material. It has too many separate files. I suggest creating a single supplementary file with all figures with captions, and comments.

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Timothy Verstynen

Reviewer #2: Yes: Maxime Chamberland

Reviewer #3: No

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1009136.r004

Acceptance letter

Roberto Toro, Daniele Marinazzo

24 Jun 2021

PCOMPBIOL-D-21-00231R1

Multidimensional analysis and detection of informative features in human brain white matter

Dear Dr Richie-Halford,

I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Zsofi Zombor

PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Bundle profiles and regression coefficients for the ALS dataset FA.

    Diffusion metrics are plotted along the length of eighteen bundles: right corticospinal (CSTR), left corticospinal (CSTL), right uncinate (UNCR), left uncinate (UNCL), left inferior fronto-occipital fasciculus (IFOL), right inferior fronto-occipital fasciculus (IFOR), right arcuate (ARCR), left arcuate (ARCL), right thalamic radiation (ATRR), left thalamic radiation (ATRL), right cingulum cingulate (CGCR), left cingulum cingulate (CGCL), callosum forceps posterior (CFP), callosum forceps anterior (CFA), right inferior longitudinal fasciculus (ILFR), left inferior longitudinal fasciculus (ILFL), right superior longitudinal fasciculus (SLFR), and left superior longitudinal fasciculus (SLFL). FA is plotted on the left y-axis while the β^ coefficients are displayed on the twin axis on the right-hand-side. SGL selected the right corticospinal tract (CSTR) as important and regularized coefficients in the CSTL. Yet, there are also group FA differences in the CSTL. This highlighted a potential drawback of the SGL method, discussed in the main text in the context of age regression. Namely, SGL is not guaranteed to identify all important features. In this case, if the diagnostic signal in the CSTL is redundant to that in the CSTR, SGL will regularize the CSTL features, thereby reducing its sparsity penalty without any corresponding increase in loss. This parsimony cuts both ways; it is a feature of the method when one seeks an efficient predictive model, but is a disadvantage of the method when one wants an exhaustive explanation of feature importance. We use the phrase “parsimony pitfall” to refer to the case when SGL regularizes away redundant but obviously important features.

    (TIF)

    S2 Fig. Bundle profiles and regression coefficients for the ALS dataset MD.

    The scale of the β^-axis is identical to that used in S1 Fig, to facilitate the comparison of the relative importance of each metric.

    (TIF)

    S3 Fig. Bundle profiles and regression coefficients for the WH dataset FA.

    In contrast to the ALS classification case, the β^ coefficients are distributed widely through the brain, supporting the interpretation that aging is a large and continuous whole-brain process.

    (TIF)

    S4 Fig. Bundle profiles and regression coefficients for the WH dataset MD.

    This and S3 Fig demonstrate that SGL behaves much more like the lasso than the group lasso, as discussed in the main text. The parsimony pitfall is most evident in the IFOL and IFOR bundles.

    (TIF)

    S5 Fig. FA bundle profiles and β^ coefficients for age regression in the HBN dataset.

    Like the WH dataset, the β^ coefficients are distributed widely through the brain and SGL behaves more like the lasso than the group lasso.

    (TIF)

    S6 Fig. MD bundle profiles and β^ coefficients for age regression in the HBN dataset.

    In contrast to the WH results, the bundle profiles show different behaviors. For example the SLFL and SLFR bundle profiles in have different concavity. This is unsurprising, however, given the differences between these datasets (see also Discussion). The parsimony pitfall is most evident in the UNCL, UNCR, ARCL, SLFL, and SLFR bundles.

    (TIF)

    S7 Fig. FA bundle profiles and β^ coefficients for age regression in the Cam-CAN dataset.

    The β^ coefficients are distributed widely through the brain and SGL behaves more like the lasso than the group lasso. As before, one must be cautious about comparing bundle profiles and β^ coefficients between models. While the HBN and Cam-CAN datasets share the same diffusion model and refrain from clipping streamlines, the age distributions for the two are roughly disjoint, with the WH age distribution straddling the two.

    (TIF)

    S8 Fig. Mean diffusivity (MD) bundle profiles and β^ coefficients for age regression in the Cam-CAN dataset.

    (TIF)

    S9 Fig. Age distributions are similar between the different HBN sites.

    Rutgers Rutgers University Brain Imaging Center (RU) and the CitiGroup Cornell Brain Imaging Center (CBIC).

    (TIF)

    S10 Fig. FA bundle profiles exhibit strong site differences in the HBN dataset.

    (TIF)

    S11 Fig. MD bundle profiles exhibit strong site differences in the HBN dataset.

    (TIF)

    S12 Fig. Site differences in FA are eliminated by ComBat harmonization.

    (TIF)

    S13 Fig. Site differences in MD are eliminated by ComBat harmonization.

    (TIF)

    Attachment

    Submitted filename: R1ReviewerComments.pdf

    Data Availability Statement

    The main software described in this study is available through GitHub at https://github.com/richford/AFQ-Insight. The version of software used in this study is also available at https://doi.org/10.5281/zenodo.4316000. To facilitate reproducibility and ease use of the software, the results presented in this paper are also provided in https://github.com/richford/afq-insight-paper as a series of Jupyter notebooks. We refer to four datasets in this work: ALS, WH, HBN, and Cam-CAN. The ALS dataset is openly available at https://doi.org/10.5281/zenodo.1161864. The WH dataset is openly available at https://doi.org/10.5281/zenodo.1161846. To facilitate acquisition of these data, we provide a ‘make data’ command in the paper repository at https://github.com/richford/afq-insight-paper. Data from the HBN study is available to access through: http://fcon_1000.projects.nitrc.org/indi/cmi_healthy_brain_network/. MRI data from this study is openly available. Phenotypic data can be freely accessed, upon establishing a data usage agreement with the Child Mind Institute, which is the data provider for this study. Data from the Cam-CAN study is available to access through: https://www.cam-can.org/index.php?content=dataset, and can be freely accessed upon agreeing to terms and conditions set forth by the Cambridge Centre for Ageing and Neuroscience, which is the data provider for this study.


    Articles from PLoS Computational Biology are provided here courtesy of PLOS

    RESOURCES