Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2022 Jan 24.
Published in final edited form as: Apert Neuro. 2021 Nov 17;1(1):10.52294/e6198273-b8e3-4b63-babb-6e6b0da10669. doi: 10.52294/e6198273-b8e3-4b63-babb-6e6b0da10669

Evaluating the Reliability of Human Brain White Matter Tractometry

John Kruper a,b, Jason D Yeatman c,d, Adam Richie-Halford b, David Bloom a,b, Mareike Grotheer e,f, Sendy Caffarra c,d,g, Gregory Kiar h, Iliana I Karipidis i, Ethan Roy c, Bramsh Q Chandio j, Eleftherios Garyfallidis j, Ariel Rokem a,b
PMCID: PMC8785971  NIHMSID: NIHMS1737034  PMID: 35079748

Abstract

The validity of research results depends on the reliability of analysis methods. In recent years, there have been concerns about the validity of research that uses diffusion-weighted MRI (dMRI) to understand human brain white matter connections in vivo, in part based on the reliability of analysis methods used in this field. We defined and assessed three dimensions of reliability in dMRI-based tractometry, an analysis technique that assesses the physical properties of white matter pathways: (1) reproducibility, (2) test-retest reliability, and (3) robustness. To facilitate reproducibility, we provide software that automates tractometry (https://yeatmanlab.github.io/pyAFQ). In measurements from the Human Connectome Project, as well as clinical-grade measurements, we find that tractometry has high test-retest reliability that is comparable to most standardized clinical assessment tools. We find that tractometry is also robust: showing high reliability with different choices of analysis algorithms. Taken together, our results suggest that tractometry is a reliable approach to analysis of white matter connections. The overall approach taken here both demonstrates the specific trustworthiness of tractometry analysis and outlines what researchers can do to establish the reliability of computational analysis pipelines in neuroimaging.

Keywords: Diffusion MRI, Brain Connectivity, Tractography, Reproducibility, Robustness

INTRODUCTION

The white matter of the brain contains the long-range connections between distant cortical regions. The integration and coordination of brain activity through the fascicles containing these connections are important for information processing and for brain health (1, 2). Using voxel-specific directional diffusion information from diffusion-weighted MRI (dMRI), computational tractography produces three-dimensional trajectories through the white matter within the MRI volume that are called streamlines (3, 4). Collections of streamlines that match the location and direction of major white matter pathways within an individual can be generated with different strategies: using probabilistic (5, 6) or streamline-based (7, 8) atlases or known anatomical landmarks (912). Because these are models of the anatomy, we refer to these estimates as bundles to distinguish them from the anatomical pathways themselves. The delineation of well-known anatomical pathways overcomes many of the concerns about confounds in dMRI-based tractography (13, 14), because “brain connections derived from diffusion MRI tractography can be highly anatomically accurate – if we know where white matter pathways start, where they end, and where they do not go” (15).

The physical properties of brain tissue affect the diffusion of water, and the microstructure of tissue within the white matter along the length of computationally generated bundles can be assessed using a variety of models (16, 17). Taken together, computational tractography, bundle recognition, and diffusion modeling provide so-called tract profiles: estimates of microstructural properties of tissue along the length of major pathways. This is the basis of tractometry: statistical analysis that compares different groups or assesses individual variability in brain connection structure (9, 1821). For the inferences made from tractometry to be valid and useful, tract profiles need to be reliable.

In the present work, we provide an assessment of three different ways in which scientific results can be reliable: reproducibility, test-retest reliability (TRR), and robustness. These terms are often debated, and conflicting definitions for these terms have been proposed (22, 23). Here, we use the definitions proposed in (24). Reproducibility is defined as the case in which data and methods are fully accessible and usable: running the same code with the same data should produce an identical result. Use of different data (e.g., in a test-retest experiment) resulting in quantitatively comparable results would denote TRR. In clinical science and psychology in general, TRR (e.g., in the form of inter-rater reliability) is considered a key metric of the reliability of a measurement. Use of a different analysis approach or different analysis system (e.g., different software implementation of the same ideas) could result in similar conclusions, denoting their robustness to implementation details. The recent findings of Botvinik-Nezer et al. (25) show that even when full computational reproducibility is achieved, the results of analyzing a single functional MRI (fMRI) dataset can vary significantly between teams and analysis pipelines, demonstrating issues of robustness.

The contribution of the present work is three-fold: to support reproducible research using tractometry, we developed an open-source software library called Automated Fiber Quantification in Python (pyAFQ; https://yeatmanlab.github.io/pyAFQ). Given dMRI data that has undergone standard preprocessing (e.g., using QSIprep (26)), pyAFQ automatically performs tractography, classifies streamlines into bundles representing the major tracts, and extracts tract profiles of diffusion properties along those bundles, producing “tidy” CSV output files (27) that are amenable to further statistical analysis (Fig. S1). The library implements the major functionality provided by a previous MATLAB implementation of tractometry analysis (9) and offers a menu of configurable algorithms allowing researchers to tune the pipeline to their specific scientific questions (Fig. S2). Second, we use pyAFQ to assess TRR of tractometry results. Third, we assess robustness of tractometry results to variations across different models of the diffusion in individual voxels, across different bundle recognition approaches, and across different implementations.

MATERIALS AND METHODS pyAFQ

We developed an open-source tractometry software library to support computational reproducibility: pyAFQ. The software relies heavily on methods implemented in Diffusion Imaging in Python (DIPY) (28). Our implementation was also guided by a previous MATLAB implementation of tractometry (mAFQ) (9). More details are available in the “Automated Fiber Quantification in Python (pyAFQ)” section of Supplementary Methods.

Tractometry

The pyAFQ software is configurable, allowing users to specify methods and parameters for different stages of the analysis (Fig. S2). Here, we will describe the default setting. In the first step, computational tractography methods, implemented in DIPY (28), are used to generate streamlines throughout the brain white matter (Fig. S1A). Next, the T1-weighted Montreal Neurological Institute (MNI) template (29, 30) is registered to the anisotropic power map (APM) (31, 32) computed from the diffusion data that has a T1-like contrast (Fig. S1B) using the symmetric image normalization method (33) implemented in DIPY (28). The next step is to perform bundle recognition, where each tractography streamline is classified as either belonging to a particular bundle or discarded. We use the transformation found during registration to bring canonical anatomical landmarks, such as waypoint regions of interest (ROIs) and probability maps, from template space to the individual subject’s native space. Waypoint ROIs are used to delineate the trajectory of the bundles (34). See Table S1 for the bundle abbreviations we use in this paper. Streamlines that pass through inclusion waypoint ROIs for a particular bundle, and do not pass through exclusion ROI, are selected as candidates to include in the bundle. In addition, a probabilistic atlas (35) is used as a tiebreaker to determine whether a streamline is more likely to belong to one bundle or another (in cases where the streamline matches the criteria for inclusion in either). For example, the corticospinal tract is identified by finding streamlines that pass through an axial waypoint ROI in the brainstem and another ROI axially oriented in the white matter of the corona radiata but that do not pass through the midline (Fig. S1C). The final step is to extract the tract profile: each streamline is resampled to a fixed number of points, and the mean value of a diffusion-derived scalar (e.g., fractional anisotropy (FA) and mean diffusivity (MD)) is found for each one of these nodes. The values are summarized by weighting the contribution of each streamline, based on how concordant the trajectory of this streamline is with respect to the other streamlines in the bundle (Fig. S1D). To make sure that profiles represent properties of the core white matter, we remove the first and last five nodes of the profile, then further remove any nodes where either the FA is less than 0.2 or the MD is greater than 0.002. This removes nodes that contain partial volume artifacts (16).

Data

We used two datasets with test-retest measurements. We used Human Connectome Project test-retest (HCP-TR) measurements of dMRI for 44 neurologically healthy subjects aged 22–35 (36). The other is an experimental dataset, with dMRI from 48 children, aged 5 years old, collected at the University of Washington (UW-PREK). More details about the measurement are available in the “Data” section of Supplementary Methods.

HCP-TR configurations

We processed HCP-TR with three different pyAFQ configurations. In the first configuration, we used the diffusional kurtosis imaging (DKI) model as the orientation distribution function (ODF) model. In the second configuration, we used constrained spherical deconvolution (CSD) as the ODF model. For the final configuration, we used RecoBundles (8) for bundle recognition instead of the default waypoint ROI approach, and DKI as the ODF model. More details are available in the “Configurations” section of Supplementary Methods.

Measures of reliability

Tract recognition of each bundle was compared across measurements and methods using the Dice coefficient, weighted by streamline count (wDSC) (37). Tract profiles were compared with three measures: (1) profile reliability: mean intraclass correlation coefficient (ICC) across points in different tract profiles for different data, which quantifies the agreement of tract profiles (38, 39); (2) subject reliability: Spearman’s rank correlation coefficient (Spearman’s ρ) between the means of the tract profiles across individuals, which quantifies the consistency of the mean of tract profiles; and (3) an adjusted contrast index profile (ACIP): to directly compare the values of individual nodes in the tract profiles in different measurements. To estimate TRR, the above measures were calculated for each individual across different measurements, and to estimate robustness, these were calculated for each individual across different analysis methods. For example, if we calculated the subject reliability across measurements, we would call that “subject TRR,” and if we calculated the subject reliability across analysis methods, we would call that “subject robustness.” We explain profile and subject reliability in more detail below; we explain wDSC and ACIP in more detail in equations 1 and 2 in the “Measures of Reliability” section of the Supplementary Methods.

Profile reliability

We use profile reliability to compare the shapes of profiles per bundle and per scalar. Given two sets of data (either from test-retest analysis or from different analyses), we first calculate the ICC between tract profiles for each subject in a given bundle and scalar. Then, we take the mean of those correlations. We do this for every bundle and for every scalar. We call this profile reliability because larger differences in the overall values along the profiles will result in a smaller mean of the ICC. Consistent profile shapes are important for distinguishing bundles. Profile reliability provides an assessment of the overall reliability of the tract profiles, summarizing over the full length of the bundle, for a particular scalar. We calculate the 95% confidence interval on profile reliabilities using the standard error of the measurement.

In some cases, there is low between-subject variance in tract profile shape (e.g., this is often the case in corticospinal tract (CST)). We use ICC to account for this, as ICC will penalize low between-subject variance in addition to rewarding high within-subject variance. Profile reliability is a way of quantifying the agreement between profiles. Qualitatively, we use four descriptions for profile reliability: excellent (ICC > 0.75), good (ICC = 0.60 to 0.74), fair (ICC = 0.40 to 0.59), and poor (ICC < 0.40) (40).

Subject reliability

We calculate subject reliability to compare individual differences in profiles, per bundle and per scalar, following (41). Given two measurements for each subject, we first take the mean of each profile within each individual, measurement and scalar. Then, we calculate Spearman’s ρ from the means from different subjects for a given bundle and scalar across the measurements. High subject reliability means the ordering of an individual’s tract profile mean among other individuals is consistent across measurements or methods. This is akin to test reliability that is computed for any clinical measure.

One downside of subject reliability is that the shape of the extracted profile is not considered. Additionally, if one measurement or method produces higher values for all subjects uniformly, subject reliability would not be affected. Instead, the intent of subject reliability is to well summarize the preservation of relative differences between individuals for mean tract profiles. In other words, subject reliability quantifies the consistency of mean profiles. The 95% confidence interval on subject reliabilities is parametric.

RESULTS

Tractometry using pyAFQ classifies streamlines into bundles that represent major anatomical pathways. The streamlines are used to sample dMRI-derived scalars into bundle profiles that are calculated for every individual and can be summarized for a group of subjects. An example of the process and result of the tract profile extraction process is shown in Fig. S3 together with the results of this process across the 18 major white matter pathways for all subjects in the HCP-TR dataset.

Assessing TRR of tractometry

In datasets with scan-rescan data, we can assess TRR at several different levels of tractometry. For example, the correlation between two profiles provides a measure of the reliability of the overall tract profile in that subject. Analyzing the HCP-TR dataset, we find that for FA calculated using DKI, the values of profile reliability vary across subjects (Fig. 1A), but they overall tend to be rather high, with the average value within each bundle in the range of 0.77 ± 0.05 to 0.92 ± 0.02 and a median across bundles of 0.86 (Fig. 1B). We find similar results for MD (Fig. S4) and replicate similar results in a second dataset (Fig. 3B).

Fig. 1. Fractional anisotropy (FA) profile test-retest reliability (TRR).

Fig. 1.

(A) Histograms of individual subject intraclass correlation coefficient (ICC) between the FA tract profiles across sessions for a given bundle. Colors encode the bundles, matching the diagram showing the rough anatomical positions of the bundles for the left side of the brain (center). (B) Mean (± 95% confidence interval) TRR for each bundle, color-coded to match the histograms and the bundles diagram, with median across bundles in red.

Fig. 3. Weighted Dice similarity coefficient (wDSC), profile, and subject test-retest reliability (TRR) of Python Automated Fiber Quantification (pyAFQ) and MATLAB Automated Fiber Quantification (mAFQ) on University of Washington (UW-PREK); pyAFQ on Human Connectome Project test-retest (HCP-TR) using different orientation distribution function (ODF) models; and Reproducible Tract Profile (RTP) on HCP-TR.

Fig. 3.

Colors indicate bundle. (A) Texture indicates the dataset and methods being compared. Error bars show the 95% confidence interval. (B, D, and F) Profile TRR and (C, E, and G) subject TRR. Profile and subject TRR calculations are demonstrated with HCP-TR using diffusion kurtosis model (DKM) in figures 1 and 2 respectively. (B, C) Comparison of the TRR of mAFQ and pyAFQ on UW-PREK. (D, E) Comparison of pyAFQ and RTP on HCP-TR using only single shell data. (F, G) Comparison of DKI and CSD TRR on HCP-TR. Point shapes indicate the extracted scalar. The red dotted line is equal TRR between methods.

Subject reliability assesses the reliability of mean tract profiles across individuals. Subject FA TRR in the HCP-TR also tends to be high, but the values vary more across bundles with a range of 0.57 ± 0.24 to 0.85 ± 0.12 and a median across bundles of 0.73. We can see that subject TRR is lower than profile TRR (Fig. 2). This trend is consistent for MD (Fig. S5) as well as for another dataset (Fig. 3C).

Fig. 2. Subject test-retest reliability.

Fig. 2.

(A) Mean tract profiles for a given bundle and the fractional anisotropy (FA) scalar for each subject using the first and second session of Human Connectome Project test-retest (HCP-TR). Colors encode bundle information, matching the core of the bundles (center). (B) Subject reliability is calculated from the Spearman’s ρ of these distributions, with median across bundles in red (± 95% confidence interval).

TRR of tractometry in different implementations, datasets, and tractography methods

We compared TRR across datasets and implementations. In both datasets, we found high TRR in the results of tractography and bundle recognition: wDSC was larger than 0.7 for all but one bundle (Fig. 3A): the delineation of the anterior forceps (FA bundle) seems relatively unreliable using pyAFQ in the UW-PREK dataset (using the FA scalar, pyAFQ subject TRR is only 0.37 ± 0.28 compared to mAFQ’s 0.84 ± 0. 10). We found overall high-profile TRR that did not always translate to high subject TRR (Fig. 3BG). For example, for FA in UW-PREK, median profile TRRs are 0.75 for pyAFQ and 0.77 for mAFQ, while median subject TRRs are 0.70 for pyAFQ and 0.75 for mAFQ. Note that profile and subject TRRs have different denominators (e.g., subjects that have similar mean profiles to each other would have low subject TRR, even if the profiles are reliable, because it is harder to distinguish between subjects in this case). mAFQ is one of the most popular software pipelines currently available for tractometry analysis, so it provides an important point for comparison. In comparing different software implementations, we found that mAFQ has higher subject TRR relative to pyAFQ in the UW-PREK dataset, when TRR is relatively low for pyAFQ (see the FA bundle, CST L, and ATR L in Fig. 3C). On the other hand, in the HCP-TR dataset pyAFQ, we used the Reproducible Tract Profile (RTP) pipeline (42, 43), which is an extension of mAFQ, and found that pyAFQ tends to have slightly higher profile TRR than RTP for MD but slightly lower profile TRR for FA (Fig. 3D). The pyAFQ and RTP subject TRR are highly comparable (Fig. 3E). In FA, the median pyAFQ subject TRR for FA is 0.76, while the median RTP subject TRR is 0.74. Comparing different ODF models in pyAFQ, we found that the DKI and CSD ODF models have highly similar TRR, both at the level of wDSC (Fig. 3A) and at the level of profile and subject TRRs (Fig. 3F, G).

Robustness: comparison between distinct tractography models and bundles recognition algorithms

To assess the robustness of tractometry results to different models and algorithms, we used the same measures that were used to calculate TRR.

Tractometry results can be robust to differences in ODF models used in tractography

We compared two algorithms: tractography using DKI- and CSD-derived ODFs. The weighted Dice similarity coefficient (wDSC) for this comparison can be rather high in some cases (e.g., the uncinate and corticospinal tracts, Fig. 4A) but produce results that appear very different for some bundles, such as the arcuate and superior longitudinal fasciculi (ARC and SLF) (see also Fig. 4D). Despite these discrepancies, profile and subject robustness are high for most bundles (median FA of 0.77 and 0.75, respectively) (Fig. 4B, C). In contrast to the results found in TRR, MD subject robustness is consistently higher than FA subject robustness. The two bundles with the most marked differences between the two ODF models are the SLF and ARC (Fig. 4D). These bundles have low wDSC and profile robustness, yet their subject robustness remains remarkably high (in FA, 0.75 ± 0.17 for ARC R and 0.88 ± 0.09 for SLF R) (Fig. 4C). These differences are partially explained due to the fact that there are systematic biases in the sampling of white matter by bundles generated with these two ODF models, as demonstrated by the non-zero ACIP between the two models (Fig. 4E).

Fig. 4. Orientation distribution function (ODF) model robustness.

Fig. 4.

We compared diffusion kurtosis model (DKI)- and constrained spherical deconvolution (CSD)-derived tractography. Colors encode bundle information as in Figs. 1 and 2. Textured hatching encodes fractional anisotropy/mean diffusivity (FA/MD) information. (A) weighted Dice similarity coefficient (wDSC) robustness. (B) Profile robustness. (C) Subject robustness. Error bars represent 95% confidence interval. (D, E) Adjusted contrast index profile (ACIP) between left arcuate and left superior longitudinal fasciculi (ARC L and SLF L) tract profiles of each algorithm. Positive adjusted contrast index (ACI) indicates DKI found a higher value of FA than CSD at that node. The 95% confidence interval on the mean is shaded. (F) Tractography and bundle recognition results for ARC L and SLF L, respectively, for one example subject.

Most white matter bundles are highly robust across bundle recognition methods

We compared bundle recognition with the same tractography results using two different approaches: the default waypoint ROI approach (9) and an alternative approach (RecoBundles) that uses atlas templates in the space of the streamlines (44). Between these algorithms, wDSC is around or above 0.6 for all but one bundle, Right Inferior Longitudinal Fasciculus (ILF R) (Fig. 5). There is an asymmetry in the ILF atlas bundle (7), which results in discrepancies between ILF R recognized with waypoint ROIs and with RecoBundles. Despite this bundle, we find high robustness overall. For MD, the first quartile subject robustness is 0.82 (Fig. 5C, D).

Fig. 5. Recognition algorithm robustness.

Fig. 5.

(A) Weighted Dice similarity coefficient (wDSC). (B) Profile robustness. (C) Subject robustness. Error bars show the 95% confidence interval. (D) The right inferior longitudinal fasciculus (ILF R) fractional anisotropy (FA) adjusted contrast index profile (ACIP), where positive ACI indicating RecoBundles found a higher value of FA than the waypoint regions of interest (ROIs) approach at that node. (E) The ILF R found by each algorithm for an example subject.

Tractometry results are robust to differences in software implementation

Overall, we found that robustness of tractometry across these different software implementations is high in most white matter bundles. In the mAFQ/pyAFQ comparison, most bundles have a wDSC around or above 0.8, except the two callosal bundles (FA bundle and forceps posterior (FP)), which have a much lower overlap (Fig. 6A). Consistent with this pattern, profile and subject robustness are also overall rather high (Fig. 6B, C). The median values across bundles are 0.71 and 0.77 for FA profile and subject robustness, respectively.

Fig. 6. Robustness between Python Automated Fiber Quantification (pyAFQ) and MATLAB Automated Fiber Quantification (mAFQ) on University of Washington (UW-PREK) session #1 data.

Fig. 6.

(A) Adjusted contrast index profile (ACIP) between the fractional anisotropy (FA) tract profiles from UW-PREK using pyAFQ and mAFQ. Positive ACI indicates pyAFQ found a higher value than mAFQ at that node. The 95% confidence interval on the mean is shaded. Robustness in wDSC (B) bundle profiles (C) and across subjects (D). Error bars show the 95% confidence interval.

For some bundles, like the right and left uncinate (UNC R and UNC L), there is large agreement between pyAFQ and mAFQ (for subject FA: UNC L ρ = 0.90 ± 0.07, UNC R ρ = 0.89 ± 0.08). However, the callosal bundles have particularly low MD profile robustness (0.07 ± 0.09 for FP, 0.18 ± 0.09 for FA) (Fig. 6B).

The robustness of tractometry to the differences between the pyAFQ and mAFQ implementation depends on the bundle, scalar, and reliability metric. In addition, for many bundles, the ACIP between mAFQ and pyAFQ results is very close to 0, indicating no systematic differences (Fig. 6D). In some bundles – the CST and the anterior thalamic radiations (ATR) – there are small systematic differences between mAFQ and pyAFQ. In the forceps posterior (FP), pyAFQ consistently finds smaller FA values than mAFQ in a section on the left side. Notice that the forceps anterior has an ACIP that deviates only slightly from 0, even though the forceps recognitions did not have as much overlap as other bundle recognitions (see Fig. 6A).

DISCUSSION

Previous work has called into question the reliability of neuroimaging analysis (e.g., (25, 45, 46)). We assessed the reliability of a specific approach, tractometry, which is grounded in decades of anatomical knowledge, and we demonstrated that this approach is reproducible, reliable, and robust. A tractometry analysis typically combines the outputs of tractography with diffusion reconstruction at the level of the individual voxels within each bundle. One of the major challenges facing researchers who use tractometry is that there are many ways to analyze diffusion data, including different models of diffusion at the level of individual voxels; techniques to connect voxels through tractography; and approaches to classify tractography results into major white matter bundles. Here, we analyzed the reliability of tractometry analysis at several different levels. We analyzed both TRR of tractometry results and their robustness to changes in analytic details, such as choice of tractography method, bundle recognition algorithm, and software implementation (Fig. 6).

Test-retest reliability of tractometry

TRR of tractometry is usually rather high, comparable in some tracts and measurements to the TRR of the measurement. In comparing the HCP-TR analysis and UW-PREK analysis, we note that higher measurement reliability goes hand in hand with tractometry reliability.

In terms of the anatomical definitions of the bundles, quantified as the TRR wDSC, we find reliable results in both datasets and with both software implementations and both tractography methods that we tested. With pyAFQ, we found a relatively low TRR in the frontal callosal bundle (FA bundle) in the UW-PREK dataset. This could be due to the sensitivity of the definition of this bundle to susceptibility distortion artifacts in the frontal poles of the two hemispheres. This low TRR was not found with mAFQ, suggesting that this low TRR is not a necessary feature of the analysis and is a potential avenue for improvement to pyAFQ. While the two implementations were created by teams with partial overlap and despite the fact that pyAFQ implementation drew both inspiration as well as specific implementation details from mAFQ, many details of implementation still differ substantially. For example, the implementations of tractography algorithms are quite different – pyAFQ relies on DIPY (28) for its tractography, while mAFQ uses implementations provided in Vistasoft (47). The two pipelines also use different registration algorithms, with pyAFQ relying on the symmetric diffeomorphic registration (SyN) algorithm (33), while mAFQ relies on registration methods implemented as part of the Statistical Parametric Mapping (SPM) software (48). These differences may explain the discrepancies observed.

We also find that TRR is high at the level of profiles within subjects and mean tract profiles across subjects. This is generally observed in both datasets that we examined and using different analysis methods and software implementations. For the UW-PREK dataset, subject TRR tends to be higher in mAFQ than in pyAFQ. On the other hand, for the HCP-TR dataset, pyAFQ subject TRR tends to be higher than that obtained with RTP, which is a fork and extension of mAFQ (42, 43). Generally, TRR of FA profiles and TRR of mean FA across subjects tend to be higher than those of MD. This could be because the assessment of MD is more sensitive to partial volume effects. In contrast to FA, MD is also not bounded, which means that extreme values at the boundaries of tissue types can have a substantial effect on TRR.

Robustness of tractometry

As highlighted in the recent work by Botvinik-Nezer et al. (25) and in parallel by Schilling et al. (45), inferences from even a single dataset can vary significantly, depending on the decisions and analysis pipelines that are used. The analysis approaches used in tractometry embody many assumptions made at the different stages of analysis: the model of the signal in each individual voxel, the manner in which streamlines are generated in tractography, the definition of bundles, and the extraction of tract profiles. While TRR is important, it does not guard against systematic errors in the analysis approach. One way to test model assumptions and software failures is to create ground truth data against which different methods and implementations can be tested (13, 49, 50). However, this approach also relies on certain assumptions about the mechanisms that generate the data that is considered ground truth, making this approach more straightforward for some methods than others. Here, we instead assessed the robustness of tractometry results to perturbations of analytic components, focusing on the modeling of ODFs in individual voxels and the approach taken to bundle recognition.

Subject robustness remains high despite differences in the spatial extent of bundles

We replicated previous findings that the definition of major bundles can vary in terms of their spatial extent (quantified via wDSC) (13, 37, 40, 45), depending on the software implementation or the ODF model used. As we showed, low wDSC robustness often corresponds to low profile robustness and vice versa (Figs. 4A and B, 5A and B, 6B and C). That is, when two algorithms detect bundles with small spatial overlap, the shape of the resulting tract profiles is also different from each other. However, low wDSC and profile robustness does not always translate to low subject robustness. Algorithms can detect bundles with low spatial overlap and of different shapes yet still agree on the ordering of the mean of the profiles, that is, which subjects have high or low FA in a given bundle. A clear example of this is the SLF and ARC in Fig. 4 (wDSC and profile robustness are low, yet subject robustness is very high). This suggests that tractometry can overcome failures in precise delineation of the major bundles by averaging tissue properties within the core of the white matter. Conversely, important details that are sensitive to these choices may be missed when averaging along the length of the tracts. Moreover, this may also reflect biases in the measurement that cannot be overcome at either stage of the analysis: tractography or bundle recognition.

Our high subject-level robustness results (Figs. 4C, 5C, 6C) dovetail with the results of a recently published study that used tractometry in a sample of 45 participants (51) and found high subject-level correlations between the mean tract values of FA and MD for two different pipelines: deterministic tractography using the diffusion tensor model (DTI) as the ODF model (essentially identical to a pipeline used in our supplementary analysis, described in “DTI Configuration”) and probabilistic tractography using CSD as the ODF model. Consistent with our results on the HCP-TR dataset, slightly higher subject robustness was found for MD than for FA.

Exceptions and limitations

High profile robustness did not always imply high subject robustness (e.g., the FP in Fig. 4 has high profile robustness but low subject robustness). This suggests that there are other sources of between-subject variance that do not correspond directly to profile robustness within an individual.

There are still significant challenges to robustness that arise from the way in which the major bundles are defined. This problem was highlighted in recent work that demonstrated that different researchers use different criteria to define bundles of streamlines that represent the same tract (45). In our case, this challenge is represented by the relatively low robustness between the waypoint ROI algorithm for bundle definition and the RecoBundles algorithm. In this comparison, the wDSC exceeds 0.8 in only one bundle and is below 0.4 in two cases. While both algorithms identify a bundle of streamlines that represents the right ILF, this bundle differs substantially between the two algorithms. Even so, profile and subject robustness can still be rather high, even in cases in which a rather middling overlap is found between the anatomical extents of the bundles. This challenge not only highlights the need for more precise definitions of the models of brain tracts that are derived from dMRI but also highlights the need for clear, automated, and reproducible software to perform bundle recognition.

In addition to decisions about analysis approach, which may be theoretically motivated, software implementations may contain systematic errors in executing the different steps and different software may be prone to different kinds of failure modes. Since other software implementations (9, 42) of the AFQ approach have been in widespread use in multiple different datasets and research settings, we also compared the results across different software implementations (Fig. 6). While there are some systematic differences between implementations, tractometry is overall quite robust to differences between software implementations.

Another important limitation of this work is that we have only analyzed samples of healthy individuals. Where brains are severely deformed (e.g., in TBI, brain tumors, and so forth), particular care would be needed to check the results of bundle recognition, and separate considerations would be needed in order to reach conclusions about the reliability of the inferences made.

Computational reproducibility via open-source software

Reproducibility is a bedrock of science, but achieving full computational reproducibility is a high bar that requires access to the software, data, and computational environment that a researcher uses (22). One of the goals of pyAFQ is to provide a platform for reproducible tractometry. It is embedded in an ecosystem of tools for reproducible neuroimaging and is extensible. This is shown in Fig. S6 and Fig. S2 and is further discussed in “Supplementary Discussion of pyAFQ.” Results from the present article and supplements can be reproduced using a set of Jupyter notebooks provided here: https://github.com/36000/Tractometry_TRR_and_robustness. After installing the version of pyAFQ that we used (0.6), reproduction should be straightforward on standard operating systems and architectures or in cloud computing systems (see the set of Jupyter notebooks linked to above, and Supplementary Methods). In the UW-PREK dataset, we shared the tract profiles and we provide web-based visualizations using a tool that was previously developed for transparent data sharing of tractometry data (52): https://yeatmanlab.github.io/UW_PREK_pyAFQ_pre_browser and https://yeatmanlab.github.io/UW_PREK_pyAFQ_post_browser.

The HCP-TR dataset is relatively straightforward for others to access in its preprocessed form through the HCP, and because the study IDs can be openly shared in our code, anyone with such access should be able to reproduce the figures in full. Using these resources, it should be possible to re-execute our workflows and replicate most of our results (53). For example, if other researchers would be interested in comparing our TRR results to another tractometry pipeline (e.g., TRACULA (11), another popular tractometry pipeline) or another bundle recognition algorithm (e.g., TractSeg (54), which uses a neural network to recognize bundles, or Classifyber (55), which uses a linear classifier), they could do so with the HCP-TR dataset, inspired by our scripts and the visualization tools in the pyAFQ software.

Future work

There are many aspects of reliability that could be further explored. We explored robustness with respect to ODF models and bundle recognition algorithms; robustness could also be explored with respect to data acquisition parameters within the same subject; preprocessing methods; profile extraction method (e.g., comparing our current approach with the BUndle ANalytics (BUAN) (56)); and the effects of profile realignment on tract profile reliability (57). Another possibility for teasing apart measurement and tractography effects would be to test profile TRR using the streamline of one scan on the results of the second scan (by registering the streamline themselves, to avoid data interpolation in volume registration). This could tease apart the effects of tractography from the voxel-level models of tissue properties, because it is not necessary that these would be sensitive to the same constraints (e.g., different sensitivity to noise). The methods we demonstrate and resources we provide in this paper should be useful for anyone wishing to further explore reliability in tractometry.

Supplementary Material

1

ACKNOWLEDGMENTS

This work was supported through grant 1RF1MH121868–01 from the National Institute of Mental Health/the BRAIN Initiative, through grant 5R01EB027585–02 to Eleftherios Garyfallidis (Indiana University) from the National Institute of Biomedical Imaging and Bioengineering, through Azure Cloud Computing Credits for Research & Teaching provided through the University of Washington’s Research Computing unit and the University of Washington eScience Institute, and NICHD R21HD092771 to Jason D. Yeatman. We are also grateful for support from the Gordon and Betty Moore Foundation and the Alfred P. Sloan Foundation to the University of Washington eScience Institute Data Science Environment, as well as support from the Washington Research Foundation to the eScience Institute and to the University of Washington Institute for Neuroengineering. Thanks to Andreas Neef for feedback on the pyAFQ software. Data was provided in part by the Human Connectome Project, WU-Minn Consortium (principal investigators: David Van Essen and Kamil Ugurbil; 1U54MH091657), funded by the 16 National Institutes of Health (NIH) institutes and centers that support the NIH Blueprint for Neuroscience Research, and by the McDonnell Center for Systems Neuroscience at Washington University.

BIBLIOGRAPHY

  • 1.Petersen Steven E. and Sporns Olaf. Brain Networks and Cognitive Architectures. Neuron, 88(1):207–219, October 2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Bassett Danielle S. and Sporns Olaf. Network neuroscience. Nat. Neurosci, 20(3):353–364, February 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Conturo Thomas E., Lori Nicolas F., Cull Thomas S., Akbudak Erbil, Snyder Abraham Z., Shimony Joshua S., McKinstry Robert C., Burton Harold, and Raichle Marus. Tracking neuronal fiber pathways in the living human brain. Proc. Natl. Acad. Sci. U. S. A, 96(18):10422–10427, August 1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Mori Susumu and Van Zijl Peter C. M. Fiber tracking: principles and strategies–a technical review. NMR Biomed, 15(7–8):468–480, 2002. [DOI] [PubMed] [Google Scholar]
  • 5.Wakana Setsu, Jiang Hangyi, Nagae-Poetscher Lidia M., van Zijl Peter C. M., and Mori Susumu. Fiber tract-based atlas of human white matter anatomy. Radiology, 230(1):77–87, January 2004. [DOI] [PubMed] [Google Scholar]
  • 6.Oishi Kenichi, Zilles Karl, Amunts Katrin, Faria Andreia, Jiang Hangyi, Li Xin, Akhter Kazi, Hua Kegang, Woods Roger, Toga Arthur W., Bruce Pike G, Rosa-Neto Pedro, Evans Alan, Zhang Jiangyang, Huang Hao, Miller Michael I., van Zijl Peter C. M., Mazziotta John, and Mori Susumu. Human brain white matter atlas: identification and assignment of common anatomical structures in superficial white matter. Neuroimage, 43(3):447–457, November 2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Yeh Fang-Cheng, Panesar Sandip, Fernandes David, Meola Antonio, Yoshino Masanori, Fernandez-Miranda Juan C., Vettel Jean M., and Verstynen Timothy. Population-averaged atlas of the macroscale human structural connectome and its network topology. Neuroimage, 178:57–68, 2018. ISSN 1095–9572. doi: 10.1016/j.neuroimage.2018.05.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Garyfallidis Eleftherios, Côté Marc-Alexandre, Rheault Francois, Sidhu Jasmeen, Hau Janice, Petit Laurent, Fortin David, Cunanne Stephen, and Descoteaux Maxime. Recognition of white matter bundles using local and global streamline-based registration and clustering. Neuroimage, July 2017. doi: 10.1016/j.neuroimage.2017.07.015. [DOI] [PubMed]
  • 9.Yeatman Jason D., Dougherty Robert F., Myall Nathaniel J., Wandell Brian A., and Feldman Heidi M. Tract profiles of white matter properties: automating fiber-tract quantification. PLOS ONE, 7(11):e49790, November 2012. ISSN 1932–6203. doi: 10.1371/journal.pone.0049790. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Catani Marco and Schotten Michel Thiebaut de. A diffusion tensor imaging tractography atlas for virtual in vivo dissections. Cortex, 44(8):1105–1132, September 2008. [DOI] [PubMed] [Google Scholar]
  • 11.Yendiki Anastasia, Panneck Patricia, Srinivasan Priti, Stevens Allison, Zöllei Lilla, Augustinack Jean, Wang Ruopeng, Salat David, Ehrlich Stefan, Behrens Tim, Jbabdi Saad, Gollub Randy, and Fischl Bruce. Automated probabilistic reconstruction of white-matter pathways in health and disease using an atlas of the underlying anatomy. Front. Neuroinform, 5:23, October 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Wassermann Demian, Makris Nikos, Rathi Yogesh, Shenton Martha, Kikinis Ron, Kubicki Marek, and Westin Carl-Fredrik. The white matter query language: a novel approach for describing human white matter anatomy. Brain Struct. Funct, 221(9):4705–4721, December 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Maier-Hein Klaus H., Neher Peter F., Houde Jean-Christophe, MarcAlexandre Côté Eleftherios Garyfallidis, Zhong Jidan, Chamberland Maxime, Yeh Fang-Cheng, Lin Ying-Chia, Ji Qing, Reddick Wilburn E., Glass John O., David Qixiang Chen Yuanjing Feng, Gao Chengfeng, Wu Ye, Ma Jieyan, He Renjie, Li Qiang, Westin Carl-Fredrik, Deslauriers-Gauthier Samuel, Omar Ocegueda González J, Paquette Michael, Samuel St-Jean Gabriel Girard, Rheault François, Sidhu Jasmeen, Tax Chantal M. W., Guo Fenghua, Mesri Hamed Y., Szabolcs Dávid Martijn Froeling, Heemskerk Anneriet M., Leemans Alexander, Arnaud Boré Basile Pinsard, Bedetti Christophe, Desrosiers Matthieu, Brambati Simona, Doyon Julien, Sarica Alessia, Vasta Roberta, Cerasa Antonio, Quattrone Aldo, Yeatman Jason, Khan Ali R., Hodges Wes, Alexander Simon, Romascano David, Barakovic Muhamed, Anna Auría Oscar Esteban, Lemkaddem Alia, Thiran Jean-Philippe, Ertan Cetingul H, Odry Benjamin L., Mailhe Boris, Nadar Mariappan S., Pizzagalli Fabrizio, Prasad Gautam, Villalon-Reina Julio E., Galvis Justin, Thompson Paul M., Requejo Francisco De Santiago, Laguna Pedro Luque, Lacerda Luis Miguel, Barrett Rachel, Dell’Acqua Flavio, Catani Marco, Petit Laurent, Caruyer Emmanuel, Daducci Alessandro, Dyrby Tim B., Tim Holland-Letz Claus C. Hilgetag, Stieltjes Bram, and Descoteaux Maxime. The challenge of mapping the human connectome based on diffusion tractography. Nat. Commun, 8(1):1349, November 2017. ISSN 2041–1723. doi: 10.1038/s41467-017-01285-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Thomas Cibu, Ye Frank Q., Irfanoglu M. Okan, Modi Pooja, Saleem Kadharbatcha S., Leopold David A., and Pierpaoli Carlo. Anatomical accuracy of brain connections derived from diffusion MRI tractography is inherently limited. Proc. Natl. Acad. Sci. U. S. A, 111(46):16574–16579, November 2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Schilling Kurt G., Petit Laurent, Rheault Francois, Remedios Samuel, Pierpaoli Carlo, Anderson Adam W., Landman Bennett A., and Descoteaux Maxime. Brain connections derived from diffusion MRI tractography can be highly anatomically accurate—if we know where white matter pathways start, where they end, and where they do not go. Brain Struct. Funct, 225(8):2387–2402, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Rokem Ariel, Yeatman Jason D., Pestilli Franco, Kay Kendrick N., Mezer Aviv, van der Walt Stefan, and Wandell Brian A. Evaluating the accuracy of diffusion MRI models in white matter. PLOS ONE, 10(4):e0123272, April 2015. ISSN 1932–6203. doi: 10.1371/journal.pone.0123272. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Novikov Dmitry S., Kiselev Valerij G., and Jespersen Sune N. On modeling. Magn. Reson. Med, 79(6):3172–3193, June 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Jones Derek K., Travis Adam R., Eden Greg, Pierpaoli Carlo, and Basser Peter J. PASTA: pointwise assessment of streamline tractography attributes. Magn. Reson. Med, 53(6):1462–1467, June 2005. [DOI] [PubMed] [Google Scholar]
  • 19.Colby John B., Soderberg Lindsay, Lebel Catherine, Dinov Ivo D., Thompson Paul M., and Sowell Elizabeth R. Along-tract statistics allow for enhanced tractography analysis. Neuroimage, 59(4):3227–3242, February 2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Adam Richie-Halford Jason Yeatman, Simon Noah, and Rokem Ariel. Multidimensional analysis and detection of informative features in human brain white matter. PLoS Comput. Biol, 17(6):e1009136, 2021. doi: 10.1371/journal.pcbi.1009136. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Dayan Michael, Monohan Elizabeth, Pandya Sneha, Kuceyeski Amy, Nguyen Thanh D., Raj Ashish, and Gauthier Susan A. Profilometry: a new statistical framework for the characterization of white matter pathways, with application to multiple sclerosis. Hum. Brain Mapp, 37(3):989–1004, December 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Donoho David L. An invitation to reproducible computational research. Biostatistics, 11(3):385–388, July 2010. [DOI] [PubMed] [Google Scholar]
  • 23.Ivie Peter and Thain Douglas. Reproducibility in scientific computing. ACM Comput. Surv, 51(3):1–36, July 2018. [Google Scholar]
  • 24.The Turing Way Community, Arnold Becky, Bowler Louise, Gibson Sarah, Herterich Patricia, Higman Rosie, Krystalli Anna, Morley Alexander, O’Reilly Martin, and Whitaker Kirstie. The Turing Way: A Handbook for Reproducible Data Science (Version v0.0.4). Zenodo 2019, March 25. doi: 10.5281/zenodo.3233986. [DOI]
  • 25.Botvinik-Nezer Rotem, Holzmeister Felix, Camerer Colin F., Dreber Anna, Huber Juergen, Johannesson Magnus, Kirchler Michael, Iwanir Roni, Mumford Jeanette A., Alison Adcock R, Avesani Paolo, Baczkowski Blazej M., Bajracharya Aahana, Bakst Leah, Ball Sheryl, Barilari Marco, Bault Nadège, Beaton Derek, Beitner Julia, Benoit Roland G., Berkers Ruud M. W. J., Bhanji Jamil P., Biswal Bharat B., Bobadilla-Suarez Sebastian, Bortolini Tiago, Bottenhorn Katherine L., Bowring Alexander, Braem Senne, Brooks Hayley R., Brudner Emily G., Calderon Cristian B., Camilleri Julia A., Castrellon Jaime J., Cecchetti Luca, Cieslik Edna C., Cole Zachary J., Collignon Olivier, Cox Robert W., Cunningham William A., Czoschke Stefan, Dadi Kamalaker, Davis Charles P., De Luca Alberto, Delgado Mauricio R., Demetriou Lysia, Dennison Jeffrey B., Di Xin, Dickie Erin W., Dobryakova Ekaterina, Donnat Claire L., Dukart Juergen, Duncan Niall W., Durnez Joke, Eed Amr, Eickhoff Simon B., Erhart Andrew, Fontanesi Laura, Matthew Fricke G, Fu Shiguang, Galván Adriana, Gau Remi, Genon Sarah, Glatard Tristan, Glerean Enrico, Goeman Jelle J., Golowin Sergej A. E., González-García Carlos, Gorgolewski Krzysztof J., Grady Cheryl L., Green Mikella A., Guassi Moreira João F., Guest Olivia, Hakimi Shabnam, Paul Hamilton J, Hancock Roeland, Handjaras Giacomo, Harry Bronson B., Hawco Colin, Herholz Peer, Herman Gabrielle, Heunis Stephan, Hoffstaedter Felix, Hogeveen Jeremy, Holmes Susan, Hu Chuan-Peng, Huettel Scott A., Hughes Matthew E., Iacovella Vittorio, Iordan Alexandru D., Isager Peder M., Isik Ayse I., Jahn Andrew, Johnson Matthew R., Johnstone Tom, Joseph Michael J. E., Juliano Anthony C., Kable Joseph W., Kassinopoulos Michalis, Koba Cemal, Kong Xiang-Zhen, Koscik Timothy R., Kucukboyaci Nuri Erkut, Kuhl Brice A., Kupek Sebastian, Laird Angela R., Lamm Claus, Langner Robert, Lauharatanahirun Nina, Lee Hongmi, Lee Sangil, Leemans Alexander, Leo Andrea, Lesage Elise, Li Flora, Li Monica Y. C., Cheng Lim Phui, Lintz Evan N., Liphardt Schuyler W., Losecaat Vermeer Annabel B., Love Bradley C., Mack Michael L., Malpica Norberto, Marins Theo, Maumet Camille, McDonald Kelsey, McGuire Joseph T., Melero Helena, Méndez Leal Adriana S., Meyer Benjamin, Meyer Kristin N., Mihai Glad, Mitsis Georgios D., Moll Jorge, Nielson Dylan M., Nilsonne Gustav, Notter Michael P., Olivetti Emanuele, Onicas Adrian I., Papale Paolo, Patil Kaustubh R., Peelle Jonathan E., Pérez Alexandre, Pischedda Doris, Poline Jean-Baptiste, Prystauka Yanina, Ray Shruti, Reuter-Lorenz Patricia A., Reynolds Richard C., Ricciardi Emiliano, Rieck Jenny R., Rodriguez-Thompson Anais M., Romyn Anthony, Salo Taylor, Samanez-Larkin Gregory R., Sanz-Morales Emilio, Schlichting Margaret L., Schultz Douglas H., Shen Qiang, Sheridan Margaret A., Silvers Jennifer A., Skagerlund Kenny, Smith Alec, Smith David V., Sokol-Hessner Peter, Steinkamp Simon R., Tashjian Sarah M., Thirion Bertrand, Thorp John N., Tinghög Gustav, Tisdall Loreen, Tompson Steven H., Toro-Serey Claudio, Torre Tresols Juan Jesus, Tozzi Leonardo, Truong Vuong, Turella Luca, van ‘t Veer Anna E., Verguts Tom, Vettel Jean M., Vijayarajah Sagana, Vo Khoi, Wall Matthew B., Weeda Wouter D., Weis Susanne, White David J., Wisniewski David, Xifra-Porxas Alba, Yearling Emily A., Yoon Sangsuk, Yuan Rui, Yuen Kenneth S. L., Zhang Lei, Zhang Xu, Zosky Joshua E., Nichols Thomas E., Poldrack Russell A., and Schonberg Tom. Variability in the analysis of a single neuroimaging dataset by many teams. Nature, 582(7810):84–88, June 2020. ISSN 1476–4687. doi: 10.1038/s41586-020-2314-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Cieslak Matthew, Cook Philip A., He Xiaosong, Yeh Fang-Cheng, Dhollander Thijs, Adebimpe Azeez, Aguirre Geoffrey K., Bassett Danielle S., Betzel Richard F., Bourque Josiane, Cabral Laura M., Davatzikos Christos, Detre John, Earl Eric, Elliott Mark A., Fadnavis Shreyas, Fair Damien A., Foran Will, Fotiadis Panagiotis, Garyfallidis Eleftherios, Giesbrecht Barry, Gur Ruben C., Gur Raquel E., Kelz Max, Keshavan Anisha, Larsen Bart S., Luna Beatriz, Mackey Allyson P., Milham Michael, Oathes Desmond J., Perrone Anders, Pines Adam R., Roalf David R., Adam Richie-Halford Ariel Rokem, Sydnor Valerie J., Tapera Tinashe M., Tooley Ursula A., Vettel Jean M., Yeatman Jason D., Grafton Scott T., and Satterthwaite Theodore D. QSIPrep: an integrative platform for preprocessing and reconstructing diffusion MRI data. Nat. Methods, 18(7):775–778, 2021. doi: 10.1038/s41592-021-01185-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Wickham Hadley. Tidy data. J. Stat. Softw, 59(10):1–23, 2014.26917999 [Google Scholar]
  • 28.Garyfallidis Eleftherios, Brett Matthew, Amirbekian Bagrat, Rokem Ariel, Van Der Walt Stefan, Descoteaux Maxime, and Nimmo-Smith Ian. Dipy, a library for the analysis of diffusion MRI data. Front. Neuroinform, 8:8, 2014. ISSN 1662–5196. doi: 10.3389/fninf.2014.00008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Fonov Vladimir, Evans Alan C., Botteron Kelly, Robert Almli C, McKinstry Robert C., Louis Collins D, and Brain Development Cooperative Group. Unbiased average age-appropriate atlases for pediatric studies. Neuroimage, 54(1):313–327, January 2011. ISSN 1095–9572. doi: 10.1016/j.neuroimage.2010.07.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Fonov Vladimir S., Evans Alan C., Botteron Kelly, McKinstry Robert C., Robert Almli C, and Louis Collins D Unbiased nonlinear average age-appropriate brain templates from birth to adulthood. Neuroimage, 47:S102, July 2009. ISSN 1053–8119. doi: 10.1016/S1053-8119(09)70884-5. [DOI] [Google Scholar]
  • 31.Dell’Acqua Flavio, Lacerda Luis, Catani Marco, and Simmons Andrew. Anisotropic Power Maps: a diffusion contrast to reveal low anisotropy tissues from HARDI data. Proc. Intl. Soc. Mag. Reson. Med, 22:29960–29967, 2014. [Google Scholar]
  • 32.Chen David Qixiang, Dell’Acqua Flavio, Rokem Ariel, Garyfallidis Eleftherios, Hayes David J., Zhong Jidan, and Hodaie Mojgan. Diffusion weighted image co-registration: investigation of best practices. bioRxiv, December 2019. doi: 10.1101/864108. [DOI]
  • 33.Avants BB, Epstein Charles L., Grossman M, and Gee James C. Symm etric diffeomorphic image registration with cross-correlation: evaluating automated labeling of elderly and neurodegenerative brain. Med. Image Anal, 12(1):26–41, February 2008. ISSN 1361–8415. doi: 10.1016/j.media.2007.06.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Catani Marco, Howard Robert J., Pajevic Sinisa, and Jones Derek K. Virtual in vivo inter-active dissection of white matter fasciculi in the human brain. Neuroimage, 17(1):77–94, September 2002. ISSN 1053–8119. doi: 10.1006/nimg.2002.1136. [DOI] [PubMed] [Google Scholar]
  • 35.Hua Kegang, Zhang Jiangyang, Wakana Setsu, Jiang Hangyi, Li Xin, Reich Daniel S., Calabresi Peter A., Pekar James J., van Zijl Peter C. M., and Mori Susumu. Tract probability maps in stereotaxic spaces: analyses of white matter anatomy and tract specific quantification. Neuroimage, 39(1):336–347, January 2008. ISSN 1053–8119. doi: 10.1016/j.neuroimage.2007.07.053. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Sotiropoulos Stamatios N., Jbabdi Saad, Xu Junqian, Andersson Jesper L., Moeller Steen, Auerbach Edward J., Glasser Matthew F., Hernandez Moises, Sapiro Guillermo, Jenkinson Mark, Feinberg David A., Yacoub Essa, Lenglet Christophe, Van Essen David C., Ugurbil Kamil, Behrens Timothy E. J., and WU-Minn HCP Consortium. Advances in diffusion MRI acquisition and processing in the human connectome project. Neuroimage, 80:125–143, October 2013. doi: 10.1016/j.neuroimage.2013.05.057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Cousineau Martin, Jodoin Pierre-Marc, Garyfallidis Eleftherios, Côté Marc-Alexandre, Morency Félix C., Rozanski Verena, Grand’Maison Marilyn, Bedell Barry J., and Descoteaux Maxime. A test-retest study on Parkinson’s PPMI dataset yields statistically significant white matter fascicles. Neuroimage Clin, 16:222, 2017. doi: 10.1016/j.nicl.2017.07.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.McGraw Kenneth O. and Wong SP Forming inferences about some intraclass correlation coefficients. Psychol. Methods, 1(1):30–46, 1996. ISSN 1939–1463(Electronic),1082–989X(Print). doi: 10.1037/1082-989X.1.1.30. [DOI] [Google Scholar]
  • 39.Boukadi Mariem, Marcotte Karine, Bedetti Christophe, Houde Jean-Christophe, Desautels Alex, Deslauriers-Gauthier Samuel, Chapleau Marianne, Boré Arnaud, Descoteaux Maxime, and Brambati Simona M. Test-Retest reliability of diffusion measures extracted along white matter language fiber bundles using HARDI-based tractography. Front. Neurosci, 12:1055, 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Boukadi Mariem, Marcotte Karine, Bedetti Christophe, Houde Jean-Christophe, Desautels Alex, Deslauriers-Gauthier Samuel, Chapleau Marianne, Boré Arnaud, Descoteaux Maxime, and Brambati Simona M. Test-Retest reliability of diffusion measures extracted along white matter language fiber bundles using HARDI-based tractography. Front. Neurosci, 12:1055, January 2019. ISSN 1662–4548. doi: 10.3389/fnins.2018.01055. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Huber Elizabeth, Neto Henriques Rafael, Owen Julia P., Rokem Ariel, and Yeatman Jason D. Applying microstructural models to understand the role of white matter in cognitive development. Dev. Cogn. Neurosci, 36:100624, February 2019. ISSN 1878–9293. doi: 10.1016/j.dcn.2019.100624. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Lerma-Usabiaga Garikoitz, Perry Michael L., and Wandell Brian A. Reproducible tract profiles (RTP): from diffusion MRI acquisition to publication. bioRxiv, 680173, 2019. [Google Scholar]
  • 43.Garikoitz Lerma-Usabiaga Pratik Mukherjee, Perry Michael L., and Wandell Brian A. Data-science ready, multisite, human diffusion MRI white-matter-tract statistics. Sci. Data, 7:Article number 422, 2020. doi: 10.1038/s41597-020-00760-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Garyfallidis Eleftherios, Marc-Alexandre Côté Francois Rheault, Sidhu Jasmeen, Hau Janice, Petit Laurent, Fortin David, Cunanne Stephen, and Descoteaux Maxime. Recognition of white matter bundles using local and global streamline-based registration and clustering. Neuroimage, 170:283–295, 2018. ISSN 1095–9572. doi: 10.1016/j.neuroimage.2017.07.015. [DOI] [PubMed] [Google Scholar]
  • 45.Schilling Kurt G., Rheault François, Petit Laurent, Hansen Colin B., Nath Vishwesh, Yeh Fang-Cheng, Girard Gabriel, Barakovic Muhamed, Rafael-Patino Jonathan, Yu Thomas, Fischi-Gomez Elda, Pizzolato Marco, Ocampo-Pineda Mario, Schiavi Simona, Canales-Rodríguez Erick J., Daducci Alessandro, Granziera Cristina, Innocenti Giorgio, Thiran Jean-Philippe, Mancini Laura, Wastling Stephen, Cocozza Sirio, Petracca Maria, Pontillo Giuseppe, Mancini Matteo, Vos Sjoerd B., Vakharia Vejay N., Duncan John S., Melero Helena, Manzanedo Lidia, Emilio Sanz-Morales Ángel Peña-Melián, Calamante Fernando, Arnaud Attyé Ryan P. Cabeen, Korobova Laura, Toga Arthur W., Anupa Ambili Vijayakumari Drew Parker, Verma Ragini, Radwan Ahmed, Sunaert Stefan, Emsell Louise, De Luca Alberto, Leemans Alexander, Bajada Claude J., Haroon Hamied, Azadbakht Hojjatollah, Chamberland Maxime, Genc Sila, Tax Chantal M. W., Yeh Ping-Hong, Srikanchana Rujirutana, Mcknight Colin, Joseph Yuan-Mou Yang Jian Chen, Kelly Claire E., Yeh Chun-Hung, Cochereau Jerome, Maller Jerome J., Welton Thomas, Almairac Fabien, Seunarine Kiran K., Clark Chris A., Zhang Fan, Makris Nikos, Golby Alexandra, Rathi Yogesh, O’Donnell Lauren J., Xia Yihao, Dogu Baran Aydogan Yonggang Shi, Francisco Guerreiro Fernandes Mathijs Raemaekers, Warrington Shaun, Michielse Stijn, Alonso Ramírez-Manzanares Luis Concha, Aranda Ramón, Mariano Rivera Meraz Garikoitz Lerma-Usabiaga, Roitman Lucas, Fekonja Lucius S., Calarco Navona, Joseph Michael, Nakua Hajer, Voineskos Aristotle N., Karan Philippe, Grenier Gabrielle, Jon Haitz Legarreta Nagesh Adluru, Nair Veena A., Prabhakaran Vivek, Alexander Andrew L., Kamagata Koji, Saito Yuya, Uchida Wataru, Andica Christina, Masahiro Abe, Bayrak Roza G., Gandini Claudia A., Egidio D’Angelo Fulvia Palesi, Savini Giovanni, Rolandi Nicolò, Guevara Pamela, Houenou Josselin, Narciso López-López Jean-François Mangin, Poupon Cyril, Claudio Román Andrea Vázquez, Maffei Chiara, Arantes Mavilde, Andrade José Paulo, Maria Silva Susana, Raja Rajikha, Calhoun Vince D., Caverzasi Eduardo, Sacco Simone, Lauricella Michael, Pestilli Franco, Bullock Daniel, Zhan Yang, Brignoni-Perez Edith, Lebel Catherine, Reynolds Jess E., Nestrasil Igor, Labounek René, Lenglet Christophe, Paulson Amy, Aulicka Stefania, Heilbronner Sarah, Heuer Katja, Anderson Adam W., Landman Bennett A., and Descoteaux Maxime. Tractography dissection variability: what happens when 42 groups dissect 14 white matter bundles on the same dataset? Neuroimage 2021. Aug 22;243:118502. doi: 10.1016/j.neuroimage.2021.118502. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Kiar Gregory, Chatelain Yohan, Castro Pablo de Oliveira, Petit Eric, Rokem Ariel, Varoquaux Gaël, Misic Bratislav, Evans Alan C., and Glatard Tristan. Numerical instabilities in analytical pipelines lead to large and meaningful variability in brain networks. PLoS One, in press, 2020.10.15.341495, October 2020. doi: 10.1101/2020.10.15.341495. [DOI] [PMC free article] [PubMed]
  • 47.Dougherty Robert F., Michal Ben-Shachar Roland Bammer, Brewer Alyssa A., and Wandell Brian A. Functional organization of human occipital-callosal fiber tracts. Proc. Natl. Acad. Sci. U. S. A, 102(20):7350–7355, May 2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Friston Karl J. Statistical parametric mapping. In Kötter Rolf, editor, Neuroscience Databases: A Practical Guide, pp. 237–250. Springer US, Boston, MA, 2003. ISBN 978–1-4615–1079-6. doi: 10.1007/978-1-4615-1079-6_16. [DOI] [Google Scholar]
  • 49.Lerma-Usabiaga Garikoitz, Benson Noah, Winawer Jonathan, and Wandell Brian A. A validation framework for neuroimaging software: the case of population receptive fields. PLoS Comput. Biol, 16(6):e1007924, June 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Neher Peter F., Laun Frederik B., Stieltjes Bram, and Maier-Hein Klaus H. Fiberfox: facilitating the creation of realistic white matter software phantoms. Magn. Reson. Med, 72(5):1460–1470, November 2014. [DOI] [PubMed] [Google Scholar]
  • 51.Yablonski Maya, Menashe Benjamin, and Ben-Shachar Michal. A general role for ventral white matter pathways in morphological processing: going beyond reading. Neuroimage, 226:117577, November 2020. [DOI] [PubMed] [Google Scholar]
  • 52.Yeatman Jason D., Richie-Halford Adam, Smith Josh K., Keshavan Anisha, and Rokem Ariel. A browser-based tool for visualization and analysis of diffusion MRI data. Nat. Commun, 9(1):940, March 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Ghosh Satrajit S., Poline Jean-Baptiste, Keator David B., Halchenko Yaroslav O., Thomas Adam G., Kessler Daniel A., and Kennedy David N. A very simple, re-executable neuroimaging publication. F1000Res, 6:124, June 2017. ISSN 2046–1402. doi: 10.12688/f1000research.10783.2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Wasserthal Jakob, Neher Peter, and Maier-Hein Klaus H. Tractseg-fast and accurate white matter tract segmentation. Neuroimage, 183:239–253, 2018. [DOI] [PubMed] [Google Scholar]
  • 55.Giulia Bertò Daniel Bullock, Astolfi Pietro, Hayashi Soichi, Zigiotto Luca, Annicchiarico Luciano, Corsini Francesco, Alessandro De Benedictis Silvio Sarubbo, Pestilli Franco, Avesani Paolo, and Olivetti Emanuele. Classifyber, a robust streamline-based linear classifier for white matter bundle segmentation. NeuroImage, 224:117402, 2021. doi: 10.1016/j.neuroimage.2020.117402. [DOI] [PubMed] [Google Scholar]
  • 56.Qamar Chandio Bramsh, Risacher Shannon Leigh, Pestilli Franco, Bullock Daniel, Yeh Fang-Cheng, Koudoro Serge, Rokem Ariel, Harezlak Jaroslaw, and Garyfallidis Eleftherios. Bundle analytics, a computational framework for investigating the shapes and profiles of brain pathways across populations. Sci. Rep, 10(1):17149, October 2020. ISSN 2045–2322. doi: 10.1038/s41598-020-74054-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.St-Jean Samuel, Chamberland Maxime, Viergever Max A., and Leemans Alexander. Reducing variability in along-tract analysis with diffusion profile realignment. Neuroimage, 199:663–679, October 2019. ISSN 1095–9572. doi: 10.1016/j.neuroimage.2019.06.016. [DOI] [PubMed] [Google Scholar]
  • 58.Virtanen Pauli, Gommers Ralf, Oliphant Travis E., Haberland Matt, Reddy Tyler, Cournapeau David, Burovski Evgeni, Peterson Pearu, Weckesser Warren, Bright Jonathan, van der Walt Stéfan J., Brett Matthew, Wilson Joshua, Jarrod Millman K, Mayorov Nikolay, Nelson Andrew R. J., Jones Eric, Kern Robert, Larson Eric, Carey CJ, Polat İlhan, Feng Yu, Moore Eric W., VanderPlas Jake, Laxalde Denis, Perktold Josef, Cimrman Robert, Henriksen Ian, Quintero EA, Harris Charles R., Archibald Anne M., Ribeiro Antônio H., Pedregosa Fabian, Mulbregt Paul van, and SciPy 1.0 Contributors. SciPy 1.0: fundamental algorithms for scientific computing in python. Nat. Methods, 17(3):261–272, March 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Nájera Óscar, Larson Eric, Loïc Estève Lucy Liu, Varoquaux Gael, Grobler Jaques, Andrade Elliott Sales de, Holdgraf Chris, Gramfort Alexandre, Jas Mainak, Nothman Joel, Grisel Olivier, Varoquaux Nelle, Gouillart Emmanuelle, Lee Antony, Luessi Martin, Hiscocks Steven, Vanderplas Jake, Hoffmann Tim, Caswell Thomas A., Shih Albert Y., Batula Alyssa, Sullivan Bane, Stań czak Dominik, Sunden Kyle, Lars Matthias Feurer, Geier Matthias, Maximilian Nicolas Hug. sphinx-gallery/sphinx-gallery: Release v0.9.0 (v0.9.0). Zenodo, 2021. doi: 10.5281/zenodo.4718153. [DOI]
  • 60.Hansen Brian and Jespersen Sune Nørhøj. Data for evaluation of fast kurtosis strategies, b-value optimization and exploration of diffusion MRI contrast. Sci. Data, 3(1):160072, August 2016. ISSN 2052–4463. doi: 10.1038/sdata.2016.72. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Rocklin Matthew. Dask: parallel computation with Blocked algorithms and task scheduling In Python in Science Conference, Austin, Texas, pp. 126–132, 2015. doi: 10.25080/Majora-7b98e3ed-013. [DOI] [Google Scholar]
  • 62.Richie-Halford Adam and Rokem Ariel. Cloudknot: a Python library to run your existing code on AWS batch In Proceedings of the 17th Python in Science Conference, pp. 8–14, 2018. doi: 10.25080/Majora-4af1f417-001. [DOI] [Google Scholar]
  • 63.Glatard Tristan, Kiar Gregory, Tristan Aumentado-Armstrong Natacha Beck, Bellec Pierre, Bernard Rémi, Bonnet Axel, Brown Shawn T., Camarasu-Pop Sorina, Cervenan-sky Frédéric, Das Samir, Silva Rafael Ferreira da, Flandin Guillaume, Girard Pascal, Gorgolewski Krzysztof J., Guttmann Charles R. G., Valérie Hayot-Sasson Pierre-Olivier Quirion, Rioux Pierre, Rousseau Marc-Étienne, and Evans Alan C. Boutiques: a flexible framework to integrate command-line applications in computing platforms. Gigascience, 7(5):giy016, May 2018. doi: 10.1093/gigascience/giy016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Yarkoni Tal, Markiewicz Christopher J., de la Vega Alejandro, Gorgolewski Krzysztof J., Salo Taylor, Halchenko Yaroslav O., Quinten McNamara Krista DeStasio, Poline Jean-Baptiste, Petrov Dmitry, Valérie Hayot-Sasson Dylan M. Nielson, Carlin Johan, Kiar Gregory, Whitaker Kirstie, Elizabeth DuPre Adina Wagner, Tirrell Lee S., Jas Mainak, Hanke Michael, Poldrack Russell A., Esteban Oscar, Appelhoff Stefan, Holdgraf Chris, Staden Isla, Thirion Bertrand, Kleinschmidt Dave F., Lee John A., Castello Matteo Visconti Oleggio di, Notter Michael P., and Blair Ross. PyBIDS: Python tools for BIDS datasets. J. Open Source Softw, 4(40):1294, August 2019. ISSN 2475–9066. doi: 10.21105/joss.01294. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Gorgolewski Krzysztof J., Auer Tibor, Calhoun Vince D., Craddock R. Cameron, Das Samir, Duff Eugene P., Flandin Guillaume, Ghosh Satrajit S., Glatard Tristan, Halchenko Yaroslav O., Handwerker Daniel A., Hanke Michael, Keator David, Li Xiangrui, Michael Zachary, Maumet Camille, Nolan Nichols B, Nichols Thomas E., Pellman John, Poline Jean-Baptiste, Rokem Ariel, Schaefer Gunnar, Sochat Vanessa, Triplett William, Turner Jessica A., Varoquaux Gaël, and Poldrack Russell A. The brain imaging data structure, a format for organizing and describing outputs of neuroimaging experiments. Sci. Data, 3(1):160044, June 2016. ISSN 2052–4463. doi: 10.1038/sdata.2016.44. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Brett Matthew, Markiewicz Christopher J., Hanke Michael, Marc-Alexandre Côté Ben Cipollini, Paul McCarthy Dorota Jarecka, Cheng Christopher P., Halchenko Yaroslav O., Cottaar Michiel, Larson Eric, Ghosh Satrajit, Wassermann Demian, Gerhard Stephan, Lee Gregory R., Wang Hao-Ting, Kastman Erik, Kaczmarzyk Jakub, Guidotti Roberto, Duek Or, Daniel Jonathan, Rokem Ariel, Madison Cindee, Moloney Brendan, Morency Félix C., Goncalves Mathias, Markello Ross, Riddell Cameron, Burns Christopher, Millman Jarrod, Gramfort Alexandre, Leppäkangas Jaakko, Sólon Anibal, van den Bosch Jasper J. F., Vincent Robert D., Braun Henry, Subramaniam Krish, Gorgolewski Krzysztof J., Raamana Pradeep Reddy, Klug Julian, Nolan Nichols B, Baker Eric M., Hayashi Soichi, Pinsard Basile, Haselgrove Christian, Hymers Mark, Esteban Oscar, Koudoro Serge, Fernando Pérez-García Nikolaas N. Oosterhof, Amirbekian Bago, Ian Nimmo-Smith Ly Nguyen, Reddigari Samir, St-Jean Samuel, Panfilov Egor, Garyfallidis Eleftherios, Varoquaux Gael, Legarreta Jon Haitz, Hahn Kevin S., Hinds Oliver P., Fauber Bennet, Poline Jean-Baptiste, Stutters Jon, Jordan Kesshi, Cieslak Matthew, Miguel Estevan Moreno Valentin Haenel, Schwartz Yannick, Baratz Zvi, Darwin Benjamin C., Thirion Bertrand, Gauthier Carl, Dimitri Papadopoulos Orfanos Igor Solovey, Gonzalez Ivan, Palasubramaniam Jath, Lecher Justin, Leinweber Katrin, Raktivan Konstantinos, Markéta Calábková Peter Fischer, Gervais Philippe, Gadde Syam, Ballinger Thomas, Roos Thomas, Reddam Venkateswara Reddy, and freec84. nipy/nibabel: 3.2.0, October 2020. nipy/nibabel: 3.2.1 (3.2.1). 10.5281/zenodo.4295521 [DOI]
  • 67.Descoteaux Maxime, Deriche Rachid, Knösche Thomas R., and Anwander Alfred. Deterministic and probabilistic tractography based on complex fibre orientation distributions. IEEE Trans. Med. Imaging, 28(2):269–286, February 2009. ISSN 1558–254X. doi: 10.1109/TMI.2008.2004424. [DOI] [PubMed] [Google Scholar]
  • 68.Basser PJ, Mattiello J, and LeBihan D Estimation of the effective self-diffusion tensor from the NMR spin echo. J. Magn. Reson. B, 103(3):247–254, March 1994. ISSN 1064–1866. doi: 10.1006/jmrb.1994.1037. [DOI] [PubMed] [Google Scholar]
  • 69.Basser Peter J. and Pierpaoli Carlo. Microstructural and physiological features of tissues elucidated by quantitative-diffusion-tensor MRI. J Magn Reson B, 111(3):209–219, 1996. doi: 10.1006/jmrb.1996.0086. [DOI] [PubMed] [Google Scholar]
  • 70.Tabesh Ali, Jensen Jens H., Ardekani Babak A., and Helpern Joseph A. Estimation of tensors and tensor-derived measures in diffusional kurtosis imaging. Magn. Reson. Med, 65(3):823–836, March 2011. ISSN 1522–2594. doi: 10.1002/mrm.22655. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Donald Tournier J, Calamante Fernando, Gadian David G., and Connelly Alan. Direct estimation of the fiber orientation density function from diffusion-weighted MRI data using spherical deconvolution. Neuroimage, 23(3):1176–1185, November 2004. ISSN 1053–8119. doi: 10.1016/j.neuroimage.2004.07.037. [DOI] [PubMed] [Google Scholar]
  • 72.Donald Tournier J, Calamante Fernando, and Connelly Alan. Robust determination of the fibre orientation distribution in diffusion MRI: non-negativity constrained super-resolved spherical deconvolution. Neuroimage, 35(4): 1459–1472, May 2007. ISSN 1053–8119. doi: 10.1016/j.neuroimage.2007.02.016. [DOI] [PubMed] [Google Scholar]
  • 73.Jeurissen Ben, Tournier Jacques-Donald, Dhollander Thijs, Connelly Alan, and Sijbers Jan. Multi-tissue constrained spherical deconvolution for improved analysis of multi-shell diffusion MRI data. Neuroimage, 103:411–426, December 2014. ISSN 1095–9572. doi: 10.1016/j.neuroimage.2014.07.061. [DOI] [PubMed] [Google Scholar]
  • 74.Girard Gabriel, Whittingstall Kevin, Deriche Rachid, and Descoteaux Maxime. Towards quantitative connectivity analysis: reducing tractography biases. Neuroimage, 98:266–278, September 2014. ISSN 1095–9572. doi: 10.1016/j.neuroimage.2014.04.074. [DOI] [PubMed] [Google Scholar]
  • 75.Smith Robert E., Tournier Jacques-Donald, Calamante Fernando, and Connelly Alan. Anatomically-constrained tractography: improved diffusion MRI streamlines tractography through effective use of anatomical information. Neuroimage, 62(3):1924–1938, September 2012. ISSN 1095–9572. doi: 10.1016/j.neuroimage.2012.06.005. [DOI] [PubMed] [Google Scholar]
  • 76.Côté Marc-Alexandre, Girard Gabriel, Boré Arnaud, Garyfallidis Eleftherios, Houde Jean-Christophe, and Descoteaux Maxime. Tractometer: towards validation of tractography pipelines. Med. Image Anal, 17(7):844–857, October 2013. ISSN 1361–8423. doi: 10.1016/j.media.2013.03.009. [DOI] [PubMed] [Google Scholar]
  • 77.Fidel Alfaro-Almagro Mark Jenkinson, Bangerter Neal K., Andersson Jesper L. R., Griffanti Ludovica, Douaud Gwenaëlle, Sotiropoulos Stamatios N., Jbabdi Saad, Moises Hernandez-Fernandez Emmanuel Vallee, Vidaurre Diego, Webster Matthew, Paul McCarthy Christopher Rorden, Daducci Alessandro, Alexander Daniel C., Zhang Hui, Dragonu Iulius, Matthews Paul M., Miller Karla L., and Smith Stephen M. Image processing and quality control for the first 10,000 brain imaging datasets from UK Biobank. Neuroimage, 166:400–424, February 2018. ISSN 1053–8119. doi: 10.1016/j.neuroimage.2017.10.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Miller Karla L., Fidel Alfaro-Almagro Neal K. Bangerter, Thomas David L., Yacoub Essa, Xu Junqian, Bartsch Andreas J., Jbabdi Saad, Sotiropoulos Stamatios N., Andersson Jesper L. R., Griffanti Ludovica, Douaud Gwenaëlle, Okell Thomas W., Weale Peter, Dragonu Iulius, Garratt Steve, Hudson Sarah, Collins Rory, Jenkinson Mark, Matthews Paul M., and Smith Stephen M. Multimodal population brain imaging in the UK Biobank prospective epidemiological study. Nat. Neurosci, 19(11):1523–1536, November 2016. ISSN 1546–1726. doi: 10.1038/nn.4393. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Garyfallidis Eleftherios, Ocegueda Omar, Wassermann Demian, and Descoteaux Maxime. Robust and efficient linear registration of white-matter fascicles in the space of streamlines. Neuroimage, 117:124–140, August 2015. ISSN 1053–8119. doi: 10.1016/j.neuroimage.2015.05.016. [DOI] [PubMed] [Google Scholar]
  • 80.Ciric Rastko, Thompson William H., Lorenz Romy, Goncalves Mathias, MacNicol Eilidh, Markiewicz Christopher J., Halchenko Yaroslav O., Ghosh Satrajit S., Gorgolewski Krzysztof J., Poldrack Russell A., and Esteban Oscar. Template-Flow: standardizing standard 3D spac es in neuroimaging. bioRxiv, 2021.02.10.430678. doi: 10.1101/2021.02.10.430678. [DOI]
  • 81.Otsu Nobuyuki. A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern, 9(1):62–66, January 1979. ISSN 2168–2909. doi: 10.1109/TSMC.1979.4310076. [DOI] [Google Scholar]
  • 82.Wakana Setsu, Caprihan Arvind, Panzenboeck Martina M., Fallon James H., Perry Michele, Gollub Randy L., Hua Kegang, Zhang Jiangyang, Jiang Hangyi, Dubey Prachi, Blitz Ari, Zijl Peter van, and Mori Susumu. Reproducibility of quantitative tractography methods applied to cerebral white matter. Neuroimage, 36(3):630–644, July 2007. ISSN 1053–8119. doi: 10.1016/j.neuroimage.2007.02.049. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Tzourio-Mazoyer Nathalie, Landeau Brigitte, Papathanassiou DF, Crivello Fabrice, Etard OND, Delcroix Nicolas, Mazoyer Bernard, and Marc Joliot. Automated anatomical labeling of activations in SPM using a macroscopic anatomical parcellation of the MNI MRI single-subject brain. Neuroimage, 15(1):273–289, January 2002. ISSN 1053–8119. doi: 10.1006/nimg.2001.0978. [DOI] [PubMed] [Google Scholar]
  • 84.Bradford Barber C, David P Dobkin, and Hannu Huhdanpaa. The quickhull algorithm for convex hulls. ACM Trans. Math. Softw, 22(4):469–483, December 1996. ISSN 0098–3500, 1557–7295. doi: 10.1145/235815.235821. [DOI] [Google Scholar]
  • 85.Garyfallidis Eleftherios, Koudoro Serge, Guaje Javier, Marc-Alex Côté Soham Biswas, Reagan David, Anousheh Nasim, Silva Filipi, Fox Geoffrey, and FURY Contributors. FURY: advanced scientific visualization. Journal of Open Source Software, 6(64):3384, August 2021. doi: 10.21105/joss.03384. [DOI] [Google Scholar]
  • 86.Van Essen David C., Smith Stephen M., Barch Deanna M., Behrens Timothy E. J., Yacoub Essa, and Ugurbil Kamil. The WU-Minn Human Connectome Project: an overview. Neuroimage, 80:62–79, October 2013. ISSN 1053–8119. doi: 10.1016/j.neuroimage.2013.05.041. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Chang Lin-Ching, Jones Derek K., and Pierpaoli Carlo. RESTORE: robust estimation of tensors by outlier rejection. Magn. Reson. Med, 53(5):1088–1095, May 2005. ISSN 0740–3194. doi: 10.1002/mrm.20426. [DOI] [PubMed] [Google Scholar]
  • 88.Tournier J-Donald, Smith Robert, Raffelt David, Tabbara Rami, Dhollander Thijs, Pietsch Maximilian, Christiaens Daan, Jeurissen Ben, Yeh Chun-Hung, and Connelly Alan. MRtrix3: a fast, flexible and open software framework for medical image processing and visualisation. Neuroimage, 202:116137, November 2019. [DOI] [PubMed] [Google Scholar]
  • 89.Dice Lee R. Measures of the amount of ecologic association between species. Ecology, 26(3):297–302, 1945. ISSN 00129658, 19399170. doi: 10.2307/1932409. [DOI] [Google Scholar]
  • 90.Alexander Lindsay M., Escalera Jasmine, Ai Lei, Andreotti Charissa, Febre Karina, Mangone Alexander, Natan Vega-Potler Nicolas Langer, Alexander Alexis, Kovacs Meagan, Litke Shannon, Bridget O’Hagan Jennifer Andersen, Bronstein Batya, Bui Anastasia, Bushey Marijayne, Butler Henry, Castagna Victoria, Camacho Nicolas, Chan Elisha, Citera Danielle, Clucas Jon, Cohen Samantha, Dufek Sarah, Eaves Megan, Fradera Brian, Gardner Judith, Natalie Grant-Villegas Gabriella Green, Gregory Camille, Hart Emily, Harris Shana, Horton Megan, Kahn Danielle, Kabotyanski Katherine, Karmel Bernard, Kelly Simon P., Kleinman Kayla, Koo Bonhwang, Kramer Eliza, Lennon Elizabeth, Lord Catherine, Mantello Ginny, Margolis Amy, Merikangas Kathleen R., Milham Judith, Minniti Giuseppe, Neuhaus Rebecca, Levine Alexandra, Osman Yael, Parra Lucas C., Pugh Ken R., Racanello Amy, Restrepo Anita, Saltzman Tian, Septimus Batya, Tobe Russell, Waltz Rachel, Williams Anna, Yeo Anna, Castellanos Francisco X., Klein Arno, Paus Tomas, Leventhal Bennett L., Cameron Craddock R, Koplewicz Harold S., and Milham Michael P. An open resource for transdiagnostic research in pediatric mental health and learning disorders. Sci. Data, 4:170181, December 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Lindquist Martin. Neuroimaging results altered by varying analysis pipelines. Nature, 582(7810):36–37, June 2020. doi: 10.1038/d41586-020-01282-z. [DOI] [PubMed] [Google Scholar]
  • 92.Dougherty Robert F., Michal Ben-Shachar Gayle K. Deutsch, Hernandez Arvel, Fox Glenn R., and Wandell Brian A. Temporal-callosal pathway diffusivity predicts phonological skills in children. Proc. Natl. Acad. Sci. U. S. A, 104(20):8556–8561, May 2007. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1

RESOURCES