Abstract
Introduction
Thresholding of low‐weight connections of diffusion MRI‐based brain networks has been proposed to remove false‐positive connections. It has been previously established that this yields more reproducible scan–rescan network architecture in healthy subjects. In patients with brain disease, network measures are applied to assess inter‐individual variation and changes over time. Our aim was to investigate whether thresholding also achieves improved consistency in network architecture in patients, while maintaining sensitivity to disease effects for these applications.
Methods
We applied fixed‐density and absolute thresholding on brain networks in patients with cerebral small vessel disease (SVD, n = 86; ≈24 months follow‐up), as a clinically relevant exemplar condition. In parallel, we applied the same methods in healthy young subjects (n = 44; scan–rescan interval ≈4 months) as a frame of reference. Consistency of network architecture was assessed with dice similarity of edges and intraclass correlation coefficient (ICC) of edge‐weights and hub‐scores. Sensitivity to disease effects in patients was assessed by evaluating interindividual variation, changes over time, and differences between those with high and low white matter hyperintensity burden, using correlation analyses and mixed ANOVA.
Results
Compared to unthresholded networks, both thresholding methods generated more consistent architecture over time in patients (unthresholded: dice = .70; ICC: .70–.78; thresholded: dice = .77; ICC: .73–.83). However, absolute thresholding created fragmented nodes. Similar observations were made in the reference group. Regarding sensitivity to disease effects in patients, fixed‐density thresholds that were optimal in terms of consistency (densities: .10–.30) preserved interindividual variation in global efficiency and node strength as well as the sensitivity to detect effects of time and group. Absolute thresholding produced larger fluctuations of interindividual variation.
Conclusions
Our results indicate that thresholding of low‐weight connections, particularly when using fixed‐density thresholding, results in more consistent network architecture in patients with longer rescan intervals, while preserving sensitivity to disease effects.
Keywords: cerebral small vessel disease, diffusion tensor imaging, network density, network reproducibility, network thresholding
In this study, we investigated whether thresholding methods that have been shown to improve scan–rescan reproducibility of diffusion‐based brain networks in healthy young subjects are also applicable to datasets of elderly patients with brain pathology, scanned over much longer time frames. We examined whether thresholding achieves improved consistency in network architecture, while maintaining sensitivity to biological effects in patients with cerebral small vessel disease.
1. INTRODUCTION
Diffusion‐weighted imaging (DWI) and fiber tractography enable us to map cerebral white matter pathways and reconstruct large‐scale brain networks (Jeurissen et al., 2017). Subsequently, graph theory can be applied to quantify the properties of such networks (Hagmann et al., 2007; Sporns et al., 2005). This framework has been widely used to investigate not only normal brain development but also a variety of neurological and psychiatric disorders (Fornito et al., 2013; Tijms et al., 2013).
A major challenge in structural network analysis is the limited reproducibility of networks obtained with diffusion MRI, due to the presence of false‐negative and false‐positive connections (Buchanan et al., 2014; Zalesky et al., 2016). Essentially, false negatives represent white matter connections that are undetected by the tractography algorithm. By contrast, false positives are edges in the reconstructed network that do not represent true white matter connections. These errors can result from numerous processing steps (Sotiropoulos & Zalesky, 2017), particularly the choice of parcellation scheme (Zalesky et al., 2010), tractography algorithm (Sarwar et al., 2018), and weighting strategy (Dimitriadis et al., 2017).
The most common solution to reduce false positives is to employ weight‐based thresholding by removing so‐called “weak” connections. In diffusion‐based brain networks, weak connections are usually defined as having a low number of streamlines (NOS). Many network studies have used thresholding strategies such as absolute thresholding (Garrison et al., 2015; Nicols et al., 2016), where a uniform threshold is applied to remove all connections below a certain edge‐weight (e.g., below five streamlines). Another popular method is fixed‐density thresholding (Rubinov & Sporns, 2010; van den Heuvel et al., 2017), where a relative threshold derived from each individual's data is applied to remove the weakest connections, such that an equal network density is achieved across subjects. Recent scan–rescan studies have investigated the impact of thresholding on the reproducibility of structural brain networks in healthy subjects (Buchanan et al., 2020; Messaritaki et al., 2019; Sarwar et al., 2018; Tsai, 2018; Welton et al., 2015). Their results suggest that applying thresholds to remove false positives can improve network similarity between scan and rescan, not only in terms of graph metrics but also by replicating the same network architecture (e.g., the same set of edges and edge‐weights).
Previous scan–rescan studies that tested reproducibility typically focused on datasets of healthy subjects, with state‐of‐the art MRI sequences, high imaging quality, and short rescan intervals (Van Essen et al., 2013). Thresholding methods may also be of value to increase consistency in network architecture in datasets of patient populations. In this setting, diffusion‐based network studies are primarily used to study disease effects, cross‐sectionally and over time. Before application in such clinical studies, it is essential to understand if thresholding indeed also produces more consistent network architecture in scans from patients, acquired in a clinical setting, containing various degrees of pathology, and across longer rescan intervals where further pathology has likely occurred. It is also important to determine if gain in network architecture consistency in this setting does not come at the cost of reduced sensitivity to detect disease effects, reflected in interindividual variation in diffusion metrics, and disease‐related network changes over time.
In this study, we therefore investigated whether thresholding methods that were previously shown to improve reproducibility in repeated scans of healthy young subjects also generate more consistent network architectures (e.g., the same set edges, edge‐weighs, and hubs‐scores) in patients who were scanned over longer time periods. To this end, we used longitudinal data of patients with cerebral small vessel disease (SVD), a condition known to affect cerebral white matter integrity that is often investigated with network analysis (Lawrence et al., 2018; Reijmer et al., 2013). In addition, we evaluated in these patients how thresholding affects sensitivity to disease effects, which was defined as (1) interindividual variation in network measures often examined in SVD (e.g., global efficiency and node strength) and (2) differences in global efficient and node strength between patients with low versus high SVD disease burden. We focused on two thresholding methods commonly applied in brain network studies: absolute thresholding and fixed‐density thresholding, both of which remove low‐weight connections and allow analysis of scan–rescan reproducibility on an individual patient level. As a frame of reference, we processed a dataset of healthy controls with the same methodology.
2. MATERIALS AND METHODS
2.1. Dataset 1—Memory clinic patients with SVD
We included 228 patients from the Parelsnoer longitudinal study (Aalten et al., 2014). Patients who were referred to the memory‐clinic of the UMC Utrecht for evaluation of cognitive problems, with a clinical dementia rating scale (CDR) (Morris, 1993) score of 0, .5, or 1, and a Mini Mental State Examination (MMSE) (Folstein et al., 1975) of 20 or higher were eligible. Exclusion criteria were: normal pressure hydrocephalus, Morbus Huntington, recent transient ischemic attack (TIA) or cerebrovascular accident (CVA) (<2 years), TIA/CVA followed by cognitive decline (within 3 months), history of major psychiatric disease or brain disease other than neurodegeneration or vascular disease, causing cognitive decline (e.g., brain tumor, epilepsy). Patients were eligible for the current analysis if they had a structural MRI and DTI scan at baseline and after ≈2‐year follow‐up visit date (N = 90). We additionally excluded three patients with cognitive complaints due to a diagnosis other than SVD or Alzheimer's disease (AD) to obtain a more homogenous study sample and one patient who was an extreme outlier in the network analysis. Thus, the total number of subjects included in the analysis was 86 (59% male). The follow‐up time ranged between 22 and 35 months (mean ± SD: 27 ± 3 months) and age of the patients varied between 56 and 86 years (mean ± SD: 73 ± 7 years). MRI data were acquired on a 3 tesla Philips scanner (Achieva, Philips, Best, the Netherlands) using a standardized clinical protocol that included a 3D T1‐weighted image and a diffusion weighted sequence. T1‐weighted scans were acquired with a voxel size of 1 mm3. DWI scans had an isotropic acquisition voxel size of 2.50 mm3, 45 diffusion‐sensitizing gradients with a b‐value of 1200 s/mm2, and 1 b = 0 s/mm2. Fluid‐attenuated inversion recovery (FLAIR; TR/TE/inversion time: 11,000/125/2800 ms) were also obtained.
The study was approved by the institutional review board of the UMC Utrecht, and all participants provided written informed consent prior to any research procedure.
2.2. Dataset 2 – Reference data of healthy young adults
As a frame of reference, we included a second dataset with repeated scans from healthy young adults from the Human Connectome Project (HCP, Van Essen et al., 2013). Previous studies have already tested the effects of thresholding using on this dataset but since network reconstruction pipelines always differ slightly across studies. We included these controls in our study to have a high‐quality reference, reconstructed with the exact same software packages, and tractography algorithm as our patient data. We selected 44 healthy participants (32% male) with scan–rescan DWI and T1‐weighted images. The rescan interval ranged between 1.5 and 11 months (mean ± SD: 4.7 ± 2 months) and the age of the participants varied between 22 and 35 years old. MRI was acquired on a Siemens Skyra 3 tesla scanner (Siemens, Erlangen, Germany). T1‐weighted images had an isotropic voxel size of 1.25 mm3. The multi‐shell DWI were acquired with an isotropic voxel size of 1.25 mm3 and three diffusion weightings (b‐values: 1000, 2000, and 3000 s/mm2). For each b‐value, 90 diffusion‐sensitizing gradients directions were measured. Additionally, 18 images with no diffusion weighting (b‐values = 0 s/mm2) were obtained. Here, we selected only a single shell (b‐value 1000 s/mm2), since it was more comparable to the patient dataset described above.
2.3. Diffusion processing and fiber tractography
All DWI scans were processed using ExploreDTI version 4.8.6 (Leemans et al., 2009) running on MATLAB R2018a (MATLAB and Statistics Toolbox Release 2018a, The MathWorks, Inc., Natick, MA, USA). Data were corrected for signal drift (Vos et al., 2017), eddy currents, subject motion with rotation of the B‐matrix (Leemans & Jones, 2009), and susceptibility distortions (Veraart et al., 2013). The DWI volumes were nonlinearly registered to the T1 images prior to estimation of the diffusion tensors. Diffusion tensors were estimated using a robust method to account for outliers (Tax et al., 2015), and fiber tracts were reconstructed using deterministic fiber tractography. Seed points were distributed uniformly throughout the whole brain with 2 mm isotropic resolution. The streamlines were propagated using integration over fiber orientation distributions (FOD), with a step size of 1 mm. The orientation distributions were inferred using constrained spherical deconvolution (CSD) with a maximum harmonic order (l‐max) of 6 (Jeurissen et al., 2011). Fiber tracking was terminated when streamlines entered a voxel with FOD < .1, or when the deflection angle between two successive 1 mm steps was > 45°. When tractography was concluded, streamlines with a length outside of the range between 10 and 500 mm were excluded.
2.4. Network definition
Figure 1 illustrates the processing steps for network definition. The T1‐weighted scans were preprocessed using the FMRIB Software Library (FSL v5.0, Smith et al., 2004) and the Computational Anatomy Toolbox (CAT12, http://dbm.neuro.uni‐jena.de/cat). First, each subject's T1 image was skull‐stripped using FSL BET (Smith et al., 2004). Next, the gray matter volume was parcellated into 90 cortical and subcortical regions of interest (ROIs) using the automated anatomical labeling (AAL) atlas (Tzourio‐Mazoyer et al., 2002). The parcellations were performed in the native T1 space, with the AAL template being nonlinearly registered to each subjects T1 image. The parcellated regions and the tractography data were combined to reconstruct the whole‐brain network. Each ROI represented a node in the network, and two nodes were considered to be connected when they contained the end‐points of at least one streamline, resulting in a 90 × 90 binary connectivity matrices. We also computed three weighted three matrices, where the edges were weighted by the number of streamlines (NOS) connecting the two nodes, the mean diffusivity (MD), and the fractional anisotropy (FA).
2.5. Network thresholding
Thresholding is frequently applied after network reconstruction, aiming to reduce the number of false‐positive connections. In this work, we employed the two most common weight‐based thresholding strategies: fixed‐density thresholding and absolute thresholding. The fixed‐density approach involved removing the edges with the lowest NOS until an equal density was achieved for all subjects. Network density is defined as the proportion of actual connections in the network, relative to all possible connections. For example, when a network has a density of .15, it means that 15% of all possible connections were detected in that network. To ensure that the networks did not become disconnected after thresholding we incorporated the minimal spanning tree (MST), an acyclic subgraph that connects all N nodes in the network (Tewarie et al., 2015). The MST is computed at the beginning of the thresholding step by selecting only edges with the highest NOS unless an edge forms a cycle. When all nodes are connected, the MST has N − 1 connections and a density of 2/N (.02), with N being the number of nodes (N = 90 nodes in our case). Using the MST as a starting point, fixed‐density thresholding is applied by adding more edges to the network (from strongest to weakest weights) until a certain density is achieved. We varied the density level between the density of the MST (.02) and the mean density of all unthresholded networks (density = .40) in steps of .01. Note that the stronger the threshold level, the lower the network density. The absolute thresholding approach involved removing all edges with a weight below an absolute number of streamlines. We varied the absolute threshold level between 1 and 40 streamlines in steps of 1. In this case, the stronger the threshold, the higher the number of streamlines removed. For example, a threshold of 20 streamlines means that that have fewer than 20 streamlines are removed from the network. This thresholding method does not ensure that nodes will not become disconnected or that networks of different subjects will have the same density after thresholding.
2.6. WMH volume segmentation
WMH hyperintensity volumes were segmented from the FLAIR images using and automated pipeline, kNN‐TTP (Steenwijk et al., 2013).
2.7. Consistency of network architecture
To examine the consistency of network architecture between scan and rescan, we focused on characteristics that represent the building blocks of structural networks, such as edges detected, edge‐weight distribution, and hub‐scores (node degree and betweenness centrality).
2.7.1. Similarity in edges detected
The most direct manner of measuring similarity between networks of scan and rescan is to overlap the edges detected at both time points. Using the binary connectivity matrix, we computed the dice similarity coefficient between edges detected at scan and rescan:
Here, represent edges in common between the two scans, whereas |scan| and |rescan| represent unique edges of scan and rescan, respectively. The dice coefficient ranges from 0 to 1, with 0 indicating no overlap and 1 representing a complete overlap between the two sets of edges.
2.7.2. Similarity in edge‐weight
We also evaluated whether the edges detected in both scans have similar weights, namely the number of streamlines (i.e., the weight used to determine which edges should be retained or removed). Thus, we first computed edges in common between scan and rescan and calculated the agreement in edge‐weight using the intraclass correlation coefficient (ICC) (Shrout & Fleiss, 1979). The ICC was originally created to assess the reliability of multiple raters measuring the same variable, but it is also often utilized in network studies to assess the consistency of graph measure over multiple sessions (Andreotti et al., 2014; Buchanan et al., 2014; Messaritaki et al., 2019):
where is the between‐subject variance, represents within‐subject variance, and is the number of repeated measurements. ICC values range between 0 and 1 and are typically interpreted as poor (<.40), fair (.40−.59), good (.60−.74), and excellent (>.75) (Cicchetti, 1994; Wang et al., 2019; Yuan et al., 2019).
2.7.3. Similarity in hub score
Another relevant feature of network architecture is the location of hub nodes. Hubs are topologically central regions and are positioned to make strong contributions for global network function (van den Heuvel & Sporns, 2013). The two most common graph metrics used to define node importance and to identify hubs are node degree and node betweenness centrality. Degree refers to the number of connections that link one node to adjacent nodes. Betweenness centrality is defined as the fraction of all shortest paths in the network that pass through a given node (Bullmore & Sporns, 2009). These nodes with high “hub‐score” (i.e., high degree and/or betweenness centrality) contribute to an efficient communication between distant brain regions. We compared the similarity in hub scores of all nodes between scan and rescan by computing the ICC of betweenness centrality and ICC of degree.
2.8. Interindividual variation and sensitivity to changes over time in patients
Here, we examined how thresholding affects the natural interindividual variation in the data, necessary to test associations with external variables and perform group comparisons (Bagarinao et al., 2019). We focused on metrics such as global efficiency and FA‐ and MD‐weighted node strength. Global efficiency was defined as the inverse of the average shortest path length and quantifies how efficiently information is exchanged over the network (Rubinov & Sporns, 2010). Node strength was defined as the average FA or MD of all edges connected to a node. Clearly, disturbances of these network metrics are not specific to only SVD. Moreover, a range of other metrics exist. We choose these particular measures because they are known to be affected by SVD (Lawrence et al., 2014; Reijmer et al., 2013). Moreover, these metrics (i.e., lower global efficiency and FA, and a higher MD) are known to be related to disease burden and progression over time. Patients with larger WMH volumes show lower global efficiency and FA, and a higher MD, and as the disease progresses over time, global efficiency, FA and MD are expected to decline further (Tuladhar et al., 2020). Therefore, we also tested among the patients how thresholding affected the effect size of the difference in diffusion metrics between high and low disease burden and sensitivity to detect changes over time. The paragraphs below describe how we evaluated the interindividual variation of these metrics and sensitivity to detect time and group effects for different threshold levels.
2.8.1. Assessing interindividual variation
For each threshold level, global efficiency and node strength were calculated and transformed to z‐scores. Follow‐up z‐scores were calculated using the mean and standard deviation of baseline. To assess whether thresholding affects interindividual variation, we computed a correlation matrix containing Pearson's correlation coefficients between z‐scores of different threshold levels. Figure 2 shows an example of how this correlation matrix was calculated for global efficiency (GE). First, global efficiency z‐scores were calculated for each subject (S) and for each threshold level (T). Then, z‐scores of one threshold level were correlated with z‐scores of another threshold level, yielding a correlation coefficient (r). The strength of this correlation indicates whether the interindividual variation in global efficiency is preserved between threshold levels. In other words, if baseline scores change between threshold levels, this results in low correlation coefficients. We calculated r between all combinations thresholds to fill the correlation matrix. In this manner, it is possible to observe which threshold levels generate substantially different global efficiency values. We also computed the same correlation matrix for rate of the change in global efficiency over time (i.e., baseline minus follow‐up), to understand whether thresholding affects the estimated individual rate of change over time. Essentially, for a hypothetical subject with a baseline global efficiency of 1.5×standard deviation (SD) relative to the group mean, if the interindividual variation is not affected, the same score should be obtained for that subject when a different threshold is used. Similarly, if the global efficiency of that subject declined −.5 SD from baseline to follow‐up, the same rate of decline should be observed at different thresholds. The range of thresholds that shows the highest r values thus represent the thresholds were sensitivity to interindividual variation is optimal. Differences in follow‐up time between subjects were not adjusted in the analyses, because such differences only further contribute to interindividual variation and our intention was to assess that variation.
2.8.2. Assessing changes over time
To examine whether thresholding affects the sensitivity to detect network changes over time in patients, we used mixed ANOVA to compare baseline versus follow‐up global efficiency and node strength. Patients were stratified into two groups using a median split of WMH volume. For this analysis, 13 patients were excluded due to lack of FLAIR images for the segmentation of WMH, resulting in 73 patients. We evaluated whether the sensitivity to detect an effect of time (within‐subject factor), effect of group (between‐subject factor), and interaction time × group is preserved across thresholds.
3. RESULTS
3.1. Consistency of network architecture
Figure 3 shows the effect of thresholding on the consistency of network architecture in patients. In each panel, the first plot contains similarity scores for different characteristics of the network architecture, and the second plot shows the number of nodes that remain in the network at each threshold level. Quantitate values are shown in Table S1. Results with the reference dataset of controls are shown in the Supporting Information (Figure S1 and Table S2).
3.1.1. Similarity in edges detected
In the patients, the dice similarity of edges was .70 before thresholding. When fixed density thresholding was applied, this score increased with stronger thresholds (i.e., with lower densities), reaching a maximum of .76 (at a density = .08, p < .001 compared to unthresholded, Figure 3a, blue line, Table S1). For densities lower than .08, dice scores decreased sharply to .67 (density = .02). As expected, the number of nodes (N = 90) in the network did not change with fixed‐density thresholding. Regarding absolute thresholding, the dice score also increased with stronger thresholds (i.e., with larger number of streamlines removed), from .70 (unthresholded) to a maximum of .76 (# streamlines removed = 40, p < .001, Figure 3b, blue, Table S1). With absolute thresholding, the maximum dice score was produced by the strongest threshold. However, at this threshold level, only 38 of the initial 90 nodes remained in the network. Overall, these results indicate that compared to not applying any threshold whatsoever, thresholding generates a more similar set of edges between scan and rescan.
In controls, thresholding had a similar effect on dice similarity, albeit with higher scores than in patients as expected, because of higher quality scans and absence of pathology (Figure S1A and B, blue, Table S2).
3.1.2. Similarity in edge‐weight
The ICC of edge‐weight in patients was .71 for unthresholded networks (Figure 3a, black). After fixed‐density thresholding, ICC scores increased with stronger thresholds to a maximum of .75 (density = .12, p < .001, Figure 3a, black). When absolute thresholding was applied, the ICC also increased with stronger thresholds, reaching a maximum of .76, again at the strongest threshold level (# streamlines removed = 40 streamlines, Figure 3b, black, Table S1). These results indicate that the edges retained after thresholding have more consistent weight distributions between scan and rescan.
Similar results were observed in controls, but with higher ICC scores (Figure S1).
3.1.3. Similarity in hub score
The ICC degree also increased after thresholding, from .78 (unthresholded) to .83 when fixed‐density thresholding was used (density = .08, p < .001, Figure 3a, red, Table S1), whereas the ICC of betweenness centrality showed an unstable profile and did not increase significantly after thresholding (Figure 3a, green). Regarding absolute thresholding, the ICC degree also increased from .78 to a maximum of .83 (Figure 3b, red), and ICC‐betweenness centrality did not increase with stronger thresholds.
For controls, the effect of thresholding on hubs scores was analogous (see Figure S1), albeit with higher scores.
3.2. Interindividual variation and sensitivity to changes over time in patients
The analyses assessing interindividual variation and sensitivity to changes in global efficiency, MD‐ and FA‐weighted node strength in patients are shown in Figures 4, 5, 6. In each figure, “panel a” contains spaghetti plots summarizing baseline and follow‐up z‐scores, as well as a correlations between baseline and follow‐up, within each threshold level. From a biological perspective, these correlations indicate whether patients maintain their relative position in group from baseline to follow‐up, which would then be reflected in parallel lines in the spaghetti plots, and strong correlations. “Panel b (left)” shows correlation matrices between baseline values of different threshold levels. These correlations indicate whether individual patients maintain their relative baseline scores across different thresholds. Threshold levels that do not disturb this interindividual variation of baseline scores should show high correlations. Likewise, “panel b (right)” shows a correlation matrix but for the rate of decline over time across thresholds, indicating whether individual patients maintain the same rate of decline between thresholds. “Panel c” depicts the sensitivity to detect time and group effects in patients stratified by WMH volume for different threshold levels. If a statistically significant change over time is found, this will be reflected in an F‐value (time) > critical value. The magnitude of this effect is given by the Cohen's d. Similarly, if there is significant difference between patients with low and high WMH volume, this group‐effect is given by the F‐value (group) and corresponding effect size.
Since absolute thresholding produced a large number of disconnected nodes, some brain regions got excluded from the analysis, not allowing us to always evaluate global efficiency over the same network of 90 nodes. It is well known that global network measures are highly dependent on the number of nodes (van Wijk et al., 2010). Therefore, for the analysis regarding the sensitivity to disease effects, we focused on the fixed‐density approach where the size of the network is maintained regardless of the threshold level, whereas the results for absolute thresholding are shown in the Supporting Information (Figures S1–S3).
3.2.1. Global efficiency
Figure 4 shows results for global efficiency. In Figure 4a, the spaghetti plots indicate that before thresholding there is a clear variation in global efficiency between subjects, with an apparent decline from baseline to follow‐up. The lines of the subject initially run in parallel before thresholding, reflected in high a correlation between baseline and follow‐up scores (r = .81). After thresholding, this variation of baseline and follow‐up scores was maintained (r ≈ .8). The relation between baseline and follow‐up scores was only disturbed when networks were thresholded to densities < .05, implying that interindividual differences between subjects is no longer maintained.
In Figure 4b, we quantified whether the interindividual variation of subjects and the individual rate of change over time are maintained between different threshold levels. Regarding baseline values (left matrix), z‐scores of global efficiency remained robust between density thresholds of .40 and .05 (r > .9). In other words, the distance of each subject to the group mean at baseline was unchanged, as long as networks were not thresholded to densities below .05. The same was true for the change over time (right matrix). The individual rate of change from baseline to follow‐up was unaffected for densities between .40 and .10 (r > .9). Overall, these correlations analyses show that global efficiency scores of individual subjects do not change after thresholding (.10 < density < .40).
In Figure 4c, we statistically tested whether the apparent decline over time in global efficacy observed in the spaghetti plots was statistically significant, and whether patients with low WMH differed from high WMH volume. Before thresholding (density = .40), there was significant effect of time (blue curve), indicating a decline in global efficiency from baseline to follow‐up (F (1, 72) = 22; p < .01; Cohen's d = .29). This decline is in line with previous reports (Tuladhar et al., 2020). Regarding the group effect (gray curve), patients with larger WMH volumes had a significantly lower global efficiency (F (1, 72) = 45; p < .01; Cohen's d = .4). There was no interaction of time × group, with both groups of patients declining at the same rate. After thresholding, the sensitivity to detect time and group effects was robust, with F‐values and effect sizes remaining relatively constant for a wide range of densities between .40 and .10. This result indicates that these threshold levels preserve the sensitivity to effects of time and group observed originally before thresholding. The absolute thresholding method resulted in larger changes on the interindividual variation and sensitivity do detect time and group effects (Figure S1). Only a narrow range of thresholds preserved the sensitivity to detect time and group‐effects.
3.2.2. MD‐weighted node strength
Figure 5 describes the same analysis as Figure 4, but for MD‐weighted node strength. By observing the spaghetti plots on Figure 5a, MD appears unchanged from baseline to follow‐up on a group level.
However, the correlation matrices on Figure 5b reveal that between a density of .4 and .30, MD scores remain initially unchanged (r > .9, note the block of high correlations on the bottom right of the matrix). When stronger thresholds were applied (i.e., when more noisy connections were removed, densities < .30), MD scores obtained at these threshold levels no longer resemble those obtained for thresholds between .40 and .30 but did not change further for the remaining thresholds (r > .9, for densities between .30 and .02).
In Figure 5c, we tested the sensitivity to detect changes over time. Before thresholding, there was a significant but small effect of time on MD (F (1, 72) = 12; p < .01; Cohen's d = .1). The group‐effect and the interaction time × group were not significant. After thresholding, the sensitivity to detect an effect of time was increased (i.e., larger F‐values) and highest for density thresholds around .12. Note that these were the threshold levels that also improved the consistency of network architecture over time. This indicates that removing noisy connections improves the sensitivity to detect small changes within individuals over time in local weight‐based metrics such as MD. Absolute thresholding produced larger changes on interindividual variation, but did not eliminate the sensitivity to effects of time and group (Figure S2).
3.2.3. FA‐weighted node strength
Figure 6 shows the results for FA‐weighted node strength. Thresholding appears to have a small effect on the relation between baseline and follow‐up FA values as illustrated by the spaghetti plots and the correlations between baseline and follow‐up (Figure 6a).
The correlations matrices (Figure 6b) further show that the interindividual variation of baseline values (left matrix) remained robust for all threshold levels between .37 and .02 (r > .90). In other words, the position of each individual subject relative to the group mean did not change for thresholds between .37 and .02. The same result was obtained for the individual rate of change over time (right matrix).
Figure 6c shows that before thresholding no significant time and group effects were detected. Thresholding did not improve the sensitivity and effect of time, but did improve the sensitivity do detect a small effect of group for densities < .30: (F (1, 72) = 5; p < .05; Cohen's d = .05).
Again, absolute thresholding caused larger changes on interindividual variation but also improved the detection group‐effects after the first threshold levels (Figure S2).
4. DISCUSSION
In this work, we evaluated the impact of thresholding on scan–rescan brain networks of patients with SVD to assess how thresholds that improve scan–rescan network reproducibility in healthy young subjects affect (1) consistency in network architecture in these patients over a longer time period and (2) sensitivity to detect biological effects. Our results indicate that threshold levels that improve the reproducibility in controls also generate more consistent network architecture over time in patients. The similarity between scan and rescan for characteristics such as the location of edges detected, edge‐weights, and hub scores improved after thresholding. We also showed that the natural interindividual variation in outcome measures used to assess disease effect is preserved within threshold levels where the network architecture is consistent. Furthermore, the sensitivity to detect statistical group differences between patients with low vs. high WMH burden was maintained.
Preceding our work, several studies had examined the effect of thresholding on reproducibility of network architecture, including the binary topology, edges detected, edge‐weights, and graph metrics in healthy, mostly young, controls (Andreotti et al., 2014; Buchanan et al., 2014; Owen et al., 2013). As expected, our results with controls were in line with studies that explicitly showed that networks become more reproducible after thresholding (e.g., Buchanan et al., 2020; Messaritaki et al., 2019; Roine et al., 2019). In patients where disease effects are monitored over time, the rescan interval is typically much longer. Therefore, the same thresholding methods and threshold levels that are reported to improve reproducibility in controls might not directly apply. We used SVD as an exemplar condition, because network metrics have been shown to be relevant for this disease, but also because SVD‐related brain injury such as white matter hyperintensities, brain atrophy, and enlarged ventricles can impact the performance of tractography and network reconstruction. Current fiber tractography methods do not explicitly account for such factors, which can lead to erroneous estimations of white matter pathways and presence of more false positives in the network. Our results with patients show that the consistency of edges detected and their respective weights are improved when both thresholding methods are used, suggesting that the edges removed are noisy connections with a more random weight distribution (Messaritaki et al., 2019; Zalesky et al., 2016). Thresholding also improved the ICC of degree, meaning that if we were to define hubs nodes based on degree, a more consistent set of hubs would be detected between scans. Notably, the betweenness centrality was less consistent between scans, which could be explained by the fact that this metric depends not only on edges directly connected to a specific node but also on edges connected to distant nodes. Since the betweenness centrality quantifies the proportion shortest paths that go through given node, removing only one edge (which can be directly or indirectly connected to that node) can have a large impact on that shortest path. Thus, the betweenness centrality is more susceptible to disruptions when edges are removed (Drakesmith et al., 2015; Segarra & Ribeiro, 2014).
The two thresholding methods had distinct threshold levels to achieve optimal reproducibility. For the fixed‐density approach, similarity scores improved with decreasing density (i.e., stronger thresholds), before drastically decreasing for densities < .05. This reproducibility profile could be explained by the proportion of false positive at low densities and by the MST (Zalesky et al., 2016). The MST was incorporated to ensure that networks remain connected and avoid fragmented nodes. Since, by definition, the MST cannot contain connections that form cycles, a certain proportion of low‐weight connections (i.e., potential false positives) must be included in order to keep the network connected, meaning that at very low densities, the effect of these false positives is stronger. A potential disadvantage of thresholding networks to fixed‐density is that it can lead to confounding effects when comparing groups or datasets with different distributions of edge weights. In a group with higher edge weights, this would lead to ignoring potentially important edges with strong weights, while in a group with lower edge weights, this would lead to including weak or potentially spurious edges (van Wijk et al., 2010). Regarding the absolute thresholding approach, the similarity scores increased with mild thresholds (2–5 streamlines). Since this approach works by removing all connections with a weight below a certain number of streamlines, only the highest weighted connections survive when strong thresholds are used. A major downside of this method, also evident in our dataset, is that it quickly creates fragmented nodes, which means that some brain regions are no longer part of the network. Furthermore, differences in brain size or absolute number of streamlines computed for each subject results in largely different networks between subjects or over time. Thus, a “one size fits all” type of threshold is not ideal. For datasets of patients with similar characteristics as those included in this study, we therefore recommend using fixed‐density thresholds between .08 and .20 to achieve optimal network consistency while keeping all network nodes connected. This is also in line with previous research that estimated the density of structural connectomes to lie between .05 and .30 (Hagmann et al., 2008; Roberts et al., 2017). One of the main arguments against thresholding networks of patients in the attempt to improve network consistency over repeated scans is that this procedure could remove biological or disease‐related effects (Drakesmith et al., 2015; McColgan et al., 2018). This concern can be relevant in studies trying to identify disease effects at the level of subnetwork and/or individual connections, rather than pathological changes in large‐scale brain network topology (Petersen et al., 2020). In the SVD field, this type of analyses could help understanding how diffuse and/or focal damage in certain brain areas affects cognitive function. In those scenarios, applying thresholds could erroneously cut connections that are affected by pathology. On the other hand, for global network metrics, it is more desirable to assess disease effects on networks with a more consistent scan–rescan architecture, less affected by noisy connections that can confound the results. Detection of intraindividual changes over time in patients is generally one of the more challenging tasks in terms of sensitivity. Such changes may be detected at a group level, but the individual trajectories of patients tend to get overshadowed by noise if techniques are insensitive. Therefore, analysis of interindividual variation over time can be performed with more confidence if networks are voided of noisy connections. In the second part of our analysis, we show that the concerns about removal of disease effects by thresholding may not always be justified. For global efficiency, interindividual variation of subjects at baseline and their individual rate of change over time remained robust for fixed‐density thresholds > .10. Before thresholding, global efficiency was able to capture significant differences between patients with low and high WMH volume, which is in line with previous research where white matter lesion load was associated with lower global efficiency (Heinen et al., 2018). After fixed‐density thresholding, a wide‐range of densities (.40–.10) preserved the sensitivity to detect these disease‐related effects, suggesting that changes in global metrics can be consistently detected over multiple threshold levels (de Brito Robalo et al., 2020; Drakesmith et al., 2015). Absolute thresholding had a stronger impact interindividual variation and sensitivity to time and group effects (see Supporting Information), due to the fact that this thresholding approach creates disconnected nodes during, thereby changing the size of the network and disrupting global network metrics (van Wijk et al., 2010).
Regarding MD‐weighted node strength, the sensitivity to detect time effects was improved after thresholding (density ≈ .12). Since this network metric is based on edge weights, the measurements after thresholding are obtained over a smaller and more consistent set of connections, thereby decreasing the standard error and improving the sensitivity to detect small effect sizes. This hypothesis is also supported by previous work that examined associations between edge‐weights and age and showed that connections retained in the network after thresholding were significantly more associated with age than those removed (Buchanan et al., 2020). FA‐weighted node strength was lower in patients with higher WMH volume but did not significantly decline over time. These effects were not affected by thresholding.
Strengths of this work include the use of two distinct datasets, with different MRI protocols, different subject groups, and rescan intervals. In this manner, it was possible to directly test whether thresholds that improve the reproducibility in high‐quality scans with short rescan intervals also have the same effect on scans of patients, with longer follow‐up intervals. Our study also had some limitations. We analyzed the effects of thresholding only on network metrics that have shown association with disease effects in patients with SVD, such global efficiency and node strength. Thus, our results cannot be directly generalized to all network measures and all patient populations. Further investigation on disease effects reflected by other network metrics is required (e.g., local metrics or disease effects at the level of subnetworks). Our analysis was focused on the most popular weight‐based thresholding methods that directly remove connections from each individual network, without the need to create a group‐level network to determine a criterion to remove connections. In this manner, test–retest consistency of network architecture can be evaluated on an individual basis. Other scan–rescan studies have examined thresholding approaches that are not based on edge‐weights but rather on group‐level consistency and is less biased towards the length of streamlines (Buchanan et al., 2020; Roberts et al., 2017). Their findings also point that stringency of threshold can improve network consistency. However, future work should also analyze how consistency‐based thresholding improves group‐level consistency of network architecture (e.g., across scanners) in similar datasets and whether disease effects are preserved. Our results could also have been influenced by the choice of parcellation scheme (Zalesky et al., 2010), tractography algorithm (Bastiani et al., 2012), weighting scheme, among other factors. Thus, these results need to be tested using different network reconstruction pipelines.
5. CONCLUSION
Our study demonstrates the effects of weight‐based thresholding on longitudinal brain networks of patients with SVD. We showed that thresholding, particularly with fixed‐density approaches, can produce more consistent network architectures in patients scanned over longer time periods, while preserving disease‐related effects. Our work sheds a light on how to make informed decisions when applying thresholds in studies with a longitudinal design and how such choices can potentially influence the statistical significance of the results. A good practice for longitudinal studies that intend to apply weight‐based thresholds would be to first examine which threshold levels generate the most consistent network architecture over time and then verify if those threshold levels also preserve the interindividual variation in metrics that will be used as outcome for the study (e.g., global efficiency).
CONFLICT OF INTEREST
The authors declare no conflict of interest.
Supporting information
ACKNOWLEDGMENTS
This work was supported by ZonMw, The Netherlands Organisation for Health Research and Development (VICI grant 91816616 to Geert Jan Biessels). Yael D. Reijmer received funding from Alzheimer Nederland and ZonMw/Deltaplan Dementie (grant #733050503) and a Young Talent Fellowship from the Brain Center Rudolf Magnus, University Medical Center Utrecht. The research of Alexander Leemans is supported by VIDI Grant 639.072.411 from the Netherlands Organization for Scientific Research (NWO). Members of the Utrecht Vascular Cognitive Impairment (VCI) Study group involved in the present study (in alphabetical order by department): University Medical Center Utrecht, The Netherlands, Department of Neurology: E. van den Berg, J. M. Biesbroek, G. J. Biessels, M. Brundel, W. H. Bouvy, L.G. Exalto, C. J. M. Frijns, O. Groeneveld, S. M. Heringa, N. Kalsbeek, L. J. Kappelle, Y. D. Reijmer, J. Verwer; Department of Radiology/Image Sciences Institute: J. de Bresser, H. J. Kuijf, A. Leemans, P. R. Luijten, M. A. Viergever, K. L. Vincken, J. J. M. Zwanenburg; Department of Geriatrics: H. L. Koek; Hospital Diakonessenhuis Zeist, The Netherlands: M. Hamaker, R. Faaij, M. Pleizier, E. Vriens.
De Brito Robalo, B. M. , Vlegels, N. , Leemans, A. , Reijmer, Y. D. , & Biessels, G. J. (2022). Impact of thresholding on the consistency and sensitivity of diffusion MRI‐based brain networks in patients with cerebral small vessel disease. Brain and Behavior, 12, e2523. 10.1002/brb3.2523
DATA AVAILABILITY STATEMENT
The data that support the findings of this study are available from the corresponding author upon request.
REFERENCES
- Aalten, P. , Ramakers, I. H. G. B. , Biessels, G. J. , de Deyn, P. P. , Koek, H. L. , OldeRikkert, M. G. M. , Oleksik, A. M. , Richard, E. , Smits, L. L. , van Swieten, J. C. , Teune, L. K. , van der Lugt, A. , Barkhof, F. , Teunissen, C. E. , Rozendaal, N. , Verhey, F. R. J. , & van der Flier, W. M. (2014). The Dutch Parelsnoer Institute – Neurodegenerative diseases; methods, design and baseline results. BMC Neurology, 14(1), 1–8. 10.1186/s12883-014-0254-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Andreotti, J. , Jann, K. , Melie‐Garcia, L. , Dierks, T. , Federspiel, A. , & Giezendanner, S. (2014). Repeatability analysis of global and local metrics of brain structural networks. Brain Connectivity, 4(3), 203–220. 10.1089/brain.2013.0202 [DOI] [PubMed] [Google Scholar]
- Bagarinao, E. , Watanabe, H. , Maesawa, S. , Mori, D. , Hara, K. , Kawabata, K. , Yoneyama, N. , Ohdake, R. , & Imai, K. (2019). Reorganization of brain networks and its association with general cognitive performance over the adult lifespan. Scientific Reports, 9, 1–15. 10.1038/s41598-019-47922-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bastiani, M. , Shah, N. J. , Goebel, R. , & Roebroeck, A. , (2012). Human cortical connectome reconstruction from diffusion weighted MRI: The effect of tractography algorithm. Neuroimage, 62(3), 1732–1749. 10.1016/j.neuroimage.2012.06.002 [DOI] [PubMed] [Google Scholar]
- Buchanan, C. R. , Bastin, M. E. , Ritchie, S. J. , Liewald, D. C. , Madole, J. W. , Tucker‐Drob, E. M. , Deary, I. J. , & Cox, S. R. (2020). The effect of network thresholding and weighting on structural brain networks in the UK Biobank. Neuroimage, 211, 116443. 10.1016/j.neuroimage.2019.116443 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Buchanan, C. R. , Pernet, C. R. , Gorgolewski, K. J. , Storkey, A. J. , & Bastin, M. E. (2014). Test–retest reliability of structural brain networks from diffusion MRI. Neuroimage, (86), 231–243. 10.1016/j.neuroimage.2013.09.054 [DOI] [PubMed] [Google Scholar]
- Bullmore, E. , & Sporns, O. (2009) Complex brain networks: Graph theoretical analysis of structural and functional systems. Nature Reviews Neuroscience, 10(3), 186–198. 10.1038/nrn2575 [DOI] [PubMed] [Google Scholar]
- Cicchetti, D. V. (1994). Guidelines, criteria, and rules of thumb for evaluating normed and standardized assessment instruments in psychology. Psychological Assessment, 6(4), 284–290 10.1037/1040-3590.6.4.284 [DOI] [Google Scholar]
- de Brito Robalo, B. M. , Vlegels, N. , Meier, J. , Leemans, A. , & Biessels, G. J. (2020). Effect of fixed‐density thresholding on structural brain networks : A demonstration in cerebral small vessel disease. Brain Connectivity, 10, 121–133. 10.1089/brain.2019.0686 [DOI] [PubMed] [Google Scholar]
- Dimitriadis, S. I. , Drakesmith, M. , Bells, S. , Parker, G. D. , Linden, D. E. , & Jones, D. K. (2017). Improving the reliability of network metrics in structural brain networks by integrating different network weighting strategies into a single graph. Frontiers in Neuroscience, 11(DEC), 1–17. 10.3389/fnins.2017.00694 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Drakesmith, M. , Caeyenberghs, K. , Dutt, A. , Lewis, G. , David, A. S. , & Jones, D. K. (2015). Overcoming the effects of false positives and threshold bias in graph theoretical analyses of neuroimaging data. Neuroimage, 118, 313–333. 10.1016/j.neuroimage.2015.05.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Folstein, M. F. , Folstein, S. E. , & McHugh, P. R. (1975). Mini‐mental state. Journal of Psychiatric Research, 12, 189–198. 10.1016/0022-3956(75)90026-6 [DOI] [PubMed] [Google Scholar]
- Fornito, A. , Zalesky, A. , & Breakspear, M. (2013). Graph analysis of the human connectome: Promise, progress, and pitfalls. Neuroimage, 80, 426–444. 10.1016/j.neuroimage.2013.04.087 [DOI] [PubMed] [Google Scholar]
- Garrison, K. A. , Scheinost, D. , Finn, E. S. , Shen, X. , & Constable, R. T. (2015). The (in)stability of functional brain network measures across thresholds. Neuroimage, 118, 651–661. 10.1016/j.neuroimage.2015.05.046 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hagmann, P. , Cammoun, L. , Gigandet, X. , Meuli, R. , Honey, C. J. , Van Wedeen, J. , & Sporns, O. (2008). Mapping the structural core of human cerebral cortex. PLoS Biology, 6(7), 1479–1493. 10.1371/journal.pbio.0060159 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hagmann, P. , Kurant, M. , Gigandet, X. , Thiran, P. , Wedeen, V. J. , Meuli, R. , & Thiran, J. (2007). Mapping human whole‐brain structural networks with diffusion MRI. Plos One, 2, e597. 10.1371/journal.pone.0000597 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Heinen, R. , Vlegels, N. , de Bresser, J. , Leemans, A. , Biessels, G. J. , & Reijmer, Y. D. , & Utrecht Vascular Cognitive Impairment Study Group . (2018). The cumulative effect of small vessel disease lesions is reflected in structural brain networks of memory clinic patients. Neuroimage Clinical, 19, 963–969. 10.1016/j.nicl.2018.06.025 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jeurissen, B. , Descoteaux, M. , Mori, S. , & Leemans, A. (2017). Diffusion MRI fiber tractography of the brain. NMR in Biomedicine, 32, e3785. 10.1002/nbm.3785 [DOI] [PubMed] [Google Scholar]
- Jeurissen, B. , Leemans, A. , Jones, D. K. , Tournier, J. D. , & Sijbers, J. (2011). Probabilistic fiber tracking using the residual bootstrap with constrained spherical deconvolution. Human Brain Mapping, 32, 461–479. 10.1002/hbm.21032 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lawrence, A. J. , Chung, A. W. , Morris, R. G. , Markus, H. S. , & Barrick, T. R. (2014). Structural network efficiency is associated with cognitive impairment in small‐vessel disease. Neurology, 83, 304–311. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lawrence, A. J. , Zeestraten, E. A. , Benjamin, P. , Lambert, C. P. , Morris, R. G. , Barrick, T. R. , & Markus, H. S. (2018). Longitudinal decline in structural networks predicts dementia in cerebral small vessel disease. Neurology, 90(21), e1898–e1910. 10.1212/WNL.0000000000005551 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Leemans, A. , Jeruissen, B. , Sijbers, J. , & Jones, D. K. (2009). ExploreDTI: A graphical toolbox for processing, analyzing, and visualizing diffusion MR data. Proceedings of the International Society for Magnetic Resonance in Medicine, 245, 3537. [Google Scholar]
- Leemans, A. , & Jones, D. K. (2009). The B‐matrix must be rotated when correcting for subject motion in DTI data. Magnetic Resonance in Medicine, 61, 1336–1349. 10.1002/mrm.21890 [DOI] [PubMed] [Google Scholar]
- McColgan, P. , Blom, T. , Rees, G. , Seunarine, K. K. , Gregory, S. , Johnson, E. , Durr, A. , Roos, R. A. C. , Scahill, R. I. , Clark, C. A. , Tabrizi, S. J. , & Razi, A. ; the Track‐HD Investigators . (2018). Stability and sensitivity of structural connectomes: Effect of thresholding and filtering and demonstration in neurodegeneration . bioRxiv. 10.1101/416826 [DOI]
- Messaritaki, E. , Dimitriadis, S. I. , & Jones, D. K. (2019). Optimization of graph construction can significantly increase the power of structural brain network studies. Neuroimage, 199(May), 495–511. 10.1016/j.neuroimage.2019.05.052 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Morris, J. C. (1993). The clinical dementia rating (CDR): Current vision and scoring rules. Neurology, 43, 2412.2–2412–a. 10.1212/WNL.43.11.2412-a [DOI] [PubMed] [Google Scholar]
- Nicols, T. E. , Das, S. , Eickhoff, S. B. , Evans, A. C. , Glastard, T. G. , Hanke, M. , Kriegeskorte, N. , Milham, M. P. , Poldrack, R. A. , Poline, J.‐B. , Proal, E. , Thirion, B. , Van Essen, D. C. , White, T. , & Yeo, B. T. T. (2016). Best practices in data analysis and sharing in neuroimaging using MRI. bioRxiv. 10.1101/054262 [DOI] [PMC free article] [PubMed]
- Owen, J. P. , Ziv, E. , Bukshpun, P. , Pojman, N. , Wakahiro, M. , Berman, J. I. , Roberts, T. P. L. , Friedman, E. J. , Sherr, E. H. , & Mukherjee, P. (2013). Test–retest reliability ofcomputational network measurements derived from the structural connectome of the human brain. Brain Connect, 3, 160–176. 10.1089/brain.2012.0121 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Petersen, M. , Frey, B. M. , Schlemm, E. , Mayer, C. , Hanning, U. , Engelke, K. , Fiehler, J. , Borof, K. , Jagodzinski, A. , Gerloff, C. , Thomalla, G. , & Cheng, B. (2020). Network localisation of white matter damage in cerebral small vessel disease. Scientific Reports, 10, 9210. 10.1038/s41598-020-66013-w [DOI] [PMC free article] [PubMed] [Google Scholar]
- Reijmer, Y. D. , Leemans, A. , Brundel, M. , Kappelle, L. J. , & Biessels, G. J. (2013) Utrecht Vascular Cognitive Impairment (VCI) Study GroupDisruption of the cerebral white matter network is related to slowing of information processing speed in patients with type 2 diabetes. Diabetes, 62, 2112–2115. 10.2337/db12-1644 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Roberts, J. A. , Perry, A. , Roberts, G. , Mitchell, P. B. , & Breakspear, M. (2017) Consistency‐based thresholding of the human connectome. Neuroimage, 145(Pt A), 118–129. 10.1016/j.neuroimage.2016.09.053 [DOI] [PubMed] [Google Scholar]
- Roine, T. , Jeurissen, B. , Perrone, D. , Aelterman, J. , Philips, W. , Sijbers, J. , & Leemans, A. (2019). Reproducibility and intercorrelation of graph theoretical measures in structural brain connectivity networks. Medical Image Analysis, 52, 56–67. 10.1016/j.media.2018.10.009 [DOI] [PubMed] [Google Scholar]
- Rubinov, M. , & Sporns, O. (2010). Complex network measures of brain connectivity: Uses and interpretations. Neuroimage, 52(3), 1059–1069. 10.1016/j.neuroimage.2009.10.003 [DOI] [PubMed] [Google Scholar]
- Sarwar, T. , Ramamohanarao, K. , & Zalesky, A. (2018). Mapping connectomes with diffusion MRI: Deterministic or probabilistic tractography? Magnetic Resonance in Medicine, 81, 1368–1384. 10.1002/mrm [DOI] [PubMed] [Google Scholar]
- Segarra, S. , & Ribeiro, A. (2014). A stable betweenness centrality measure in networks. In IEEE International Conference on acoustics, speech, and signal processing . Florence, Italy. [Google Scholar]
- Shrout, P. E. , & Fleiss, J. L. (1979). Intraclass correlations: Uses in assessing rater reliability. Psychological Bulletin, 86(2), 420–428. 10.1037//0033-2909.86.2.420 [DOI] [PubMed] [Google Scholar]
- Smith, S. M. , Jenkinson, M. , Woolrich, M. W. , Beckmann, C. F. , Behrens, T. E. J. , Johansen‐berg, H. , Bannister, P. R. , Luca, M. D. , Drobnjak, I. , Flitney, D. E. , Niazy, R. K. , Saunders, J. , Vickers, J. , Zhang, Y. , De Stefano, N. , Brady, J. M. , & Matthews, P. M. (2004). Advances in functional and structural MR image analysis and implementation as FSL. Neuroimage, 23, S208–S219. 10.1016/j.neuroimage.2004.07.051 [DOI] [PubMed] [Google Scholar]
- Sotiropoulos, S. N. , & Zalesky, A. (2017). Building connectomes using diffusion MRI: Why, how and but. NMR in Biomedicine, 32, 1–23. 10.1002/nbm.3752 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sporns, O. , Tononi, G. , & Kötter, R. (2005). The human connectome: A structural description of the human brain. PLoS Computational Biology, 1(4), 0245–0251. 10.1371/journal.pcbi.0010042 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Steenwijk, M. D. , Pouwels, P. J. W. , Daams, M. , van Dalen, J. W. , Caan, M. W. A. , Richard, E. , Barkhof, F. , & Vrenken, H. (2013) Accurate white matter lesion segmentation by k nearest neighbor classification with tissue type priors (kNN‐TTPs). NeuroImage Clin, 3, 462–469. 10.1016/j.nicl.2013.10.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tax, C. M. W. , Otte, W. M. , Viergever, M. A. , Dijkhuizen, R. M. , & Leemans, A. (2015). REKINDLE: Robust extraction of Kurtosis INDices with linear estimation. Magnetic Resonance in Medicine, 73, 794–808. 10.1002/mrm.25165 [DOI] [PubMed] [Google Scholar]
- Tewarie, P. , van Dellen, E. , Hillebrand, A. , & Stam, C. J. (2015). The minimum spanning tree: An unbiased method for brain network analysis. Neuroimage, 104, 177–188. 10.1016/j.neuroimage.2014.10.015 [DOI] [PubMed] [Google Scholar]
- Tijms, B. M. , Wink, A. M. , de Haan, W. , van der Flier, W. M. , Stam, C. J. , Scheltens, P. , & Barkhof, F. (2013). Alzheimer's disease: Connecting findings from graph theoretical studies of brain networks. Neurobiology of Aging, 34(8), 2023–2036. 10.1016/j.neurobiolaging.2013.02.020 [DOI] [PubMed] [Google Scholar]
- Tsai, S. Y. (2018). Reproducibility of structural brain connectivity and network metrics using probabilistic diffusion tractography. Scientific Reports, 8(1), 1–12. 10.1038/s41598-018-29943-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tuladhar, A. M. , Tay, J. , van Leijsen, E. , Lawrence, A. J. , van Uden, I. W. M. , Bergkamp, M. , van der Holst, E. , Kessels, R. P. C. , Norris, D. , Markus, H. S. , & De leeuw, F. (2020). Structural network changes in cerebral small vessel disease. Journal of Neurology, Neurosurgery, and Psychiatry, 91(2), 196–203. 10.1136/jnnp-2019-321767 [DOI] [PubMed] [Google Scholar]
- Tzourio‐Mazoyer, N. , Landeau, B. , Papathanassiou, D. , Crivello, F. , Etard, O. , Delcroix, N. , Mazoyer, B. , & Joliot, M. (2002). Automated anatomical labeling of activations in SPM using a macroscopic anatomical parcellation of the MNI MRI single‐subject brain. Neuroimage, 15(1), 273–289. 10.1006/nimg.2001.0978 [DOI] [PubMed] [Google Scholar]
- van den Heuvel, M. P. , de Lange, S. C. , Zalesky, A. , Seguin, C. , Yeo, B. T. T. , & Schmidt, R. (2017). Proportional thresholding in resting‐state fMRI functional connectivity networks and consequences for patient‐control connectome studies: Issues and recommendations. Neuroimage, 152(December 2016), 437–449. 10.1016/j.neuroimage.2017.02.005 [DOI] [PubMed] [Google Scholar]
- van den Heuvel, M. P. , & Sporns, O. (2013). Network hubs in the human brain. Trends in Cognitive Sciences, 17(12), 683–696. 10.1016/j.tics.2013.09.012 [DOI] [PubMed] [Google Scholar]
- Van Essen, D. C. , Smith, S. M. , Barch, D. M. , Behrens, T. E. J. , Yacoub, E. , & Ugurbil, K. (2013). The WU‐Minn Human Connectome Project: An overview. Neuroimage, 80, 62–79. 10.1016/j.neuroimage.2013.05.041 [DOI] [PMC free article] [PubMed] [Google Scholar]
- van Wijk, B. C. M. , Stam, C. J. , & Daffertshofer, A. (2010). Comparing brain networks of different size and connectivity density using graph theory. PLoS One, 5(10), e13701. 10.1371/journal.pone.0013701 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Veraart, J. , Sijbers, J. , Sunaert, S. , Leemans, A. , & Jeurissen, B. (2013). Weighted linear least squares estimation of diffusion MRI parameters: Strengths, limitations, and pitfalls. Neuroimage, 81, 335–346. 10.1016/j.neuroimage.2013.05.028 [DOI] [PubMed] [Google Scholar]
- Vos, S. B. , Tax, C. M. , Luijten, P. R. , Ourselin, S. , Leemans, A. , & Froeling, M. (2017) The importance of correcting for signal drift in diffusion MRI. Magnetic Resonance in Medicine, 77(1), 285–299. 10.1002/mrm.26124 [DOI] [PubMed] [Google Scholar]
- Wang, M. , Yuan, Z. , & Niu, H. (2019). Reliability evaluation on weighted graph metrics of fNIRS brain networks. Quantitative Imaging in Medicine and Surgery, 9(5), 832–841. 10.21037/qims.2019.05.08 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Welton, T. , Kent, D. A. , Auer, D. P. , & Dineen, R. A. (2015). Reproducibility of graph‐theoretic brain network metrics: A systematic review. Brain Connectivity, 5(4), 193–202. 10.1089/brain.2014.0313 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yuan, J. P. , Henje Blom, E. , Flynn, T. , Chen, Y. , Ho, T. C. , Connolly, C. G. , Dumont Walter, R. A. , Yang, T. T. , Xu, D. , & Tymofiyeva, O. (2019). Test‐retest reliability of graph theoretic metrics in adolescent brains. Brain Connectivity, 9(2), 144–154. 10.1089/brain.2018.0580 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zalesky, A. , Fornito, A. , Cocchi, L. , Gollo, L. L. , van den Heuvel, M. P. , & Breakspear, M. (2016). Connectome sensitivity or specificity: Which is more important? Neuroimage, 142, 407–420. 10.1016/j.neuroimage.2016.06.035 [DOI] [PubMed] [Google Scholar]
- Zalesky, A. , Fornito, A. , Harding, I. H. , Cocchi, L. , Yücel, M. , Pantelis, C. , & Bullmore, E. T. (2010). Whole‐brain anatomical networks: Does the choice of nodes matter? Neuroimage, 50(3), 970–983. 10.1016/j.neuroimage.2009.12.027 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data that support the findings of this study are available from the corresponding author upon request.