Deep Neural Networks and Kernel Regression Achieve Comparable Accuracies for Functional Connectivity Prediction of Behavior and Demographics

Tong He; Ru Kong; Avram J Holmes; Minh Nguyen; Mert R Sabuncu; Simon B Eickhoff; Danilo Bzdok; Jiashi Feng; BT Thomas Yeo

doi:10.1016/j.neuroimage.2019.116276

. Author manuscript; available in PMC: 2021 Feb 1.

Published in final edited form as: Neuroimage. 2019 Oct 11;206:116276. doi: 10.1016/j.neuroimage.2019.116276

Deep Neural Networks and Kernel Regression Achieve Comparable Accuracies for Functional Connectivity Prediction of Behavior and Demographics

Tong He ^1,², Ru Kong ^1,², Avram J Holmes ³, Minh Nguyen ^1,², Mert R Sabuncu ⁴, Simon B Eickhoff ^5,⁶, Danilo Bzdok ^7,^8,⁹, Jiashi Feng ², BT Thomas Yeo ^1,^2,^10,^11,¹²

PMCID: PMC6984975 NIHMSID: NIHMS1542375 PMID: 31610298

Abstract

There is significant interest in the development and application of deep neural networks (DNNs) to neuroimaging data. A growing literature suggests that DNNs outperform their classical counterparts in a variety of neuroimaging applications, yet there are few direct comparisons of relative utility. Here, we compared the performance of three DNN architectures and a classical machine learning algorithm (kernel regression) in predicting individual phenotypes from whole-brain resting-state functional connectivity (RSFC) patterns. One of the DNNs was a generic fully-connected feedforward neural network, while the other two DNNs were recently published approaches specifically designed to exploit the structure of connectome data. By using a combined sample of almost 10,000 participants from the Human Connectome Project (HCP) and UK Biobank, we showed that the three DNNs and kernel regression achieved similar performance across a wide range of behavioral and demographic measures. Furthermore, the generic feedforward neural network exhibited similar performance to the two state-of-the-art connectome-specific DNNs. When predicting fluid intelligence in the UK Biobank, performance of all algorithms dramatically improved when sample size increased from 100 to 1000 subjects. Improvement was smaller, but still significant, when sample size increased from 1000 to 5000 subjects. Importantly, kernel regression was competitive across all sample sizes. Overall, our study suggests that kernel regression is as effective as DNNs for RSFC-based behavioral prediction, while incurring significantly lower computational costs. Therefore, kernel regression might serve as a useful baseline algorithm for future studies.

Keywords: Fingerprinting, deep learning, resting-state fMRI, graph convolutional neural network, kernel ridge regression

1. Introduction

Deep neural networks (DNNs) have enjoyed tremendous success in machine learning (Lecun et al., 2015). As such, there has been significant interest in the application of DNNs to neuroscience research. DNNs have been applied to neuroscience in at least two main ways. First, deep learning models have been used to simulate actual brain mechanisms, such as in vision (Khaligh-Razavi and Kriegeskorte, 2014; Yamins et al., 2014; Eickenberg et al., 2017) and auditory perception (Kell et al., 2018). Second, DNNs have been applied as tools to analyze neuroscience data, including lesion and tumor segmentation (Pinto et al., 2016; Havaei et al., 2017; Kamnitsas et al., 2017b; G. Zhao et al., 2018), anatomical segmentation (Wachinger et al., 2018; X. Zhao et al., 2018), image modality/quality transfer (Bahrami et al., 2016; Nie et al., 2017; Blumberg et al., 2018), image registration (Yang et al., 2017; Dalca et al., 2018), as well as behavioral and disease prediction (Plis et al., 2014; van der Burgh et al., 2017; Vieira et al., 2017; Nguyen et al., 2018).

Deep neural networks can perform well in certain scenarios and tasks, where large quantities of data are unavailable, e.g., winning multiple MICCAI predictive modeling challenges involving image segmentation (Choi et al., 2016, Kamnitsas et al., 2017a, Hongwei Li et al., 2018). Yet, the conventional wisdom is that DNNs perform especially well when applied to well-powered samples, for instance, the 14 million images in ImageNet (Russakovsky et al., 2015) and Google 1 Billion Word Corpus (Chelba et al., 2014). However, in many neuroimaging applications, the available data often only involve hundreds or thousands of participants, while the associated feature dimensions can be significantly larger, such as entries of connectivity matrices with upwards of 100,000 edges. Consequently, we hypothesize that in certain neuroimaging applications, DNNs might not be the optimal choice for a machine learning problem (Bzdok and Yeo, 2017). Here, we investigated whether DNNs can outperform classical machine learning for behavioral prediction using resting-state functional connectivity (RSFC).

RSFC measures the synchrony of resting-state functional magnetic resonance image (rs-fMRI) signals between brain regions (Biswal et al., 1995; Fox and Raichle, 2007; Buckner et al., 2013), while participants are lying at rest without any explicit task. RSFC has been widely used for exploring human brain organization and mental disorders (Smith et al., 2009; Assaf et al., 2010; Power et al., 2011; Yeo et al., 2011; Bertolero et al., 2017). For a given brain parcellation scheme (e.g., Shen et al., 2013; Gordon et al., 2016; Glasser et al., 2017; Eickhoff et al., 2018), the parcels can be used as regions of interest (ROIs), such that a whole brain (or cortical) RSFC matrix can be computed for each participant. Each entry of the RSFC matrix corresponds to the strength of functional connectivity between two brain regions. In recent years, one of the most influential developments in neuroimaging has been the use of the RSFC matrices for predicting the attributes (e.g., age or fluid intelligence) of individual participants (Dosenbach et al., 2010; Finn et al., 2015; Smith et al., 2015; Rosenberg et al., 2016; Dubois et al., 2018; Reinen et al., 2018; Weis et al., 2019). Consequently, there have been many studies developing new techniques to improve RSFC-based behavioral prediction (Amico and Goñi, 2018; Nostro et al., 2018; Parisot et al., 2018; Kashyap et al., 2019; Yoo et al., 2019).

In this work, we compared kernel regression with three DNN architectures in RSFC-based behavioral prediction. Kernel regression is a non-parametric classical machine learning algorithm (Murphy, 2012) that has previously been utilized in various neuroimaging prediction problems, including RSFC-based behavioral prediction (Raz et al., 2017; Zhu et al., 2017; Kong et al., 2019; Li et al., 2019). Our three DNN implementations included a generic, fully-connected feedforward neural network, and two state-of-the-art DNNs specifically developed for RSFC-based prediction (Kawahara et al., 2017; Parisot et al., 2017, 2018). An initial version of this study utilizing only the fluid intelligence measure in the HCP dataset has been previously presented at a workshop (He et al., 2018). By using RSFC data from nearly 10,000 participants and a broad range of behavioral (and demographic) measures from the HCP (Smith et al., 2013; Van Essen et al., 2013) and UK Biobank (Sudlow et al., 2015; Miller et al., 2016), this current extended study represents one of the largest empirical evaluations of DNN’s utility in RSFC-based fingerprinting.

2. Methods

2.1. Datasets

Two datasets were considered: the Human Connectome Project (HCP) S1200 release (Van Essen et al., 2013) and the UK Biobank (Sudlow et al., 2015; Miller et al., 2016). Both datasets contained multiple types of neuroimaging data, including structural MRI, rs-fMRI, and multiple behavioral and demographic measures for each subject.

HCP S1200 release comprised 1206 healthy young adults (age 22–35). There were 1,094 subjects with both structural MRI and rs-fMRI. Both structural MRI and rs-fMRI were acquired on a customized Siemens 3T “Connectome Skyra” scanner at Washington University at St. Louis. The structural MRI was 0.7mm isotropic. The rs-fMRI was 2mm isotropic with TR of 0.72s and 1200 frames per run (14.4 minutes). Each subject had two sessions of rs-fMRI, and each session contained two rs-fMRI runs. A number of behavioral measures were also collected by HCP. More details can be found elsewhere (Van Essen et al., 2012; Barch et al., 2013; Smith et al., 2013).

The UK Biobank is a prospective epidemiological study that has recruited 500,000 adults (age 40–69) between 2006–2010 (Sudlow et al., 2015). 100,000 of these 500,000 participants will be brought back for multimodal imaging by 2022 (Miller et al., 2016). Here we considered an initial release of 10065 subjects with both structural MRI and rs-fMRI data. Both structural MRI and rs-fMRI were acquired on harmonized Siemens 3T Skyra scanners at three UK Biobank imaging centres (Cheadle Manchester, Newcastle, and Reading). The structural MRI was 1.0mm isotropic. The rs-fMRI was 2.4mm isotropic with TR of 0.735s and 490 frames per run (6 minutes). Each subject had one rs-fMRI run. A number of behavioral measures were also collected by the UK Biobank. More details can be found elsewhere (Elliott and Peakman, 2008; Sudlow et al., 2015; Miller et al., 2016; Alfaro-Almagro et al., 2018).

2.2. Preprocessing and RSFC

We utilized ICA-FIX MSM-All grayordinate rs-fMRI data provided by the HCP S1200 release (HCP S1200 manual; Van Essen et al., 2012, 2013; Glasser et al., 2013; Smith et al., 2013; Griffanti et al., 2014; Salimi-Khorshidi et al., 2014). To eliminate residual motion and respiratory-related artifacts (Burgess et al., 2016), we performed further censoring and nuisance regression (Kong et al., 2019; Li et al., 2019) Runs with more than 50% censored frames were discarded (Pruett et al., 2015; Gordon et al., 2016; Smyser et al., 2016; Kong et al., 2019; Li et al., 2019). Figure S1 shows the distribution of the number of uncensored frames across subjects.

Consistent with previous studies from our group (Kebets et al., 2019; Li et al., 2019), we considered 400 cortical (Schaefer et al., 2018) and 19 sub-cortical (Fischl et al., 2002; Glasser et al., 2013) ROIs to ensure whole-brain coverage. The preprocessed rs-fMRI time courses were averaged across all grayordinate locations within each ROI. RSFC was then computed using Pearson’s correlation of the averaged time courses for each run of each subject (with the censored frames excluded for the computation). The RSFC was averaged across all runs, resulting in one 419 x 419 RSFC matrix for each subject.

In the case of the UK Biobank, we utilized the 55 x 55 RSFC (Pearson’s correlation) matrices provided by the Biobank (Miller et al., 2016; Alfaro-Almagro et al., 2018). The 55 ROIs were obtained from a 100-component whole-brain spatial-ICA (Beckmann and Smith, 2004), of which 45 components were considered to be artifactual (Miller et al., 2016).

2.3. FC-based prediction setup

We considered 58 behavioral measures across cognition, emotion and personality from the HCP (Table S1; Kong et al., 2019). By restricting the dataset to participants with at least one run (that survived censoring) and all 58 behavioral measures, we were left with 953 subjects. 23, 67, 62 and 801 subjects had 1, 2, 3 and 4 runs respectively.

In the case of the UK Biobank, we considered four behavioral and demographic measures: age, sex, fluid intelligence and pairs matching¹ (number of incorrect matches). By restricting the dataset to participants with 55 x 55 RSFC matrices and all four measures, we were left with 8868 subjects.

For both datasets, kernel regression and three DNNs were applied to predict the behavioral and demographic measures of individual subjects based on individuals’ RSFC matrices. More specifically, the RSFC data of each participant was summarized as an N x N matrix, where N is the number of brain ROIs. Each entry in the RSFC matrix represented the strength of functional connectivity between two ROIs. The entries of the RSFC matrix were then used as features to predict behavioral and demographic measures in individual participants.

2.4. Kernel ridge regression

Kernel regression (Murphy, 2012) is a non-parametric classical machine learning algorithm. Let y be the behavioral measure (e.g., fluid intelligence) and c be the RSFC matrix of a test subject. Let y_i be the behavioral measure (e.g., fluid intelligence) and c_i be the RSFC matrix of the i-th training subject. Roughly speaking, kernel regression will predict the test subject’s behavioral measure to be the weighted average of the behavioral measures of all training subjects: $y \approx Σ_{i \in t r a i n i n g s e t} S i m i l a r i t y (c_{i}, c) y_{i}$ , where similarity (c_i,c), is the similarity between the RSFC matrices of the test subject and i-th training subject. Here, we simply set similarity (c_i,c), to be the Pearson’s correlation between the lower triangular entries of matrices c_i and c, which is effectively a linear kernel. In practice, an l₂ regularization term is needed to avoid overfitting (i.e., kernel ridge regression). The level of l₂ regularization is controlled by the hyperparameter λ. More details are found in Appendix A1.

2.5. Fully-connected neural network (FNN)

Fully-connected neural networks (FNNs) belong to a generic class of feedforward neural networks (Lecun et al., 2015) illustrated in Figure 1. An FNN takes in vector data as an input and outputs a vector. An FNN consists of several fully connected layers. Each fully connected layer consists of multiple nodes. Data enters the FNN via the input layer nodes. Each node (except input layer nodes) is connected to all nodes in the previous layer. The values at each node is the weighted sum of node values from the previous layer. The weights are the trainable parameters in FNN. The outputs of the hidden layer nodes typically go through a nonlinear activation function, e.g., Rectified Linear Units (ReLU; f(x) = max(0,x)), while the output layer tends to be linear. The value at each output layer node typically represents a predicted quantity. Thus, FNNs (and neural networks in general) allow the prediction of multiple quantities simultaneously. In this work, the inputs to the FNN are the vectorized RSFC (i.e., lower triangular entries of the RSFC matrices) and the outputs are the behavioral or demographic variables we seek to predict.

2.6. BrainNetCNN

One potential weakness of the FNN is that it does not exploit the (mathematical and neurobiological) structure of the RSFC matrix, e.g., RSFC matrix is symmetric, positive definite and represents a network. On the other hand, BrainNetCNN (Kawahara et al., 2017) is a specially designed DNN for connectivity data, illustrated in Figure 2. BrainNetCNN allows the application of convolution to connectivity data, resulting in significantly less trainable parameters than the FNN. This leads to less parameters, which should theoretically improve the ease of training and reduce overfitting issues. In this work, the input to the BrainNetCNN is the N × N RSFC matrix and the outputs are the behavioral or demographic variables we seek to predict.

The BrainNetCNN takes in any connectivity matrix directly as an input and outputs behavioral or demographic predictions. Kawahara et al. (2017) used this model for predicting age and neurodevelopmental outcomes from structural connectivity data. BrainNetCNN consists of four types of layers: Edge-to-Edge (E2E) layer, Edge-to-Node (E2N) layer, Node-to-Graph (N2G) layer and a final fully connected (linear) layer. The first three types of layers are specially designed layers introduced in the BrainNetCNN. The final fully connected layer is the same as that used in FNNs.

The Edge-to-Edge (E2E) layer is a convolution layer using cross-shaped filters (Figure 2). The cross-shaped filter is applied to each element of the input matrix. Thus, for each filter, the E2E layer takes in a N × N matrix and outputs a N × N matrix. The number of E2E layer is arbitrary and is a tunable hyperparameter. The outputs of the final E2E layer are inputs to the E2N layer. The E2N layer is similar to the E2E layer, except that the cross-shaped filter is applied to only the diagonal entries of the input matrix. Thus, for each filter, the E2N layer takes in a N × N matrix and outputs a N × 1 vector. There is one E2N layer for BrainNetCNN. The outputs of the E2N layer are the inputs to the Node-to-Graph (N2G) layer. The N2G layer is simply a fully connected hidden layer similar to the a FNN’s hidden layer. Finally, the outputs of the N2G layer are linearly summed by the final fully connected layer to provide a final set of prediction values.

2.7. Graph convolutional neural network (GCNN)

Standard convolution applies to data that lies on a Euclidean grid (e.g., images). Graph convolution exploits the graph Laplacian in order to generalize the concept of standard convolution to data lying on nodes connected together into a graph. This allows the extension of the standard CNN to graph convolutional neural networks (GCNNs; Defferrard et al., 2016; Bronstein et al., 2017; Kipf and Welling, 2017). There are many different ways that GCNN can be applied to neuroimaging data (Kipf and Welling, 2017; Ktena et al., 2018; Zhang et al., 2018). Here we considered the innovative GCNN developed by Kipf and Welling (2017) and extended to neuroimaging data by Parisot and colleagues (Parisot et al., 2017, 2018). Figure 3 illustrates this approach.

Figure 3. — This particular GCNN takes in vectorized RSFC matrices of all subjects as input and outputs behavioral (or demographic) prediction of all subjects. (A) Vectorized FC of all subjects (subject 1 to subject n). (B) The input of GCNN is a graph, where each node represents a subject and is associated with the vectorized FC from the corresponding subject in (A). An edge in the graph represents the similarity between two subjects. Here, the similarity is defined in terms of the similarity of the subjects’ RSFC matrices. (C) Output of the first graph convolutional layer. Graph convolutional layer extends standard convolution to graph convolution. Each node is associated with a vector, whose length is determined by the number of filters in the first graph convolutional layer. (D) Final output of GCNN after one or more graph convolutional layers. Each node contains the predicted behavioral measure(s).

The input to an FNN (Figure 1) or a BrainNetCNN (Figure 2) is the RSFC data of a single subject. By contrast, the GCNN takes in data (e.g., vectorized RSFC) of all subjects as input and outputs behavioral (or demographic) predictions of all subjects (Parisot et al., 2017, 2018). In other words, data from the training, validation, and testing sets are all input into the GCNN at the same time. To avoid leakage of information across training, validation and test sets, masking of data is applied during the calculation of the loss function and gradient descent.

More importantly, the graph in GCNN does not represent connectivity matrices (like in BrainNetCNN). Instead, each node represents a subject and edges are determined by the similarity between subjects. This similarity is problem dependent. For example, in the case of autism spectrum disorder (ASD) classification, similarity between two subjects is defined based on sex, sites and RSFC, i.e., two subjects are more similar if they have the same sex, from the same site and have similar RSFC patterns (Parisot et al., 2017, 2018). The use of sex and sites in the graph definition were particularly important for this specific application, since ASD is characterized by strong sex-specific effects and the database included data from multiple unharmonized sites (Di Martino et al., 2014).

Similar to the original studies (Parisot et al., 2017, 2018), we utilized vectorized RSFC (lower triangular entries of the RSFC matrix) of all subjects as inputs to the GCNN. Edges between subjects were defined based on Pearson’s correlation between lower triangular portions of RSFC matrices.

2.8. HCP training, validation and testing

For the HCP dataset, 20-fold cross-validation was performed. The 953 subjects were divided into 20 folds, such that family members were not split across folds. Inner-loop cross-validation was performed for hyperparameter tuning. More specifically, for a given test fold, cross-validation was performed on the remaining 19 folds with different hyperparameters. The best hyperparameters were then used to train on the 19 folds. The trained model was then applied to the test fold. This was repeated for all 20 test folds.

In the case of kernel regression, there was only one single hyperparameter λ (that controls the l₂ regularization; see Appendix A.1). A separate hyperparameter was tuned for each fold and each behavioral measure separately based on a grid search over the hyperparameter.

In the case of the DNNs, there was a large number of hyperparameters, e.g., number of layers, number of nodes, number of training epochs, dropout rate, optimizer (e.g., stochastic gradient or ADAM), weight initialization, activation functions, regularization, etc. GCNN also has additional hyperparameters tuned, e.g., definition of the graph and graph Laplacian estimation. Therefore, instead of training a separate DNN for each behavioral measure, a single FNN (or BrainNetCNN or GCNN) was trained for all 58 behavioral measures. The reason is that tuning hyperparameters separately for each behavioral measure would be too time consuming. We note that the joint prediction of multiple behavioral measures might not be a disadvantage for the DNNs and might even potentially improve prediction performance because of shared structure among target behavioral variables (Rahim et al., 2017). Furthermore, we tried to tune each DNN (FNN, BrainNetCNN or GCNN) for only fluid intelligence, but the performance for fluid intelligence prediction was not better than predicting all 58 behavioral measures simultaneously.

Furthermore, a proper inner-loop 20-fold cross-validation would involve tuning the hyperparameters for each DNN 20 times (once for each split of the data into training-test folds), which was computationally prohibitive. Thus, for each DNN (FNN, BrainNetCNN and GCNN), we tuned the hyperparameters once, using the first split of the data into training-test folds, and simply re-used the optimal hyperparameters for the remaining training-test splits of the data. Such a procedure biases the prediction performance in favor of the DNNs (relative to kernel regression), so the results should be interpreted accordingly (see Discussion). Such a bias is avoided in the UK Biobank dataset (see below). Further details about DNN hyperparameters are found in Appendix A2.

As is common in the FC-based prediction literature (Finn et al., 2015), model performance was evaluated based on the Pearson’s correlation between predicted and actual behavioral measures across subjects within each test fold. Furthermore, since certain behavioral measures were correlated with motion (Siegel et al., 2017), age, sex, and motion (FD) were regressed from the behavioral measures from the training and test folds (Kong et al., 2019; Li et al., 2019). Regression coefficients were estimated from the training folds and applied to the test folds. Mean absolute error (MAE) and coefficient of determination (COD) will also be reported.

2.9. UK Biobank training, validation and testing

The large UK Biobank dataset allowed us the luxury of splitting the 8868 subjects into training (N = 6868), validation (N = 1000) and test (N = 1000) sets, instead of employing an inner-loop cross-validation procedure like in the HCP dataset. Care was taken so that the distributions of various attributes (sex, age, fluid intelligence and pairs matching) were similar across training, validation and test sets.

Hyperparameters were tuned using the training and validation sets. The test set was only utilized to evaluate the final prediction performance. A separate DNN was trained for each of the four behavioral and demographic measures. Thus, the hyperparameters were tuned independently for each behavioral/demographic measure. Further details about DNN hyperparameters are found in Appendix A2. Initial experiments using a single neural network to predict all four measures simultaneously (like in the HCP dataset) did not appear to improve performance and so was not further pursued. In the case of kernel regression, the hyperparameter λ was tuned using the validation set based on a grid search over the hyperparameter.

Like before, prediction accuracies for age, fluid intelligence and pairs matching were evaluated based on the Pearson’s correlation between predicted and actual measures across subjects within the test set. Since the age prediction literature often used mean absolute error (MAE) as an evaluation metric (Liem et al., 2017; Cole et al., 2018; Varikuti et al., 2018), we included MAE as an evaluation metric. For completeness, we also computed MAE for pairs matching and fluid intelligence.

In the case of sex, accuracy was defined as the fraction of participants whose sex was correctly predicted. Like before, we regressed age, sex and motion from fluid intelligence and pairs matching measures in the training set and apply the regression coefficients to the validation and test sets. When predicting age and sex, no regression was performed. Coefficient of determination (COD) for age, pairs matching and fluid intelligence will also be reported in the Supplemental Material.

2.10. Deep neural network implementation

The DNNs were implemented using Keras (Chollet, 2015) or PyTorch (Paszke et al., 2017) and run on NVIDIA Titan Xp GPU using CUDA. Our implementation of BrainNetCNN and GCNN were based on GitHub code from the original papers (Kawahara et al., 2017; Kipf and Welling, 2017). Our implementations achieved similar results as the original implementations when using the toy datasets and hyperparameters provided by the original GitHub implementations. More details about hyperparameter tuning can be found in Appendix A2.

2.11. Statistical tests

For the HCP dataset, we performed 20-fold cross-validation, yielding a prediction accuracy for each test fold. To compare two algorithms, the corrected resampled t-test was performed (Nadeau and Bengio, 2003; Bouckaert and Frank, 2004). The corrected resampled t-test corrects for the fact that the accuracies across test folds were not independent. In the case of the UK Biobank, there was only a single test fold, so the corrected resampled t-test could not be applied. Instead, when comparing correlations from two algorithms, the Steiger’s Z-test was utilized (Steiger, 1980). When comparing MAE, a two-tailed paired sample t-test was performed. When comparing prediction accuracies for sex, the McNemar’s test was utilized (McNemar, 1947).

2.12. Scaling of prediction performance as a function of sample size

The large UK Biobank dataset allowed us to explore the effect of sample size on predicting fluid intelligence. The test set (N = 1000) was the same as before to allow for meaningful comparisons. We considered 100, 500, 1000, 2000, 3000, 4000, 5000 and 6000 and 7868 subjects for training and validation. The case of 7868 subjects was identical to the analysis from the previous sections.

In the case of 3000, 4000, 5000 and 6000 subjects, the validation set comprised the same set of 1000 subjects as in the previous sections. The training set was obtained by randomly sampling the appropriate number of subjects from the original training set of 6868 participants. For example, in the case of 3000 training and validation subjects, we randomly sampled 2000 training subjects from the original training set. However, the training subjects were selected so that the distribution of fluid intelligence matched the distributions of the validation and test sets.

In the case of 100, 500, 1000 and 2000 subjects, we split the participants with a 3:1 ratio. For example, in the case of 100 subjects, there were 75 training and 25 validation subjects. Like before, the participants were randomly selected but we ensured the distributions of fluid intelligence in the training and validation sets were similar to the distribution of the test set.

The hyperparameter tuning for the three DNNs and kernel regression was the same as in previous sections. See Appendices A1 and A2 for more details.

2.13. Control analysis

We repeated our analyses using hyperparameters as close as possible to the original BrainNetCNN hyperparameters (provided by the BrainNetCNN code repository; Kawahara et al., 2017) and original GCNN hyperparameters (provided by the GCNN code repository; Parisot et al., 2017; 2018). In the case of FNN, we utilized hyperparameters as close as possible to the FC90net baseline in the BrainNetCNN paper (Kawahara et al., 2017).

2.14. Data and code availability

This study utilized publicly available data from the HCP (https://www.humanconnectome.org/) and UK Biobank (https://www.ukbiobank.ac.uk/). The 400 cortical ROIs (Schaefer et al., 2018) can be found here (https://github.com/ThomasYeoLab/CBIG/tree/master/stable_projects/brain_parcellation/Schaefer2018_LocalGlobal). The kernel regression and DNNs code utilized in this study can be found here (https://github.com/ThomasYeoLab/CBIG/tree/master/stable_projects/predict_phenotypes/He2019_KRDNN). The trained models for the UK Biobank dataset can also be found in the above GitHub link. The code was reviewed by one of the co-authors (MN) before merging into the GitHub repository to reduce the chance of coding errors.

3. Results

3.1. HCP behavioral prediction

Figure 4 shows the prediction accuracy (Pearson’s correlation coefficient) averaged across 58 HCP behavioral measures and 20 test folds. Statistical tests were performed between kernel regression and the three DNNs (see Methods). False discovery rate (q < 0.05) was applied to correct for multiple comparisons correction.

FNN achieved the highest average prediction accuracy with Pearson’s correlation r = 0.121 ± 0.063 (mean ± std). On the other hand, kernel regression achieved an average prediction accuracy of r = 0.115 ± 0.036 (mean ± std). However, there was no statistical difference between FNN and kernel regression (p = 0.60). Interestingly, BrainNetCNN (r = 0.114 ± 0.048) and GCNN (r = 0.072 ± 0.044) did not outperform FNN, even though the two DNNs were designed for neuroimaging data. KRR was significantly better than GCNN (p = 3e-4), but not BrainNetCNN (p = 0.93).

For completeness, Figures 5, S2, and S3 show the behavioral prediction accuracies for all 58 behavioral measures. Figures S4 to S7 show the scatterplots of predicted versus actual values for 13 cognitive measures. Kernel regression was significantly better than FNN for predicting grip strength (p = 2.65e-4) and significantly better than GCNN for predicting picture matching vocabulary (p = 6.91e-5). No other difference survived the FDR correction.

Figure 5. — Correlation was computed for each test fold and each behavior. Bars show mean across test folds. Error bars show standard errors of model performance across cross-validation folds. Prediction accuracies of the remaining 45 behavioral measures are found in Figures S2 and S3.

Similar conclusions were obtained when using mean absolute error (Figure 6) and coefficient of determination (Figure S8) as measures of prediction performance.

3.2. UK Biobank behavioral and demographics prediction

Table 1 and Figure 7 show the prediction performance of sex, age, pairs matching and fluid intelligence. Figure S9 shows the scatterplots of predicted versus actual values for age, pairs matching and fluid intelligence. Kernel regression, FNN, and GCNN achieved the highest accuracy for sex prediction. Kernel regression performed the best for fluid intelligence and age (measured using Pearson’s correlation). BrainNetCNN performed the best for age (measured using MAE) and pairs matching.

Table 1. Prediction performance of four behavioral and demographic measures in the UK Biobank.

For age (MAE), lower values imply better performance. For all the other measures, larger values imply better performance. Bold indicates the best performance, although it does not imply statistical significance. There was no statistical difference between kernel regression and the DNNs for all behavioral and demographic measures after correcting for multiple comparisons (q < 0.05). MAE refers to mean absolute error. Correlation refers to Pearson’s correlation. We note that simply predicting the median age in the training set would have yielded an MAE of 6.194.

Model	Sex	Age		Pairs matching	Fluid intelligence
Model	Accuracy	Correlation	MAE	Correlation	Correlation
Kernel Regression	0.916	0.599	4.826	0.061	0.239
FNN	0.916	0.599	4.899	0.045	0.239
BrainNetCNN	0.914	0.598	4.824	0.067	0.235
GCNN	0.916	0.593	4.895	0.008	0.232

Open in a new tab

Figure 7. — For age (MAE), lower values imply better performance. For all the other measures, larger values imply better performance. The horizontal lines represent statistical tests between kernel regression and the DNNs. “n.s” stands for not significant after FDR (q < 0.05) correction.

Statistical tests were performed between kernel regression and the three DNNs (see Methods). False discovery rate (q < 0.05) was applied to correct for multiple comparisons correction. There was no statistical difference between kernel regression and the DNNs for all behavioral and demographic measures.

Interestingly, the GCNN achieved poor performance in the case of pairs matching (Pearson’s correlation r = 0.008), although it was not statistically worse than kernel regression. Upon further investigation, we found that GCNN achieved an accuracy of r = 0.106 in the UK Biobank validation set. When using the initial set of hyperparameters (before hyperparameter tuning using HORD), GCNN achieved accuracies of r = 0.047 and r = 0.056 in the validation and test sets respectively. Overall, this suggests that the hyperparameter tuning overfitted the validation set, despite the rather large sample size.

Similar conclusions were obtained when using mean absolute error (MAE) as a performance measure for fluid intelligence and pairs matching (Table 2 and Figure S10), or when using coefficient of determination (COD) as a performance measure for age, pairs matching and fluid intelligence (Table S2).

Table 2. Prediction MAE of pairs matching and fluid intelligence in the UK Biobank.

Lower values imply better performance. Bold indicates the best performance. We note that simply predicting the median of the pairs matching value in the training set would have yielded an MAE of 0.400, which was better than kernel regression and all DNNs.

Model	Pairs matching	Fluid intelligence
Kernel regression	0.551	1.608
FNN	0.567	1.613
BrainNetCNN	0.553	1.610
GCNN	0.497	1.612

Median	0.400	1.656

Open in a new tab

3.3. Effect of sample size on predicting fluid intelligence in the UK Biobank

Figure 8 shows the prediction performance (Pearson’s correlation) of fluid intelligence in the UK Biobank as the training and validation sample sizes were varied, while the same test set of 1000 subjects was used throughout. All algorithms performed poorly with 100 subjects but improved with more subjects. There was more than 300% improvement when increasing the sample size from 100 to 1000 subjects and more than 35% improvement when increasing the sample size from 1000 to 5000 subjects. However, the improvement tapered off from 5000 to 7868 subjects. GCNN was highly volatile as the sample size was varied, suggesting its sensitivity to particular choices of training and validation subjects. Kernel regression was competitive across all sample sizes. Similar conclusions were obtained when MAE was used as a performance metric (Figure S11).

3.4. Control analysis

Tables S3 and S4 show the performance of the DNNs using hyperparameters from the original publications (Kawahara et al., 2017; Parisot et al., 2017; 2018) versus our tuned hyperparameters. The performance of our hyperparameters compared favorably to the performance of the original hyperparameters. This is not surprising, since our hyperparameters were obtained by tuning using the datasets shown in this paper.

3.5. Computational costs

Kernel regression has a close-form solution (Appendix A1) and only one hyperparameter, so the computational cost is extremely low. For example, kernel regression training and grid search of 32 hyperparameter values in the UK Biobank validation set took about 20 minutes (single CPU core) for one behavioral measure. This is one reason why we considered kernel regression instead of other slower classical approaches (e.g., support vector regression or elastic net) requiring iterative optimization. On the other hand, FNN training and tuning of hyperparameters in the UK Biobank validation set took around 80 hours (single GPU) for one behavioral measure, excluding the manhours necessary for the manual tuning.

4. Discussion

In this study, we showed that kernel regression and DNNs achieved similar performance in RSFC-based prediction of a wide range of behavioral and demographic measures across two large-scale datasets totaling almost 10,000 participants. Furthermore, FNN performed as well as the two DNNs that were specifically designed for connectome data. Given comparable performance between kernel regression and the DNNs and the significantly greater computational costs associated with DNNs, our results suggest that kernel regression might be more suitable than DNNs in some neuroimaging applications.

4.1. Potential reasons why DNNs did not outperform kernel regression for RSFC-based prediction

There are a few potential reasons why DNNs did not outperform kernel regression in our experiments on RSFC-based behavioral prediction. First, while the human brain is nonlinear and hierarchically organized (Deco et al., 2011; Breakspear, 2017; Wang et al., 2019), such a structure might not be reflected in the RSFC matrix in a way that was exploitable by the DNNs we considered. This could be due to the measurements themselves (Pearson’s correlations of rs-fMRI timeseries), the particular representation (N x N connectivity matrices) or particular choices of DNNs, although we again note that BrainNetCNN and GCNN were specifically developed for connectome data.

Second, given the much larger datasets used in computer vision and natural language processing (Chelba et al., 2014; Russakovsky et al., 2015), it is possible that there was not enough neuroimaging data (even in the UK Biobank) to fully exploit DNNs. However, our experiments show that kernel regression was highly competitive across all sample sizes from 100 to 7898 subjects. In fact, all approaches (except GCNN) improved at almost lockstep with greater sample size, suggesting that even larger sample sizes might equally benefit both DNNs and kernel regression.

Third, it is well-known that hyper-parameter settings and architectural details can impact the performance of DNNs. Thus, it is possible that the benchmark DNNs we implemented in this work can be further optimized. However, we do not believe this would alter our conclusions for two reasons. First, for some measures (e.g., sex classification in the UK Biobank), we were achieving performance at or near the state-of-the-art. Second, an earlier version of this paper relied completely on manual tuning of hyperparameters. In the current version of this paper, we utilized an automatic algorithm to tune a subset of hyperparameters for the UK Biobank experiments (Appendix A2), yielding essentially the same conclusions.

It is also worth pointing out that while deep learning has won several predictive modeling challenges, these have mostly involved image segmentation (Choi et al., 2016, Kamnitsas et al., 2017a, Hongwei Li et al., 2018). The success of DNNs has been less clear in other neuroimaging challenges. For example, in the 2019 ABCD challenge to predict fluid intelligence from structural MRI, kernel regression was the winner, beating other deep learning algorithms (Mihalik et al., 2019). Similarly, in the recent TADPOLE challenge to predict Alzheimer’s Disease progression (Marinescu et al., 2018), the top entry did not utilize deep learning (https://tadpole.grand-challenge.org/Results/).

4.2. Hyperparameters

There are significantly more hyperparameters in DNNs compared with classical machine learning approaches. For example, for a fixed kernel (e.g., correlation metric in our study), kernel regression has one single regularization parameter. Even with a nonlinear kernel (e.g. radial basis function), there would only be two hyperparameters. This is in contrast to DNNs, where there are easily more than ten hyperparameters.

Because of the large number of hyperparameters, most applications involving DNNs currently require some level of manual hyperparameter tuning. Therefore, we suggest that manual hyper-parameter tuning should be performed within a training-validation-test framework (like in our UK Biobank experiments), rather than a nested (inner-loop) cross-validation framework (like in HCP experiments). The reason is that within a nested (inner-loop) cross-validation framework, information from tuning one fold might leak to another fold (via the person tuning the hyperparameters).

To elaborate, recall that we divided the HCP dataset into 20 folds. We tuned the hyperparameters of the DNNs using folds 2 to 20 and applied the trained DNNs to fold 1. Since fold 1 was not used in tuning the hyperparameters, the performance of the DNNs in fold 1 was unbiased. However, when fold 2 became the test fold, we utilized the same hyperparameters to train using folds 1, 3 to 20. This is problematic because fold 2 was originally utilized to tune the hyperparameters, so consequently the performance of the DNNs in test fold 2 was inflated.

One could try to independently tune the hyperparameters for each fold independently. However, complete independence between folds is unlikely because the person performing the manual tuning cannot possibly forget his/her tuning experience with the other folds. As such, this will yield overly optimistic results.

On the other hand, the test set in the UK Biobank was only utilized after the hyperparameters have been determined from the training and validation sets. Therefore, the performance of the DNNs was unbiased. It is worth noting that our motivation for advocating the training-validation-test framework is to prevent overly optimistic results in the test set, but does not necessarily eliminate overfitting. For example, in the case of pairs matching in the UK Biobank, our tuning procedure overfitted on the validation set, yielding poor performance in the test set (Table 1). Thus, overfitting was “caught” in the test set, which highlights the benefits of adopting a training-validation-test framework.

Finally, we note that there are generally too many DNN hyperparameters (and design choices) to be listed in a paper. In fact, there were hyperparameters too complex to completely specify in this paper. However, we have made our code publicly available, so researchers can refer to the code for the exact hyperparameters. We encourage future neuroimaging DNN studies to also make their code publicly available.

4.3. Prediction performance in the literature

Comparing our prediction performance with the literature is difficult because of different datasets, sample sizes, cross-validation procedures and de-confounding strategies. For example, we regressed age, sex, and motion (FD) from the behavioral measures, but other studies might not perform any regression or use a different set of regressors. Nevertheless, we believe that our prediction performance is generally consistent with the literature.

As mentioned earlier, our sex prediction accuracy of 91.6% in the UK Biobank is among the best in the literature. For example, Ktena and colleagues (2018) reported a sex prediction accuracy of around 80% when using 55 x 55 functional connectivity matrices from 2500 UK Biobank subjects. On the other hand, Chekroud and colleagues (2016) reported sex prediction accuracy of 93% when using cortical thickness and subcortical volumes of 1566 subjects from the Brain Genomics Superstruct Project (Holmes et al., 2015).

In the case of fluid intelligence, our prediction accuracies (Pearson’s correlation) ranged from around 0.257 to 0.297 (excluding GCNN which performed poorly) in the HCP dataset. Although earlier RSFC-based behavioral prediction studies have reported high fluid intelligence prediction accuracy in the HCP dataset (Finn et al., 2015), newer studies using more subjects reported lower accuracies comparable with our results. For example, Dubois and colleagues (2018) reported a prediction accuracy (Pearson’s correlation) of 0.27 for fluid intelligence in the HCP dataset. On the other hand, Greene and colleagues (2018) reported a prediction accuracy (Pearson’s correlation) of 0.17 for fluid intelligence in the HCP dataset (but only using data from a single resting-fMRI session). Thus, our prediction accuracies for fluid intelligence is consistent with the literature.

In the case of age prediction, we achieved a prediction accuracy (Pearson’s correlation) of 0.6 and an MAE of 4.8 in the UK Biobank dataset. Comparing these results with the literature is difficult because of sensitivity to age range in the dataset. For example, many studies utilized either lifespan (Cole et al., 2017; Liem et al., 2017) or developmental (Sturmfels et al., 2018; Nielsen et al., 2019) cohorts, while the UK Biobank comprised older adults (more than 45 years old). Furthermore, many studies preferred to use structural MRI, instead of RSFC, for predicting age (Cole et al., 2017; Sturmfels et al., 2018; Varikuti et al., 2018). Liem and colleagues (2017) achieved MAEs ranging from 5.25 to 5.99 when using RSFC for predicting age in a lifespan dataset comprising 2354 subjects, which was worse than our MAE. On the other hand, their prediction accuracies (Pearson’s correlation) ranged from 0.79 to 0.93, which was better than our prediction accuracy (Pearson’s correlation). Overall, this suggests that our prediction performance is probably comparable with other RSFC studies, although we emphasize that comparing age prediction performance across datasets is non-trivial.

It is important to mention that prediction performance was poor for a number of target variables across all four prediction algorithms. For example, in the case of pairs matching in the UK Biobank dataset, predicting the median of the training set yielded lower MAE than all four models, suggesting that pairs matching is not an easily predictable trait using RSFC. Therefore, it might not be meaningful to compare the models for pairs matching. On the other hand, we note that for both age and fluid intelligence prediction, all four models performed better than predicting the median of the training set. Similarly, sex prediction was a lot better than chance, given that there were roughly equal number of males and females in the dataset. For these three target variables (age, sex and fluid intelligence), all four models exhibited very similar performance.

It is also worth noting that the poor average COD in the HCP dataset is consistent with the literature. For example, of the 58 behavioral measures, 48 of them were also utilized in the HCP MegaTrawl (https://db.humanconnectome.org/megatrawl/). For the 300-dimensional group-ICA results, HCP MegaTrawl achieved an average COD of −0.177 (original data space), while kernel regression in the current study achieved an average COD of −0.0875. Overall, this suggests that certain target variables are not easily predicted using RSFC.

4.4. Limitations and caveats

Although the current study suggests that kernel regression and DNNs achieved similar performance for RSFC-based behavioral prediction, it is possible that other DNNs (we have not considered) might outperform kernel regression. Furthermore, our study focused on the use of N x N static RSFC matrices for behavioral prediction. Other RSFC features, such as dynamic RSFC features (Calhoun et al., 2014; Preti et al., 2017; Liégeois et al., 2019), in combination with DNNs might potentially yield better performance (Hongming Li et al., 2018; Khosla et al., 2019).

We also note that our evaluation procedure was performed on the HCP and UK Biobank datasets independently. Therefore, we expect the reported prediction performance to be maintained if new participants were recruited in the respective studies under the same experimental conditions (e.g., no scanner upgrade, same population, same acquisition protocol and preprocessing, etc). However, the reported prediction performance would likely drop if the trained models (from the UK Biobank or HCP) were applied to other datasets (Arbabshirani et al., 2017; Woo et al., 2017). At this point, it is unclear which approach (kernel regression, FNN, BrainNetCNN or GCNN) would generalize better to a completely new dataset. This is obviously an active area of research given the increasing number of large-scale publicly available brain imaging datasets.

5. Conclusion

By using a combined sample of nearly 10,000 participants, we showed that kernel regression and three types of DNN architectures achieved similar performance for RSFC-based prediction of a wide range of behavioral and demographic measures. Overall, our study suggests that kernel regression might be just as effective as DNNs for certain neuroimaging applications, while incurring significantly less computational costs.

Supplementary Material

NIHMS1542375-supplement-1.docx^{(11.2MB, docx)}

Acknowledgment

We like to thank the anonymous reviewers for the very helpful feedback. We would also like to thank Christine Annette, Taimoor Akhtar, Li Zhenhua for their help on the HORD algorithm. This work was supported by Singapore MOE Tier 2 (MOE2014-T2–2-016), NUS Strategic Research (DPRT/944/09/14), NUS SOM Aspiration Fund (R185000271720), Singapore NMRC (CBRG/0088/2015), NUS YIA and the Singapore National Research Foundation (NRF) Fellowship (Class of 2017). Our research also utilized resources provided by the Center for Functional Neuroimaging Technologies, P41EB015896 and instruments supported by 1S10RR023401, 1S10RR019307, and 1S10RR023043 from the Athinoula A. Martinos Center for Biomedical Imaging at the Massachusetts General Hospital. Our computational work was partially performed on resources of the National Supercomputing Centre, Singapore (https://www.nscc.sg). The Titan Xp GPUs used for this research were donated by the NVIDIA Corporation. This research has been conducted using the UK Biobank resource under application 25163 and Human Connectome Project, WU-Minn Consortium (Principal Investigators: David Van Essen and Kamil Ugurbil; 1U54MH091657) funded by the 16 NIH Institutes and Centers that support the NIH Blueprint for Neuroscience Research; and by the McDonnell Center for Systems Neuroscience at Washington University.

Appendix

A1. Kernel Regression

In this section, we describe kernel regression in detail (Liu et al., 2007; Murphy, 2012). The kernel matrix K encodes the similarity between pairs of subjects. Motivated by Finn and colleagues (2015), the i-th row and j-th column of the kernel matrix is defined as the Pearson’s correlation between the i-th subject’s vectorized RSFC and j-th subject’s vectorized RSFC (considering only the lower triangular portions of the RSFC matrices). The behavioral measure y_i of subject i can be written as:

y_{i} = Σ_{j = 1}^{M} α_{j} K (c_{i}, c_{j}) + e_{i} #

(1)

where c_i is the vectorized RSFC of the i-th subject, K(c_i, c_j) is the element at i-th row and j-th column of kernel matrix, M-is the total number of training subjects, e_i is the noise term and α_j is the trainable weight. The goal of kernel regression is to find an optimal set of α. To achieve this goal, we maximize the penalized likelihood function:

J = - \frac{1}{2} Σ_{i = 1}^{M} {y_{i} - Σ_{j = 1}^{M} α_{j} K (c_{i}, c_{j})}^{2} #

(2)

with respect to α = [α₁, α₂,…, α_M]^T. To avoid overfitting, a l₂ regularization (i.e., kernel ridge regression) can be added, so the resulting optimization problem becomes:

α = \underset{α}{argmin} \frac{1}{2} {(y - K α)}^{T} (y - K α) + \frac{λ}{2} α^{T} K α #

(3)

where $K$ is the M × M kernel matrix, y = [y₁, y₂,…, y_M]^Tand λ is a hyperparameter that controls the l₂ regularization. By solving equation (3) with respect to α, we can predict a test subject’s behavioral measure y_s as:

y_{s} = K_{s} α = K_{s} {(K + λ I)}^{- 1} y #

(4)

where $K_{s} = [K (c_{s}, c_{1}), K (c_{s}, c_{2}), \dots, K (c_{s}, c_{M})]$ .

In the case of the HCP, λ was selected via inner-loop cross-validation. In the case of the UK biobank, λ was tuned on the validation set. For sex prediction in the UK Biobank, for each continuous prediction y_sex, the participant was classified as male or female based on whether it was larger or smaller than the threshold. We tuned the threshold to obtain the best accuracy in the UK Biobank validation dataset and used this threshold in the test set.

A2. More details of deep neural networks

In this section, we describe further details of our DNN implementation.

For GCNN, we adopted Keras code from the GCNN GitHub repository (https://github.com/tkipf/keras-gcn; Kipf and Welling, 2017). We made some minor modifications to the code, e.g., the modified code directly loaded the graph adjacency matrix, instead of loading the edges and generating the adjacency matrix. As another example, our graph convolution layer loaded the graph matrix as parameters rather than as an input. However, we emphasized that the core functionalities (e.g., graph convolution) remained unchanged. As a sanity check, we applied our modified code to the original toy data using the original hyperparameters provided by the original GitHub repository. Our results were comparable to the original implementation (Table S5).
The original BrainNetCNN implementation used the Caffe framework (https://github.com/jeremykawahara/ann4brains; Kawahara et al., 2017). We re-implemented BrainNetCNN in Keras and PyTorch following the original Caffe code as closely as possible. The Keras version was applied to the HCP data, while the PyTorch version was applied to the UK Biobank data. The reason for this inconsistency was that after our experiments with the HCP dataset using Keras, we realized that the Keras framework yielded slightly different results each time the code was run. This was apparently a well-known issue of the framework. As such, we decided to implement a second version in PyTorch, which was then applied to the UK Biobank. As a sanity check, we applied both implementations (Keras and PyTorch) to the original toy data using the original hyperparameters provided by the original GitHub repository. Our implementations achieved comparable results with the original implementation (Table S6).
In the case of the FNN, since this is just a generic feedforward neural network, so we implemented using default libraries in Keras and PyTorch. The Keras version was applied to the HCP data, while the PyTorch version was applied to the UK Biobank data. The reason for this inconsistency is the same as the previous bullet point.
Representative learning curves for the HCP dataset are shown in Figure S12. Learning curves for the UK Biobank are shown in Figures S13 to S15. The training curves showed good accuracy/error, suggesting that we are not underfitting to the data. The validation curves were plateauing, suggesting that we were not stopping too early in our training. Since the validation and test curves were progressing in almost lockstep (except for certain instances of GCNN), our stopping criterion (based on the peaks of the validation curves) was reasonable. For most behavioral measures, there were relatively big gaps between the training and validation/test curves, suggesting overfitting. However, we have already deployed several standard strategies to reduce overfitting, including dropout, L₂-regularization/weight-decay and batch-normalization.

In the case of the HCP dataset:

For all three DNNs, all behavioral measures were z-normalized based on training data. The loss function was mean squared error (MSE). Optimizer was stochastic gradient descent (SGD). With the MSE loss, the output layer has 58 nodes (FNN and BrainNetCNN) or filters (GCNN).
In the case of the main results (Figures 4, 5, S2 and S3), the hyperparameters were tuned manually by trial-and-error. Since each test fold was of size 47 or 48, we simply set 48 to be the batch size (except GCNN, which utilized the whole dataset in a single mini-batch). We initialized with a default set of hyperparameters (e.g., learning rate=0.01, dropout rate=0.5, number of filter/nodes=32) and then tuned the optimizer (learning rate, momentum, and learning rate decay), layer structure (number of layers, number of nodes/filters), dropout rate, regularization and weight initialization. There was no fixed order for hyperparameter tuning. We generally started by tuning the layer structure, followed by the optimizer and then other hyperparameters. For GCNN, we also tuned the graph-related hyperparameters at the beginning of the tuning process.
Final FNN structure is shown in Table 3. Dropout of 0.6 was added before each fully-connected layer. L2 regularization of 0.02 was added for layer 2.
Final BrainNetCNN structure is shown in Table 4. Dropout of 0.4 was added after E2N layer. LeakyReLU (Maas et al., 2013) with alpha of 0.3 was used as the activation function for the first three layers.
Final GCNN structure is shown in Table 5. Dropout of 0.3 was added for each layer. L2 regularization of 8e-4 was added for layer 1. The nodes of the graph corresponded to subjects. Edges were constructed based on Pearson’s correlation between subjects’ vectorized RSFC. The graph was thresholded by only retaining edges with top 5% correlation (across the entire graph). However, this might result in a disconnected graph. Therefore, the top five correlated edges of each node were also retained (even if these edges were not among the top 5% correlated edges). The graph convolution filters were estimated using a 5-degree Chebyshev polynomial (Defferrard et al., 2016).

Table 3. FNN architecture and hyperparameters for HCP and UK Biobank.

Under “Model structure”, the numbers represent the number of nodes at each fully connected layer. For example, “256, 96, 256, 58” represents a 4-layer FNN with 256, 96, 256 and 58 nodes.

Dataset	Predicting	Model architecture	Optimizer
HCP	58 behaviors	223, 128, 192, 58	SGD

UK Biobank	Sex	3, 2	SGD
	Age	9, 1	SGD
	Pairs matching	415, 437, 1	SGD
	Fluid intelligence	318, 357, 1	SGD

Open in a new tab

Table 4. BrainNetCNN architecture and hyperparameters for HCP and UK Biobank.

Under “Model structure”, the numbers represent the number of filters or nodes at each layer. For example, “15, 93, 106, 2” represents a BrainNetCNN with 15 filters for the E2E layer, 93 filters for the E2N layer, 106 filters (nodes) for the N2G layer and 2 nodes in the final fully connected layer. All BrainNetCNNs follow the same layer order: E2E, E2N, N2G and then a final fully connected layer.

Dataset	Predicting	Model architecture	Optimizer
HCP	58 behaviors	18, 19, 84, 58	SGD

UK Biobank	Sex	38, 58, 7, 2	SGD
	Age	22, 79, 91, 1	SGD
	Pairs matching	27, 29, 54, 1	SGD
	Fluid intelligence	40, 60, 41, 1	SGD

Open in a new tab

Table 5. GCNN architecture and hyperparameters for HCP and UK Biobank.

Under “Model structure”, the numbers represent the number of filters for each graph convolutional layer. For example, “64, 1” represents a 2-layer GCNN with 64 and 1 filters respectively.

Dataset	Predicting	Model architecture	Optimizer
HCP	58 behaviors	256, 58	SGD

UK Biobank	Sex	71, 2	Adam
	Age	10, 1	SGD
	Pairs matching	3, 1	Adam
	Fluid intelligence	72, 1	Adam

Open in a new tab

In the case of the UK Biobank:

For all three DNNs, model ensemble was used to improve final test result: for each DNN and each behavior, five models were trained separately (with different random initializations). The predictions were averaged across the five models yielding a final prediction. All four behavioral measures were z-normalized based on training data. The loss function for sex prediction was cross entropy, i.e., the output layer for sex prediction have 2 nodes (FNN and BrainNetCNN) or filters (GCNN). The loss function was MSE for the other three measures. The output layer for these three measures have 1 node (FNN and BrainNetCNN) or filter (GCNN). Adam (Kingma and Ba, 2015) or SGD were used. See details in Tables 3, 4 and 5.
For all three DNNs, we utilized the HORD algorithm (Regis et al., 2013; Ilievski et al., 2017, Eriksson et al., 2019) to assist in hyperparameter tuning using the UK Biobank validation dataset. For each DNN, the HORD algorithm automatically tuned the DNN hyperparameters within user-specified ranges of various hyperparameters. Not all hyperparameters were tuned by HORD because the speed and performance of HORD worsened when too many hyperparameters were tuned. Therefore, we determined several hyperparameters based on our previous manual tuning experience, i.e. momentum = 0.9, batch size = 128 (except GCNN’s batch size is 1 as it loads all data at once), weight initialization = Xavier uniform (PyTorch) or Glorot uniform (Keras), Chebyshev polynomial basis filters with degree of 1 for GCNN.
For FNN, we tuned the number of layers (2 to 4 layers), number of nodes for each layer (2 to 512 nodes), dropout rate (0 to 0.8), starting learning rate (1e-2 to 1e-4), weight decay rate (1e-3 to 1e-7), and epochs to decrease learning rate (10 to 200 epochs) using HORD.
For BrainNetCNN, we tuned the number of filters for e2e (2 to 48 filters), e2n (2 to 96 filters), and n2g layers (2 to 128 nodes), dropout rate (0 to 0.8), learning rate (1e-2 to 1e-4), weight decay rate (1e-3 to 1e-7), and epochs to decrease the learning rate (10 to 200 epochs) using HORD.
For GCNN, we tuned the number of filters for GCNN layer (2 to 128 filters), methods to generate graph adjacency matrix, dropout rate (0 to 0.8), L₂ regularization rate (1e-3 to 1e-7), and learning rate (1e-2 to 1e-4) using HORD.
For all DNNs, model was tuned for each behavior separately. Tables 3, 4 and 5 show the final DNN structures and hyperparameters.
Final FNN structure is shown in Table 3. For FNN, dropout of 0.00275/0.309/0.285/0.526 (for sex/age/pairs matching/fluid intelligence respectively) were added before each fully-connected layer. L2 regularization of 0.02 was added for layer 2. Weight decay of 2.662e-4/2.799e-5/1.141e-6/1.425e-4 (for sex/age/pairs matching/fluid intelligence respectively) were applied to the weights of all fully connected layers.
Final BrainNetCNN structure is shown in Table 4. For BrainNetCNN, dropout of 0.463/0.573/0.264/0.776 (for sex/age/pairs matching/fluid intelligence respectively) were added after the E2E, E2N, and N2G layers. LeakyReLU was replaced by linear activation for all four models.
Final GCNN structure is shown in Table 5. Dropout of 0.0150/0.316/0.308/0.555 (for sex/age/pairs matching/fluid intelligence respectively) were added before the first and second hidden layers. L2 regularization of 3.344e-4/9.181e-7/4.716e-7/7.183e-4 (for sex/age/pairs matching/fluid intelligence respectively) were added for layer 1. The nodes of the graph corresponded to subjects. Edges were constructed based on Pearson’s correlation between subjects’ vectorized RSFC. Thresholding of the graph was tuned separately for each behavior or demographic measure. For pairs matching prediction, the top five correlated edges of each node were retained. For age, sex and fluid intelligence prediction, the graph was thresholded by only retaining edges with top 5% correlation (across the entire graph). Furthermore, the top five correlated edges of each node were also retained (even if these edges were not among the top 5% correlated edges). The graph convolution filters for all four GCNNs were estimated by a 1-degree Chebyshev polynomial (Defferrard et al., 2016).

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

The pairs matching task requires participants to memorize the positions of matching pairs of cards.

References

Alfaro-Almagro F, Jenkinson M, Bangerter NK, Andersson JLR, Griffanti L, Douaud G, Sotiropoulos SN, Jbabdi S, Hernandez-Fernandez M, Vallee E, Vidaurre D, Webster M, McCarthy P, Rorden C, Daducci A, Alexander DC, Zhang H, Dragonu I, Matthews PM, Miller KL, Smith SM, 2018. Image processing and Quality Control for the first 10,000 brain imaging datasets from UK Biobank. Neuroimage 166, 400–424. 10.1016/j.neuroimage.2017.10.034 [DOI] [PMC free article] [PubMed] [Google Scholar]
Amico E, Goñi J, 2018. The quest for identifiability in human functional connectomes. Sci. Rep 8, 1–14. 10.1038/s41598-018-25089-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
Arbabshirani MR, Plis S, Sui J, Calhoun VD, 2017. Single subject prediction of brain disorders in neuroimaging: Promises and pitfalls. Neuroimage 145, 137–165. 10.1016/j.neuroimage.2016.02.079 [DOI] [PMC free article] [PubMed] [Google Scholar]
Assaf M, Jagannathan K, Calhoun VD, Miller L, Stevens MC, Sahl R, O’Boyle JG, Schultz RT, Pearlson GD, 2010. Abnormal functional connectivity of default mode sub-networks in autism spectrum disorder patients. Neuroimage 53, 247–256. 10.1016/j.neuroimage.2010.05.067.Assaf [DOI] [PMC free article] [PubMed] [Google Scholar]
Bahrami K, Shi F, Rekik I, Shen D, 2016. Convolutional Neural Network for Reconstruction of 7T-like Images from 3T MRI Using Appearance and Anatomical Features, in: MICCAI 2016 DL Workshop. pp. 39–47. 10.1007/978-3-319-46976-8_5 [DOI] [Google Scholar]
Barch DM, Burgess GC, Harms MP, Petersen SE, Schlaggar BL, Corbetta M, Glasser MF, Curtiss S, Dixit S, Feldt C, Nolan D, Bryant E, Hartley T, Footer O, Bjork JM, Poldrack R, Smith S, Johansen-Berg H, Snyder AZ, Van Essen DC, 2013. Function in the human connectome: Task-fMRI and individual differences in behavior. Neuroimage 80, 169–189. 10.1016/j.neuroimage.2013.05.033 [DOI] [PMC free article] [PubMed] [Google Scholar]
Beckmann CF, Smith SM, 2004. Probabilistic Independent Component Analysis for Functional Magnetic Resonance Imaging. IEEE Trans. Med. Imaging 23, 137–152. 10.1109/TMI.2003.822821 [DOI] [PubMed] [Google Scholar]
Bertolero MA, Yeo BTT, D’Esposito M, 2017. The diverse club. Nat. Commun 8, 1–10. 10.1038/s41467-017-01189-w [DOI] [PMC free article] [PubMed] [Google Scholar]
Biswal B, FZ Y, VM H, JS H, 1995. Functional connectivity in the motor cortex of resting human brain using echo-planar MRI. Magn Reson Med 34, 537–541. 10.1002/mrm.1910340409 [DOI] [PubMed] [Google Scholar]
Blumberg SB, Tanno R, Kokkinos I, Alexander DC, 2018. Deeper image quality transfer: Training low-memory neural networks for 3D images. Int. Conf. Med. Image Comput. Comput. Interv 118–125. 10.1007/978-3-030-00928-1_14 [DOI]
Bouckaert RR, Frank E, 2004. Evaluating the Replicability of Significance Tests for Comparing Learning Algorithms. Adv. Knowl. Discov. data Min 3–12. 10.1007/978-3-540-24775-3 [DOI]
Breakspear M, 2017. Dynamic models of large-scale brain activity. Nat. Neurosci 20, 340–352. 10.1038/nn.4497 [DOI] [PubMed] [Google Scholar]
Bronstein MM, Bruna J, Lecun Y, Szlam A, Vandergheynst P, 2017. Geometric Deep Learning: Going beyond Euclidean data. IEEE Signal Process. Mag 34, 18–42. 10.1109/MSP.2017.2693418 [DOI] [Google Scholar]
Buckner RL, Krienen FM, Yeo BTT, 2013. Opportunities and limitations of intrinsic functional connectivity MRI. Nat. Neurosci 16, 832–837. 10.1038/nn.3423 [DOI] [PubMed] [Google Scholar]
Burgess GC, Kandala S, Nolan D, Laumann TO, Power JD, Adeyemo B, Harms MP, Petersen SE, Barch DM, 2016. Evaluation of Denoising Strategies to Address Motion-Correlated Artifacts in Resting-State Functional Magnetic Resonance Imaging Data from the Human Connectome Project. Brain Connect. 6, 669–680. 10.1089/brain.2016.0435 [DOI] [PMC free article] [PubMed] [Google Scholar]
Bzdok D, Yeo BTT, 2017. Inference in the age of big data: Future perspectives on neuroscience. Neuroimage 155, 549–564. 10.1016/j.neuroimage.2017.04.061 [DOI] [PubMed] [Google Scholar]
Calhoun VD, Miller R, Pearlson G, Adali T, 2014. The Chronnectome: Time-Varying Connectivity Networks as the Next Frontier in fMRI Data Discovery. Neuron 84, 262–274. 10.1016/j.neuron.2014.10.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
Chekroud AM, Ward EJ, Rosenberg MD, Holmes AJ, 2016. Patterns in the human brain mosaic discriminate males from females. Proc. Natl. Acad. Sci. U. S. A 113, E1968 10.1073/pnas.1523888113 [DOI] [PMC free article] [PubMed] [Google Scholar]
Chelba C, Mikolov T, Schuster M, Ge Q, Brants T, Koehn P, Robinson T, 2014. One billion word benchmark for measuring progress in statistical language modeling. Proc. Annu. Conf. Int. Speech Commun. Assoc INTERSPEECH 2635–2639 10.1016/j.csl.2015.07.001 [DOI]
Choi Y, Kwon Y, Lee H, Kim BJ, Paik MC, Won J-H, 2016. Ensemble of Deep Convolutional Neural Networks for Prognosis of Ischemic Stroke, in: Crimi A, Menze B, Maier O, Reyes M, Winzeck S, Handels H (Eds.), International MICCAI Brainlesion Workshop BrainLes 2016: Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries. Springer International Publishing, Cham, pp. 231–243. [Google Scholar]
Chollet F, 2015. Keras [WWW Document]. GitHub. URL https://github.com/fchollet/keras
Cole JH, Poudel RPK, Tsagkrasoulis D, Caan MWA, Steves C, Spector TD, Montana G, 2017. Predicting brain age with deep learning from raw imaging data results in a reliable and heritable biomarker. Neuroimage 163, 115–124. 10.1016/j.neuroimage.2017.07.059 [DOI] [PubMed] [Google Scholar]
Cole JH, Ritchie SJ, Bastin ME, Valdés Hernández MC, Muñoz Maniega S, Royle N, Corley J, Pattie A, Harris SE, Zhang Q, Wray NR, Redmond P, Marioni RE, Starr JM, Cox SR, Wardlaw JM, Sharp DJ, Deary IJ, 2018. Brain age predicts mortality. Mol. Psychiatry 23, 1385–1392. 10.1038/mp.2017.62 [DOI] [PMC free article] [PubMed] [Google Scholar]
Dalca AV, Balakrishnan G, Guttag J, Sabuncu MR, 2018. Unsupervised learning for fast probabilistic diffeomorphic registration. Int. Conf. Med. Image Comput. Comput. Interv 729–738. 10.1007/978-3-030-00928-1_82 [DOI] [PubMed] [Google Scholar]
Deco G, Jirsa VK, McIntosh AR, 2011. Emerging concepts for the dynamical organization of resting-state activity in the brain. Nat. Rev. Neurosci 12, 43–56. 10.1038/nrn2961 [DOI] [PubMed] [Google Scholar]
Defferrard M, Bresson X, Vandergheynst P, 2016. Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering, in: Advances in Neural Information Processing Systems. pp. 3844–3852.
Di Martino A, Yan CG, Li Q, Denio E, Castellanos FX, Alaerts K, Anderson JS, Assaf M, Bookheimer SY, Dapretto M, Deen B, Delmonte S, Dinstein I, Ertl-Wagner B, Fair DA, Gallagher L, Kennedy DP, Keown CL, Keysers C, Lainhart JE, Lord C, Luna B, Menon V, Minshew NJ, Monk CS, Mueller S, Müller RA, Nebel MB, Nigg JT, O’Hearn K, Pelphrey KA, Peltier SJ, Rudie JD, Sunaert S, Thioux M, Tyszka JM, Uddin LQ, Verhoeven JS, Wenderoth N, Wiggins JL, Mostofsky SH, Milham MP, 2014. The autism brain imaging data exchange: Towards a large-scale evaluation of the intrinsic brain architecture in autism. Mol. Psychiatry 19, 659–667. 10.1038/mp.2013.78 [DOI] [PMC free article] [PubMed] [Google Scholar]
Dosenbach NUF, Nardos B, Cohen AL, Fair DA, Power JD, Church JA, Nelson SM, Wig GS, Vogel AC, Lessov-Schlaggar CN, Barnes KA, Dubis JW, Feczko E, Coalson RS, Pruett JR, Barch DM, Petersen SE, Schlaggar BL, 2010. Prediction of individual brain maturity using fMRI. Science (80-. ). 329, 1358–1361. 10.1126/science.1194144 [DOI] [PMC free article] [PubMed] [Google Scholar]
Dubois J, Galdi P, Han Y, Paul LK, Adolphs R, 2018. Resting-State Functional Brain Connectivity Best Predicts the Personality Dimension of Openness to Experience. Personal. Neurosci 1, e6 10.1017/pen.2018.8 [DOI] [PMC free article] [PubMed] [Google Scholar]
Eickenberg M, Gramfort A, Varoquaux G, Thirion B, 2017. Seeing it all: Convolutional network layers map the function of the human visual system. Neuroimage 152, 184–194. 10.1016/j.neuroimage.2016.10.001 [DOI] [PubMed] [Google Scholar]
Eickhoff SB, Yeo BTT, Genon S, 2018. Imaging-based parcellations of the human brain. Nat. Rev. Neurosci 19, 672–686. 10.1038/s41583-018-0071-7 [DOI] [PubMed] [Google Scholar]
Elliott P, Peakman TC, 2008. The UK Biobank sample handling and storage protocol for the collection, processing and archiving of human blood and urine. Int. J. Epidemiol 37, 234–244. 10.1093/ije/dym276 [DOI] [PubMed] [Google Scholar]
Finn ES, Shen X, Scheinost D, Rosenberg MD, Huang J, Chun MM, Papademetris X, Constable RT, 2015. Functional connectome fingerprinting: Identifying individuals using patterns of brain connectivity. Nat. Neurosci 18, 1664–1671. 10.1038/nn.4135 [DOI] [PMC free article] [PubMed] [Google Scholar]
Fischl B, Salat DH, Busa E, Albert M, Dieterich M, Haselgrove C, Van Der Kouwe A, Killiany R, Kennedy D, Klaveness S, Montillo A, Makris N, Rosen B, Dale AM, 2002. Whole brain segmentation: Automated labeling of neuroanatomical structures in the human brain. Neuron 33, 341–355. 10.1016/S0896-6273(02)00569-X [DOI] [PubMed] [Google Scholar]
Fox MD, Raichle ME, 2007. Spontaneous fluctuations in brain activity observed with functional magnetic resonance imaging. Nat. Rev. Neurosci 8, 700–711. 10.1038/nrn2201 [DOI] [PubMed] [Google Scholar]
Glasser MF, Coalson TS, Robinson EC, Hacker CD, Yacoub E, Ugurbil K, Andersson JL, Beckmann CF, Jenkinson M, Smith SM, Essen D.C. Van, 2017. A Multi-Modal Oarcellation of Human Cerebral Cortex. Nature 536, 171–178. 10.1038/nature18933.A [DOI] [PMC free article] [PubMed] [Google Scholar]
Glasser MF, Sotiropoulos SN, Wilson JA, Coalson TS, Fischl B, Andersson JL, Xu J, Jbabdi S, Webster M, Polimeni JR, Van Essen DC, Jenkinson M, 2013. The minimal preprocessing pipelines for the Human Connectome Project. Neuroimage 80, 105–124. 10.1016/j.neuroimage.2013.04.127 [DOI] [PMC free article] [PubMed] [Google Scholar]
Gordon EM, Laumann TO, Adeyemo B, Huckins JF, Kelley WM, Petersen SE, 2016. Generation and Evaluation of a Cortical Area Parcellation from Resting-State Correlations. Cereb. Cortex 26, 288–303. 10.1093/cercor/bhu239 [DOI] [PMC free article] [PubMed] [Google Scholar]
Greene AS, Gao S, Scheinost D, Constable RT, 2018. Task-induced brain state manipulation improves prediction of individual traits. Nat. Commun 9 10.1038/s41467-018-04920-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
Griffanti L, Salimi-Khorshidi G, Beckmann CF, Auerbach EJ, Douaud G, Sexton CE, Zsoldos E, Ebmeier KP, Filippini N, Mackay CE, Moeller S, Xu J, Yacoub E, Baselli G, Ugurbil K, Miller KL, Smith SM, 2014. ICA-based artefact removal and accelerated fMRI acquisition for improved resting state network imaging. Neuroimage 95, 232–247. 10.1016/j.neuroimage.2014.03.034 [DOI] [PMC free article] [PubMed] [Google Scholar]
Havaei M, Davy A, Warde-Farley D, Biard A, Courville A, Bengio Y, Pal C, Jodoin PM, Larochelle H, 2017. Brain tumor segmentation with Deep Neural Networks. Med. Image Anal 35, 18–31. 10.1016/j.media.2016.05.004 [DOI] [PubMed] [Google Scholar]
He T, Kong R, Holmes AJ, Sabuncu MR, Eickhoff SB, Bzdok D, Feng J, Yeo BTT, 2018. Is deep learning better than kernel regression for functional connectivity prediction of fluid intelligence? 2018 Int. Work. Pattern Recognit. Neuroimaging, PRNI 2018 6–9. 10.1109/PRNI.2018.8423958 [DOI]
Holmes AJ, Hollinshead MO, O’Keefe TM, Petrov VI, Fariello GR, Wald LL, Fischl B, Rosen BR, Mair RW, Roffman JL, Smoller JW, Buckner RL, 2015. Brain Genomics Superstruct Project initial data release with structural, functional, and behavioral measures. Sci. Data 2, 1–16. 10.1038/sdata.2015.31 [DOI] [PMC free article] [PubMed] [Google Scholar]
Kamnitsas K, Bai W, Ferrante E, McDonagh S, Sinclair M, Pawlowski N, Rajchl M, Lee M, Kainz B, Rueckert D, Glocker B, 2017a. Ensembles of Multiple Models and Architectures for Robust Brain Tumour Segmentation, in: International MICCAI Brainlesion Workshop BrainLes 2017: Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries. pp. 450–462. [Google Scholar]
Kamnitsas K, Ledig C, Newcombe VFJ, Simpson JP, Kane AD, Menon DK, Rueckert D, Glocker B, 2017b. Efficient multi-scale 3D CNN with fully connected CRF for accurate brain lesion segmentation. Med. Image Anal 36, 61–78. 10.1016/j.media.2016.10.004 [DOI] [PubMed] [Google Scholar]
Kashyap R, Kong R, Bhattacharjee S, Li J, Zhou J, Thomas Yeo BT, 2019. Individual-specific fMRI-Subspaces improve functional connectivity prediction of behavior. Neuroimage 189, 804–812. 10.1016/j.neuroimage.2019.01.069 [DOI] [PubMed] [Google Scholar]
Kawahara J, Brown CJ, Miller SP, Booth BG, Chau V, Grunau RE, Zwicker JG, Hamarneh G, 2017. BrainNetCNN: Convolutional neural networks for brain networks; towards predicting neurodevelopment. Neuroimage 146, 1038–1049. 10.1016/j.neuroimage.2016.09.046 [DOI] [PubMed] [Google Scholar]
Kebets V, Holmes AJ, Orban C, Tang S, Li J, Sun N, Kong R, Poldrack RA, Yeo BTT, 2019. Somatosensory-Motor Dysconnectivity Spans Multiple Transdiagnostic Dimensions of Psychopathology. Biol. Psychiatry 10.1016/j.biopsych.2019.06.013 [DOI] [PubMed]
Kell AJE, Yamins DLK, Shook EN, Norman-Haignere SV, McDermott JH, 2018. A Task-Optimized Neural Network Replicates Human Auditory Behavior, Predicts Brain Responses, and Reveals a Cortical Processing Hierarchy. Neuron 98, 630–644.e16. 10.1016/j.neuron.2018.03.044 [DOI] [PubMed] [Google Scholar]
Khaligh-Razavi SM, Kriegeskorte N, 2014. Deep Supervised, but Not Unsupervised, Models May Explain IT Cortical Representation. PLoS Comput. Biol 10 10.1371/journal.pcbi.1003915 [DOI] [PMC free article] [PubMed] [Google Scholar]
Khosla M, Jamison K, Kuceyeski A, Sabuncu MR, 2019. Ensemble learning with 3D convolutional neural networks for functional connectome-based prediction. Neuroimage 199, 651–662. 10.1016/j.neuroimage.2019.06.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
Kingma DP, Ba JL, 2015. Adam: a Method for Stochastic Optimization. Int. Conf. Learn. Represent 2015 1–15.
Kipf TN, Welling M, 2017. Semi-Supervised Classification with Graph Convolutional Networks. Int. Conf. Learn. Represent 1–14. 10.1051/0004-6361/201527329 [DOI]
Kong R, Li J, Orban C, Sabuncu MR, Liu H, Schaefer A, Sun N, Zuo X-N, Holmes AJ, Eickhoff SB, Yeo BTT, 2019. Spatial Topography of Individual-Specific Cortical Networks Predicts Human Cognition, Personality, and Emotion. Cereb. Cortex 29, 2533–2551. 10.1093/cercor/bhy123 [DOI] [PMC free article] [PubMed] [Google Scholar]
Ktena SI, Parisot S, Ferrante E, Rajchl M, Lee M, Glocker B, Rueckert D, 2018. Metric learning with spectral graph convolutions on brain connectivity networks. Neuroimage 169, 431–442. 10.1016/j.neuroimage.2017.12.052 [DOI] [PubMed] [Google Scholar]
Lecun Y, Bengio Y, Hinton G, 2015. Deep learning. Nature 521, 436–444. 10.1038/nature14539 [DOI] [PubMed] [Google Scholar]
Li Hongwei, Jiang G, Zhang J, Wang R, Wang Z, Zheng W, Menze B, 2018. Fully convolutional network ensembles for white matter hyperintensities segmentation in MR images. Neuroimage 183, 650–665. 10.1016/j.neuroimage.2018.07.005 [DOI] [PubMed] [Google Scholar]
Li Hongming, Satterthwaite TD, Fan Y, 2018. Brain Age Prediction Based on Resting-State Functional Connectivity Patterns Using Convolutional Neural Networks, in: IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018). pp. 101–104. [DOI] [PMC free article] [PubMed] [Google Scholar]
Li J, Kong R, Liégeois R, Orban C, Tan Y,Sun N, Holmes AJ, Sabuncu MR, Ge T, Yeo BTT, 2019. Global signal regression strengthens association between resting-state functional connectivity and behavior. Neuroimage 196, 126–141. 10.1016/j.neuroimage.2019.04.016 [DOI] [PMC free article] [PubMed] [Google Scholar]
Liégeois R, Li J, Kong R, Orban C, Van De Ville D, Ge T, Sabuncu MR, Yeo BTT, 2019. Resting brain dynamics at different timescales capture distinct aspects of human behavior. Nat. Commun 10 10.1038/s41467-019-10317-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
Liem F, Varoquaux G, Kynast J, Beyer F, Kharabian Masouleh S, Huntenburg JM, Lampe L, Rahim M, Abraham A, Craddock RC, Riedel-Heller S, Luck T, Loeffler M, Schroeter ML, Witte AV, Villringer A, Margulies DS, 2017. Predicting brain-age from multimodal imaging data captures cognitive impairment. Neuroimage 148, 179–188. 10.1016/j.neuroimage.2016.11.005 [DOI] [PubMed] [Google Scholar]
Liu D, Lin X, Ghosh D, 2007. Semiparametric Regression of Multidimensional Genetic Pathway Data : Least-Squares Kernel Machines and Linear Mixed Models. Biometrics 1079–1088. 10.1111/j.1541-0420.2007.00799.x [DOI] [PMC free article] [PubMed]
Maas AL, Hannun AY, Ng AY, 2013. Rectifier Nonlinearities Improve Neural Network Acoustic Models. Proc. 30 th Int. Conf. Mach. Learn 28, 6. [Google Scholar]
Marinescu RV, Oxtoby NP, Young AL, Bron EE, Toga AW, Weiner MW, Barkhof F, Fox NC, Klein S, Alexander DC, Consortium, the E., Initiative, for the A.D.N., 2018. TADPOLE Challenge: Prediction of Longitudinal Evolution in Alzheimer’s Disease. arXiv Prepr. arXiv1805.03909.
McNemar Q, 1947. Note on the sampling error of the difference between correlated proportions or percentages. Psychometrika 12, 153–157. 10.1007/BF02295996 [DOI] [PubMed] [Google Scholar]
Mihalik A, Brudfors M, Robu M, Ferreira FS, Lin H, Rau A, Wu T, Blumberg SB, Kanber B, Tariq M, Garcia MDME, Zor C, Nikitichev DI, Mourao-Miranda J, Oxtoby NP, 2019. ABCD Neurocognitive Prediction Challenge 2019: Predicting individual fluid intelligence scores from structural MRI using probabilistic segmentation and kernel ridge regression. arXiv Prepr. arXiv1905.10831 1–11.
Miller KL, Alfaro-Almagro F, Bangerter NK, Thomas DL, Yacoub E, Xu J, Bartsch AJ, Jbabdi S, Sotiropoulos SN, Andersson JLR, Griffanti L, Douaud G, Okell TW, Weale P, Dragonu I, Garratt S, Hudson S, Collins R, Jenkinson M, Matthews PM, Smith SM, 2016. Multimodal population brain imaging in the UK Biobank prospective epidemiological study. Nat. Neurosci 19, 1523–1536. 10.1038/nn.4393 [DOI] [PMC free article] [PubMed] [Google Scholar]
Murphy KP, 2012. Machine Learning: A Probabilistic Perspective, MIT Press. [Google Scholar]
Nadeau C, Bengio Y, 2003. Inference for the generalization error. Mach. Learn 52, 239–281. 10.1023/A:1024068626366 [DOI] [Google Scholar]
Nguyen M, Sun N, Alexander DC, Feng J, Thomas Yeo BT, 2018. Modeling Alzheimer’s disease progression using deep recurrent neural networks. 2018 Int. Work. Pattern Recognit. Neuroimaging, PRNI 2018 1–4. 10.1109/PRNI.2018.8423955 [DOI]
Nie D, Trullo R, Petitjean C, Ruan S, Shen D, 2017. Medical Image Synthesis with Context-Aware Generative Adversarial Networks, in: International Conference on Medical Image Computing and Computer-Assisted Intervention pp. 417–425. 10.1007/978-3-319-66179-7_48 [DOI] [PMC free article] [PubMed] [Google Scholar]
Nielsen AN, Greene DJ, Gratton C, Dosenbach NUF, Petersen SE, Schlaggar BL, 2019. Evaluating the Prediction of Brain Maturity from Functional Connectivity after Motion Artifact Denoising. Cereb. Cortex 29, 2455–2469. 10.1093/cercor/bhy117 [DOI] [PMC free article] [PubMed] [Google Scholar]
Nostro AD, Müller VI, Varikuti DP, Pläschke RN, Hoffstaedter F, Langner R, Patil KR, Eickhoff SB, 2018. Predicting personality from network-based resting-state functional connectivity. Brain Struct. Funct 223, 2699–2719. 10.1007/s00429-018-1651-z [DOI] [PMC free article] [PubMed] [Google Scholar]
Parisot S, Ira S, Ferrante E, Lee M, Guerrero R, Glocker B, 2018. Disease Prediction using Graph Convolutional Networks : Application to Autism Spectrum Disorder and Alzheimer ‘ s Disease. Med. Image Anal 1–26. 10.1016/j.media.2018.06.001 [DOI] [PubMed]
Parisot S, Ktena SI, Ferrante E, Lee M, Moreno RG, Glocker B, Rueckert D, 2017. Spectral Graph Convolutions for Population-Based Disease Prediction, in: Descoteaux M, Maier-Hein L, Franz A, Jannin P, Collins DL, Duchesne S (Eds.), Medical Image Computing and Computer-Assisted Intervention – MICCAI 2017. Springer International Publishing, Cham, pp. 177–185. 10.1007/978-3-319-66179-7_21 [DOI] [Google Scholar]
Paszke A, Chanan G, Lin Z, Gross S, Yang E, Antiga L, Devito Z, 2017. Automatic differentiation in PyTorch. Adv. Neural Inf. Process. Syst 30 1–4. [Google Scholar]
Pinto A, Alves V, Silva CA, 2016. Brain Tumor Segmentation using Convolutional Neural Networks in MRI Images. IEEE Trans. Med. Imaging 35, 1240–1251. 10.1109/TMI.2016.2538465 [DOI] [PubMed] [Google Scholar]
Plis SM, Hjelm DR, Slakhutdinov R, Allen EA, Bockholt HJ, Long JD, Johnson H, Paulsen J, Turner J, Calhoun VD, 2014. Deep learning for neuroimaging: A validation study. Front. Neurosci 8, 1–11. 10.3389/fnins.2014.00229 [DOI] [PMC free article] [PubMed] [Google Scholar]
Power JD, Cohen AL, Nelson SM, Wig GS, Barnes KA, Church JA, Vogel AC, Laumann TO, Miezin FM, Schlaggar BL, Petersen SE, 2011. Functional Network Organization of the Human Brain. Neuron 72, 665–678. 10.1016/j.neuron.2011.09.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
Preti MG, Bolton TA, Van De Ville D, 2017. The dynamic functional connectome: State-of-the-art and perspectives. Neuroimage 160, 41–54. 10.1016/j.neuroimage.2016.12.061 [DOI] [PubMed] [Google Scholar]
Pruett JR, Kandala S, Hoertel S, Snyder AZ, Elison JT, Nishino T, Feczko E, Dosenbach NUF, Nardos B, Power JD, Adeyemo B, Botteron KN, McKinstry RC, Evans AC, Hazlett HC, Dager SR, Paterson S, Schultz RT, Collins DL, Fonov VS, Styner M, Gerig G, Das S, Kostopoulos P, Constantino JN, Estes AM, Petersen SE, Schlaggar BL, Piven J, 2015. Accurate age classification of 6 and 12 month-old infants based on resting-state functional connectivity magnetic resonance imaging data. Dev. Cogn. Neurosci 12, 123–133. 10.1016/j.dcn.2015.01.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
Rahim M, Thirion B, Bzdok D, Buvat I, Varoquaux G, 2017. Joint prediction of multiple scores captures better individual traits from brain images. Neuroimage 158, 145–154. 10.1016/j.neuroimage.2017.06.072 [DOI] [PubMed] [Google Scholar]
Raz G, Svanera M, Singer N, Gilam G, Bleich M, Lin T, Admon R, Gonen T, Thaler A, Granot RY, Goebel R, Benini S, Valente G, 2017. Robust inter-subject audiovisual decoding in functional magnetic resonance imaging using high-dimensional regression. Neuroimage 163, 244–263. 10.1016/j.neuroimage.2017.09.032 [DOI] [PubMed] [Google Scholar]
Reinen JM, Chén OY, Hutchison RM, Yeo BTT, Anderson KM, Sabuncu MR, Öngür D, Roffman JL, Smoller JW, Baker JT, Holmes AJ, 2018. The human cortex possesses a reconfigurable dynamic network architecture that is disrupted in psychosis. Nat. Commun 9, 1–15. 10.1038/s41467-018-03462-y [DOI] [PMC free article] [PubMed] [Google Scholar]
Rosenberg MD, Finn ES, Scheinost D, Papademetris X, Shen X, Constable RT, Chun MM, 2016. A neuromarker of sustained attention from whole-brain functional connectivity. Nat. Neurosci 19, 165–171. 10.1038/nn.4179 [DOI] [PMC free article] [PubMed] [Google Scholar]
Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, Huang Z, Karpathy A, Khosla A, Bernstein M, Berg AC, Fei-Fei L, 2015. ImageNet Large Scale Visual Recognition Challenge. Int. J. Comput. Vis 115, 211–252. 10.1007/s11263-015-0816-y [DOI] [Google Scholar]
Salimi-Khorshidi G, Douaud G, Beckmann CF, Glasser MF, Griffanti L, Smith SM, 2014. Automatic denoising of functional MRI data: Combining independent component analysis and hierarchical fusion of classifiers. Neuroimage 90, 449–468. 10.1016/j.neuroimage.2013.11.046 [DOI] [PMC free article] [PubMed] [Google Scholar]
Schaefer A, Kong R, Gordon EM, Laumann TO, Zuo X, Holmes AJ, Eickhoff SB, Yeo BTT, 2018. Local-Global Parcellation of the Human Cerebral Cortex from Intrinsic Functional Connectivity MRI. Cereb. Cortex 3095–3114. 10.1093/cercor/bhx179 [DOI] [PMC free article] [PubMed]
Shen X, Tokoglu F, Papademetris X, Constable RT, 2013. Groupwise whole-brain parcellation from resting-state fMRI data for network node identification. Neuroimage 82, 403–415. 10.1016/j.neuroimage.2013.05.081 [DOI] [PMC free article] [PubMed] [Google Scholar]
Siegel JS, Mitra A, Laumann TO, Seitzman BA, Raichle M, Corbetta M, Snyder AZ, 2017. Data quality influences observed links between functional connectivity and behavior. Cereb. Cortex 27, 4492–4502. 10.1093/cercor/bhw253 [DOI] [PMC free article] [PubMed] [Google Scholar]
Smith SM, Beckmann CF, Andersson J, Auerbach EJ, Bijsterbosch J, Douaud G, Duff E, Feinberg DA, Griffanti L, Harms MP, Kelly M, Laumann T, Miller KL, Moeller S, Petersen S, Power J, Salimi-Khorshidi G, Snyder AZ, Vu AT, Woolrich MW, Xu J, Yacoub E, Uǧurbil K, Van Essen DC, Glasser MF, 2013. Resting-state fMRI in the Human Connectome Project. Neuroimage 80, 144–168. 10.1016/j.neuroimage.2013.05.039 [DOI] [PMC free article] [PubMed] [Google Scholar]
Smith SM, Fox PT, Miller KL, Glahn DC, Fox PM, Mackay CE, Filippini N, Watkins KE, Toro R, Laird AR, Beckmann CF, 2009. Correspondence of the brain’s functional architecture during activation and rest. Proc. Natl. Acad. Sci 106, 13040–13045. 10.1073/pnas.0905267106 [DOI] [PMC free article] [PubMed] [Google Scholar]
Smith SM, Nichols TE, Vidaurre D, Winkler AM, Behrens TEJ, Glasser MF, Ugurbil K, Barch DM, Van Essen DC, Miller KL, 2015. A positive-negative mode of population covariation links brain connectivity, demographics and behavior. Nat. Neurosci 18, 1565–1567. 10.1038/nn.4125 [DOI] [PMC free article] [PubMed] [Google Scholar]
Smyser CD, Snyder AZ, Shimony JS, Mitra A, Inder TE, Neil JJ, 2016. Resting-State Network Complexity and Magnitude Are Reduced in Prematurely Born Infants. Cereb. Cortex 26, 322–333. 10.1093/cercor/bhu251 [DOI] [PMC free article] [PubMed] [Google Scholar]
Steiger JH, 1980. Tests for comparing elements of a correlation matrix. Psychol. Bull 10.1037/0033-2909.87.2.245 [DOI]
Sturmfels P, Rutherford S, Angstadt M, Peterson M, Sripada C, Wiens J, 2018. A Domain Guided CNN Architecture for Predicting Age from Structural Brain Images, in: Doshi-Velez F, Fackler J, Jung K, Kale D, Ranganath R, Wallace B, Wiens J (Eds.), Proceedings of the 3rd Machine Learning for Healthcare Conference, Proceedings of Machine Learning Research. PMLR, Palo Alto, California, pp. 295–311. [Google Scholar]
Sudlow C, Gallacher J, Allen N, Beral V, Burton P, Danesh J, Downey P, Elliott P, Green J, Landray M, Liu B, Matthews P, Ong G, Pell J, Silman A, Young A, Sprosen T, Peakman T, Collins R, 2015. UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age. PLoS Med. 12, 1–10. 10.1371/journal.pmed.1001779 [DOI] [PMC free article] [PubMed] [Google Scholar]
van der Burgh HK, Schmidt R, Westeneng HJ, de Reus MA, van den Berg LH, van den Heuvel MP, 2017. Deep learning predictions of survival based on MRI in amyotrophic lateral sclerosis. NeuroImage Clin. 13, 361–369. 10.1016/j.nicl.2016.10.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
Van Essen DC, Smith SM, Barch DM, Behrens TEJ, Yacoub E, Ugurbil K, 2013. The WU-Minn Human Connectome Project: An overview. Neuroimage 80, 62–79. 10.1016/j.neuroimage.2013.05.041 [DOI] [PMC free article] [PubMed] [Google Scholar]
Van Essen DC, Ugurbil K, Auerbach E, Barch D, Behrens TEJ, Bucholz R, Chang A, Chen L, Corbetta M, Curtiss SW, Della Penna S, Feinberg D, Glasser MF, Harel N, Heath AC, Larson-Prior L, Marcus D, Michalareas G, Moeller S, Oostenveld R, Petersen SE, Prior F, Schlaggar BL, Smith SM, Snyder AZ, Xu J, Yacoub E, 2012. The Human Connectome Project: A data acquisition perspective. Neuroimage 62, 2222–2231. 10.1016/j.neuroimage.2012.02.018 [DOI] [PMC free article] [PubMed] [Google Scholar]
Varikuti DP, Genon S, Sotiras A, Schwender H, Hoffstaedter F, Patil KR, Jockwitz C, Caspers S, Moebus S, Amunts K, Davatzikos C, Eickhoff SB, 2018. Evaluation of non-negative matrix factorization of grey matter in age prediction. Neuroimage 173, 394–410. 10.1016/j.neuroimage.2018.03.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
Vieira S, Pinaya WHL, Mechelli A, 2017. Using deep learning to investigate the neuroimaging correlates of psychiatric and neurological disorders: Methods and applications. Neurosci. Biobehav. Rev 74, 58–75. 10.1016/j.neubiorev.2017.01.002 [DOI] [PubMed] [Google Scholar]
Wachinger C, Reuter M, Klein T, 2018. DeepNAT: Deep convolutional neural network for segmenting neuroanatomy. Neuroimage 170, 434–445. 10.1016/j.neuroimage.2017.02.035 [DOI] [PMC free article] [PubMed] [Google Scholar]
Wang P, Kong R, Kong X, Liégeois R, Orban C, Deco G, Van Den Heuvel MP, Yeo BTT, 2019. Inversion of a large-scale circuit model reveals a cortical hierarchy in the dynamic resting human brain. Sci. Adv 21 10.1126/sciadv.aat7854 [DOI] [PMC free article] [PubMed] [Google Scholar]
Weis S, Patil KR, Hoffstaedter F, Nostro A, Yeo BTT, Eickhoff SB, 2019. Sex Classification by Resting State Brain Connectivity. Cereb. Cortex 1–12. 10.1093/cercor/bhz129 [DOI] [PMC free article] [PubMed]
Woo CW, Chang LJ, Lindquist MA, Wager TD, 2017. Building better biomarkers: Brain models in translational neuroimaging. Nat. Neurosci 20, 365–377. 10.1038/nn.4478 [DOI] [PMC free article] [PubMed] [Google Scholar]
Yamins DLK, Hong H, Cadieu CF, Solomon EA, Seibert D, DiCarlo JJ, 2014. Performance-optimized hierarchical models predict neural responses in higher visual cortex. Proc. Natl. Acad. Sci 111, 8619–8624. 10.1073/pnas.1403112111 [DOI] [PMC free article] [PubMed] [Google Scholar]
Yang X, Kwitt R, Styner M, Niethammer M, 2017. Quicksilver: Fast predictive image registration – A deep learning approach. Neuroimage 158, 378–396. 10.1016/j.neuroimage.2017.07.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
Yeo BTT, Krienen FM, Sepulcre J, Sabuncu MR, Lashkari D, Hollinshead M, Roffman JL, Smoller JW, Zollei L, Polimeni JR, Fischl B, Liu H, Buckner RL, 2011. The organization of the human cerebral cortex estimated by intrinsic functional connectivity. J. Neurophysiol 106, 1125–1165. 10.1152/jn.00338.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
Yoo K, Rosenberg MD, Noble S, Scheinost D, Constable RT, Chun MM, 2019. Multivariate approaches improve the reliability and validity of functional connectivity and prediction of individual behaviors. Neuroimage 197, 212–223. 10.1016/j.neuroimage.2019.04.060 [DOI] [PMC free article] [PubMed] [Google Scholar]
Zhang X, He L, Chen K, Luo Y, Zhou J, Wang F, 2018. Multi-View Graph Convolutional Network and Its Applications on Neuroimage Analysis for Parkinson’s Disease. arXiv Prepr. arXiv1805.08801 https://doi.org/arXiv:1805.08801v2 [PMC free article] [PubMed]
Zhao G, Liu F, Oler JA, Meyerand ME, Kalin NH, Birn RM, 2018. Bayesian convolutional neural network based MRI brain extraction on nonhuman primates. Neuroimage 175, 32–44. 10.1016/j.neuroimage.2018.03.065s [DOI] [PMC free article] [PubMed] [Google Scholar]
Zhao X, Wu Y, Song G, Li Z, Zhang Y, Fan Y, 2018. A deep learning model integrating FCNNs and CRFs for brain tumor segmentation. Med. Image Anal 43, 98–111. 10.1016/j.media.2017.10.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
Zhu X, Thung K-H, Adeli E, Zhang Y, Shen D, 2017. Maximum Mean Discrepancy Based Multiple Kernel Learning for Incomplete Multimodality Neuroimaging Data, in: Descoteaux M, Maier-Hein L, Franz A, Jannin P, Collins DL, Duchesne S (Eds.), Medical Image Computing and Computer-Assisted Intervention – MICCAI 2017. Springer International Publishing, Cham, pp. 72–80. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

NIHMS1542375-supplement-1.docx^{(11.2MB, docx)}

Data Availability Statement

[R1] Alfaro-Almagro F, Jenkinson M, Bangerter NK, Andersson JLR, Griffanti L, Douaud G, Sotiropoulos SN, Jbabdi S, Hernandez-Fernandez M, Vallee E, Vidaurre D, Webster M, McCarthy P, Rorden C, Daducci A, Alexander DC, Zhang H, Dragonu I, Matthews PM, Miller KL, Smith SM, 2018. Image processing and Quality Control for the first 10,000 brain imaging datasets from UK Biobank. Neuroimage 166, 400–424. 10.1016/j.neuroimage.2017.10.034 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R2] Amico E, Goñi J, 2018. The quest for identifiability in human functional connectomes. Sci. Rep 8, 1–14. 10.1038/s41598-018-25089-1 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R3] Arbabshirani MR, Plis S, Sui J, Calhoun VD, 2017. Single subject prediction of brain disorders in neuroimaging: Promises and pitfalls. Neuroimage 145, 137–165. 10.1016/j.neuroimage.2016.02.079 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R4] Assaf M, Jagannathan K, Calhoun VD, Miller L, Stevens MC, Sahl R, O’Boyle JG, Schultz RT, Pearlson GD, 2010. Abnormal functional connectivity of default mode sub-networks in autism spectrum disorder patients. Neuroimage 53, 247–256. 10.1016/j.neuroimage.2010.05.067.Assaf [DOI] [PMC free article] [PubMed] [Google Scholar]

[R5] Bahrami K, Shi F, Rekik I, Shen D, 2016. Convolutional Neural Network for Reconstruction of 7T-like Images from 3T MRI Using Appearance and Anatomical Features, in: MICCAI 2016 DL Workshop. pp. 39–47. 10.1007/978-3-319-46976-8_5 [DOI] [Google Scholar]

[R6] Barch DM, Burgess GC, Harms MP, Petersen SE, Schlaggar BL, Corbetta M, Glasser MF, Curtiss S, Dixit S, Feldt C, Nolan D, Bryant E, Hartley T, Footer O, Bjork JM, Poldrack R, Smith S, Johansen-Berg H, Snyder AZ, Van Essen DC, 2013. Function in the human connectome: Task-fMRI and individual differences in behavior. Neuroimage 80, 169–189. 10.1016/j.neuroimage.2013.05.033 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R7] Beckmann CF, Smith SM, 2004. Probabilistic Independent Component Analysis for Functional Magnetic Resonance Imaging. IEEE Trans. Med. Imaging 23, 137–152. 10.1109/TMI.2003.822821 [DOI] [PubMed] [Google Scholar]

[R8] Bertolero MA, Yeo BTT, D’Esposito M, 2017. The diverse club. Nat. Commun 8, 1–10. 10.1038/s41467-017-01189-w [DOI] [PMC free article] [PubMed] [Google Scholar]

[R9] Biswal B, FZ Y, VM H, JS H, 1995. Functional connectivity in the motor cortex of resting human brain using echo-planar MRI. Magn Reson Med 34, 537–541. 10.1002/mrm.1910340409 [DOI] [PubMed] [Google Scholar]

[R10] Blumberg SB, Tanno R, Kokkinos I, Alexander DC, 2018. Deeper image quality transfer: Training low-memory neural networks for 3D images. Int. Conf. Med. Image Comput. Comput. Interv 118–125. 10.1007/978-3-030-00928-1_14 [DOI]

[R11] Bouckaert RR, Frank E, 2004. Evaluating the Replicability of Significance Tests for Comparing Learning Algorithms. Adv. Knowl. Discov. data Min 3–12. 10.1007/978-3-540-24775-3 [DOI]

[R12] Breakspear M, 2017. Dynamic models of large-scale brain activity. Nat. Neurosci 20, 340–352. 10.1038/nn.4497 [DOI] [PubMed] [Google Scholar]

[R13] Bronstein MM, Bruna J, Lecun Y, Szlam A, Vandergheynst P, 2017. Geometric Deep Learning: Going beyond Euclidean data. IEEE Signal Process. Mag 34, 18–42. 10.1109/MSP.2017.2693418 [DOI] [Google Scholar]

[R14] Buckner RL, Krienen FM, Yeo BTT, 2013. Opportunities and limitations of intrinsic functional connectivity MRI. Nat. Neurosci 16, 832–837. 10.1038/nn.3423 [DOI] [PubMed] [Google Scholar]

[R15] Burgess GC, Kandala S, Nolan D, Laumann TO, Power JD, Adeyemo B, Harms MP, Petersen SE, Barch DM, 2016. Evaluation of Denoising Strategies to Address Motion-Correlated Artifacts in Resting-State Functional Magnetic Resonance Imaging Data from the Human Connectome Project. Brain Connect. 6, 669–680. 10.1089/brain.2016.0435 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R16] Bzdok D, Yeo BTT, 2017. Inference in the age of big data: Future perspectives on neuroscience. Neuroimage 155, 549–564. 10.1016/j.neuroimage.2017.04.061 [DOI] [PubMed] [Google Scholar]

[R17] Calhoun VD, Miller R, Pearlson G, Adali T, 2014. The Chronnectome: Time-Varying Connectivity Networks as the Next Frontier in fMRI Data Discovery. Neuron 84, 262–274. 10.1016/j.neuron.2014.10.015 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R18] Chekroud AM, Ward EJ, Rosenberg MD, Holmes AJ, 2016. Patterns in the human brain mosaic discriminate males from females. Proc. Natl. Acad. Sci. U. S. A 113, E1968 10.1073/pnas.1523888113 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R19] Chelba C, Mikolov T, Schuster M, Ge Q, Brants T, Koehn P, Robinson T, 2014. One billion word benchmark for measuring progress in statistical language modeling. Proc. Annu. Conf. Int. Speech Commun. Assoc INTERSPEECH 2635–2639 10.1016/j.csl.2015.07.001 [DOI]

[R20] Choi Y, Kwon Y, Lee H, Kim BJ, Paik MC, Won J-H, 2016. Ensemble of Deep Convolutional Neural Networks for Prognosis of Ischemic Stroke, in: Crimi A, Menze B, Maier O, Reyes M, Winzeck S, Handels H (Eds.), International MICCAI Brainlesion Workshop BrainLes 2016: Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries. Springer International Publishing, Cham, pp. 231–243. [Google Scholar]

[R21] Chollet F, 2015. Keras [WWW Document]. GitHub. URL https://github.com/fchollet/keras

[R22] Cole JH, Poudel RPK, Tsagkrasoulis D, Caan MWA, Steves C, Spector TD, Montana G, 2017. Predicting brain age with deep learning from raw imaging data results in a reliable and heritable biomarker. Neuroimage 163, 115–124. 10.1016/j.neuroimage.2017.07.059 [DOI] [PubMed] [Google Scholar]

[R23] Cole JH, Ritchie SJ, Bastin ME, Valdés Hernández MC, Muñoz Maniega S, Royle N, Corley J, Pattie A, Harris SE, Zhang Q, Wray NR, Redmond P, Marioni RE, Starr JM, Cox SR, Wardlaw JM, Sharp DJ, Deary IJ, 2018. Brain age predicts mortality. Mol. Psychiatry 23, 1385–1392. 10.1038/mp.2017.62 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R24] Dalca AV, Balakrishnan G, Guttag J, Sabuncu MR, 2018. Unsupervised learning for fast probabilistic diffeomorphic registration. Int. Conf. Med. Image Comput. Comput. Interv 729–738. 10.1007/978-3-030-00928-1_82 [DOI] [PubMed] [Google Scholar]

[R25] Deco G, Jirsa VK, McIntosh AR, 2011. Emerging concepts for the dynamical organization of resting-state activity in the brain. Nat. Rev. Neurosci 12, 43–56. 10.1038/nrn2961 [DOI] [PubMed] [Google Scholar]

[R26] Defferrard M, Bresson X, Vandergheynst P, 2016. Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering, in: Advances in Neural Information Processing Systems. pp. 3844–3852.

[R27] Di Martino A, Yan CG, Li Q, Denio E, Castellanos FX, Alaerts K, Anderson JS, Assaf M, Bookheimer SY, Dapretto M, Deen B, Delmonte S, Dinstein I, Ertl-Wagner B, Fair DA, Gallagher L, Kennedy DP, Keown CL, Keysers C, Lainhart JE, Lord C, Luna B, Menon V, Minshew NJ, Monk CS, Mueller S, Müller RA, Nebel MB, Nigg JT, O’Hearn K, Pelphrey KA, Peltier SJ, Rudie JD, Sunaert S, Thioux M, Tyszka JM, Uddin LQ, Verhoeven JS, Wenderoth N, Wiggins JL, Mostofsky SH, Milham MP, 2014. The autism brain imaging data exchange: Towards a large-scale evaluation of the intrinsic brain architecture in autism. Mol. Psychiatry 19, 659–667. 10.1038/mp.2013.78 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R28] Dosenbach NUF, Nardos B, Cohen AL, Fair DA, Power JD, Church JA, Nelson SM, Wig GS, Vogel AC, Lessov-Schlaggar CN, Barnes KA, Dubis JW, Feczko E, Coalson RS, Pruett JR, Barch DM, Petersen SE, Schlaggar BL, 2010. Prediction of individual brain maturity using fMRI. Science (80-. ). 329, 1358–1361. 10.1126/science.1194144 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R29] Dubois J, Galdi P, Han Y, Paul LK, Adolphs R, 2018. Resting-State Functional Brain Connectivity Best Predicts the Personality Dimension of Openness to Experience. Personal. Neurosci 1, e6 10.1017/pen.2018.8 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R30] Eickenberg M, Gramfort A, Varoquaux G, Thirion B, 2017. Seeing it all: Convolutional network layers map the function of the human visual system. Neuroimage 152, 184–194. 10.1016/j.neuroimage.2016.10.001 [DOI] [PubMed] [Google Scholar]

[R31] Eickhoff SB, Yeo BTT, Genon S, 2018. Imaging-based parcellations of the human brain. Nat. Rev. Neurosci 19, 672–686. 10.1038/s41583-018-0071-7 [DOI] [PubMed] [Google Scholar]

[R32] Elliott P, Peakman TC, 2008. The UK Biobank sample handling and storage protocol for the collection, processing and archiving of human blood and urine. Int. J. Epidemiol 37, 234–244. 10.1093/ije/dym276 [DOI] [PubMed] [Google Scholar]

[R33] Finn ES, Shen X, Scheinost D, Rosenberg MD, Huang J, Chun MM, Papademetris X, Constable RT, 2015. Functional connectome fingerprinting: Identifying individuals using patterns of brain connectivity. Nat. Neurosci 18, 1664–1671. 10.1038/nn.4135 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R34] Fischl B, Salat DH, Busa E, Albert M, Dieterich M, Haselgrove C, Van Der Kouwe A, Killiany R, Kennedy D, Klaveness S, Montillo A, Makris N, Rosen B, Dale AM, 2002. Whole brain segmentation: Automated labeling of neuroanatomical structures in the human brain. Neuron 33, 341–355. 10.1016/S0896-6273(02)00569-X [DOI] [PubMed] [Google Scholar]

[R35] Fox MD, Raichle ME, 2007. Spontaneous fluctuations in brain activity observed with functional magnetic resonance imaging. Nat. Rev. Neurosci 8, 700–711. 10.1038/nrn2201 [DOI] [PubMed] [Google Scholar]

[R36] Glasser MF, Coalson TS, Robinson EC, Hacker CD, Yacoub E, Ugurbil K, Andersson JL, Beckmann CF, Jenkinson M, Smith SM, Essen D.C. Van, 2017. A Multi-Modal Oarcellation of Human Cerebral Cortex. Nature 536, 171–178. 10.1038/nature18933.A [DOI] [PMC free article] [PubMed] [Google Scholar]

[R37] Glasser MF, Sotiropoulos SN, Wilson JA, Coalson TS, Fischl B, Andersson JL, Xu J, Jbabdi S, Webster M, Polimeni JR, Van Essen DC, Jenkinson M, 2013. The minimal preprocessing pipelines for the Human Connectome Project. Neuroimage 80, 105–124. 10.1016/j.neuroimage.2013.04.127 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R38] Gordon EM, Laumann TO, Adeyemo B, Huckins JF, Kelley WM, Petersen SE, 2016. Generation and Evaluation of a Cortical Area Parcellation from Resting-State Correlations. Cereb. Cortex 26, 288–303. 10.1093/cercor/bhu239 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R39] Greene AS, Gao S, Scheinost D, Constable RT, 2018. Task-induced brain state manipulation improves prediction of individual traits. Nat. Commun 9 10.1038/s41467-018-04920-3 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R40] Griffanti L, Salimi-Khorshidi G, Beckmann CF, Auerbach EJ, Douaud G, Sexton CE, Zsoldos E, Ebmeier KP, Filippini N, Mackay CE, Moeller S, Xu J, Yacoub E, Baselli G, Ugurbil K, Miller KL, Smith SM, 2014. ICA-based artefact removal and accelerated fMRI acquisition for improved resting state network imaging. Neuroimage 95, 232–247. 10.1016/j.neuroimage.2014.03.034 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R41] Havaei M, Davy A, Warde-Farley D, Biard A, Courville A, Bengio Y, Pal C, Jodoin PM, Larochelle H, 2017. Brain tumor segmentation with Deep Neural Networks. Med. Image Anal 35, 18–31. 10.1016/j.media.2016.05.004 [DOI] [PubMed] [Google Scholar]

[R42] He T, Kong R, Holmes AJ, Sabuncu MR, Eickhoff SB, Bzdok D, Feng J, Yeo BTT, 2018. Is deep learning better than kernel regression for functional connectivity prediction of fluid intelligence? 2018 Int. Work. Pattern Recognit. Neuroimaging, PRNI 2018 6–9. 10.1109/PRNI.2018.8423958 [DOI]

[R43] Holmes AJ, Hollinshead MO, O’Keefe TM, Petrov VI, Fariello GR, Wald LL, Fischl B, Rosen BR, Mair RW, Roffman JL, Smoller JW, Buckner RL, 2015. Brain Genomics Superstruct Project initial data release with structural, functional, and behavioral measures. Sci. Data 2, 1–16. 10.1038/sdata.2015.31 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R44] Kamnitsas K, Bai W, Ferrante E, McDonagh S, Sinclair M, Pawlowski N, Rajchl M, Lee M, Kainz B, Rueckert D, Glocker B, 2017a. Ensembles of Multiple Models and Architectures for Robust Brain Tumour Segmentation, in: International MICCAI Brainlesion Workshop BrainLes 2017: Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries. pp. 450–462. [Google Scholar]

[R45] Kamnitsas K, Ledig C, Newcombe VFJ, Simpson JP, Kane AD, Menon DK, Rueckert D, Glocker B, 2017b. Efficient multi-scale 3D CNN with fully connected CRF for accurate brain lesion segmentation. Med. Image Anal 36, 61–78. 10.1016/j.media.2016.10.004 [DOI] [PubMed] [Google Scholar]

[R46] Kashyap R, Kong R, Bhattacharjee S, Li J, Zhou J, Thomas Yeo BT, 2019. Individual-specific fMRI-Subspaces improve functional connectivity prediction of behavior. Neuroimage 189, 804–812. 10.1016/j.neuroimage.2019.01.069 [DOI] [PubMed] [Google Scholar]

[R47] Kawahara J, Brown CJ, Miller SP, Booth BG, Chau V, Grunau RE, Zwicker JG, Hamarneh G, 2017. BrainNetCNN: Convolutional neural networks for brain networks; towards predicting neurodevelopment. Neuroimage 146, 1038–1049. 10.1016/j.neuroimage.2016.09.046 [DOI] [PubMed] [Google Scholar]

[R48] Kebets V, Holmes AJ, Orban C, Tang S, Li J, Sun N, Kong R, Poldrack RA, Yeo BTT, 2019. Somatosensory-Motor Dysconnectivity Spans Multiple Transdiagnostic Dimensions of Psychopathology. Biol. Psychiatry 10.1016/j.biopsych.2019.06.013 [DOI] [PubMed]

[R49] Kell AJE, Yamins DLK, Shook EN, Norman-Haignere SV, McDermott JH, 2018. A Task-Optimized Neural Network Replicates Human Auditory Behavior, Predicts Brain Responses, and Reveals a Cortical Processing Hierarchy. Neuron 98, 630–644.e16. 10.1016/j.neuron.2018.03.044 [DOI] [PubMed] [Google Scholar]

[R50] Khaligh-Razavi SM, Kriegeskorte N, 2014. Deep Supervised, but Not Unsupervised, Models May Explain IT Cortical Representation. PLoS Comput. Biol 10 10.1371/journal.pcbi.1003915 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R51] Khosla M, Jamison K, Kuceyeski A, Sabuncu MR, 2019. Ensemble learning with 3D convolutional neural networks for functional connectome-based prediction. Neuroimage 199, 651–662. 10.1016/j.neuroimage.2019.06.012 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R52] Kingma DP, Ba JL, 2015. Adam: a Method for Stochastic Optimization. Int. Conf. Learn. Represent 2015 1–15.

[R53] Kipf TN, Welling M, 2017. Semi-Supervised Classification with Graph Convolutional Networks. Int. Conf. Learn. Represent 1–14. 10.1051/0004-6361/201527329 [DOI]

[R54] Kong R, Li J, Orban C, Sabuncu MR, Liu H, Schaefer A, Sun N, Zuo X-N, Holmes AJ, Eickhoff SB, Yeo BTT, 2019. Spatial Topography of Individual-Specific Cortical Networks Predicts Human Cognition, Personality, and Emotion. Cereb. Cortex 29, 2533–2551. 10.1093/cercor/bhy123 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R55] Ktena SI, Parisot S, Ferrante E, Rajchl M, Lee M, Glocker B, Rueckert D, 2018. Metric learning with spectral graph convolutions on brain connectivity networks. Neuroimage 169, 431–442. 10.1016/j.neuroimage.2017.12.052 [DOI] [PubMed] [Google Scholar]

[R56] Lecun Y, Bengio Y, Hinton G, 2015. Deep learning. Nature 521, 436–444. 10.1038/nature14539 [DOI] [PubMed] [Google Scholar]

[R57] Li Hongwei, Jiang G, Zhang J, Wang R, Wang Z, Zheng W, Menze B, 2018. Fully convolutional network ensembles for white matter hyperintensities segmentation in MR images. Neuroimage 183, 650–665. 10.1016/j.neuroimage.2018.07.005 [DOI] [PubMed] [Google Scholar]

[R58] Li Hongming, Satterthwaite TD, Fan Y, 2018. Brain Age Prediction Based on Resting-State Functional Connectivity Patterns Using Convolutional Neural Networks, in: IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018). pp. 101–104. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R59] Li J, Kong R, Liégeois R, Orban C, Tan Y,Sun N, Holmes AJ, Sabuncu MR, Ge T, Yeo BTT, 2019. Global signal regression strengthens association between resting-state functional connectivity and behavior. Neuroimage 196, 126–141. 10.1016/j.neuroimage.2019.04.016 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R60] Liégeois R, Li J, Kong R, Orban C, Van De Ville D, Ge T, Sabuncu MR, Yeo BTT, 2019. Resting brain dynamics at different timescales capture distinct aspects of human behavior. Nat. Commun 10 10.1038/s41467-019-10317-7 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R61] Liem F, Varoquaux G, Kynast J, Beyer F, Kharabian Masouleh S, Huntenburg JM, Lampe L, Rahim M, Abraham A, Craddock RC, Riedel-Heller S, Luck T, Loeffler M, Schroeter ML, Witte AV, Villringer A, Margulies DS, 2017. Predicting brain-age from multimodal imaging data captures cognitive impairment. Neuroimage 148, 179–188. 10.1016/j.neuroimage.2016.11.005 [DOI] [PubMed] [Google Scholar]

[R62] Liu D, Lin X, Ghosh D, 2007. Semiparametric Regression of Multidimensional Genetic Pathway Data : Least-Squares Kernel Machines and Linear Mixed Models. Biometrics 1079–1088. 10.1111/j.1541-0420.2007.00799.x [DOI] [PMC free article] [PubMed]

[R63] Maas AL, Hannun AY, Ng AY, 2013. Rectifier Nonlinearities Improve Neural Network Acoustic Models. Proc. 30 th Int. Conf. Mach. Learn 28, 6. [Google Scholar]

[R64] Marinescu RV, Oxtoby NP, Young AL, Bron EE, Toga AW, Weiner MW, Barkhof F, Fox NC, Klein S, Alexander DC, Consortium, the E., Initiative, for the A.D.N., 2018. TADPOLE Challenge: Prediction of Longitudinal Evolution in Alzheimer’s Disease. arXiv Prepr. arXiv1805.03909.

[R65] McNemar Q, 1947. Note on the sampling error of the difference between correlated proportions or percentages. Psychometrika 12, 153–157. 10.1007/BF02295996 [DOI] [PubMed] [Google Scholar]

[R66] Mihalik A, Brudfors M, Robu M, Ferreira FS, Lin H, Rau A, Wu T, Blumberg SB, Kanber B, Tariq M, Garcia MDME, Zor C, Nikitichev DI, Mourao-Miranda J, Oxtoby NP, 2019. ABCD Neurocognitive Prediction Challenge 2019: Predicting individual fluid intelligence scores from structural MRI using probabilistic segmentation and kernel ridge regression. arXiv Prepr. arXiv1905.10831 1–11.

[R67] Miller KL, Alfaro-Almagro F, Bangerter NK, Thomas DL, Yacoub E, Xu J, Bartsch AJ, Jbabdi S, Sotiropoulos SN, Andersson JLR, Griffanti L, Douaud G, Okell TW, Weale P, Dragonu I, Garratt S, Hudson S, Collins R, Jenkinson M, Matthews PM, Smith SM, 2016. Multimodal population brain imaging in the UK Biobank prospective epidemiological study. Nat. Neurosci 19, 1523–1536. 10.1038/nn.4393 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R68] Murphy KP, 2012. Machine Learning: A Probabilistic Perspective, MIT Press. [Google Scholar]

[R69] Nadeau C, Bengio Y, 2003. Inference for the generalization error. Mach. Learn 52, 239–281. 10.1023/A:1024068626366 [DOI] [Google Scholar]

[R70] Nguyen M, Sun N, Alexander DC, Feng J, Thomas Yeo BT, 2018. Modeling Alzheimer’s disease progression using deep recurrent neural networks. 2018 Int. Work. Pattern Recognit. Neuroimaging, PRNI 2018 1–4. 10.1109/PRNI.2018.8423955 [DOI]

[R71] Nie D, Trullo R, Petitjean C, Ruan S, Shen D, 2017. Medical Image Synthesis with Context-Aware Generative Adversarial Networks, in: International Conference on Medical Image Computing and Computer-Assisted Intervention pp. 417–425. 10.1007/978-3-319-66179-7_48 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R72] Nielsen AN, Greene DJ, Gratton C, Dosenbach NUF, Petersen SE, Schlaggar BL, 2019. Evaluating the Prediction of Brain Maturity from Functional Connectivity after Motion Artifact Denoising. Cereb. Cortex 29, 2455–2469. 10.1093/cercor/bhy117 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R73] Nostro AD, Müller VI, Varikuti DP, Pläschke RN, Hoffstaedter F, Langner R, Patil KR, Eickhoff SB, 2018. Predicting personality from network-based resting-state functional connectivity. Brain Struct. Funct 223, 2699–2719. 10.1007/s00429-018-1651-z [DOI] [PMC free article] [PubMed] [Google Scholar]

[R74] Parisot S, Ira S, Ferrante E, Lee M, Guerrero R, Glocker B, 2018. Disease Prediction using Graph Convolutional Networks : Application to Autism Spectrum Disorder and Alzheimer ‘ s Disease. Med. Image Anal 1–26. 10.1016/j.media.2018.06.001 [DOI] [PubMed]

[R75] Parisot S, Ktena SI, Ferrante E, Lee M, Moreno RG, Glocker B, Rueckert D, 2017. Spectral Graph Convolutions for Population-Based Disease Prediction, in: Descoteaux M, Maier-Hein L, Franz A, Jannin P, Collins DL, Duchesne S (Eds.), Medical Image Computing and Computer-Assisted Intervention – MICCAI 2017. Springer International Publishing, Cham, pp. 177–185. 10.1007/978-3-319-66179-7_21 [DOI] [Google Scholar]

[R76] Paszke A, Chanan G, Lin Z, Gross S, Yang E, Antiga L, Devito Z, 2017. Automatic differentiation in PyTorch. Adv. Neural Inf. Process. Syst 30 1–4. [Google Scholar]

[R77] Pinto A, Alves V, Silva CA, 2016. Brain Tumor Segmentation using Convolutional Neural Networks in MRI Images. IEEE Trans. Med. Imaging 35, 1240–1251. 10.1109/TMI.2016.2538465 [DOI] [PubMed] [Google Scholar]

[R78] Plis SM, Hjelm DR, Slakhutdinov R, Allen EA, Bockholt HJ, Long JD, Johnson H, Paulsen J, Turner J, Calhoun VD, 2014. Deep learning for neuroimaging: A validation study. Front. Neurosci 8, 1–11. 10.3389/fnins.2014.00229 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R79] Power JD, Cohen AL, Nelson SM, Wig GS, Barnes KA, Church JA, Vogel AC, Laumann TO, Miezin FM, Schlaggar BL, Petersen SE, 2011. Functional Network Organization of the Human Brain. Neuron 72, 665–678. 10.1016/j.neuron.2011.09.006 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R80] Preti MG, Bolton TA, Van De Ville D, 2017. The dynamic functional connectome: State-of-the-art and perspectives. Neuroimage 160, 41–54. 10.1016/j.neuroimage.2016.12.061 [DOI] [PubMed] [Google Scholar]

[R81] Pruett JR, Kandala S, Hoertel S, Snyder AZ, Elison JT, Nishino T, Feczko E, Dosenbach NUF, Nardos B, Power JD, Adeyemo B, Botteron KN, McKinstry RC, Evans AC, Hazlett HC, Dager SR, Paterson S, Schultz RT, Collins DL, Fonov VS, Styner M, Gerig G, Das S, Kostopoulos P, Constantino JN, Estes AM, Petersen SE, Schlaggar BL, Piven J, 2015. Accurate age classification of 6 and 12 month-old infants based on resting-state functional connectivity magnetic resonance imaging data. Dev. Cogn. Neurosci 12, 123–133. 10.1016/j.dcn.2015.01.003 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R82] Rahim M, Thirion B, Bzdok D, Buvat I, Varoquaux G, 2017. Joint prediction of multiple scores captures better individual traits from brain images. Neuroimage 158, 145–154. 10.1016/j.neuroimage.2017.06.072 [DOI] [PubMed] [Google Scholar]

[R83] Raz G, Svanera M, Singer N, Gilam G, Bleich M, Lin T, Admon R, Gonen T, Thaler A, Granot RY, Goebel R, Benini S, Valente G, 2017. Robust inter-subject audiovisual decoding in functional magnetic resonance imaging using high-dimensional regression. Neuroimage 163, 244–263. 10.1016/j.neuroimage.2017.09.032 [DOI] [PubMed] [Google Scholar]

[R84] Reinen JM, Chén OY, Hutchison RM, Yeo BTT, Anderson KM, Sabuncu MR, Öngür D, Roffman JL, Smoller JW, Baker JT, Holmes AJ, 2018. The human cortex possesses a reconfigurable dynamic network architecture that is disrupted in psychosis. Nat. Commun 9, 1–15. 10.1038/s41467-018-03462-y [DOI] [PMC free article] [PubMed] [Google Scholar]

[R85] Rosenberg MD, Finn ES, Scheinost D, Papademetris X, Shen X, Constable RT, Chun MM, 2016. A neuromarker of sustained attention from whole-brain functional connectivity. Nat. Neurosci 19, 165–171. 10.1038/nn.4179 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R86] Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, Huang Z, Karpathy A, Khosla A, Bernstein M, Berg AC, Fei-Fei L, 2015. ImageNet Large Scale Visual Recognition Challenge. Int. J. Comput. Vis 115, 211–252. 10.1007/s11263-015-0816-y [DOI] [Google Scholar]

[R87] Salimi-Khorshidi G, Douaud G, Beckmann CF, Glasser MF, Griffanti L, Smith SM, 2014. Automatic denoising of functional MRI data: Combining independent component analysis and hierarchical fusion of classifiers. Neuroimage 90, 449–468. 10.1016/j.neuroimage.2013.11.046 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R88] Schaefer A, Kong R, Gordon EM, Laumann TO, Zuo X, Holmes AJ, Eickhoff SB, Yeo BTT, 2018. Local-Global Parcellation of the Human Cerebral Cortex from Intrinsic Functional Connectivity MRI. Cereb. Cortex 3095–3114. 10.1093/cercor/bhx179 [DOI] [PMC free article] [PubMed]

[R89] Shen X, Tokoglu F, Papademetris X, Constable RT, 2013. Groupwise whole-brain parcellation from resting-state fMRI data for network node identification. Neuroimage 82, 403–415. 10.1016/j.neuroimage.2013.05.081 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R90] Siegel JS, Mitra A, Laumann TO, Seitzman BA, Raichle M, Corbetta M, Snyder AZ, 2017. Data quality influences observed links between functional connectivity and behavior. Cereb. Cortex 27, 4492–4502. 10.1093/cercor/bhw253 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R91] Smith SM, Beckmann CF, Andersson J, Auerbach EJ, Bijsterbosch J, Douaud G, Duff E, Feinberg DA, Griffanti L, Harms MP, Kelly M, Laumann T, Miller KL, Moeller S, Petersen S, Power J, Salimi-Khorshidi G, Snyder AZ, Vu AT, Woolrich MW, Xu J, Yacoub E, Uǧurbil K, Van Essen DC, Glasser MF, 2013. Resting-state fMRI in the Human Connectome Project. Neuroimage 80, 144–168. 10.1016/j.neuroimage.2013.05.039 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R92] Smith SM, Fox PT, Miller KL, Glahn DC, Fox PM, Mackay CE, Filippini N, Watkins KE, Toro R, Laird AR, Beckmann CF, 2009. Correspondence of the brain’s functional architecture during activation and rest. Proc. Natl. Acad. Sci 106, 13040–13045. 10.1073/pnas.0905267106 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R93] Smith SM, Nichols TE, Vidaurre D, Winkler AM, Behrens TEJ, Glasser MF, Ugurbil K, Barch DM, Van Essen DC, Miller KL, 2015. A positive-negative mode of population covariation links brain connectivity, demographics and behavior. Nat. Neurosci 18, 1565–1567. 10.1038/nn.4125 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R94] Smyser CD, Snyder AZ, Shimony JS, Mitra A, Inder TE, Neil JJ, 2016. Resting-State Network Complexity and Magnitude Are Reduced in Prematurely Born Infants. Cereb. Cortex 26, 322–333. 10.1093/cercor/bhu251 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R95] Steiger JH, 1980. Tests for comparing elements of a correlation matrix. Psychol. Bull 10.1037/0033-2909.87.2.245 [DOI]

[R96] Sturmfels P, Rutherford S, Angstadt M, Peterson M, Sripada C, Wiens J, 2018. A Domain Guided CNN Architecture for Predicting Age from Structural Brain Images, in: Doshi-Velez F, Fackler J, Jung K, Kale D, Ranganath R, Wallace B, Wiens J (Eds.), Proceedings of the 3rd Machine Learning for Healthcare Conference, Proceedings of Machine Learning Research. PMLR, Palo Alto, California, pp. 295–311. [Google Scholar]

[R97] Sudlow C, Gallacher J, Allen N, Beral V, Burton P, Danesh J, Downey P, Elliott P, Green J, Landray M, Liu B, Matthews P, Ong G, Pell J, Silman A, Young A, Sprosen T, Peakman T, Collins R, 2015. UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age. PLoS Med. 12, 1–10. 10.1371/journal.pmed.1001779 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R98] van der Burgh HK, Schmidt R, Westeneng HJ, de Reus MA, van den Berg LH, van den Heuvel MP, 2017. Deep learning predictions of survival based on MRI in amyotrophic lateral sclerosis. NeuroImage Clin. 13, 361–369. 10.1016/j.nicl.2016.10.008 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R99] Van Essen DC, Smith SM, Barch DM, Behrens TEJ, Yacoub E, Ugurbil K, 2013. The WU-Minn Human Connectome Project: An overview. Neuroimage 80, 62–79. 10.1016/j.neuroimage.2013.05.041 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R100] Van Essen DC, Ugurbil K, Auerbach E, Barch D, Behrens TEJ, Bucholz R, Chang A, Chen L, Corbetta M, Curtiss SW, Della Penna S, Feinberg D, Glasser MF, Harel N, Heath AC, Larson-Prior L, Marcus D, Michalareas G, Moeller S, Oostenveld R, Petersen SE, Prior F, Schlaggar BL, Smith SM, Snyder AZ, Xu J, Yacoub E, 2012. The Human Connectome Project: A data acquisition perspective. Neuroimage 62, 2222–2231. 10.1016/j.neuroimage.2012.02.018 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R101] Varikuti DP, Genon S, Sotiras A, Schwender H, Hoffstaedter F, Patil KR, Jockwitz C, Caspers S, Moebus S, Amunts K, Davatzikos C, Eickhoff SB, 2018. Evaluation of non-negative matrix factorization of grey matter in age prediction. Neuroimage 173, 394–410. 10.1016/j.neuroimage.2018.03.007 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R102] Vieira S, Pinaya WHL, Mechelli A, 2017. Using deep learning to investigate the neuroimaging correlates of psychiatric and neurological disorders: Methods and applications. Neurosci. Biobehav. Rev 74, 58–75. 10.1016/j.neubiorev.2017.01.002 [DOI] [PubMed] [Google Scholar]

[R103] Wachinger C, Reuter M, Klein T, 2018. DeepNAT: Deep convolutional neural network for segmenting neuroanatomy. Neuroimage 170, 434–445. 10.1016/j.neuroimage.2017.02.035 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R104] Wang P, Kong R, Kong X, Liégeois R, Orban C, Deco G, Van Den Heuvel MP, Yeo BTT, 2019. Inversion of a large-scale circuit model reveals a cortical hierarchy in the dynamic resting human brain. Sci. Adv 21 10.1126/sciadv.aat7854 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R105] Weis S, Patil KR, Hoffstaedter F, Nostro A, Yeo BTT, Eickhoff SB, 2019. Sex Classification by Resting State Brain Connectivity. Cereb. Cortex 1–12. 10.1093/cercor/bhz129 [DOI] [PMC free article] [PubMed]

[R106] Woo CW, Chang LJ, Lindquist MA, Wager TD, 2017. Building better biomarkers: Brain models in translational neuroimaging. Nat. Neurosci 20, 365–377. 10.1038/nn.4478 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R107] Yamins DLK, Hong H, Cadieu CF, Solomon EA, Seibert D, DiCarlo JJ, 2014. Performance-optimized hierarchical models predict neural responses in higher visual cortex. Proc. Natl. Acad. Sci 111, 8619–8624. 10.1073/pnas.1403112111 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R108] Yang X, Kwitt R, Styner M, Niethammer M, 2017. Quicksilver: Fast predictive image registration – A deep learning approach. Neuroimage 158, 378–396. 10.1016/j.neuroimage.2017.07.008 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R109] Yeo BTT, Krienen FM, Sepulcre J, Sabuncu MR, Lashkari D, Hollinshead M, Roffman JL, Smoller JW, Zollei L, Polimeni JR, Fischl B, Liu H, Buckner RL, 2011. The organization of the human cerebral cortex estimated by intrinsic functional connectivity. J. Neurophysiol 106, 1125–1165. 10.1152/jn.00338.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R110] Yoo K, Rosenberg MD, Noble S, Scheinost D, Constable RT, Chun MM, 2019. Multivariate approaches improve the reliability and validity of functional connectivity and prediction of individual behaviors. Neuroimage 197, 212–223. 10.1016/j.neuroimage.2019.04.060 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R111] Zhang X, He L, Chen K, Luo Y, Zhou J, Wang F, 2018. Multi-View Graph Convolutional Network and Its Applications on Neuroimage Analysis for Parkinson’s Disease. arXiv Prepr. arXiv1805.08801 https://doi.org/arXiv:1805.08801v2 [PMC free article] [PubMed]

[R112] Zhao G, Liu F, Oler JA, Meyerand ME, Kalin NH, Birn RM, 2018. Bayesian convolutional neural network based MRI brain extraction on nonhuman primates. Neuroimage 175, 32–44. 10.1016/j.neuroimage.2018.03.065s [DOI] [PMC free article] [PubMed] [Google Scholar]

[R113] Zhao X, Wu Y, Song G, Li Z, Zhang Y, Fan Y, 2018. A deep learning model integrating FCNNs and CRFs for brain tumor segmentation. Med. Image Anal 43, 98–111. 10.1016/j.media.2017.10.002 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R114] Zhu X, Thung K-H, Adeli E, Zhang Y, Shen D, 2017. Maximum Mean Discrepancy Based Multiple Kernel Learning for Incomplete Multimodality Neuroimaging Data, in: Descoteaux M, Maier-Hein L, Franz A, Jannin P, Collins DL, Duchesne S (Eds.), Medical Image Computing and Computer-Assisted Intervention – MICCAI 2017. Springer International Publishing, Cham, pp. 72–80. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Deep Neural Networks and Kernel Regression Achieve Comparable Accuracies for Functional Connectivity Prediction of Behavior and Demographics

Tong He

Ru Kong

Avram J Holmes

Minh Nguyen

Mert R Sabuncu

Simon B Eickhoff

Danilo Bzdok

Jiashi Feng

BT Thomas Yeo

Abstract

1. Introduction

2. Methods

2.1. Datasets

2.2. Preprocessing and RSFC

2.3. FC-based prediction setup

2.4. Kernel ridge regression

2.5. Fully-connected neural network (FNN)

Figure 1. Schematic of a feedforward neural network (FNN).

2.6. BrainNetCNN

Figure 2. Schematic of the BrainNetCNN (Kawahara et al., 2017).

2.7. Graph convolutional neural network (GCNN)

Figure 3. Schematic of a graph convolution neural network (GCNN; Parisot et al., 2017, 2018).

2.8. HCP training, validation and testing

2.9. UK Biobank training, validation and testing

2.10. Deep neural network implementation

2.11. Statistical tests

2.12. Scaling of prediction performance as a function of sample size

2.13. Control analysis

2.14. Data and code availability

3. Results

3.1. HCP behavioral prediction

Figure 4. Prediction accuracy (Pearson’s correlation coefficient) averaged across 58 HCP behavioral measures and 20 test folds.

Figure 5. Prediction accuracies (Pearson’s correlation coefficient) in a curated set of 13 HCP cognitive measures averaged across 20 test folds.

Figure 6. Prediction MAE averaged across 58 HCP behavioral measures and 20 test folds.

3.2. UK Biobank behavioral and demographics prediction

Table 1. Prediction performance of four behavioral and demographic measures in the UK Biobank.

Figure 7. Prediction performance of four behavioral and demographic measures in the UK Biobank.

Table 2. Prediction MAE of pairs matching and fluid intelligence in the UK Biobank.

3.3. Effect of sample size on predicting fluid intelligence in the UK Biobank

Figure 8. Prediction performance (Pearson’s correlation coefficient) of fluid intelligence in the UK Biobank dataset with different number of training and validation subjects.

3.4. Control analysis

3.5. Computational costs

4. Discussion

4.1. Potential reasons why DNNs did not outperform kernel regression for RSFC-based prediction

4.2. Hyperparameters

4.3. Prediction performance in the literature

4.4. Limitations and caveats

5. Conclusion

Supplementary Material

Acknowledgment

Appendix

A1. Kernel Regression

A2. More details of deep neural networks

Table 3. FNN architecture and hyperparameters for HCP and UK Biobank.

Table 4. BrainNetCNN architecture and hyperparameters for HCP and UK Biobank.

Table 5. GCNN architecture and hyperparameters for HCP and UK Biobank.

Footnotes

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases