Abstract
In the literature, there are substantial machine learning attempts to classify schizophrenia based on alterations in resting-state (RS) brain patterns using functional magnetic resonance imaging (fMRI). Most earlier studies modelled patients undergoing treatment, entailing confounding with drug effects on brain activity, and making them less applicable to real-world diagnosis at the point of first medical contact. Further, most studies with classification accuracies >80% are based on small sample datasets, which may be insufficient to capture the heterogeneity of schizophrenia, limiting generalization to unseen cases. In this study, we used RS fMRI data collected from a cohort of antipsychotic drug treatment-naive patients meeting DSM IV criteria for schizophrenia (N = 81) as well as age- and sex-matched healthy controls (N = 93). We present an ensemble model -- EMPaSchiz (read as ‘Emphasis’; standing for ‘Ensemble algorithm with Multiple Parcellations for Schizophrenia prediction’) that stacks predictions from several ‘single-source’ models, each based on features of regional activity and functional connectivity, over a range of different a priori parcellation schemes. EMPaSchiz yielded a classification accuracy of 87% (vs. chance accuracy of 53%), which out-performs earlier machine learning models built for diagnosing schizophrenia using RS fMRI measures modelled on large samples (N > 100). To our knowledge, EMPaSchiz is first to be reported that has been trained and validated exclusively on data from drug-naive patients diagnosed with schizophrenia. The method relies on a single modality of MRI acquisition and can be readily scaled-up without needing to rebuild parcellation maps from incoming training images.
Introduction
Despite decades of research, there are no precise and reliable etiopathophysiological markers for major psychiatric conditions.1 Impeding factors range from inherent challenges in studying complex genetic disorders2 to weakly established neural bases for cognition, experience and behaviour.3,4 However, a part of the problem is a mismatch between current diagnostic standards for psychiatric illnesses and observations emerging from basic systems and behavioural neuroscience research.5 Recognized biological heterogeneity, also adds to the difficulty of identifying reliable biological markers associated with these conditions.6 Treatments for psychiatric disorders have emerged largely as a result of serendipitous observations7 with an unfortunate range of side-effects8 and this may be why mortality and prevalence rates associated with psychiatric illnesses have not decreased in past years,9 as compared to other medical conditions such as certain types of cancer10 or heart diseases.11
In particular, the underlying pathophysiology of schizophrenia, a severe and debilitating psychotic illness, still remains elusive, with few established consistent findings.12 Currently objectively measurable diagnostic tests for schizophrenia13 are lacking, and the reliability of diagnoses based on observable signs and symptoms leaves room for improvement.5 Further, there is marked heterogeneity within clinical manifestations of ‘schizophrenia’ as well as considerable overlap with other psychiatric diagnoses, leading many to question the validity of a singular disease entity.14
In this context, applying machine learning techniques to MRI data has the potential to provide an objective and evidence-based approach for identification and management of schizophrenia.15,16 Machine-learned MRI models have the potential to identify biological markers and delineate symptom clusters. Recently, an increasing number of studies have attempted to classify schizophrenia (vs. healthy controls) based on functional alterations in resting-state brain patterns (Table 1, see supplementary materials for more description of these studies).
Table 1.
Study | Year | Total: Size of classes | Accuracy |
---|---|---|---|
Shen et al.83 | 2010 | 52: 32 SCZ, 20 HC | 86.50% |
Fan et al.84 | 2011 | 62: 31 SCZ, 31 HC | 87.1%a |
Yu et al.85 | 2013 | 89: 32 SCZ, 38 HC (+19 MDD) | 80.9% |
Anderson and Cohen86 | 2013 | 146: 74 SCZ, 72 HC (COBRE dataset) | 65% |
Arbabshirani et al.87 | 2013 | 56: 28 SCZ, 28 HC | 96%a |
Yu et al.88 | 2013 | 71: 24 SCZ, 25 healthy siblings of SCZ, 22 HC | 62% |
Guo et al.89 | 2014 | 131: 69 SCZ, 62 HC | 80% |
Brodersen et al.90 | 2014 | 83: 41 SCZ, 42 HC | 78%a |
Anticevic et al.91 | 2014 | 180: 90 SCZ, 90 HC | 73.9% |
Watanabe et al.92 | 2014 | 123: 54 SCZ, 67 HC | 73.50% |
Chyzhyk et al.93 | 2015 | 54: 26 SCZ with history of AH, 14 SCZ without a history of AH, 28 HC | 97.1%a |
Cheng et al.94 | 2015 | 48: 19 SCZ, 29 HC | 79% |
Peters et al.95 | 2016 | 36: 18 SCZ, 18 HC | 91%a |
Mikolas et al.96 | 2016 | 126: 63 SCZ with FE SCZ, 63 HC | 73% |
Cabral et al.34 | 2016 | 132: 66 SCZ, 66 HC (COBRE dataset) | 70.5% |
Yang et al.97 | 2016 | 86: 40 SCZ, 46 HC | 77.91% |
Iwabuchi and Palaniyappan98 | 2017 | 133: 62 SCZ, 71 HC | 78.04%a |
Lottman et al.99 | 2017 | 69: 34 unmedicated (17 drug-naive) SCZ + follow-up post treatment, 35 HC | 83.8%a |
Guo et al.100 | 2017 | 68: 28 FE drug-naive SCZ, 28 family-based controls, 40 HC | 92.86%a |
SCZ Schizophrenia, HC Healthy controls, AH Auditory hallucinations, FE First episode, MDD Major depression
aAccuracy of best model among several reported models
Most earlier studies assessed patients already undergoing treatment, which means their fMRI scans were confounded with antipsychotic drug effects17 – hence, those scans did not correspond to the point of first medical contact, and so may not lead to optimal diagnostic models. Further, diagnostic models obtained from larger datasets (more than 100 subjects) have classification accuracies well below 80% (Fig. 1). Many have observed this phenomenon: “smaller-N studies reach higher prediction accuracy of schizophrenia with neuroimaging data”.18 Even with higher cross-validated accuracy, the smaller samples likely do not capture the heterogeneity of the disease, which suggests that these models will not generalize well to unseen cases.
Many of these studies first parcellate the whole brain resting-state information into spatial regions that are considered homogeneous. However, with the increasing number of parcellation methods and atlases now available, the choice of which parcellation to use seems rather arbitrary. These methods can vary widely in principle and can be based on (a) pre-defined ontology of brain structures such as post-mortem cytoarchitecture,19,20 sulco-gyral anatomy,21,22 anatomical connectivity using diffusion imaging23,24 or (b) data-driven modelling of the functional features in the BOLD signal from resting-state25 or task-based fMRI26,27 or even meta-analyses28,29 using analytical techniques such as hierarchical clustering30 or independent components analysis.31 The quality of the brain network obtained and the downstream predictive model may be largely influenced by the selection of the atlas or parcellation used.32,33 Brain segmentations based on these parcellation schemes not only provide a way to reduce the dimensionality of fMRI data but can also provide an elegant way to incorporate prior neurobiological knowledge to ‘refine’ the features. However, to date, there has been no investigation on whether combined learning from multiple predefined parcellation schemes can provide better performance for diagnostic prediction of schizophrenia.
In this study, we eliminated the potential confound of antipsychotic treatment by using resting state fMRI data collected from a cohort of antipsychotic-naive schizophrenia patients (N = 81) as well as age- and gender-matched healthy controls (N = 93). The aim of our study was to improve accuracy for diagnostic prediction, compared to results reported in the literature, by designing a feature creation and learning pipeline that incorporates prior knowledge of neuroanatomy and neurophysiology. Our overall model involves stacking predictions from several single-source models, each based on the specific set of features related to regional fMRI activity and functional connectivity, and a specific a priori parcellation scheme. We demonstrate that our ensemble model yields a classification accuracy of 87% (vs. 53% chance), which is better than any standard single-source model considered in the study. To the best of our knowledge, (1) the performance of our model, based on 174 subjects, outscores earlier machine learning models built for diagnosing schizophrenia using resting-state fMRI measures that have been learned from datasets of N > 100 subjects; and (2) this is the only such classification model that has been built and validated exclusively on never-treated schizophrenia cases.
Our method relies on a single modality of data acquisition for neuroimaging and is easily scalable as it uses a set of pre-defined atlases—i.e., it does not rely on data-driven brain parcellation methods, such as group-independent component analysis.
Results
We show below that (a) our EMPaSchiz ensemble learner, which learns a combination of learned classifiers, each trained on its own neuroimaging feature extractions and brain parcellation schemes, produces a classifier that can predict schizophrenia more accurately than any of the individual predictors (that used just a single feature/parcellation combination). (b) Within this ensemble prediction framework, even a very small fraction of features (as low as top 0.5% selected via univariate tests) can still provide high prediction accuracy (>80%). (c) This learning framework can also produce models that can distinguish clinically symptomatic versus non-symptomatic patients, with moderate accuracy.
Table 2 presents the 5 × 10-fold cross-validation prediction performance of the various learners in EMPaSchiz. Majority class baseline accuracy for schizophrenia prediction (declaring every subject to be control) was 53.4% (93 controls of 174 total subjects). These accuracy values are plotted in Fig. 2. Stacked models with neuroimaging features that are regional—viz., ALFF, fALFF and ReHO—had accuracies in the range of 74 to 76%, while the ones based on functional connectivity—viz., FC-Correlation, FC-partial correlation, FC-precision—showed better performance with 79 to 84% accuracy. The final ensemble model EMPaSchiz (stacked-multi) showed the best performance with accuracy of 87%, sensitivity of 80%, specificity of 93% and precision of 92%, each with standard errors of 1–2%. This accuracy of stacked-multi was significantly better than second best stacked model (stacked-FC-precision at 84%, t-test, p = 0.03).
Table 2.
Accuracy | Precision | Sensitivity | Specificity | True positive | True negative | False positive | False negative | |
---|---|---|---|---|---|---|---|---|
Stacked-multi | 86.9 (1.1) | 91.9 (1.4) | 79.8 (1.8) | 93.1 (1.2) | 65.0 (1.4) | 86.8 (1.2) | 6.2 (1.1) | 16.0 (1.4) |
Stacked-ALFF | 76.4 (1.4) | 76.3 (1.8) | 73.9 (2.2) | 78.7 (1.9) | 59.8 (1.7) | 73.0 (1.7) | 20.0 (1.9) | 21.2 (1.8) |
Stacked-ReHo | 74.1 (1.6) | 73.4 (2.0) | 74.6 (2.0) | 73.6 (2.5) | 60.4 (1.6) | 68.2 (2.3) | 24.8 (2.5) | 20.6 (1.6) |
Stacked-fALFF | 74.5 (1.5) | 73.8 (1.7) | 72.2 (1.8) | 76.6 (1.9) | 58.6 (1.6) | 72.0 (1.7) | 21.0 (1.7) | 22.4 (1.7) |
Stacked-FC-correlation | 82.4 (1.3) | 83.9 (1.9) | 79.7 (1.8) | 84.7 (2.0) | 64.6 (1.5) | 78.8 (2.0) | 14.2 (1.9) | 16.4 (1.4) |
Stacked-FC-partial correlation | 78.5 (1.4) | 93.7 (1.5) | 58.2 (2.8) | 96.2 (0.9) | 46.8 (2.4) | 89.8 (1.0) | 3.2 (0.8) | 34.2 (2.3) |
Stacked-FC-precision | 83.7 (1.2) | 90.2 (1.6) | 73.8 (2.0) | 92.3 (1.3) | 60.0 (1.9) | 86.8 (1.3) | 6.2 (1.2) | 21.0 (1.8) |
Baselinea | 51.2 (0.3) | 47.0 (0.5) | 40.7 (0.6) | 60.2 (0.5) | 33.0 (0.4) | 56.0 (0.5) | 37.0 (0.5) | 48.0 (0.5) |
aBaseline results are based on permutation test over the randomly shuffled labels (based on 100 repetitions of entire ‘learning with subsequent 10-fold CV evaluations’)
Figure 3 shows a comparative profile of accuracies for various SSM predictors along with EMPaSchiz stacked models. (Supplementary material provides results in tabular format as well as plots of comparisons limited to specific feature types. It also provides results for various ensemble learners that were stacked parcellation-wise.) Prediction accuracies for SSM ranged from 52% (FC-precision with harvard_sub_25) to 83% (FC-precision with basc_multiscale_444) and averaged overall at 73%. In general, basc_multiscale atlases showed better performance than the others. For instance, accuracies of EMPaSchiz stacked models were comparable to basc_multiscale_197 models for FC-correlation at 82% and for FC-partial correlation at 79%.
We examined the effect of feature selection using top-r percentage of total features based on a univariate test, of r percentile of the highest F-value scores, for r = 0.5%, 1%, 2%, 5%, 10%, 20%, 30%, as well as “all regional features +30% connectivity features” (we chose this combination as, for any given parcellation, the number of regional features was much less than that of connectivity features), and all features (no feature selection). Note that each “setting” is applied to all 84 SSMs. Figure 4 shows the comparative profile of model performances with varying levels of r for top-r percentage of features, along with original EMPaSchiz (stacked-multi) model where feature reduction was done using PCA. (Supplementary materials provide results in tabular format as well as additional plots of comparisons of feature selection methods for SSM and MSM models.) Using all features (r = 100%, i.e., no selection/reduction) showed accuracy of 85% (which was slightly poorer than PCA reduced features at 87% but was not a statistically significant difference) and accuracy declined only slightly when r was reduced gradually to as low as 0.5. It is noteworthy that with only 0.5% of top features, our ensemble prediction framework still showed a high prediction accuracy of 82%.
Patients with schizophrenia in our sample showed a range of psychopathological symptom severity, as measured using the clinical scales SANS for negative symptoms (integer values from 0 to 110) and SAPS for positive symptoms (integer values from 8 to 55). We used the first and last quartile of these scales to categorize the 20 least, and the 20 most, severely symptomatic patients. We then used our ensemble prediction framework in leave-one-out cross-validation setup to predict the high-symptomatic patients against non/low-symptomatic ones (majority class baseline accuracy of 50%). We used leave-one-out cross-validation (rather than 10-fold) to deal with low number of subjects (N = 40) that were available for this analysis. Prediction accuracy for stacked-multi model was 73.2% for SANS and 61.9% for SAPS of schizophrenia psychopathology.
To identify some of the key pathological alterations in our schizophrenia sample, we estimated the reliability of a feature’s importance for diagnostic prediction, similar to the approach used by an earlier neuroimaging study34 – sorting the features by their respective mean logistic regression weight divided by its standard error for each feature in a particular learned SSM generated during 50 folds of cross-validation. (This was performed with raw ROI data, without any PCA transformations.) Fig. 5 (respectively Fig. 6) highlight some of the top-most ( > 98 or 99th percentile) reliable features using representative atlases for regional resting state measures (respective connectivity).35 However, given the complexity of our ensemble model (which recall is based on 84 SSM), these depictions should be considered just representative in nature, and cannot be claimed as the ‘only’ important features in the model.
The pattern of functional connectivity changes (Fig. 6) indicates robust hypo-connectivity between the frontoparietal network (such as post parietal) and the sensorimotor network (such as frontal, parietal, precentral gyrus) with widespread hypo-connectivity in language (e.g.: Broca), attention (e.g.: frontal pole, parietal) and default mode network (e.g.: angular, fusiform gyrus). On other hand, the auditory network as well as the anterior insula, which is implicated in high-level cognitive control, attentional processes and saliency,36 show hyper-connectivity. Similarly, the overall picture (Fig. 5) shows increased regional low frequency activity in the superior temporal gyrus and basal ganglia structures - caudate, putamen, and reduced regional activity in cingulum.
Discussion
This study aimed to build a machine learned classifier for diagnosing schizophrenia that depends on a single neuroimaging modality of acquisition - resting state fMRI. Resting state fMRI is a popular imaging method and possibly better than task-based fMRI, since the latter depends on experimental parameters that require standardization. Further, resting state fMRI is not limited by participants’ attention or cognitive ability to perform a task and hence is applicable to patients with more pronounced disabilities.37
Several recent studies have built diagnostic models using data from patients receiving antipsychotic drug treatment (see Table 1). However, antipsychotics are known to affect brain activity and function,38,39 and a recent study cautions against the practice of interpreting brain changes in a medicated state, noting it might not be related to the pure pathology of schizophrenia.17 We developed the model presented in this study on a sample of never-treated schizophrenia patients, to make our results directly apply to realistic clinical scenarios of diagnosis at first clinical presentation. Further work will be necessary to examine how this may generalize to medicated patients, as well as other confounds, such as multi-site batch effects, remains to be examined.40 It is notable, however, that non-medicated patients are an important group for analysis and represent, perhaps, the most difficult sample for recruitment. In this way our study provides a very important sample to demonstrate the value of our approach.
With respect to diagnostic accuracy of schizophrenia, Schnack and others have observed that smaller sample studies may reach high prediction accuracy at the cost of lower generalizability to external samples -- an effect attributed to clinical heterogeneity, physiological variation, sampling noise and errors in diagnosis.18 In our outline of recent literature on machine learning studies with resting-state fMRI (see the Introduction section), we also observed this relation (see Fig. 1). Nevertheless, our ensemble model outscores earlier models built for diagnosing schizophrenia using resting state fMRI measures, even though it was learned from a large sample. We believe this may be because our feature creation process incorporates prior rich neurobiological knowledge with simultaneous use of regional and connectivity measures that are jointly extracted over various biologically-informed brain atlas schemes. We demonstrate that if we employ standard machine learning pipelines (called SSM here) on this dataset of untreated patients, we obtain a level of performance ( < 80% accuracy) that is similar to the results reported widely in earlier studies with comparable sample sizes. Hence, these drug-naive cases are unlikely to be ‘easier’ to model than standard treated cases. Our results provide encouraging progress toward deploying automated or semi-automated diagnostic systems based on neuroimaging and predictive models in psychiatric clinics. However, the performance of our model is favoured by the fact that the entire sample in this study comes from a single site, meaning it does not need to deal with the challenges of cross-site generalizability and site-specific effects. Future clinical studies with larger cohorts, preferably from multiple clinical sites, would be necessary to justify clinical deployment.
Our EMPaSchiz model used brain parcellations that were based on prior knowledge of anatomy / cytoarchitecture or statistical maps extracted from correlation structure in fMRI data collected and analysed in earlier studies. Hence, these maps might not perfectly adapt to signals in the individual subject images – which might not be an issue for data-driven parcellation or clustering techniques. Our study neither explored that option, nor compared model performance empirically, with features obtained with these two alternative methodologies. However, use of pre-existing parcellations reduces chances of overfitting, and possibly increases the robustness of the resulting model. Note also that these a priori ROIs incorporate nicely biological knowledge of fMRI data into the feature creation process, which can help interpretation of results, and provide an effective way to reduce dimensionality. Our model may be readily scaled-up with relatively little computation, as it does not need to build parcellation maps from incoming training images.
It is often challenging to provide a biological interpretation of complex machine learning models, as the goal of the learning process is to find a model that maximizes prediction performance, which may require (possibly non-linear) combinations of thousands of features. In this study, we produced an effective classifier by seeking the coefficients for the features that collectively optimize the predictive accuracy. In general, such coefficients need not correspond to the inherent correlation of each individual feature with the outcome. This is especially true in our approach of using multiple parcellations of the brain, as this means the “features” will overlap to a large degree. This can be seen as potential limitation for the interpretation of our model. We provide only a snapshot of some representative changes in patient’s brain, showing only the most reliable resting state features; features that, alone, may be neither necessary nor sufficient to obtain the prediction performance of the reported ensemble model. However, several of these brain networks and regions were observed to be altered consistently in schizophrenia.41–43
Functional connectivity aberrations observed in our study are consistent with the dysconnectivity hypothesis of schizophrenia.44 This theoretical framework describes schizophrenia as a dysconnection syndrome linking aberrations at the level of synapse with the abnormalities in the long-range connectivity of several brain networks.45 A vital component of the dysconnectivity hypothesis is proposed aberrant connectivity between prefrontal cortex and other brain regions, which is posited to give rise to key symptoms such as delusions and hallucinations.46 A systematic review of fMRI studies on functional connectivity supports reduction in brain region connectivity in subjects with schizophrenia, especially reductions involving prefrontal cortex,47 in agreement with our observations. Our findings of concurrent hyper-connectivity among some regions is also consistent with earlier reports of increased functional connectivity in schizophrenia.48 Another core postulate of the dysconnectivity hypothesis is that modulation of synaptic efficacy with resultant fronto-temporo-parietal aberrations leads to hallucinations / delusions in schizophrenia.49 The hypothesized synaptic efficacy aberrations may be linked to NMDA receptor abnormalities.49 In this context it is of interest that effects on temporoparietal-prefrontal circuitry through transcranial Direct Current Stimulation (possibly via NMDA-dependent mechanisms50) has been shown to ameliorate severity of auditory hallucinations,51,52 possibly through “correction” of functional dysconnectivity.53 It is likely that further systematic application of machine learning techniques to analysis of brain connectivity may be useful for developing prognostic markers for schizophrenia that might predict differential responses to clinical interventions.
A general conceptual limitation of machine learning studies in psychiatry is that the diagnostic labels might themselves be ill defined. Amidst an ever-expanding volume of research data, inconsistencies in neurobiological findings fuel doubts about the validity of the currently defined disease construct of schizophrenia. This might be an issue inherent in psychiatric practice, which contributes to low reliability of diagnosis with nosology such as the DSM criteria. The work reported here may indicate a useful step towards more biological informed diagnoses, as it involves developing algorithms to predict current psychiatric diagnoses based on objective neurobiological features. This approach could also provide us with a framework for evaluating the validity of clinical diagnoses. Lastly, our empirical results show that multi-parcellation ensemble learning models may effectively learn models for early diagnosis of schizophrenia; we anticipate that this approach may work for other psychoses, and for prediction of treatment responses.
Methods
Subjects
This study examined 92 patients attending the clinical services of the National Institute of Mental Health & Neurosciences (NIMHANS, India), who fulfilled DSM-IV criteria for schizophrenia and were never treated with any psychotropic medications including antipsychotics. The diagnosis of schizophrenia was established using the Mini International Neuropsychiatric Interview (MINI) Plus,54 which was confirmed by another psychiatrist through an independent clinical interview. The details related to illness onset and antipsychotic-naive status were carefully ascertained by reliable information obtained from at least one additional adult relative. The Scale for Assessment of Positive Symptoms (SAPS) and Scale for Assessment of Negative Symptoms (SANS) were used to measure psychotic symptoms.55 Clinical assessments and MRI were performed on the day before starting antipsychotics.
Controls were recruited from among the consenting healthy volunteers from the same locale to match for age and sex. We used 102 age- and sex-matched healthy volunteers, who were screened to rule out any psychiatric diagnosis using the MINI as well as a comprehensive mental status examination. For both cases and controls, we recruited only right-handed subjects to avoid the potential confounds of differential handedness. None of the study subjects had contraindications to MRI or medical illness that could significantly influence CNS function or structure, such as seizure disorder, cerebral palsy, or history suggestive of delayed developmental milestones. There was no history suggestive of DSM-IV psychoactive substance dependence or of head injury associated with loss of consciousness longer than 10 min. No subjects had abnormal movements as assessed by the Abnormal Involuntary Movements Scale. Pregnant or postpartum females were not included. The supplementary material provides a table with details of demographic and clinical profile of 174 subjects who qualified to be included in the study. (See details on excessive head movement in the ‘Image pre-processing’ section)
The catchment area for the subject recruitment involved the southern states of India. We obtained informed written consent after providing a complete description of the study to all the subjects. The NIMHANS ethics committee reviewed and approved the original research protocol. The Research Ethics Board at University of Alberta, Edmonton approved the secondary analysis of archived data.
Image acquisition
Magnetic Resonance Imaging (MRI) was done in a 3.0 Tesla scanner (Magnetom Skyra, Siemens). Resting State Functional MRI: BOLD (Blood Oxygen Level Dependent) sensitive echo-planar imaging was obtained using a 32-channel coil for a duration of 5 minutes 14 s, yielding 153 dynamic scans. The scan parameters were: TR = 2000ms; TE = 30ms; flip angle = 78°; Slice thickness = 3 mm; Slice order: Descending; Slice number = 37; Gap = 25%; Matrix = 64 × 64 × 64 mm3, FOV = 192 × 192, voxel size = 3.0 mm isotropic. Subjects were asked to keep their eyes open during the scan. For intra-subject co-registration, structural MRI: T1-weighted three-dimensional high-resolution MRI was performed (TR = 8.1 msec, TE = 3.7ms, nutation angle = 8°, FOV = 256 mm, slice thickness = 1 mm without inter-slice gap, NEX = 1, matrix = 256 × 256) yielding 165 sagittal slices.
Image pre-processing
We performed pre-processing and feature extraction using MATLAB (The MathWorks, Inc) toolboxes including Statistical parametric mapping (SPM8, http://www.fil.ion.ucl.ac.uk/spm), Data Processing Assistant for Resting-State fMRI (DPARSF)56 as well as Python toolboxes including the nilearn package57 based on scikit-learn, a Python machine learning library.58 We checked acquired images visually for artefacts such as incomplete brain coverage or ghosting; then re-orientated the origin to the anterior commissure in structural MRI and fMRI images. The first ten volumes of each functional time-series were discarded as they were before the time required for the scanner field to reach steady magnetization, and for the participants to adapt to scanning noise. Images were then pre-processed with slice-timing correction, image realignment to correct for motion, and intensity normalization. Since head movement may lead to group-related differences,59–61 we excluded images for 11 patients and 9 controls from the study based on excessive head movement (translational > 2.0 mm and/or rotational > 2°).62 This yielded a total of 174 subjects: 93 controls and 81 patients. Functional images were co-registered with the structural image and then normalized to MNI space resampled to 3×3×3 mm3. Nuisance regression was performed to remove noise in the signal induced by head motion using 24 regressors derived from the parameters estimated during motion realignment, scanner drift using a linear term, as well as global fMRI signals from white matter and cerebrospinal fluid segments using SPM’s new segment method.63 Normalized images were smoothed, detrended and band-pass filtered as appropriate—depending on the feature to be extracted, see details below.
Feature extraction
To obtain neurobiologically relevant features, we projected each resting brain information into 14 different parcellations, each based on a specific a priori defined atlas or set of regions of interest (ROIs). Our goal here was to jointly learn from this entire set of neuroimaging features extracted through several brain parcellation schemes to obtain an accurate model; n.b., we are neither trying to compare nor evaluate the influence of any single feature type or ROI definition on prediction accuracy. Our goal is to produce a predictive model whose validation is only its predictive accuracy.
We used the following 14 pre-defined brain parcellation schemes:
yeo: intrinsic functional connectivity of cerebral cortex25
smith20, smith70: functional networks during activation and rest (at two different resolutions)26
harvard_cort_25, harvard_sub_25: Harvard-Oxford cortical and subcortical parcellation (http://www.cma.mgh.harvard.edu/fsl_atlas.html)
msdl: multi-subject dictionary learning for functional parcellation64
aal: macroscopic anatomical parcellation of single-subject brain65
basc_multiscale_122, basc_multiscale_197, basc_multiscale_325 and basc_multiscale_444: multi-level bootstrap analysis of stable clusters in resting-state fMRI, at four different resolutions66
destrieux: sulcal depth-based anatomical parcellation of the cerebral cortex67
dosenbach: multivariate pattern analysis of functional connectivity28
power: graph measures of functional brain organization68
For each of these 14 parcellation schemes, we extracted 3 regional-based and 3 connectivity-based resting brain fMRI features. For regional features, we used:
ALFF: amplitude of frequency fluctuations
fALFF: fractional ALFF
ReHo: regional homogeneity
We smoothed each functional image using a 4 mm FWHM gaussian kernel (except for extraction of ReHo - to avoid overestimation of spatial homogeneity) and band-pass-filtered fMRI time-courses at 0.01–0.08 Hz to capture slow fluctuations that are believed to reflect spontaneous brain activity.69,70 ALFF was calculated as total power within the frequency range between 0.01 and 0.08 Hz to estimate the strength of low frequency oscillations.71 fALFF was calculated as power within the low-frequency range (0.01–0.08 Hz) divided by the total power in the entire detectable frequency range.69 Lastly, ReHo was calculated using Kendall’s coefficient of concordance,72 as a measure of the similarity between the time series of a given voxel and its nearest neighbours.73
We calculated each of these features at the voxel level using the DPARSF toolbox, standardized and then averaged over an ROI. For each ROI, we ran a nuisance regression across the features to remove the effects of confounding variables that are generally recommended and commonly reported in neuroimaging research—age, sex, and total intracranial volume.74 In addition, we also used average framewise displacement to (at least partially) counter systematic yet spurious correlations in functional connectivity that may arise from subject motion.59
We also computed connectivity features with each of the 14 parcellations, by extracting average time series per ROI and then estimating functional connectivity matrices between each pair of regions using one of three statistical measures
Pearson correlation
partial correlation
precision
In each case, the feature vectors were the flattened lower triangular part of these symmetric matrices.
We chose to study the above features as earlier literature established their relevance to schizophrenia pathology. Abnormalities in low-frequency oscillations70,75 and regional homogeneity of blood-oxygen-level-dependent signals76,77 have been well documented in schizophrenia. Further, patients diagnosed with schizophrenia have exhibited changes in functional brain connectivity, as revealed through distant correlations.77,78 In addition to simple Pearson correlation, we described the connectivity structure using partial correlation, which measures the interactions between two ROIs. We use a sparse precision matrix—i.e., the sparse inverse of the covariance matrix—which reveals the brain regions that appear conditionally independent given all other brain regions.79
So, in total, our approach ‘Ensemble algorithm with Multiple Parcellations for Schizophrenia prediction’, abbreviated as: EMPaSchiz (read as ‘Emphasis’) – modelled 84 sources of data (14 parcellation schemes×(3 + 3) feature types) per subject; these descriptions ranged in size from 17 to 98,346 values. We used appropriate masker classes57 to summarize brain signals from non-overlapping clusters (e.g.: basc_multiscale) or overlapping networks (e.g., smith) or spheres centred at seeds with fixed small radius (e.g.: power). Table 3 presents the total number of features per data source. (The supplementary material presents visualizations of a few representative parcellations, overlaid over an MRI slice.)
Table 3.
Parcellation | Regional | Connectivity |
---|---|---|
yeo | 17 | 136 |
smith20 | 20 | 190 |
harvard_sub_25 | 22 | 231 |
msdl | 39 | 741 |
smith70 | 70 | 2415 |
harvard_cort_25 | 96 | 4560 |
aal | 116 | 6670 |
basc_multiscale_122 | 122 | 7381 |
destrieux | 148 | 10,878 |
dosenbach | 160 | 12,720 |
basc_multiscale_197 | 197 | 19,306 |
power | 264 | 34,716 |
basc_multiscale_325 | 325 | 52,650 |
basc_multiscale_444 | 444 | 98,346 |
Prediction and evaluation framework
EMPaSchiz produced a classifier from our multi-source data, in two levels. For the first level, EMPaSchiz trained 84 different L2-regularized logistic regression classifiers, using the ‘liblinear’ solver80 – one for each individual data source to predict the diagnosis; we consider each to be a single-source model (SSM). For the second level, EMPaSchiz then trained a single L2-regularized logistic regression model to take the prediction probabilities computed by each SSM, to predict the schizophrenia-vs-normal label; hence, this is a multi-source model (MSM). Figures 7 and 8 show schematic representations of our prediction and evaluation framework. These computations were performed using the scikit-learn package36 and mlxtend extensions.81
Figure 7a shows performance of learned EMPaSchiz-Performance model. Given a resting state fMRI time series for a subject, the EMPaSchiz-Performance first extracts 6 different feature types (F1 to F6; coded here with different fill colours) over each of 14 brain parcellation schemes (P1 to P14; coded here with border colour) to obtain 84 feature sets (FS1,1 to FS6,14). Each is given to a “single-source model” (SSM), which is a learned logistic regression (LR) classifier of the PCA-projection of that data with learned parameter θi,j (i.e, θ1,1 to θ6,14 each correspond to a specific feature set) trained to predict schizophrenia. This produces a vector of the resulting 84 prediction probability values (P1,1 to P6,14)—one from each LR—which is given to a final trained LR classifier with learned parameter θ*,*. The final prediction probability P*,* is used to predict whether the given subject is “schizophrenia” or “normal”. We also considered 6 other multi-source models, with learned parameters θ1,* to θ6,*—one for each feature type.
Figure 7b, c shows the process for learning the EMPaSchiz-Performance model. The EMPaSchiz-Learner first learns 84 different single-source models SSMi,j: For the ith feature type (i = 1..6) and the jth parcellation (j = 1..14), EMPaSchiz-Learner computes the (i,j)-feature set for the resting state fMRI time series for each of the K labelled subjects in training set, to obtain the feature sets FS*i,j = { FSk i,j } over k = 1..K. It then trains a regularized logistic regression (LR) model θi,j to predict schizophrenia, from each feature set FS * i,j, where the regularization strength C is obtained using internal CV. For example, θ3,12 is learned by fitting LR on FS*3,12 (which corresponds to the 3rd feature type: ReHo with the 12th parcellation: destrieux). After learning all the 84 SSM parameters {θi,j} in this manner, EMPaSchiz-Learner as shown in Fig. 7c, then runs each of these 84 resulting SSMs on each of the K training instances; this produces a new training set P = {P k i,j }, where P ki,j is the probability produced by running the (i,j)-th SSM predictor, with learned parameter θi,j, on the k-th instance. It then learns the multi-source model (MSM) by training the regularized logistic regression (LR) on the set P to predict schizophrenia. This produces the parameter θ*,*. Similarly, six other MSMs θ1,* to θ6,* are learned by training LR with each set P *1,j = {P k 1,j } over k = 1..K, j = 1..14 to P * 6,j = {P k 6,j } over k = 1..K, j = 1..14.
In more detail: EMPaSchiz first used singular value decomposition of each data source to project it to a lower dimensional space. We extracted principal components from the training instances, then projected each instance onto the eigenvectors (PCA). We used all the components—i.e., set the number of principal components to the smaller of the number of original features or the number of instances. Note these components captured all the variance, but reduced the dimensionality by a huge factor, for most datasets, as the final number of features for each data source was at most the number of instances in training set (~157 subjects in our 10-fold cross-validation). For the few data sources that had fewer features than training instances (e.g., yeo-regional has 17 features), this transformation would not change the number of features, but changed the data to a new basis. The motivation for this procedure was to have a uniform pipeline of PCA transformations for all data sources, irrespective of the varying number of features.
For SSM, EMPaSchiz set the C parameter (inverse of regularization strength) by internal 10-fold cross-validation on the training split (5 shuffled iterations). We call the MSM that combined predictions from all 14 x 6 = 84 SSMs, ‘stacked-multi’. We also considered six other versions of MSM, each combining SSMs for a specific feature type (14 each): stacked-ALFF, stacked-fALFF, stacked-ReHo, stacked-FC-correlation, stacked-FC-partial correlation, stacked-FC-precision.
The EMPaSchiz model was evaluated in five shuffled iterations of a 10-fold balanced cross-validation approach (90% training set, 10% test set; for a total of 50 train-test splits). We evaluated the model’s generalization performance on the test set (in outer cross-validation), computing:
accuracy (Overall, how often is the classifier correct?)
sensitivity (When the actual label is ‘patient’, how often is the prediction correct?)
specificity (When the actual label is ‘control’, how often is the prediction correct?)
precision (When the predicted label is ‘patient’, how often is the prediction correct?)
For each variant, we report the mean and standard errors for these metrics over all 50 train-test splits. To compare MSM models, we used parametric statistical tests (two sided t-test) on the accuracy, using the SciPy package.82
We also performed two additional analyses. First, we explored the effect of feature selection with respect to SSM, using the top-r percentage of the total set of features, based on univariate testing (F-value score) on the model performance. For example, when r = 20%, the EMPaSchiz-Learner would use only 20% of the original features, in each of its 84 SSMs. Note this is instead of using PCA. (So, for the regional features of the ‘aal’ parcellation, instead of using all 116 features, it only considered the top 0.2 × 116 = 23 features, etc.) While computing the cross-validation scores, we ran the feature selection process ‘in fold’ using the ‘pipeline’ class of scikit-learn58 to avoid obtaining optimistically biased estimates. Second, we examined our ensemble prediction framework to distinguish the least symptomatic schizophrenia patients vs. the most symptomatic patients (based on SAPS and SANS); evaluated using leave-one out cross-validation.
Supplementary information
Acknowledgements
This study is supported by IBM Alberta Centre for Advanced Studies and MITACS (IT09558) funds to S.V.K.; Wellcome Trust/DBT India Alliance (500236/Z/11/Z) and DST (DST/SJF/LSA-02/2014-15) research grants to G.V.; Alberta Machine Intelligence Institute and NSERC grants to R.G. V.S. is supported by the ICMR.
Author contributions
G.V., J.C.N., V.S. collected the clinical and neuroimaging data. Clinical symptom ratings were done by V.S., J.C.N. under the supervision of G.V. Data were cleaned and processed by R.A. and S.V.K. S.V.K. designed and implemented the machine learning models, with supervision of R.G., A.J.G., M.R.G.B. and S.M.D. S.V.K. managed the literature search and wrote the first draft of manuscript along with R.G. All authors revised and optimized further versions of the manuscript. All the authors have contributed to and have approved the final manuscript.
DATA AVAILABILITY
The datasets generated during and/or analysed during the current study as well as relevant computer codes that were used to process the data and to generate the results are available from corresponding authors on a reasonable request.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Sunil Vasu Kalmady, Phone: +1 780 862 2735, Email: kalmady@ualberta.ca.
Ganesan Venkatasubramanian, Phone: +91 (80) 26995256, Email: gvs@nimhans.ac.in.
Supplementary information
Supplementary information accompanies the paper on the npj Schizophrenia website (10.1038/s41537-018-0070-8).
References
- 1.Thomas RI. The NIMH Research Domain Criteria (RDoC) Project: precision medicine for psychiatry. Am. J. Psychiatry. 2014;171:395–397. doi: 10.1176/appi.ajp.2014.14020138. [DOI] [PubMed] [Google Scholar]
- 2.Gejman PV, Sanders AR, Kendler KS. Genetics of schizophrenia: new findings and challenges. Annu. Rev. Genom. Hum. Genet. 2011;12:121–144. doi: 10.1146/annurev-genom-082410-101459. [DOI] [PubMed] [Google Scholar]
- 3.Sass LA, Parnas J. Schizophrenia, consciousness, and the self. Schizophr. Bull. 2003;29:427–444. doi: 10.1093/oxfordjournals.schbul.a007017. [DOI] [PubMed] [Google Scholar]
- 4.Kapur S. Psychosis as a state of aberrant salience: a framework linking biology, phenomenology, and pharmacology in schizophrenia. Am. J. Psychiatry. 2003;160:13–23. doi: 10.1176/appi.ajp.160.1.13. [DOI] [PubMed] [Google Scholar]
- 5.Cuthbert BN, Insel TR. Toward the future of psychiatric diagnosis: the seven pillars of RDoC. BMC Med. 2013;11:126. doi: 10.1186/1741-7015-11-126. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Hyman SE. Can neuroscience be integrated into the DSM-V? Nat. Rev. Neurosci. 2007;8:725–732. doi: 10.1038/nrn2218. [DOI] [PubMed] [Google Scholar]
- 7.Pieper AA, Baraban JM. Moving beyond serendipity to mechanism-driven psychiatric therapeutics. Neurotherapeutics. 2017;14:533–536. doi: 10.1007/s13311-017-0547-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Goldberg JF, Ernst CL. Core concepts involving adverse psychotropic drug effects: assessment, implications, and management. Psychiatr. Clin. North Am. 2016;39:375–389. doi: 10.1016/j.psc.2016.04.001. [DOI] [PubMed] [Google Scholar]
- 9.Kessler RC, et al. Prevalence and treatment of mental disorders, 1990 to 2003. N. Engl. J. Med. 2005;352:2515–2523. doi: 10.1056/NEJMsa043266. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Hunger SP, et al. Improved survival for children and adolescents with acute lymphoblastic leukemia between 1990 and 2005: a report from the children’s oncology group. J. Clin. Oncol. 2012;30:1663–1669. doi: 10.1200/JCO.2011.37.8018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.National Heart, Lung, and Blood Institute. NHLBI Fact Book, Fiscal Year. Bethesda, MD: NHLBI; 2011. [Google Scholar]
- 12.Kahn RS, et al. Schizophrenia. Nat. Rev. Dis. Prim. 2015;1:15067. doi: 10.1038/nrdp.2015.67. [DOI] [PubMed] [Google Scholar]
- 13.Jablensky A. The diagnostic concept of schizophrenia: its history, evolution, and future prospects. Dialog. Clin. Neurosci. 2010;12:271–287. doi: 10.31887/DCNS.2010.12.3/ajablensky. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Kendell R, Jablensky A. Distinguishing between the validity and utility of psychiatric diagnoses. Am. J. Psychiatry. 2003;160:4–12. doi: 10.1176/appi.ajp.160.1.4. [DOI] [PubMed] [Google Scholar]
- 15.Huys QJM, Maia TV, Frank MJ. Computational psychiatry as a bridge from neuroscience to clinical applications. Nat. Neurosci. 2016;19:404–413. doi: 10.1038/nn.4238. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Orrù G, Pettersson-Yeo W, Marquand AF, Sartori G, Mechelli A. Using support vector machine to identify imaging biomarkers of neurological and psychiatric disease: a critical review. Neurosci. Biobehav. Rev. 2012;36:1140–1152. doi: 10.1016/j.neubiorev.2012.01.004. [DOI] [PubMed] [Google Scholar]
- 17.Lesh TA, et al. A multimodal analysis of antipsychotic effects on brain structure and function in first-episode schizophrenia. JAMA Psychiatry. 2015;72:226–234. doi: 10.1001/jamapsychiatry.2014.2178. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Schnack HG, Kahn RS. Detecting neuroimaging biomarkers for psychiatric disorders: sample size matters. Front. Psychiatry. 2016;7:50. doi: 10.3389/fpsyt.2016.00050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Zilles K, Amunts K. Receptor mapping: architecture of the human cerebral cortex. Curr. Opin. Neurol. 2009;22:331–339. doi: 10.1097/WCO.0b013e32832d95db. [DOI] [PubMed] [Google Scholar]
- 20.Eickhoff SB, Rottschy C, Kujovic M, Palomero-Gallagher N, Zilles K. Organizational principles of human visual cortex revealed by receptor mapping. Cereb. Cortex. 2008;18:2637–2645. doi: 10.1093/cercor/bhn024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Talairach, J. & Tournoux, P. Co-Planar Stereotaxic Atlas of the Human Brain: 3-Dimensional Proportional System: An Approach to Cerebral Imaging (G. Thieme, New York, 1988).
- 22.Desikan RS, et al. An automated labeling system for subdividing the human cerebral cortex on MRI scans into gyral based regions of interest. Neuroimage. 2006;31:968–980. doi: 10.1016/j.neuroimage.2006.01.021. [DOI] [PubMed] [Google Scholar]
- 23.Damoiseaux JS, Greicius MD. Greater than the sum of its parts: a review of studies combining structural connectivity and resting-state functional connectivity. Brain Struct. Funct. 2009;213:525–533. doi: 10.1007/s00429-009-0208-6. [DOI] [PubMed] [Google Scholar]
- 24.Roca P, et al. Inter-subject connectivity-based parcellation of a patch of cerebral cortex. Med Image Comput. Comput. Assist Interv. 2010;13:347–354. doi: 10.1007/978-3-642-15745-5_43. [DOI] [PubMed] [Google Scholar]
- 25.Yeo BT, et al. The organization of the human cerebral cortex estimated by intrinsic functional connectivity. J. Neurophysiol. 2011;106:1125–1165. doi: 10.1152/jn.00338.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Smith SM, et al. Correspondence of the brain’s functional architecture during activation and rest. Proc. Natl Acad. Sci. USA. 2009;106:13040–13045. doi: 10.1073/pnas.0905267106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Lashkari D, et al. Search for patterns of functional specificity in the brain: a nonparametric hierarchical Bayesian model for group fMRI data. Neuroimage. 2012;59:1348–1368. doi: 10.1016/j.neuroimage.2011.08.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Dosenbach NU, et al. Prediction of individual brain maturity using fMRI. Science. 2010;329:1358–1361. doi: 10.1126/science.1194144. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Eickhoff SB, et al. Co-activation patterns distinguish cortical modules, their connectivity and functional differentiation. Neuroimage. 2011;57:938–949. doi: 10.1016/j.neuroimage.2011.05.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Cordes D, Haughton V, Carew JD, Arfanakis K, Maravilla K. Hierarchical clustering to measure connectivity in fMRI resting-state data. Magn. Reson. Imaging. 2002;20:305–317. doi: 10.1016/S0730-725X(02)00503-9. [DOI] [PubMed] [Google Scholar]
- 31.McKeown MJ, et al. Analysis of fMRI data by blind separation into independent spatial components. Hum. Brain. Mapp. 1998;6:160–188. doi: 10.1002/(SICI)1097-0193(1998)6:3<160::AID-HBM5>3.0.CO;2-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Yao Z, Hu B, Xie Y, Moore P, Zheng J. A review of structural and functional brain networks: small world and atlas. Brain Inform. 2015;2:45–52. doi: 10.1007/s40708-015-0009-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Thirion, B., Varoquaux, G., Dohmatob, E. & Poline, J.-B. Which fMRI clustering gives good brain parcellations? Front. Neurosci.8, 167 (2014). [DOI] [PMC free article] [PubMed]
- 34.Cabral C, et al. Classifying schizophrenia using multimodal multivariate pattern recognition analysis: evaluating the impact of individual clinical profiles on the neurodiagnostic performance. Schizophr. Bull. 2016;42(Suppl 1):S110–S117. doi: 10.1093/schbul/sbw053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Xia M, Wang J, He Y. BrainNet viewer: a network visualization tool for human brain connectomics. PLoS ONE. 2013;8:e68910. doi: 10.1371/journal.pone.0068910. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Menon V, Uddin LQ. Saliency, switching, attention and control: a network model of insula function. Brain. Struct. Funct. 2010;214:655–667. doi: 10.1007/s00429-010-0262-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Takamura T, Hanakawa T. Clinical utility of resting-state functional connectivity magnetic resonance imaging for mood and cognitive disorders. J. Neural Transm. 2017;124:821–839. doi: 10.1007/s00702-017-1710-2. [DOI] [PubMed] [Google Scholar]
- 38.van Amelsvoort T, Hernaus D. Effect of pharmacological interventions on the fronto-cingulo-parietal cognitive control network in psychiatric disorders: a transdiagnostic systematic review of fMRI studies. Front. Psychiatry. 2016;7:82. doi: 10.3389/fpsyt.2016.00082. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Hu ML, et al. A review of the functional and anatomical default mode network in schizophrenia. Neurosci. Bull. 2017;33:73–84. doi: 10.1007/s12264-016-0090-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Vega Romero, R., Brown, M. & Greiner, R. The challenge of applying machine learning techniques to diagnose schizophrenia using multi-site fMRI data. MSc Thesis, University of Alberta (2017).
- 41.Alderson-Day B, McCarthy-Jones S, Fernyhough C. Hearing voices in the resting brain: A review of intrinsic functional connectivity research on auditory verbal hallucinations. Neurosci. Biobehav. Rev. 2015;55:78–87. doi: 10.1016/j.neubiorev.2015.04.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Curcic-Blake B, et al. Interaction of language, auditory and memory brain networks in auditory verbal hallucinations. Prog. Neurobiol. 2017;148:1–20. doi: 10.1016/j.pneurobio.2016.11.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Alderson-Day B, et al. Auditory hallucinations and the brain’s resting-state networks: findings and methodological observations. Schizophr. Bull. 2016;42:1110–1123. doi: 10.1093/schbul/sbw078. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Damaraju E, et al. Dynamic functional connectivity analysis reveals transient states of dysconnectivity in schizophrenia. NeuroImage. 2014;5:298–308. doi: 10.1016/j.nicl.2014.07.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Yu Q, et al. Brain connectivity networks in schizophrenia underlying resting state functional magnetic resonance imaging. Curr. Top. Med. Chem. 2012;12:2415–2425. doi: 10.2174/156802612805289890. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Zhou Y, Fan L, Qiu C, Jiang T. Prefrontal cortex and the dysconnectivity hypothesis of schizophrenia. Neurosci. Bull. 2015;31:207–219. doi: 10.1007/s12264-014-1502-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Pettersson-Yeo W, Allen P, Benetti S, McGuire P, Mechelli A. Dysconnectivity in schizophrenia: where are we now? Neurosci. Biobehav. Rev. 2011;35:1110–1124. doi: 10.1016/j.neubiorev.2010.11.004. [DOI] [PubMed] [Google Scholar]
- 48.Fornito A, Bullmore ET. Reconciling abnormalities of brain network structure and function in schizophrenia. Curr. Opin. Neurobiol. 2015;30:44–50. doi: 10.1016/j.conb.2014.08.006. [DOI] [PubMed] [Google Scholar]
- 49.Friston K, Brown HR, Siemerkus J, Stephan KE. The dysconnection hypothesis. Schizophr. Res. 2016;176:83–94. doi: 10.1016/j.schres.2016.07.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Monte-Silva K, et al. Induction of late LTP-like plasticity in the human motor cortex by repeated non-invasive brain stimulation. Brain Stimul. 2013;6:424–432. doi: 10.1016/j.brs.2012.04.011. [DOI] [PubMed] [Google Scholar]
- 51.Moseley P, Alderson-Day B, Ellison A, Jardri R, Fernyhough C. Non-invasive brain stimulation and auditory verbal hallucinations: new techniques and future directions. Front. Neurosci. 2015;9:515. doi: 10.3389/fnins.2015.00515. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Bose A, et al. Efficacy of fronto-temporal transcranial direct current stimulation for refractory auditory verbal hallucinations in schizophrenia: a randomized, double-blind, sham-controlled study. Schizophr. Res. 2018;195:475–480. doi: 10.1016/j.schres.2017.08.047. [DOI] [PubMed] [Google Scholar]
- 53.Mondino M, et al. Effects of fronto-temporal transcranial direct current stimulation on auditory verbal hallucinations and resting-state functional connectivity of the left temporo-parietal junction in patients with schizophrenia. Schizophr. Bull. 2016;42:318–326. doi: 10.1093/schbul/sbv114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Sheehan DV, et al. The Mini-International Neuropsychiatric Interview (M.I.N.I.): the development and validation of a structured diagnostic psychiatric interview for DSM-IV and ICD-10. J. Clin. Psychiatry. 1998;59(Suppl 20):22–33. [PubMed] [Google Scholar]
- 55.Andreasen NC, Arndt S, Miller D, Flaum M, Nopoulos P. Correlational studies of the scale for the assessment of negative symptoms and the scale for the assessment of positive symptoms: an overview and update. Psychopathology. 1995;28:7–17. doi: 10.1159/000284894. [DOI] [PubMed] [Google Scholar]
- 56.Chao-Gan Y, Yu-Feng Z. DPARSF: a MATLAB toolbox for “Pipeline” data analysis of resting-state fMRI. Front. Syst. Neurosci. 2010;4:13. doi: 10.3389/fnsys.2010.00013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Abraham A, et al. Machine learning for neuroimaging with scikit-learn. Front. Neuroinform. 2014;8:14. doi: 10.3389/fninf.2014.00014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Pedregosa F, et al. Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 2011;12:2825–2830. [Google Scholar]
- 59.Power JD, Barnes KA, Snyder AZ, Schlaggar BL, Petersen SE. Spurious but systematic correlations in functional connectivity MRI networks arise from subject motion. Neuroimage. 2012;59:2142–2154. doi: 10.1016/j.neuroimage.2011.10.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Satterthwaite TD, et al. Impact of in-scanner head motion on multiple measures of functional connectivity: relevance for studies of neurodevelopment in youth. Neuroimage. 2012;60:623–632. doi: 10.1016/j.neuroimage.2011.12.063. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Van Dijk KR, Sabuncu MR, Buckner RL. The influence of head motion on intrinsic functional connectivity MRI. Neuroimage. 2012;59:431–438. doi: 10.1016/j.neuroimage.2011.07.044. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Chang X, et al. Distinct inter-hemispheric dysconnectivity in schizophrenia patients with and without auditory verbal hallucinations. Sci. Rep. 2015;5:11218. doi: 10.1038/srep11218. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Friston KJ, et al. Statistical parametric maps in functional imaging: a general linear approach. Hum. Brain Mapp. 1994;2:189–210. doi: 10.1002/hbm.460020402. [DOI] [Google Scholar]
- 64.Varoquaux G, Gramfort A, Pedregosa F, Michel V, Thirion B. Multi-subject dictionary learning to segment an atlas of brain spontaneous activity. Inf. Process. Med. Imaging. 2011;22:562–573. doi: 10.1007/978-3-642-22092-0_46. [DOI] [PubMed] [Google Scholar]
- 65.Tzourio-Mazoyer N, et al. Automated anatomical labeling of activations in SPM using a macroscopic anatomical parcellation of the MNI MRI single-subject brain. Neuroimage. 2002;15:273–289. doi: 10.1006/nimg.2001.0978. [DOI] [PubMed] [Google Scholar]
- 66.Bellec P, Rosa-Neto P, Lyttelton OC, Benali H, Evans AC. Multi-level bootstrap analysis of stable clusters in resting-state fMRI. Neuroimage. 2010;51:1126–1139. doi: 10.1016/j.neuroimage.2010.02.082. [DOI] [PubMed] [Google Scholar]
- 67.Destrieux C, Fischl B, Dale A, Halgren E. A sulcal depth-based anatomical parcellation of the cerebral cortex. Neuroimage. 2009;47:S151. doi: 10.1016/S1053-8119(09)71561-7. [DOI] [Google Scholar]
- 68.Power JD, et al. Functional network organization of the human brain. Neuron. 2011;72:665–678. doi: 10.1016/j.neuron.2011.09.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Zou QH, et al. An improved approach to detection of amplitude of low-frequency fluctuation (ALFF) for resting-state fMRI: fractional ALFF. J. Neurosci. Methods. 2008;172:137–141. doi: 10.1016/j.jneumeth.2008.04.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Hoptman MJ, et al. Amplitude of low-frequency oscillations in schizophrenia: a resting state fMRI study. Schizophr. Res. 2010;117:13–20. doi: 10.1016/j.schres.2009.09.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Zang YF, et al. Altered baseline brain activity in children with ADHD revealed by resting-state functional MRI. Brain Dev. 2007;29:83–91. doi: 10.1016/j.braindev.2006.10.001. [DOI] [PubMed] [Google Scholar]
- 72.Kendall MG. Rank Correlation Methods. Oxford: Griffin; 1948. [Google Scholar]
- 73.Zang Y, Jiang T, Lu Y, He Y, Tian L. Regional homogeneity approach to fMRI data analysis. Neuroimage. 2004;22:394–400. doi: 10.1016/j.neuroimage.2003.12.030. [DOI] [PubMed] [Google Scholar]
- 74.Crowley S, et al. Considering total intracranial volume and other nuisance variables in brain voxel based morphometry in idiopathic PD. Brain Imaging Behav. 2018;12:1–12. doi: 10.1007/s11682-016-9656-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Turner J, et al. A multi-site resting state fMRI study on the amplitude of low frequency fluctuations in schizophrenia. Front. Neurosci. 2013;7:137. doi: 10.3389/fnins.2013.00137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Liu H, et al. Decreased regional homogeneity in schizophrenia: a resting state functional magnetic resonance imaging study. Neuroreport. 2006;17:19–22. doi: 10.1097/01.wnr.0000195666.22714.35. [DOI] [PubMed] [Google Scholar]
- 77.Chen J, et al. Comparative study of regional homogeneity in schizophrenia and major depressive disorder. Am. J. Med. Genet. 2013;162B:36–43. doi: 10.1002/ajmg.b.32116. [DOI] [PubMed] [Google Scholar]
- 78.Lynall ME, et al. Functional connectivity and brain networks in schizophrenia. J. Neurosci. 2010;30:9477–9487. doi: 10.1523/JNEUROSCI.0333-10.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Das A, et al. Interpretation of the precision matrix and its application in estimating sparse brain connectivity during sleep spindles from human electrocorticography recordings. Neural Comput. 2017;29:603–642. doi: 10.1162/NECO_a_00936. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Fan RE, Chang KW, Hsieh CJ, Wang XR, Lin CJ. LIBLINEAR: A library for large linear classification. J. Mach. Learn. Res. 2008;9:1871–1874. [Google Scholar]
- 81.Raschka, S. MLxtend: Providing machine learning and data science utilities and extensions to Python’s scientific computing stack. J. Open Source Softw. 3, 24 10.21105/joss.00638 (2018). http://joss.theoj.org/papers/10.21105/joss.00638.
- 82.Jones, E., Oliphant, T. & Peterson, P. SciPy: Open source scientific tools for Python (2001). http://www.scipy.org/.
- 83.Shen H, Wang L, Liu Y, Hu D. Discriminative analysis of resting-state functional connectivity patterns of schizophrenia using low dimensional embedding of fMRI. Neuroimage. 2010;49:3110–3121. doi: 10.1016/j.neuroimage.2009.11.011. [DOI] [PubMed] [Google Scholar]
- 84.Fan Y, et al. Discriminant analysis of functional connectivity patterns on Grassmann manifold. Neuroimage. 2011;56:2058–2067. doi: 10.1016/j.neuroimage.2011.03.051. [DOI] [PubMed] [Google Scholar]
- 85.Yu Y, Shen H, Zeng LL, Ma Q, Hu D. Convergent and divergent functional connectivity patterns in schizophrenia and depression. PLoS ONE. 2013;8:e68250–e68250. doi: 10.1371/journal.pone.0068250. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Anderson A, Cohen MS. Decreased small-world functional network connectivity and clustering across resting state networks in schizophrenia: an fMRI classification tutorial. Front. Hum. Neurosci. 2013;7:520–520. doi: 10.3389/fnhum.2013.00520. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Arbabshirani M, Kiehl K, Pearlson G, Calhoun V. Classification of schizophrenia patients based on resting-state functional network connectivity. Front. Neurosci. 2013;7:133–133. doi: 10.3389/fnins.2013.00133. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Yu Y, et al. Functional connectivity-based signatures of schizophrenia revealed by multiclass pattern analysis of resting-state fMRI from schizophrenic patients and their healthy siblings. Biomed. Eng. Online. 2013;12:10–10. doi: 10.1186/1475-925X-12-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Guo S, Kendrick KM, Yu R, Wang HLS, Feng J. Key functional circuitry altered in schizophrenia involves parietal regions associated with sense of self. Hum. Brain. Mapp. 2014;35:123–139. doi: 10.1002/hbm.22162. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Brodersen KH, et al. Dissecting psychiatric spectrum disorders by generative embedding. NeuroImage. Clin. 2014;4:98–111. doi: 10.1016/j.nicl.2013.11.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Anticevic A, et al. Characterizing thalamo-cortical disturbances in schizophrenia and bipolar illness. Cereb. Cortex. 2014;24:3116–3130. doi: 10.1093/cercor/bht165. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Watanabe T, Kessler D, Scott C, Angstadt M, Sripada C. Disease prediction based on functional connectomes using a scalable and spatially-informed support vector machine. Neuroimage. 2014;96:183–202. doi: 10.1016/j.neuroimage.2014.03.067. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Chyzhyk D, Grana M, Ongur D, Shinn AK. Discrimination of schizophrenia auditory hallucinators by machine learning of resting-state functional MRI. Int. J. Neural Syst. 2015;25:1550007. doi: 10.1142/S0129065715500070. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Cheng W, et al. Voxel-based, brain-wide association study of aberrant functional connectivity in schizophrenia implicates thalamocortical circuitry. NPJ Schizophr. 2015;1:15016–15016. doi: 10.1038/npjschz.2015.16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Peters H, et al. More consistently altered connectivity patterns for cerebellum and medial temporal lobes than for amygdala and striatum in schizophrenia. Front. Hum. Neurosci. 2016;10:55–55. doi: 10.3389/fnhum.2016.00055. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96.Mikolas P, et al. Connectivity of the anterior insula differentiates participants with first-episode schizophrenia spectrum disorders from controls: a machine-learning study. Psychol. Med. 2016;46:2695–2704. doi: 10.1017/S0033291716000878. [DOI] [PubMed] [Google Scholar]
- 97.Yang H, He H, Zhong J. Multimodal MRI characterisation of schizophrenia: a discriminative analysis. Lancet. 2016;388(Suppl):S36–S36. doi: 10.1016/S0140-6736(16)31963-8. [DOI] [Google Scholar]
- 98.Iwabuchi SJ, Palaniyappan L. Abnormalities in the effective connectivity of visuothalamic circuitry in schizophrenia. Psychol. Med. 2017;47:1–11. doi: 10.1017/S0033291716003469. [DOI] [PubMed] [Google Scholar]
- 99.Lottman KK, et al. Risperidone effects on brain dynamic connectivity—a prospective resting-state fMRI study in schizophrenia. Front. Psychiatry. 2017;8:14–14. doi: 10.3389/fpsyt.2017.00014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100.Guo W, et al. Family-based case-control study of homotopic connectivity in first-episode, drug-naive schizophrenia at rest. Sci. Rep. 2017;7:43312–43312. doi: 10.1038/srep43312. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The datasets generated during and/or analysed during the current study as well as relevant computer codes that were used to process the data and to generate the results are available from corresponding authors on a reasonable request.