Skip to main content
NeuroImage : Clinical logoLink to NeuroImage : Clinical
. 2018 May 9;19:476–486. doi: 10.1016/j.nicl.2018.04.037

Classification of multi-site MR images in the presence of heterogeneity using multi-task learning

Qiongmin Ma a,b,c,, Tianhao Zhang b, Marcus V Zanetti d, Hui Shen a, Theodore D Satterthwaite e, Daniel H Wolf e, Raquel E Gur e, Yong Fan b, Dewen Hu a,1, Geraldo F Busatto d,1, Christos Davatzikos b,1
PMCID: PMC6029565  PMID: 29984156

Abstract

With the advent of Big Data Imaging Analytics applied to neuroimaging, datasets from multiple sites need to be pooled into larger samples. However, heterogeneity across different scanners, protocols and populations, renders the task of finding underlying disease signatures challenging. The current work investigates the value of multi-task learning in finding disease signatures that generalize across studies and populations. Herein, we present a multi-task learning type of formulation, in which different tasks are from different studies and populations being pooled together. We test this approach in an MRI study of the neuroanatomy of schizophrenia (SCZ) by pooling data from 3 different sites and populations: Philadelphia, Sao Paulo and Tianjin (50 controls and 50 patients from each site), which posed integration challenges due to variability in disease chronicity, treatment exposure, and data collection. Some existing methods are also tested for comparison purposes. Experiments show that classification accuracy of multi-site data outperformed that of single-site data and pooled data using multi-task feature learning, and also outperformed other comparison methods. Several anatomical regions were identified to be common discriminant features across sites. These included prefrontal, superior temporal, insular, anterior cingulate cortex, temporo-limbic and striatal regions consistently implicated in the pathophysiology of schizophrenia, as well as the cerebellum, precuneus, and fusiform, middle temporal, inferior parietal, postcentral, angular, lingual and middle occipital gyri. These results indicate that the proposed multi-task learning method is robust in finding consistent and reliable structural brain abnormalities associated with SCZ across different sites, in the presence of multiple sources of heterogeneity.

Keywords: Multi-task learning, Multi-site classification, Imaging heterogeneity, Sparsity, Schizophrenia, MRI

Highlights

  • Multi-task learning is tested to classify MRI data of 3 sites (50 controls and 50 schizophrenia patients from each site).

  • Classification accuracy of multi-site data outperformed that of single-site and pooled data.

  • Several cortical and subcortical anatomical regions were found to be common discriminant features across different sites.

  • Multi-task learning is robust in finding reliable structural brain abnormalities of schizophrenia across different sites.

1. Introduction

Neuroimaging studies have widely explored the clinical value of machine learning methods for differentiating psychiatric patients from healthy controls at the individual level. In addition to providing individualized indices for diagnostic purposes, machine learning methods may ultimately help identify brain regions affected by disease in subtle ways that can only be elucidated using multi-variate analysis. While most of these neuroimaging studies to date have been performed using single-site datasets, it is essential to integrate multi-site data for two reasons. First, multi-site data provide sufficient statistical power for detecting subtle, but informative patterns of brain structure and function (Brown et al., 2011; Friedman et al., 2006; Schnack et al., 2010), which may be difficult to unravel with the relative small sample sizes usually acquired in single centers (Pearlson, 2009; Segall et al., 2009). Second, large sample sizes enhance sample generalizability by pooling large patient populations with diverse demographic features and clinical characteristics including disease onset, symptom severity, and types and duration of treatment (Brown et al., 2011; Friedman et al., 2006; Glover et al., 2012; Pearlson, 2009; Sutton et al., 2008; Van Horn and Toga, 2009). Multi-site studies are therefore becoming increasingly the norm in neuroimaging research (Casey et al., 1998; Van Horn and Toga, 2009).

Multi-site data reflect a more comprehensive abnormal pattern of disease, and therefore may provide a richer understanding of disease signatures than single-site data. However, two recent studies found that simple pooling of multi-site data did not outperform single-site disease classification. Colby (Colby et al., 2012) and Nielsen (Nielsen et al., 2013) pooled multi-site data and trained a common classifier for all data, to identify attention deficit hyperactivity disorder (ADHD) and autism, respectively. Results in these two studies showed that the pooled dataset exhibited lower accuracy than each single-site datasets. Here we seek a new approach to synergistically integrating multi-site data, by emphasizing two points. First, since datasets collected in multiple imaging centers have a common disorder of interest (e.g. schizophrenia (SCZ), in our experiments herein), the abnormal patterns in each dataset are strongly related and thus, to some extent, may share a common imaging signature. The pattern reproducibility among multi-site data has been repeatedly demonstrated in several multi-site studies on functional MRI (fMRI) (Casey et al., 1998; Costafreda et al., 2007; Gee et al., 2015; Jovicich et al., 2015), morphometric MRI (Cannon et al., 2014; Schnack et al., 2010), and diffusion-tensor imaging (DTI) (Jovicich et al., 2014; Pfefferbaum et al., 2003). Most of these studies arrived at similar conclusions, namely that with appropriate multi-site data collection, different data sites shared highly consistent feature patterns. With this, the site-shared features reveal consistent brain abnormalities in multi-site data, which can lead to a more accurate neurobiological understanding of the psychiatric disorder under investigation. On the other hand, though the integration of multiple single-site data is advantageous, the unavoidable presence of site-specific features might decrease the accuracy of a classifier that merely pools data together across studies. Heterogeneity can emanate from multiple sources including scanner differences, differences in image acquisition protocols, or ethnic and treatment differences among participating patient populations (Jovicich et al., 2015; Schnack et al., 2010; Van Horn and Toga, 2009). Given such site-related heterogeneity, multi-site datasets should not simply be merged into larger cohorts for further machine learning investigation (Pan and Yang, 2010). In an attempt to eliminate or reduce the site-specific variability, studies have suggested same scanning protocols, consistent scanner parameters and etc. (Brown et al., 2011; Calhoun and Adali, 2009; Pearlson, 2009; You et al., 2011) in data collection, as well as the utilization of smoothness equalization (Friedman et al., 2006) and independent component analysis (Kim et al., 2009; Meda et al., 2008) in data preprocessing. Despite these efforts, site-specific heterogeneities still exist due to their complex causes (Pearlson, 2009; Segall et al., 2009).

The above considerations highlight the need for a feature-learning framework in multi-site disease classification that can extract the site-shared features, while also accounting for the site-specific features; this approach generally seeks an overarching signature of disease, whereas it accommodates potential sub-cohort and other differences to be taken into consideration. In recent years, multi-task learning has been successful in learning task-shared and task-specific features simultaneously, which effectively improves generalization compared with traditional machine learning methods. For example, support vector machine (SVM) with single-task learning (Caruana, 1997) learns a distinct feature pattern and finds a maximum margin hyperplane to classify two groups, which extracts information within a single learning task. In contrast, multi-task learning extracts a subset of task-shared features to generate more accurate models on multiple tasks, with the task-specific features learned simultaneously (Marquand et al., 2014). The basic assumption of multi-task learning is that the feature weights of different tasks share similar sparse patterns (Chen et al., 2012), which can be learned by imposing sparsity regularization penalties on the task weight matrix (Kumar and Daume III, 2012). l1-norm and l2,1-norm are two commonly used sparsity regulating terms in multi-task learning, which enforces the weight matrix of different tasks to be sparse across all tasks. Particularly, an l1-norm term highlights task-specific features by encouraging the weights of irrelevant features to be very small (Wang et al., 2015), while l2,1-norm introduces group sparsity and enforces task-shared features to have larger weights (Watanabe et al., 2014; Yan et al., 2015).

The advantage of multi-task learning makes it suitable for multi-site data learning, considering the site as task, and the site-shared and site-specific features as task-shared and task-specific features. Neuroimaging studies have shown the effectiveness of performing multi-task learning in the brain decoding and disease classification (Marquand et al., 2014; Obozinski et al., 2010; Rao et al., 2013; Wang et al., 2015; Watanabe et al., 2014). Specifically, multi-site fMRI data of ADHD was demonstrated better than single-site classification by learning site-shared and site-specific features using multi-task scheme (Watanabe et al., 2014). In this work, though the multi-task learning scheme successfully extracted site-shared and site-specific features in multi-site data, the form of the objective function was rather complex and specific as it included an l2,1-norm group sparsity regularization term and a 6-D spatial structure penalty (generated by the GraphNet, fused Lasso, or the isotropic total variation). In order to make the multi-task learning scheme more simple and applicable, an objective function including l1-norm and l2,1-norm penalty terms was used in the current study (Wang et al., 2011; Wang et al., 2015).

Building upon this emerging literature, we aim to distinguish SCZ patients from healthy controls across multiple-site MRI data using multi-task learning. We hypothesized that using multi-task learning framework on multi-site classification would not only have better performance than single-site data classification, but would also identify the abnormalities shared by all sites, and also specific to each site. These site-shared brain structural alterations should be consistent with the previously reported altered regions in SCZ, such as the brain regions involving prefrontal, superior temporal, insular, temporo-limbic regions, among others.

2. Materials and methods

2.1. Participants and MRI acquisitions

MRI data were collected by three academic centers, including locations in the United States (University of Pennsylvania; site A) (Davatzikos et al., 2005), Brazil (University of Sao Paulo; site B) (Schaufelberger et al., 2007; Zanetti et al., 2013), and China (Tianjin; site C). From each site, a balanced dataset was obtained with 50 normal controls (NCs) and 50 SCZ patients randomly selected from a larger pool of available subjects. In total, we had 150 NCs and 150 SCZs, which didn't differ by age and gender significantly (p > 0.05; see Table 1).

Table 1.

Characteristics of the participants in this study.

Variable Sample size Gender (male/female) Age (years) Mean ± SD (range)
Site A SCZ 50 28/22 35.38 ± 11.78(19–60)
NC 50 25/25 32.50 ± 12.96(15–65)
p value 0.69a 0.25b
Site B SCZ 50 34/16 27.48 ± 7.89(18–50)
NC 50 29/21 30.60 ± 8.17(18–50)
p value 0.30a 0.055b
Site C SCZ 50 25/25 34.10 ± 8.44(16–56)
NC 50 22/28 32.24 ± 11.42(21–57)
p value 0.55a 0.36b
Site A + B + C SCZ 150 87/63 32.32 ± 10.08(16–60)
NC 150 76/74 31.78 ± 10.99(15–65)
p value 0.20a 0.66b

Note: SCZ: schizophrenia; NC: normal controls.

a

Pearson Chi-square test.

b

Two-sample t-test.

All SCZ patients met DSM-IV criteria. Written informed consent was obtained from all participants before MRI scanning. In site A, the 50 SCZ patients had chronic symptoms and were receiving treatment with antipsychotics (mean duration of illness 16.2 ± 12.3 years). In site B, all SCZ subjects were recruited shortly after they made their first contact with mental health services due to psychotic symptoms, and their duration of illness was 1.0 ± 1.3 years; 31 patients had been on antipsychotic treatment within 3 weeks of MRI, while the remaining 19 patients were free of antipsychotics at the time of MRI scanning. Site C contributed 5 first-episode, never-treated SCZ patients and 45 chronic SCZ patients under antipsychotic treatment (mean duration of illness 10.5 ± 7.2 years).

In site A, the imaging data were acquired using a Siemens Trio 3-T scanner (Siemens Medical Systems, Erlangen Germany), with the following protocol: slice thickness = 1 mm, TE = 3.51 ms, TR = 18.1 ms, flip angle = 9°, acquisition matrix = 240 × 180, and slice number = 160, no gaps, 1-mm isotropic voxels. In site B, the T1 images were acquired using two identical 1.5-T GE Signa scanners (GE Medical Systems, Milwaukee WI, USA) with the following protocol: T1-SPGR sequence providing 124 contiguous slices, TE = 5.2 ms, TR = 21.7 ms, flip angle = 20°, FOV = 22 cm, acquisition matrix = 256 × 192. Details of the MRI scanning in site B can be found in Zanetti et al. (2013). In site C, all scans were obtained using a GE 3-T Signa scanner (GE Medical Systems, Milwaukee WI, USA) with the following protocol: slice thickness = 1 mm, TE = 3.2 ms, TR = 8.2 ms, flip angle = 12°, acquisition matrix = 256 × 256, FOV = 25.6 cm.

2.2. Preprocessing

The imaging data were preprocessed using the following steps as previously described (Davatzikos, 1998; Davatzikos et al., 2001; Goldszal et al., 1998; Shen and Davatzikos, 2003): (1) skull stripping, (2) bias field correction, (3) brain tissue segmentation into gray matter (GM), white matter (WM), cerebrospinal fluid (CSF), (4) spatial registration to Montreal Neurological Institute (MNI) template, (5) generation of the Regional Analysis of Volumes Examined in Normalized Space (RAVENS) (Davatzikos et al., 2001; Goldszal et al., 1998) maps of GM, WM, and CSF using deformable registration (DRAMMS) (Ou et al., 2011), and (6) spatial smoothing of RAVENS maps using a 6-mm Full Width at Half Maximum (FWHM) Gaussian filter. In our study, we investigated the RAVENS value changes in the GM images, which indicated regional GM volumetric alterations in the brain. Note that the RAVENS method has been demonstrated to provide more accurate and reliable quantification of volumes of different brain regions (Goldszal et al., 1998). In order to reduce computational requirements, and since SCZ is not known to be associated with very localized volumetric reductions, the GM RAVENS maps were resampled to have a spatial resolution of 3 × 3 × 3 mm3. After applying a GM mask, 66,732 voxel-wise features for each subject were extracted. These images yielded a feature matrix of dimensionality of 100 × 66,732 for each site dataset. In order to ensure that features in each site are within the same level and contribute equally in the process of feature weight learning, the feature matrices were then standardized to zero mean and unit variance across all subjects within each site, respectively.

2.3. Multi-site classification using multi-task learning framework

In our study, the multi-task learning framework was used to learn the site-shared and site-specific features in the three data sites. These selected features were then used to classify the different data sites respectively. The workflow of multi-site learning is shown in Fig. 1.

Fig. 1.

Fig. 1

Workflow of each iteration in multi-site classification (with total 20 repeated experiments). (1) There were three sites in our study (sites A, B, and C). After preprocessing, the N samples in each data site were randomly divided into 5 folds and designated to three sets: one fold for testing, 80% and 20% of the other 4 folds for training and validation, respectively. Each sample had the feature dimension of D. (2) Feature learning. The training sets from three sites were used in the multi-task learning framework, which generated the feature weights of the three sites W. The three column vectors in the weight matrix W were then sorted according to their absolute values respectively, while the feature weights of the site-shared features obtained according to the K top ranked feature weights were strengthened. The new feature weights matrix W was generated. (3) Parameter tuning. Using the feature weights learned in the last step, the K features and their corresponding weights in the validation samples from three sites were used to classify the validation sets. The parameter set which contributed to the best classification accuracy was selected. (4) Testing. The best parameter set was used in testing and the classification accuracy was obtained.

We model the classification between SCZs and NCs of different sites as a multi-task learning problem with t learning tasks (one task for each site). In the ith learning task, Xi = [xi1xi2, …, xin] ∈ Rd×n is features of training samples, where d is the number of features and n is number of training samples, and their class labels are Yi = [yi1yi2, …, yin] ∈ Rn, where yj ∈ {+1, −3} stands for NCs and SCZs, respectively. Let wi ∈ Rd denotes a feature weight vector for the ith learning task, and W = [w1w2, …, wt] ∈ Rd×t denotes the feature weight matrix of all tasks. The columns of W (wi ∈ Rd) are the feature weights in each task, and the rows of W (wp ∈ Rt, where p = 1, 2,⋯, d) hold the weight vectors of each feature across all learning tasks. Our goal is to learn the feature weight matrix W by minimizing the following cost function:

argmini=1tj=1nmax01yijwiTxij+αW1+βW2,1, (1)

where wiTxij is the dot product of the feature weights and feature values in each training sample, and the first term in the objective function is the hinge loss cost of all training samples in all tasks, the second term ‖W1 = ∑i=1t|wi| is l1-norm of W, the last term ‖W2, 1 = ∑k=1dwk2 is a l2,1-norm penalty. The l1-norm term encourages task-specific sparsity, while l2,1-norm term encourages group sparsity among all tasks to identify task-shared features. The l2,1 + l1 regularization terms consider both the task-specific and task-shared features, which make multi-task learning more suitable in classification of the heterogeneous data. The optimal stochastic alternating direction method of multipliers (SADMM) was adopted to solve the optimization function (Azadi and Sra, 2014; Wang et al., 2015). Details in the solution of the optimal problem using SADMM, and the converge prove can be found in Wang et al. (2015).

In our study, the learning of feature weights at each data site was considered as a different task. Due to the sparse penalty constrains in the object function of multi-task learning, many elements in the feature weight matrix W were forced to zero, or approximately equal to zero. Weight values reflect the importance levels of different features in learning the classification boundaries between two groups, while weight values that are positive or negative tend to classify the sample into either positive or negative classes, respectively. We assume that features with larger weight coefficients (in terms of absolute value) contribute more in discriminating SCZ from NCs. Moreover, features with zero and near-zero weights make small contributions in the classification. Thus, in order to select the more important features for further classification, we sorted the weight vector in each data site according to their absolute values after the weights W were learned.

The consensus features in a small portion of the top-ranked features-which were set to 5% across three data sites, were considered as the site-shared features. The threshold of 5% was chosen because in terms of classification performance, it outperformed others in a range of 5%, 10%, and 20% (Tan et al., 2013). For a single data site, the site-specific and site-shared features were mixed together, as well as some irrelevant features. Using the multi-task feature learning scheme, the feature weights of the site-specific and site-shared features were obviously greater than the irrelevant features. Though the site-shared and site-specific features were both very important, the site-shared features were considered underlying the pathology of SCZ and more critical in characterizing disease alterations. Thus, to further emphasize the influence of site-shared features in the classification, we increased the weights of site-shared features using the following equation,

W=WspecificγWshared, (2)

where γ ≥ 1.

Note that the γ was not set to be a free parameter, which was also tuned using the nested-loop cross-validation in our study.

2.4. Classification with top ranked features

The K top ranked features with the new feature weights were used to classify each dataset. The classifier equation is shown as follows.

y=signwipxp, (3)

where wip denotes the new weight value of the pth feature in the ith data site, xp denotes the pth feature value, and y denotes the sample label. Given one testing sample in the ith data site, the dot product of the feature weights, and the feature values of the K selected features were summed up to get the sample label.

2.5. Validation experiments

To evaluate the proposed method, ten comparison experiments were designed in the consideration of three aspects. (1) To show the effectiveness of gathering multi-site data, we reported the classification results of single-site, pooled, and multi-site data, respectively. Single-site classification learns features of each single-site data separately for classification; pooled classification pools the three data sites together as a larger dataset regardless of the site differences, which contributed to a simple binary classification of 150 controls and 150 patients; and multi-site classification uses multi-task learning to learn the site-specific and site-shared features simultaneously, in the three data sites, and combine the two types of features to classify each dataset separately. (2) To show the effectiveness of using multi-task feature learning framework, two widely used feature learning methods including principal component analysis (PCA) and two-sample t-test (ttest2) were also tested. (3) The proposed classifier based on the learned feature weights was compared to the SVM classifier with a linear kernel. Differences between these two classifiers exist as the SVM focuses on the selected features and calculates new feature weights for classification, while our proposed classifier takes advantage of the feature weights obtained in the feature learning step, and then generate the classification label. Table 2 listed the ten comparison methods in the current study.

Table 2.

Summary of the comparison methods.

Method Description
l2,1 + l1 (MS) The proposed multi-task classification framework on multi-site data, which contains l2,1 + l1 regularization terms in the feature learning step, and the classifier based on the learned feature weights.
l1 (SS) Single-task classification on each single site data: the single-task feature learning step uses the l1 regularization term, and the classifier uses the same classifier formula as the multi-task classification framework.
l1 (PO) Data from all the datasets are pooled together as a larger dataset. The classification framework is the same as the single-task classification.
l2,1 + l1 + SVM (MS) Using multi-site data, the feature learning step is multi-task learning, and uses SVM classifier with the linear kernel.
l1 + SVM (SS) Use single-task feature learning framework and a SVM classifier to classify each single site data.
l1 + SVM (PO) Use single-task feature learning framework and a SVM classifier to classify the pooled data.
PCA + SVM (SS) Use principal component analysis (PCA) to learn features in each single site data and a SVM classifier with the linear kernel.
PCA + SVM (PO) Use principal component analysis (PCA) to learn features on the pooled data and a SVM classifier with the linear kernel.
ttest2 + SVM (SS) Use two-sample t-test (ttest2) to learn features in each single site data and a SVM classifier with the linear kernel.
ttest2 + SVM (PO) Using two-sample t-test (ttest2) to learn features on the pooled data and a SVM classifier with the linear kernel.

In the first three experiments, the same proposed feature learning and classifier framework was applied in the multi-site, single-site and pooled data classification, respectively. In the single-site data classification, for each data site, we used the l1-norm feature selection (Wang et al., 2015), which included the hinge loss function and the l1-norm sparsity regularization term in the cost function. In the pooled data classification, the feature learning cost function was similar to the single-site classification, which considered the pooled data as a single cohort. Note that the feature learning framework applied in the single-site data classification and the pooled data classification was a special case of the multi-task feature learning framework, with the l2,1 sparsity regularization term disabled. The site-specific features were learned in the same way in all three classification experiments, while the site-shared features were only emphasized in multi-site learning, which could show the influence of site-shared features more clearly. The classifier for single-site data and pooled data classification was the same as Eq. (3), however, increasing weights of site-shared features in Eq. (2) was not applied as we did in multi-site learning, since there were no site-shared features in the single-site and pooled data classification.

In other seven experiments, the SVM classifier with a linear kernel was used after different feature learning methods. As shown in Table 2, the 4th to 6th experiments used the proposed multi-task feature learning technique, taking the feature learning method in single-site and pooled data as a special case of multi-site feature learning. In the 7th and 8th experiments, the PCA method was used, while in the 9th and 10th experiments, the ttest2 method was used in the single-site and pooled data classification, respectively.

The nested loop 5-fold cross-validation was used in the ten experiments to evaluate the classification performance. In specific, the samples were randomly separated into five subsets: one (20% of all samples) was used for testing, and the other four (80% of all samples) were divided further into two parts with the proportion rates of 80% (64% of all samples) and 20% (16% of all samples), respectively. The larger of the two parts was used as a training set, and the smaller part was used as a validation set, as described previously (Koutsouleris et al., 2014). The training samples were used for feature learning and classifier training, while the validation samples were used to tune all parameters with the trained classifier, and the best parameter set which contributed to the best classification accuracy of the validation samples was selected. Finally, the best parameters in the validation step were used to classify the testing samples and obtain the classification result. The whole process was repeated for 20 repeated experiments. The average accuracies of the 20 iterations for each site was considered as the final classification result.

The parameters of the multi-task feature learning framework were tuned by grid searching the regularization factors in the range of α, β ∈ {10−8, 10−6, 10−4, 10−2, 10−1, 0.3, 0.5, 1, 5, 10, 102, 103}, while in the special case of using multi-task feature learning in the single-site and pooled data, the l2,1-norm term was disabled and β was set to 0. In the proposed classifier in the multi-site classification (the first experiment), the ratio to increase the site-shared feature weights γ was tuned in the range of {1, 2, …, 100}. The number of selected features K ranged from {500, …, 20,000}, with the step-size of 100. In the experiments with the SVM classifier, the linear kernel was used and the cost value C was set to 1.

The site-shared and site-specific features were obtained in the multi-site classification, which used the proposed multi-task learning framework. After the parameter tuning, the best parameter set of the regulation terms α, β, the number of selected features K, and the site-shared feature increasing ratio γ were obtained. In each iteration, the best classification accuracy in each site corresponded to a specific W vector. We first calculated the mean W vectors of the 20 iterations that were contributed to the best accuracy in all three sites respectively, then the mean W vector of sites A, B, and C was obtained. The consensus features with weights that ranked the top 5% of the three sites were shown as the site-shared features, and the remaining features were considered as the site-specific features for each site, respectively. The feature weights of the site-shared and site-specific features were displayed using the Caret package (Van Essen, 2005), in which the clusters comprising of >50 voxels were shown.

3. Results

The classification results of multi-site, single-site, and pooled data were listed in Table 3. Using the proposed multi-task feature learning framework and the classifier based on the learned feature weights, the average classification accuracies of sites A, B, and C, based on the multi-site data learning, were 78.5%, 64.5%, and 84.5%, which outperformed the accuracies of 76%, 56%, and 82.5% on single-site learning, as well as the accuracies of 70%, 57.5%, and 75.3% on the pooled classification.

Table 3.

The average accuracy values of multi-site, single-site and pooling classification with 20 repetitions in 10 experiments.

Method Site A Site B Site C Average of 3 sites
l2,1 + l1 (MS) 0.785 0.645 0.845 0.758
l1 (SS) 0.76 0.56 0.825 0.713
l1 (PO) 0.70 0.575 0.753 0.676
l2,1 + l1 + SVM (MS) 0.713 0.648 0.658 0.673
l1 + SVM (SS) 0.728 0.598 0.825 0.690
l1 + SVM (PO) 0.610 0.573 0.668 0.617
PCA + SVM (SS) 0.668 0.474 0.724 0.622
PCA + SVM (PO) 0.646 0.494 0.732 0.624
ttest2 + SVM (SS) 0.562 0.468 0.680 0.570
ttest2 + SVM (PO) 0.672 0.530 0.646 0.616

MS: multi-site data; SS: single-site data; PO: pooled data.

Using the multi-task feature learning framework and the SVM classifier, the average accuracies in multi-site (67.3%), single-site (69.0%) and pooled (61.7%) data classification were lower than the results using the proposed classifier (75.8%, 71.3%, and 67.6%), respectively. However, the average accuracy of multi-site classification was poorer than the single-site classification using multi-task feature learning framework and SVM classifiers. In the experiments using PCA feature learning method and SVM classifiers, the average results of single-site and pooled data classification were 62.2% and 62.4%. In the experiments using ttest2 feature learning method, the classification accuracies were the lowest comparing to all other methods, which were 57.0% and 61.6%, respectively in single-site and pooled data classification.

The best average accuracy was 75.8% with our proposed multi-task feature learning classification method. The results indicated that multi-site learning captured more precise feature patterns in revealing the differences between SCZ patients and controls, than using only one single-site data. Simply pooling all data sites together did not improve classification performance compared with using single-site data.

The site-shared and site-specific features with the largest weights were mapped onto a brain image template. As shown in the left panel of Fig. 2, multi-site learning extracted a series of feature patterns as site-shared features, which revealed gray matter abnormalities in a variety of regions, including the fusiform, middle temporal, superior temporal, inferior parietal, postcentral, angular, inferior frontal, middle occipital and lingual gyri, insular, anterior cingulate cortex, precuneus, the dorsolateral prefrontal cortex, and the cerebellum. The most discriminative site-shared regions are all listed in Table 4. Detailed information including the BA area, side, cluster size and peak MNI coordinates is also shown in Table 4.

Fig. 2.

Fig. 2

Left panel: locations of the site-shared features Right panel: locations of the site-specific features corresponding to each site. These site-shared and site-specific features were shown with cluster size of >50 voxels obtained by multi-task learning on multi-site schizophrenia classification. The colorbar represents the weight values of features. The warm and hot colors corresponded to negative and positive weight values, respectively.

Table 4.

The site-shared gray matter alteration features of brain regions.

Regions Side BA Cluster size (voxels) MNI coordinates
Cerebellum crus I, fusiform gyrus, lingual gyrus L 18,19 146 −32,−86,−17/−29,−74,−3
Middle temporal gyrus L 38,21,22 77 −44,12,−35/−59,−14,−8/−56,−59,−13/−65,−35,−1
Superior temporal gyrus, middle temporal gyrus, insula R 22,41,13 77 52,−41,−10/49,−17,1
Anterior cingulate, superior medial frontal gyrus L 32 51 −12,34,1/−12,34,25
Middle occipital gyrus, angular L/R 39,40 111 45,−55,20/33,−76,32/−51,67,39
Insula R 13 27 33,16,12
Middle frontal gyrus, inferior triangular frontal gyrus L/R 9,10,44,45 162 −51,26,28/−33,52,16/−42,22,13/39,35,24
Postcentral L 2 18 −49,−31,50
Precuneus L/R 31 25 −6,−43,36
Inferior parietal gyrus L 40 15 −40,−49,51
Superior parietal gyrus L 7,5 42 −28,−51,75
Superior frontal gyrus, middle frontal gyrus R 9,6 56 27,47,32/33,41,35/35,8,42
Superior frontal gyrus, middle frontal gyrus L 32,6 63 −16,23,47/−34,11,52
dlPFC L/R 9,6,32 119 −16,23,47/−34,11,52/27,47,32/33,41,35/35,8,42

Note: BA = Brodmann area, R = right, L = left.

The site-specific features were displayed in the right panel of Fig. 2. The site-specific features of site A included regions of the fusiform, lingual, inferior temporal, middle temporal, superior temporal, middle frontal, superior frontal, superior medial frontal, postcentral, superior parietal, precentral gyri, precuneus, putamen, supplemental motor area, and the cerebellum. The site-specific features of site B included regions of the fusiform, anterior cingulate cortex, posterior cingulate cortex, hippocampus, parahippocampus, angular, inferior frontal, middle frontal, postcentral, precentral, inferior parietal, middle occipital gyri, precuneus, and the cerebellum. The site-specific features of site C included regions of the fusiform, lingual, anterior cingulate cortex, angular, middle temporal, inferior temporal, superior temporal, middle frontal, postcentral, supramarginal, superior medial frontal gyri, precuneus, insular, and the cerebellum. As shown, the site-specific patterns in the three sites involved several regions that were located adjacent to the same site-shared regions, including the fusiform, lingual, postcentral, precentral, prefrontal gyri, and the cerebellum. Detailed information of the site-specific features are shown in Table 5, Table 6, Table 7.

Table 5.

The site-specific gray matter alteration features of brain regions in site A.

Regions Side BA Cluster size (voxels) MNI coordinates
Fusiform gyrus, cerebellum VI, lingual gyrus L 37,19,18 135 −20,−59,−15
Inferior temporal gyrus, middle temporal gyrus L 37,20,19,21 50 −56,−56,−10
Superior temporal gyrus, middle temporal gyrus L/R 38,21,41 148 −47,9,−32/55,−53,−4
Putamen R 69 30,8,15
Middle frontal gyrus/superior frontal gyrus R 9,8,10 114 29,38,38
Postcentral gyrus, superior parietal gyrus R 3,2,5,40 236 −43,−37,65/26,−46,49
Superior medial frontal gyrus, supplemental motor area, superior frontal gyrus L 8,6 203 −19,35,58
Superior parietal gyrus, postcentral gyrus, precuneus gyrus L 7,5 92 −28,−54,75
Precentral gyrus L 6 54 −34,−12,77

Note: BA = Brodmann area, R = right, L = left.

Table 6.

The site-specific gray matter alteration features of brain regions in site B.

Regions Side BA Cluster size (voxels) MNI coordinates
Cerebellum IV, V, fusiform gyrus R 19,36 51 22,−35,−20
Hippocampus, parahippocampus, fusiform gyrus R 36 75 34,−35,−11
Anterior cingulate cortex L 32,9,24,10 294 −3,38,23
Inferior parietal gyrus, middle occipital gyrus, angular R 40,39,7 105 −46,−49,47
Posterior cingulum L 65 −9,−43,36
Inferior frontal gyrus L 13,44,47 38 −39,22,13
Middle frontal gyrus L 8,9 53 −34,35,37
Precuneus L 7 120 −13,−67,37
Postcentral gyrus, precentral gyrus L/R 6,4,3,40 180 −52,−31,53

Note: BA = Brodmann area, R = right, L = left, dlPFC = dorsolateral prefrontal cortex.

Table 7.

The site-specific gray matter alteration features of brain regions in site C.

Regions Side BA Cluster size (voxels) MNI coordinates
Cerebellum crus I, crus II, fusiform gyrus, lingual gyrus L 18,19 209 −28,−84,−38
Middle temporal gyrus, inferior temporal gyrus L 21,20,22 274 −59,−12,−26
Superior temporal gyrus, middle temporal gyrus, insula R 22,13,41,21 112 46,−14,−2
Anterior cingulate cortex, superior medial frontal gyrus L 10,32,9 122 0,46,14
Angular R 39,22,13 62 45,−55,23
Middle frontal gyrus L 9,10 81 −28,56,37
Supramarginal gyrus L 40 80 −54,−37,29
Postcentral gyrus R 2,40,3 62 44,−31,55
Precuneus L/R 7 72 2,−66,67

Note: BA = Brodmann area, R = right, L = left.

4. Discussion

We present a method utilizing multi-task l2,1 + l1-norm learning to simultaneously learn the site-shared and site-specific features, for multi-site classification problems. The method was tested to differentiate SCZ patients from healthy controls in three data sites. Compared to single-site and pooled data classification, the multi-site classification performance was significantly higher, which demonstrated that multi-site data can enhance accuracy in identifying brain alterations in SCZ if multi-task learning is applied. The multi-task learning framework thus provides a new way to take advantage of multi-site data to better classify and understand brain diseases.

4.1. Reliable identification of multi-site schizophrenia data using multi-task learning

Single-site data contains information about pathophysiology, but may simultaneously include information related to different scanners or imaging protocols, as well as demographic differences that may influence the pattern investigations across individual subject samples. Using single-site data is limited by smaller sample sizes and smaller diversity in imaging features, both of which often lead to poor generalization performance of classifiers on new patient populations. Data pooled from multiple sites alleviate both of these problems at the expense of introducing confounding heterogeneity in the imaging data, which renders the pooled analysis challenging. In our study, using the proposed multi-task feature learning scheme, the classification accuracy of using pooled data was no better than using each single site and, in the case of site C data, pooled learning performed worse than single-site classification. This result might demonstrate that the site-specific features in each data site may not always benefit for the pooled classification. The advantage of the multi-task learning frame in our study is that it can simultaneously learn site-shared and site-specific features, which is suitable when combining multi-site data for disease classification. The l2,1 + l1-norm penalty terms focuses on the group sparsity, and sparsity to maximize weights of site-shared or site-specific features, while setting all other irrelevant features to be very small. In addition, the two penalty terms are very common and simple in sparsity regularization and can easily be generalized. The multi-task learning framework has attracted increasingly attention in other fields, such as in data mining, bioinformatics and computer vision (Heisele et al., 2001; Ji and Ye, 2009; Xue et al., 2007).

In the current study, the performance was poor in using the PCA and ttest2 feature learning methods in the single-site and pooled classification. PCA is a linear method for dimensionality reduction, which may not consider the nonlinear correlations of different features, while ttest2 is a univariate feature selection method, neglecting the relationship of multiple variables. Thus, the accuracies of using PCA and ttest2 were no better than using the proposed multi-task feature learning framework. Results also showed that accuracies using the proposed classifier were much better than using SVM. In the multi-task feature learning step, the feature weights were calculated based on all features and contained valuable information. Those features exhibited the largest weights were selected and used in the classification step. SVM classifier only used the sequencing information of the feature weights, which had lost some valuable information of these selected features. Therefore, better results using our proposed classifier showed that the feature weights learned by the multi-task feature learning method were accurate.

4.2. Site-shared and site-specific regions of gray matter alterations in schizophrenia

A pattern of gray matter intensity alterations implicating several brain regions in the SCZ sample was revealed by the site-shared feature analysis of multi-site MRI data in our study. This is consistent with a vast number of large-scale structural MRI investigations of SCZ, including both meta-analyses of MRI results (Chan et al., 2011; Ellison-Wright et al., 2008; Honea et al., 2005; Palaniyappan et al., 2012; Steen et al., 2006; Vita et al., 2012; Wright et al., 2000), and mega-group comparisons of mean indices combining data from multiple sites (Gupta et al., 2015; Segall et al., 2009; Torres et al., 2016). These previous studies have repeatedly highlighted the prefrontal cortex, lateral temporal cortex, insular, striatum, anterior cingulate cortex, hippocampus and other temporo-limbic structures as key components of the network of brain regions showing volumetric deficits relative to healthy controls from early stages of SCZ onwards. Most of those brain regions were implicated in our study, indicating a high degree of consistency between the spatial distribution of findings in our site-shared feature analysis and the previous MRI literature on SCZ. The brain regions involved in this network are critical to emotion processing and several cognitive domains known to be affected in schizophrenia, including language processing, working memory and other executive functioning operations, episodic memory, attention, self-monitoring, error detection, and attribution of salience to emotionally relevant stimuli, among others (Backasch et al., 2014; Belin et al., 2000; Cabeza and Nyberg, 2000; Chao et al., 1999; Jeong et al., 2009; Kim et al., 2003; Noppeney and Price, 2002; Ojemann et al., 2002; Palaniyappan and Liddle, 2012a; Palaniyappan and Liddle, 2012b; Potkin et al., 2009; Price, 2010; Tranel et al., 1997; Wright et al., 2003). Thus, the findings presented herein provide evidence that our method robustly identifies consistent and reliable structural brain abnormalities associated with SCZ across different sites.

Additional brain regions often implicated in the pathophysiology of SCZ were also highlighted in our site-shared analysis, such as the cerebellum (Andreasen and Pierson, 2008). Though the cerebellum is traditionally believed to participate in motor function, neuroimaging studies, in both nonhuman primates and humans, have demonstrated that it is also involved in brain cognitive and affective processing (Bernard and Mittal, 2015; Dum and Strick, 2003; Hu et al., 2008; Kelly and Strick, 2003). Several structural and functional neuroimaging studies have shown cerebellar abnormalities in SCZ subjects relative to controls (Bernard and Mittal, 2015; Gupta et al., 2015; Ha et al., 2004; Kim et al., 2014; Mothersill et al., 2016; Rusch et al., 2007; Shen et al., 2010; Shenton et al., 2001; Wagshal et al., 2015). Abnormalities in the cerebellum, through its connections to the prefrontal cortex via thalamus, have been proposed as critical to the state of “cognitive dysmetria” put forward by some authors to explain the behavior manifestations and cognitive deficits of schizophrenia (Andreasen et al., 1999; Rusch et al., 2007; Shen et al., 2010). However cerebellar findings should be interpreted cautiously, since segmentation between GM and WM in that region is difficult and prone to errors, due to poor contrast at this imaging resolution. Finally, other brain sites showing volume abnormalities in our study, including the precuneus and lateral parietal cortical regions, fusiform, lingual and middle occipital gyri, have been variably implicated in previous neuroimaging studies of SCZ (Borgwardt et al., 2010; Brunet-Gouet and Decety, 2006; Chou et al., 2014; Davatzikos et al., 2005; Di Rosa et al., 2009; Dong et al., 2006; Faget-Agius et al., 2012; Gaser et al., 1999; Herrmann et al., 2006; Johnston et al., 2005; Lee et al., 2002; Mashal et al., 2014; Nierenberg et al., 2005; Niznikiewicz et al., 2000; Onitsuka et al., 2005; Onitsuka et al., 2003; Quarantelli et al., 2014; Schultz et al., 2010; Seiferth et al., 2008; Shenton et al., 2001; Singh et al., 2014; Takahashi et al., 2011; Tanskanen et al., 2010; Walther et al., 2009; Whalley et al., 2006; Yoon et al., 2006). These brain regions are involved in a variety of cognitive operations of relevance to schizophrenia, including language, visual attention and emotional processing (Cohen et al., 1996; Dong et al., 2006; Herrmann et al., 2006; Johnston et al., 2005; Kaas, 1995; Mashal et al., 2014; Nierenberg et al., 2005; Niznikiewicz et al., 2000; Seiferth et al., 2008; Walther et al., 2009; Whalley et al., 2006; Yoon et al., 2006).

Findings of structural brain abnormalities in SCZ may be influenced by clinical variables not specifically related to the pathophysiology of the disorder, such as the use of antipsychotic medications. Several recent studies have shown that the continued use of antipsychotics may be significantly associated with some of the neuroanatomical changes detected in MRI studies of SCZ, even after taking into account the influence of other potential moderators such as illness severity and substance abuse (Fusar-Poli et al., 2013; Ho et al., 2011; Torres et al., 2016). In the present study, participants with SCZ from site B were all in their first psychotic episode and had a relatively shorter duration of illness in comparison to the samples contributed by sites A and C (both of which included a substantial proportion of chronic, medicated SCZ patients). Thus the overall SCZ sample from site B was less exposed to antipsychotic treatment in comparison to the two other sites. The selection of first-episode SCZ patients in site B may help to explain the more modest diagnostic accuracy figures obtained in the single-site analysis for that center in comparison to the two other sites, given the fact that medicated SCZ patients in chronic disease stages display more widespread brain volumetric abnormalities relative to healthy controls than first-episode SCZ patients (Meisenzahl et al., 2008; Torres et al., 2016). Since the three imaging sites contributed each an equal number of SCZ patients and controls to the present investigation, disease chronicity and antipsychotic exposure should be taken as clinical variables that increased inter-site variability in our study, instead of representing unifying features present in all sites. Therefore, the results of the site-shared analysis presented herein can be more safely assumed to reflect the commonality of SCZ neuropathology across the three sites, rather than the potential influence of confounding clinical variables ubiquitously present in the three centers. Disentangling the specific neurobiological features underlying psychiatric disorders from clinically relevant confounders is challenging in neuroimaging research, given the highly prevalent influence of factors such as use of nicotine and other drugs of abuse, cardiovascular risk factors, and exposure to pharmacological treatment, all of which can affect brain structure (Weinberger and Radulescu, 2016).

The site-specific regions of the three sites revealed in our study consisted of two parts. The larger part of the site-specific features was located in the same regions of the site-shared features, though the coordinates were slightly different. These regions involved most of the site-shared features, including the fusiform, lingual, postcentral, precuneus, prefrontal gyri and the cerebellum. The findings further implicated that these site-shared and also the site-specific regions revealed the abnormal feature patterns of the SCZ accurately, and multi-task feature learning framework provided a promising method in detecting these features. However, though these site-specific features were located in the same regions with the site-shared features, the feature distributed in adjacent areas. This was reasonable due to the data heterogeneity resulted from each site. Another part of the site-specific regions included some site-specific regions found exclusively in each site, such as putamen and supplemental motor area in site A, hippocampus, parahippocampus, posterior cingulate cortex in site B, and supramarginal gyrus in site C.

Studies have shown that supplemental motor area dysfunction was related to motor disturbances in SCZ patients (Schroder et al., 1995). The putamen was proposed to be related to the treatment response (Buchsbaum et al., 2003; Li et al., 2012; Mitelman et al., 2009), and also reflected the deficits of auditory verbal hallucinations of SCZ patients (Cui et al., 2016). The gyrus of hippocampus and parahippocampus were reduced (Razi et al., 1999), and hippocampus was demonstrated to be associated with the positive symptoms of SCZ patients (Duan et al., 2015). Meanwhile, the posterior cingulate cortex has recently been implicated in the pathophysiology of schizophrenia (Newell et al., 2006). These site-specific regions may provide potential evidences on interpreting the underlying multisite data heterogeneity introduced by different scanners, subjects, treatment exposure, or disease symptoms or etc.

4.3. Limitations and future directions

Although the classification result of multi-site MRI data using multi-task learning was encouraging, there are limitations in our work. First, due to the limited sample sizes, small site numbers, and the specific psychiatric disease type and data modality in our study, the effectiveness of using multi-task learning scheme on multi-site classification needs to be further validated in larger datasets. Moreover, an important challenge in multi-task learning is that when new data is analyzed, the multi-task learner would need to be recalculated. This would obviously limit clinical utility. One solution is to apply our approach to obtain a set of t discriminants, each associated with one of our training sites. It is possible that as new data and sites are added to our analyses, one would not need to solve the multi-task learning problem again, but rather could apply all these t vectors and pool the classification results via weighted consensus approaches.

5. Conclusion

This work proposes a multi-task learning framework for high-dimensional classification, which simultaneously learns the site-specific and site-shared features in multi-site MRI data of brain morphology. This method was tested on multi-site anatomical images of SCZ patients. The classification results of using multi-task learning outperformed single-task learning and the pooled data. The most significant site-shared features revealed brain structural alterations in SCZ consistent with those previously reported in the literature. In summary, multi-task learning provides a promising technique to learn the site-shared features in multi-site data study without neglecting site-specific features, which gives a new angle on integrating multiple single-site data together for big data studies.

Acknowledgement

This work was supported by NIH Grants (R01 MH112070); the National Science Foundation of China (61420106001, 91420302, 61773391, 61722313); and Fok Ying Tung Education Foundation (161057).

Footnotes

There are no conflicts of interest.

References

  1. Andreasen N.C., Pierson R. The role of the cerebellum in schizophrenia. Biol. Psychiatry. 2008;64:81–88. doi: 10.1016/j.biopsych.2008.01.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Andreasen N.C., Nopoulos P., O'Leary D.S., Miller D.D., Wassink T., Flaum M. Defining the phenotype of schizophrenia: cognitive dysmetria and its neural mechanisms. Biol. Psychiatry. 1999;46:908–920. doi: 10.1016/s0006-3223(99)00152-3. [DOI] [PubMed] [Google Scholar]
  3. Azadi S., Sra S. Proceedings of the 31st International Conference on Machine Learning (ICML) 2014. Towards an optimal stochastic alternating direction method of multipliers. [Google Scholar]
  4. Backasch B., Sommer J., Klohn-Saghatolislam F., Muller M.J., Kircher T.T., Leube D.T. Dysconnectivity of the inferior frontal gyrus: implications for an impaired self-other distinction in patients with schizophrenia. Psychiatry Res. 2014;223:202–209. doi: 10.1016/j.pscychresns.2014.05.007. [DOI] [PubMed] [Google Scholar]
  5. Belin P., Zatorre R.J., Lafaille P., Ahad P., Pike B. Voice-selective areas in human auditory cortex. Nature. 2000;403:309–312. doi: 10.1038/35002078. [DOI] [PubMed] [Google Scholar]
  6. Bernard J.A., Mittal V.A. Dysfunctional activation of the cerebellum in schizophrenia: a functional neuroimaging meta-analysis. Clin. Psychol. Sci. 2015;3:545–566. doi: 10.1177/2167702614542463. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Borgwardt S.J., Picchioni M.M., Ettinger U., Toulopoulou T., Murray R., McGuire P.K. Regional gray matter volume in monozygotic twins concordant and discordant for schizophrenia. Biol. Psychiatry. 2010;67:956–964. doi: 10.1016/j.biopsych.2009.10.026. [DOI] [PubMed] [Google Scholar]
  8. Brown G.G., Mathalon D.H., Stern H., Ford J., Mueller B., Greve D.N., McCarthy G., Voyvodic J., Glover G., Diaz M., Yetter E., Ozyurt I.B., Jorgensen K.W., Wible C.G., Turner J.A., Thompson W.K., Potkin S.G., Function Biomedical Informatics Research Network Multisite reliability of cognitive BOLD data. NeuroImage. 2011;54:2163–2175. doi: 10.1016/j.neuroimage.2010.09.076. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Brunet-Gouet E., Decety J. Social brain dysfunctions in schizophrenia: a review of neuroimaging studies. Psychiatry Res. 2006;148:75–92. doi: 10.1016/j.pscychresns.2006.05.001. [DOI] [PubMed] [Google Scholar]
  10. Buchsbaum M.S., Shihabuddin L., Brickman A.M., Miozzo R., Prikryl R., Shaw R., Davis K. Caudate and putamen volumes in good and poor outcome patients with schizophrenia. Schizophr. Res. 2003;64:53–62. doi: 10.1016/s0920-9964(02)00526-1. [DOI] [PubMed] [Google Scholar]
  11. Cabeza R., Nyberg L. Imaging cognition II: an empirical review of 275 PET and fMRI studies. J. Cogn. Neurosci. 2000;12:1–47. doi: 10.1162/08989290051137585. [DOI] [PubMed] [Google Scholar]
  12. Calhoun V.D., Adali T. Feature-based fusion of medical imaging data. IEEE Trans. Inf. Technol. Biomed. 2009;13:711–720. doi: 10.1109/TITB.2008.923773. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Cannon T.D., Sun F., McEwen S.J., Papademetris X., He G., van Erp T.G., Jacobson A., Bearden C.E., Walker E., Hu X., Zhou L., Seidman L.J., Thermenos H.W., Cornblatt B., Olvet D.M., Perkins D., Belger A., Cadenhead K., Tsuang M., Mirzakhanian H., Addington J., Frayne R., Woods S.W., McGlashan T.H., Constable R.T., Qiu M., Mathalon D.H., Thompson P., Toga A.W. Reliability of neuroanatomical measurements in a multisite longitudinal study of youth at risk for psychosis. Hum. Brain Mapp. 2014;35:2424–2434. doi: 10.1002/hbm.22338. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Caruana R. Multitask learning. Mach. Learn. 1997;28:41–75. [Google Scholar]
  15. Casey B.J., Cohen J.D., O'Craven K., Davidson R.J., Irwin W., Nelson C.A., Noll D.C., Hu X., Lowe M.J., Rosen B.R., Truwitt C.L., Turski P.A. Reproducibility of fMRI results across four institutions using a spatial working memory task. NeuroImage. 1998;8:249–261. doi: 10.1006/nimg.1998.0360. [DOI] [PubMed] [Google Scholar]
  16. Chan R.C., Di X., McAlonan G.M., Gong Q.Y. Brain anatomical abnormalities in high-risk individuals, first-episode, and chronic schizophrenia: an activation likelihood estimation meta-analysis of illness progression. Schizophr. Bull. 2011;37:177–188. doi: 10.1093/schbul/sbp073. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Chao L.L., Haxby J.V., Martin A. Attribute-based neural substrates in temporal cortex for perceiving and knowing about objects. Nat. Neurosci. 1999;2:913–919. doi: 10.1038/13217. [DOI] [PubMed] [Google Scholar]
  18. Chen J., Liu J., Ye J. Learning incoherent sparse and low-rank patterns from multiple tasks. ACM Trans. Knowl. Discov. Data. 2012;5:22. doi: 10.1145/2086737.2086742. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Chou P.H., Koike S., Nishimura Y., Kawasaki S., Satomura Y., Kinoshita A., Takizawa R., Kasai K. Distinct effects of duration of untreated psychosis on brain cortical activities in different treatment phases of schizophrenia: a multi-channel near-infrared spectroscopy study. Prog. Neuro-Psychopharmacol. Biol. Psychiatry. 2014;49:63–69. doi: 10.1016/j.pnpbp.2013.11.009. [DOI] [PubMed] [Google Scholar]
  20. Cohen M., Kosslyn S., Breiter H., DiGirolamo G., Thompson W., Anderson A., Brookheimer S., Rosen B., Belliveau J. Changes in cortical activity during mental rotation. Brain. 1996;119:89–100. doi: 10.1093/brain/119.1.89. [DOI] [PubMed] [Google Scholar]
  21. Colby J.B., Rudie J.D., Brown J.A., Douglas P.K., Cohen M.S., Shehzad Z. Insights into multimodal imaging classification of ADHD. Front. Syst. Neurosci. 2012;6:59. doi: 10.3389/fnsys.2012.00059. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Costafreda S.G., Brammer M.J., Vencio R.Z., Mourao M.L., Portela L.A., de Castro C.C., Giampietro V.P., Amaro E., Jr. Multisite fMRI reproducibility of a motor task using identical MR systems. J. Magn. Reson. Imaging. 2007;26:1122–1126. doi: 10.1002/jmri.21118. [DOI] [PubMed] [Google Scholar]
  23. Cui L.B., Liu K., Li C., Wang L.X., Guo F., Tian P., Wu Y.J., Guo L., Liu W.M., Xi Y.B., Wang H.N., Yin H. Putamen-related regional and network functional deficits in first-episode schizophrenia with auditory verbal hallucinations. Schizophr. Res. 2016;173:13–22. doi: 10.1016/j.schres.2016.02.039. [DOI] [PubMed] [Google Scholar]
  24. Davatzikos C. Mapping image data to stereotaxic spaces: applications to brain mapping. Hum. Brain Mapp. 1998;6:334–338. doi: 10.1002/(SICI)1097-0193(1998)6:5/6<334::AID-HBM2>3.0.CO;2-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Davatzikos C., Genc A., Xu D., Resnick S.M. Voxel-based morphometry using the RAVENS maps: methods and validation using simulated longitudinal atrophy. NeuroImage. 2001;14:1361–1369. doi: 10.1006/nimg.2001.0937. [DOI] [PubMed] [Google Scholar]
  26. Davatzikos C., Shen D., Gur R.C., Wu X., Liu D., Fan Y., Hughett P., Turetsky B.I., Gur R.E. Whole-brain morphometric study of schizophrenia revealing a spatially complex set of focal abnormalities. Arch. Gen. Psychiatry. 2005;62:1218–1227. doi: 10.1001/archpsyc.62.11.1218. [DOI] [PubMed] [Google Scholar]
  27. Di Rosa E., Crow T.J., Walker M.A., Black G., Chance S.A. Reduced neuron density, enlarged minicolumn spacing and altered ageing effects in fusiform cortex in schizophrenia. Psychiatry Res. 2009;166:102–115. doi: 10.1016/j.psychres.2008.04.007. [DOI] [PubMed] [Google Scholar]
  28. Dong W., Liu L., Zou L. Face perception in schizophrenia: a functional magnetic resonance imaging study. Chin. Ment. Health J. 2006;20:775. [Google Scholar]
  29. Duan H.F., Gan J.L., Yang J.M., Cheng Z.X., Gao C.Y., Shi Z.J., Zhu X.Q., Liang X.J., Zhao L.M. A longitudinal study on intrinsic connectivity of hippocampus associated with positive symptom in first-episode schizophrenia. Behav. Brain Res. 2015;283:78–86. doi: 10.1016/j.bbr.2015.01.022. [DOI] [PubMed] [Google Scholar]
  30. Dum R.P., Strick P.L. An unfolded map of the cerebellar dentate nucleus and its projections to the cerebral cortex. J. Neurophysiol. 2003;89:634–639. doi: 10.1152/jn.00626.2002. [DOI] [PubMed] [Google Scholar]
  31. Ellison-Wright I., Glahn D.C., Laird A.R., Thelen S.M., Bullmore E. The anatomy of first-episode and chronic schizophrenia: an anatomical likelihood estimation meta-analysis. Am. J. Psychiatry. 2008;165:1015–1023. doi: 10.1176/appi.ajp.2008.07101562. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Faget-Agius C., Boyer L., Padovani R., Richieri R., Mundler O., Lancon C., Guedj E. Schizophrenia with preserved insight is associated with increased perfusion of the precuneus. J. Psychiatry Neurosci. 2012;37:297–304. doi: 10.1503/jpn.110125. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Friedman L., Glover G.H., Krenz D., Magnotta V., First B. Reducing inter-scanner variability of activation in a multicenter fMRI study: role of smoothness equalization. NeuroImage. 2006;32:1656–1668. doi: 10.1016/j.neuroimage.2006.03.062. [DOI] [PubMed] [Google Scholar]
  34. Fusar-Poli P., Smieskova R., Kempton M.J., Ho B.C., Andreasen N.C., Borgwardt S. Progressive brain changes in schizophrenia related to antipsychotic treatment? A meta-analysis of longitudinal MRI studies. Neurosci. Biobehav. Rev. 2013;37:1680–1691. doi: 10.1016/j.neubiorev.2013.06.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Gaser C., Volz H.-P., Kiebel S., Riehemann S., Sauer H. Detecting structural changes in whole brain based on nonlinear deformations—application to schizophrenia research. NeuroImage. 1999;10:107–113. doi: 10.1006/nimg.1999.0458. [DOI] [PubMed] [Google Scholar]
  36. Gee D.G., McEwen S.C., Forsyth J.K., Haut K.M., Bearden C.E., Addington J., Goodyear B., Cadenhead K.S., Mirzakhanian H., Cornblatt B.A., Olvet D., Mathalon D.H., McGlashan T.H., Perkins D.O., Belger A., Seidman L.J., Thermenos H., Tsuang M.T., van Erp T.G., Walker E.F., Hamann S., Woods S.W., Constable T., Cannon T.D. Reliability of an fMRI paradigm for emotional processing in a multisite longitudinal study. Hum. Brain Mapp. 2015;36:2558–2579. doi: 10.1002/hbm.22791. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Glover G.H., Mueller B.A., Turner J.A., van Erp T.G., Liu T.T., Greve D.N., Voyvodic J.T., Rasmussen J., Brown G.G., Keator D.B., Calhoun V.D., Lee H.J., Ford J.M., Mathalon D.H., Diaz M., O'Leary D.S., Gadde S., Preda A., Lim K.O., Wible C.G., Stern H.S., Belger A., McCarthy G., Ozyurt B., Potkin S.G. Function biomedical informatics research network recommendations for prospective multicenter functional MRI studies. J. Magn. Reson. Imaging. 2012;36:39–54. doi: 10.1002/jmri.23572. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Goldszal A.F., Davatzikos C., Pham D.L., Yan M.X., Bryan R.N., Resnick S.M. An image-processing system for qualitative and quantitative volumetric analysis of brain images. J. Comput. Assist. Tomogr. 1998;22:827–837. doi: 10.1097/00004728-199809000-00030. [DOI] [PubMed] [Google Scholar]
  39. Gupta C.N., Calhoun V.D., Rachakonda S., Chen J., Patel V., Liu J., Segall J., Franke B., Zwiers M.P., Arias-Vasquez A., Buitelaar J., Fisher S.E., Fernandez G., van Erp T.G., Potkin S., Ford J., Mathalon D., McEwen S., Lee H.J., Mueller B.A., Greve D.N., Andreassen O., Agartz I., Gollub R.L., Sponheim S.R., Ehrlich S., Wang L., Pearlson G., Glahn D.C., Sprooten E., Mayer A.R., Stephen J., Jung R.E., Canive J., Bustillo J., Turner J.A. Patterns of gray matter abnormalities in schizophrenia based on an international mega-analysis. Schizophr. Bull. 2015;41:1133–1142. doi: 10.1093/schbul/sbu177. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Ha T.H., Youn T., Ha K.S., Rho K.S., Lee J.M., Kim I.Y., Kim S.I., Kwon J.S. Gray matter abnormalities in paranoid schizophrenia and their clinical correlations. Psychiatry Res. 2004;132:251–260. doi: 10.1016/j.pscychresns.2004.05.001. [DOI] [PubMed] [Google Scholar]
  41. Heisele B., Serre T., Pontil M., Vetter T., Poggio T. Neural Information Processing Systems. 2001. Categorization by learning and combining object parts; pp. 1239–1245. [Google Scholar]
  42. Herrmann M.J., Reif A., Jabs B.E., Jacob C., Fallgatter A.J. Facial affect decoding in schizophrenic disorders: a study using event-related potentials. Psychiatry Res. 2006;141:247–252. doi: 10.1016/j.psychres.2005.09.015. [DOI] [PubMed] [Google Scholar]
  43. Ho B.C., Andreasen N.C., Ziebell S., Pierson R., Magnotta V. Long-term antipsychotic treatment and brain volumes: a longitudinal study of first-episode schizophrenia. Arch. Gen. Psychiatry. 2011;68:128–137. doi: 10.1001/archgenpsychiatry.2010.199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Honea R., Crow T.J., Passingham D., Mackay C.E. Regional deficits in brain volume in schizophrenia: a meta-analysis of voxel-based morphometry studies. Am. J. Psychiatry. 2005;162:2233–2245. doi: 10.1176/appi.ajp.162.12.2233. [DOI] [PubMed] [Google Scholar]
  45. Jeong B., Wible C.G., Hashimoto R., Kubicki M. Functional and anatomical connectivity abnormalities in left inferior frontal gyrus in schizophrenia. Hum. Brain Mapp. 2009;30:4138–4151. doi: 10.1002/hbm.20835. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Ji S., Ye J. Proceedings of the 26th Annual International Conference on Machine Learning. ACM; 2009. An accelerated gradient method for trace norm minimization; pp. 457–464. [Google Scholar]
  47. Johnston P.J., Stojanov W., Devir H., Schall U. Functional MRI of facial emotion recognition deficits in schizophrenia and their electrophysiological correlates. Eur. J. Neurosci. 2005;22:1221–1232. doi: 10.1111/j.1460-9568.2005.04294.x. [DOI] [PubMed] [Google Scholar]
  48. Jovicich J., Marizzoni M., Bosch B., Bartres-Faz D., Arnold J., Benninghoff J., Wiltfang J., Roccatagliata L., Picco A., Nobili F., Blin O., Bombois S., Lopes R., Bordet R., Chanoine V., Ranjeva J.P., Didic M., Gros-Dagnac H., Payoux P., Zoccatelli G., Alessandrini F., Beltramello A., Bargallo N., Ferretti A., Caulo M., Aiello M., Ragucci M., Soricelli A., Salvadori N., Tarducci R., Floridi P., Tsolaki M., Constantinidis M., Drevelegas A., Rossini P.M., Marra C., Otto J., Reiss-Zimmermann M., Hoffmann K.T., Galluzzi S., Frisoni G.B., PharmaCog C. Multisite longitudinal reliability of tract-based spatial statistics in diffusion tensor imaging of healthy elderly subjects. NeuroImage. 2014;101:390–403. doi: 10.1016/j.neuroimage.2014.06.075. [DOI] [PubMed] [Google Scholar]
  49. Jovicich J., Minati L., Marizzoni M., Marchitelli R., Sala-Llonch R., Bartres-Faz D., Arnold J., Benninghoff J., Fiedler U., Roccatagliata L., Picco A., Nobili F., Blin O., Bombois S., Lopes R., Bordet R., Sein J., Ranjeva J.P., Didic M., Gros-Dagnac H., Payoux P., Zoccatelli G., Alessandrini F., Beltramello A., Bargallo N., Ferretti A., Caulo M., Aiello M., Cavaliere C., Soricelli A., Parnetti L., Tarducci R., Floridi P., Tsolaki M., Constantinidis M., Drevelegas A., Rossini P.M., Marra C., Schonknecht P., Hensch T., Hoffmann K.T., Kuijer J.P., Visser P.J., Barkhof F., Frisoni G.B., PharmaCog C. Longitudinal reproducibility of default-mode network connectivity in healthy elderly participants: a multicentric resting-state fMRI study. NeuroImage. 2015;124:442–454. doi: 10.1016/j.neuroimage.2015.07.010. [DOI] [PubMed] [Google Scholar]
  50. Kaas J.H. Theories of visual cortex organization in primates: areas of the third level. Prog. Brain Res. 1995;112:213–221. doi: 10.1016/s0079-6123(08)63331-2. [DOI] [PubMed] [Google Scholar]
  51. Kelly R.M., Strick P.L. Cerebellar loops with motor cortex and prefrontal cortex of a nonhuman primate. J. Neurosci. 2003;23:8432–8444. doi: 10.1523/JNEUROSCI.23-23-08432.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Kim J.J., Youn T., Lee J.M., Kim I.Y., Kim S.I., Kwon J.S. Morphometric abnormality of the insula in schizophrenia: a comparison with obsessive-compulsive disorder and normal control using MRI. Schizophr. Res. 2003;60:191–198. doi: 10.1016/s0920-9964(02)00306-7. [DOI] [PubMed] [Google Scholar]
  53. Kim D.I., Manoach D.S., Mathalon D.H., Turner J.A., Mannell M., Brown G.G., Ford J.M., Gollub R.L., White T., Wible C., Belger A., Bockholt H.J., Clark V.P., Lauriello J., O'Leary D., Mueller B.A., Lim K.O., Andreasen N., Potkin S.G., Calhoun V.D. Dysregulation of working memory and default-mode networks in schizophrenia using independent component analysis, an fBIRN and MCIC study. Hum. Brain Mapp. 2009;30:3795–3811. doi: 10.1002/hbm.20807. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Kim D.J., Kent J.S., Bolbecker A.R., Sporns O., Cheng H., Newman S.D., Puce A., O'Donnell B.F., Hetrick W.P. Disrupted modular architecture of cerebellum in schizophrenia: a graph theoretic analysis. Schizophr. Bull. 2014;40:1216–1226. doi: 10.1093/schbul/sbu059. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Koutsouleris N., Davatzikos C., Borgwardt S., Gaser C., Bottlender R., Frodl T., Falkai P., Riecher-Rossler A., Moller H.J., Reiser M., Pantelis C., Meisenzahl E. Accelerated brain aging in schizophrenia and beyond: a neuroanatomical marker of psychiatric disorders. Schizophr. Bull. 2014;40:1140–1153. doi: 10.1093/schbul/sbt142. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Kumar A., Daume H., III Learning task grouping and overlap in multi-task learning. arXiv Preprint arXiv. 2012;1206:6417. [Google Scholar]
  57. Lee C.U., Shenton M.E., Salisbury D.F., Kasai K., Onitsuka T., Dickey C.C., Yurgelun-Todd D., Kikinis R., Jolesz F.A., McCarley R.W. Fusiform gyrus volume reduction in first-episode schizophrenia: a magnetic resonance imaging study. Arch. Gen. Psychiatry. 2002;59:775–781. doi: 10.1001/archpsyc.59.9.775. [DOI] [PubMed] [Google Scholar]
  58. Li M., Chen Z., Deng W., He Z., Wang Q., Jiang L., Ma X., Wang Y., Chua S.E., Cheung C., McAlonan G.M., Sham P.C., Collier D.A., Gong Q., Li T. Volume increases in putamen associated with positive symptom reduction in previously drug-naive schizophrenia after 6 weeks antipsychotic treatment. Psychol. Med. 2012;42:1475–1483. doi: 10.1017/S0033291711002157. [DOI] [PubMed] [Google Scholar]
  59. Marquand A.F., Brammer M., Williams S.C., Doyle O.M. Bayesian multi-task learning for decoding multi-subject neuroimaging data. NeuroImage. 2014;92:298–311. doi: 10.1016/j.neuroimage.2014.02.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Mashal N., Vishne T., Laor N. The role of the precuneus in metaphor comprehension: evidence from an fMRI study in people with schizophrenia and healthy participants. Front. Hum. Neurosci. 2014;8:818. doi: 10.3389/fnhum.2014.00818. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Meda S.A., Bhattarai M., Morris N.A., Astur R.S., Calhoun V.D., Mathalon D.H., Kiehl K.A., Pearlson G.D. An fMRI study of working memory in first-degree unaffected relatives of schizophrenia patients. Schizophr. Res. 2008;104:85–95. doi: 10.1016/j.schres.2008.06.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Meisenzahl E.M., Koutsouleris N., Bottlender R., Scheuerecker J., Jager M., Teipel S.J., Holzinger S., Frodl T., Preuss U., Schmitt G., Burgermeister B., Reiser M., Born C., Moller H.J. Structural brain alterations at different stages of schizophrenia: a voxel-based morphometric study. Schizophr. Res. 2008;104:44–60. doi: 10.1016/j.schres.2008.06.023. [DOI] [PubMed] [Google Scholar]
  63. Mitelman S.A., Canfield E.L., Chu K.W., Brickman A.M., Shihabuddin L., Hazlett E.A., Buchsbaum M.S. Poor outcome in chronic schizophrenia is associated with progressive loss of volume of the putamen. Schizophr. Res. 2009;113:241–245. doi: 10.1016/j.schres.2009.06.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Mothersill O., Knee-Zaska C., Donohoe G. Emotion and theory of mind in schizophrenia-investigating the role of the cerebellum. Cerebellum. 2016;15:357–368. doi: 10.1007/s12311-015-0696-2. [DOI] [PubMed] [Google Scholar]
  65. Newell K.A., Deng C., Huang X.F. Increased cannabinoid receptor density in the posterior cingulate cortex in schizophrenia. Exp. Brain Res. 2006;172:556–560. doi: 10.1007/s00221-006-0503-x. [DOI] [PubMed] [Google Scholar]
  66. Nielsen J.A., Zielinski B.A., Fletcher P.T., Alexander A.L., Lange N., Bigler E.D., Lainhart J.E., Anderson J.S. Multisite functional connectivity MRI classification of autism: ABIDE results. Front. Hum. Neurosci. 2013;7:599. doi: 10.3389/fnhum.2013.00599. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Nierenberg J., Salisbury D.F., Levitt J.J., David E.A., McCarley R.W., Shenton M.E. Reduced left angular gyrus volume in first-episode schizophrenia. Am. J. Psychiatry. 2005;162:1539–1541. doi: 10.1176/appi.ajp.162.8.1539. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Niznikiewicz M., Donnino R., McCarley R.W., Nestor P.G., Iosifescu D.V., O'Donnell B., Levitt J., Shenton M.E. Abnormal angular gyrus asymmetry in schizophrenia. Am. J. Psychiatry. 2000;157:428–437. doi: 10.1176/appi.ajp.157.3.428. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Noppeney U., Price C. Retrieval of visual, auditory, and abstract semantics. NeuroImage. 2002;15:917–926. doi: 10.1006/nimg.2001.1016. [DOI] [PubMed] [Google Scholar]
  70. Obozinski G., Taskar B., Jordan M.I. Joint covariate selection and joint subspace selection for multiple classification problems. Stat. Comput. 2010;20:231–252. [Google Scholar]
  71. Ojemann G., Schoenfield-McNeill J., Corina D. Anatomic subdivisions in human temporal cortical neuronal activity related to recent verbal memory. Nat. Neurosci. 2002;5:64–71. doi: 10.1038/nn785. [DOI] [PubMed] [Google Scholar]
  72. Onitsuka T., Shenton M.E., Kasai K., Nestor P.G., Toner S.K., Kikinis R., Jolesz F.A., McCarley R.W. Fusiform gyrus volume reduction and facial recognition in chronic schizophrenia. Arch. Gen. Psychiatry. 2003;60:349–355. doi: 10.1001/archpsyc.60.4.349. [DOI] [PubMed] [Google Scholar]
  73. Onitsuka T., Nestor P.G., Gurrera R.J., Shenton M.E., Kasai K., Frumin M., Niznikiewicz M.A., McCarley R.W. Association between reduced extraversion and right posterior fusiform gyrus gray matter reduction in chronic schizophrenia. Am. J. Psychiatry. 2005;162:599–601. doi: 10.1176/appi.ajp.162.3.599. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Palaniyappan L., Liddle P.F. Aberrant cortical gyrification in schizophrenia: a surface-based morphometry study. J. Psychiatry Neurosci. 2012;37:399–406. doi: 10.1503/jpn.110119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Palaniyappan L., Liddle P.F. Does the salience network play a cardinal role in psychosis? An emerging hypothesis of insular dysfunction. J. Psychiatry Neurosci. 2012;37:17–27. doi: 10.1503/jpn.100176. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Palaniyappan L., Balain V., Radua J., Liddle P.F. Structural correlates of auditory hallucinations in schizophrenia: a meta-analysis. Schizophr. Res. 2012;137:169–173. doi: 10.1016/j.schres.2012.01.038. [DOI] [PubMed] [Google Scholar]
  77. Pan S.J., Yang Q.A. A survey on transfer learning. IEEE Trans. Knowl. Data. 2010;22:1345–1359. [Google Scholar]
  78. Pearlson G. Multisite collaborations and large databases in psychiatric neuroimaging: advantages, problems, and challenges. Schizophr. Bull. 2009;35:1–2. doi: 10.1093/schbul/sbn166. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Pfefferbaum A., Adalsteinsson E., Sullivan E.V. Replicability of diffusion tensor imaging measurements of fractional anisotropy and trace in brain. J. Magn. Reson. Imaging. 2003;18:427–433. doi: 10.1002/jmri.10377. [DOI] [PubMed] [Google Scholar]
  80. Potkin S.G., Turner J.A., Brown G.G., McCarthy G., Greve D.N., Glover G.H., Manoach D.S., Belger A., Diaz M., Wible C.G., Ford J.M., Mathalon D.H., Gollub R., Lauriello J., O'Leary D., van Erp T.G., Toga A.W., Preda A., Lim K.O., FBIRN Working memory and DLPFC inefficiency in schizophrenia: the FBIRN study. Schizophr. Bull. 2009;35:19–31. doi: 10.1093/schbul/sbn162. [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Price C.J. The anatomy of language: a review of 100 fMRI studies published in 2009. Ann. N. Y. Acad. Sci. 2010;1191:62–88. doi: 10.1111/j.1749-6632.2010.05444.x. [DOI] [PubMed] [Google Scholar]
  82. Quarantelli M., Palladino O., Prinster A., Schiavone V., Carotenuto B., Brunetti A., Marsili A., Casiello M., Muscettola G., Salvatore M., de Bartolomeis A. Patients with poor response to antipsychotics have a more severe pattern of frontal atrophy: a voxel-based morphometry study of treatment resistance in schizophrenia. Biomed. Res. Int. 2014;2014 doi: 10.1155/2014/325052. [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Rao N.S., Cox C.R., Nowak R.D., Rogers T.T. Conference on Neural Information Processing Systems. Vol. 2013. 2013. Sparse overlapping sets Lasso for multitask learning and its application to fMRI analysis. [Google Scholar]
  84. Razi K., Greene K.P., Sakuma M., Ge S., Kushner M., DeLisi L.E. Reduction of the parahippocampal gyrus and the hippocampus in patients with chronic schizophrenia. Br. J. Psychiatry. 1999;174:512–519. doi: 10.1192/bjp.174.6.512. [DOI] [PubMed] [Google Scholar]
  85. Rusch N., Spoletini I., Wilke M., Bria P., Di Paola M., Di Iulio F., Martinotti G., Caltagirone C., Spalletta G. Prefrontal-thalamic-cerebellar gray matter networks and executive functioning in schizophrenia. Schizophr. Res. 2007;93:79–89. doi: 10.1016/j.schres.2007.01.029. [DOI] [PubMed] [Google Scholar]
  86. Schaufelberger M.S., Duran F.L., Lappin J.M., Scazufca M., Amaro E., Jr., Leite C.C., de Castro C.C., Murray R.M., McGuire P.K., Menezes P.R., Busatto G.F. Grey matter abnormalities in Brazilians with first-episode psychosis. Br. J. Psychiatry Suppl. 2007;51:s117–22. doi: 10.1192/bjp.191.51.s117. [DOI] [PubMed] [Google Scholar]
  87. Schnack H.G., van Haren N.E., Brouwer R.M., van Baal G.C., Picchioni M., Weisbrod M., Sauer H., Cannon T.D., Huttunen M., Lepage C., Collins D.L., Evans A., Murray R.M., Kahn R.S., Hulshoff Pol H.E. Mapping reliability in multicenter MRI: voxel-based morphometry and cortical thickness. Hum. Brain Mapp. 2010;31:1967–1982. doi: 10.1002/hbm.20991. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Schroder J., Wenz F., Schad L.R., Baudendistel K., Knopp M.V. Sensorimotor cortex and supplementary motor area changes in schizophrenia. A study with functional magnetic resonance imaging. Br. J. Psychiatry. 1995;167:197–201. doi: 10.1192/bjp.167.2.197. [DOI] [PubMed] [Google Scholar]
  89. Schultz C.C., Koch K., Wagner G., Roebel M., Nenadic I., Gaser C., Schachtzabel C., Reichenbach J.R., Sauer H., Schlosser R.G. Increased parahippocampal and lingual gyrification in first-episode schizophrenia. Schizophr. Res. 2010;123:137–144. doi: 10.1016/j.schres.2010.08.033. [DOI] [PubMed] [Google Scholar]
  90. Segall J.M., Turner J.A., van Erp T.G., White T., Bockholt H.J., Gollub R.L., Ho B.C., Magnotta V., Jung R.E., McCarley R.W., Schulz S.C., Lauriello J., Clark V.P., Voyvodic J.T., Diaz M.T., Calhoun V.D. Voxel-based morphometric multisite collaborative study on schizophrenia. Schizophr. Bull. 2009;35:82–95. doi: 10.1093/schbul/sbn150. [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Seiferth N.Y., Pauly K., Habel U., Kellermann T., Shah N.J., Ruhrmann S., Klosterkötter J., Schneider F., Kircher T. Increased neural response related to neutral faces in individuals at risk for psychosis. NeuroImage. 2008;40:289–297. doi: 10.1016/j.neuroimage.2007.11.020. [DOI] [PubMed] [Google Scholar]
  92. Shen D., Davatzikos C. Very high-resolution morphometry using mass-preserving deformations and HAMMER elastic registration. NeuroImage. 2003;18:28–41. doi: 10.1006/nimg.2002.1301. [DOI] [PubMed] [Google Scholar]
  93. Hu D., Shen H., Zhou Z. Functional asymmetry in the cerebellum: a brief review. Cerebellum. 2008;7:304–313. doi: 10.1007/s12311-008-0031-2. [DOI] [PubMed] [Google Scholar]
  94. Shen H., Wang L., Liu Y., Hu D. Discriminative analysis of resting-state functional connectivity patterns of schizophrenia using low dimensional embedding of fMRI. NeuroImage. 2010;49:3110–3121. doi: 10.1016/j.neuroimage.2009.11.011. [DOI] [PubMed] [Google Scholar]
  95. Shenton M.E., Dickey C.C., Frumin M., McCarley R.W. A review of MRI findings in schizophrenia. Schizophr. Res. 2001;49:1–52. doi: 10.1016/s0920-9964(01)00163-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  96. Singh S., Goyal S., Modi S., Kumar P., Singh N., Bhatia T., Deshpande S.N., Khushu S. Motor function deficits in schizophrenia: an fMRI and VBM study. Neuroradiology. 2014;56:413–422. doi: 10.1007/s00234-014-1325-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  97. Ou Y., Sotiras A., Paragios N., Davatzikos C. DRAMMS: deformable registration via attribute matching and mutual-saliency weighting. Med. Image Anal. 2011;15:622–639. doi: 10.1016/j.media.2010.07.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  98. Steen R.G., Mull C., McClure R., Hamer R.M., Lieberman J.A. Brain volume in first-episode schizophrenia: systematic review and meta-analysis of magnetic resonance imaging studies. Br. J. Psychiatry. 2006;188:510–518. doi: 10.1192/bjp.188.6.510. [DOI] [PubMed] [Google Scholar]
  99. Sutton B.P., Goh J., Hebrank A., Welsh R.C., Chee M.W., Park D.C. Investigation and validation of intersite fMRI studies using the same imaging hardware. J. Magn. Reson. Imaging. 2008;28:21–28. doi: 10.1002/jmri.21419. [DOI] [PMC free article] [PubMed] [Google Scholar]
  100. Takahashi T., Zhou S.Y., Nakamura K., Tanino R., Furuichi A., Kido M., Kawasaki Y., Noguchi K., Seto H., Kurachi M., Suzuki M. A follow-up MRI study of the fusiform gyrus and middle and inferior temporal gyri in schizophrenia spectrum. Prog. Neuro-Psychopharmacol. Biol. Psychiatry. 2011;35:1957–1964. doi: 10.1016/j.pnpbp.2011.07.009. [DOI] [PubMed] [Google Scholar]
  101. Tan L., Chen Y., Maloney T.C., Caré M.M., Holland S.K., Lu L.J. Combined analysis of sMRI and fMRI imaging data provides accurate disease markers for hearing impairment. Neuroimage Clin. 2013;3:416–428. doi: 10.1016/j.nicl.2013.09.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  102. Tanskanen P., Ridler K., Murray G.K., Haapea M., Veijola J.M., Jaaskelainen E., Miettunen J., Jones P.B., Bullmore E.T., Isohanni M.K. Morphometric brain abnormalities in schizophrenia in a population-based sample: relationship to duration of illness. Schizophr. Bull. 2010;36:766–777. doi: 10.1093/schbul/sbn141. [DOI] [PMC free article] [PubMed] [Google Scholar]
  103. Torres U.S., Duran F.L., Schaufelberger M.S., Crippa J.A., Louza M.R., Sallet P.C., Kanegusuku C.Y., Elkis H., Gattaz W.F., Bassitt D.P., Zuardi A.W., Hallak J.E., Leite C.C., Castro C.C., Santos A.C., Murray R.M., Busatto G.F. Patterns of regional gray matter loss at different stages of schizophrenia: a multisite, cross-sectional VBM study in first-episode and chronic illness. Neuroimage Clin. 2016;12:1–15. doi: 10.1016/j.nicl.2016.06.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  104. Tranel D., Damasio H., Damasio A.R. A neural basis for the retrieval of conceptual knowledge. Neuropsychologia. 1997;35:1319–1327. doi: 10.1016/s0028-3932(97)00085-7. [DOI] [PubMed] [Google Scholar]
  105. Van Essen D.C. A population-average, landmark- and surface-based (PALS) atlas of human cerebral cortex. NeuroImage. 2005;28:635–662. doi: 10.1016/j.neuroimage.2005.06.058. [DOI] [PubMed] [Google Scholar]
  106. Van Horn J.D., Toga A.W. Multisite neuroimaging trials. Curr. Opin. Neurol. 2009;22:370–378. doi: 10.1097/WCO.0b013e32832d92de. [DOI] [PMC free article] [PubMed] [Google Scholar]
  107. Vita A., De Peri L., Deste G., Sacchetti E. Progressive loss of cortical gray matter in schizophrenia: a meta-analysis and meta-regression of longitudinal MRI studies. Transl. Psychiatry. 2012;2 doi: 10.1038/tp.2012.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  108. Wagshal D., Knowlton B.J., Cohen J.R., Bookheimer S.Y., Bilder R.M., Fernandez V.G., Asarnow R.F. Cognitive correlates of gray matter abnormalities in adolescent siblings of patients with childhood-onset schizophrenia. Schizophr. Res. 2015;161:345–350. doi: 10.1016/j.schres.2014.12.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  109. Walther S., Federspiel A., Horn H., Bianchi P., Wiest R., Wirth M., Strik W., Muller T.J. Encoding deficit during face processing within the right fusiform face area in schizophrenia. Psychiatry Res. 2009;172:184–191. doi: 10.1016/j.pscychresns.2008.07.009. [DOI] [PubMed] [Google Scholar]
  110. Wang, H., Nie, F., Huang, H., Risacher, S., Ding, C., Saykin, A.J., Shen, L. (Sparse multi-task regression and feature selection to identify brain imaging predictors for memory performance). In; 2011. p 557–562. [DOI] [PMC free article] [PubMed]
  111. Wang X., Zhang T., Chaim T., Zanetti M., Davatzikos C. Proceeding of the 18th Annual International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI) 2015. Classification of MRI under the presence of disease heterogeneity using multi-task learning: application to bipolar disorder. [DOI] [PMC free article] [PubMed] [Google Scholar]
  112. Watanabe T., Kessler D., Scott C., Sripada C. 2014. Multisite Disease Classification with Second International Workshop on Sparsity Techniques in Medical Imaging. Boston. [Google Scholar]
  113. Weinberger D.R., Radulescu E. Finding the elusive psychiatric “lesion” with 21st-century neuroanatomy: a note of caution. Am. J. Psychiatry. 2016;173:27–33. doi: 10.1176/appi.ajp.2015.15060753. [DOI] [PubMed] [Google Scholar]
  114. Whalley H.C., Simonotto E., Moorhead W., McIntosh A., Marshall I., Ebmeier K.P., Owens D.G., Goddard N.H., Johnstone E.C., Lawrie S.M. Functional imaging as a predictor of schizophrenia. Biol. Psychiatry. 2006;60:454–462. doi: 10.1016/j.biopsych.2005.11.013. [DOI] [PubMed] [Google Scholar]
  115. Wright I.C., Rabe-Hesketh S., Woodruff P.W., David A.S., Murray R.M., Bullmore E.T. Meta-analysis of regional brain volumes in schizophrenia. Am. J. Psychiatry. 2000;157:16–25. doi: 10.1176/ajp.157.1.16. [DOI] [PubMed] [Google Scholar]
  116. Wright T.M., Pelphrey K.A., Allison T., McKeown M.J., McCarthy G. Polysensory interactions along lateral temporal regions evoked by audiovisual speech. Cereb. Cortex. 2003;13:1034–1043. doi: 10.1093/cercor/13.10.1034. [DOI] [PubMed] [Google Scholar]
  117. Xue Y., Liao X., Carin L., Krishnapuram B. Multi-task learning for classification with Dirichlet process priors. J. Mach. Learn. Res. 2007;8:35–63. [Google Scholar]
  118. Yan J., Li T., Wang H., Huang H., Wan J., Nho K., Kim S., Risacher S.L., Saykin A.J., Shen L., Alzheimer's Disease Neuroimaging I. Cortical surface biomarkers for predicting cognitive outcomes using group l2,1 norm. Neurobiol. Aging. 2015;36(Suppl. 1):S185–93. doi: 10.1016/j.neurobiolaging.2014.07.045. [DOI] [PMC free article] [PubMed] [Google Scholar]
  119. Yoon J.H., D'Esposito M., Carter C.S. Preserved function of the fusiform face area in schizophrenia as revealed by fMRI. Psychiatry Res. 2006;148:205–216. doi: 10.1016/j.pscychresns.2006.06.002. [DOI] [PubMed] [Google Scholar]
  120. You X., Adjouadi M., Guillen M.R., Ayala M., Barreto A., Rishe N., Sullivan J., Dlugos D., Vanmeter J., Morris D., Donner E., Bjornson B., Smith M.L., Bernal B., Berl M., Gaillard W.D. Sub-patterns of language network reorganization in pediatric localization related epilepsy: a multisite study. Hum. Brain Mapp. 2011;32:784–799. doi: 10.1002/hbm.21066. [DOI] [PMC free article] [PubMed] [Google Scholar]
  121. Zanetti M.V., Schaufelberger M.S., Doshi J., Ou Y., Ferreira L.K., Menezes P.R., Scazufca M., Davatzikos C., Busatto G.F. Neuroanatomical pattern classification in a population-based sample of first-episode schizophrenia. Prog. Neuro-Psychopharmacol. Biol. Psychiatry. 2013;43:116–125. doi: 10.1016/j.pnpbp.2012.12.005. [DOI] [PubMed] [Google Scholar]

Articles from NeuroImage : Clinical are provided here courtesy of Elsevier

RESOURCES