Skip to main content
Frontiers in Computational Neuroscience logoLink to Frontiers in Computational Neuroscience
. 2014 Jun 6;8:60. doi: 10.3389/fncom.2014.00060

Identifying endophenotypes of autism: a multivariate approach

Fermín Segovia 1,2,*, Rosemary Holt 2, Michael Spencer 2, Juan M Górriz 3, Javier Ramírez 3, Carlos G Puntonet 4, Christophe Phillips 1, Lindsay Chura 2, Simon Baron-Cohen 2, John Suckling 2
PMCID: PMC4047979  PMID: 24936183

Abstract

The existence of an endophenotype of autism spectrum condition (ASC) has been recently suggested by several commentators. It can be estimated by finding differences between controls and people with ASC that are also present when comparing controls and the unaffected siblings of ASC individuals. In this work, we used a multivariate methodology applied on magnetic resonance images to look for such differences. The proposed procedure consists of combining a searchlight approach and a support vector machine classifier to identify the differences between three groups of participants in pairwise comparisons: controls, people with ASC and their unaffected siblings. Then we compared those differences selecting spatially collocated as candidate endophenotypes of ASC.

Keywords: autism spectrum condition, MRI, support vector machine, searchlight, endophenotype

1. Introduction

Autism Spectrum Condition (ASC) is a lifelong neurodevelopmental condition affecting approximately 1% of population (Baron-Cohen et al., 2009), that causes impairments of social communication alongside unusually repetitive behavior, narrow interests and resistance to change. Despite research efforts carried out over last few years, ASC is one of the more difficult conditions to characterize (Brambilla et al., 2003) and its aetiology is still largely unknown.

Historically postmortem studies have revealed neuroanatomic abnormalities (Bailey et al., 1998) associated with ASC that can also be analyzed by means of in vivo neuroimaging techniques. Neuroimaging technology is advancing at an impressive pace and is having a huge impact in both research and clinical environments. Magnetic Resonance Imaging (MRI) is a medical imaging technology that allows detailed visualization of the internal structures of the body. It has been widely used in the neurosciences, including in many studies of ASC. For example, in Sears et al. (1999) and Courchesne et al. (2001), the authors compared the size of several brain regions in controls and ASC participants from structural MRI. A longitudinal study focusing on the volume of the brainstem of controls and ASC participants was presented in Jou et al. (2013). Additionally, in Hashimoto et al. (1992) the brain volume and head circumference of participants with ASC were examined and compared with control individuals.

Recent studies suggest the existence of an endophenotype [a heritable biomarker associated with a pathology that individuals may have regardless of whether they have developed the pathology or not (Gottesman and Gould, 2003)] of ASC (Nydn et al., 2011; Spencer et al., 2012) identifiable through the analysis of unaffected relatives, typically parents or siblings of autistic individuals. In fact, according to (Constantino et al., 2010), siblings of individuals with ASC are at increased risk of developing the condition, with a prevalence estimated to be >20 times that of the general population.

Recently developed machine learning techniques provide an opportunity to analyse the differences between unaffected siblings of autistic individuals and controls with no family history of ASC. Differences co-located with phenotypic differences identified in comparisons of controls and ASC participants are candidate endophenotypes of ASC and may be attributed to effects at the level of the genome or epigenome that confer familial risk for the condition.

The analysis techniques used in neuroimaging are usually divided into two groups. On the one hand, univariate techniques perform a statistical analysis at each voxel separately and do not take into account relationships amongst distant voxels, although statistical inference of voxel clusters has improved sensitivity. Despite this disadvantage, univariate methods are widely used, in part because of the interpretability of results and access to the methodology through standard software packages such as Statistical Parametric Mapping (SPM) (Friston et al., 2011). In fact, univariate methods have been recently used to study the endophenotype of autism. For example in Dalton et al. (2007), the authors used a t-test to analyse functional Magnetic Resonance Imaging (fMRI) from controls and unaffected siblings of ASC individuals. Another univariate approach was used in Salmond et al. (2003) and Peterson et al. (2006). In these works, MRI data from ASC children were analyzed by means of Voxel Based Morphometry (VBM) (Ashburner and Friston, 2000). This approach allows investigation of focal differences in brain anatomy by performing a voxel-wise comparison of the local concentration or volume of gray matter between two or more groups of participants. Indeed, VBM has been widely used in the study of ASC and several meta-analyses have been conducted (Cauda et al., 2011; Radua et al., 2011; Via et al., 2011; Nickl-Jockschat et al., 2012). Whilst there appears to be a consistency in the reports of case-control differences in occipital, temporal, and parietal lobes as well as the precentral gyrus, generally the extant literature is characterized by a significant level of between-study variability. Such discrepancies between studies are variously ascribed to methodological differences, variations in the ages and IQs of participants, diagnostic criteria (Nickl-Jockschat et al., 2012) and, most frequently, to the heterogeneity of the signs and symptoms of ASC (Volkmar and Pauls, 2003). With these issues in mind, techniques that increase statistical sensitivity have added value in structural MRI studies of ASC and particularly in study designs for endophenotype discovery where effect sizes are potentially reduced by the introduction of a group (siblings) hypothesized to lie intermediate between cases and controls.

On the other hand, multivariate approaches, including machine learning, analyse each image as a whole and explicitly consider the inter-relationships across voxels. Here, effects due to brain structure or function as well as confounding and error effects are assessed statistically both at each voxel and as interactions among voxels (Friston et al., 2011). The main drawback to be addressed is the so-called small sample size problem (Duin, 2000), which occurs when the number of variables (voxels in case of neuroimaging) to be analyzed is significantly greater than the sample size (i.e., number of images) used in the study.

Searchlight analysis (or information mapping) is a recently described methodology based on multivariate pattern analysis (MVPA) that address the small sample size problem by dividing the brain volume into small regions of just a few voxels (Kriegeskorte et al., 2006; Kriegeskorte and Bandettini, 2007). It has been successfully used for classification problems [see for example (Illán et al., 2011)]. However, in this work it is used to highlight the locations where participants from two groups have differing patterns of brain tissue volume.

Here we demonstrate an original methodology to analyse structural MR images based on a multivariate exploration carried out with a searchlight approach. The primary goal is to discover evidence for the existence of an endophenotype of ASC. This condition has been intensively studied through univariate approaches, such as the ones mentioned above, and by directly measuring the size of several brain regions (Rojas et al., 2004; Webb et al., 2009). However, in our opinion, there is still room for the application of multivariate strategies that provide higher statistical sensitivity and allow corroborating or refuting the results reported in univariate studies. In addition, we hypothesized that multivariate approaches like the one proposed in this work provide a better way of analysing neuroimaging data by looking for differences between groups. Therefore, multivariate methods can realize differences even when univariate methods find no differences. Our hypothesis is based in the fact that multivariate approaches analyse the data not only at voxel level but also taking into account the relations between voxels. Thus, differences in those relations can be only found with multivariate methods.

We studied 132 whole-brain structural MRI from adolescents divided into three groups: ASC participants, unaffected siblings of ASC participants, and individuals with no family history of ASC (Table 1), referred to hereafter as “ASC,” “siblings,” and “controls” respectively. After normalizing the MRI volumes to a standard stereotactic space, we performed a searchlight analysis comparing “ASC” vs. “controls” and “siblings” vs. “controls.” The resulting information maps identified overlapping regions of between-group difference that may be considered as elements of an endophenotype of ASC. It is worth noting that this cohort was previously analyzed with VBM and no significant differences were found.

Table 1.

Demographic details of participants.

Number Sex Age IQ
M F μ σ range μ σ range
ASC 52 35 17 14.49 1.74 12–18 104.6 15.89 73–146
Siblings 40 12 28 14.86 2.12 12–18 113.1 10.06 88–133
Controls 40 20 20 15.06 1.61 12–18 112.4 11.12 83–136

μ and σ are the average and the standard deviation, respectively.

2. Materials and methods

2.1. Study description

The study cohort comprised 132 participants aged between 12 and 18 years. Fifty-two (52) had a diagnosis of ASC, either classic autism or Asperger syndrome, 40 participants were unaffected siblings of individuals with ASC. Forty (40) controls with no family history of ASC were also recruited. Demographic details are gathered in Table 1.

All ASC participants met Diagnostic and Statistical Manual of Mental Disorders, fourth edition criteria (American Psychiatric Association et al., 1994) for autism or Asperger syndrome and were positive on the Autism Diagnostic Interview-Revised (Lord et al., 1994) and the ADOS-G (Losh et al., 2009). Further details on recruitment are given in references (Spencer et al., 2011, 2012).

The study was given ethical approval by the Cambridgeshire 1 Research Ethics Committee, and all participants and their parents provided written informed consent.

Data collection took place at the Medical Research Council, Cognition and Brain sciences unit, on a Siemens Tim Trio (Siemens Medical Solutions, AG, Erlangen, Germany) operating at 3T. High-resolution T1-weighted three-dimensional magnetization-prepared rapid acquisition gradient-echo (MP-RAGE) structural images were acquired with the following parameters: slice thickness = 1 mm; TR = 2300 ms; TE = 2.98 ms; field of view = 256 × 240 × 176 mm; flip angle = 9°; voxel size = 1 × 1 × 1 mm.

All MRI datasets were first segmented into their component tissues (gray and white matter) and then normalized to a standard stereotactic space of the Montreal Neurological Institute (MNI) using SPM software (Friston et al., 2011). Specifically, we used the Diffeomorphic Anatomical Registration Through Exponentiated Lie Algebra (DARTEL) (Ashburner, 2007). This algorithm normalizes both segmented gray and white matter images from all participants in an integrated, iterative procedure by computing a flow field which can then be expressed as both forward and backward image deformations. The MNI-normalized gray matter volume maps were used in the subsequent analyses.

2.2. Multivariate analysis based on machine learning

During the last decade, many research efforts have been focused on MVPA as a promising way of analysing high dimensional data. The growth of multivariate approaches is partly due to the recent advances in machine learning which provide more accurate statistical classifiers with the benefits of generalization. In that sense, support vector machines (SVM) have recently attracted the attention of the pattern recognition community because of the merits derived from statistical learning theory (Vapnik, 1995, 1998) developed by Vladimir Vapnik in late 1990s.

A SVM classifier builds a function f: RN → {±1} using the training data (N-dimensional patterns xi and their class labels yi) so that f is able to predict the label y of a new example x. The computation of function f is based on the calculation of a hyperplane, called the maximal margin hyperplane, that has the largest distance to the closest training data point of any class. Then, this hyperplane is used to classify new data points of unknown class. Formally, the decision hyperplane is defined as:

g(x)=wTx+w0=0, (1)

where w is the weight vector, orthogonal to the decision hyperplane, and w0 is the threshold. When no linear separation of the training data is possible, SVM can work effectively in combination with kernel techniques so that the hyperplane defining the SVM corresponds to a non-linear decision boundary in the input space (Müller et al., 2001).

SVM has been successfully used in a number of problems in different fields and is one of the most prevalent classifiers in neuroimaging-based classification tasks (Shen and Ji, 2009; Swiderski et al., 2009; Cuingnet et al., 2011; Zhang et al., 2011), such as the development of computer aided diagnosis systems for neurodegenerative disorders (López et al., 2009; Segovia et al., 2010, 2012). As with other statistical classifiers, SVM may be also used to highlight the differences between two groups of images by means of a searchlight approach. Searchlight (Kriegeskorte et al., 2006) divides the brain volumes into small regions that are independently analyzed. All the regions are of equal size and shape and cover the entire parenchyma of the brain. They are defined by following a systematic procedure with no prior knowledge about the groups that are being compared. As a result of applying this technique, a map is obtained derived from the discriminating power of different brain regions.

In this work, we utilized a searchlight approach and support vector machine classification to look for anatomical differences between controls and ASC participants, and between controls and unaffected siblings of ASC participants. Then, by comparing both maps of classification accuracies we estimated potential endophenotypes of ASC. Thus, the exploration implemented here makes two comparisons of two groups and creates a map that assigns a discrimination power to each voxel. However, the analysis is not carried out in a univariate way (voxel by voxel), but it is performed over small regions and then the result obtained for a region is assigned to all the voxels in that region. We defined as many regions as voxels resulting in a high rate of overlapping and allowing for exhaustive exploration of the image space. Cubic regions with a side of 5 voxels were used, leading to regions containing 125 voxels.

For each region we used a k-fold cross-validation (CV) scheme along with a SVM classifier to estimate the accuracy of classifying the images in the two groups. A high accuracy rate indicates that the analyzed region contains large differences between the two groups. On the other hand, a low accuracy rate, about 50%, suggests only small differences. The classification procedure used a linear kernel along with the SVM classifier (with parameter C fixed to the commonly accepted value of C = 1) and estimated the accuracy rates by means of CV, which is an effective method for estimating the risk of a classifier (Cortes and Vapnik, 1995; Müller et al., 2001). Specifically, we used a 10-fold scheme that provides similar estimation errors than leave-one-out (Varma and Simon, 2006) with a smaller computational load (an important issue to be taken into account in our experiments since we performed many thousands of CV loops).

Subsequently, the significance of the accuracy obtained with each region was assessed with a non-parametric test. This additional test consisted of repeating 1000 times the same classification procedure but using random labels. In this way we were able to estimate the probability that an accuracy rate was obtained by chance (Good, 2000). A p-value was then computed as the number of repeats where the accuracy obtained with the random labels was larger than that obtained with the true labels, divided by 1000. In order to address the multiple comparison problem (Miller, 1966), a false discovery rate control based on the Benjamini-Hochberg procedure (Benjamini and Hochberg, 1995) was applied. Accordingly, the tests with high p-values (i.e., p-values higher than an ad-hoc threshold related to the number of simultaneous tests) were discarded.

Finally, we assigned to each voxel the highest accuracy rate obtained for all the regions that included that voxel. The pseudocode for the searchlight procedure is shown in Algorithm 1.

Algorithm 1.

Implementation of the SVM-based searchlight approach.

graphic file with name fncom-08-00060-i0001.jpg

3. Experiments and results

The experiments were performed in three steps: First, we estimated the locations with different patterns in controls and ASC participants using the multivariate approach described in section 2.2. Secondly, we followed the same procedure to differentiate between controls and unaffected siblings of ASC participants. Finally, we estimated the similarities between the two maps computed in previous steps by finding the regions of significant difference common for both. The result of the first two steps is shown in Figure 1 and the final comparison (step 3) is shown in Figure 2. In order to analyse the locations of the regions highlighted by the algorithm, we calculated the brain regions involved by means of the Anatomical Labeling Atlas (AAL) (Tzourio-Mazoyer et al., 2002). Table 2 shows the structural regions containing areas highlighted in Figure 1 and the percentage of brain region they cover. Regions covered by a relatively large area in both classifications (shown in the last column) are candidate endophenotypes of ASC. This analysis facilitates the comparison of results obtained in this work with the extant literature.

Figure 1.

Figure 1

Location of the significant regions (estimated through the Benjamini-Hochberg procedure with α = 0.05) when classifying controls vs. ASC (top) and controls vs. siblings (bottom). The intensity of the white color indicates the accuracy achieved for that region. The accuracy rate varies from 65.22 to 77.17% for the first classification and from 66.25 to 82.50% for the second one.

Figure 2.

Figure 2

Comparison of between-group difference maps shown in Figure 1. Red: significant regions for the controls vs. ASC participant's classification; blue: significant regions for the controls vs. siblings classification; white: the common regions for both classifications.

Table 2.

Size of the significant regions for classifications of control vs. ASC (second column) and controls vs. siblings (third column), grouped by anatomical region.

Anatomical region (hemisphere) Controls vs ASC (%). Controls vs siblings (%). Common area (%)
Cerebelum (left) 394 (9.71) 821 (20.24) 146 (3.60)
Parietal lobule (right) 123 (9.33) 162 (12.28) 100 (7.58)
Occipital (left) 239 (11.86) 289 (14.34) 74 (3.67)
Angular gyrus (left) 75 (18.25) 141 (34.31) 71 (17.27)
Inferior frontal gyrus (left) 89 (8.64) 265 (25.73) 67 (6.50)
Temporal gyrus (right) 426 (10.71) 377 (9.48) 62 (1.56)
Cingulate gyrus (left) 152 (11.27) 197 (14.60) 59 (4.37)
Parietal lobule (left) 111 (6.95) 79 (4.94) 41 (2.57)
Parahippocampal gyrus (right) 51 (13.35) 75 (19.63) 31 (8.12)
Hippocampus (left) 36 (11.92) 89 (29.47) 30 (9.93)
Thalamus (right) 64 (19.22) 38 (11.41) 27 (8.11)
Cerebelum (right) 262 (6.05) 273 (6.31) 24 (0.55)
Vermis 192 (26.82) 44 (6.15) 24 (3.35)
Cingulate gyrus (right) 394 (29.76) 62 (4.68) 23 (1.74)
Area triangularis (left) 101 (10.64) 91 (9.59) 21 (2.21)
Precuneus (right) 296 (24.79) 74 (6.20) 18 (1.51)
Calcarine sulcus (left) 31 (3.54) 178 (20.32) 17 (1.94)
Precuneus (left) 33 (2.52) 230 (17.60) 15 (1.15)
Putamen (left) 21 (5.98) 79 (22.51) 15 (4.27)
Thalamus (left) 65 (18.90) 139 (40.41) 11 (3.20)
Globus pallidus (left) 38 (43.18) 11 (12.50) 8 (9.09)
Middle temporal pole (right) 43 (9.35) 91 (19.78) 7 (1.52)
Superior frontal gyrus (right) 120 (3.94) 146 (4.79) 7 (0.23)
Supplementary motor area (left) 24 (3.38) 22 (3.09) 7 (0.98)
Amygdala (left) 8 (10.53) 6 (7.89) 6 (7.89)
Inferior frontal gyrus (right) 125 (10.48) 160 (13.41) 6 (0.50)
Fusiform gyrus (right) 10 (1.12) 65 (7.30) 5 (0.56)
Middle frontal gyrus (left) 142 (6.59) 252 (11.69) 5 (0.23)
Calcarine sulcus (right) 109 (15.55) 74 (10.56) 4 (0.57)
Caudate nucleus (left) 42 (11.54) 5 (1.37) 4 (1.10)
Lingual gyrus (right) 139 (16.22) 32 (3.73) 4 (0.47)
Angular gyrus (right) 13 (2.02) 22 (3.42) 3 (0.47)
Parahippocampal gyrus (left) 34 (10.12) 38 (11.31) 2 (0.60)
Superior temporal pole (right) 63 (11.71) 5 (0.93) 1 (0.19)
Area triangularis (right) 89 (10.29) 23 (2.66)
Caudate nucleus (right) 1 (0.27) 42 (11.29)
Cuneus (left) 23 (4.09)
Cuneus (right) 2 (0.40) 71 (14.12)
Fusiform gyrus (left) 72 (9.13) 48 (6.08)
Gyrus rectus (left) 5 (1.51)
Gyrus rectus (right) 1 (0.37)
Hippocampus (right) 3 (0.94) 16 (5.02)
Insula (left) 230 (33.82)
Insula (right) 7 (1.07)
Lingual gyrus (left) 10 (1.27) 343 (43.69)
Middle frontal gyrus (right) 130 (5.66) 167 (7.27)
Middle temporal pole (left) 62 (21.23)
Occipital (right) 21 (1.30) 220 (13.58)
Olfactory cortex (left) 6 (6.00)
Olfactory cortex (right) 11 (9.48)
Paracentral lobule (left) 9 (1.96) 67 (14.60)
Paracentral lobule (right) 2 (0.70) 4 (1.40)
Postcentral gyrus (left) 48 (3.63) 81 (6.12)
Postcentral gyrus (right) 35 (2.55)
Precentral gyrus (left) 8 (0.71) 17 (1.51)
Precentral gyrus (right) 124 (11.10)
Rolandic operculum (left) 52 (15.57)
Rolandic operculum (right) 10 (2.16)
Superior frontal gyrus (left) 161 (5.09) 398 (12.57)
Superior temporal pole (left) 12 (2.43)
Supplementary motor area (right) 114 (14.21) 1 (0.12)
Supramarginal gyrus (right) 10 (1.48)
Temporal gyrus (left) 256 (6.99) 205 (5.60)
Transverse temporal gyri (left) 24 (28.92)
Transverse temporal gyri (right) 1 (1.16)

The last column (fourth) contains the size of the common area for both classifications. The size is given in mm3 and as the percentage of the total volume of the anatomical region.

In order to test if regions marked as common for both classifications indeed have the same pattern, an additional experiment was performed. For each significant region in the first classification, a SVM classifier was trained using controls and ASC participants, and the positive rate (percentage of ASC correctly classified) was then estimated through CV. Subsequently, the sibling data was evaluated with that trained classifier. For about 20% of the regions, the number of ASC individuals correctly classified as such was approximately the same (equal or in the 10% range) as the number of siblings classified as ASC. This reveals the existence of a common pattern for ASC and siblings in those regions and strengthens the suggestion that they are endophenotypes of ASC. Figure 3 plots the positive rate vs. the percentage of siblings classified as ASC. Note that 20% of regions mentioned above are located close to the blue line.

Figure 3.

Figure 3

Percentage of ASC individuals correctly classified as ASC by a classifier trained with controls and ASC data (abscissa) vs. the percentage of siblings classified as ASC by the same trained classifier (ordinate axis). Regions near to the blue line provide approximately the same accuracy in both classifications. This suggests they have similar patterns and corroborates the suggestion that they are endophenotypes of ASC.

Regardless of the regional analysis it is reasonable to expect that classification of controls vs. ASC participants would achieve higher accuracy rates than classification of controls vs. siblings of ASC participants. After all, the siblings have not developed the condition and should be more similar to controls and thus separating these groups should be expected to be relatively more challenging. However, our experiments indicate the converse. Figure 4 shows the histograms with the number of regions for which the two classifications (controls vs. ASC and controls vs. siblings) achieved significant accuracies. Note the large differences in the intervals of 70–75 and 75–80% of accuracy.

Figure 4.

Figure 4

Histograms of the number of regions for which the SVM classifier achieved significant accuracy rates when classifying controls vs. ASC participants (left) and controls vs. siblings of ASC participants (right).

4. Discussion

The results presented above demonstrate that structural differences exist in gray matter maps derived from neuroimaging data comparing controls and ASC participants, as well as between controls and unaffected siblings of ASC individuals. In the first case the regions containing differences are mainly located in temporal gyrus, cerebellum, right cingulate gyrus, right precuneus, left occipital and vermis, whereas differences for the second comparison are mostly located in cerebellum, left frontal gyrus, right temporal gyrus, left lingual gyrus, left occipital, and insula (a complete list of the regions is shown in Table 2). A large number of common regions are located in the cerebellum, the importance of which has been emphasized in previous studies of ASC (Salmond et al., 2003). Specifically, left cerebellum was suggested as a neuroendophenotype for ASC in Peterson et al. (2006), where unaffected parents of autistic participants and controls were compared. These authors found abnormalities in the left cerebellar volume of the parents indicating the existence of similarities in altered brain structures of autistic individuals and their unaffected relatives.

The vermis is also a candidate region as a neuroendophenotype of ASC (group differences in both ASC compared to controls and siblings compared to controls covering 23 mm3 were found in this region). Smaller vermis volume in ASC participants than in controls was found in Kilman et al. (1997) and, more recently in Webb et al. (2009). Figure 2 and Table 2 show that differences in this region are also extant in the comparison between controls and siblings of ASC participants discovered in the present study.

Other previous works that looked for neuroendophenotypes of ASC reported that the amygdala was smaller in adults with ASC relative to the unaffected parents of children with ASC and age-matched controls (Rojas et al., 2004). In our analysis we found small differences in the left amygdala when comparing ASC participants vs. controls and siblings vs. controls. In addition the regions highlighted in both comparisons are of similar size. These results concur with those described in Dalton et al. (2007), where the authors compared adolescents diagnosed with ASC, unaffected siblings of ASC individuals and participants with no personal or familial history of ASC. They found that amygdala volume of ASC participants was similar to their unaffected siblings and significantly smaller when compared with a control group, which may have implications for social interaction and communication difficulties in autism and potentially subtle traits in siblings.

In general, it worth noting that an important part of the regions highlighted in our analysis and enumerated in Table 2 also appear relevant to differentiating between control and ASC individuals in a number of univariate studies (Cauda et al., 2011; Radua et al., 2011; Via et al., 2011; Nickl-Jockschat et al., 2012).

The results presented also suggest that our initial hypothesis on the enhanced ability of multivariate approaches to finding differences between groups of neuroimaging data is substantiated. The proposed method found significant differences in a dataset in which VBM had previously found no significant differences at levels of significance controlling for multiple comparisons. Furthermore, the locations of the observed differences match those reported by univariate studies using other, larger datasets.

Finally, some considerations about the size and shape of the regions, and the manner of exploration associated with the searchlight approach can be drawn. The original searchlight approach proposed by Kriegeskorte et al. (2006) used spherical regions. However, here we used cubes for computational simplicity, reducing the computation time of the procedure without producing adverse effects. In fact, a cube-based searchlight approach was successfully used in Illán et al. (2011) for classification purposes. In general, the size of the regions is a key control parameter for this approach since it controls the trade-off between localization and globalization. If the regions are small, the results (highlighted regions) will be more accurate with enhanced spatial localization, but the approach tends to the univariate approach with its attendant disadvantages. In addition, using smaller regions may result in an increase of the computational burden if region overlapping is limited. Conversely, if the regions are large, finding small areas of interest is more difficult or, if found, will be included in regions much larger than the area of interest, decreasing the localization power. As described in section 3, we used regions of 125 voxels that, in our opinion, represent a good trade-off between accuracy and computational burden. Furthermore, the exploration of the brain space may be undertaken with varying amounts of overlap of the regions. Clearly, overlapping allows a more exhaustive exploration and is particularly important when regions are large.

5. Conclusions

We have presented an original analysis of MRI images by means of a multivariate approach in order to identify candidate neuroendophenotypes of ASC. To this end, we first looked for differences between controls and ASC participants, and then compared these differences to those resulting from comparing controls and unaffected siblings of ASC participants.

The main novelty presented in this study is the algorithm used in the discovery of those differences: a multivariate approach that consists of performing an exhaustive examination of the brain space by means of a searchlight methodology combined with a support vector machine classifier.

The results reported here indicate that differences in cerebellum, parietal lobule, left occipital, left angular gyrus and, to a lesser extent, other regions listed in Table 2 can be considered neuroendophenotypes of ASC. Additionally, we corroborated the existence of separate, significant differences between controls and siblings of ASC participants that have not developed the condition. These findings, if replicated by other studies, may go some way to explaining the increased likelihood of unaffected siblings of ASC individuals developing the condition in later life.

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

This work was partly supported by the University of Granada under the Genil PYR 2012-10 project (CEI BioTIC GENIL CEB09-0010) and the University of Liège. The study was also funded by a Clinical Scientist Fellowship from the UK Medical Research Council (MRC (G0701919)) to Michael Spencer and by the UK National Institute for Health Research Cambridge Biomedical Research Centre. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Simon Baron-Cohen was supported by the Medical Research Council, the Wellcome Trust, and the Autism Research Trust during the period of this work.

References

  1. American Psychiatric Association., American Psychiatric Association., and Task Force on DSM-IV (1994). Diagnostic and Statistical Manual of Mental Disorders: DSM-IV. Washington, DC: American Psychiatric Association [Google Scholar]
  2. Ashburner J. (2007). A fast diffeomorphic image registration algorithm. Neuroimage 38, 95–113 10.1016/j.neuroimage.2007.07.007 [DOI] [PubMed] [Google Scholar]
  3. Ashburner J., Friston K. J. (2000). Voxel-based morphometry the methods. Neuroimage 11, 805–821 10.1006/nimg.2000.0582 [DOI] [PubMed] [Google Scholar]
  4. Bailey A., Luthert P., Dean A., Harding B., Janota I., Montgomery M., et al. (1998). A clinicopathological study of autism. Brain J. Neurol. 121(pt 5), 889–905 10.1093/brain/121.5.889 [DOI] [PubMed] [Google Scholar]
  5. Baron-Cohen S., Scott F. J., Allison C., Williams J., Bolton P., Matthews F. E., et al. (2009). Prevalence of autism-spectrum conditions: UK school-based population study. Br. J. Psychiatry 194, 500–509 10.1192/bjp.bp.108.059345 [DOI] [PubMed] [Google Scholar]
  6. Benjamini Y., Hochberg Y. (1995). Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. B Methodol. 57, 289–300 [Google Scholar]
  7. Brambilla P., Hardan A., di Nemi S. U., Perez J., Soares J. C., Barale F. (2003). Brain anatomy and development in autism: review of structural MRI studies. Brain Res. Bull. 61, 557–569 10.1016/j.brainresbull.2003.06.001 [DOI] [PubMed] [Google Scholar]
  8. Cauda F., Geda E., Sacco K., D'Agata F., Duca S., Geminiani G., et al. (2011). Grey matter abnormality in autism spectrum disorder: an activation likelihood estimation meta-analysis study. J. Neurol. Neurosurg. Psychiatry 82, 1304–1313 10.1136/jnnp.2010.239111 [DOI] [PubMed] [Google Scholar]
  9. Constantino J. N., Zhang Y., Frazier T., Abbacchi A. M., Law P. (2010). Sibling recurrence and the genetic epidemiology of autism. Am. J. Psychiatry 167, 1349–1356 10.1176/appi.ajp.2010.09101470 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Cortes C., Vapnik V. (1995). Support-vector networks. Machine Learn. 20, 273–297 10.1016/j.neunet.2006.12.003 [DOI] [PubMed] [Google Scholar]
  11. Courchesne E., Karns C. M., Davis H. R., Ziccardi R., Carper R. A., Tigue Z. D., et al. (2001). Unusual brain growth patterns in early life in patients with autistic disorder: an MRI study. Neurology 57, 245–254 10.1212/WNL.57.2.245 [DOI] [PubMed] [Google Scholar]
  12. Cuingnet R., Rosso C., Chupin M., Lehricy S., Dormont D., Benali H., et al. (2011). Spatial regularization of SVM for the detection of diffusion alterations associated with stroke outcome. Med. Image Anal. 15, 729–737 10.1016/j.media.2011.05.007 [DOI] [PubMed] [Google Scholar]
  13. Dalton K. M., Nacewicz B. M., Alexander A. L., Davidson R. J. (2007). Gaze-fixation, brain activation, and amygdala volume in unaffected siblings of individuals with autism. Biol. Psychiatry 61, 512–520 10.1016/j.biopsych.2006.05.019 [DOI] [PubMed] [Google Scholar]
  14. Duin R. P. W. (2000). Classifiers in almost empty spaces, in Proceedings 15th International Conference on Pattern Recognition, Vol. 2 (Barcelona: ), 1–7 [Google Scholar]
  15. Friston K. J., Ashburner J. T., Kiebel S. J., Nichols T. E., Penny W. D. (2011). Statistical Parametric Mapping: The Analysis of Functional Brain Images. Amsterdam: Academic Press [Google Scholar]
  16. Good P. I. (2000). Permutation Tests: A Practical Guide to Resampling Methods for Testing Hypotheses. New York, NY: Springer; 10.1007/978-1-4757-3235-1 [DOI] [Google Scholar]
  17. Gottesman I. I., Gould T. D. (2003). The endophenotype concept in psychiatry: etymology and strategic intentions. Am. J. Psychiatry 160, 636–645 10.1176/appi.ajp.160.4.636 [DOI] [PubMed] [Google Scholar]
  18. Hashimoto T., Tayama M., Miyazaki M., Sakurama N., Yoshimoto T., Murakawa K., et al. (1992). Reduced brainstem size in children with autism. Brain Dev. 14, 94–97 10.1016/S0387-7604(12)80093-3 [DOI] [PubMed] [Google Scholar]
  19. Illáan I., Górriz J., López M., Ramírez J., Salas-Gonzalez D., Segovia F., et al. (2011). Computer aided diagnosis of Alzheimers disease using component based SVM. Appl. Soft Comput. 11, 2376–2382 10.1016/j.asoc.2010.08.019 [DOI] [Google Scholar]
  20. Jou R. J., Frazier T. W., Keshavan M. S., Minshew N. J., Hardan A. Y. (2013). A two-year longitudinal pilot MRI study of the brainstem in autism. Behav. Brain Res. 251, 163–167 10.1016/j.bbr.2013.04.021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Kilman A., Reeve A., Cooperman R. B., Lesnik P. G., Ciesielski K. T. (1997). 418 - morphometry of cerebellar vermis in chronic schizophrenia and autism: MRI evidence of developmental etiology. Schizophr. Res. 24, 149 10.1016/S0920-9964(97)82426-7 [DOI] [Google Scholar]
  22. Kriegeskorte N., Bandettini P. (2007). Analyzing for information, not activation, to exploit high-resolution fMRI. Neuroimage 38, 649–662 10.1016/j.neuroimage.2007.02.022 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Kriegeskorte N., Goebel R., Bandettini P. (2006). Information-based functional brain mapping. Proc. Natl. Acad. Sci. U.S.A. 103, 3863–3868 10.1073/pnas.0600244103 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. López M., Ramírez J., Górriz J. M., Álvarez I., Salas-Gonzalez D., Segovia F., et al. (2009). SVM-based CAD system for early detection of the Alzheimer's disease using kernel PCA and LDA. Neurosci. Lett. 464, 233–238 10.1016/j.neulet.2009.08.061 [DOI] [PubMed] [Google Scholar]
  25. Lord C., Rutter M., Le Couteur A. (1994). Autism diagnostic interview-revised: a revised version of a diagnostic interview for caregivers of individuals with possible pervasive developmental disorders. J. Autism Dev. Disord. 24, 659–685 10.1007/BF02172145 [DOI] [PubMed] [Google Scholar]
  26. Losh M., Adolphs R., Poe M. D., Couture S., Penn D., Baranek G. T., et al. (2009). Neuropsychological profile of autism and the broad autism phenotype. Arch. Gen. Psychiatry 66, 518–526 10.1001/archgenpsychiatry.2009.34 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Miller R. G. (1966). Simultaneous Statistical Inference. New York, NY: McGraw-Hill [Google Scholar]
  28. Müller K., Mika S., Rätsch G., Tsuda K., Schölkopf B. (2001). An introduction to kernel-based learning algorithms. IEEE Trans. Neural Netw. 12, 181–201 10.1109/72.914517 [DOI] [PubMed] [Google Scholar]
  29. Nickl-Jockschat T., Habel U., Michel T. M., Manning J., Laird A. R., Fox P. T., et al. (2012). Brain structure anomalies in autism spectrum disorder–a meta-analysis of VBM studies using anatomic likelihood estimation. Hum. Brain Mapp. 33, 1470–1489 10.1002/hbm.21299 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Nydn A., Hagberg B., Gouss V., Rastam M. (2011). A cognitive endophenotype of autism in families with multiple incidence. Res. Autism Spect. Disord. 5, 191–200 10.1016/j.rasd.2010.03.010 [DOI] [Google Scholar]
  31. Peterson E., Schmidt G. L., Tregellas J. R., Winterrowd E., Kopelioff L., Hepburn S., et al. (2006). A voxel-based morphometry study of gray matter in parents of children with autism. Neuroreport 17, 1289–1292 10.1097/01.wnr.0000233087.15710.87 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Radua J., Via E., Catani M., Mataix-Cols D. (2011). Voxel-based meta-analysis of regional white-matter volume differences in autism spectrum disorder versus healthy controls. Psychol. Med. 41, 1539–1550 10.1017/S0033291710002187 [DOI] [PubMed] [Google Scholar]
  33. Rojas D. C., Smith J. A., Benkers T. L., Camou S. L., Reite M. L., Rogers S. J. (2004). Hippocampus and amygdala volumes in parents of children with autistic disorder. Am. J. Psychiatry 161, 2038–2044 10.1176/appi.ajp.161.11.2038 [DOI] [PubMed] [Google Scholar]
  34. Salmond C. H., de Haan M., Friston K. J., Gadian D. G., Vargha-Khadem F. (2003). Investigating individual differences in brain abnormalities in autism. Philos. Trans. R. Soc. Lond. B Biol. Sci. 358, 405–413 10.1098/rstb.2002.1210 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Sears L. L., Vest C., Mohamed S., Bailey J., Ranson B. J., Piven J. (1999). An MRI study of the basal ganglia in autism. Prog. Neuro-Psychopharmacol. Biol. Psychiatry 23, 613–624 10.1016/S0278-5846(99)00020-2 [DOI] [PubMed] [Google Scholar]
  36. Segovia F., Górriz J. M., Ramírez J., Álvarez I., Jiméenez-Hoyuela J. M., Ortega S. J. (2012). Improved parkinsonism diagnosis using a partial least squares based approach. Med. phys. 39, 4395–4403 10.1118/1.4730289 [DOI] [PubMed] [Google Scholar]
  37. Segovia F., Górriz J. M., Ramírez J., Salas-Gonzalez D., Álvarez I., López M., et al. (2010). Classification of functional brain images using a GMM-based multi-variate approach. Neurosci. Lett. 474, 58–62 10.1016/j.neulet.2010.03.010 [DOI] [PubMed] [Google Scholar]
  38. Shen L.-L., Ji Z. (2009). Gabor wavelet selection and SVM classification for object recognition. Acta Autom. Sin. 35, 350–355 10.1016/S1874-1029(08)60082-80 [DOI] [Google Scholar]
  39. Spencer M. D., Holt R. J., Chura L. R., Calder A. J., Suckling J., Bullmore E. T., et al. (2012). Atypical activation during the embedded figures task as a functional magnetic resonance imaging endophenotype of autism. Brain J. Neurol. 135(pt 11), 3469–3480 10.1093/brain/aws229 [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Spencer M. D., Holt R. J., Chura L. R., Suckling J., Calder A. J., Bullmore E. T., et al. (2011). A novel functional brain imaging endophenotype of autism: the neural response to facial expression of emotion. Transl. Psychiatry 1, e19 10.1038/tp.2011.18 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Swiderski B., Osowski S., Cichocki A., Rysz A. (2009). Single-class SVM and directed transfer function approach to the localization of the region containing epileptic focus. Neurocomputing 72, 1575–1583 10.1016/j.neucom.2008.09.00423065480 [DOI] [Google Scholar]
  42. Tzourio-Mazoyer N., Landeau B., Papathanassiou D., Crivello F., Etard O., Delcroix N., et al. (2002). Automated anatomical labeling of activations in SPM using a macroscopic anatomical parcellation of the MNI MRI single-subject brain. Neuroimage 15, 273–289 10.1006/nimg.2001.0978 [DOI] [PubMed] [Google Scholar]
  43. Vapnik V. N. (1995). The Nature of Statistical Learning Theory. Berlin: Springer-Verlag; 10.1007/978-1-4757-2440-0 [DOI] [Google Scholar]
  44. Vapnik V. N. (1998). Statistical Learning Theory. New York, NY: John Wiley and Sons Inc [Google Scholar]
  45. Varma S., Simon R. (2006). Bias in error estimation when using cross-validation for model selection. BMC Bioinform. 7:91 10.1186/1471-2105-7-91 [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Via E., Radua J., Cardoner N., Happ F., Mataix-Cols D. (2011). Meta-analysis of gray matter abnormalities in autism spectrum disorder: should asperger disorder be subsumed under a broader umbrella of autistic spectrum disorder? Arch. Gen. Psychiatry 68, 409–418 10.1001/archgenpsychiatry.2011.27 [DOI] [PubMed] [Google Scholar]
  47. Volkmar F. R., Pauls D. (2003). Autism. Lancet 362, 1133–1141 10.1016/S0140-6736(03)14471-6 [DOI] [PubMed] [Google Scholar]
  48. Webb S. J., Sparks B.-F., Friedman S. D., Shaw D. W., Giedd J., Dawson G., et al. (2009). Cerebellar vermal volumes and behavioral correlates in children with autism spectrum disorder. Psychiatry Res. Neuroimag. 172, 61–67 10.1016/j.pscychresns.2008.06.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Zhang N., Ruan S., Lebonvallet S., Liao Q., Zhu Y. (2011). Kernel feature selection to fuse multi-spectral MRI images for brain tumor segmentation. Comput. Vis. Image Understand. 115, 256–269 10.1016/j.cviu.2010.09.007 [DOI] [Google Scholar]

Articles from Frontiers in Computational Neuroscience are provided here courtesy of Frontiers Media SA

RESOURCES