Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2016 Nov 1.
Published in final edited form as: Int J Dev Neurosci. 2015 Aug 24;46:125–131. doi: 10.1016/j.ijdevneu.2015.07.007

Cortical Thickness Predicts the First Onset of Major Depression in Adolescence

Lara C Foland-Ross 1,*, Matthew D Sacchet 1,2,*, Gautam Prasad 3, Brooke Gilbert 1, Paul M Thompson 3, Ian H Gotlib 1,2
PMCID: PMC4604750  NIHMSID: NIHMS718979  PMID: 26315399

Abstract

Given the increasing prevalence of Major Depressive Disorder and recent advances in preventative treatments for this disorder, an important challenge in pediatric neuroimaging is the early identification of individuals at risk for depression. We examined whether machine learning can be used to predict the onset of depression at the individual level. Thirty-three never-disordered adolescents (10–15 years old) underwent structural MRI. Participants were followed for 5 years to monitor the emergence of clinically significant depressive symptoms. We used support vector machines (SVMs) to test whether baseline cortical thickness could reliably distinguish adolescents who develop depression from adolescents who remained free of any Axis I disorder. Accuracies from subsampled cross-validated classification were used to assess classifier performance. Baseline cortical thickness correctly predicted the future onset of depression with an overall accuracy of 70% (69% sensitivity, 70% specificity; p = 0.021). Examination of SVM feature weights indicated that the right medial orbitofrontal, right precentral, left anterior cingulate, and bilateral insular cortex contributed most strongly to this classification. These findings indicate that cortical gray matter structure can predict the subsequent onset of depression. An important direction for future research is to elucidate mechanisms by which these anomalies in gray matter structure increase risk for developing this disorder.

Keywords: depression, cortical thickness, emotion regulation, machine learning

Introduction

Major Depressive Disorder (MDD) is the most common psychiatric illness in the United States (Kessler et al., 2003) and accounts for almost half of all disability-adjusted life years worldwide (Whiteford et al., 2013). Although the onset of major depression can occur at any age, most adults living with the disorder experienced their first episode during their teenage years (Lewinsohn et al., 1998). Late adolescence is a period of considerable vulnerability for depression: one-quarter of all youth will experience an episode of MDD by the end of their teenage years, a figure that is increasing over time (Kessler et al., 2001). Despite over two decades of research examining the neural and molecular bases of depression, fewer than one-quarter of depressed adolescents respond to an initial course of antidepressant medication. Half of those who do respond to treatment continue to have disturbances in sleep and mood, fatigue, and concentration difficulties (Tao et al., 2010).

Given the enormous personal and societal costs and the high prevalence of MDD, and the difficulty in treating depression once it has developed, early detection and prevention is crucial. Prevention programs that are implemented before the first onset of disorder should significantly reduce the societal burden of adolescent and adult depression. Such programs, however, require that we accurately identify children at greater risk for the disorder (Gladstone et al., 2011), a goal that continues to be elusive as the etiology of major depression is poorly understood. Therefore, an important challenge in pediatric neuroimaging is to identify brain measures that are useful for predicting the onset of MDD. In this study, we used machine learning methods to test the hypothesis that structural patterns in cortical gray matter can distinguish adolescents who go on to develop clinically significant depression within five years of their initial assessment from adolescents who remain free of any Axis I disorder. As in previous studies (Grotegerd et al., 2013; Redlich et al., 2014), our SVM analysis was conducted using a mask that included areas of the cortex that have been widely implicated in neuroanatomical models of depression and of emotion regulation (e.g., ventral and superior frontal cortex, anterior cingulate cortex, insula).

Methods

Participants

The study was approved by Stanford University’s institutional review board. All adolescent participants provided written assent, and their mothers provided written informed consent. Participants in the current study sample included 33 adolescent girls, ages 10 to 15 years. We restricted our sampling to young adolescent girls to avoid the confound of sex differences on brain structure and depression vulnerability (Nolen-Hoeksema & Hilt, 2010) and because of previous findings indicating that the average age for first onset of adolescent depression is 15 years (Lewinsohn et al., 1994). Participants were recruited through advertisements posted in numerous locations (e.g., internet bulletin boards, university kiosks, supermarkets, etc.) describing a research study on depression in mothers. The mothers’ responses to a telephone interview provided initial selection information. This phone screen established that both mothers and daughters were fluent in English and that daughters were between 10 and 15 years of age. Participants were excluded if they had experienced severe head trauma, learning disabilities, current or past depression or other Axis I disorders, current or past substance abuse, or if they were taking psychotropic medications. The telephone interview was also used to identify mothers who were likely either to have no psychiatric history or to meet criteria for recurrent depression during their daughter’s lifetime. Those mother-daughter pairs who were considered likely to be eligible for participation were invited to come to the laboratory.

Baseline diagnostic status of participants was assessed by administering the structured Kiddie Schedule for Affective Disorders and Schizophrenia for School-Age Children–Present and Lifetime version (K-SADS-PL; Kaufman et al., 1997) to both the girls and their mothers (regarding the girls). In addition, the Structured Clinical Interview for the DSM-IV was administered to participants’ mothers to assess history of maternal depression (for more information about the use of these measures in this study, see Joormann et al., 2007; Gotlib et al., 2010; Chen et al., 2012). To monitor the course of depressive symptoms and the possible onset of depression, girls and their mothers returned to the laboratory for diagnostic reassessment every 18 months for up to 5 years, or until the onset of depression. Importantly, although investigators have estimated that between 12% (Merikangas et al., 2010) and 35% (Weissman et al., 1997) of adolescents will experience an initial onset of depression by the end of their teenage years, just over 50% of the participants in the current study had developed a depressive disorder by follow-up (see Results). This high incidence rate was due to our sampling approach: many of the adolescents were at familial risk for depression by virtue of having a mother with repeated episodes of depression during her lifetime (Table 1). More importantly, however, to qualify for inclusion in the converted group, participants needed to be followed only until the onset of a depressive episode; this took considerably less time from the baseline scan date than the time needed to qualify as for inclusion in the non-converted group (i.e., had not developed a depressive episode in the five years following the baseline scan date).

Table 1.

Demographic variables

Demographic variable Converted (N=18) Non-converted (N=15) p
Age at baseline (SD), years 13.0 (1.8) 13.7 (1.2) 0.25a
Baseline CDI-S score (SD) 0.83 (1.5) 0.73 (0.8) 0.81a
Baseline MASC score (SD) 39.3 (9.8) 40.5 (13.8) 0.77a
Baseline WISC-III vocabulary score (SD) 46.8 (5.1) 50.4 (6.7) 0.11a
Baseline SES 3.8 (1.5) 4.2 (0.7) 0.11b
Maternal history of MDD (%) 9 (50.0) 11 (73.3) 0.28 b
Number of traumatic events experienced before scanning 1.2 (0.9) 0.9 (1.1) 0.36a
Number of traumatic events experienced between baseline and follow-up assessment 1.6 (1.0) 1.9 (1.6) 0.84a

Values indicate the Mean ± SD unless otherwise noted. Abbreviations: CDI-S, Child Depression Inventory–Short Form; MASC, Multidimensional Anxiety Scale for Children; WISC-III, Wechsler Intelligence Scale for Children–III; SES, socioeconomic status; MDD, Major Depressive Disorder.

a

Statistic computed using two-sample t-tests;

b

statistic computed using χ2 test, Trauma was assessed using Post Traumatic Stress Disorder (PTSD) section of the K-SADS-PL.

SES was coded using annual household income (US dollars) as follows: 0 = less than 10,000, 1 = 10,000–25,000, 2 = 25,000–50,000, 3 = 50,000–75,000, 4 = 75,000–100,000, 5 = more than 100,000.

Magnetic Resonance Imaging Data Acquisition

Within one week of the baseline interview assessments, participants were scanned on a 1.5T GE scanner (GE Healthcare Systems, Milwaukee, Wisconsin). Anatomic images were obtained using a T1-weighted spoiled gradient-recalled (SPGR) echo sequence using the following parameters: repetition time (TR) = 8.924 ms, echo time (TE) = 1.792 ms, flip angle = 15°, in-plane resolution = 0.859 × 0.859 mm2, and slice thickness = 1.5 mm.

Magnetic Resonance Imaging Data Preprocessing

Scans were processed using FreeSurfer to produce measures of cortical gray matter thickness (Fischl & Dale, 2000; version 5.0, http://surfer.nmr.mgh.harvard.edu; Huang & Du, 2005; Salat et al., 2004). Processing streams included the removal of nonbrain tissue, intensity normalization, the segmentation of gray/white matter, and the alignment of each image volume to a standardized space. To estimate cortical gray matter thickness, a deformable surface algorithm was applied to segmented images to extract the pial and gray/white cortical surfaces (Dale et al., 1999). To ensure the accuracy of gray/white matter segmentation, exclusion of scalp and other non-brain tissue, and the inclusion of brain tissue, cortical surfaces were visually inspected by one rater (B.G.) who was blind to group membership. Manual corrections were performed by this rater where appropriate, in accordance with previously established procedures (Black et al., 2012; Yang et al., 2009). After spatially normalizing the data to an average template space, local cortical thickness was measured by estimating the shortest distance between spatially equivalent surface points on the pial surface and the gray–white matter boundary and vice versa and averaging these 2 values (Fischl & Dale, 2000). The cortical surface was then parcellated into 34 regions per hemisphere, as in Desikan et al. (Desikan et al., 2006). We chose to use cortical gray matter thickness as our primary measure in the current study, given prior work by our group and others documenting cortical thickness abnormalities in the context of depression risk (Foland-Ross et al., in press; Peterson et al., 2009; Papmeyer et al., 2014) and given evidence that this measure is less strongly influenced by individual variations in surface area than is volume (Winkler et al., 2010). However, investigations that use other aspects of brain structure (e.g., cortical surface area, gyrification index, cortical and subcortical volume) to predict subsequent onset of depression would also be informative in the broader understanding of a neural risk for this disorder.

Neuroanatomical Classification

SVM is a tool from the field of machine learning that classifies novel data with a classifier that has been trained on data labeled with true class labels (Braver et al., 1997). SVM has been successful when applied to psychiatric and neurological diagnosis, identifying prodromal states, and in predicting treatment prognosis (for a review, see Orrù et al., 2012). In the current study, class labels included two categorical groups: girls who developed a diagnosable episode of depression within five years (N=18; i.e., “converted”), and girls who did not develop any Axis I disorder within five years (N=15; i.e., “non-converted”). The feature set, or the data used for classification, was composed of regional thickness averages for a subset of brain areas previously implicated in neuroanatomical models of emotion and depression (Grotegerd et al., 2013; Redlich et al., 2014). The mask was created using the Desikan atlas according to the automated anatomical labeling definitions (Desikan et al., 2006). As with previous SVM studies of mood disorders (Grotegerd et al., 2013; Redlich et al., 2014). This large region of interest, which included nearly the entire frontal lobe and some parts of the temporal lobe was selected with the goal of including those neural regions most likely to substantially contribute to discriminative patterns in order to increase the performance of the classifiers. The complete list of subregions included in the mask is noted in the Supplement. Converted and non-converted girls in the current study did not differ with respect to age or intracranial volume (ts < 1.17, ps > 0.25). Moreover, exploratory analyses, conducted at the vertex level using a general linear model in QDEC (part of Freesurfer; http://surfer.nmr.mgh.harvard.edu/fswiki/Qdec) and carried out using regional thickness averages and a general linear model in SPSS (http://www-01.ibm.com/software/analytics/spss) indicated no significant or trend-level associations among cortical thickness, age, and total intracranial volume. Thus, the thickness averages used in our SVM were not adjusted for these factors. While the absence of an association between thickness and age may be surprising, at least one report has also noted a lack of such a relation (e.g., Pakkenberg & Gundersen, 1997). It is also possible, however, that the small sample size limited our power to detect such an association.

SVMs, by definition, find the hyperplane, or high-dimensional surface, that best differentiates two classes in a training data feature set. This differentiation is achieved by maximizing the “margin” between data points from the two classes (Cortes & Vapnik, 1995). The selected model can then be used to classify novel, or “test,” data points. Because the converted and non-converted groups did not contain equal sample sizes, which can bias classification estimates (Japkowicz, 2002; Hastie et al., 2005), we repeatedly randomly subsampled the larger (converted, N=18) group, such that it was the same size as the smaller group (non-converted, N=15). For each subsampled dataset, we used stratified 10-fold cross-validation (Kohavi, 1995) to estimate generalized accuracy, sensitivity, and specificity of classification for each (Hastie et al., 2005). More specifically, for each of 10 folds, disjoint subsets were created for training (N=27) and testing (N=3) the classifier such that no individual was ever in both sets. Such an approach has been shown to improve bias and variance estimates relative to other cross-validation schemes (Kohavi, 1995). To arrive at a stable estimate of classifier performance with this combined cross-validation and subsampling approach, we conducted 10-fold cross-validation 50 times with randomly identified subsamples and report average performance metrics across these subsamples. As described and implemented in Fu et al. (2008) and Redlich et al. (2014), we used the binomial test to assess whether classification performed above chance. Because the calculation of performance across subsamples could yield a non-integer value for performance (i.e., average number of successful classifications of 30 tests across subsamples), and the binomial distribution is discrete and defined for integers, we conducted binomial testing using rounded average performance across subsamples. To examine the features (i.e., brain regions) that contributed most strongly to classification, feature weights were computed according to the method described by De Martino et al. (2008). Briefly, this approach quantifies feature weights based on the weight of a given feature in the SVM-defined model (i.e., hyperplane), with greater weights indicating greater contribution to the model. We computed feature weight ranks using all individuals in a given subsample and then averaged these ranks across subsamples. Details of the derivation of the SVM-defined optimal hyperplane are presented in the Supplement.

Results

Participant Characteristics

Demographic and clinical characteristics of the participants are presented in Table 1. Eighteen of the 33 girls in this study (54.5%) subsequently developed depression (i.e., converted) within 5 years of the baseline scan (average age of onset = 16.2±1.8 years). Converted girls did not differ in age at baseline from non-converted girls, in their baseline scores on the Children’s Depression Inventory–Short Form (CDI-S; Kovacs, 1992), the Multidimensional Anxiety Scale for Children (MASC; March et al., 1997), or the vocabulary subscale of the Wechsler Intelligence Scale for Children–III (WISC-III; Weschler, 1991). There were also no group differences in baseline socioeconomic status, maternal history of depression, assessed using the Structured Clinical Interview for DSM-IV (American Psychiatric Association, 2000) or in the number of traumatic events experienced either before baseline scanning or between baseline and follow-up assessments, as measured with the Post Traumatic Stress Disorder (PTSD) section of the K-SADS-PL.

Of the 18 participants who converted, 11 (61.1%) endorsed 5 or more symptoms of depression on the K-SADS-PL and met diagnostic criteria for MDD. The remaining 7 girls endorsed 3 (16.7%) or 4 (22.2%) symptoms of depression and met criteria for a diagnosis of Depressive Disorder Not Otherwise Specified (DD-NOS). With respect to comorbidities, of the 18 girls who converted, 10 (55.6%) developed one or more anxiety disorders. No girls in the non-converted group developed an anxiety disorder, and no other psychiatric comorbidities were documented in either group.

Neuroanatomic Classification

Initial testing of classifier performance revealed an average generalized accuracy of 69.7% (69.3% sensitivity, 70.0% specificity) across cross-validated subsampled datasets. Statistical testing using the binomial test showed that classification accuracy was significantly better than chance (p = 0.021). Feature weight ranks indicated that the gray matter thickness of the right medial orbitofrontal cortex contributed most strongly to classification (i.e., exhibited the highest feature weight rank). Exploratory post-hoc two-tailed t-tests indicated that girls who subsequently developed depression had significantly thinner gray matter in this region than did girls who did not develop depression (average thickness: 2.45 versus 2.64 mm, respectively; t(31) = 3.15, uncorrected p = 0.004). The 10 regions that contributed the most information to the classifier are presented in Table 2 (see Supplement for a complete listing of all 32 regions). As shown in this table, gray matter of the right precentral gyrus, the left rostral anterior cingulate cortex, and the left and right insula were among the top five ranked features (Figure 1, Table 2). Post-hoc two-tailed t-tests of these 5 areas indicated that gray matter of the left insula was significantly thicker in girls who converted than in girls who did not convert (average thickness: 3.09 mm versus 2.94 mm, respectively; t(31) = −2.56, uncorrected p = 0.015); the two groups of girls did not differ significantly with respect to thickness in any other of the top 10 ranked regions (ts < 1.53, ps > 0.14).

Table 2.

Ten features having the greatest contribution to classification accuracy in the SVM

Overall Rank Average Rank Hemisphere Average regional thickness (SD), mm p Region
Non-converted Converted
1 1.00 right 2.64 (0.18) 2.45 (0.15) 0.004 Medial orbitofrontal cortex
2 2.52 right 2.58 (0.22) 2.64 (0.10) 0.351 Precentral gyrus
3 3.66 left 2.84 (0.20) 2.95 (0.19) 0.137 Rostral anterior cingulate cortex
4 5.22 left 2.95 (0.12) 3.09 (0.19) 0.015 Insula
5 6.10 right 2.97 (0.19) 2.95 (0.22) 0.770 Insula
6 8.38 left 2.53 (0.13) 2.57 (0.13) 0.312 Lateral orbital frontal cortex
7 8.52 right 2.75 (0.25) 2.87 (0.20) 0.156 Rostral anterior cingulate cortex
8 9.80 right 2.72 (0.13) 2.74 (0.18) 0.648 Inferior temporal gyrus
9 10.94 right 2.64 (0.18) 2.66 (0.16) 0.717 Superior temporal gyrus
10 11.14 right 2.60 (0.22) 2.48 (0.12) 0.051 Lateral orbital frontal cortex

Laterality, region name, and average feature weight rank of the highest ranked features are displayed. Regions having a lower overall rank contributed a greater amount of information to the classifier. Significance values indicate uncorrected p values from post-hoc comparisons of regional thickness averages between groups using two-tailed independent samples t-tests.

Figure 1.

Figure 1

Average feature weight ranks across support vector machines (SVMs) from all individuals from each of 50 subsamples for all 32 cortical regions included in our anatomical mask. Features with the greatest contributions toward classifier accuracy (and thus the highest average feature weight ranks) are coded in yellow and orange.

Discussion

Using prospective analyses and machine learning methods, we identified patterns of brain structure in adolescent girls that differentially predicted the subsequent onset of depression within five years. Specifically, the thickness of the orbitofrontal cortex, precentral gyrus, anterior cingulate, and insula were among the regions that provided the most information in distinguishing adolescents who went on to develop clinically significant depression from those who remained free of any Axis I disorder. Given the high costs of depression (Murray & Lobez, 1996), the difficulty in treating adolescent depression once it occurs (March et al., 2004), and the initial promise of early interventions (Gladstone et al., 2011), the set of biomarkers for the onset of depression identified here has the potential to make significant clinical implications. Indeed, distinguishing to determine which high-risk adolescents are at greatest risk for developing MDD and offering them prevention-focused intervention may significantly reduce the enormous negative personal and societal impact of this debilitating disorder.

The greatest contribution from any single brain region to classification came from the right medial orbitofrontal cortex. Therefore, structure in this area alone may serve as an important biomarker of the initial prodrome of depression. A role for the right medial orbitofrontal cortex in depression is well established. Alterations - primarily reductions - in medial orbitofrontal volume have been documented in several studies of chronically depressed adults (Bremner et al., 2002; Grieve et al., 2013; Koolschijn et al., 2009; Lacerda et al., 2004; Nakano et al., 2014). This direction of effects is consistent with post-hoc analyses of this region in the current study. Specifically, we found that girls who went on to develop depression at follow-up exhibited significant reductions in the thickness of this region at baseline relative to girls who remained well (uncorrected p = 0.004). Alterations of the medial orbitofrontal cortex, therefore, likely precede the emergence of depression and may be involved in the genesis of the disorder. It is interesting to note that the single largest neuroimaging study of familial risk for depression found that this same area of the medial orbitofrontal cortex showed an opposite direction of effects. That is, in this prior study, adults at familial risk for depression showed abnormally thick right medial orbitofrontal gray matter (Peterson et al., 2009). How this prior pattern of findings is associated with our own is unclear but may be related to differential neural factors underlying risk versus resilience. Because the mean age of participants in this prior study of risk were significantly older than the age of peak incidence for depression (Weissman et al., 1997), cortical thickening of this region could have signified some form of resilience to the disorder. Certainly, future studies that scan high- and low-risk individuals both before and following the onset (or the non-onset) of depression are needed to test this interpretation.

The medial orbitofrontal cortex plays a central role in monitoring the rewarding value of stimuli and in anticipating upcoming emotional states or reactions (Amodio & Frith, 2006). Thus, structural anomalies of this region could contribute to the onset of depression by disrupting the neural processing of reward. Consistent with this formulation, anomalous structure and function of the orbitofrontal cortex is one of the most frequently reported findings in neuroimaging studies of anhedonia (Gorwood, 2008), which has been conceptualized both as a core symptom of depression (Pizzagalli, 2014) and as a risk factor for the disorder (Pizzagalli, 2014; Rawal et al., 2013). Indeed, low reward-seeking behavior in adolescents at familial risk for depression has been found to predict depressive symptoms and the onset of MDD one year later, even when controlling for baseline symptom levels (Rawal et al., 2013). We do not know whether the thinner gray matter in the right medial orbitofrontal region of the converted girls in this study is related to anomalous reward-seeking behavior; future neuroimaging studies of developing adolescents that incorporate assessments of all of these constructs will be necessary to elucidate the behavioral correlates of the medial orbitofrontal abnormalities that we found in this study.

Exploratory analyses of the remaining nine features that contributed most strongly to classification accuracy in the SVM indicated that the converted and non-converted girls differed from each other in the thickness of only one region other than right medial orbitofrontal cortex: the left insula. This region, particularly the anterior segment, is central to the detection and processing of subjectively salient stimuli (Rolls, 1996) and to integrating this information for the initiation of appropriate control signals (Mennon & Uddin, 2010). The anterior insula is also involved in the experience of social emotions, such as guilt (Lamm & Singer, 2010) - a core symptom of depression in pediatric samples (Pagliaccio et al., 2014). Thus, it is possible that structural alterations in this area contribute to the onset of depression by influencing a disproportionate allocation of resources to the internal experience of emotion.

In contrast to our findings of thinner gray matter in the right medial orbitofrontal cortex, girls who subsequently developed a depressive disorder had greater thickness of the left insula than did girls who remained well. The direction of this effect is in contrast to a recent neuroimaging investigation of preschool children that reported smaller insula volumes to be associated with subsequent depression (Belden et al., 2015). Our finding that increased insula thickness predicted the onset of depression also contrasts with neuroimaging studies of depressed adults that report MDD-associated reductions in insula volume (Koolschijn et al., 2009). It is important to note, however, that several of the studies that have assessed brain structure in individuals scanned early in the course of their disorder have reported fronto-limbic increases in cortical gray matter associated with the initial onset of a depressive disorder (Qiu et al., 2014; Reynolds et al., 2014; van Eijndhoven et al., 2013). Thus, although speculative given our prospective study design, when taken in context with prior neuroimaging studies of first-episode and chronically ill individuals, our findings in the insula suggest a pathological process whereby cortical gray matter of this region is higher prior to the onset of the disorder, but ultimately becomes thinner as individuals experience repeated episodes of depression. While longitudinal studies are clearly required to test this formulation, one recent longitudinal study of young adults at familial risk for bipolar disorder reported that the onset of mood disorder at follow-up was associated with anomalous increases over time in the thickness of neighboring gray matter of the inferior frontal gyrus (Papmeyer et al., 2014). Adding support to a potential reversal in the direction of the association of insula anomalies and depression are studies showing that later in the course of the disorder, repeated episodes of depression are associated with a decline in gray matter in frontal lobe subregions. For example, Reynolds et al (2014) found that depressed adolescents showed a stronger inverse relation between age and middle frontal thickness than did never-depressed controls. Similarly, Frodl et al (2008) reported that adults who experienced one or more episodes of depression over a three-year time period exhibited greater gray matter reductions between baseline and follow-up scans than did adults who had remitted from depression. Investigations in which adolescents or young adults are followed longitudinally starting before the appearance of symptoms and continuing through the onset and course of depression, and in which other aspects of psychobiological functioning are assessed, will be critical in elucidating the temporal patterns of structural neural anomalies and their underlying causes.

A unique strength of the present study is the careful selection of the study sample. Although the onset of depression can occur at any age, the peak age of incidence ranges from 15 to 20 years (Weissman et al., 1997), a range immediately following the age at which the current sample was scanned (10–15 years). It is also noteworthy that the converted and non-converted girls in this study did not differ with respect to either maternal history of depression or the number of traumatic events reported prior to scanning or between the baseline and follow-up assessments. A primary limitation of the current study, however, is the sample size. The number of participants included in the groups of converted and non-converted girls was relatively small, and our findings should be replicated with larger samples. Importantly however, the feature-to-individual ratio in the current study was relatively large (1.07), which, combined with our random subsampling approach, was helpful for modeling these groups and reducing bias in our classification estimates (Raudys & Jain, 1991; Redlich et al., 2014). We should also note that 10 of the 18 girls who developed a depressive disorder by follow-up also developed a comorbid anxiety disorder, and 7 girls endorsed symptoms of depression and met criteria for a diagnosis of Depressive Disorder Not Otherwise Specified (DD-NOS). Future studies with larger samples that include participants who subsequently develop non-comorbid depression and depression-NOS, non-comorbid anxiety, and comorbid depression and anxiety are needed to accurately assess the specificity of our classifier in predicting MDD, MDD-NOS, comorbid MDD/anxiety and non-comorbid anxiety. We can also not exclude the possibilities that girls in our non-converted sample developed depression after 5 years from their scan and that including such girls in our non-converted group affected classifier performance. Future studies that test whether classification accuracy is influenced by the length of time between scan and the subsequent first onset of depression would be helpful in addressing this issue. Although we did not conduct immediate repeated scans of the participants in the same session, which would be required to formally test the reliability of our measurement of cortical thickness, we did use established procedures to maximize the accuracy of cortical thickness measurements in Freesurfer,; in addition, we ensured that raters were blind to group membership, and we used an identical scanner and similar acquisition parameters to those used by Dickerson et al. (2008), who found absolute measurements of cortical thickness to be highly reliable. Finally, because we focused in this study on structural neuroimaging predictors of subsequent depression, we cannot determine how variations in brain functioning might relate to a future onset of MDD. Given the growing literature regarding abnormalities in functional activations in adolescent depression (for review see Kerestes et al., 2014; Miller et al., 2015), it will be important in future research to examine whether patterns of neural activation predict onset of the first episode of depression.

In conclusion, using machine-learning methods we demonstrated that patterns of brain structure in adolescence can help in predicting the subsequent onset of depression. Our findings represent an important first step towards understanding the pathological neural processes that underlie the emergence of this disorder and highlight several important avenues for future research. First and foremost, given our relatively small sample size, it is important that future studies, using larger samples and alternate cross-validation designs, replicate the anomalies in medial orbitofrontal and insular gray matter thickness found here. It is also important that future studies attempt to clarify the pathophysiological processes at play in the development of disorder. Such studies could, for example, assess whether alterations in cortical thickness influence the onset of depression directly or indirectly by mediating the emergence of risk-related biases in cognitive functioning. It is imperative that we continue to address this and other questions concerning the prediction of MDD in pediatric populations using multivariate approaches. This effort will elucidate biomarkers that can better identify individuals at high risk for developing depression in order to reduce the economic, societal, and personal costs of this recurrent and debilitating disorder.

Supplementary Material

1
2
3

Highlights.

  • We examined whether cortical thickness predicts the onset of depression in youth

  • We used machine learning to identify neural structures involved in this prediction

  • Overall classification accuracy was 70% (69% sensitivity, 70% specificity)

  • The right medial orbitofrontal cortex contributed most strongly to classification

  • Thus, cortical gray matter structure can predict the subsequent onset of depression

Acknowledgments

We thank all the participants and their families for taking part in this study and the funding agencies that supported this work, including grants from the National Institute of Mental Health (MH74849 to IHG, MH090617 to LCFR and MH020016 to MDS), the National Alliance for Research in Schizophrenia and Affective Disorders (NARSAD; Distinguished Investigator Award to IHG and Young Investigator Award 19018 to LCFR), the National Science Foundation (Integrative Graduate Education and Research Traineeship (IGERT) 0801700 and Graduate Research Fellowship Program (GRFP) DGE-1147470 to MDS) and from the Hope for Depression Research Foundation (to IHG and LCFR).

Footnotes

Conflict of Interest

None declared.

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

  1. Amodio DM, Frith CD. Meeting of minds: the medial frontal cortex and social cognition. Nature Reviews Neuroscience. 2006;7:268–277. doi: 10.1038/nrn1884. [DOI] [PubMed] [Google Scholar]
  2. Belden AC, Barch DM, Oakberg TJ, April LM, Harms MP, Botteron KN, Luby JL. Anterior insula volume and guilt: neurobehavioral markers of recurrence after early childhood major depressive disorder. JAMA Psychiatry. 2015;72:40–48. doi: 10.1001/jamapsychiatry.2014.1604. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Black JM, Tanaka H, Stanley L, Nagamine M, Zakerani N, Thurston A, Kesler S, Hulme C, Lyytinen H, Glover GH, Serrone C, Raman MM, Reiss AL, Hoeft F. Maternal history of reading difficulty is associated with reduced language-related gray matter in beginning readers. NeuroImage. 2012;59:3021–3032. doi: 10.1016/j.neuroimage.2011.10.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Braver TS, Cohen JD, Nystrom LE, Jonides J, Smith EE, Noll DC. A parametric study of prefrontal cortex involvement in human working memory. NeuroImage. 1997;5:49–62. doi: 10.1006/nimg.1996.0247. [DOI] [PubMed] [Google Scholar]
  5. Bremner JD, Vythilingam M, Vermetten E, Nazeer A, Adil J, Khan S, Staib LH, Charney DS. Reduced volume of orbitofrontal cortex in major depression. Biological Psychiatry. 2002;51:273–279. doi: 10.1016/s0006-3223(01)01336-1. [DOI] [PubMed] [Google Scholar]
  6. Chen MC, Burley HW, Gotlib IH. Reduced sleep quality in healthy girls at risk for depression. Journal of Sleep Research. 2012;21:68–72. doi: 10.1111/j.1365-2869.2011.00934.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Cortes C, Vapnik V. Support-vector networks. Machine Learning. 1995;20(3):273–297. [Google Scholar]
  8. Dale AM, Fischl B, Sereno MI. Cortical surface-based analysis. I. Segmentation and surface reconstruction. NeuroImage. 1999;9:179–194. doi: 10.1006/nimg.1998.0395. [DOI] [PubMed] [Google Scholar]
  9. De Martino F, Valente G, Staeren G, Ashburner J, Goebel R, Formisano E. Combining multivariate voxel selection and support vector machines for mapping and classification of functional MRI spatial patterns. Neuroimage. 2008;43:44–58. doi: 10.1016/j.neuroimage.2008.06.037. [DOI] [PubMed] [Google Scholar]
  10. Desikan RS, Ségonne F, Fischl B, Quinn BT, Dickerson BC, Blacker D, Buckner RL, Dale AM, Maguire RP, Hyman BT, Albert MS, Killiany RJ. An automated labeling system for subdividing the human cerebral cortex on MRI scans into gyral based regions of interest. NeuroImage. 2006;31:968–980. doi: 10.1016/j.neuroimage.2006.01.021. [DOI] [PubMed] [Google Scholar]
  11. American Psychiatric Association. Diagnostic and statistical manual of mental disorders. 4. Washington, DC: American Psychiatric Association; 2000. Text Revision. [Google Scholar]
  12. Dickerson BC, Fenstermacher E, Salat DH, Wolk DA, Maguire RP, Desikan R, Pacheco J, Quinn BT, Van der Kouwe A, Greve DN, Blacker D, Albert MS, Killiany RJ, Fischl B. Detection of cortical thickness correlates of cognitive performance: Reliability across MRI scan sessions, scanners, and field strengths. Neuroimage. 2008;39:10–18. doi: 10.1016/j.neuroimage.2007.08.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Fischl B, Dale AM. Measuring the thickness of the human cerebral cortex from magnetic resonance images. Proceedings of the National Academy of Sciences of the United States of America. 2000;97:11050–11055. doi: 10.1073/pnas.200033797. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Foland-Ross LC, Gilbert BL, Joormann J, Gotlib IH. Neural markers of familial risk for depression: an investigation of cortical thickness abnormalities in healthy adolescent daughters of mothers with recurrent depression. Journal of Abnormal Psychology. doi: 10.1037/abn0000050. (in press) [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Frodl TS, Koutsouleris N, Bottlender R, Born C, Jäger M, Scupin I, Reiser M, Möller HJ, Meisenzahl EM. Depression-related variation in brain morphology over 3 years: effects of stress? Archives of General Psychiatry. 2008;65:1156–1165. doi: 10.1001/archpsyc.65.10.1156. [DOI] [PubMed] [Google Scholar]
  16. Fu CH, Mourao-Miranda J, Costafreda SG, Khanna A, Marquand AF, Williams SC, Brammer MJ. Pattern classification of sad facial processing: toward the development of neurobiological markers in depression. Biological Psychiatry. 2008;63:656–662. doi: 10.1016/j.biopsych.2007.08.020. [DOI] [PubMed] [Google Scholar]
  17. Gladstone TRG, Beardslee WR, O’Connor EE. The prevention of adolescent depression. Psychiatric Clinics of North America. 2011;34:35–52. doi: 10.1016/j.psc.2010.11.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Gorwood P. Neurobiological mechanisms of anhedonia. Dialogues in Clinical Neuroscience. 2008;10:291–299. doi: 10.31887/DCNS.2008.10.3/pgorwood. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Gotlib IH, Hamilton JP, Cooney RE, Singh MK, Henry ML, Joormann J. Neural processing of reward and loss in girls at risk for major depression. Archives of General Psychiatry. 2010;67:380–387. doi: 10.1001/archgenpsychiatry.2010.13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Grieve SM, Korgaonkar MS, Koslow SH, Gordon E, Williams LM. Widespread reductions in gray matter volume in depression. NeuroImage : Clinical. 2013;3:332–339. doi: 10.1016/j.nicl.2013.08.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Grotegerd D, Suslow T, Bauer J, Ohrmann P, Arolt V, Stuhrmann A, Heindel W, Kugel H, Dannlowski U. Discriminating unipolar and bipolar depression by means of fMRI and pattern classification: a pilot study. European Archives of Psychiatry and Clinical Neuroscience. 2013;263:119–131. doi: 10.1007/s00406-012-0329-4. [DOI] [PubMed] [Google Scholar]
  22. Hastie T, Tibshirani R, Friedman J, Franklin J. The elements of statistical learning: data mining, inference and prediction. The Mathematical Intelligencer. 2005;27:83–85. [Google Scholar]
  23. Huang YM, Du S. Weighted support vector machine for classification with uneven training class sizes. Proceedings of 2005 International Conference on Machine Learning and Cybernetics; 2005; 2005. pp. 4365–4369. [Google Scholar]
  24. Japkowicz N, Stephen S. The class imbalance problem: A systematic study. Intelligent Data Analysis. 2002;6:429–449. [Google Scholar]
  25. Joormann J, Talbot L, Gotlib IH. Biased processing of emotional information in girls at risk for depression. Journal of Abnormal Psychology. 2007;116:135. doi: 10.1037/0021-843X.116.1.135. [DOI] [PubMed] [Google Scholar]
  26. Kaufman J, Birmaher B, Brent D, Rao U, Flynn C, Moreci P, Williamson D, Ryan N. Schedule for Affective Disorders and Schizophrenia for School-Age Children-Present and Lifetime Version (K-SADS-PL): initial reliability and validity data. Journal of the American Academy of Child and Adolescent Psychiatry. 1997;36:980–988. doi: 10.1097/00004583-199707000-00021. [DOI] [PubMed] [Google Scholar]
  27. Kerestes R, Davey CG, Stephanou K, Whittle S, Harrison BJ. Functional brain imaging studies of youth depression: a systematic review. Neuroimage Clinical. 2014;4:209–231. doi: 10.1016/j.nicl.2013.11.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Kessler RC, Avenevoli S, Ries Merikangas K. Mood disorders in children and adolescents: an epidemiologic perspective. Biological Psychiatry. 2001;49:1002–1014. doi: 10.1016/s0006-3223(01)01129-5. [DOI] [PubMed] [Google Scholar]
  29. Kessler RC, Berglund P, Demler O, Jin R, Koretz D, Merikangas KR, Rush AJ, Walters EE, Wang PS National Comorbidity Survey Replication. The epidemiology of major depressive disorder: Results from the National Comorbidity Survey Replication (NCS-R) JAMA. 2003;289:3095–3105. doi: 10.1001/jama.289.23.3095. [DOI] [PubMed] [Google Scholar]
  30. Kohavi R. A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection. International Joint Conference on Artificial Intelligence; 1995. pp. 1137–1145. [Google Scholar]
  31. Koolschijn PC, van Haren NE, Lensvelt-Mulders GJ, Hulshoff Pol HE, Kahn RS. Brain volume abnormalities in major depressive disorder: a meta-analysis of magnetic resonance imaging studies. Human Brain Mapping. 2009;30:3719–3735. doi: 10.1002/hbm.20801. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Kovacs M. Children’s Depression Inventory. 2. Multi-Health Systems; 1992. [Google Scholar]
  33. Lacerda AL, Keshavan MS, Hardan AY, Yorbik O, Brambilla P, Sassi RB, Nicoletti M, Mallinger AG, Frank E, Kupfer DJ, Soares JC. Anatomic evaluation of the orbitofrontal cortex in major depressive disorder. Biological Psychiatry. 2004;55:353–358. doi: 10.1016/j.biopsych.2003.08.021. [DOI] [PubMed] [Google Scholar]
  34. Lamm C, Singer T. The role of anterior insular cortex in social emotions. Brain Structure & Function. 2010;214:579–591. doi: 10.1007/s00429-010-0251-3. [DOI] [PubMed] [Google Scholar]
  35. Lewinsohn PM, Clarke GN, Seeley JR, Rohde P. Major depression in community adolescents: Age at onset, episode duration, and time to recurrence. Journal of the American Academy of Child and Adolescent Psychiatry. 1994;33:809–818. doi: 10.1097/00004583-199407000-00006. [DOI] [PubMed] [Google Scholar]
  36. Lewinsohn PM, Rohde P, Seeley JR. Treatment of adolescent depression: frequency of services and impact on functioning in young adulthood. Depression and Anxiety. 1998;7:47–52. doi: 10.1002/(sici)1520-6394(1998)7:1<47::aid-da6>3.0.co;2-2. [DOI] [PubMed] [Google Scholar]
  37. March JS, Parker JDA, Sullivan K, Stallings P, Conners CK. The Multidimensional Anxiety Scale for Children (MASC): Factor Structure, Reliability, and Validity. Journal of the American Academy of Child & Adolescent Psychiatry. 1997;36:554–565. doi: 10.1097/00004583-199704000-00019. [DOI] [PubMed] [Google Scholar]
  38. March JS, Silva S, Petrycki S. Treatment for Adolescents with Depression Study Team. Fluoxetine, cognitive-behavioral therapy, and their combination for adolescents with depression. JAMA. 2004;292:807–820. doi: 10.1001/jama.292.7.807. [DOI] [PubMed] [Google Scholar]
  39. Mennon V, Uddin LQ. Saliency, switching, attention and control: a network model of insula function. Brain Structure and Function. 2010;214:655–667. doi: 10.1007/s00429-010-0262-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Merikangas KR, He JP, Burstein M, Swanson SA, Avenevoli S, Cui L, Benjet C, Georgiades K, Swendsen J. Lifetime prevalence of mental disorders in U.S. adolescents: results from the National Comorbidity Survey Replication--Adolescent Supplement (NCS-A) Journal of the American Academy of Child and Adolescent Psychiatry. 2010;49:980–989. doi: 10.1016/j.jaac.2010.05.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Miller C, Hamilton JP, Sacchet MD, Gotlib IH. A quantitative whole-brain meta-analysis of emotion processing in adolescent Major Depressive Disorder. JAMA Psychiatry (in press) [Google Scholar]
  42. Murray C, Lobez A. The global burden of disease. Cambridge: Harvard School of Public Health; 1996. [Google Scholar]
  43. Nakano M, Matsuo K, Nakashima M, Matsubara T, Harada K, Egashira K, Masaki H, Takahashi K, Watanabe Y. Gray matter volume and rapid decision-making in major depressive disorder. Progress in Neuro-Psychopharmacology & Biological Psychiatry. 2014;48:51–56. doi: 10.1016/j.pnpbp.2013.09.011. [DOI] [PubMed] [Google Scholar]
  44. Nolen-Hoeksema S, Hilt LM. Handbook of Depression. 2. Guilford Press; 2010. Gender differences in depression; pp. 386–404. [Google Scholar]
  45. Orrù G, Pettersson-Yeo W, Marquand AF, Sartori G, Mechelli A. Using Support Vector Machine to identify imaging biomarkers of neurological and psychiatric disease: a critical review. Neuroscience and Biobehavioral Reviews. 2012;36:1140–1152. doi: 10.1016/j.neubiorev.2012.01.004. [DOI] [PubMed] [Google Scholar]
  46. Pagliaccio D, Luby JL, Luking KR, Belden AC, Barch DM. Brain-behavior relationships in the experience and regulation of negative emotion in healthy children: implications for risk for childhood depression. Development and Psychopathology. 2014;26:1289–1303. doi: 10.1017/S0954579414001035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Pakkenberg B, Gundersen HJ. Neocortical neuron number in humans: effect of sex and age. The Journal of Comparative Neurology. 1997;384:312–320. [PubMed] [Google Scholar]
  48. Papmeyer M, Giles S, Sussmann JE, Kielty S, Stewart T, Lawrie SM, Whalley HC, McIntosh AM. Cortical Thickness in Individuals at High Familial Risk of Mood Disorders as They Develop Major Depressive Disorder. Biological Psychiatry. 2014 doi: 10.1016/j.biopsych.2014.10.018. [DOI] [PubMed] [Google Scholar]
  49. Peterson B, Warner V, Bansal R, Zhu H, Hao X, Liu J, Durkin K, Adams PB, Wickramaratne P, Weissman M. Cortical thinning in persons at increased familial risk for major depression. Proceedings of the National Academy of Sciences of the United States of America. 2009;106:6273–6278. doi: 10.1073/pnas.0805311106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Pizzagalli DA. Depression, stress, and anhedonia: toward a synthesis and integrated model. Annual Review of Clinical Psychology. 2014;10:393–423. doi: 10.1146/annurev-clinpsy-050212-185606. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Qiu L, Lui S, Kuang W, Huang X, Li J, Li J, Zhang J, Chen H, Sweeney JA, Gong Q. Regional increases of cortical thickness in untreated, first-episode major depressive disorder. Translational Psychiatry. 2014;4:e378. doi: 10.1038/tp.2014.18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Raudys SJ, Jain AK. Small Sample Size Effects in Statistical Pattern Recognition: Recommendations for Practitioners. IEEE Transactions on Pattern Analysis and Machine Intelligence. 1991;13:252–264. [Google Scholar]
  53. Rawal A, Collishaw S, Thapar A, Rice F. “The risks of playing it safe”: a prospective longitudinal study of response to reward in the adolescent offspring of depressed parents. Psychological Medicine. 2013;43:27–38. doi: 10.1017/S0033291712001158. [DOI] [PubMed] [Google Scholar]
  54. Redlich R, Almeida JJR, Grotegerd D, Opel N, Kugel H, Heindel W, Arolt V, Phillips ML, Dannlowski U. Brain morphometric biomarkers distinguishing unipolar and bipolar depression. A voxel-based morphometry-pattern classification approach. JAMA Psychiatry. 2014;71:1222–1230. doi: 10.1001/jamapsychiatry.2014.1100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Reynolds S, Carrey N, Jaworska N, Langevin LM, Yang XR, Macmaster FP. Cortical thickness in youth with major depressive disorder. BMC Psychiatry. 2014;14:83. doi: 10.1186/1471-244X-14-83. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Rolls E. The orbitofrontal cortex. Philosophical Transactions of the Royal Society of London. 1996;351:1433–1444. doi: 10.1098/rstb.1996.0128. [DOI] [PubMed] [Google Scholar]
  57. Salat D, Buckner R, Snyder A, Greve D, Desikan R, Busa E, Morris JC, Dale AM, Fischl B. Thinning of the cerebral cortex in aging. Cerebral Cortex. 2004;14:721–730. doi: 10.1093/cercor/bhh032. [DOI] [PubMed] [Google Scholar]
  58. Tao R, Emslie GJ, Mayes TL, Nakonezny PA, Kennard BD. Symptom improvement and residual symptoms during acute antidepressant treatment in pediatric major depressive disorder. Journal of Child and Adolescent Psychopharmacology. 2010;20:423–430. doi: 10.1089/cap.2009.0116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Van Eijndhoven P, van Wingen G, Katzenbauer M, Groen W, Tepest R, Fernández G, Buitelaar J, Tendolkar I. Paralimbic cortical thickness in first-episode depression: evidence for trait-related differences in mood regulation. The American Journal of Psychiatry. 2013;170:1477–1486. doi: 10.1176/appi.ajp.2013.12121504. [DOI] [PubMed] [Google Scholar]
  60. Weissman MM, Warner V, Wickramaratne P, Moreau D, Olfson M. Offspring of depressed parents. 10 Years later. Archives of General Psychiatry. 1997;54:932–940. doi: 10.1001/archpsyc.1997.01830220054009. [DOI] [PubMed] [Google Scholar]
  61. Weschler D. The Weschler Intelligence Scale for Children. Psychological Corporation; 1991. [Google Scholar]
  62. Winkler AM, Kochunov P, Blangero J, Almasy L, Zilles K, Fox PT, Duggirala R, Glahn DC. Cortical thickness or grey matter volume? The importance of selecting the phenotype for imaging genetics studies. Neuroimage. 2010;53:1135–1146. doi: 10.1016/j.neuroimage.2009.12.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Whiteford HA, Degenhardt L, Rehm J, Baxter AJ, Ferrari AJ, Erskine HE, Charlson FJ, Norman RE, Flaxman AD, Johns N, Burstein R, Murray CJ, Vos T. Global burden of disease attributable to mental and substance use disorders: findings from the Global Burden of Disease Study 2010. Lancet. 2013;382:1575–1586. doi: 10.1016/S0140-6736(13)61611-6. [DOI] [PubMed] [Google Scholar]
  64. Yang Y, Raine A, Colletti P, Toga AW, Narr KL. Abnormal temporal and prefrontal cortical gray matter thinning in psychopaths. Molecular Psychiatry. 2009;14:561–562. 555. doi: 10.1038/mp.2009.12. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1
2
3

RESOURCES