Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2021 Jul 1.
Published in final edited form as: Biol Psychiatry. 2019 Dec 23;88(1):51–62. doi: 10.1016/j.biopsych.2019.12.015

Approaches to Defining Common and Dissociable Neurobiological Deficits Associated with Psychopathology in Youth

Antonia N Kaczkurkin a, Tyler M Moore b, Aristeidis Sotiras c, Cedric Huchuan Xia b, Russell T Shinohara d, Theodore D Satterthwaite b
PMCID: PMC7305976  NIHMSID: NIHMS1569528  PMID: 32087950

Abstract

Psychiatric disorders show high rates of co-morbidity and non-specificity of presenting clinical symptoms, while at the same time demonstrating substantial heterogeneity within diagnostic categories. Notably, many of these psychiatric disorders first manifest in youth. Here we review progress and next steps in efforts to parse heterogeneity in psychiatric symptoms in youth by identifying abnormalities within neural circuits. To address this fundamental challenge in psychiatry, a number of methods have been proposed. We provide an overview of these methods, broadly organized into dimensional vs. categorical approaches and single-view vs. multi-view approaches. Dimensional approaches including factor analysis and canonical correlation analysis aim to capture dimensional associations between psychopathology and brain measures across a continuous spectrum from health to disease. In contrast, categorical approaches such as clustering and community detection aim to identify subtypes of individuals within a class of symptoms or brain features. We highlight several studies that apply these methods to samples of youth, and discuss issues to consider when using these approaches. Finally, we end by highlighting avenues for future research.

Keywords: heterogeneity, neurobiology, imaging, psychopathology, Youth, adolescents

1. Introduction

Youth, which we define broadly as childhood, adolescence, and young adulthood, is a period during which many psychiatric disorders first manifest (1, 2). It is also a time of marked development in brain structure and function (3, 4). Despite dramatic advances in neuroimaging methodology that have made it possible to measure the structure and function of neural circuits, attempts to isolate neurobiological substrates of psychopathology have been hampered by the simultaneous comorbidity among and heterogeneity within psychiatric diagnoses. For example, psychiatric disorders share many presenting clinical symptoms (5) which likely contributes to the apparent non-specificity of neural mechanisms associated with psychopathology (613). At the same time, there is considerable heterogeneity in the presentation of clinical symptoms (1417). There is increasing interest in understanding clinical heterogeneity in psychopathology in terms of underlying biological mechanisms (18), which may provide the basis for a biologically-based nosology for mental disorders. However, thus far, the majority of this work has been restricted to adult samples.

The goal of this review is to introduce different approaches to understanding neurobiological heterogeneity within psychiatric disorders, with a focus on studies using these approaches in samples of youth. Specifically, we concentrate on methods for defining common and dissociable neurobiological deficits associated with psychopathology. Notably, the focus of this review is not on trajectories of developmental change or theoretical nosological debates. For an overview of patterns of brain development using longitudinal studies, we refer to the review by Becht and Mills in this special issue. For current nosological debates, Bornovalova et al. in this special issue provide a comprehensive review of issues concerning the bifactor model. Instead, our goal is to introduce the reader to a range of approaches for parsing common and dissociable neurobiological heterogeneity, while highlighting the similarities and differences between the methods. Of note, we focus only on methods that have been applied in samples of youth. While we do not provide a comprehensive tutorial on any particular method, we refer the reader to more detailed treatments of these methods when available. We organize this review according to two axes that aim to understand the complex mapping between neural deficits and clinical symptoms: dimensional vs. categorical approaches and single-view vs. multi-view approaches. We begin by defining the four broad approaches, and then highlight studies that apply these methods to samples of youth. We follow this with a discussion of the issues to consider when using these approaches, and end by discussing considerations for future research.

2. Dimensional vs. categorical and single-view vs. multi-view approaches

When reviewing methods for parsing heterogeneity, one important distinction is between dimensional vs. categorical approaches. Here. we are referring to whether the approach produces dimensions (continuous variables) or categories (clusters or subtypes) of the measure of interest. Specifically, dimensions represent the loadings onto symptoms and each individual receives a dimensional score, while categories represent subtypes of people who share features in common and each individual is classified into a category. Research has shown that many psychiatric symptoms exist on a continuum, with diagnosable psychopathology being an extreme phenotype of variation that is present in the general population (19). Dimensional approaches are able to account for this continuous spectrum from health to disease including subthreshold levels of psychopathology. However, multiple mechanisms may drive the extreme phenotypes of psychopathology and one can also use categorical approaches to identify subtypes of psychopathology.

A second important distinction is between single-view and multi-view methodological approaches. This refers to the nature of the input data, which can be symptoms, brain features, or both. Any data type can be considered, but here we limit our review to clinical symptoms and neuroimaging measures given our focus on neurobiological heterogeneity in psychopathology. Single-view approaches consider data from a single feature set (e.g., symptoms). The output can be dimensional (a spectrum of symptoms) or categorical (subtypes of people), but the input reflects a single data type. In contrast, multi-view approaches use input data from multiple feature sets, such as integrating both symptoms and brain features (20). Again, the output can be dimensional (continuous dimensions representing combinations of symptoms and brain features) or categorical (subtypes characterized by different combinations of symptoms and brain features). Collectively, we can organize the approaches discussed into this dimensional vs. categorical and single-view vs. multi-view framework.

2.1. Single-view dimensional approaches

Single-view dimensional approaches take as input a single feature set (clinical symptoms or brain measures) and produce dimensional output. One use of this approach is to reduce a large feature set into a smaller number of latent summary variables for discovery of hidden relationships and/or for data reduction. For example, approaches such as independent component analysis (ICA), principal component analysis (PCA), and non-negative matrix factorization (NMF) can be used to reduce high-dimensional symptoms or brain features into a smaller number of components (2131). Another common approach involves factor analysis, which summarizes a large number of psychiatric symptoms into latent dimensions that can then be related to various neurobiological measures (3239). Two models for this purpose include correlated traits and bifactor models (40). Correlated traits models (e.g., factors from an exploratory factor analysis) produce correlated symptom factors. In contrast, bifactor models reveal a hierarchical structure of symptoms including a general psychopathology (p) factor that represents the overall burden of psychopathology across disorders (Figure 1) (41). Akin to the “g” factor in general intelligence, the p factor represents the symptoms that psychiatric disorders share in common (42). In addition to p, a bifactor model identifies uncorrelated subfactors of psychopathology, such as factors for internalizing/fear, anxious-misery/distress, externalizing/behavioral, and psychosis/thought disorder (3239).

Figure 1. Examples of representative approaches: single-view vs. multi-view and dimensional vs. categorical.

Figure 1.

Schematic illustrating the four broad approaches surveyed in this review. A) The input data for each approach may include clinical symptoms, imaging features, or a combination of the two. B) Each approach is illustrated with an exemplar technique (note that many other methods are available as well). Single-view dimensional approaches (e.g., bifactor model) take as input a single data type, such as clinical symptoms, and output latent dimensions that summarize the data. Multi-view dimensional approaches (e.g., CCA) take as input two data types and identify linear combinations of the two. Single-view categorical approaches (e.g., clustering) find subtypes based on a single feature set. Multi-view categorical approaches (e.g., multi-view biclustering) find subtypes based on multiple views of the data, where the input is two or more feature sets.

The p factor has been associated with a number of neurobiological measures in youth, including reduced gray matter volume (22, 43), reduced activity in executive regions (44), elevated resting-state CBF (45), reduced fractional anisotropy (46), and delay in connectome distinctiveness (47) (Figure 2). In addition, there also exist dissociable deficits specific to the symptom domains of fear, anxious-misery/distress, behavioral/externalizing, and psychosis/thought disorder. For example, factors related to internalizing symptoms are associated with reduced gray matter volume in specific regions (43), widespread hyperactivation of the executive network (44), reduced cortical thickness (22), and specific abnormalities in CBF (45). The bifactor model is useful for summarizing symptoms into dimensions that capture common variance across disorders (p factor) and unique variance within specific classes of symptoms. The studies applying this method to samples of youth and then relating these factors to neuroimaging measures suggest that there may exist both common and dissociable neurobiological substrates of psychopathology in youth.

Figure 2. General psychopathology is associated with common neurobiological deficits in youth.

Figure 2.

Single-view dimensional approaches that identify a general factor common across disorders in youth reveal: A) globally reduced gray matter volume (22); B) bilateral gray matter volume reductions in regions chosen a priori including dorsal prefrontal cortex (DPFC), ventrolateral prefrontal cortex (VLPFC), and orbitofrontal cortex (OFC) (43); C) reduced activation in regions within the cingulo-opercular control network during an N-back working memory task (44); D) reduced fractional anisotropy in the genu and body of the corpus callosum (46); E) elevated cerebral blood flow in the dorsal and rostral anterior cingulate (45); and F) delay in connectome distinctiveness compared to healthy controls across the whole brain (47). All figures reprinted with permission (Copyright ©2017, Nature Neuroscience, Springer Nature; Copyright ©2019, Behavioural Brain Research, Elsevier; Copyright ©2017, Clinical Psychological Science, SAGE Publications; Copyright ©2016 and ©2019, American Journal of Psychiatry, American Psychiatric Association. All Rights Reserved).

2.2. Multi-view dimensional approaches

While single-view dimensional approaches only consider a single data type as input, multi-view dimensional approaches take as input two or more feature sets (symptoms and brain) and produce dimensional summaries of the interrelationships between the feature sets. Two commonly used multi-view dimensional methods are partial least squares (PLS) regression and canonical correlation analysis (CCA) (4851). Both methods seek to find linear combinations of brain features that predict linear combinations of clinical symptoms (Figure 1). A growing number of studies have used PLS regression and CCA to link neurobiological measures to psychopathology in adults (5266).

In youth, CCA has been used to link functional connectivity patterns to behavioral measures including demographics, IQ, and a variety of self-report measures (67). However, the participants of this study included healthy youth (n=281) and a much smaller number with major depression (n=25); as a result, there was relatively limited psychopathology present. It should also be noted that the assumptions of CCA often do not hold in high-dimensional imaging data (68). Alternative methods such as sparse CCA (sCCA) are able to overcome these limitations in high-dimensional data by simplifying the model to avoid overfitting and increase interpretability (6971). Our group has used sCCA to link dimensional psychopathology symptoms to functional connectivity measures in a large sample of 663 youth (72). The results revealed that mood, psychosis, fear, and externalizing behavior were associated with distinct patterns of connectivity, while loss of network segregation between the default mode and executive networks was common across these dimensions (72) (Figure 3). Methods such as PLS regression and CCA are useful for measuring brain-behavior relationships. However, compared to adults, there have been relatively few studies applying these methods in younger populations, suggesting the need for additional studies in samples of youth in order to replicate these results.

Figure 3. Sparse CCA links dimensions of psychopathology to functional connectivity patterns in youth.

Figure 3.

A) sCCA reveals three symptom dimensions (mood, psychosis, and fear) that were statistically significant, with the fourth dimension (externalizing behavior) showing an effect at uncorrected thresholds. B) Scatter plots showing linear combinations of functional connectivity and psychiatric symptoms demonstrate the correlated multivariate patterns of connectomic and clinical features. C) Connectivity-informed dimensions of psychopathology cross clinical diagnostic categories. Specifically, the mood dimension was composed of a mixture of depressive symptoms, suicidality, irritability, and recurrent thoughts of self-harm. The psychotic dimension was composed of psychosis-spectrum symptoms, as well as two manic symptoms. The fear dimension was comprised of social phobia and agoraphobia symptoms. The externalizing behavior dimension showed a mixture of symptoms from attention-deficit/hyperactivity and oppositional defiant disorders, as well as irritability from the depression section (72). Reprinted with permission (http://creativecommons.org/licenses/by/4.0/).

2.3. Single-view categorical approaches

Single-view categorical approaches take a single modality as input (symptoms or brain features) and produce clusters or subtypes of individuals. Notably, these techniques may also be used to discover clusters of variables; however, here we focus on the use of these methods to reveal clusters of individuals with common features. While many studies have clustered on psychiatric symptoms, behavioral measures, or neuropsychological performance in both adults and youth (7393), here we focus on studies that cluster based on imaging measures. A number of studies have identified subtypes based on imaging features in adults (60, 94108).

Neurobiological subtypes of internalizing/externalizing symptoms have been identified in samples as young as infants (109). Additionally, community detection has been used to identify subtypes of youth with ADHD using measures of intrinsic functional connectivity (110, 111). Our group has used a recently developed machine learning method, HYDRA (94), in 1,141 youth to identify two subtypes of internalizing youth differentiated by abnormalities in brain structure, function, and white matter integrity, with one subtype showing poorer functioning across multiple domains (112) (Figure 4). Unlike many clustering approaches, HYDRA accounts for variability in the controls while uncovering subtypes in the symptomatic group. These approaches are potentially useful because they seek to identify subtypes that “carve nature at its joints” based on underlying neurobiology, rather than relying on symptom measures. Studies using this approach in samples of youth are just beginning to emerge.

Figure 4. HYDRA identifies two neurostructural subtypes of internalizing youth.

Figure 4.

A) A linear support vector machine (SVM) is a discriminative classifier defined by a separating hyperplane, shown here as a gray line. Linear SVMs fail to capture the heterogeneity that exists in the patients, indicated by blue, green, and purple subtypes within the cases. B) Conversely, methods such as HYDRA (94) can estimate multiple linear hyperplanes (green lines) whose segments separate the clusters of cases from the controls. This approach makes HYDRA more flexible than SVMs, facilitating the identification of heterogeneous subtypes of patients. C) HYDRA identified two subtypes of internalizing youth with a high degree of reliability. Subtype 1 and 2 both showed significant psychopathology compared to typically developing youth. D) Subtype 1 showed significantly worse performance than the other two groups on cognitive measures, especially executive functioning tasks. E) In terms of structural measures, subtype 1 showed smaller volumes than the other two groups consistently across the brain. F) Subtype 1 also showed reduced cortical thickness in most regions. G) Subtype 1 demonstrated reduced resting-state ALFF (amplitude of low-frequency fluctuations) in frontal regions, the right amygdala, and the right hippocampus. H) Finally, subtype 1 showed reduced fractional anisotropy in a number of white matter tracts. Taken together, this study showed that clustering approaches can be used to identify reliable subtypes of internalizing youth, with subtype 1 showing greater deficits across symptoms, cognition, and brain structure (112).

2.4. Multi-view categorical approaches

Multi-view categorical approaches take as input two or more feature sets (e.g., symptoms and brain measures) and produce clusters or subtypes characterized by different combinations of those features (20). Biclustering clusters on both rows (subjects) and columns (features) simultaneously to generate clusters representing subsets of subjects related to subsets of features (113). While still a single-view approach, this method can be adapted to be multi-view by including more than one feature set. Multi-view biclustering has been used to simultaneously cluster symptoms and brain features in adults with a range of psychopathology (114116). There are currently no studies applying multi-view biclustering to samples of youth, suggesting a promising area for future work.

A method conceptually to related multi-view biclustering is Similarity Network Fusion (SNF). SNF is a multi-view approach that creates networks of individuals based on each feature set separately and then integrates these into a single network (117). SNF has identified clusters of youth with schizophrenia-spectrum, autism-spectrum, or bipolar disorder using demographics, brain imaging, and behavioral data (118). Methods that consider multiple features sets will likely better represent the complex interactions that exist between clinical symptoms and biological data; as such, this approach may have great relevance to studies of brain development in youth.

3. Methodological considerations in studies of heterogeneity

3.1. Data and subject inclusion

Single-view and multi-view approaches share several issues in common across both dimensional and categorical methods. First are the issues of which subjects and what data (imaging features or symptoms) to include. This is especially important since the results will inevitably depend heavily on the input data. For example, bifactor studies that do not measure psychosis-spectrum symptoms will not find a thought disorder factor. Any variance associated with psychosis-spectrum symptoms may be aliased into the remaining factors, possibly impacting the dimensions or clusters discovered. Likewise, many clustering methods require defining a patient group for clustering; thus, how the sample is defined (e.g., transdiagnostic or only a particular disorder) will have a large impact the clusters found. Specific to clustering, there is also debate as to whether healthy participants should be clustered separately or in combination with patients (19, 79). Despite evidence of neurobiological heterogeneity within healthy controls (79, 110, 111, 119), they are often treated as a homogeneous group (120). Taken together, it is important to carefully consider which individuals and measures will be included for all methods discussed.

3.2. Choice of approach

A second common issue across approaches is the existence of multiple ways to parse heterogeneity in clinical groups depending on the approach chosen (19). While this was noted by Marquand et al (19) in regard to clustering methods, this issue also applies to dimensional approaches as well. There are many supervised, semi-supervised, and unsupervised algorithms for clustering, and while beyond the scope of this review, each has its own strengths and limitations (121). Different algorithms may suggest different clustering solutions, and there is currently no strong consensus on the best method for choosing the optimal number of clusters (19). Similarly, there is considerable debate regarding the appropriate method to delineate psychopathology factors using dimensional approaches. These methods differ to the degree to which they are theory-driven (confirmatory) or data-driven (exploratory), which will impact the interpretation of the resulting factors. Bornovalova et al. provide a comprehensive review of these issues as they relate to the bifactor model in this special issue. As Feczko and colleagues (119) point out, the goal or question at hand should drive methodological choices, as different approaches may be valid for different purposes (e.g., differentiating subtypes of patients, investigating symptom latent structure, predicting treatment response, etc.).

3.3. Consideration of covariates

Also related to the approach used is the issue of whether or not to control for covariates. This is not commonly discussed in single-view dimensional approaches such as factor analyses of symptom data, and it remains an open question as to whether it is necessary or even desirable to consider covariates in approaches that only examine symptoms. However, controlling for covariates in multi-view approaches that include both symptoms and neurobiological measures is important, as there are known confounds associated neuroimaging measures. In cross-sectional data spanning a large age range in youth, both age and sex are commonly controlled for (see Future Directions for a discussion on the need for longitudinal designs), as well as motion or data quality. Controlling for covariates may be especially important for methods that cluster based on neurobiological measures, as clustering algorithms may inadvertently produce clusters based on irrelevant variables with known relationships with brain features (e.g., clusters separated by younger and older individuals, or by females and males). For many approaches, covariates can simply be regressed out of the features of interest prior to clustering or subject-level weighting can be used to address confounds (122). Additionally, multi-view factor analytic approaches that consider covariates have been developed (123).

3.4. Issues regarding sample size

An additional consideration that applies to all approaches is the issue of adequate sample size. The methods reviewed typically require very large sample sizes to increase the stability of the results. Some methods are constrained by sample size. For example, factor analytic approaches and CCA are limited by the ratio of observations (subjects) to model features (variables), with more observations than variables required (51). While clustering methods and multi-view techniques like PLS may produce results in smaller samples that have many features, the results may have poor stability, generalizability, or both. Cross-validation and replication in independent samples can increase our confidence in the results, but may require even greater sample sizes and access to independently collected datasets with comparable measures. Leveraging large, publicly available neuroimaging studies will be especially helpful moving forward. Available resources include the NKI-Rockland Sample (124), the Philadelphia Neurodevelopmental Cohort (125), the Pediatric, Imaging, Neurocognition, and Genetics study (126), the Human Connectome Project-Lifespan studies (127), the Healthy Brain Network (128), and the Adolescent Brain Cognitive Development Study (129).

3.5. Interpretation issues

There are a number of interpretation issues to take into consideration for the methods discussed. For single-view dimensional approaches such as factor analysis, methodological choices for addressing correlations between factors will impact the interpretation of the results. For example, while correlated traits models allow resultant factors to be highly correlated with one another resulting in a high degree of overlap between these factors (40), bifactor models produce orthogonal (uncorrelated) factors (40). The symptom factors in a bifactor model represent the specific variance not accounted for by general psychopathology (e.g., the fear factor represents the unique fear symptoms not shared across disorders). Thus, the presence or lack of orthogonality needs to be taken into consideration when interpreting these symptom factors. Likewise, interpretability is also an important issue in categorical approaches. The progress in developing and implementing different clustering algorithms has out-paced the research on evaluating the validity of the results (130). Notably, some clusters may be so smallas to no longer be meaningful and some individuals may not fit into any cluster (19). Finally, for both dimensional and categorical approaches, it is also unclear whether data reduction for high-dimensional data will help or hinder interpretability. Data reduction is widely used, but few studies address its impact on the results. Validation on an independent dataset or feature type will be useful for evaluating whether the results are biologically meaningful.

3.6. Reproducibility and generalizability

Finally, it is important to consider the reproducibility and generalizability of results in studies that attempt to parse heterogeneity. While there are varying opinions in the field as to the definition of reproducibility as opposed to replicability (131), here we take Plesser’s recommendation to adapt Goodman’s definitions of: 1) methods reproducibility - providing sufficient detail about procedures and data so that the same procedures could be exactly repeated; 2) results reproducibility - obtaining the same results from an independent study with procedures as closely matched to the original study as possible; and 3) inferential reproducibility - drawing the same interpretive conclusions (132). Results reproducibility and inferential reproducibility are both related to the generalizability of the results. Importantly, the stability of subgroups derived from clustering methods over time has been brought into question (19), and generalizability to new samples remains challenging for all methods discussed (133). Reproducibility and generalizability may be improved when out-of-sample validation methods are employed using best practices (119, 134). While the reproducibility crisis in the field is not exclusive to the methods covered in this review (135), the high-dimensional, multivariate nature of the data in conjunction with relatively small sample sizes inevitably reduces reproducibility. Thus, it will be important for researchers to provide transparent documentation of the decisions made at each step and to validate their results using appropriate methods.

4. Considerations for future research

4.1. The importance of considering brain maturation in youth

As should be apparent from this review, there are only a limited number of studies that have applied these approaches to samples of youth, suggesting a potential area growth for the field. Additionally, the studies reviewed have relied primarily on cross-sectional data spanning wide age ranges, which has clear limitations for studying development. Currently, most cross-sectional studies simply control for age effects, either by including age as a covariate in the model, or by removing age effects before clustering. Treating age as a confound – rather than the primary effect of interest – is not ideal for approaching primary developmental questions of interest. Longitudinal designs have long been considered the gold standard in developmental research (136) and the dynamic nature of longitudinal trajectories of brain development are well illustrated in the review by Becht and Mills in this special issue. Thus, it will be important for future work to apply the approaches reviewed to longitudinal samples to determine whether neurobiological patterns change throughout development. Future work that capitalizes on large, longitudinal studies such as the ABCD study (129) will be extremely useful in this regard.

4.2. Considering circularity in data-driven approaches

One of the primary goals of data-driven approaches that aim to reconceptualize psychopathology is to provide an alternative to clinically-defined DSM categorical diagnoses. However, many of the methods reviewed still rely to varying degrees on DSM defined symptoms, introducing potential circularity into the data-driven discovery of brain-behavior relationships. For example, the clinical symptoms used in a bifactor analysis or CCA often come from clinical interviews or self-report measures based on symptoms established by the DSM. Likewise, many clustering methods require defining a patient group to cluster on beforehand, which is typically defined using DSM criteria. If our input symptom measures or patient groups are heavily influenced by DSM defined diagnoses, then it is possible that our output may broadly align with these DSM categories. Such circularity may conflict with the goal of identifying underlying heterogeneity in order to redefine traditional diagnostic categories. Importantly, it is possible the that symptoms defined by the DSM through years of observation and scientific study could be the most relevant for data-driven exploration. Future work may be able to mitigate this circularity to some degree by applying these methods to representative samples that 1) do not exclude comorbidity, 2) include symptoms spanning the continuum from health to disorder, 3) assess the full range of psychiatric symptoms, and 4) include atypical or less common symptoms. Additionally, alternatives to using DSM-defined constructs should be explored, including studies framed around longitudinal functional outcomes (137) and treatment response across disorders (60).

4.3. The utility of modeling symptoms and neurobiology simultaneously

Studies that employ data-driven approaches to drive discovery of neural circuits associated with psychopathology represent a potential advance over the traditional case-control approach that dominated the field for years. The approaches discussed have propelled research beyond simple group differences to consider common circuit-level deficits which drive co-morbidity, and heterogenous biological mechanisms within clinical syndromes. Methods that take into account more than one feature set at the same time may be particularly valuable. While single-view approaches that consider a single set of features remain useful for specific goals (e.g., redefining the classification of psychopathology symptoms), modeling more complex relationships between symptoms and biological measures will necessitate the development and further refinement of methods that can consider multiple features sets simultaneously. Moving forward, it will be important to assess for convergence using multimodal data including clinical symptoms, neuroimaging features, neuropsychological measures, genetics, as well as cellular and molecular measures. Thus, future work would benefit from the application of advanced methods such as multi-kernel learning, generalized CCA, and multi-view biclustering to integrate these increasing numbers of data types.

5. Conclusion

Taken together, the approaches reviewed here can be useful for redefining our understanding of heterogeneity in psychopathology. A reconceptualization of psychiatric disorders using these data-driven methods may move the field forward beyond traditional symptom-defined categories. Critically, additional research is needed using these methods in younger samples with longitudinal designs. Parsing heterogeneity in youth is a critical first step to advancing interventions that target the pathophysiological mechanisms underlying psychopathology.

ACKNOWLEDGEMENTS

This work was supported by grants from the National Institute of Mental Health (K99MH117274 to ANK; R01MH107703, R01MH113550, and R01MH120482 to TDS; R01NS085211 to RTS; R01MH112847 to RTS and TDS), the Lifespan Brain Institute at the Children’s Hospital of Philadelphia and Penn Medicine, and the NARSAD Young Investigator Award (ANK).

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

DISCLOSURES

Dr. Shinohara has received legal consulting and advisory board income from Genentech/Roche. All other authors (Dr. Kaczkurkin, Dr. Moore, Dr. Sotiras, Dr. Xia, and Dr. Satterthwaite) report no biomedical financial interests or potential conflicts of interest.

References

  • 1.Kessler RC, Amminger GP, Aguilar-Gaxiola S, Alonso J, Lee S, Üstün TB (2007): Age of onset of mental disorders: A review of recent literature. Curr Opin Psychiatry. 20: 359–364. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Roza SJ, Hofstra MB, Van Der Ende J, Verhulst FC (2003): Stable prediction of mood and anxiety disorders based on behavioral and emotional problems in childhood: A 14-year follow-up during childhood, adolescence, and young adulthood. Am J Psychiatry. 160: 2116–2121. [DOI] [PubMed] [Google Scholar]
  • 3.Luciana M (2013): Adolescent brain development in normality and psychopathology. Dev Psychopathol. 25: 1325–1345. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Stiles J, Jernigan TL (2010): The basics of brain development. Neuropsychol Rev. 20: 327–348. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Kessler RC, Chiu WT, Demler O, Walters EE (2005): Prevalence, Severity, and Comorbidity of 12-Month DSM-IV Disorders in the National Comorbidity Survey Replication. Arch Gen Psychiatry. 62: 617–627. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.McTeague LM, Huemer J, Carreon DM, Jiang Y, Eickhoff SB, Etkin A (2017): Identification of common neural circuit disruptions in cognitive control across psychiatric disorders. Am J Psychiatry. 174: 676–685. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Sprooten E, Rasgon A, Goodman M, Carlin A, Leibu E, Lee WH, Frangou S (2017): Addressing reverse inference in psychiatric neuroimaging: Meta-analyses of task-related brain activation in common mental disorders. Hum Brain Mapp. 38: 1846–1864. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Kotkowski E, Price LR, Fox PM, Vanasse TJ, Fox PT (2018): The hippocampal network model: A transdiagnostic metaconnectomic approach. NeuroImage Clin. 18: 115–129. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Cauda F, Costa T, Nani A, Fava L, Palermo S, Bianco F, et al. (2017): Are schizophrenia, autistic, and obsessive spectrum disorders dissociable on the basis of neuroimaging morphological findings?: A voxel-based meta-analysis. Autism Res. 10: 1079–1095. [DOI] [PubMed] [Google Scholar]
  • 10.Cauda F, Nani A, Manuello J, Premi E, Palermo S, Tatu K, et al. (2018): Brain structural alterations are distributed following functional, anatomic and genetic connectivity. Brain A J Neurol. 141: 3211–3232. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Etkin A, Wager TD (2007): Functional neuroimaging of anxiety: A meta-analysis of emotional processing in PTSD, social anxiety disorder, and specific phobia. Am J Psychiatry. 164: 1476–1488. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Zhao Y, Chen L, Zhang W, Xiao Y, Shah C, Zhu H, et al. (2017): Gray Matter Abnormalities in Non-comorbid Medication-naive Patients with Major Depressive Disorder or Social Anxiety Disorder. EBioMedicine. 21: 228–235. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Goodkind M, Eickhoff SB, Oathes DJ, Jiang Y, Chang A, Jones-Hagata LB, et al. (2015): Identification of a common neurobiological substrate for mental illness. JAMA Psychiatry. 72: 305–315. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Fried EI (2015): Problematic assumptions have slowed down depression research: Why symptoms, not syndromes are the way forward. Front Psychol. 6: 1–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Fried EI (2017): The 52 symptoms of major depression: Lack of content overlap among seven common depression scales. J Affect Disord. 208: 191–197. [DOI] [PubMed] [Google Scholar]
  • 16.Luo Y, Weibman D, Halperin JM, Li X (2019): A Review of Heterogeneity in Attention Deficit/Hyperactivity Disorder (ADHD). Front Hum Neurosci. 13: 1–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Nandi A, Beard JR, Galea S (2009): Epidemiologic heterogeneity of common mood and anxiety disorders over the lifecourse in the general population: a systematic review. BMC Psychiatry. 9: 1–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Lahey BB, Krueger RF, Rathouz PJ, Waldman ID, Zald DH (2017): A hierarchical causal taxonomy of psychopathology across the life span. Psychol Bull. 143: 142–186. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Marquand AF, Rezek I, Buitelaar J, Beckmann CF (2016): Understanding Heterogeneity in Clinical Cohorts Using Normative Models: Beyond Case-Control Studies. Biol Psychiatry. 80: 552–561. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Zhao J, Xie X, Xu X, Sun S (2017): Multi-view learning overview: Recent progress and new challenges. Inf Fusion. 38: 43–54. [Google Scholar]
  • 21.Nassar R, Kaczkurkin AN, Xia CH, Sotiras A, Pehlivanova M, Moore TM, et al. (2018): Gestational Age is Dimensionally Associated with Structural Brain Network Abnormalities Across Development. Cereb Cortex. 29: 2102–2114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Kaczkurkin AN, Park SS, Sotiras A, Moore TM, Calkins ME, Cieslak M, et al. (2019): Evidence for Dissociable Linkage of Dimensions of Psychopathology to Brain Structure in Youths. Am J Psychiatry. in press. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Calhoun VD, Adali T, Pearlson GD, Pekar JJ (2001): Spatial and temporal independent component analysis of functional MRI data containing a pair of task-related waveforms. Hum Brain Mapp. 13: 43–53. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Pehlivanova M, Wolf DH, Sotiras A, Kaczkurkin AN, Moore TM, Ciric R, et al. (2018): Diminished cortical thickness is associated with impulsive choice in adolescence. J Neurosci. 38: 2471–2481. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Smith SM, Hyvärinen A, Varoquaux G, Miller KL, Beckmann CF (2014): Group-PCA for very large fMRI datasets. Neuroimage. 101: 738–749. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Beckmann CF, Smith SM (2004): Probabilistic Independent Component Analysis for Functional Magnetic Resonance Imaging. IEEE Trans Med Imaging. 23: 137–152. [DOI] [PubMed] [Google Scholar]
  • 27.Sotiras A, Resnick SM, Davatzikos C (2015): Finding imaging patterns of structural covariance via Non-Negative Matrix Factorization. Neuroimage. 108: 1–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Sotiras A, Toledo JB, Gur RE, Gur RC, Satterthwaite TD, Davatzikos C (2017): Patterns of coordinated cortical remodeling during adolescence and their associations with functional specialization and evolutionary expansion. Proc Natl Acad Sci. 114: 3527–3532. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.McKeown MJ, Jung T-P, Makeig S, Brown G, Kindermann SS, Lee T-W, Sejnowski TJ (1998): Spatially independent activity patterns in functional MRI data during the Stroop color-naming task. Proc Natl Acad Sci. 95: 803–810. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Mckeown MJ, Makeig S, Brown GG, Jung T, Kindermann SS, Bell AJ, Sejnowski TJ (1998): Analysis of fMRI Data by Blind Separation Into Independent Spatial Components. Hum Brain Mapp. 6: 160–188. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Calhoun VD, Adali T, Pearlson G, Pekar J (2001): A Method for Making Group Inferences from Functional MRI Data Using Independent Component Analysis. Hum Brain Mapp. 14: 140–151. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Moore TM, Calkins ME, Satterthwaite TD, Roalf DR, Port AM, Jackson CT, et al. (2019): Development, validation, and public release of a computerized adaptive (CAT) screener for overall mental illness burden. under review. [Google Scholar]
  • 33.Lahey BB, Applegate B, Hakes JK, Zald DH, Hariri AR, Rathouz PJ (2012): Is There a general factor of prevalent psychopathology during adulthood? J Abnorm Psychol. 121: 971–977. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Blanco C, Wall MM, He JP, Krueger RF, Olfson M, Jin CJ, et al. (2015): The Space of Common Psychiatric Disorders in Adolescents: Comorbidity Structure and Individual Latent Liabilities. J Am Acad Child Adolesc Psychiatry. 54: 45–52. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Krueger RF (1999): The Structure of Common Mental Disorders. Arch Gen Psychiatry. 56: 921. [DOI] [PubMed] [Google Scholar]
  • 36.Krueger RF, Markon KE (2005): Reinterpreting Comorbidity: A Model-Based Approach to Understanding and Classifying Psychopathology. Annu Rev Clin Psychol. 2: 111–133. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Slade T, Watson D (2006): The structure of common DSM-IV and ICD-10 mental disorders in the Australian general population. Psychol Med. 36: 1593–1600. [DOI] [PubMed] [Google Scholar]
  • 38.Wright AGC, Krueger RF, Hobbs MJ, Markon KE, Eaton NR, Slade T (2013): The structure of psychopathology: Toward an expanded quantitative empirical model. J Abnorm Psychol. 122: 281–294. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Lahey BB, Rathouz PJ, Van Hulle C, Urbano RC, Krueger RF, Applegate B, et al. (2008): Testing structural models of DSM-IV symptoms of common forms of child and adolescent psychopathology. J Abnorm Child Psychol. 36: 187–206. [DOI] [PubMed] [Google Scholar]
  • 40.Reise SP, Moore TM, Haviland MG (2010): Bifactor models and rotations: Exploring the extent to which multidimensional data yield univocal scale scores. J Pers Assess. 92: 544–559. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Conway CC, Forbes MK, Forbush KT, Fried EI, Hallquist MN, Kotov R, et al. (2019): A hierarchical taxonomy of psychopathology can transform mental health research. Perspect Psychol Sci. 14: 419–436. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Caspi A, Houts RM, Belsky DW, Goldman-Mellor SJ, Harrington H, Israel S, et al. (2014): The p factor: One general psychopathology factor in the structure of psychiatric disorders? Clin Psychol Sci. 2: 119–137. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Snyder HR, Hankin BL, Sandman CA, Head K, Davis EP (2017): Distinct Patterns of Reduced Prefrontal and Limbic Gray Matter Volume in Childhood General and Internalizing Psychopathology. Clin Psychol Sci. 5: 1001–1013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Shanmugan S, Wolf DH, Calkins ME, Moore TM, Ruparel K, Hopson RD, et al. (2016): Common and dissociable mechanisms of executive system dysfunction across psychiatric disorders in youth. Am J Psychiatry. 173: 517–526. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Kaczkurkin AN, Moore TM, Calkins ME, Ciric R, Detre JA, Elliott MA, et al. (2018): Common and dissociable regional cerebral blood flow differences associate with dimensions of psychopathology across categorical diagnoses. Mol Psychiatry. 23: 1981–1989. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Riem MME, van Hoof MJ, Garrett AS, Rombouts SARB, van der Wee NJA, van IJzendoorn MH, Vermeiren RRJM (2019): General psychopathology factor and unresolved-disorganized attachment uniquely correlated to white matter integrity using diffusion tensor imaging. Behav Brain Res. 359: 1–8. [DOI] [PubMed] [Google Scholar]
  • 47.Kaufmann T, Alnæs D, Doan NT, Brandt CL, Andreassen OA, Westlye LT (2017): Delayed stabilization and individualization in connectome development are related to psychiatric disorders. Nat Neurosci. 20: 513–515. [DOI] [PubMed] [Google Scholar]
  • 48.McIntosh AR, Lobaugh NJ (2004): Partial least squares analysis of neuroimaging data: Applications and advances. Neuroimage. 23: S250–S263. [DOI] [PubMed] [Google Scholar]
  • 49.Krishnan A, Williams LJ, McIntosh AR, Abdi H (2011): Partial Least Squares (PLS) methods for neuroimaging: A tutorial and review. Neuroimage. 56: 455–475. [DOI] [PubMed] [Google Scholar]
  • 50.Sui J, Adali T, Yu Q, Chen J, Calhoun VD (2012): A review of multivariate methods for multimodal fusion of brain imaging data. J Neurosci Methods. 204: 68–81. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Wang H-T, Smallwood J, Mourao-Miranda J, Xia CH, Satterthwaite TD, Bassett DS, Bzdok D (2018): Finding the needle in high-dimensional haystack: A tutorial on canonical correlation analysis. arXiv. 1812.02598. [DOI] [PubMed] [Google Scholar]
  • 52.Sui J, He H, Pearlson GD, Adali T, Kiehl KA, Yu Q, et al. (2013): Three-way (N-way) fusion of brain imaging data based on mCCA+jICA and its application to discriminating schizophrenia. Neuroimage. 66: 119–132. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Moser DA, Doucet GE, Lee WH, Rasgon A, Krinsky H, Leibu E, et al. (2018): Multivariate associations among behavioral, clinical, and multimodal imaging phenotypes in patients with psychosis. JAMA Psychiatry. 75: 386–395. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Marquand AF, Haak KV., Beckmann CF (2017): Functional corticostriatal connection topographies predict goal-directed behaviour in humans. Nat Hum Behav. 1: 1–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Vatansever D, Bzdok D, Wang HT, Mollo G, Sormaz M, Murphy C, et al. (2017): Varieties of semantic cognition revealed through simultaneous decomposition of intrinsic brain connectivity and behaviour. Neuroimage. 158: 1–11. [DOI] [PubMed] [Google Scholar]
  • 56.Tsvetanov KA, Henson RNA, Tyler LK, Razi A, Geerligs L, Ham TE, Rowe JB (2016): Extrinsic and Intrinsic Brain Network Connectivity Maintains Cognition across the Lifespan Despite Accelerated Decay of Regional Brain Activation. J Neurosci. 36: 3115–3126. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Wang HT, Bzdok D, Margulies D, Craddock C, Milham M, Jefferies E, Smallwood J (2018): Patterns of thought: Population variation in the associations between large-scale network organisation and self-reported experiences at rest. Neuroimage. 176: 518–527. [DOI] [PubMed] [Google Scholar]
  • 58.Wang HT, Poerio G, Murphy C, Bzdok D, Jefferies E, Smallwood J (2018): Dimensions of Experience: Exploring the Heterogeneity of the Wandering Mind. Psychol Sci. 29: 56–71. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Supekar K, Cai W, Krishnadas R, Palaniyappan L, Menon V (2019): Dysregulated Brain Dynamics in a Triple-Network Saliency Model of Schizophrenia and Its Relation to Psychosis. Biol Psychiatry. 85: 60–69. [DOI] [PubMed] [Google Scholar]
  • 60.Drysdale AT, Grosenick L, Downar J, Dunlop K, Mansouri F, Meng Y, et al. (2017): Resting-state connectivity biomarkers define neurophysiological subtypes of depression. Nat Med. 23: 28–38. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Lin H-Y, Cocchi L, Zalesky A, Lv J, Perry A, Tseng W-YI, et al. (2018): Brain-behavior patterns define a dimensional biotype in medication-naïve adults with attention-deficit hyperactivity disorder. Psychol Med. 48: 2399–2408. [DOI] [PubMed] [Google Scholar]
  • 62.Stout DM, Buchsbaum MS, Spadoni AD, Risbrough VB, Strigo IA, Matthews SC, Simmons AN (2018): Multimodal canonical correlation reveals converging neural circuitry across trauma-related disorders of affect and cognition. Neurobiol Stress. 9: 241–250. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Avants BB, Libon DJ, Rascovsky K, Boller A, McMillan CT, Massimo L, et al. (2014): Sparse canonical correlation analysis relates network-level atrophy to multivariate cognitive measures in a neurodegenerative population. Neuroimage. 84: 698–711. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Kebets V, Holmes AJ, Orban C, Tang S, Li J, Sun N, et al. (2019): Somatosensory-Motor Dysconnectivity Spans Multiple Transdiagnostic Dimensions of Psychopathology. bioRxiv. 637827. [DOI] [PubMed] [Google Scholar]
  • 65.Rodrigue AL, McDowell JE, Tandon N, Keshavan MS, Tamminga CA, Pearlson GD, et al. (2018): Multivariate Relationships Between Cognition and Brain Anatomy Across the Psychosis Spectrum. Biol Psychiatry Cogn Neurosci Neuroimaging. 3: 992–1002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Smith SM, Nichols TE, Vidaurre D, Winkler AM, Behrens TEJ, Glasser MF, et al. (2015): A positive-negative mode of population covariation links brain connectivity, demographics and behavior. Nat Neurosci. 18: 1565–1567. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Mihalik A, Ferreira FS, Rosa MJ, Moutoussis M, Ziegler G, Monteiro JM, et al. (2018): Brain-behaviour modes of covariation in healthy and clinically depressed young people. bioRxiv. 487421. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Parkhomenko E, Tritchler D, Beyene J (2009): Sparse canonical correlation analysis with application to genomic data integration. Stat Appl Genet Mol Biol. 8: 1–34. [DOI] [PubMed] [Google Scholar]
  • 69.Witten DM, Tibshirani R, Hastie T (2009): A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis. Biostatistics. 10: 515–534. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Long Z, Wang Y, Liu X, Yao L (2019): Two-step paretial least square regression classifiers in brain-state decoding using functional magnetic resonance imaging. PLoS One. 14: 1–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Vounou M, Nichols TE, Montana G (2010): Discovering genetic associations with high-dimensional neuroimaging phenotypes: A sparse reduced-rank regression approach. Neuroimage. 53: 1147–1159. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Xia CH, Ma Z, Ciric R, Gu S, Betzel RF, Kaczkurkin AN, et al. (2018): Linked dimensions of psychopathology and connectivity in functional brain networks. Nat Commun. 9: 1–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Doshi-Velez F, Ge Y, Kohane I (2013): Comorbidity Clusters in Autism Spectrum Disorders: An Electronic Health Record Time-Series Analysis. Pediatrics. 133: e54–e63. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Geisler D, Walton E, Naylor M, Roessner V, Lim KO, Charles Schulz S, et al. (2015): Brain structure and function correlates of cognitive subtypes in schizophrenia. Psychiatry Res - Neuroimaging. 234: 74–83. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Rhebergen D, Lamers F, Spijker J, De Graaf R, Beekman ATF, Penninx BWJH (2012): Course trajectories of unipolar depressive disorders identified by latent class growth analysis. Psychol Med. 42: 1383–1396. [DOI] [PubMed] [Google Scholar]
  • 76.van Loo HM, Cai T, Gruber MJ, Li J, De Jonge P, Petukhova M, et al. (2014): Major depressive disorder subtypes to predict long-term course. Depress Anxiety. 31: 765–777. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Veatch OJ, Veenstra-Vanderweele J, Potter M, Pericak-Vance MA, Haines JL (2014): Genetically meaningful phenotypic subgroups in autism spectrum disorders. Genes, Brain Behav. 13: 276–285. [DOI] [PubMed] [Google Scholar]
  • 78.Bell MD, Corbera S, Johannesen JK, Fiszdon JM, Wexler BE (2013): Social cognitive impairments and negative symptoms in schizophrenia: Are there subtypes with distinct functional correlates? Schizophr Bull. 39: 186–196. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Fair D, Bathula D, Nikolas M, Nigg J (2012): Distinct neuropsychological subgroups in typically developing youth inform heterogeneity in children with ADHD Author(s): PNAS. 109: 6769–6774. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Lamers F, de Jonge P, Nolen WA, Smit JH, Zitman FG, Beekman ATF, Penninx BWJH (2010): Identifying Depressive Subtypes in a Large Cohort Study. J Clin Psychiatry. 71: 1582–1589. [DOI] [PubMed] [Google Scholar]
  • 81.Van Dam NT, O’Connor D, Marcelle ET, Ho EJ, Cameron Craddock R, Tobe RH, et al. (2017): Data-Driven Phenotypic Categorization for Neurobiological Analyses: Beyond DSM-5 Labels. Biol Psychiatry. 81: 484–494. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Van Hulst BM, De Zeeuw P, Durston S (2015): Distinct neuropsychological profiles within ADHD: A latent class analysis of cognitive control, reward sensitivity and timing. Psychol Med. 45: 735–745. [DOI] [PubMed] [Google Scholar]
  • 83.Nenadic I, Gaser C, Sauer H (2012): Heterogeneity of brain structural variation and the structural imaging endophenotypes in schizophrenia. Neuropsychobiology. 66: 44–49. [DOI] [PubMed] [Google Scholar]
  • 84.Karalunas SL, Fair D, Musser ED, Aykes K, Iyer SP, Nigg JT (2014): Subtyping attention-deficit/hyperactivity disorder using temperament dimensions: Toward biologically based nosologic criteria. JAMA Psychiatry. 71: 1015–1024. [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
  • 85.Georgiades S, Szatmari P, Boyle M, Hanna S, Duku E, Zwaigenbaum L, et al. (2013): Investigating phenotypic heterogeneity in children with autism spectrum disorder: A factor mixture modeling approach. J Child Psychol Psychiatry. 54: 206–215. [DOI] [PubMed] [Google Scholar]
  • 86.Wiggins LD, Tian LH, Levy SE, Rice C, Lee LC, Schieve L, et al. (2017): Homogeneous Subgroups of Young Children with Autism Improve Phenotypic Characterization in the Study to Explore Early Development. J Autism Dev Disord. 47: 3634–3645. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Grisanzio KA, Goldstein-Piekarski AN, Wang MY, Ahmed APR, Samara Z, Williams LM (2018): Transdiagnostic symptom clusters and associations with brain, behavior, and daily function in mood, anxiety, and trauma disorders. JAMA Psychiatry. 75: 201–209. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Kleinman A, Caetano SC, Brentani H, Rocca CCDA, Dos Santos B, Andrade ER, et al. (2015): Attention-based classification pattern, a research domain criteria framework, in youths with bipolar disorder and attention-deficit/hyperactivity disorder. Aust N Z J Psychiatry. 49: 255–265. [DOI] [PubMed] [Google Scholar]
  • 89.Lamers F, Burstein M, He JP, Avenevoli S, Angst J, Merikangas KR (2012): Structure of major depressive disorder in adolescents and adults in the US general population. Br J Psychiatry. 201: 143–150. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Lewandowski KE, Sperry SH, Cohen BM, Öngür D (2014): Cognitive variability in psychotic disorders: A cross-diagnostic cluster analysis. Psychol Med. 44: 3239–3248. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Milaneshi Y, Lamers F, Peyrot W, Abdellaoui A, Willemsen G, Hottenga J-J, et al. (2016): Polygenic dissection of major depression clinical heterogeneity. Mol Psychiatry. 21: 516–522. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Mostert JC, Hoogman M, Onnink AMH, van Rooij D, von Rhein D, van Hulzen KJE, et al. (2018): Similar Subgroups Based on Cognitive Performance Parse Heterogeneity in Adults With ADHD and Healthy Controls. J Atten Disord. 22: 281–292. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Olino TM, Klein DN, Lewinsohn PM, Rohde P, Seeley JR (2010): Latent trajectory classes of depressive and anxiety disorders from adolescence to adulthood: descriptions of classes and associations with risk factors. Compr Psychiatry. 51: 224–235. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Varol E, Sotiras A, Davatzikos C (2017): HYDRA: Revealing heterogeneity of imaging and genetic patterns through a multiple max-margin discriminative analysis framework. Neuroimage. 145: 346–364. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Gupta CN, Castro E, Rachkonda S, van Erp TGM, Potkin S, Ford JM, et al. (2017): Biclustered independent component analysis for complex biomarker and subtype identification from structural magnetic resonance images in schizophrenia. Front Psychiatry. 8: 1–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.Rahaman MA, Turner JA, Gupta CN, Rachakonda S, Chen J, Liu JY, et al. (2019): N-BiC: A method for multi-component and symptom biclustering of structural MRI data: Application to schizophrenia. IEEE Trans Biomed Eng. in press. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Arnedo J, Mamah D, Baranger DA, Harms MP, Barch DM, Svrakic DM, et al. (2015): Decomposition of brain diffusion imaging data uncovers latent schizophrenias with distinct patterns of white matter anisotropy. Neuroimage. 120: 43–54. [DOI] [PubMed] [Google Scholar]
  • 98.Segman RH, Shefi N, Goltser-Dubner T, Friedman N, Kaminski N, Shalev AY (2005): Peripheral blood mononuclear cell gene expression profiles identify emergent post-traumatic stress disorder among trauma survivors. Mol Psychiatry. 10: 500–513. [DOI] [PubMed] [Google Scholar]
  • 99.Cha K, Hwang T, Oh K, Yi GS (2015): Discovering transnosological molecular basis of human brain diseases using biclustering analysis of integrated gene expression data. BMC Med Inform Decis Mak. 15: 1–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Rangan AV, McGrouther CC, Kelsoe J, Schork N, Stahl E, Zhu Q, et al. (2018): A loop-counting method for covariate-corrected low-rank biclustering of gene-expression and genome-wide association study data. PLOS Comput Biol. 14: e1006105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101.Dong A, Toledo JB, Honnorat N, Doshi J, Varol E, Sotiras A, et al. (2017): Heterogeneity of neuroanatomical patterns in prodromal Alzheimer’s disease: links to cognition, progression and biomarkers. Brain. 140: 735–747. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102.Dong A, Honnorat N, Gaonkar B, Davatzikos C (2016): CHIMERA: Clustering of heterogeneous disease effects via distribution matching of imaging patterns. IEEE Trans Med Imaging. 35: 612–621. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 103.Tokuda T, Yoshimoto J, Shimizu Y, Okada G, Takamura M, Okamoto Y, et al. (2018): Identification of depression subtypes and relevant brain regions using a data-driven approach. Sci Rep. 8: 1–13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104.Brodersen KH, Deserno L, Schlagenhauf F, Lin Z, Penny WD, Buhmann JM, Stephan KE (2014): Dissecting psychiatric spectrum disorders by generative embedding. NeuroImage Clin. 4: 98–111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105.Clementz BA, Sweeney JA, Hamm JP, Ivleva EI, Ethridge LE, Pearlson GD, et al. (2016): Identification of distinct psychosis biotypes using brain-based biomarkers. Am J Psychiatry. 173: 373–384. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106.Sun H, Lui S, Yao L, Deng W, Xiao Y, Zhang W, et al. (2015): Two patterns of white matter abnormalities in medication-naive patients with first-episode schizophrenia revealed by diffusion tensor imaging and cluster analysis. JAMA Psychiatry. 72: 678–686. [DOI] [PubMed] [Google Scholar]
  • 107.Viviano JD, Buchanan RW, Calarco N, Gold JM, Foussias G, Bhagwat N, et al. (2018): Resting-State Connectivity Biomarkers of Cognitive Performance and Social Function in Individuals With Schizophrenia Spectrum Disorder and Healthy Control Subjects. Biol Psychiatry. 84: 665–674. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 108.Hawco C, Buchanan RW, Calarco N, Mulsant BH, Viviano JD, Dickie EW, et al. (2019): Separable and Replicable Neural Strategies During Social Brain Function in People With and Without Severe Mental Illness. Am J Psychiatry. 176: 521–530. [DOI] [PubMed] [Google Scholar]
  • 109.Wee CY, Tuan TA, Broekman BFP, Ong MY, Chong YS, Kwek K, et al. (2017): Neonatal neural networks predict children behavioral profiles later in life. Hum Brain Mapp. 38: 1362–1373. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 110.Costa Dias TG, Iyer SP, Carpenter SD, Cary RP, Wilson VB, Mitchel SH, et al. (2015): Characterizing heterogeneity in children with and without ADHD based on reward system connectivity. Dev Cogn Neurosci. 11: 155–174. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 111.Gates KM, Molenaar PCM, Iyer SP, Nigg JT, Fair DA (2014): Organizing heterogeneous samples using community detection of GIMME-Derived resting state functional networks. PLoS One. 9: 1–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 112.Kaczkurkin AN, Sotiras A, Baller E, Calkins M, Chand G, Cui Z, et al. (2019): Neurostructural Heterogeneity in Youth with Internalizing Symptoms. bioRxiv. 614438. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 113.Padilha VA, Campello RJGB (2017): A systematic comparative evaluation of biclustering techniques. BMC Bioinformatics. 18: 1–25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 114.Yin L, Chau CKL, Sham P-C, So H-C (2019): Uncovering complex disease subtypes by integrating clinical data and imputed transcriptome from genome-wide association studies: Applications in psychiatry and cardiovascular medicine. bioRxiv. 595488. [Google Scholar]
  • 115.Yin L, Cheung EFC, Chen RYL, Wong EHM, Sham PC, So HC (2018): Leveraging genome-wide association and clinical data in revealing schizophrenia subgroups. J Psychiatr Res. 106: 106–117. [DOI] [PubMed] [Google Scholar]
  • 116.Sun J, Bi J, Kranzler HR (2014): Multi-view singular value decomposition for disease subtyping and genetic associations. BMC Genet. 15: 1–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 117.Stefanik L, Erdman L, Ameis SH, Foussias G, Mulsant BH, Behdinan T, et al. (2018): Brain-Behavior Participant Similarity Networks among Youth and Emerging Adults with Schizophrenia Spectrum, Autism Spectrum, or Bipolar Disorder and Matched Controls. Neuropsychopharmacology. 43: 1180–1188. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 118.Wang B, Mezlini A, Demir F, Fiume M, Tu Z, Brudno M, et al. (2014): Similarity network fusion for aggregating data types on a genomic scale. Nat Methods. 11: 333–340. [DOI] [PubMed] [Google Scholar]
  • 119.Feczko E, Miranda-Dominguez O, Marr M, Graham AM, Nigg JT, Fair DA (2019): The Heterogeneity Problem: Approaches to Identify Psychiatric Subtypes. Trends Cogn Sci. 23: 584–601. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 120.Holmes AJ, Patrick LM (2018): The Myth of Optimality in Clinical Neuroscience. Trends Cogn Sci. 22: 241–257. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 121.Vong WK, Navarro DJ, Perfors A (2016): The helpfulness of category labels in semi-supervised learning depends on category structure. Psychon Bull Rev. 23: 230–238. [DOI] [PubMed] [Google Scholar]
  • 122.Linn KA, Gaonkar B, Doshi J, Davatzikos C, Shinohara RT (2016): Addressing Confounding in Predictive Models with an Application to Neuroimaging. Int J Biostat. 12: 31–44. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 123.Li G, Jung S (2017): Incorporating covariates into integrated factor analysis of multi-view data. Biometrics. 73: 1433–1442. [DOI] [PubMed] [Google Scholar]
  • 124.Nooner KB, Colcombe SJ, Tobe RH, Mennes M, Benedict MM, Moreno AL, et al. (2012): The NKI-Rockland sample: A model for accelerating the pace of discovery science in psychiatry. Front Neurosci. 6: 1–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 125.Satterthwaite TD, Connolly JJ, Ruparel K, Calkins ME, Jackson C, Elliott MA, et al. (2016): The Philadelphia Neurodevelopmental Cohort: A publicly available resource for the study of normal and abnormal brain development in youth. Neuroimage. 124: 1115–1119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 126.Jernigan TL, Brown TT, Hagler DJ, Akshoomoff N, Bartsch H, Newman E, et al. (2016): The Pediatric Imaging, Neurocognition, and Genetics (PING) Data Repository. Neuroimage. 124: 1149–1154. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 127.Ances B, Bookheimer S, Buckner R, Salat D, Smith S, Terpstra M, et al. (2018): Human Connectome Project-Lifespan studies. Retrieved March 6, 2018, from https://www.humanconnectome.org/lifespan-studies.
  • 128.Alexander LM, Escalera J, Ai L, Andreotti C, Febre K, Mangone A, et al. (2017): Data Descriptor: An open resource for transdiagnostic research in pediatric mental health and learning disorders. Sci Data. 4: 1–26. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 129.National Institutes of Health (2018): Adolescent Brain Cogntive Development Study (ABCD). Retrieved March 6, 2018, from https://addictionresearch.nih.gov/abcd-study.
  • 130.Halkidi M, Batistakis Y, Vazirgiannis M (2001): On clustering validation techniques. J Intell Inf Syst. 17: 107–145. [Google Scholar]
  • 131.Plesser HE (2018): Reproducibility vs. Replicability: A Brief History of a Confused Terminology. Front Neuroinform. 11: 1–4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 132.Goodman & Fanelli (2016): What does research reproducibility mean? Sci Transl Med. 8: 1–6. [DOI] [PubMed] [Google Scholar]
  • 133.Dinga R, Schmaal L, Penninx B, van Tol MJ, Veltman D, van Velzen L, et al. (2019): Evaluating the evidence for biotypes of depression: Methodological replication and extension of Drysdale et.al. 2017. NeuroImage Clin. 22: in press. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 134.Chicco D (2017): Ten quick tips for machine learning in computational biology. BioData Min. 10: 1–17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 135.Nichols TE, Das S, Eickhoff SB, Evans AC, Glatard T, Hanke M, et al. (2017): Best practices in data analysis and sharing in neuroimaging using MRI HHS Public Access Author manuscript. Nat Neurosci. 20: 299–303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 136.Miller SA (2017): Developmental Research Methods, 5th Ed. Thousand Oaks, CA: SAGE Publications. [Google Scholar]
  • 137.Koutsouleris N, Kambeitz-Ilankovic L, Ruhrmann S, Rosen M, Ruef A, Dwyer DB, et al. (2018): Prediction Models of Functional Outcomes for Individuals in the Clinical High-Risk State for Psychosis or with Recent-Onset Depression: A Multimodal, Multisite Machine Learning Analysis. JAMA Psychiatry. 75: 1156–1172. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES