Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2018 Mar 15.
Published in final edited form as: Biol Psychiatry. 2016 Jul 19;81(6):484–494. doi: 10.1016/j.biopsych.2016.06.027

Data-Driven Phenotypic Categorization for Neurobiological Analyses: Beyond DSM-5 Labels

Nicholas T Van Dam 1,2,†,*, David O’Connor 1,2, Enitan T Marcelle 1,, Erica J Ho 1, R Cameron Craddock 1,2, Russell H Tobe 2, Vilma Gabbay 3, James J Hudziak 5, F Xavier Castellanos 2,5, Bennett L Leventhal 2,6, Michael P Milham 1,2
PMCID: PMC5402759  NIHMSID: NIHMS818778  PMID: 27667698

Abstract

Background

Data-driven approaches can capture behavioral and biological variation currently unaccounted for by contemporary diagnostic categories, thereby enhancing the ability of neurobiological studies to characterize brain-behavior relationships.

Methods

A community-ascertained sample of individuals (N=347, ages 18–59) completed a battery of behavioral measures, psychiatric assessment, and resting state functional magnetic resonance imaging (R-fMRI) in a cross-sectional design. Bootstrap-based exploratory factor analysis was applied to 49 phenotypic subscales from 10 measures. Hybrid Hierarchical Clustering was applied to resultant factor scores to identify nested groups. Adjacent groups were compared via independent samples t-tests and chi-square tests of factor scores, syndrome scores, and psychiatric prevalence. Multivariate Distance Matrix Regression examined functional connectome differences between adjacent groups.

Results

Reduction yielded six factors, which explained 77.8% and 65.4% of the variance in exploratory and constrained exploratory models, respectively. Hybrid Hierarchical Clustering of these 6 factors identified 2, 4, and 8 nested groups (i.e., phenotypic communities). At the highest clustering level, the algorithm differentiated functionally adaptive and maladaptive groups. At the middle clustering level, groups were separated by problem type (maladaptive groups; internalizing vs. externalizing problems) and behavioral type (adaptive groups; sensation-seeking vs. extraverted/emotionally stable). Unique phenotypic profiles were also evident at the lowest clustering level. Group comparisons exhibited significant differences in intrinsic functional connectivity at the highest clustering level in somatomotor, thalamic, basal ganglia, and limbic networks.

Conclusions

Data-driven approaches for identifying homogenous subgroups, spanning typical function to dysfunction not only yielded clinically meaningful groups, but captured behavioral and neurobiological variation among healthy individuals as well.

Keywords: Phenotypes, RDoC, Psychopathology, resting state fMRI, Hierarchical Clustering, Multivariate Distance Matrix Regression

Introduction

The limitations of categorical definitions of psychiatric illness for clinical practice (1) and psychiatric research (2) are increasingly apparent. While diagnostic labels defined in nosological systems such as the Diagnostic and Statistical Manual of Mental Disorders (DSM (3)) are needed for clinical practice, these systems impede the search for pathophysiological markers using epidemiologic, genetic, and neuroimaging approaches (4). Given growing recognition of these limitations, the Research Domain Criteria (RDoC) Project has called for the development of a new nosology (5). In response, empirical data are being used to identify target phenotypic domains and constructs to characterize psychopathology and guide psychological and neurobiological investigations.

Not surprisingly, how to best delineate phenotypic domains/constructs to guide a non-syndromal research framework is uncertain (6). Inherent to this pursuit is the varying utility of categorical and dimensional frameworks. While dimensional models of psychopathology are widely supported (79), fully dimensional perspectives have limitations with respect to clinical decision-making (10, 11). Further, it is unclear whether it is more expedient to derive phenotypic targets from existing models (based on existing data and theory), data-driven analytic approaches (12, 13), or some combination. Psychiatric classification systems have variable derivations spanning clinical/research observations (e.g., DSM) to empirical assessment (e.g., Achenbach System), with other entities exhibiting a combination (e.g., RDoC). Consensus-driven methods (e.g., DSM; also arguably RDoC – see (14)) can certainly provide valuable insights; however, data-driven approaches may be crucial for identifying more behaviorally refined biological phenotypes (15) to address the profound heterogeneity evident in health and illness (16, 17). Fair, Nigg, and colleagues recently demonstrated the potential value of delineating groups by similarity/dissimilarity of individual phenotypic profiles (e.g., neuropsychological profiles, temperament profiles). Adopting community detection methodologies from graph theory, they successfully identified: i) six distinct neuropsychological profiles that capture normal variation and are modified by Attention-Deficit/Hyperactivity Disorder (18) and ii) three temperamental phenotypes that showed intriguing biological differences as well as differential clinical outcomes (19).

Here, we used the Nathan Kline Institute Rockland Sample (NKI-RS)(20), a deeply phenotyped, community-ascertained multimodal imaging sample. Using data from adult participants (ages 18–59) in the NKI-RS, we aimed to identify data-driven phenotypes, based on core behavioral features representing several domains of function (including personality/temperament, symptom features, interpersonal functioning, and behavioral tendencies). Our first aim is to identify phenotypic dimensions that accurately represent meaningful variation across multiple domains of behavior. Accordingly, we conducted a bootstrap-based exploratory factor analysis on 49 subscales derived from 10 measures obtained for 347 participants. The second aim is to identify a nested hierarchy of homogenous participant groups via hybrid hierarchical clustering of participants, based on the factor profiles that we previously identified. To provide a phenotypic characterization of the participant groupings identified, we used DSM-IV labels and Achenbach’s Adult Self-Report (ASR (21)) – neither of which were included in the factor analysis. The third aim is to examine multivariate intrinsic brain functional connectivity differences among adjacent clusters/groups (derived from the first two aims).

Materials and Methods

Participants

Participants were recruited as part of the NKI-RS (20), a community-based sample of approximately 1,000 participants, ages 6–85 years. To ascertain a cohort approximating a representative sample, exclusion criteria were minimal. Notably, comorbid medical conditions and medications (including psychotropics) were permitted. Written informed consent was obtained from all participants in accordance with local Institutional Review Board oversight. The following inclusion criteria were applied: (i) ages 18–59; (ii) absence of serious head injury and/or major neurological disorder; (iii) negative history of bipolar disorder or psychosis; (iv) negative drug test for commonly used illicit drugs with no therapeutic analogs/applications; (v) at least 95% completion of each self-report measure examined.

Subject Phenotyping

All participants completed the Structured Clinical Interview for the DSM (SCID (22)), and the Edinburgh Handedness Inventory (EHI (23)), in addition to measures reflecting clinical symptom domains, personality/temperament, and broad behavioral characteristics (see Table S1; also (20)). We selected subscales rather than individual items or full-scale scores to balance depth of assessment and amount of data per subject.

MRI Acquisition

Imaging data were collected on a 3T Siemens TIM Trio system equipped with 32-channel head coil. Both structural and resting state functional magnetic imaging (R-fMRI) data were acquired. The structural image was a T1-weighted magnetization prepared gradient echo sequence (MPRAGE): TR=1900 ms, TE=2.52 ms, flip angle=9°, 176 slices, 1 mm3 isotropic voxels. The R-fMRI data were acquired via multiband echo-planar imaging (MB-EPI(24)) with the following parameters: volumes=900, TR=645 ms, TE=30 ms, flip angle=60°, 3mm3 isotropic voxels.

Phenotypic Analysis

Data Screening

All self-report data were checked for univariate and multivariate outliers. We also tested the assumption of missingness at random (MAR(25)). Missing data were imputed using an expectation-maximization algorithm (26).

Dimension Reduction

For each participant, we created a multidimensional phenotypic profile using 49 subscale scores obtained from 10 questionnaires. Given modest intercorrelations among the questionnaires, we next performed an exploratory factor analysis to obtain a reduced set of dimensions. Analyses were done on age- and gender-regressed residuals of the 49 subscale scores to minimize the impact of these demographic variables on clustering (27). Parallel analysis (28) of 10,000 permutations of the raw data (29), was used to determine number of factors, comparing eigenvalues for each factor from the raw data to the 95th percentile of eigenvalues from the permutations. Maximum likelihood factor estimation with varimax rotation was used to estimate six factor loadings for each subscale score. Confidence intervals for factor loadings were estimated from 10,000 bootstrapped re-samplings (30). To minimize factor intercorrelations, factor loadings overlapping zero (95% CI) and/or exhibiting values < |0.25| were set to 0 in a restricted model. Factor scores (i.e., equivalent of latent value per factor) were computed using regression estimation (31).

Clustering Analysis

Categorical approaches (e.g., DSM) confer the ability to delineate unique combinations of above-threshold impairments in subsets of individuals. Building on this strength, a central goal of the present work is to identify phenotypically distinct groupings of participants among the larger sample (akin to categories), based on their dimensional 6-factor phenotypic profiles. Specifically, we implemented hybrid hierarchical clustering (HHC)(32) using tree-structured vector quantization (33) to identify nested participant groups based on Euclidean distances between participant factor score profiles. HHC combines agglomerative and divisive clustering by (i) identifying mutual clusters (i.e., groups of data that are exceptionally close to one another and as a group, distant from all others) via agglomerative clustering, (ii) implementing constrained divisive clustering (retaining mutual clusters), and (iii) applying additional divisive clustering, which explores the division of mutual clusters. In combination with visual examination of the dendrogram, we used the Calinski-Harabasz criterion (CHC)(34) at each cluster number to inform cut decisions (i.e., where to divide into subgroups).

Cluster comparisons

Pair-wise comparisons were made among adjacent clusters at each level using EFA, ASR (21), and SCID profiles. EFA scores were fully dimensional, while the ASR and SCID diagnoses were categorical. A categorical version of ASR scores was achieved by mimicking common clinical strategies for its use, which apply cut-scores to identify meaningful psychopathology. For each grouping of participants identified via cluster analysis, we calculated the percentage of individuals exhibiting standardized scores (T-scores) ≥ 60 within each ASR domain. The cutoff of 60 was chosen to increase sensitivity to subthreshold symptoms of potential relevance. EFA scores were compared using independent sample t-tests, while ASR proportions and Psychiatric Diagnoses were compared using chi-square tests. We also conducted t-tests on continuous ASR scores, reported in Supplemental Information.

MRI Data Processing

R-fMRI Preprocessing

Data were preprocessed using the Configurable Pipeline for Analysis of Connectomes (C-PAC; http://fcp-indi.github.io), which combines tools from AFNI (http://afni.nimh.nih.gov/afni), FSL (http://www.fmrib.ox.ac.uk), and Advanced Normalization Tools (ANTs; http://stnava.github.io/ANTs), using Nipype (35). Preprocessing included: (i) motion correction, (ii) mean-based intensity normalization, (iii) nuisance signal regression, (iv) temporal band-pass filtering (0.01–0.1 Hz), (v) co-registration of functional to structural images using boundary-based registration (36) using FSL’s FLIRT (37), (vi) normalization of functional to MNI152 template (38) by applying a nonlinear transform from ANTs, and (vii) smoothing with a full-width-at-half-maximum 6mm Gaussian kernel. Nuisance regression removed linear and quadratic trends to account for scanner drift, 24 motion parameters, and 5 “nuisance” signals, identified via the component correction approach (CompCor (39)).

Multivariate Distance Matrix Regression (MDMR)

At each level of the hierarchy identified via HHC, we used MDMR to compare voxel-wise functional connectivity profiles between adjacent phenotypic groups (e.g., C1 vs. C2, C1a vs. C1b, C2a vs. C2b).

MDMR was performed on a voxel-by-voxel basis. At each voxel, the following three steps were carried out: (i) for each participant, Pearson’s correlations were computed between the target voxel and all other voxels within a specified brain mask; this step generated, for each participant, a whole-brain functional connectivity for the target voxel; (ii) a between-participant distance matrix was computed, in which each entry is the distance between the connectivity maps obtained for the target voxel in two different participants. Distance is defined as √(2 * (1 – r)), where r is the spatial correlation of the connectivity maps obtained at the target voxel in two different individuals; the range of this distance metric is 0 (perfectly correlated) to 2 (perfectly negatively correlated), where 1 reflects no correlation. Importantly, these distances are calculated independent of any phenotypic relationships; (iii) a pseudo-F statistic was computed to provide mathematical evaluation of the relationship between the variability in the distance matrix (40) computed in step ii and the variable of interest (i.e., group membership). Thus the pseudo-F value at each voxel tells us whether the functional connectivity profiles for that voxel varied among individuals as a function of the phenotypic group membership (e.g., C1 vs. C2, C1a vs. C1b, C2a vs. C2b).

MDMR was applied using the ‘Connectir’ package in R (http://czarrar.github.io/connectir) on resampled, 4mm3 isotropic voxels. Computations were constrained to a study-specific group mask, including only voxels present across all participants and contained in a 25% probability gray-matter MNI mask. The MDMR model (at each level of the dendrogram) specified cluster membership (categorical) and age, sex, hand laterality, and mean framewise displacement (FD) as covariates.

Consistent with prior work, voxel-wise significance of the pseudo-F statistic was determined via estimation of the null distribution with random permutation (n=10,000) for each cluster/community comparison (40). Recent work has raised concerns about potentially inflated type I error rates with Random Field Theory (RFT) cluster thresholding approaches (cf. (41, 42)). These concerns are primarily applicable to parametric tests; non-parametric approaches are likely less prone to such inflations. However, we have chosen a more conservative thresholding approach than those used in prior studies implementing MDMR. Specifically, we corrected for multiple comparisons using cluster-based permutation (n=5000) with a height threshold of Z ≥ 2.33 (p < 0.01) and cluster extent probability of p < 0.05.

Given that lower levels of the hierarchy had few participants per group, statistical power in these comparisons would be expected to be notably lower. For illustrative purposes, we repeated analyses using less stringent criteria (voxel-wise p < 0.05; RFT-corrected p < 0.05) and included these results in the Supplementary Information.

Results

Demographic and diagnostic information is provided in Table 1. Details about data screening are provided in the Supplementary Information.

Table 1.

Demographic and Diagnostic Characteristics

Full Sample (n = 347)
Mean Age (SD) 37.5 (13.6)
% Female (n) 66.0 (229)
% Right handed (n) 91.9 (319)

Ethnicity
 % Hispanic/Latino (n) 13.8 (47)

Racial Background
 % White/Caucasian (n) 67.6 (230)
 % Black/African American (n) 20.3 (69)
 % Asian (n) 7.6 (26)
 % Native American/Pacific Islander (n) 1.2 (4)
 % “Other” (n) 3.2 (11)

Lifetime Psychiatric Historya
 % Any Disorder (n) 49.1 (167)
 % Depression (n) 21.2 (72)
 % Anxietyb(n) 9.4 (32)
 % Substance Use Disorder (n) 26.2 (89)
 % ADHD (n) 1.8 (6)

Current Psychiatric Historya
 % Any Disorder (n) 10.9 (37)
 % Depression (n) 2.6 (9)
 % Anxietyb(n) 4.7 (16)
 % Substance Use Disorder (n) 4.1 (14)
 % ADHD (n) 0.9 (3)

Number of Lifetime Psychiatry Diagnoses
 % 0 (n) 52.3 (178)
 % 1 (n) 25.3 (86)
 % 2 (n) 12.4 (42)
 % 3 or more (n) 10.0 (34)
a

Total N = 340; diagnostic information missing for 7

b

Excluding Obsessive Compulsive Disorder and Post Traumatic Stress Disorder

ADHD: Attention-Deficit/Hyperactivity Disorder

Dimension Reduction

Parallel analysis suggested 7 factors (see Figure S1), though the 7th factor exhibited factor loadings whose bootstrap-based 95% confidence intervals overlapped 0. Thus a 6-factor solution was estimated with maximum likelihood-based exploratory factor analysis, accounting for 77.8% of the variance. The constrained model (eliminating low-loading items) accounted for 65.4% of the total variance; standardized factor loadings are presented in Table S3. Correlations between latent factor scores and regression estimates are displayed in Table S4. Primary factor loadings and example items for each subscale are provided in Figure 1. The 6 retained factors were interpreted as follows (see Figure 1): (i) General Distress & Impairment, (ii) Conscientiousness, (iii) Sensation & Risk Seeking, (iv) Frustration Intolerance, (v) Contextual Sensitivity, and (vi) Neuroticism & Negative Affect. Details about each of the factors are provided in Supplementary Information.

Figure 1. Factors identified by bootstrap-based maximum likelihood exploratory factor analysis.

Figure 1

Factors and their corresponding subscales (along with example items) identified by exploratory analysis with 10,000 bootstrap resamplings. Subscale names are provided in the middle column. Sample items for each subscale are provided in the far right column. Color bar on the left provides an index to the shading of each subscale relative to its standardized loading on the factor. ATQ = Adult Temperament Questionnaire; BDI = Beck Depression Inventory, 2nd Edition; CAARS = Conners’ Adult ADHD Rating Scale; DOSP = Domain-Specific Risk-Taking Scale; NEO = NEO - Five Factor Inventory; ICU = Inventory of Callous and Unemotional Traits; IRI = Interpersonal Reactivity Index; STAI = Spielberger State Trait Anxiety Inventory; TSC40 = Trauma Symptom Checklist; UPPS = Impulsive Behavior Scale. Notes: (R) = reverse-scored item in scale. aThese sample items were selected from the response option corresponding to 2 on a 0–3 scale.

Cluster Analysis

Visual inspection of the dendrogram suggested three clear cut-points, yielding 2, 4, and 8 groups, respectively. The largest CHC value was observed at k=2 and stable subgroups (operationalized as values of k wherein CHC did not change appreciably from the prior solution (i.e., local minima)) at k=4 and k=8. Visual examination of participant-by-participant squared Euclidean distance matrices supported the face validity of these cut-points (see Figure 2).

Figure 2. Results of hierarchical clustering.

Figure 2

Panels in this figure depict the decision-criteria used to ascertain clustering levels and resultant groups, as well as the similarity and dissimilarity of the groups as a function of correlation and squared Euclidean distance between factor scores (subject-by-subject). Panel A shows the dendrogram resulting from Hybrid Hierarchical Clustering. It also shows the various levels at which the dendrogram was cut and the resultant groups. Panel B shows the Calinski-Harabasz Criterion (CHC; black line), used as a decision aid for dendrogram cutting, as a function of cluster number. The red line depicts percent change in the CHC value from one cluster to the next. Since CHC did not exhibit a typical pattern (i.e., elevation at some cluster level), we defined stability (i.e., minimal change from one cluster number to the next) as our goal in deciding where to cut the dendrogram. Panel C again depicts the dendrogram, but relative to the squared Euclidean distance matrix. Groups and subgroups are outlined with dashed lines to help visualize group membership and increased similarity/decreased dissimilarity.

Phenotypic Cluster Differences

At all 3 levels of the dendrogram (i.e., 2-, 4- and 8- cluster solutions), the participant groups differed from one another with respect to their phenotypic profiles. Due to space limitations, we limit reported findings beyond the first level (C1 vs. C2) to those along the C2 arm (which exhibited more psychopathology-like patterns). Phenotypic results along the C1 arm, as well as for both groups together are provided in Supplementary Information.

Level 1

Results at level 1 were robust, reflecting broad-reaching group differences that spanned nearly all domains. More specifically, cluster 1 (C1) participants exhibited higher levels of adaptive functionality and cluster 2 (C2) higher levels of maladaptive functionality; significant differences were noted in nearly all measures included in the three phenotypic profiles examined (Exploratory Factor Analysis [EFA], Achenbach Adult Self-Report [ASR], Psychiatric Diagnosis [SCID]) (see Figures 3, S2). To facilitate comparison, we also depicted phenotypic findings as heatmaps (and using a continuous score on the ASR) in Figure S3.

Fig. 3. Factor, clinical symptom, and lifetime psychiatric profiles visualized as radar plots by cluster/group at three levels of hierarchical clustering, showing expansion within C2.

Fig. 3

All panels in this figure represent different measures pertaining to the same clusters (i.e., C2 is the same group of individuals, showing variation in factor profiles – Panel A; ASR clinical symptom profiles – Panel B; and lifetime psychiatric diagnosis – Panel C). Panel A represents mean values by cluster for each of the 6 factors from the exploratory factor analysis. Plots represent a standard loading of −1.5 at the origin and 1.5 at the maximum for each of the 6 factors. Panel B represents percent of individuals within a cluster exhibiting T-scores ≥ 60 (1 standard deviation above the mean; approaching clinical importance) for 8 domains from the Achenbach Adult Self-Report. Plots represent 0 at the center and 25% at the periphery (unless otherwise denoted) for each of the 8 domains. Panel C represents percent of individuals within a cluster exhibiting a lifetime (i.e., past or current) psychiatric diagnosis (ANY = any diagnosis; DEPR = depressive disorder; ANX = anxiety disorder, excluding OCD and PTSD; SUD = substance use disorder; ADHD = attention-deficit/hyperactivity disorder). Where significant differences in current psychiatric diagnosis were observed, group percentages and significance is represented next to the diagnosis in italics. Plots represent 0 at the center and 60% at the periphery (unless otherwise denoted) for each of the 5 diagnostic categories. Note that diagnoses are not mutually exclusive. Significant group differences are represented by asterisks; * p < .05, ** p < .01, ***p < .001.

Level 2

The second level (k=4) subdivided C1 (C1a and C1b) and C2 (C2a and C2b). C2 (functionally maladaptive group) was further divided into internalizing (C2a) and externalizing (C2b) problem characteristics. C2a had significantly higher ASR scores across all internalizing domains (see Fig S3). C2a also exhibited significantly higher rates of any lifetime psychiatric diagnosis and lifetime depression. C2b exhibited significantly higher levels of Sensation and Risk Seeking on the EFA Factor Profile and significantly higher levels of ASR externalizing problems (see Figure 3).

Level 3

The third level (k=8) divided the 4 clusters from level 2 into 8 total sub-clusters (two clusters at level 3 for each cluster at level 2). Significant pairwise differences in ASR domains were largely a difference of magnitude (see Figures 3, S2, S3). Overall there were few significant pairwise differences between sub-clusters in DSM diagnoses, though notably C2a2 exhibited more current psychopathology than C2a1.

Multivariate Intrinsic Connectivity Differences Among Clusters

With permutation-based cluster correction, only MDMR findings from the first level (C1 vs. C2) survived multiple comparison correction (see Figure 4; see Table S8 for functional peaks). This is not surprising given the larger number of participants in each group at the first level (C1-functionally adaptive: n = 165; C2-functionally maladaptive: n =115), compared to the lower levels, which subdivided the sample into 4 and 8 groups, respectively.

Figure 4. Results from multivariate distance matrix analysis of the functional connectome between C1 and C2.

Figure 4

Adjacent groups at the highest level of hierarchical clustering (level 1: nC1=165, nC2=115) are displayed. Rendered brains and axial slices reflect multivariate distance matrix regression comparing intrinsic connectivity between groups; findings represent conversion of pseudo-F test results to Z values via permutation testing (10,000 resamplings of data) and permutation-based cluster correction (5,000 resamplings of data) with cluster formation set at p < .01 and extent threshold set at p < .05. Note that images are presented in neurological convention (L=R, R=L). The bottom row of axial slices represents significant findings at MNI axis Z values of −30, −20, −10, 0, and 10. From left to right, top to bottom, the top 2/3 of the figure depicts the lateral and medial surface of the right hemisphere, the dorsal and ventral surface of the right, then left, hemisphere, and finally, the dorsal and medial surface of the left hemisphere. Only Level 1 results survived cluster permutation testing correction for multiple comparisons.

Three clusters were identified for the first level comparison of C1 vs. C2. The largest cluster (k=133,248mm3, p=.0005) included the bilateral primary and secondary somatosensory cortices, as well as premotor, motor, and supplementary motor regions and was approximately centered on the midline near the supplementary motor area (X=0, Y=−20, Z=54); it also extended bilaterally to the lateral temporal lobes. These regions have been implicated in bodily self-awareness (43) and interoception (44), and identified as comprising a major hub (45) and critical network (46) of the functional brain. Additionally, emerging evidence suggests an important role for the somatosensory/somatomotor hub in high prevalence psychopathology (e.g., (47, 48)).

The second largest cluster (k=27,323mm3, p=.0061) was approximately centered on the left thalamus (X=−16, Y=−20, Z=14). In addition to a large thalamic contribution, the cluster included limbic regions (e.g., hippocampus, amygdala), decision-making regions (e.g., caudate, putamen), and various language (e.g., lingual gyrus) and vision regions (e.g., fusiform gyrus). It also extended from the left to right thalamus into the right caudate and putamen. The cluster comprised thalamic and basal ganglia regions commonly implicated in models of mental illness that emphasize thalamocortical and frontostriatal contributions (e.g., (48, 49))

Finally, the third cluster (k=14,528mm3, p=.0167) was approximately centered on the right hippocampus (X=32, Y=−24, Z=−18) and was largely comprised of right limbic regions (e.g., hippocampus and amygdala), as well as the caudate/putamen, fusiform gyrus, and middle and posterior insula. The regions implicated in this cluster, especially the right hippocampus and amygdala, are commonly associated with automated emotional processing (e.g., (50)), particularly of the kind related to high prevalence psychopathological alterations (e.g., (51)). Regions herein have also been identified as part of the medial temporal lobe subsystem of the default mode network (DMN)(52).

See supplementary materials for MDMR findings obtained at the 2nd and 3rd level of the dendrogram using a more liberal thresholding strategy.

Discussion

Traditional psychiatric nosology comprises heterogeneous categories with few meaningful neurobiological correlates. The present findings illustrate the utility of data-driven approaches to: (i) derive relevant phenotypic dimensions from diverse measures, (ii) identify interpretable groups from hierarchical clustering of dimensional variables, and (iii) use nested groups to examine potential neurobiological differences.

Our findings suggest that inclusion of instruments with normal distributions (i.e., not truncated due to an assessment floor, such as an absence of symptoms; e.g., the NEO-FFI, ATQ) can be critical to defining relevant groups in population-based classification and among high-prevalence conditions (e.g., anxiety, depression, substance use). While most individuals in the functionally adaptive group had truncated scores on syndrome-focused or problem-focused measures, our inclusion of bipolar scales allowed us to delineate further subgroups. The present work also highlights the value of including assessment tools that capture positive or protective factors rather focusing solely on syndrome characteristics or problems. For example, conscientiousness, a personality characteristic associated with health and well-being (53), was a large contributor to differentiating groups at the highest and lowest levels of the nested hierarchical classification. Protective factors are rarely assessed in syndrome- or problem-focused assessments, though they can have important implications for presentation and prognosis among neuropsychiatric conditions (54).

Within the limitations of the current sample size, the present work demonstrated the value of pursuing nested subgroups within both the functionally maladaptive (C2) and functionally adaptive (C1) participant groupings. Of concern, it is possible that the groups merely recapitulated the theories on which some of the measures were predicated. It is important to note, however, that no single scale had its sub-scores distributed in a manner that could explain all components derived from the factor analysis (arguably, the NEO-FFI came the closest). We observed novel combinations of subscales in the factors derived, reflecting a range of psychiatric symptoms, personality features, and protective/risk factors.

The present data also show the value of using broad behavioral characteristics to identify groups to examine potential neurobiological differences. Multivariate comparisons of intrinsic connectivity were implemented at three different clustering levels, though the findings only passed stringent multiple comparisons correction at the first level, where power was the largest. The connectome differences observed between the functionally maladaptive and adaptive groups (C1 vs. C2) at the first level were evident within (i) the somatomotor network, (ii) thalamic and basal ganglia regions, and (iii) the amygdala and extended hippocampal complex.

The somatomotor network is a somewhat novel functional target in the context of identifying potential imaging biomarkers of the tendency towards psychiatric illness. While known as a key hub (45, 46) of the functional brain, the potential role in psychopathology of the somatomotor network is only beginning to be recognized (47, 48). The somato-sensory/motor network is intimately involved in bodily self-consciousness and interoception (43, 44), processes that are increasingly implicated in predictive outcome models (55), especially for neuropsychiatric illness (56). At a coarse level of group differentiation (e.g., functionally maladaptive vs. functionally adaptive), these results underscore the potential importance of subjective valuation and bodily states in making interactional predictions that may be fundamentally altered as part of the pathophysiology of psychiatric illness.

Somewhat less novel, connectomic alterations in the thalamus and basal ganglia, as well as the amygdala and extended hippocampal complex, underscore the importance of basic cognitive functions (e.g., attention, working memory), as well as reward and emotion/saliency in delineating adaptive from maladaptive function. This latter finding lends support to the RDoC-style approach to examining domains rather than syndromes or diseases. On the whole, the intrinsic connectivity findings indicated potential new targets associated with adaptive and maladaptive function, while affirming existing targets, which suggests neurobiological validity of our data-driven group assignments.

Limitations

A number of steps were taken to protect against over- or under-fitting the number of factors and provide robust estimates of which subscales loaded on which dimension. Factor number was determined via parallel analysis, factor structure was based on bootstrapping the raw data, and bootstrap-based confidence intervals were computed for estimates of standardized factor loadings. Despite all this, a separate confirmatory sample will be necessary to validate the replicability of the factor structure and the groups identified.

The different response formats of the questionnaires could potentially have lead to clustering of items by comparable response format and hence, recapitulation of the original scale factor structure. There was little evidence of such a result, though. To overcome clustering as a function of similar response formats, all questions would have to be administered with the same response format. However, such an approach would require extremely large samples and known psychometric properties of the scales would be forfeited.

Significant connectome-wide differences decreased dramatically beyond the first level of hierarchical classification. Results at lower levels were only significant with less stringent, RFT multiple comparisons correction (see Supplementary Information). There are at least three plausible reasons for this decrease in significant findings: sample size, group homogeneity, and demographic differences. Average sample size was halved for each incremental level of the hierarchy. Thus, one contributor may merely be decreased power. Another potential factor is increasing subgroup homogeneity: group factor profiles became progressively more similar to one another at lower levels of the hierarchy. Demographic differences may also contribute to smaller connectome-wide differences at more refined levels of subgroup detection; groups may become more demographically similar as they become more behaviorally homogenous. However, cluster analyses were conducted on age and gender regression-residuals, mitigating some concerns regarding the influence of these demographic variables.

While biomarker differences may diminish as subgroups become more similar, it is exactly these kinds of comparisons that will ultimately permit differential classification based on differences in neurobiology (57). Larger samples within specific diagnostic categories and/or problem domains will likely provide more power to detect differences among more similar groups (e.g., (18, 19, 58)) and can also remedy some of the problems that arise in exploratory analyses requiring complex multiple comparisons correction. By combining population-based, data-driven categorization and diagnostically focused pattern assessments, we can begin to compare the value of different classification methods. Currently, diagnostic heterogeneity is so marked that only extremely large samples (58) or time-consuming separation into individual diagnostic criteria permit head-to-head comparison of both methods (59).

Finally, while the present work focused on phenotypic information alone for group classification/detection, alternative approaches might permit classification based entirely on neurobiology or on a combination of neurobiology and behavioral features. We opted to avoid classification purely on neurobiology as such approaches are complex (60), and interpretation of behavioral characteristics can be tricky when groups are clustered entirely on neurobiology. A potentially valuable alternative is analytic approaches that simultaneously consider phenotypic and neurobiological information (61).

Conclusions

Psychiatry is limited by a lack of validated biobehavioral tests and extensive heterogeneity within diagnostic categories. In addition to consensus-based approaches to reframing the current nosology (e.g., RDoC), data-driven approaches to delineating homogenous subgroups, spanning adaptive to maladaptive function can yield clinically meaningful groups with potentially important neurobiological differences. In doing so, it will be important to consider not just symptoms of psychiatric illness (i.e., deficits or problems), but also features of psychiatric health (i.e., strengths or protective factors). Examination of a broad array of phenotypic characteristics, in combination with neurobiological differences may improve our understanding of the pathophysiology of mental illness and provide new preventative and treatment strategies. While these analyses will require large samples and advanced analytic approaches, the present study is evidence that such efforts can yield new hypotheses, as well as support existing theories, helping to focus biological psychiatry on those areas that may yield the highest return on investment.

Supplementary Material

supplement

Acknowledgments

This work was funded by NIMH R01MH094639, R01MH081218, R01MH083246, R21MH084126, with additional funding for personnel and administrative support provided by grants from the Child Mind Institute, the New York State Office of Mental Health, the Research Foundation for Mental Hygiene, the Brain Research Foundation, and the Stavros Niarchos Foundation. Additional funds were provided by gifts from Phyllis Green, Randolph Cowen, and Joseph P. Healey. All data is publicly available via the International Neuroimaging Data-Sharing Initiative (INDI). Data can be accessed at http://fcon_1000.projects.nitrc.org/ A copyright transfer agreement is required for access to detailed phenotypic information.

Footnotes

Financial Disclosures

All authors report no biomedical financial interests or potential conflicts of interest.

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

  • 1.Wakefield JC. Misdiagnosing normality: Psychiatry’s failure to address the problem of false positive diagnoses of mental disorder in a changing professional environment. J Ment Health. 2010;19:337–351. doi: 10.3109/09638237.2010.492418. [DOI] [PubMed] [Google Scholar]
  • 2.Cuthbert B, Insel T. Toward the future of psychiatric diagnosis: the seven pillars of RDoC. BMC Med. 2013;11:126. doi: 10.1186/1741-7015-11-126. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.American Psychiatric Association A. Diagnostic and Statistical Manual of Mental Disorders. 5. Washington, DC: American Psychiatric Publishing; 2013. (DSM-5) [Google Scholar]
  • 4.Hyman SE. The diagnosis of mental disorders: the problem of reification. Annu Rev Clin Psychol. 2010;6:155–179. doi: 10.1146/annurev.clinpsy.3.022806.091532. [DOI] [PubMed] [Google Scholar]
  • 5.Insel TR. The NIMH Research Domain Criteria (RDoC) Project: precision medicine for psychiatry. Am J Psychiatry. 2014;171:395–397. doi: 10.1176/appi.ajp.2014.14020138. [DOI] [PubMed] [Google Scholar]
  • 6.Casey BJ, Craddock N, Cuthbert BN, Hyman SE, Lee FS, Ressler KJ. DSM-5 and RDoC: progress in psychiatry research? Nat Rev Neurosci. 2013;14:810–814. doi: 10.1038/nrn3621. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Kraemer HC, Noda A, O’Hara R. Categorical versus dimensional approaches to diagnosis: methodological challenges. J Psychiatr Res. 2004;38:17–25. doi: 10.1016/s0022-3956(03)00097-9. [DOI] [PubMed] [Google Scholar]
  • 8.Widiger TA, Samuel DB. Diagnostic categories or dimensions? A question for the Diagnostic And Statistical Manual Of Mental Disorders--fifth edition. J Abnorm Psychol. 2005;114:494–504. doi: 10.1037/0021-843X.114.4.494. [DOI] [PubMed] [Google Scholar]
  • 9.Yee CM, Javitt DC, Miller GA. Replacing dsm categorical analyses with dimensional analyses in psychiatry research: The research domain criteria initiative. JAMA psychiatry. 2015:1159–1160. doi: 10.1001/jamapsychiatry.2015.1900. [DOI] [PubMed] [Google Scholar]
  • 10.First MB. Clinical utility: a prerequisite for the adoption of a dimensional approach in DSM. J Abnorm Psychol. 2005;114:560–564. doi: 10.1037/0021-843X.114.4.560. [DOI] [PubMed] [Google Scholar]
  • 11.Weinberger DR, Glick ID, Klein DF. Whither research domain criteria (rdoc)?: The good, the bad, and the ugly. JAMA psychiatry. 2015:1161–1162. doi: 10.1001/jamapsychiatry.2015.1743. [DOI] [PubMed] [Google Scholar]
  • 12.Biswal BB, Mennes M, Zuo X-N, Gohel S, Kelly C, Smith SM, et al. Toward discovery science of human brain function. Proceedings of the National Academy of Sciences. 2010;107:4734–4739. doi: 10.1073/pnas.0911855107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Akil H, Martone ME, Van Essen DC. Challenges and Opportunities in Mining Neuroscience Data. Science (New York, NY) 2011;331:708–712. doi: 10.1126/science.1199305. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Kraemer H. REsearch domain criteria (rdoc) and the dsm—two methodological approaches to mental health diagnosis. JAMA Psychiatry. 2015;72:1163–1164. doi: 10.1001/jamapsychiatry.2015.2134. [DOI] [PubMed] [Google Scholar]
  • 15.Insel TR, Cuthbert BN. Brain disorders? Precisely. Science. 2015;348:499–500. doi: 10.1126/science.aab2358. [DOI] [PubMed] [Google Scholar]
  • 16.McGorry P, Keshavan M, Goldstone S, Amminger P, Allott K, Berk M, et al. Biomarkers and clinical staging in psychiatry. World Psychiatry. 2014;13:211–223. doi: 10.1002/wps.20144. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Wardenaar KJ, de Jonge P. Diagnostic heterogeneity in psychiatry: towards an empirical solution. BMC Med. 2013;11:201. doi: 10.1186/1741-7015-11-201. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Fair DA, Bathula D, Nikolas MA, Nigg JT. Distinct neuropsychological subgroups in typically developing youth inform heterogeneity in children with ADHD. Proceedings of the National Academy of Sciences. 2012;109:6769–6774. doi: 10.1073/pnas.1115365109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Karalunas SL, Fair D, Musser ED, Aykes K, Iyer SP, Nigg JT. Subtyping attention-deficit/hyperactivity disorder using temperament dimensions: Toward biologically based nosologic criteria. JAMA Psychiatry. 2014;71:1015–1024. doi: 10.1001/jamapsychiatry.2014.763. [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
  • 20.Nooner KB, Colcombe S, Tobe R, Mennes M, Benedict M, Moreno A, et al. The NKI-Rockland Sample: A Model for Accelerating the Pace of Discovery Science in Psychiatry. Front Neurosci. 2012:6. doi: 10.3389/fnins.2012.00152. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Achenbach TM, Rescorla LA. Manual for the ASEBA adult forms & profiles. Burlington, VT, USA: Research Center for Children, Youth, & Familes, University of Vermont; 2003. [Google Scholar]
  • 22.First MB, Spitzer RL, Gibbon M, Williams JBW. Structured Clinical Interview for DSM-IV Axis I Disorders. New York: Biometrics Research, New York State Psychiatric Institute; 1996. [Google Scholar]
  • 23.Oldfield RC. The assessment and analysis of handedness: the Edinburgh inventory. Neuropsychologia. 1971;9:97–113. doi: 10.1016/0028-3932(71)90067-4. [DOI] [PubMed] [Google Scholar]
  • 24.Xu J, Moeller S, Auerbach EJ, Strupp J, Smith SM, Feinberg DA, et al. Evaluation of slice accelerations using multiband echo planar imaging at 3 T. Neuroimage. 2013;83:991–1001. doi: 10.1016/j.neuroimage.2013.07.055. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Potthoff RF, Tudor GE, Pieper KS, Hasselblad V. Can one assess whether missing data are missing at random in medical studies? Stat Methods Med Res. 2006;15:213–234. doi: 10.1191/0962280206sm448oa. [DOI] [PubMed] [Google Scholar]
  • 26.Gold MS, Bentler PM. Treatments of Missing Data: A Monte Carlo Comparison of RBHDI, Iterative Stochastic Regression Imputation, and Expectation-Maximization. Structural Equation Modeling: A Multidisciplinary Journal. 2000;7:319–355. [Google Scholar]
  • 27.Hudziak JJ, Achenbach TM, Althoff RR, Pine DS. A dimensional approach to developmental psychopathology. Int J Methods Psychiatr Res. 2007;16(Suppl 1):S16–23. doi: 10.1002/mpr.217. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Horn JL. A rationale and test for the number of factors in factor analysis. Psychometrika. 1965;30:179–185. doi: 10.1007/BF02289447. [DOI] [PubMed] [Google Scholar]
  • 29.Castellan NJ. Shuffling arrays: Appearances may be deceiving. Behav Res Methods Instrum Comput. 1992;24:72–77. [Google Scholar]
  • 30.Zientek L, Thompson B. Applying the bootstrap to the multivariate case: Bootstrap component/factor analysis. Behav Res Methods. 2007;39:318–325. doi: 10.3758/bf03193163. [DOI] [PubMed] [Google Scholar]
  • 31.Grice JW. Computing and evaluating factor scores. Psychol Methods. 2001;6:430–450. [PubMed] [Google Scholar]
  • 32.Chipman H, Tibshirani R. Hybrid hierarchical clustering with applications to microarray data. Biostatistics. 2006;7:286–301. doi: 10.1093/biostatistics/kxj007. [DOI] [PubMed] [Google Scholar]
  • 33.Gersho A, Gray RM. Vector quantization and signal compression. Kluwer Academic Publishers; 1992. [Google Scholar]
  • 34.Caliński T, Harabasz J. A dendrite method for cluster analysis. Communications in Statistics. 1974;3:1–27. [Google Scholar]
  • 35.Gorgolewski K, Burns CD, Madison C, Clark D, Halchenko YO, Waskom ML, et al. Nipype: A flexible, lightweight and extensible neuroimaging data processing framework. Front Neuroinform. 2011:5. doi: 10.3389/fninf.2011.00013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Greve DN, Fischl B. Accurate and Robust Brain Image Alignment using Boundary-based Registration. Neuroimage. 2009;48:63–72. doi: 10.1016/j.neuroimage.2009.06.060. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Jenkinson M, Bannister P, Brady M, Smith S. Improved optimization for the robust and accurate linear registration and motion correction of brain images. Neuroimage. 2002;17:825–841. doi: 10.1016/s1053-8119(02)91132-8. [DOI] [PubMed] [Google Scholar]
  • 38.Avants BB, Epstein CL, Grossman M, Gee JC. Symmetric diffeomorphic image registration with cross-correlation: evaluating automated labeling of elderly and neurodegenerative brain. Med Image Anal. 2008;12:26–41. doi: 10.1016/j.media.2007.06.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Behzadi Y, Restom K, Liau J, Liu TT. A Component Based Noise Correction Method (CompCor) for BOLD and Perfusion Based fMRI. Neuroimage. 2007;37:90–101. doi: 10.1016/j.neuroimage.2007.04.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Reiss PT, Stevens MHH, Shehzad Z, Petkova E, Milham MP. On Distance-Based Permutation Tests for Between-Group Comparisons. Biometrics. 2010;66:636–643. doi: 10.1111/j.1541-0420.2009.01300.x. [DOI] [PubMed] [Google Scholar]
  • 41.Woo C-W, Krishnan A, Wager TD. Cluster-extent based thresholding in fMRI analyses: Pitfalls and recommendations. Neuroimage. 2014;91:412–419. doi: 10.1016/j.neuroimage.2013.12.058. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Eklund A, Nichols T, Knutsson H. Can parametric statistical methods be trusted for fMRI based group studies? arXiv preprint arXiv:151101863 2015 [Google Scholar]
  • 43.Blanke O. Multisensory brain mechanisms of bodily self-consciousness. Nat Rev Neurosci. 2012;13:556–571. doi: 10.1038/nrn3292. [DOI] [PubMed] [Google Scholar]
  • 44.Craig AD. How do you feel? Interoception: the sense of the physiological condition of the body. Nat Rev Neurosci. 2002;3:655–666. doi: 10.1038/nrn894. [DOI] [PubMed] [Google Scholar]
  • 45.Tomasi D, Shokri-Kojori E, Volkow ND. High-Resolution Functional Connectivity Density: Hub Locations, Sensitivity, Specificity, Reproducibility, and Reliability. Cereb Cortex. 2015 doi: 10.1093/cercor/bhv171. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Smith SM, Fox PT, Miller KL, Glahn DC, Fox PM, Mackay CE, et al. Correspondence of the brain’s functional architecture during activation and rest. Proc Natl Acad Sci U S A. 2009;106:13040–13045. doi: 10.1073/pnas.0905267106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Gabbay V, Ely BA, Li Q, Bangaru SD, Panzer AM, Alonso CM, et al. Striatum-based circuitry of adolescent depression and anhedonia. J Am Acad Child Adolesc Psychiatry. 2013;52:628–641.e613. doi: 10.1016/j.jaac.2013.04.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Cui H, Zhang J, Liu Y, Li Q, Li H, Zhang L, et al. Differential alterations of resting-state functional connectivity in generalized anxiety disorder and panic disorder. Hum Brain Mapp. 2016;37:1459–1473. doi: 10.1002/hbm.23113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Buckholtz Joshua W, Meyer-Lindenberg A. Psychopathology and the Human Connectome: Toward a Transdiagnostic Model of Risk For Mental Illness. Neuron. 2012;74:990–1004. doi: 10.1016/j.neuron.2012.06.002. [DOI] [PubMed] [Google Scholar]
  • 50.Dyck M, Loughead J, Kellermann T, Boers F, Gur RC, Mathiak K. Cognitive versus automatic mechanisms of mood induction differentially activate left and right amygdala. Neuroimage. 2011;54:2503–2513. doi: 10.1016/j.neuroimage.2010.10.013. [DOI] [PubMed] [Google Scholar]
  • 51.Fairhall SL, Sharma S, Magnusson J, Murphy B. Memory related dysregulation of hippocampal function in major depressive disorder. Biol Psychol. 2010;85:499–503. doi: 10.1016/j.biopsycho.2010.09.002. [DOI] [PubMed] [Google Scholar]
  • 52.Andrews-Hanna JR, Reidler JS, Sepulcre J, Poulin R, Buckner RL. Functional-anatomic fractionation of the brain’s default network. Neuron. 2010;65:550–562. doi: 10.1016/j.neuron.2010.02.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Bogg T, Roberts BW. Conscientiousness and health-related behaviors: a meta-analysis of the leading behavioral contributors to mortality. Psychol Bull. 2004;130:887–919. doi: 10.1037/0033-2909.130.6.887. [DOI] [PubMed] [Google Scholar]
  • 54.Belcher AM, Volkow ND, Moeller FG, Ferré S. Personality traits and vulnerability or resilience to substance use disorders. Trends Cogn Sci. 2014;18:211–217. doi: 10.1016/j.tics.2014.01.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Barrett LF, Simmons WK. Interoceptive predictions in the brain. Nat Rev Neurosci. 2015;16:419–429. doi: 10.1038/nrn3950. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Seth AK. Interoceptive inference, emotion, and the embodied self. Trends Cogn Sci. 2013;17:565–573. doi: 10.1016/j.tics.2013.09.007. [DOI] [PubMed] [Google Scholar]
  • 57.Kapur S, Phillips AG, Insel TR. Why has it taken so long for biological psychiatry to develop clinical tests and what to do about it? Mol Psychiatry. 2012;17:1174–1179. doi: 10.1038/mp.2012.105. [DOI] [PubMed] [Google Scholar]
  • 58.Clementz BA, Sweeney JA, Hamm JP, Ivleva EI, Ethridge LE, Pearlson GD, et al. Identification of Distinct Psychosis Biotypes Using Brain-Based Biomarkers. Am J Psychiatry. 2016;173:373–384. doi: 10.1176/appi.ajp.2015.14091200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Borsboom D, Cramer AOJ. Network Analysis: An Integrative Approach to the Structure of Psychopathology. Annu Rev Clin Psychol. 2013;9:91–121. doi: 10.1146/annurev-clinpsy-050212-185608. [DOI] [PubMed] [Google Scholar]
  • 60.Yang Z, Xu Y, Xu T, Hoy CW, Handwerker DA, Chen G, et al. Brain Network Informed Subject Community Detection In Early-Onset Schizophrenia. Sci Rep. 2014:4. doi: 10.1038/srep05549. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Chen G, Sullivan PF, Kosorok MR. Biclustering with heterogeneous variance. Proceedings of the National Academy of Sciences. 2013;110:12253–12258. doi: 10.1073/pnas.1304376110. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

supplement

RESOURCES