Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2022 Apr 1.
Published in final edited form as: Psychiatry Res. 2021 Feb 13;298:113805. doi: 10.1016/j.psychres.2021.113805

Multifactorial Prediction of Depression Diagnosis and Symptom Dimensions

Mary E McNamara a, Jason Shumake a, Rochelle A Stewart a, Jocelyn Labrada a, Alexandra Alario a, John JB Allen b, Rohan Palmer c, David M Schnyer a, John McGeary d, Christopher G Beevers a
PMCID: PMC8042639  NIHMSID: NIHMS1679727  PMID: 33647705

Abstract

While depression is a leading cause of disability, prior investigations of depression have been limited by studying correlates in isolation. A data-driven approach was applied to identify out-of-sample predictors of current depression from adults (N = 217) sampled on a continuum of no depression to clinical levels. The current study used elastic net regularized regression and predictors from sociodemographic, self-report, polygenic scores, resting electroencephalography, pupillometry, actigraphy, and cognitive tasks to classify individuals into currently depressed (MDE), psychiatric control (PC), and no current psychopathology (NP) groups, as well as predicting symptom severity and lifetime MDE. Cross-validated models explained 20.6% of the out-of-fold deviance for the classification of MDEs versus PC, 33.2% of the deviance for MDE versus NP, but −0.6% of the deviance between PC and NP. Additionally, predictors accounted for 25.7% of the out-of-fold variance in anhedonia severity, 65.7% of the variance in depression severity, and 12.9% of the deviance in lifetime depression (yes/no). Self-referent processing, anhedonia, and psychosocial functioning emerged as important differentiators of MDE and PC groups. Findings highlight the advantages of using psychiatric control groups to isolate factors specific to depression.

Keywords: statistical learning, psychiatric control, classification

1. Introduction

Globally, depression is estimated to affect 300 million individuals and is the leading cause of disability worldwide (World Health Organization & Others, 2017). Even more concerning, prevalence rates have remained static, despite increased accessibility of treatment over the last four decades (Ormel et al., 2019). Even the best available treatments are largely unsuccessful at producing lasting outcomes, as approximately 40–50% of patients relapse within 1–2 years of receiving treatment (Dobson et al., 2008; Paykel, 2007; Vittengl et al., 2007). This disparity highlights a clear need to better understand the critical processes associated with the maintenance of depression.

Several factors may have hampered progress in this area. Traditionally, depression has been studied using one or two units of analysis (e.g., self-report, neurocognitive assessments, neuroimaging), and rarely do studies examine multiple mechanisms simultaneously. For instance, few studies have included objective measures of biological risk factors in combination with behavioral or self-report data (Kaplan, Roberts, Camacho, & Coyne, 1987). Such work often requires interdisciplinary collaboration, which until relatively recently was somewhat uncommon in psychiatric research. Additionally, psychology has emphasized monocausal theories, undoubtedly in a hopeful attempt to mirror general medicine’s success at discovering specific etiologies (e.g. pathogens engendering infectious disease; Kendler, 2019). As a result, truly integrative cross-level research in psychiatric science has been rare (Kendler, 2014), even though it is necessary to move the field forward (Fried & Robinaugh, 2020).

Work in this area has also focused primarily on developing explanatory models at the expense of models that can predict outcomes in a new sample (Yarkoni & Westfall, 2017). It is important to highlight prediction in this context refers not to prediction over time (e.g. longitudinal prediction), but expected performance in out-of-sample data the model was not trained on (e.g. predictive R2). Models built for these distinct purposes often differ in important ways—while an explanatory model may be more elegant and offer more satisfying explanations, these models may perform poorly with out-of-sample data and fail to generalize, due to overfitting of the algorithm to statistical noise specific to the dataset. On the other hand, prediction-based models may generalize with better accuracy to new data but may lack clarity about how the mechanisms give rise to the outcomes. Applied to the problem at hand, an emphasis on prediction would allow for multiple types of data from a variety of theories to predict the presence of depression, without getting bogged down by a lack of theory regarding how these components may entangle and interact with each other.

Increasingly popular machine learning techniques have made testing multifactorial models of depression (i.e. models incorporating multiple factors including biological, genetic, psychosocial, and cognitive factors) a possibility. In machine learning, the prediction of “unseen” data (data not used to train the model) is treated as the gold standard of success, an additional reason the use of machine learning algorithms is an attractive choice (Yarkoni & Westfall, 2017). Data-driven analyses of multifactorial datasets show great promise towards improving and testing models of depression and unveiling prospective treatment targets. Preliminary multifactorial work has identified correlates of depression outcomes, but to date, similar research has not distinguished the best variables that differentiate depression (Dinga et al., 2018).

As a first step to identify the most important predictors, it is necessary to tease apart specificity to depression (Fried & Kievit, 2016). To this end, psychiatric control groups should be used in place of (or at least in addition to) healthy controls. The lack of a psychiatric control group fails to tease apart which factors are specific to depression versus the presence of negative affect, impairment, and other nonspecific factors associated with having a mental health condition (Fritzsche et al., 2010).

Moreover, comparison groups with stricter inclusion and exclusion criteria than the case group have the potential to be sampled from an entirely different population. To use language from case-control studies, the only difference between a ‘case’ and a ‘control’ should be the development of the condition; both groups should have been “exposed” to the same conditions. A better alternative is the addition of a psychiatric control group in which the sample meets for the exact same inclusion and exclusion criteria as the mental illness in question, except for the focal feature (Schwartz & Susser, 2011).

To address these issues, we applied elastic net regularized regression to identify variables that predicted depression both categorically and continuously in a secondary analysis of a large, multifactorial dataset. Elastic net was used to predict three main outcomes: categorical grouping of MDE vs. psychiatric control vs. no current psychopathology, continuous measures of symptom severity, and lifetime depression status. Predictors in the model included a large range of demographic (e.g., sex, age) and self-report (e.g., rumination, psychosocial functioning) data, resting state EEG (i.e., frontal alpha asymmetry), actigraphy (i.e., daily physical activity, sleep consistency), polygenic risk scores (i.e., derived using a broad depression phenotype from the UK Biobank (Howard et al., 2018), and cognitive-affective task (e.g., self-referent negative processing, negative attention bias) data.

2. Methods and Materials

2.1. Participants

Data were collected from 218 unrelated adults; however, 1 participant was missing 25% or more of their data and was excluded from the analyses (further detail on handling of missing data can be found in the supplementary materials). The present study included a sample of 217 adults recruited from the Austin, Texas community. Of these, 60 were included in the MDE group, 54 in the Psychiatric Control, and 101 in the No Current Psychopathology group (we were unable to obtain diagnostic data from two individuals, so they were not used in the analyses, but were used in the models predicting anhedonia and depression continuously). Since there was a genetic component to the primary aim of the study, the sample was limited to individuals of European ancestry. Consistent with dimensional approaches to psychopathology (Gibb et al., 2004), participants were recruited along a continuum of no symptomology to clinical levels, to approximate a normally distributed sample of depression. This required over-sampling the depressed end of that continuum, so our sample was not random in that regard.

Data were collected at two timepoints separated by approximately one week. Participants came into the lab for an initial four-hour appointment, during which written consent was obtained and timepoint one data was collected. Actigraphy data was collected over the 5–7 day interim period between study visits. Apart from the actigraphy data, all other data in the present analysis (e.g. self-report, cognitive task, EEG, etc.) were obtained from the first timepoint in order to utilize the largest sample size possible. Additional information on test-retest reliability of the metrics over the one-week period are documented in the Supplementary Materials. The Institutional Review Board at the University of Texas at Austin approved all study procedures.

Participants were eligible if they met the following inclusion criteria: 1) 18 to 35 years of age; 2) of European descent; 3) able to speak and read proficiently in English, and 4) either normal or corrected to normal vision. The exclusion criteria were: 1) current use of steroidal or psychotropic medications; 2) serious medical conditions; 3) heavy tobacco use defined as 20 cigarettes per day or greater than 20 pack years (Kamholz, 2004; World Health Organization, 2011); 4) score of two or higher on the drug subscale of the Psychiatric Diagnostic Screening Questionnaire (Zimmerman & Mattia, 2001); 5) a score of two or higher on the alcohol subscale of the Psychiatric Diagnostic Screening Questionnaire; 6) a score of one or higher on the psychosis subscale of Psychiatric Diagnostic Screening Questionnaire; or 7) being in imminent danger to others or self, or any recent suicidal behavior (suicidal ideation at level 4 on the Columbia-Suicide Severity Rating Scale in the past two months, or any suicidal behavior in the past two months).

Given that several of the predictors in the parent project were cognitive in nature, the sample was restricted to those between the ages of 18–35 years in order to reduce the impact of cognitive aging on those assessments. Further, anyone with heavy tobacco, drug, or alcohol usage were also excluded, given the effect substances can have on cognitive functioning (Hall et al., 2015).

Participants were of European-ancestry and mostly non-Hispanic, mostly female, in their mid 20s, and never-married. The majority of the sample had experienced a past episode of depression (60.9%). Full participant demographics are reported in Table 1.

Table 1:

Demographics and Clinical Characteristics of the Sample

Characteristic Group

MDE (n= 60) PC (n= 54) NP (n= 101) Total (n = 217)

Age in years, mean (SD) 23.6 (4.4) 24.9 (5.6) 22.3 (4.5) 23.3 (4.9)
Female gender (%) 41 (71.7%) 30 (55.6%) 69 (68.3%) 142 (66.0%)
Hispanic ethnicity (%) 17 (28.3%) 13 (24.1%) 36 (35.6%) 66 (30.7%)
Caucasian (%) 58 (96.7%) 54 (100%) 101 (100%) 213 (99.1%)
Single (%) 51 (85.0%) 43 (79.6%) 90 (89.1%) 184 (85.6%)
Years in school (SD) 14.5 (2.4) 14.6 (2.2) 14.2 (2.5) 14.4 (2.4)
Private Health Insurance (%) 29 (48.3%) 32 (59.3%) 63 (62.4%) 124 (57.7%)
Household Income (%)
 $0 – $24,999 32 (53.3%) 23 (43.6%) 28 (27.7%) 83 (38.6%)
 $25,000 – $74,999 12 (20.0%) 20 (37.0%) 38 (37.6%) 70 (32.6%)
 $75,000 – $100,000 + 16 (26.7%) 11 (20.4%) 35 (34.7%) 62 (28.8%)
Current MDD 60 (100%) 0 (0%) 0 (0%) 60 (27.9%)*
Past MDD 51 (85.0%) 35 (64.8%) 45 (44.6%) 131 (60.9%)*
Number of Depressive Episodes (median, SD)** 4.0 (10.0) 2.5 (5.4) 0 (7.0) 2 (7.8)
Any Current Anxiety Disorder 30 (50.0%) 25 (46.3%) 0 (0%) 55 (%)*
Any Current Psychiatric Disorder 60 (100%) 54 (100%) 0 (0%) 114 (53.0%)*
BDI-II (SD) 29.7 (10.3) 16.8 (8.6) 11.8 (7.8) 18.0 (11.5)
*

Two individuals did not have diagnostic data and were not used in the categorical analyses, but were included for the other continuous outcomes. Therefore, total percentages for variables using diagnostic data are out of a sample of n = 215

**

This variable was rank-transformed before being included in the main analyses; these numbers reflect the raw data

Anxiety disorders assessed included panic disorder, agoraphobia, social anxiety disorder, and generalized anxiety disorder

Despite our best efforts to ensure the highest level of similarity between groups, the MDE group did experience higher levels of impairment than the psychiatric control (Table 1). However, including impairment as a predictor in the model allows us to control for any observed differences in impairment.

2.2. Measures

2.2.1. Outcomes

Participants were separated into one of three diagnostic groups based on their current diagnoses assessed by version 7.2 of the Mini International Neuropsychiatric Interview for DSM- 5 (MINI; (Sheehan et al., 1998). Participants were grouped into no current psychopathology (NP), current major depressive episode (MDE), or a psychiatric control (PC) group. Consistent with the criteria for psychiatric controls to have the same inclusion and exclusion criteria as the focal group, the PC group included anyone with a current diagnosis of mania, hypomania, generalized anxiety disorder, social anxiety disorder, agoraphobia, substance or alcohol use disorders, psychosis, post-traumatic stress disorder, obsessive-compulsive disorder, but not a current major depressive episode. Individuals in the MDE group could also have comorbid psychiatric conditions. In the MDE group, 50.0% of participants also met for a comorbid anxiety disorder, and 46.3% of participants in the PC group met for an anxiety disorder. Full descriptions of comorbidities can be found in Table 2. Additionally, both the PC and NP groups included participants who had past episodes of depression. We controlled for this by entering a rank-ordered variable denoting the total number of episodes into the models as a predictor. Further detail about the diagnostic groupings can be found in the accompanying Supplementary Materials, ‘Outcome Measures.

Table 2:

Diagnostic Comorbidity by Group

Diagnosis Group

MDE (n = 60) PC (n = 54) NP (n = 101)

Major Depressive Episode (current) 60 (100.0%) 0 (0.0%) 0 (0.0%)
Major Depressive Episode (past) 51 (85.0%) 35 (64.8%) 45 (44.6%)
Mania (current) 0 (0.0%) 0 (0.0%) 0 (0.0%)
Hypomania (current) 0 (0.0%) 0 (0.0%) 0 (0.0%)
Hypomanic symptoms (current) 0 (0.0%) 1 (1.9%) 0 (0.0%)
Mania (past) 4 (7.0%) 4 (7.4%) 0 (0.0%)
Hypomania (past) 1 (1.7%) 0 (0.0%) 1 (1.0%)
Hypomanic symptoms (past) 0 (0.0%) 1 (1.9%) 0 (0.0%)
Panic disorder (current) 10 (17.5%) 8 (17.0%) 0 (0.0%)
Panic Disorder (past) 23 (38.3%) 14 (25.9%) 9 (9.4%)
Agoraphobia (current) 11 (19.3%) 6 (12.8%) 0 (0.0%)
Generalized Anxiety Disorder (current) 18 (23.0%) 9 (16.7%) 0 (0.0%)
Social Anxiety Disorder 17 (29.8%) 11 (20.4%) 0 (0.0%)
Obsessive Compulsive Disorder (current) 9 (15.8%) 7 (13.0%) 0 (0.0%)
Post-Traumatic Stress Disorder 9 (15.8%) 3 (5.6%) 0 (0.0%)
Alcohol Use Disorder (current) 13 (22.8%) 21(38.9%) 0 (0.0%)
Substance Use Disorder 13 (21.7%) 22 (40.7%) 0 (0.0%)
Psychosis (current) 1 (1.7%) 1 (2.1%) 0 (0.0%)
Psychosis (past) 1 (1.7%) 1 (2.1%) 1 (1.0%)
Anorexia Nervosa (current) 0 (0.0%) 1 (2.1%) 0 (0.0%)
Bulimia Nervosa (current) 2 (3.5%) 2 (4.3%) 0 (0.0%)
Binge-Eating Disorder (current) 3 (5.2%) 2 (4.3%) 0 (0.0%)

In our first set of analyses, we ran classification models for MDE vs PC, MDE vs NP, and PC vs NP. Next, we looked at two continuous measures of symptomatology, the Mood and Anxiety Symptom Questionnaire - Short Form (Wardenaar et al., 2010) general distress subscale (MASQ GD), a measure of depression severity, and the Snaith-Hamilton Pleasure Scale (SHAPS; Snaith et al., 1995), a measure of anhedonia. Finally, we ran a model to examine which, if any, factors differentiated those who experienced any lifetime episodes of depression, current or past, from those with no history of depression (dichotomized as yes/no). For more information on the psychometrics of each outcome, see the accompanying Supplementary Materials, ‘Psychometrics and Description of Predictor Variables.’

2.2.2. Predictors

Each model included 137–139 candidate predictors. A variable denoting the number of lifetime episodes of depression was included in 5 of the 6 models in order to control for past history of depression. Predictors were excluded for certain models if they conveyed information that was highly similar to the outcome (for example, the number of lifetime episodes was removed as a predictor from the lifetime history model only, as any number of episodes would by definition indicate whether or not someone had experienced a major depressive episode). The number of observations and predictors included in each model is documented in the Supplementary Materials, Table 2. One of the appealing features of the elastic net is that it can accommodate the inclusion of a large number of predictors without needing to worry about the ratio of predictors to observations. This is because the elastic net penalty, not the total number of predictors, determines the model degrees of freedom (Hastie et al., 2015). We also removed five variables from the original dataset prior to beginning analyses that had near-zero variance and were unlikely to be useful predictors. A comprehensive list and descriptions of all predictors can be found in the Supplementary Materials, in Table 1.

Sociodemographic variables included age, sex, race, ethnicity, education level, years of school, marital status, income level, insurance coverage, and a rank-ordered count of lifetime depressive episodes. For the self-report questionnaires, total score and subscales (where applicable) were used. This included scores from the following measures: Ruminative Response Scale, Brooding Subscale (RRS; Treynor et al., 2003), Perseverative Thinking Questionnaire (PTQ; Ehring et al., 2011), Satisfaction With Life Scale (SWLS; Diener et al., 1985), Emotion Regulation Questionnaire (ERQ; Gross & John, 2003), Behavioral Inhibition Scale/ Behavioral Activation Scale (BIS/BAS; Carver & White, 1994), International Physical Activity Questionnaire (IPAQ; Lee et al., 2011), Pittsburgh Sleep Quality Index (PSQI; Buysse et al., 1989), and the Sheehan Disability Scale (SDS; Leon et al., 1997).

From the biobehavioral assessments, we used variables from digit span (largest number of digits recalled, forward and backward, and response time; Richardson, 2007), an affective pupil dilation task (index of reactivity to emotional stimuli; Steidtmann et al., 2010), resting state frontal alpha asymmetry and midfrontal mean alpha from EEG (Tomarken et al., 1990; Coan & Allen, 2004; Lopez-Duran et al., 2012), the dot probe for affective stimuli (attention bias for positive and dysphoric stimuli; Beevers et al., 2019) where reaction time, gaze, and trial-level bias variables were used (Zvielli et al., 2015), and an affective visual short-term working memory task designed to measure the ability to clear working memory of irrelevant affective material (Gonzalez & Schnyer, 2018). Self-referent negative processing was also assessed with the Self-Referential Encoding Task (SRET; Derry & Kuiper, 1981).

Finally, we also included actigraphy data (rest and activity cycles) and a polygenic risk score. Polygenic risk scores are the sum of the weighted liabilities conferred by multiple single-nucleotide polymorphisms associated with one outcome (Euesden et al., 2015). The parent project of the study calculated multiple polygenic risk scores for different outcomes, each of which was then Bonferroni-corrected. Given our outcomes of interest included classifying the presence and absence of depression categorically and continuously, we used a polygenic risk score calculated for current MDE and current BDI (Beck Depression Inventory-II). More detail regarding the assessments used in the current study can be found in Supplementary Materials, ‘Methods: Task Descriptions and Data Collection.

2.3. Machine Learning Analyses

2.3.1. Learning Algorithms and Tuning Parameters

For classification, we selected the elastic net machine learning algorithm, which utilizes both lasso and ridge regression penalties to prevent overfitting of the model. Elastic net regularization α parameter finds a compromise between the lasso penalty (sum of absolute values of all coefficients) and ridge penalty (sum of squared values of all coefficients) by weighting the proportion of ridge and lasso penalties. In addition to α, there is also an additional parameter λ, the magnitude of the shrinkage penalty to be tuned by cross-validation. The beset package (https://github.com/jashu) developed in-house by co-author (JS) tries over 100 possible values of λ (autogenerated by the package), as well as three values of α: 0.01 (weighting the ridge penalty more heavily and therefore including more variables, 0.99 (weighting the lasso penalty more heavily and including fewer variables), and 0.50 (equal weighting of both penalties). The values of both of these parameters were chosen to be as large as possible while remaining within one standard error of the values that resulted in the minimum cross-validation error. For classification, we compared two groups per model; MDE vs PC, then MDE vs. NP, and NP vs. PC.

2.3.2. Prediction Metrics and Cross-Validation

Cross-validation was used to estimate the predictive R2, or the proportion of variance the model is expected to explain in unseen data (i.e. cases the model was not trained on). 10-fold cross-validation was repeated 10 times using different partitions of the data. In each repetition, 9 of the 10 “folds” or partitions of data are used to train the model and the 10th holdout data is used to test the prediction model. This is repeated 9 additional times where a model is trained on 90% of the data and tested on the remaining 10% of cases. Predictive R2 is calculated based on the average residual error of holdout prediction across the 10 repetitions. Nested cross-validation was also conducted within each partition of training data in order to determine the optimal values of the tuning parameters. The standard error statistic reported for each model is the standard error of the prediction metric between the cross-validation folds, or the average error between the train and test partitions within each repetition. Additionally, the minimum and maximum estimates reported reflect the observed range across the 10 repetitions of cross-validation. For the classification models, the AUC statistic is reported in the supplementary materials in Table 3.

2.3.3. Calculation of Variable Importance Scores

Variable importance was operationalized as the absolute magnitude of the standardized regression coefficient for each predictor (Wei et al., 2015). While variable importance is not a well-defined concept (Grömping, 2009), using standardized regression coefficients is a valid and widely-accepted choice (for more rationale on the choice of standardized regression coefficients, see Supplementary MaterialsVariable Importance Estimates’). The variable importance estimates derived for each model are averaged across the 10 repetitions of cross-validation.

All analyses were implemented in R version 4.0.0 unless otherwise noted. Our code made extensive use of the tidyverse (Wickham, n.d.), particularly the dplyr and tidyr packages for data manipulation. The glmnet package was also used to run the elastic net models (Friedman et al., 2010). We also used the beset package (https://github.com/jashu) to run the models with cross-validation and produce variable importance plots. If the number of predictors retained by the model exceeded ten, we plotted only the top ten predictors.

2.4. Missing Data

Some participants in our study were unable to complete all the assessments. Since the algorithms require there is no missing data, we imputed missing data for the predictor variables using the missForest package (Stekhoven & Bühlmann, 2012). missForest uses all other variables for each observation to predict the value of the missing variable. We used a cutoff of 25% for both variables and observations, so if any variable was missing for 25% or more of participants, it was dropped, and if any participant was missing 25% or more of their data, they were dropped. A detailed description of the treatment of missing data can be found in the supplementary materials section ‘Missing Data.’ Outcome variables were not included in the imputation so as to prevent any contamination between outcome and predictor values, which would optimistically bias prediction. Variables with missing data and percentage of cases missing can be found in the Supplementary Materials in Table 4. All of the code (and the associated output) used to generate the results reported in this article can be found in a PDF report titled, “Prediction-of-Depression-Main-and-Supplemental-Analyses.pdf.” All of the datasets generated and/or analyzed during the current study, as well as the code used to analyze them, are available in the Texas Data Repository, https://doi.org/10.18738/T8/DEFPNZ.

3. Results

3.1. Diagnostic Grouping

Using 10-fold cross-validation repeated 10 times with all predictors, out-of-sample predicted R2 = .206 when classifying MDE versus PC groups (SE= 0.058, Min= 0.199, Max= 0.216). The variable importance plot is shown in Figure 1a. Predictors that increased the likelihood of MDE membership were: greater anhedonia (SHAPS total score) and more work and school-related disability (SDS) and global disability (SDS global score). The number of positive words endorsed (SRET) decreased the likelihood the participants belonged to the MDE group.

Figure 1:

Figure 1:

Variable Importance Scores for MDE vs PC (1a) and MDE vs NP (1b)

Note: No variable importance plots are generated for the NP vs PC model, as the model added uncertainty to prediction instead of reducing it.

For classifying MDE and NP, out-of-sample predicted R2 = .332 (SE= 0.057, Min= 0.287, Max= 0.367). The variable importance plot is shown in Figure 1b. Among the most important predictors positively associated with MDE status were: poor self-reported sleep quality, greater perceived dysfunction in daily activities as a result of sleep disturbance, and more global functional impairment related to sleep (PSQI composite 1, composite 7, and global scores); greater work and school-related disability and global disability (SDS work/school and global); higher levels of anhedonia (SHAPS); the number of negative words endorsed (SRET), and the number of negative, self-referential words recalled (SRET), and greater brooding (RRS brooding subscale). Additionally, the inter-trial variability of the starting point of the SRET and the rank-order-transformed number of episodes variable were associated with increased likelihood of MDE membership (e.g. higher values on the variables indicated increase likelihood of belonging to the MDE group).

For PC and NP groups, out-of-sample predictive R2 was −0.006 (SE = 0.015, Min = −0.022, Max = 0.007), indicating the model added uncertainty to the prediction rather than reducing it.

3.2. Anhedonia

Out-of-sample predictive R2 = .257 (SE= 0.037, Min = 0.231, Max= 0.272) for total anhedonia score in the full sample. As seen in Figure 2a, the variable importance score for the number of positive words endorsed as self-descriptive on the SRET was one of the most important predictors of anhedonia (fewer positive words endorsed as self-descriptive was associated with higher levels of anhedonia), as was the rank ordered number of depressive episodes (more episodes of depression predicted higher levels of anhedonia). Other variables with inverse relationships to anhedonia were greater satisfaction with life (SWLS total); more motivation to pursue rewards (BIS/BAS reward responsiveness subscale), and total score for the behavioral approach system (BAS total). Additional important predictors of anhedonia were: greater perceived dysfunction in daytime activities as a result of sleep disturbance, higher global functional impairment related to sleep, and more sleep disturbances (PSQI composite score 7, global, and disturbances scores); a greater tendency to engage in brooding (RRS brooding subscale); greater global impairment (SDS global), and a greater number of negative, self-referential words recalled (SRET).

Figure 2:

Figure 2:

Variable Importance Scores for the Snaith-Hamilton Pleasure Scale (2a) and the Mood and Anxiety Sensitivity Questionnaire, General Distress Subscale (2b)

3.3. MASQ general distress

For depression severity, out-of-sample predictive R2 = .652 (SE= 0.035, Min= 0.633, Max= 0.660) for the MASQ general distress subscale in the full sample. The variable importance plot is shown in Figure 2b. The most important predictors of greater depression severity were anhedonia (SHAPS); brooding (RRS brooding subscale); number of negative words endorsed (SRET); global disability and impairment (SDS); unproductiveness of negative repetitive thought and difficulty disengaging from negative repetitive thought subscales, and total score on the perseverative thinking questionnaire (PTQ); and. Higher levels of satisfaction with life (SWLS total); greater tendency to use reappraisal strategies (ERQ reappraisal score), and greater number of positive words endorsed (SRET) were associated with lower depression severity.

3.4. Lifetime history of depression

Out-of sample predictive R2 = .129 (SE= 0.033, Min= 0.109, Max= 0.150) for classification of lifetime episode of depression (yes/no). The strongest predictors of the likelihood of having had a depressive episode was anhedonia (SHAPS). Other important predictors included a greater number of negative words endorsed (SRET), and greater global functional impairment related to sleep (PSQI global score); endorsement of suicidal ideation; greater difficulty disengaging from negative repetitive thinking and total rumination score (PTQ; Figure 3).. Greater satisfaction with life (SWLS) was associated with an increased likelihood of never having had an episode of depression.

Figure 3:

Figure 3:

Variable Importance Plot for Lifetime History of Depression (3)

Sample Size Justification

Learning curve analyses were performed to confirm the sample sizes were adequate for the elastic net models and can be found in the Supplementary Materials.

4. Discussion

The present study used a large multi-dimensional dataset to determine predictors of depression status, both categorically and dimensionally, as well as predictors of lifetime risk for an MDE. Additionally, this study is among the first to include a psychiatric control while trying to identify the best, specific correlates of depression using a data-driven approach. Potential predictor variables included demographics, biobehavioral assessments (e.g. pupil dilation, eye-tracking), cognitive tasks (e.g. SRET), resting state EEG, a polygenic risk score for the broad depression phenotype derived from the UK Biobank, actigraphy, and self-report questionnaire data. When classifying the groups, the cross-validated deviance explained was 20.6% for MDE vs PC and 33.2% for MDE vs NP, and −0.006% for NP vs PC. When examining depression continuously, cross-validated models predicted 25.7% of the variance in anhedonia and 65.2% of general distress. Finally, the expected predictive deviance for lifetime history of depression was 12.9%. The cross-validated predicted R2 indicates the variance (or deviance, for classification models) the model explained in “unseen” data that the model was not trained on, giving a good estimate of out-of-sample generalizability.

Several candidate predictors were identified as important across the categorical and continuous prediction models. Measurements of negative self-referent processing via the SRET, rumination, self-reported sleep quality, functional impairment, and satisfaction with life emerged as important predictors across analyses. Self-referential schemas have long been studied as important correlates of depression (LeMoult et al., 2017) and interventions targeting self-referential schemas are associated with a decrease in depression symptoms (Dainer-Best, Shumake, et al., 2018). Emerging evidence suggests targeting rumination may enhance cognitive-based therapies (Watkins, 2015). Interestingly, self-reported measures of sleep quality and disturbance were considered to be important predictors. The actigraphy variables measuring daily activity and circadian rhythms made marginal to no contribution to the models.

The use of a psychiatric control in comparison to the MDE group revealed important insights as well. Two of the best differentiators of people with depressive episodes from individuals with other psychiatric diagnoses were the number of positive words endorsed on the SRET and total anhedonia score. This is in line with mounting evidence of the importance of the role of low positive emotionality in depression (Khazanov & Ruscio, 2016; Vazquez, 2017). The importance of the number of positive words endorsed as self-descriptive in differentiating depressed individuals from psychiatric controls is also noteworthy. Previous research has highlighted the number of negative words endorsed as a strong, reliable predictor of depression (Dainer-Best, Lee, et al., 2018), which is consistent with our prediction models of MDE vs NP, general distress, and lifetime depression, where the number of negative words endorsed was a strong predictor. However, the lack of positive words endorsed highlights a bias that may be specific to MDE, perhaps being driven by low positive affect.

In fact, one theme that emerged across analyses was the importance of low positive emotionality and biased information processing away from positive stimuli in predicting depression. Typically, individuals show a bias towards positive information and away from negative information, but in depression, this pattern is reversed such that depressed individuals take in more negative information and filter out positive information(Peckham et al., 2010); (Duque & Vázquez, 2015); (Disner et al., 2011). The number of positive words endorsed on the SRET was one of the most important variables across the models predicting MDE vs PC, anhedonia, and general distress.

It is worthwhile to consider which predictors arose as valuable markers of depression, and which did not. Kendler set forth five criteria by which to gauge whether a variable meets the criteria of a marker for a psychiatric disorder: 1) strength of association; 2) specificity of association; 3) non-contingent on the presence of other factors; 4) causal proximity; and 5) an appropriately plausible level of explanation(Kendler, 2005)1. Particularly when considering the MDE vs PC model because of the specificity it afforded, number of positive words endorsed on the SRET, impairment and disability, and anhedonia all arose as strong predictors of MDE. These variables repeatedly were selected as top predictors in the other models as well, along with number of negative words endorsed (SRET), rumination, perceived sleep impairment, and satisfaction with life. Interactions among variables were not examined, consistent with the idea that these predictors are not contingent on the presence of other factors. Indeed, given the sample size, it is very unlikely that reliable interactions among predictors would have been identified in cross-validated models. This is partly why we selected elastic net regression rather than other more complex machine learning models that search for interactions among predictor variables.

Notably, all of the top markers were psychosocial in nature. Consistent with Dinga et al’s (2018) findings, the biological variables (e.g., frontal alpha asymmetry, polygenic risk score for the broad depression phenotype in the UK biobank, pupil dilation) added very little classification value to the model. As a direct test, we computed additional models without the biological variables, documented in the Supplementary Materials, Table 5, and found that the models with and without the biological variables were not significantly different in predicted R2. Further, none of them correlated very strongly with the outcome measures when bivariate correlations were examined.

There are several limitations to the present study. First, all our participants were of European ancestry because of the genetic component of the parent study. Future research would need to evaluate whether the predictive models generalize to populations of non-European ancestry. Second, this sample was restricted to ages 18–35 years old to reduce any effects of cognitive aging on the heterogeneity of the sample, so we cannot speculate whether these findings extend to a wider range of ages. In our NP group, participants did not meet for any current psychopathology but were allowed to have lifetime diagnoses of various mental illnesses. Additionally, individuals in both the NP and PC groups were included even if they had experienced past episodes of depression, and we controlled for lifetime history of depression. However, we may have suppressed the predictive utility of dispositional factors because lifetime history of depression was present across groups. Considering the younger age of our sample and the range of median age of onset for depression can span from 25 to 45 (Kessler et al., 2007), it is likely that individuals in the other groups may have predisposing factors to depression but have not yet experienced their first episode. An additional limitation might be the criteria of the main study that participants could not be taking psychotropic or steroidal medications, which may limit generalizability. Finally, even in the best model (general distress), approximately 34% of the variance remained unexplained, meaning there are very likely other unmeasured variables that could also further contribute to prediction.

While these classification models reveal important correlates of depression, future research should examine the classification value of these predictions. Longitudinal studies should look at outcomes including the future onset of depression and relapse. Finally, while we took a supervised approach to identify important predictors of depression, future research should apply unsupervised approaches to multidimensional datasets to explore possible subtypes of depression. With these caveats in mind, the current study suggests that multiple mechanisms measured across units of analyses are associated with depression, with psychosocial functioning, negative self-referent processing, and perceived sleep quality among the most important predictors. Further, dysfunction with regard to positive affect (e.g., anhedonia, low endorsement of positive adjectives as self-referent) may be specifically impaired in depression and warrants additional investigation.

Supplementary Material

1
2

Highlights:

  • Psychiatric control groups can help isolate factors specific to the diagnosis of interest

  • Low positive emotionality is a strong differentiator of depression

  • Additionally, perceived sleep quality and impairment are also important predictors

Acknowledgment:

Funding for this study was provided by National Institute of Health (awards R56MH108650, R21MH110758, R33MH109600, R01DA042742). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Footnotes

1For an interesting discussion of the issues raised here, see: Fried, Eiko. (2019, August 22). “All mental disorders are brain disorders … not.” [Blog Post] Retrieved from: https://eiko-fried.com/all-mental-disorders-are-brain-disorders-not/

Data Availability:

The datasets generated and/or analyzed during the current study are available in the Texas Data Repository, https://doi.org/10.18738/T8/DEFPNZ.

Conflicts of Interest:

None of the authors declare any conflicts of interest.

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

  1. Beevers CG, Mullarkey MC, Dainer-Best J, Stewart RA, Labrada J, Allen JJB, McGeary JE, & Shumake J. (2019). Association between negative cognitive bias and depression: A symptom-level approach. Journal of Abnormal Psychology, 128(3), 212–227. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Buysse DJ, Reynolds CF 3rd, Monk TH, Berman SR, & Kupfer DJ (1989). The Pittsburgh Sleep Quality Index: a new instrument for psychiatric practice and research. Psychiatry Research, 28(2), 193–213. [DOI] [PubMed] [Google Scholar]
  3. Carver CS, & White TL (1994). Behavioral Inhibition, Behavioral Activation, and Affective Responses to Impending Reward and Punishment: The BIS/BAS Scales. Subjectpool.com. http://www.subjectpool.com/ed_teach/y4person/2_goals/Carver_BISBAS_JPSP-67_319-333.pdf
  4. Coan JA, & Allen JJB (2004). Frontal EEG asymmetry as a moderator and mediator of emotion. Biological Psychology, 67(1–2), 7–49. [DOI] [PubMed] [Google Scholar]
  5. Dainer-Best J, Lee HY, Shumake JD, Yeager DS, & Beevers CG (2018). Determining optimal parameters of the self-referent encoding task: A large-scale examination of self-referent cognition and depression. Psychological Assessment, 30(11), 1527–1540. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Dainer-Best J, Shumake JD, & Beevers CG (2018). Positive imagery training increases positive self-referent cognition in depression. Behaviour Research and Therapy, 111, 72–83. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Derry PA, & Kuiper NA (1981). Schematic processing and self-reference in clinical depression. Journal of Abnormal Psychology, 90(4), 286–297. [DOI] [PubMed] [Google Scholar]
  8. Diener E, Emmons RA, Larsen RJ, & Griffin S. (1985). The Satisfaction With Life Scale. Journal of Personality Assessment, 49(1), 71–75. [DOI] [PubMed] [Google Scholar]
  9. Dinga R, Marquand AF, Veltman DJ, Beekman ATF, Schoevers RA, van Hemert AM, Penninx BWJH, & Schmaal L. (2018). Predicting the naturalistic course of depression from a wide range of clinical, psychological, and biological data: a machine learning approach. Translational Psychiatry, 8(1), 241. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Disner SG, Beevers CG, Haigh EAP, & Beck AT (2011). Neural mechanisms of the cognitive model of depression. Nature Reviews. Neuroscience, 12(8), 467–477. [DOI] [PubMed] [Google Scholar]
  11. Dobson KS, Hollon SD, Dimidjian S, Schmaling KB, Kohlenberg RJ, Gallop RJ, Rizvi SL, Gollan JK, Dunner DL, & Jacobson NS (2008). Randomized trial of behavioral activation, cognitive therapy, and antidepressant medication in the prevention of relapse and recurrence in major depression. Journal of Consulting and Clinical Psychology, 76(3), 468–477. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Duque A, & Vázquez C. (2015). Double attention bias for positive and negative emotional faces in clinical depression: evidence from an eye-tracking study. Journal of Behavior Therapy and Experimental Psychiatry, 46, 107–114. [DOI] [PubMed] [Google Scholar]
  13. Ehring T, Zetsche U, Weidacker K, Wahl K, Schönfeld S, & Ehlers A. (2011). The Perseverative Thinking Questionnaire (PTQ): validation of a content-independent measure of repetitive negative thinking. Journal of Behavior Therapy and Experimental Psychiatry, 42(2), 225–232. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Euesden J, Lewis CM, & O’Reilly PF (2015). PRSice: Polygenic Risk Score software. Bioinformatics, 31(9), 1466–1468. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Fried EI, & Kievit RA (2016). The volumes of subcortical regions in depressed and healthy individuals are strikingly similar: a reinterpretation of the results by Schmaal et al. [Review of The volumes of subcortical regions in depressed and healthy individuals are strikingly similar: a reinterpretation of the results by Schmaal et al]. Molecular Psychiatry, 21(6), 724–725. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Fried EI, & Robinaugh DJ (2020). Systems all the way down: embracing complexity in mental health research. BMC Medicine, 18(1), 205. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Friedman J, Hastie T, & Tibshirani R. (2010). Regularization Paths for Generalized Linear Models via Coordinate Descent. Journal of Statistical Software, 33(1), 1–22. [PMC free article] [PubMed] [Google Scholar]
  18. Fritzsche A, Dahme B, Gotlib IH, Joormann J, Magnussen H, Watz H, Nutzinger DO, & von Leupoldt A. (2010). Specificity of cognitive biases in patients with current depression and remitted depression and in patients with asthma. Psychological Medicine, 40(5), 815–826. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Gibb BE, Alloy LB, Abramson LY, Beevers CG, & Miller IW (2004). Cognitive vulnerability to depression: a taxometric analysis. Journal of Abnormal Psychology, 113(1), 81–89. [DOI] [PubMed] [Google Scholar]
  20. Gonzalez GDS, & Schnyer DM (2018). Attention and Working Memory Biases to Black and Asian Faces During Intergroup Contexts. Frontiers in Psychology, 9, 2743. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Grömping U. (2009). Variable Importance Assessment in Regression: Linear Regression versus Random Forest. The American Statistician, 63(4), 308–319. [Google Scholar]
  22. Gross JJ, & John OP (2003). Individual differences in two emotion regulation processes: implications for affect, relationships, and well-being. Journal of Personality and Social Psychology, 85(2), 348–362. [DOI] [PubMed] [Google Scholar]
  23. Hastie T, Tibshirani R, & Wainwright M. (2015). Statistical learning with sparsity: the lasso and generalizations. Chapman and Hall/CRC. [Google Scholar]
  24. Howard DM, Adams MJ, Shirali M, Clarke T-K, Marioni RE, Davies G, Coleman JRI, Alloza C, Shen X, Barbu MC, Wigmore EM, Gibson J, 23andMe Research Team, Hagenaars SP, Lewis CM, Ward J, Smith DJ, Sullivan PF, Haley CS, … McIntosh AM (2018). Genome-wide association study of depression phenotypes in UK Biobank identifies variants in excitatory synaptic pathways. Nature Communications, 9(1), 1470. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Kendler KS (2005). “A Gene for…”: The Nature of Gene Action in Psychiatric Disorders. American Journal of Psychiatry, 162(7), 1243–1252. [DOI] [PubMed] [Google Scholar]
  26. Kendler KS (2014). The structure of psychiatric science. The American Journal of Psychiatry, 171(9), 931–938. [DOI] [PubMed] [Google Scholar]
  27. Kendler KS (2019). From Many to One to Many-the Search for Causes of Psychiatric Illness. JAMA Psychiatry. 10.1001/jamapsychiatry.2019.1200 [DOI] [PubMed] [Google Scholar]
  28. Kessler RC, Amminger GP, Aguilar-Gaxiola S, Alonso J, Lee S, & Ustün TB (2007). Age of onset of mental disorders: a review of recent literature. Current Opinion in Psychiatry, 20(4), 359–364. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Khazanov GK, & Ruscio AM (2016). Is low positive emotionality a specific risk factor for depression? A meta-analysis of longitudinal studies. Psychological Bulletin, 142(9), 991–1015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Lee PH, Macfarlane DJ, Lam TH, & Stewart SM (2011). Validity of the International Physical Activity Questionnaire Short Form (IPAQ-SF): a systematic review. The International Journal of Behavioral Nutrition and Physical Activity, 8, 115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. LeMoult J, Kircanski K, Prasad G, & Gotlib IH (2017). Negative Self-Referential Processing Predicts the Recurrence of Major Depressive Episodes. Clinical Psychological Science, 5(1), 174–181. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Leon AC, Olfson M, Portera L, Farber L, & Sheehan DV (1997). Assessing psychiatric impairment in primary care with the Sheehan Disability Scale. International Journal of Psychiatry in Medicine, 27(2), 93–105. [DOI] [PubMed] [Google Scholar]
  33. Lopez-Duran NL, Nusslock R, George C, & Kovacs M. (2012). Frontal EEG asymmetry moderates the effects of stressful life events on internalizing symptoms in children at familial risk for depression. Psychophysiology, 49(4), 510–521. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Ormel J, Kessler RC, & Schoevers R. (2019). Depression: more treatment but no drop in prevalence: how effective is treatment? And can we do better? Current Opinion in Psychiatry. 10.1097/YCO.0000000000000505 [DOI] [PubMed] [Google Scholar]
  35. Paykel ES (2007). Cognitive therapy in relapse prevention in depression. The International Journal of Neuropsychopharmacology / Official Scientific Journal of the Collegium Internationale Neuropsychopharmacologicum, 10(1), 131–136. [DOI] [PubMed] [Google Scholar]
  36. Peckham AD, McHugh RK, & Otto MW (2010). A meta-analysis of the magnitude of biased attention in depression. Depression and Anxiety, 27(12), 1135–1142. [DOI] [PubMed] [Google Scholar]
  37. Richardson JTE (2007). Measures of short-term memory: a historical review. Cortex; a Journal Devoted to the Study of the Nervous System and Behavior, 43(5), 635–650. [DOI] [PubMed] [Google Scholar]
  38. Schwartz S, & Susser E. (2011). The use of well controls: an unhealthy practice in psychiatric research. Psychological Medicine, 41(6), 1127–1131. [DOI] [PubMed] [Google Scholar]
  39. Sheehan DV, Lecrubier Y, Sheehan KH, Amorim P, Janavs J, Weiller E, Hergueta T, Baker R, & Dunbar GC (1998). The Mini-International Neuropsychiatric Interview (M.I.N.I.): the development and validation of a structured diagnostic psychiatric interview for DSM-IV and ICD-10. The Journal of Clinical Psychiatry, 59 Suppl 20, 22–33;quiz 34–57. [PubMed] [Google Scholar]
  40. Snaith RP, Hamilton M, Morley S, Humayan A, Hargreaves D, & Trigwell P. (1995). A scale for the assessment of hedonic tone the Snaith-Hamilton Pleasure Scale. The British Journal of Psychiatry: The Journal of Mental Science, 167(1), 99–103. [DOI] [PubMed] [Google Scholar]
  41. Steidtmann D, Ingram RE, & Siegle GJ (2010). Pupil response to negative emotional information in individuals at risk for depression. Cognition and Emotion, 24(3), 480–496. [Google Scholar]
  42. Stekhoven DJ, & Bühlmann P. (2012). MissForest--non-parametric missing value imputation for mixed-type data. Bioinformatics, 28(1), 112–118. [DOI] [PubMed] [Google Scholar]
  43. Tomarken AJ, Davidson RJ, & Henriques JB (1990). Resting frontal brain asymmetry predicts affective responses to films. Journal of Personality and Social Psychology, 59(4), 791–801. [DOI] [PubMed] [Google Scholar]
  44. Treynor W, Gonzalez R, & Nolen-Hoeksema S. (2003). Rumination Reconsidered: A Psychometric Analysis. Cognitive Therapy and Research, 27(3), 247–259. [Google Scholar]
  45. Vazquez C. (2017). What does Positivity Add to Psychopathology? An Introduction to the Special Issue on “Positive Emotions and Cognitions in Clinical Psychology.” Cognitive Therapy and Research, 41(3), 325–334. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Vittengl JR, Clark LA, Dunn TW, & Jarrett RB (2007). Reducing relapse and recurrence in unipolar depression: a comparative meta-analysis of cognitive-behavioral therapy’s effects. Journal of Consulting and Clinical Psychology, 75(3), 475–488. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Wardenaar KJ, van Veen T, Giltay EJ, de Beurs E, Penninx BWJH, & Zitman FG (2010). Development and validation of a 30-item short adaptation of the Mood and Anxiety Symptoms Questionnaire (MASQ). Psychiatry Research, 179(1), 101–106. [DOI] [PubMed] [Google Scholar]
  48. Watkins E. (2015). Psychological treatment of depressive rumination. Current Opinion in Psychology, 4, 32–36. [Google Scholar]
  49. Wei P, Lu Z, & Song J. (2015). Variable importance analysis: A comprehensive review. Reliability Engineering & System Safety, 142, 399–432. [Google Scholar]
  50. Wickham H. (n.d.). Easily Install and Load the “Tidyverse” [R package tidyverse version 1.2.1]. Retrieved April 8, 2019, from https://CRAN.R-project.org/package=tidyverse
  51. World Health Organization, & Others. (2017). Depression and other common mental disorders: global health estimates. World Health Organization. http://www.who.int/iris/handle/10665/254610 [Google Scholar]
  52. Yarkoni T, & Westfall J. (2017). Choosing Prediction Over Explanation in Psychology: Lessons From Machine Learning. Perspectives on Psychological Science: A Journal of the Association for Psychological Science, 12(6), 1100–1122. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Zvielli A, Bernstein A, & Koster EHW (2015). Temporal Dynamics of Attentional Bias. Clinical Psychological Science, 3(5), 772–788. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1
2

RESOURCES