Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2020 Feb 1.
Published in final edited form as: J Consult Clin Psychol. 2018 Dec 20;87(2):184–197. doi: 10.1037/ccp0000364

A Multi-Method Screening Approach for Pediatric Depression Onset: An Incremental Validity Study

Joseph R Cohen 1, Hena Thakur 1, Katie L Burkhouse 2, Brandon E Gibb 3
PMCID: PMC6336512  NIHMSID: NIHMS994495  PMID: 30570310

Abstract

Objective:

Screening protocols that rely on a single informant are inadequate in predicting pediatric depression. Multi-informant and risk factor screening approaches are potentially more sensitive methods for identifying depression risk, but the incremental validity of these protocols has not been adequately tested. Using a translational analytic approach and multi-method, longitudinal study design, we simultaneously tested several multi-indicator approaches to depression screening in order to identify an optimal algorithm for predicting depression onset in youth.

Method:

222 never depressed children and adolescents (Mage =10.75 years old, SDage =1.85; Female=50.45%; 82.88% White) completed baseline questionnaires for depressive symptoms and cognitive vulnerabilities, in addition to a morphed face task to assess pupil dilation. Mothers, meanwhile, completed baseline questionnaires and a semi-structured interview to assess maternal and pediatric depression. Follow-up depression diagnostic assessments with both the mother and youth occurred every 6 months for 2 years. Receiver operating characteristics and reclassification analyses were used to test our aims.

Results:

Overall, we found moderate support for a multi-informant approach, and convincing evidence that individual differences in pupil dilation uniquely predicted depression onset. Youth with subthreshold depressive symptoms and elevated pupil dilation were over twice as likely to develop a first lifetime episode of depression compared to one’s risk rate based on sex and age.

Conclusions:

Our study provides one of the first screening batteries for detecting first lifetime episodes of depression in youth. The unique and incremental validity provided by pupil dilation suggests feasible biological indicators of depression risk can improve primary prevention efforts that target depression, such as universal pediatric depression screening.

Public Health Significance:

A multi-method assessment approach, paired with a developmental psychopathology perspective, can strengthen pediatric depression screening initiatives. Compared to existing and recommended screening protocols, we found the use of a brief multi-informant diagnostic interview and pupil dilation assessment dramatically improved our ability to identify those youth at-risk for depression prior to an initial episode.

Keywords: Pediatric Depression, Multi-Method Screening, Developmental Psychopathology, Pupil Dilation, Incremental Validity


Depression remains a significant pediatric public health concern within the United States. Approximately 3.1 million adolescents (12.8%) experience a depressive episode annually with over 70% of these episodes causing severe functional impairment (Center for Behavioral Health Statistics and Quality, 2016). In addition, pre-adolescent samples are experiencing depressive symptoms at an increasing rate (Luntamo, Sourander, Santalahti, Aromaa, & Helenius, 2012). Given these alarming trends and the significant burden linked with depression (Avenevoli, Swendsen, He, Burstein, & Merikangas, 2015; Garber & Rao 2014), it is important to evaluate protocols aimed at identifying children and adolescents at risk for the onset of depression.

Universal mental health screening protocols can play a role in reducing depression rates by identifying vulnerable youth in pediatric or school settings. Depression, along with substance use, is the only psychiatric condition that the United States Preventative Services Task Force (USPSTF) recommends routine screening for in youth. Yet, there is underwhelming evidence that these screening initiatives are effective (Wissow et al., 2013). To improve upon depression screening protocols, the USPSTF recently put forth two research agendas (Siu et al., 2016). First, the USPSTF found that, especially among children younger than 12, current tools have low and variable positive predictive values (i.e., the ability to identify positive cases correctly). Second, the USPSTF emphasized the importance of assessing risk factors to prevent the onset of depression in youth. By addressing these research limitations, the USPSTF suggests that universal screening initiatives for depression can reach their considerable public health potential.

Multi-informant protocols, in which parent-youth dyads complete pediatric depression screens, is the gold standard for depression assessments (Klein, Dougherty, & Olino, 2005) and represent an improvement over traditional single-informant approaches. Only recently have theoretical, methodological, and analytical advancements paved the way for integration of these multiple (and often discrepant) sources of data into clinical decision making processes (Martel, Markon, & Smith, 2017). Multi-informant screening is now used in applied settings and is more sensitively able to identify concurrent depression than single-informant approaches (e.g., Johnson, Hollis, Marlow, Simms, & Wolke, 2014). To date, however, there is little systematic research examining the incremental validity of a multi-informant approach among children and adolescents (Johnston & Murray, 2003), especially for prospective outcomes such as the onset of depression. As multi-informant methods largely focus on current depressive symptoms, they may be limited in predicting the emergence of symptoms.

Another possible method for improving current screening algorithms for depression onset is through the inclusion of known risk factors. The developmental psychopathology literature is rich with examples of vulnerabilities that can be targeted by screening protocols (Garber, Korelitz, & Samanez-Larkin, 2012; Rice & Rawal, 2011). Yet, it is unclear which risk factors should be prioritized in translational efforts. Importantly, investigators differentiate between vulnerabilities for first and recurrent depressive episodes (Monroe & Harkness, 2011), suggesting specific risk factors may be useful for detecting episode onset. Focusing on these risk factors may provide the best opportunity for targeting risk before a potentially chronic and severe depression course emerges. One of the more well-documented risk factors for depression onset is exposure to maternal depression. Across a variety of adolescent populations (e.g., Murray et al., 2011; Pearson et al., 2013), maternal depression uniquely forecasted depression onset, specifically early onset (i.e., prior to age 15; Hammen, Brennan, & Keenan-Miller, 2008). These findings have led some to call for a greater focus on maternal depression in screening initiatives across the developmental spectrum (Halligan, Murray, Martins, & Cooper, 2007). However, despite maternal depression’s significant effect on early adolescent depression onset (Hammen & Brennan, 2003), most screening protocols using maternal depression inventories focus on young children (e.g., Earls, 2010). It is therefore unclear whether screening for maternal depression during late childhood and early adolescence is an incrementally valid approach compared to other strategies for operationalizing depression risk.

An alternative risk factor screening approach targets cognitive vulnerabilities for depression in youth. Cognitive vulnerability for depression is conceptualized as a linking mechanism between genetic risk and adolescent depression (Gibb, Beevers, & McGeary, 2013). Therefore, cognitive vulnerabilities may confer a stronger signal in screening protocols compared to maternal depression due to its developmental proximity to depression onset. To date, translational efforts have predominately used self-reported cognitive vulnerabilities. For instance, self-report forms of depressogenic inferential styles (e.g., Garber et al., 2012) and rumination (e.g., Young & Dietrich, 2015) have been included in depression screening protocols. However, self-report is just one unit of analysis that can be used to assess cognitive vulnerability (LeMoult, Yoon, & Joormann, 2016; Nejad, Fossati, & Lemogne, 2012). Overall, there is increasing awareness of the ability of biological methods to confer unique insight into prospective risk at the screening stage (Bylsma, Mauss, & Rottenberg, 2016), particularly efficient, cost-effective, psychophysiological methods (De Los Reyes & Aldao, 2015). Utilizing psychophysiological indicators is consistent with recommendations to use multi-method mental health assessments (Hunsley & Mash, 2007), and the National Institute of Mental Health Research Domain Criteria’s (RDoC) emphasis on objective units of analysis in clinical protocols (Insel et. al, 2010). Simultaneously using psychophysiological and self-reported measures can clarify whether using one, or both, methods of assessment provide the best approach for predicting depression onset. As youth may not become aware of certain depressive cognitive styles until after a depressive episode (Sheppard & Teasdale, 2004), objective assessments of cognitive vulnerability may be especially useful in identifying risk for depression onset.

A psychophysiological marker of cognitive vulnerability previously recommended for translational protocols is pupil dilation (Burkhouse, Siegle, Woody, Kudinova, & Gibb, 2015; Silk et al., 2009). Greater pupil dilation in response to emotionally salient stimuli is a marker of cognitive-affective processing (for reviews, see Bradley, Miccoli, Escrig, & Lang, 2008; Laeng, Sirois, & Gredeback, 2012) and self-reported cognitive vulnerabilities (e.g., rumination; Siegle, Steinhauer, Carter, Ramel, & Thase, 2003). Recently, Burkhouse and colleagues (2015) found that elevated pupil dilation to sad (but not happy/angry) faces predicted depressive episodes in offspring of depressed mothers, demonstrating its external validity as a predictor of prospective depression status in at-risk youth. These findings are similar to other studies that demonstrate an association between depression symptoms/status and adolescent pupil dilation (e.g., Price et al., 2016; Silk et al., 2007). We sought to extend these collective findings by examining if peak pupil dilation to sad faces a) predicted depression onset in an unselected youth sample and b) provided incremental validity over existing (e.g., multi-informant and maternal depression) and more affordable (e.g., self-reported risk factors) methods for depression screening.

The Present Study

The present study examined the incremental validity of multi-informant (youth and parent report) and risk factor (maternal depression, self-reported cognitive vulnerability, pupil dilation) screening approaches for pediatric depression onset. For the present study, depression onset was defined as having a first lifetime major or minor (referred to as Other Specified Depressive Disorder, with the specifier Depressive Episode with Insufficient Symptoms in DSM-V) depressive episode. Our decision to focus on both minor and major depression was informed by the literature. Past research shows that minor and major depressive episodes both lead to functional impairment in youth (Lewinsohn, Rohde, Seeley, Klein, & Gotlib, 2000) and including both forms of depression in our criterion reflects current dimensional models for depression status (Balazs et al., 2013). Focusing of both types of depressive episodes is also consistent with the purpose of screening, which is to identify current depressive impairment and calibrate future depression risk (Siu, 2016). Ultimately, while discriminating between mood diagnoses is critical for comprehensive treatment planning, it is a complex process best reserved for the assessment setting (De Los Reyes et al., 2015). Thus, using a dimensional depression criterion at the screening stage can cast a wider net for those at risk for depressive impairment.

To develop our screening algorithms, we used an empirically-based medicine (EBM) approach (Youngstrom et al., 2017) via receiver operating characteristics (ROC). ROC approaches are ideally suited for translating basic research into screening protocols by generating empirically-informed cutoffs that can facilitate clinical decision making (Youngstrom, 2014; Youngstrom et al., 2017). Pairing data-driven cutoffs with diagnostic likelihood ratios (DLRs; Straus, Richardson, Glasziou, & Haynes, 2011) allows protocols to estimate prospective risk and tailor their screening response based on the objectives and resources of the screening initiative. For the present study, we utilize a “stoplight model” (Youngstrom, Choukas-Bradley, Calhoun & Jensen-Doss, 2015) to illustrate how to make feasible, EBM-informed decisions across multiple, dimensional predictors from the clinical setting.

Although ROC approaches are “best practice” for validating pediatric mental health index tests (Youngstrom et al., 2017), biostatisticians have warned that ROC approaches may be overly conservative with regard to incremental validity (Wang et al., 2006). Thus, we used reclassification analyses as a complementary approach to assess incremental validity (Pencina, D’Agostino, & Steyerberg, 2011). Across medicine, reclassification analyses are used to quantify the costs and benefits of including new prognostic markers in screening protocols (Leening, Vedder, Witteman, Pencina, & Steyerberg, 2014); however, it has sparingly been used in psychosocial research. Pairing ROC with reclassification analyses provides a unique opportunity to optimize a screening protocol for depression onset, and introduces an analytic framework for evaluating multi-indicator screening protocols for other pediatric mental health outcomes.

Method

Participants

Data from 222 children-mother dyads drawn from a larger study of intergenerational transmission of depression (Burkhouse, Siegle, & Gibb, 2014) were used in the current study. An unselected child and adolescent sample was used as the major aim of the study was to inform universal depression screening protocols. Exclusion criteria for the larger study included maternal substance abuse within the last six months, history of bipolar disorder, or symptoms of schizophrenia. For the present study, youth were excluded if they had a current or past diagnosis of a major or minor depressive episode as we were focused on predictors for depression onset. Thirty-three youth reported at baseline either a current or past depressive episode and were excluded from analyses for the present study. The racial/ethnic composition of the pediatric sample was 82.88% White (not Hispanic), 10.36% Biracial, 3.6% Black (not Hispanic), and 1.35% Asian/Pacific Islander. The average age of the youth was 10.75 years old (SD = 1.85, Range=8–14) at baseline, and 50.45% were girls. Appendix A provides a narrative on other studies drawn from this sample using shared methods. Of note, a subsample of the present study previously examined the relation between elevated pupil dilation and prospective depressive episodes (approximately 16% of the participants in the current study were represented in this prior publication; Burkhouse et al., 2015).

Measures

Youth Depression Diagnostic Status.

The Schedule for Affective Disorders and Schizophrenia for School-Age Children-Present and Lifetime Version (K-SADS; Kaufman et al., 1997) was used to assess current and lifetime history of depression at baseline, and the onset of depressive episodes every 6 months for 2 years. The K-SADS is a widely used diagnostic interview with well-established psychometric properties (Klein et al., 2005). Two trained interviewers administered the K-SADS separately to mother-child dyads. Consistent with research diagnostic criteria (Spitzer, Endicott, & Robins, 1978), as well as past research studies on pediatric depression screening (e.g., Cohen, So, et al., 2018), criteria for minor depression included the presence of a criterion A symptom plus at least one symptom from criterion B, which lasted for at least 2 weeks and resulted in clinically significant impairment. In the current study, 21 youth reported a first lifetime episode over the course of the study (11 major and 10 minor episodes). Diagnostic status for youth was determined using “best estimate” procedures (Klein et al., 2005). During each follow-up assessment, interviewers asked about the onset of a depressive episode in the past 6-months. A subset of 20 K-SADS interviews from this project were coded by a second interviewer via video recording and kappa coefficients for depressive diagnoses were excellent (K=1.00).

Self-Reported Depressive Symptoms.

The Children’s Depression Inventory (CDI) was used to assess pediatric depressive symptoms in the current study. The CDI was chosen because it is the most commonly used measure of youth depression (Myers & Winters, 2002), a recommended measure for assessing depression in applied settings (Klein et al., 2005), and has comparable screening properties to other depression inventories (e.g., the CES-D; Stockings et al., 2015). For the present study, the CDI ranged from 0–27 (M = 5.31; SD = 5.03) and demonstrated good reliability (α = 0.82).

Self-Reported Cognitive Vulnerability.

Rumination was assessed via the Children’s Response Style Scale (CRRS; Ziegert & Kistner, 2002). In the present study both the total rumination score and the brooding subscale were used to predict depression onset. The CRRS total score and brooding subscale have exhibited good reliability and validity in youth (Muris, Fokke, & Kwik, 2009; Orue, Calvete, & Padilla, 2014). In the present study, total and brooding scores ranged between 3–90 (M = 51.80; SD = 18.71) and 0–47 (M = 28.86; SD = 18.71), respectively. Both the overall scale and brooding subscale were adequately reliable (Rumination: α = .79; Brooding: α = .68). Depressogenic Inferential Styles (DISs) were assessed with the Children’s Cognitive Style Questionnaire (CCSQ; Abela, 2001). The questionnaire is comprised of hypothetical negative events to assess the tendency to a) catastrophize the consequences of a negative event (DIS-Consequences) and b) make attributions towards oneself following a negative event (DIS-Self). DIS-Consequences and DIS-Self, as opposed to DISs about causes, have been found to be reliable and predictive of prospective depression in both children and adolescents (Cohen, Young, & Abela, 2012). Scores on the DIS-Consequences and DIS-Self ranged between 0–30 (M=12.52; SD=5.19) and 0–21 (M=9.06; SD=4.21) respectively, with good reliabilities on both scales in the present study (DIS-Consequences: α = .78; DIS-Self: α = .75).

Parent-Reported Youth Depression Symptoms.

We used the Child Behavior Checklist (CBCL; Achenbach & Rescorla, 2001), specifically the anxious/depressed, withdrawn/depressed, somatic complaints, and internalizing subscales, to query parent perspectives on youth depressive symptoms. These four subscales are valid indicators of depression diagnostic status in youth (Kaufman et al., 1997), and are better index tests than the DSM-oriented CBCL subscales (Ebesutani et al., 2010). All four CBCL subscales demonstrated adequate reliability in past research (see Achenbach & Rescorla, 2001). The range (CBCL-Anxious/Depressed: 0–16; CBCL-Withdrawn: 0–14; CBCL-Somatic: 0–12; CBCL-Internalizing: 0–29) and average raw scores (CBCL-Anxious/Depressed: M=3.39; SD=3.35; CBCL-Withdrawn/Depressed: M=1.80; SD=2.34; CBCL-Somatic: M=1.93; SD=2.31; CBCL-Internalizing: M=6.81; SD=6.16) for the subscales in the current study were comparable to those in past research (Achenbach & Rescorla, 2001). In the present study the anxious/depressed (α = .79), withdrawn/depressed (α = .79), somatic (α = .71) and internalizing subscales (α = .86) demonstrated good internal reliability.

Maternal Depression.

The Structured Clinical Interview for DSM-IV Axis I Disorders (SCID-I; First, Spitzer, Gibbon, & Williams, 1995) was used to assess maternal depression. The SCID-I is a widely used diagnostic interview with well-established psychometric properties (Lobbestael, Leurgans, & Arntz, 2011; First et al., 1995). Trained interviewers administered the SCID-I to all participants. In the present study, 45.9% (n = 102) of mothers reported either a current or past depressive episode. In addition, the Beck Depressive Inventory-II (BDI-II; Beck, Steer, & Brown, 1996) was used to assess mothers’ current symptoms of depression. The BDI-II has exhibited excellent reliability and validity in previous research (Storch, Roberti, & Roth, 2004). In the current study, the BDI-II ranged from 0–49 (M = 8.07; SD = 9.29) and exhibited good internal consistency (α = 0.94).

Multi-Informant Depression Inventory.

The present study also used the Children’s Depression Rating Scale-Revised (CDRS-R; Poznanski & Mokros, 1996). The CDRS-R is an interviewer administered measure that separately queries the parent and child and then forms a combined depression score. The CDRS-R has demonstrated good reliability and validity in both child and adolescent samples (e.g., Canals, Marti-Henneberg, Fernandez-Ballart, & Domenech, 1995; Poznanski & Mokros, 1996). Recently, an “unfiltered” version of the CDRS-R was shown to adequately distinguish between depression diagnoses in a large at-risk and unselected pediatric sample (Yee et al., 2015). The “unfiltered” version of the CDRS-R differs from standard diagnostic interviews because it does not contextualize the symptoms across other disorders or within a patient’s individual history (e.g., concentration symptoms are coded as a symptom of depression regardless of ADHD diagnosis). The use of “unfiltered” administrations significantly reduces the amount of time and training burden for diagnostic interviews (Fristad et al., 2016; Yee et al., 2015), making it more amenable to screening protocols outside the context of outpatient mental health services. Scores on the CDRS-R in the present study ranged between 17 and 42 (M=20.40; SD=4.00). In the present study the CDRS-R had good reliability (α=0.78).

Pupil Dilation.

Pupil dilation in response to sad faces was assessed in a moderately lit room using Tobii T60 & T60XL eye-trackers within the context of a morphed faces computer task (see Burkhouse et al., 2014). The stimulus set consists of full-color pictures of actors taken from a standardized stimulus set (Matsumoto & Ekman, 1988). Sad and neutral photographs from each actor were morphed to form a continuum of 10% increments between the two photographs. During this task, pupil size was recorded using the eye trackers at 60 Hz (every 16.7 ms) for 3s following the onset of each facial stimulus. Total time for the task was approximately 5 minutes. Data were cleaned using Siegle, Ichikawa, and Steinhauer’s (2008) standard procedures. Linear interpolations replaced blinks throughout the data set and data were smoothed using a 10-point weighted average filter. Data were resampled to 3 Hz (one sample every 333 ms). The average pupil diameter over the 333 ms preceding the onset of the stimulus was subtracted from pupil diameter after stimulus onset to produce stimulus-related pupil dilation waveforms. Consistent with prior studies highlighting the predictive validity of pupil reactivity to emotional faces at the highest, but not medium or low, levels of morphed intensity (Burkhouse et al., 2014; Burkhouse et al., 2015), we focused our analyses on peak pupil dilation to faces during the highest level of sadness intensity (70–90% morph). Peak stimulus-related pupil dilation (i.e., SadPeak) was calculated by taking the maximum pupil response on average across all trials for sad faces. Levels of pupil dilation ranged between 0–0.17 (M = .06; SD = .04). Of note, similar pieces of apparatus to assess pupil dilation are currently being used in applied pediatric settings (Boev et al., 2005; Connelly et al., 2014), albeit without the use of specific stimuli (e.g., sad faces).

Procedure

Participants were recruited from the community through a variety of means (e.g., bus ads, flyers). Mothers responding to these ads were first screened over the phone to determine eligibility. Upon arrival at the laboratory, mothers provided informed consent and children provided assent to be in the study. Next, the SCID, CBCL, and CDRS-R were administered to the mother by a research assistant. During this time, the child completed the emotional faces paradigm to assess pupil dilation. An interviewer then administered the K-SADS-PL, self-report measures of depressive symptoms (CDI, CDRS-R) and cognitive vulnerabilities (CRRS, CCSQ). Follow-up depression diagnostic assessments with both the mother and youth occurred 6, 12, 18, and 24 months after the first assessment to identify if the youth experienced depression onset. All procedures for this study were approved by the last author’s Institutional Review Board (IRB).

Data Analytic Strategy

First, logistic regression analyses tested if the relation between predictors and depression onset varied as a function of age or sex. If positive, the area under the curve (AUC) was computed separately for girls and boys and/or children (ages 8–11) and early adolescents (ages 12–14). These AUCs were then compared using the Delong test for paired ROC curves (DeLong, DeLong, & Clarke-Pearson, 1988). If significantly different, incremental validity was examined separately across the sociodemographic variable.

Following recommendations for our sample size (Obuchowski & Lieber, 1998), a bias-corrected accelerated (BCa) bootstrap method was used to generate the confidence interval (CI) for our AUCs. In the present study, an AUC greater than 0.64 conferred a medium, significant effect (Rice & Harris, 2005) while an AUC of 0.70 was prioritized as a clinically significant effect (see Swets, 1988). We next examined if index tests achieving an AUC of at least 0.64 differed from baseline CDI scores by using the saved residuals from a linear regression model. The AUC of these residual scores represent the unique variance of the predictor (Edens, Marcus, Lilienfeld, &, Poythress, 2006; Hastings, Krishnan, Tangney, & Stuewig, 2011). When testing the residuals, an index test was considered unique if the CI of the AUC did not include 0.50.

To examine the incremental validity of adding index tests to baseline symptoms, a ROC approach was first applied in which predictive probabilities from logistic regression models where predictors were simultaneously entered were plotted within an AUC curve. This multivariate AUC statistic is commonly referred to as the C statistic. After identifying the highest C statistic between two predictors, we entered a third predictor. This step-wise approach continued until no significant predictors were left or the C statistic remained unchanged.

Next, we used reclassification analyses to evaluate the incremental validity of our predictors. Reclassification tables are derived by examining how many additional cases are accurately identified by positive scores on an additional risk indicator (Pencina et al., 2011). The Net Reclassification Improvement (NRI) index, the most common statistic derived from these analyses, is the sum of the false negatives misclassified by the baseline algorithm subtracted by the number of false positives included in the new algorithm (see Pencina et al., 2008 for further details). We used the continuous model as described by Pencina and colleagues (2011) as it provides less biased results compared to using categorical approaches. At each stage, novel index tests were simultaneously added to a model. If multiple predictors were significant, the indicator with the highest NRI was retained and re-entered into a new baseline model and reclassification analyses were conducted with subsequent predictors. This process was repeated until the NRI of any of the predictors was no longer significant (see Cohen, Shorey, et al., 2018 for a demonstration). The NRI, a bootstrapped C statistic, the Hosmer-Lemeshow goodness of fit statistic, and the integrated discrimination index (IDI) were calculated to provide complementary perspectives on our final models (Pencina, D’Agostino, Pencina, Janssens, & Greenland, 2012).

Finally, diagnostic likelihood ratios (DLRs) were calculated to facilitate translation of our findings into applied settings (Straus et al., 2011). DLRs are the proportion of cases with a diagnosis within a certain scoring range on the index test divided by the proportion of cases without the diagnosis in that same range of scores. Ideally, DLRs contain at least three cutoffs so that risk can be operationalized dimensionally across “low”, “intermediate” and “high” risk categories. Selection for cutoffs can either be informative (based on established cutoffs) or equal (cutoffs that form three equivalent groups with regard to sample size).

We utilized an informative tertile approach for our baseline predictor, the CDI. Original cutoffs for the CDI were proposed to range between raw scores of 12 and 19 (Kovacs, 1992); however, these cutoffs may not be sensitive within an unselected population (Matthey & Petrovski, 2002). Recently, Cohen, So, Hankin, and Young (2018) used CDI cutoffs of 7 (subthreshold) and 15 (threshold) based on levels of sensitivity and specificity congruent with current pediatric mental health screening initiatives (70% sensitivity; 90% specificity; Lavigne, Feldman, & Meyers, 2016). For the current study, we formed three groups (i.e., tertiles) based on these cutoffs, but also tested cutoffs that corresponded to 70% sensitivity (subthreshold) and 90% specificity (threshold) for predicting onset in our sample. For all other index tests, equal tertiles were formed. After DLRs for each index test were computed, posterior probabilities were calculated based on the CDI, and then we examined the incremental impact on probability once adding additional risk factors (see Youngstrom, Halverson, Youngstrom, Lindhiem, & Findling, 2018 for an example). DLR scores of 1.0 suggest that one’s odds of developing the disorder remain relatively unchanged (i.e., the assessment result was neutral). Higher DLRs suggest that the odds of developing the target diagnosis increase. All analyses were conducted in R (3.4.4), with the exception of DLRs which were calculated in SPSS (24.0). A table in the supplemental materials available online summarizes each step of our multi-analytic plan. Interested readers are encouraged to consult this table to more clearly understand the function of each step.

Results

The number of youth with depression onset exceeded the minimum number of 20 cases needed for ROC analyses (n = 21; Kraemer, 1992). Correlations for our baseline predictors are presented in Table 1. Relations between our index tests ranged from non-significant to medium effect sizes. We then examined whether any of our indicators varied as a function of sex or age. Logistic regression analyses were non-significant, leading us to calculate AUCs across the whole sample. AUCs for all index tests are displayed in Table 2. Overall, CDI scores, CBCL withdrawn-depressed symptoms, CBCL somatic complaints, CBCL internalizing symptoms, and maternal depression diagnostic status, exhibited a significant, medium effect (AUC ≥ .64). Meanwhile, only CDRS-R scores and peak pupil dilation while viewing sad stimuli (SadPeak) were clinically significant (AUC ≥ .70). Of note, our self-reported cognitive vulnerabilities did not predict onset (ps > .05). None of the predictors with AUCs between 0.64 and 0.70 remained significant once covarying out CDI scores (p > .05). Alternatively, SadPeak (AUC = .75, p < .01) and CDRS-R scores (AUC = .63, p =.05) remained significant. The DeLong test suggested that the CDI, CDRS-R, and SadPeak equivalently forecasted depression onset (p > .10).

Table 1.

Correlations between Baseline Predictors of Youth Depression Status

Youth-Report Symptoms and Risk (Cognitive Vulnerability) Parent-Report Symptoms and Risk (Maternal Depression) Multi-Informant Symptoms
CDI CRRS-T CRRS-B CCSQ-C CCSQ-S CBCL-A/D CBCL-W/D CBCL-SOM CBCL-INT SCID BDI-II CDRS-R
Youth-Report Symptoms and Risk CDI
CRRS-T .15*
CRRS-B 19** .88**
CCSQ-C .22** .15* .21**
CCSQ-S .24** .10 .09 .42**
Parent-Report Symptoms and Risk (Maternal Depress) CBCL-A/D .21** .06 .09 .00 .11
CBCL-W/D .28** .10 .07 −.01 −.02 .55**
CBCL-SOM .16* .08 .07 −.03 .03 .42** .54**
CBCL-INT .24** .03 .05 −.02 .08 .80** .80** .77**
SCID .16* .05 .06 .10 −.03 .24** .28** .25** .30**
BDI-II .16* .00 .05 .03 −.09 .22** .39** .31** .26** .54**
Multi-Informant Symptoms
CDRS-R .29** .15* .13* .04 .01 .30** .36** .29** .33** .38** .32**
Pupil Dilation SadPeak .08 −.02 .04 .03 −.08 −.08 .13 .04 .02 .22* .06 .23*

Note: Table presents Pearson correlation coefficients for baseline predictors of youth depression status. CDI= Child self-report score on the Children’s Depression Inventory (CDI; Kovacs, 1992); CRRS-T=Child self-report score on the Children’s Response Style Scale (CRRS; Ziegert & Kistner, 2002).; CRRS-B=Child self-report score on the Brooding subscale of the Children’s Response Style Scale (CRRS; Ziegert & Kistner, 2002); CCSQ-C=Child self-report on tendency to catastrophize consequences of a negative event on the Children’s Cognitive Style Questionnaire (CCSQ; Abela, 2001);CCSQ-S=Child self-report on self-attributions following a negative event on the Children’s Cognitive Style Questionnaire (CCSQ; Abela, 2001); CBCL-A/D= Anxious/Depressed subscale score on the parent reported Child Behavior Checklist (CBCL; Achenbach & Rescorla, 2001); CBCL-W/D=Withdrawn/Depressed subscale score on the parent reported Child Behavior Checklist (CBCL; Achenbach & Rescorla, 2001); CBCL-SOM=Somatic subscale on the parent reported Child Behavior Checklist (CBCL; Achenbach & Rescorla, 2001); CBCL-INT=Internalizing subscale score on the parent reported Child Behavior Checklist (CBCL; Achenbach & Rescorla, 2001); SCID=The Structured Clinical Interview for DSM-IV Axis I Disorders (No lifetime maternal depression diagnosis=0; Lifetime maternal depression diagnosis=1); BDI-II=Parental self-report score on the Beck’s Depressive Inventory-II (BDI-II; Beck, Steer, & Brown, 1996); (SCID-I; First, Spitzer, Gibbon, & Williams, 1995); CDRS-R=Combined child self-report and parent reported scores on the Children’s Depression Rating Scale-Revised (CDRS-R; Poznanski & Mokros, 1996); SadPeak=Child’s peak pupil dilation to faces during the highest level of sadness intensity.

*

p<0.05;

**

p<0.01.

Table 2.

Summary of Individual Predictors and Multi-Indicator Algorithms

Area Under the Curve and Effect Sizes for Individual Predictors
Predictor AUC BCa CI Cohen’s d effect size
CDI 0.64* 0.51–0.76 0.51–M
CRRS-T 0.52 0.410.63 0.07
CRRS-B 0.58 0.480.68 0.29S
CCSQ-C 0.59 0.490.69 0.32S
CCSQ-S 0.56 0.450.67 0.21S
CBCL-A/D 0.59 0.440.74 0.32S
CBCL-W/D 0.67* 0.57–0.77 0.62–M
CBCL-SOM 0.68* 0.55–0.80 0.66–M
CBCL-INT 0.65* 0.52–0.78 0.55–M
SCID-Diagnosis 0.64* 0.54–0.75 0.51–M
BDI-II 0.62 0.48–0.76 0.43–S
CDRS-R 0.72** 0.61–0.83 0.82–L
SadPeak 0.76** 0.55–0.94 1.00–L
Discrimination and Calibration Statistics for All Multi-Indicator Algorithms
Baseline Model: CDI
NRI Lower Bound Upper Bound IDI Lower Bound Upper Bound C Lower Bound Upper Bound H -L X2 p
CDRS-R .76* .09 1.43 .08 −.01 .17 0.75 0.65 0.84 12.69 0.12
SadPeak 96*** .38 1.53 .21** .03 .38 0.77 0.60 0.94 8.85 0.36
Baseline Model: CDI and Sad Peak
NRI Lower Bound Upper Bound IDI Lower Bound Upper Bound C Lower Bound Upper Bound H-L X2 p
CDRS-R .63 −.04 1.31 .05 −.06 .16 0.84 0.71 0.96 7.15 0.52

Note: CDI= Child self-report score on the Children’s Depression Inventory (CDI; Kovacs, 1992); CRRST=Child self-report score on the Children’s Response Style Scale (CRRS; Ziegert & Kistner, 2002).; CRRSB=Child self-report score on the Brooding subscale of the Children’s Response Style Scale (CRRS; Ziegert & Kistner, 2002); CCSQ-C=Child self-report on tendency to catastrophize consequences of a negative event on the Children’s Cognitive Style Questionnaire (CCSQ; Abela, 2001);CCSQ-S=Child self-report on self-attributions following a negative event on the Children’s Cognitive Style Questionnaire (CCSQ; Abela, 2001); CBCL-A/D= Anxious/Depressed subscale score on the parent reported Child Behavior Checklist (CBCL; Achenbach & Rescorla, 2001); CBCL-W/D=Withdrawn/Depressed subscale score on the parent reported CBCL; CBCL-SOM=Somatic symptoms subscale on the parent reported CBCL; CBCL-INT=Internalizing subscale score on the parent reported CBCL; SCID=The Structured Clinical Interview for DSM-IV Axis I Disorders (No lifetime maternal depression diagnosis=0; Lifetime maternal depression diagnosis=1); BDIII=Parental self-report score on the Beck’s Depressive Inventory-II (BDI-II; Beck, Steer, & Brown, 1996); (SCID-I; First, Spitzer, Gibbon, & Williams, 1995); CDRS-R=Combined child self-report and parent reported scores on the Children’s Depression Rating Scale-Revised (CDRS-R; Poznanski & Mokros, 1996); SadPeak=Child’s peak pupil dilation to faces during the highest level of sadness intensity. AUC=Area Under the Curve. BCa CI=Bias-corrected accelerated confidence intervals. S=Small effect (d > .20); M=Medium effect (d> .50); L=Large Effect (d>.80). NRI=Net Reclassification Improvement Index (Pencina et al., 2012), IDI=Integrated Discrimination Index (Pencina et al., 2012); C=Multivariate AUC; H-L X2 = Hosmer-Lemeshow Test which tests whether expected rates of the outcome are significantly different than the observed rates. Non-significant p suggests the model is well-calibrated.

*

p<0.05;

**

p<0.01.

Next, we examined the C statistics for CDRS-R and SadPeak added to CDI scores. Overall, the C statistic for CDRS-R together with CDI (0.75) represented a 17% increase from the CDI’s AUC, while SadPeak and CDI’s C (0.78) demonstrated a 22% increase above the CDI alone. Meanwhile, the C statistic for all three index tests was 0.85, a 33% increase above our baseline CDI score. Although the DeLong test for paired ROC suggested using two index tests did not incrementally predict onset better than CDI alone (p > .05), using three index tests together (CDI, CDRS-R, and SadPeak) better forecasted onset compared to only using CDI scores (DeLong test= −3.17, p < .01). As for reclassification analyses, the NRI for CDRS-R scores and SadPeak were both significant compared to our baseline CDI model (p < .05). Because SadPeak had the higher NRI it was entered into a new baseline model that included CDI scores. For this model, CDRS-R scores were non-significant (p > .05). Summary statistics for the ROC-algorithm (CDI, CDRS-R, SadPeak) and reclassification-algorithm (CDI, SadPeak) are presented in Table 2.

Finally, the top panel in Table 3 displays the DLRs for our algorithms. Within our sample, a score of 4 on the CDI corresponded to 70% sensitivity, and a score of 10 conferred 90% specificity. However, DLRs between these cutoffs, and our a-priori cutoffs (7, 15) were similar (e.g., DLRs differed by less than 10% for high scores), prompting us to use the a-priori cutoffs to facilitate comparisons across the literature. Subthreshold (i.e., the medium tertile) and threshold (i.e., high tertile) scores were 18 and 20 for CDRS-R, and 0.45 and 0.72 for SadPeak. When examining the CDI alone, individuals with scores above threshold (i.e., ≥ 15) were approximately 2.5 times as likely to experience a first lifetime episode of depression compared to the average prevalence rate for depression onset within the sample (see the first post-test probability in the middle panel). The middle panel of Table 3 demonstrates how including SadPeak, our most robust novel indicator, into the decision algorithm influences probabilities. These estimates were drawn using a naïve approach (see Youngstrom et al., 2018), in which the posterior probability, as estimated by the CDI, becomes the new pre-test probability. The incremental impact of SadPeak is then demonstrated by multiplying the new pre-test odds by the DLR derived from SadPeak scores to calculate the new posterior probability. Similar estimates can be generated using the DLRs for CDRS-R both with and without SadPeak. Of note for our reclassification algorithm (CDI, SadPeak), scores above threshold significantly increase risk (see example cases 2 and 3), while scores below threshold on the SadPeak attenuate risk (see example cases 1 and 4). These findings suggest that our multi-indicator approach can be used to both “rule in” and “rule out” screening cases that are at or near cutoff scores on the CDI.

Table 3.

Diagnostic Likelihood Ratios for Significant Predictors and Stoplight Model for Examples of Screening Cases.

Diagnostic Likelihood Ratios
Model Low DLR Medium DLR High DLR
CDI 0.77 1.40 2.51
SadPeak 0.27 0.59 2.39
CDRS-R 0.19 0.73 1.70
Examples of Screening Cases
Pre-Test Probability Scoring Profile Post-Test Probability
(1) 8 year-old female 10.78% CDI: 15
SadPeak: 0.38
CDI Alone: 23.27%
CDI; SadPeak: 7.46%
(2) 8 year-old male 6.25% CDI: 14
SadPeak: 0.79
CDI Alone: 8.53%
CDI; SadPeak: 18.03%
(3) 14 year-old female 15.38% CDI: 14
SadPeak: 0.79
CDI Alone: 20.28%
CDI; SadPeak: 37.83%
(4) 14 year-old male 11.77% CDI: 14
SadPeak: 0.38
CDI Alone: 15.74%
CDI; SadPeak: 4.80%
Interpretation
(1) 8 year-old female Yellow: Despite an above threshold CDI score, a low score for pupil dilation leads to a slightly lower likelihood of experiencing depression onset than someone in the general population. As the CDI score is elevated, continued monitoring is warranted but others with above threshold CDI scores and pupil dilation should be prioritized for services.
(2) 8 year-old male Red: Just relying on CDI scores one would potentially would be missing out on someone who is nearly 3-times more likely to experience depression onset. This child should be immediately referred for a mental health assessment, and any preventative services should be made readily available.
(3) 14 year-old female Red: The subthreshold CDI score confers that this individual is slightly more at risk to develop depression. The above threshold pupil dilation score together with the subthreshold CDI score, however, suggests that this person is nearly three times as likely to experience depression onset.
(4) 14 year-old male Green: Despite presenting with a CDI score near the threshold, this individual is actually at little risk for experiencing a depressive episode over the next two years. Specifically, low scores on pupil dilation together with subthreshold scores on the CDI suggest that this individual is approximately 300% less likely to experience a first lifetime episode of depression compared to his pre-test probability.

Note: CDI= Child self-report score on the Children’s Depression Inventory (CDI; Kovacs, 1992); CDRS-R=Combined child self-report and parent reported scores on the Children’s Depression Rating Scale-Revised (CDRS-R; Poznanski & Mokros, 1996); SadPeak=Child’s peak pupil dilation to faces during the highest level of sadness intensity; DLR = Diagnostic Likelihood Ratio; Low DLR: CDI = < 7; SadPeak= < .45; CDRS-R= < 18; Medium DLR: CDI= 7–14; SadPeak=.45-.71; CDRS-R=18–19; High DLR: CDI= ≥15; SadPeak= ≥ .72; CDRS-R = ≥ 20. Pre-Test Probability = The percentage chance of each depression outcome based on sex and age; Post-Test Probability = (prevalence/(1-prevelance)* DLR)/((prevalence/(1-prevelance)) + 1) (Straus et al., 2011); Green Zone=standard screening approach; Yellow Zone = increased monitoring; Red Zone = implement brief intervention (or prioritize for mental health services in settings now equipped to implement mental health services).

Discussion

It is now well-established that multiple indicators are needed for an adequate mental health assessment (Hunsley & Mash, 2007). Yet, due to a paucity of incremental validity studies (Johnston & Murray, 2003), it is unclear which indicators should be targeted. Overall, we found moderate support for the incremental validity of multi-informant approaches, and convincing evidence that a specific psychophysiological index, pupil dilation, improves our ability to predict depression onset in youth even when compared to other informant reports or well-documented risk factors. The translational, clinical significance of these findings is discussed below.

As efficiency is prioritized within a screening setting, there is a significant need to demonstrate the incremental validity of a potential index test. Although a multi-informant approach is commonly used for discriminating between concurrent diagnostic statuses within the assessment context (De Los Reyes et al., 2015), research is mixed about whether it may improve forecasts for prospective functioning (Johnson et al., 2014). Overall, our findings suggest that a clinician-administered, multi-informant depression screen (CDRS-R) added incremental validity, while a multi-informant questionnaire for internalizing symptoms (CBCL) did not. Past research suggests that parental perspectives may be especially helpful in identifying behavioral manifestations of psychological distress in youth (De Los Reyes et al., 2015). As the CDRS-R represents a more focused assessment of depressive symptoms and specifically incorporates behavioral observations into its score (Poznanski & Mokros, 1996), it is well-positioned to identify early behavioral manifestations of depression. As depression related behavioral symptoms uniquely predict onset in adolescence (Ong, Bergeman, Bisconti, & Wallace, 2006), the inclusion of multi-informant behavioral screening initiatives may be especially important for detecting first lifetime episodes of depression in youth.

While multi-informant depression assessments are a reasonable approach to improving depression screening initiatives, assessing pupil dilation contributed to optimal models for predicting depression onset. Our study extended past research (e.g., Burkhouse et al., 2015) by (i) showing its association with first lifetime episodes within an unselected sample and (ii) its incremental validity compared to other index tests. This finding is also consistent with recent work suggesting that psychophysiological and neurobiological tools can improve prediction models for psychiatric illnesses (De Los Reyes & Aldao, 2015; Gabrieli, Ghosh, & Whitfield-Gabrieli, 2015). It is noteworthy that neither self-reported cognitive vulnerabilities nor indicators of maternal depression were predictive of our outcome. One reason for this may be is that self-reports of cognitive vulnerabilities, and to a lesser extent subjective reports of maternal depression, shared a significant amount of variance with our baseline CDI model (see Table 1). However, within the univariate models, pupil dilation still achieved a higher AUC than these other risk factors, suggesting shared method variance alone did not explain why pupil dilation’s incremental validity was unique in this study.

Pupil dilation may have emerged as a superior candidate for multi-indicator screening protocols due to its objective nature. With regard to rumination and depressogenic inferential styles, past research concerning their ability to forecast prospective episodes of depression compared to other indicators is mixed (Cohen, So, et al., 2018; Seeley, Stice, & Rohde, 2009). One possibility is that self-reported cognitive vulnerabilities are more influential in predicting recurrent depressive episodes, as opposed to first lifetime episodes, due to increased meta-cognitive awareness (Teasdale, Moore, Hayhurst, Pope, Williams, & Segal, 2002). Specifically, individuals who have previously experienced a depressive episode may be more likely to be aware of and report on negative cognitive styles (Sheppard & Teasdale, 2004). In response, autonomic indices of cognitive vulnerability, such as pupil dilation, may be needed for primary depression screening initiatives (Rawal, Collishaw, Thapar, & Rice, 2013).

As for maternal depression, we found a small-medium effect for predicting depression onset in our univariate models. This finding is consistent with past research on community adolescent samples (e.g., Hammen & Brennan, 2003; Hammen et al., 2008) and supports connect adolescent depression screening initiatives to primary care sites that routinely screen for maternal depression (Garber et al., 2009; Halligan et al., 2007). Yet, when determining a youth’s need for mental health services, more proximal risk factors should be prioritized. For instance, youth within different age cohorts who are exposed to maternal depression early in life demonstrate dysregulation across a wide array of systems, including the autonomic nervous system (Propper & Holochwost, 2013). Thus, indicators of the pathways stemming from maternal depression to adolescent depression onset (e.g., pupil dilation) may provide better insight into depression risk.

The identification of pupil dilation as a biomarker for depression onset is consistent with RDoC’s objective to identify markers of dysfunctional neural circuitry that have translational promise (Insel et. al, 2010). More direct assessments of neural activity (e.g., via imaging), may not be as easily integrated into screening protocols from a resource perspective. Therefore, cheaper and briefer psychophysiological assessments linked to neural activity can be a more feasible approach for achieving this goal (De Los Reyes & Aldao, 2015). In the past, pupillary responses to emotionally salient images have been linked with increased dorsolateral prefrontal function (Siegle, Steinhauer, Friedman, Thompson, & Thase, 2011), which is associated with deficits in executive control and emotion regulation in depressed patients (Koenigs & Grafman, 2009). Thus, pupil dilation may not only represent an objective indicator of depression onset, but also a translational window into the neural circuitry underlying depressogenic risk.

A significant challenge to the translation of novel index tests, biological and otherwise, into routine clinical practice is the absence of a standard analytic approach to assess incremental validity. Compared to reliability and other forms of validity (e.g., internal, external), incremental validity has garnered less attention from methodologists (Hunsley & Meyer, 2003), and as a result is rarely adequately tested (Cohen, So, et al., 2018; Garb, 2003). With the introduction of RDoC and the proliferation of developmental psychopathology research (Franklin, Jamieson, Glenn, & Nock, 2015), there is a need to test which risk factors should be prioritized for translational research. By using a ROC analytic plan (Youngstrom et al., 2017), together with reclassification analyses (Pencina et al., 2011), we were able to provide a multi-method perspective on incremental validity. Adopting a similar analytic approach for other potential index tests for pediatric depression, as well as for other pediatric mental health conditions, can help bring the promise of RDoC and developmental psychopathology to the clinical setting.

The present study had several strengths (e.g., multi-method risk assessment, diagnostic depression interview, prospective design); however, there are also some notable limitations. First, we only assessed depression as an outcome. While the USPTF calls for preventative screening research specifically for depression (Siu et al., 2016), others have suggested screening protocols need to capture a broader range of pediatric mental health concerns (Lavigne et al., 2016). Second, we were only able to operationalize pupillary reactivity within the first three seconds of the stimulus presentation. While this brief assessment is ideal for the screening setting, pupillary differences well after the stimulus presentation are also indicative of cognitive vulnerability for emotionally salient information (Siegle et al., 2003; Silk et al., 2009). Third, as is common with multi-wave, longitudinal designs, the data in the current study was censored. Specifically, youth who already reported depression were excluded from the study (left-censored) and others will certainly develop a depressive episode following the 2-year follow-up period (right-censored). Thus, our findings are relatively conservative in nature. Fourth, while we did not find any significant demographic differences for age and/or sex, our sample size may have been too limited to test these exploratory hypotheses. Given well-documented sex and age differences in the risk profile for depression (Avenevoli et al., 2015), future studies will want to investigate whether algorithms vary as a function of demographics within pediatric subsamples.

Finally, despite the statistical prowess concerning pupil dilation, and to a lesser extent multi-informant approaches, challenges to integrating these methods into clinical practice remain. Most notably, the indicators that demonstrated incremental validity were also the most expensive methods. Therefore, initial costs of set-up with purchasing apparatus, and the cost of training and employing people to administer a diagnostic interview, may make it challenging to integrate multi-method screening protocols into low-resourced settings. Additionally, the use of these screening measures will likely extend appointments (approximately 5 minutes for pupil dilation; 10–15 minutes for the CDRS-R) during an era when pediatricians are increasingly burdened with screening directives and shrinking appointment times (Lavigne et al., 2016). Thus, while we achieved our first aim of introducing a novel indicator into a screening/assessment protocol by demonstrating its incremental validity (Johnston & Murray, 2003), future studies must now replicate these findings in an applied setting (Youngstrom et al., 2017), ideally using publicly available depression measures (e.g., the PHQ-A; Richardson et al., 2010) in an effort to help reduce the cost of a multi-indicator approach. These studies are necessary to conduct prior to the implementation of our findings into screening protocols, to ensure that we can replicate our findings, and cross-validate our proposed algorithm.

Clinical Implications

Despite some translational barriers, there are trends in the literature that make the integration of pupillometry and diagnostic interviews into universal mental health screening promising. Similar to other psychophysiological assessments (see De Los Reyes & Aldao, 2015), innovations have lowered the financial and temporal burden associated with pupillometry (Nowak, Żarowska, Szul-Pietrzak, & Misiuk-Hojło, 2014). Across pediatric contexts (e.g., pain clinics) pupil dilation is now regularly measured as an indicator of ANS functioning (Boev et al., 2005; Connelly et al., 2014). Advantages of using pupil dilation compared to other biological measures is that it is non-invasive, can be collected across settings via mobile technology, and is easily administered and interpreted by practitioners without significant expertise. The only meaningful difference in how pupil dilation is measured in clinical versus research settings is the use of stimuli. As the most robust findings for pupil dilation is within the context of viewing depressogenic stimuli (e.g., Silk et al., 2007; Silk et al., 2009), it is important for future studies to test whether integrating this component into clinical settings is problematic in any way.

Similar to pupillometry, advancements in the assessment literature make the use of diagnostic interviews more feasible within a screening setting. The current study used an “unfiltered” version of the CDRS-R as our multi-informant indicator. The focused nature of these interviews, in which comorbid diagnoses and chronicity of symptoms are not assessed, reduces the administration time and the training burden (Firstad et al., 2016; Yee et al., 2015). Thus, while future studies should replicate these findings in applied, clinical settings (Youngstrom et al., 2017), there is reason to believe that pupillomtery and diagnostic interviews can be used in cost-effective manners.

To illustrate how our study’s translational approach may be useful with regard to clinical decision making, Table 3 includes four example screening profiles from our study. Below we briefly guide the reader on the stepped approach to creating and interpreting Table 3 (see Cohen, So, et al., 2018; Youngstrom et al., 2015 for further guidance). The first step towards making evidence-based referrals is calculating the probability of developing a first lifetime episode of depression in the overall sample. Pre-test probabilities (Middle Panel, Column 2) in the present study were calculated separately based on age and sex. Example profiles of scores at or approaching the cut-off (15) for the CDI are presented in the next column (middle panel), as these scores are especially challenging from a referral perspective (Sheldrick et al., 2015). Corresponding DLRs (top panel of Table 3) for both a traditional approach (CDI alone) and our reclassification algorithm (CDI and SadPeak) were then used to calculate the posterior prevalence for onset. Referral decisions are ultimately based on the post-test prevalence for developing the target disorder within an EBM approach (Youngstrom, 2014).

Youngstrom and colleagues’ (2015) stoplight model was used to provide a framework to interpret the posterior probabilities (see bottom panel of Table 3).1 The value of multiple indicators is best exemplified when comparing two adolescents in our study (Examples 3 and 4). Despite having the same subthreshold CDI score (14), implementation of a brief intervention (Red) is warranted for the girl as opposed to the boy where no behavioral response from the provider is required (Green). The girl’s profile, subthreshold CDI and threshold pupil dilation, corresponds to more than a two-fold increase in likelihood for experiencing a first lifetime episode. Meanwhile, the boy’s pupil dilation score was below subthreshold, suggesting minimal risk for depression onset in the context of a subthreshold CDI score. By merely examining CDI scores, differentiating between these two cases and tailoring the clinical response would not be possible. Thus, consistent with recent calls for more research on pediatric depression onset (Siu et al., 2016), our findings represent a step towards screening protocols that can target vulnerability for depression onset in youth via a risk factor approach. Continued focus on the incremental validity of novel index tests for primary prevention mental health screening carries the promise of bringing personalized prevention solutions to primary care and school settings.

Supplementary Material

1

Acknowledgements:

This project was supported by National Institute of Child Health and Human Development grant HD057066 and National Institute of Mental Health grant MH098060 awarded to B.E. Gibb. We would like to thank Ashley Johnson, Lindsey Stone, Andrea Hanley, Sydney Meadows, Michael Van Wie, Devra Alper, Cope Feurer, Erik Funk, and Effua Sosoo for their help in conducting assessments for this project.

Appendix A: Narrative of Related Studies

There have been two previously published reports that involve findings concerning pupil dilation in youth previously published from the larger study from which we drew our sample. The first study (Manuscript 1) examined only baseline data and showed individual differences in pupil dilation of children of depressed and anxious mothers. In the second study (Manuscript 2) the authors found that in a subsample of youth with a depressed mother, greater pupillary reactivity predicted prospective depressive symptoms and episodes. Approximately 16% of the current study was also used in Manuscript 2. The current study is distinct from these studies by examining pupil dilation a) within a larger, never depressed subsample of youth, b) examining first lifetime episodes of depression as the criterion, and c) examining the incremental validity and subsequent clinical utility of pupil dilation within the context of other informant and risk factor screening approaches.

Footnotes

1

The original “stoplight” model was designed for the assessment setting with “red” conferring immediate intervention. However, it may not be possible for some settings to implement an intervention immediately. Therefore, “red” may confer prioritizing for referrals to specialty care, in settings that do not have the capacity for intervention.

References

  1. Abela JRZ (2001). The hopelessness theory of depression: a test of the diathesis–stress and causal mediation components in third and seventh grade children. Journal of Abnormal Child Psychology, 29(3), 241–254. doi: 10.1023/A:1010333815728. [DOI] [PubMed] [Google Scholar]
  2. Achenbach TM, & Rescorla LA (2001). Manual for the Achenbach system of empirically based assessment school-age forms profiles. Burlington, VT: Aseba. [Google Scholar]
  3. Alloy LB, Abramson LY, Whitehouse WG, Hogan ME, Panzarella C, & Rose DT (2006). Prospective incidence of first onsets and recurrences of depression in individuals at high and low cognitive risk for depression. Journal of Abnormal Psychology, 115(1), 145. doi: 10.1037/0021-843X.115.1.145. [DOI] [PubMed] [Google Scholar]
  4. Avenevoli S, Swendsen J, He JP, Burstein M, & Merikangas KR (2015). Major depression in the National Comorbidity Survey–Adolescent Supplement: prevalence, correlates, and treatment. Journal of the American Academy of Child & Adolescent Psychiatry, 54(1), 37–44. doi: 10.1016/j.jaac.2014.10.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Balazs J, Miklósi M, Keresztény Á, Hoven CW, Carli V, Wasserman C, … & Cotter P (2013). Adolescent subthreshold-depression and anxiety: Psychopathology, functional impairment and increased suicide risk. Journal of Child Psychology and Psychiatry, 54(6), 670–677. doi: 10.1111/jcpp.12016. [DOI] [PubMed] [Google Scholar]
  6. Beck AT, Steer RA, & Brown GK (1996). Manual for the Beck Depression Inventory-II. San Antonio, TX: Psychological Corporation. [Google Scholar]
  7. Belamarich PF, Gandica R, Stein RE, & Racine AD (2006). Drowning in a sea of advice: pediatricians and American Academy of Pediatrics policy statements. Pediatrics, 118(4), e964–e978. [DOI] [PubMed] [Google Scholar]
  8. Boev AN, Fountas KN, Karampelas I, Boev C, Machinis TG, Feltes C, … & Troup C (2005). Quantitative pupillometry: normative data in healthy pediatric volunteers. Journal of Neurosurgery: Pediatrics, 103(6), 496–500. [DOI] [PubMed] [Google Scholar]
  9. Bradley MM, Miccoli L, Escrig MA, & Lang PJ (2008). The pupil as a measure of emotional arousal and autonomic activation. Psychophysiology, 45(4), 602–607. doi: 10.1111/j.1469-8986.2008.00654.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Burkhouse KL, Siegle GJ, & Gibb BE (2014). Pupillary reactivity to emotional stimuli in children of depressed and anxious mothers. Journal of Child Psychology and Psychiatry, 55(9), 1009–1016. doi: 10.1111/jcpp.12225. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Burkhouse KL, Siegle G, Woody M, Kudinova A, & Gibb B (2015). Pupillary reactivity to sad stimuli as a biomarker of depression risk: Evidence from a prospective study of children. Journal of Abnormal Psychology, 124(3), 498–506. doi: 10.1037/abn0000072. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Bylsma LM, Mauss IB, & Rottenberg J (2016). Is the divide a chasm?: Bridging affective science with clinical practice. Journal of Psychopathology and Behavioral Assessment, 38(1), 42–47. doi: 10.1007/s10862-015-9525-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Canals J, Marti-Henneberg C, Fernandez-Ballart J, & Domenech E (1995). A longitudinal study of depression in an urban Spanish pubertal population. European Child & Adolescent Psychiatry, 4(2), 102–111. doi: 10.1007/BF01977738. [DOI] [PubMed] [Google Scholar]
  14. Center for Behavioral Health Statistics and Quality. (2016). Key substance use and mental health indicators in the United States: Results from the 2015 National Survey on Drug Use and Health (HHS Publication No. SMA 16–4984, NSDUH Series H-51). Retrieved from http://www.samhsa.gov/data/.
  15. Cohen JR, Young JF, & Abela JR (2012). Cognitive vulnerability to depression in children: An idiographic, longitudinal examination of inferential styles. Cognitive Therapy and Research, 36(6), 643–654. doi: 10.1007/s10608-011-9431-6. [DOI] [Google Scholar]
  16. Cohen JR, Adams ZW, Menon SV, Youngstrom EA, Bunnell BE, Acierno R, … & Danielson CK (2016). How should we screen for depression following a natural disaster? An ROC approach to post-disaster screening in adolescents and adults. Journal of Affective Disorders, 202, 102–109. doi: 10.1016/j.jad.2016.05.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Cohen JR, So FK, Hankin BL, & Young JF (2018). Translating cognitive vulnerability into improved adolescent depression screening: A receiver operating characteristic approach. Journal of Clinical Child and Adolescent Psychology. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Connelly MA, Brown JT, Kearns GL, Anderson RA, St Peter SD, & Neville KA (2014). Pupillometry: a non-invasive technique for pain assessment in pediatric patients. Archives of disease in childhood, archdischild-2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. De Los Reyes A, & Aldao A (2015). Introduction to the special issue: Toward implementing physiological measures in clinical child and adolescent assessments. Journal of Clinical Child & Adolescent Psychology, 44(2), 221–237. doi: 10.1080/15374416.2014.891227. [DOI] [PubMed] [Google Scholar]
  20. De Los Reyes A, Augenstein TM, Wang M, Thomas SA, Drabick DAG, Burgers DE, & Rabinowitz J (2015). The validity of the multi-informant approach to assessing child and adolescent mental health. Psychological Bulletin, 141(4), 858–900. doi: 10.1037/a0038498. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. DeLong ER, DeLong DM, & Clarke-Pearson DL (1988). Comparing the areas under two or more correlated receiver operating characteristic curves: A nonparametric approach. Biometrics, 44, 837–845. [PubMed] [Google Scholar]
  22. Earls MF (2010). Committee on Psychosocial Aspects of Child and Family Health American Academy of Pediatrics: Incorporating recognition and management of perinatal and postpartum depression into pediatric practice. Pediatrics, 126(5), 1032–1039. doi: 10.1542/peds.2010-2348. [DOI] [PubMed] [Google Scholar]
  23. Ebesutani C, Chorpita BF, Higa-Mcmillan CK, Nakamura BJ, Regan J, & Lynch RE (2010). A psychometric analysis of the revised child anxiety and depression scales—parent version in a school sample. Journal of Abnormal Child Psychology, 39(2), 173–185. doi: 10.1007/s10802-010-9460-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Edens J, Marcus D, Lilienfeld S, & Poythress N (2006). Psychopathic, not psychopath: Taxometric evidence for the dimensional structure of psychopathy. Journal of Abnormal Psychology, 115(1), 131–144. doi: 10.1037/0021-843X.115.1.131. [DOI] [PubMed] [Google Scholar]
  25. First MB, Spitzer RL, Gibbon MWJB, & Williams JB (1995). Structured clinical interview for DSM-IV axis I disorders. New York: New York State Psychiatric Institute. [Google Scholar]
  26. Franklin JC, Jamieson JP, Glenn CR, & Nock MK (2015). How developmental psychopathology theory and research can inform the research domain criteria (RDoC) project. Journal of Clinical Child & Adolescent Psychology, 42(2), 280–290. doi: 10.1080/15374416.2013.873981. [DOI] [PubMed] [Google Scholar]
  27. Fristad MA, Emery BL, & Beck SJ (1997). Use and abuse of the Children’s Depression Inventory. Journal of Consulting and Clinical Psychology, 65(4), 699. doi: 10.1037/0022-006X.65.4.699. [DOI] [PubMed] [Google Scholar]
  28. Fristad MA, Wolfson H, Algorta GP, Youngstrom EA, Arnold LE, Birmaher B, … & LAMS Group. (2016). Disruptive mood dysregulation disorder and bipolar disorder not otherwise specified: fraternal or identical twins? Journal of Child and Adolescent Psychopharmacology, 26(2), 138–146. 10.1089/cap.2015.0062. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Gabrieli JD, Ghosh SS, & Whitfield-Gabrieli S (2015). Prediction as a humanitarian and pragmatic contribution from human cognitive neuroscience. Neuron, 85(1), 11–26. doi: 10.1016/j.neuron.2014.10.047. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Garb HN (2003). Incremental validity and the assessment of psychopathology in adults. Psychological Assessment, 15(4), 508. doi: 10.1037/1040-3590.15.4.508. [DOI] [PubMed] [Google Scholar]
  31. Garber J, Clarke GN, Weersing VR, Beardslee WR, Brent DA, Gladstone TR, … & Shamseddeen W (2009). Prevention of depression in at-risk adolescents: a randomized controlled trial. JAMA, 301(21), 2215–2224. doi: 10.1001/jama.2009.788. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Garber J, Korelitz K, & Samanez-Larkin S (2012). Translating basic psychopathology research to preventive interventions: a tribute to John R.Z. Abela. Journal of Clinical Child & Adolescent Psychology, 41(5), 666–681. doi: 10.1080/15374416.2012.710161. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Garber J, & Rao U (2014). Depression in children and adolescents In: Lewis M & Rudolph K (Eds.), Handbook of Developmental Psychopathology (pp. 489–520). New York: Springer. [Google Scholar]
  34. Gibb BE, Beevers CG, & McGeary JE (2013). Toward an integration of cognitive and genetic models of risk for depression. Cognition & emotion, 27(2), 193–216. 10.1080/02699931.2012.712950 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Gibb BE, Grassia M, Stone L, Uhrlass D, & McGeary J (2011). Brooding rumination and risk for depressive disorders in children of depressed mothers. Journal of Abnormal Child Psychology, 40(2), 317–326. doi: 10.1007/s10802-011-9554-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Halligan SL, Murray L, Martins C, & Cooper PJ (2007). Maternal depression and psychiatric outcomes in adolescent offspring: a 13-year longitudinal study. Journal of Affective Disorders, 97(1), 145–154. doi: 10.1016/j.jad.2006.06.010. [DOI] [PubMed] [Google Scholar]
  37. Hammen C, & Brennan PA (2003). Severity, chronicity, and timing of maternal depression and risk for adolescent offspring diagnoses in a community sample. Archives of General Psychiatry, 60(3), 253–258. doi: 10.1001/archpsyc.60.3.253. [DOI] [PubMed] [Google Scholar]
  38. Hammen C, Brennan PA, & Keenan-Miller D (2008). Patterns of adolescent depression to age 20: The role of maternal depression and youth interpersonal dysfunction. Journal of Abnormal Child Psychology, 36(8), 1189–1198. doi: 10.1007/s10802-008-9241-9. [DOI] [PubMed] [Google Scholar]
  39. Hastings M, Krishnan S, Tangney J, & Stuewig J (2011). Predictive and incremental validity of the Violence Risk Appraisal Guide scores with male and female jail inmates. Psychological Assessment, 23(1), 174–183. doi: 10.1037/a0021290. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Hilden J & Gerds TA (2014). A note on the evaluation of novel biomarkers: Do not rely on integrated discrimination improvement and net reclassification index. Statistics in Medicine, 33(19), 3405–3414. [DOI] [PubMed] [Google Scholar]
  41. Hunsley J, & Meyer GJ (2003). The incremental validity of psychological testing and assessment: conceptual, methodological, and statistical issues. Psychological Assessment, 15(4), 446. doi: 10.1037/1040-3590.15.4.446. [DOI] [PubMed] [Google Scholar]
  42. Hunsley J, & Mash EJ (2007). Evidence-based assessment. Annual Review of Clinical Psychology, 3(1), 29–51. doi: 10.1146/annurev.clinpsy.3.022806.091419. [DOI] [PubMed] [Google Scholar]
  43. Insel T, Cuthbert B, Garvey M, Heinssen R, Pine DS, Quinn K, … & Wang P (2010). Research domain criteria (RDoC): toward a new classification framework for research on mental disorders. American Journal of Psychiatry, 167(7), 748–751. doi: 10.1176/appi.ajp.2010.09091379. [DOI] [PubMed] [Google Scholar]
  44. Johnson S, Hollis C, Marlow N, Simms V, & Wolke D (2014). Screening for childhood mental health disorders using the Strengths and Difficulties Questionnaire: the validity of multi-informant reports. Developmental Medicine & Child Neurology, 56(5), 453–459. doi: 10.1111/dmcn.12360. [DOI] [PubMed] [Google Scholar]
  45. Johnston C, & Murray C (2003). Incremental validity in the psychological assessment of children and adolescents. Psychological Assessment, 15(4), 496–507. doi: 10.1037/1040-3590.15.4.496. [DOI] [PubMed] [Google Scholar]
  46. Kaufman J, Birmaher B, Brent D, Rao UMA, Flynn C, Moreci P, … & Ryan N (1997). Schedule for affective disorders and schizophrenia for school-age children-present and lifetime version (K-SADS-PL): initial reliability and validity data. Journal of the American Academy of Child & Adolescent Psychiatry, 36(7), 980–988. doi: 10.1097/00004583-199707000-00021. [DOI] [PubMed] [Google Scholar]
  47. Klein DN, Dougherty LR, & Olino TM (2005). Toward guidelines for evidence-based assessment of depression in children and adolescents. Journal of Clinical Child and Adolescent Psychology, 34(3), 412–432. doi: 10.1207/s15374424jccp3403_3. [DOI] [PubMed] [Google Scholar]
  48. Koenigs M, & Grafman J (2009). The functional neuroanatomy of depression: distinct roles for ventromedial and dorsolateral prefrontal cortex. Behavioural Brain Research, 201(2), 239–243. doi: 10.1016/j.bbr.2009.03.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Kovacs M (1992). Manual for the Children’s Depression Inventory. North Tonawanda, NJ: Multi-Health Systems. [Google Scholar]
  50. Kraemer HC (1992). Evaluating medical tests: Objective and quantitative guidelines. Newbury Park, CA: Sage Publications. [Google Scholar]
  51. Laeng B, Sirois S, & Gredeback G (2012). Pupillometry: A window to the preconscious? Perspectives on Psychological Science, 7(1), 18–27. doi: 10.1177/1745691611427305. [DOI] [PubMed] [Google Scholar]
  52. Lavigne JV, Feldman M, & Meyers KM (2016). Screening for mental health problems: addressing the base rate fallacy for a sustainable screening program in integrated primary care. Journal of Pediatric Psychology, 41(10), 1081–1090. doi: 10.1093/jpepsy/jsw048. [DOI] [PubMed] [Google Scholar]
  53. Leening MJ, Vedder MM, Witteman JC, Pencina MJ, & Steyerberg EW (2014). Net reclassification improvement: computation, interpretation, and controversies: a literature review and clinician’s guide. Annals of Internal Medicine, 160(2), 122–131. doi: 10.7326/M13-1522. [DOI] [PubMed] [Google Scholar]
  54. LeMoult J, Yoon KL, & Joormann J (2016). Rumination and cognitive distraction in major depressive disorder: An examination of respiratory sinus arrhythmia. Journal of Psychopathology and Behavioral Assessment, 38(1), 20–29. doi: 10.1007/s162-015-9510-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Lewinsohn PM, Clarke GN, Seeley JR, & Rohde P (1994). Major depression in community adolescents: age at onset, episode duration, and time to recurrence. Journal of the American Academy of Child & Adolescent Psychiatry, 33(6), 809–818. doi: 10.1097/00004583-199407000-00006. [DOI] [PubMed] [Google Scholar]
  56. Lewinsohn PM, Rohde P, Seeley JR, Klein DN, & Gotlib IH (2000). Natural Course of Adolescent Major Depressive Disorder in a Community Sample: Predictors of Recurrence in Young Adults. American Journal of Psychiatry, 157(10), 1584–1591. doi: 10.1176/appi.ajp.157.10.1584. [DOI] [PubMed] [Google Scholar]
  57. Lobbestael J, Leurgans M, & Arntz A (2011). Inter-rater reliability of the Structured Clinical Interview for DSM-IV Axis I Disorders (SCID I) and Axis II Disorders (SCID II). Clinical Psychology & Psychotherapy, 18(1), 75–79. doi: 10.1002/cpp.693. [DOI] [PubMed] [Google Scholar]
  58. Luntamo T, Sourander A, Santalahti P, Aromaa M, & Helenius H (2012). Prevalence changes of pain, sleep problems and fatigue among 8-year-old children: Years 1989, 1999, and 2005. Journal of Pediatric Psychology, 37(3). doi: 10.1093/jpepsy/jsr091. [DOI] [PubMed] [Google Scholar]
  59. Martel M, Markon K, & Smith G (2017). Research Review: Multi-informant integration in child and adolescent psychopathology diagnosis. Journal of Child Psychology and Psychiatry, 58(2), 116–128. doi: 10.1111/jcpp.12611. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Matsumoto D, & Ekman P (1988). Japanese and Caucasian facial expressions of emotion (JACFEE) [Slides]. San Francisco, CA: Intercultural and Emotion Research Laboratory, Department of Psychology, San Francisco State University. [Google Scholar]
  61. Matthey S, & Petrovski P (2002). The Children’s Depression Inventory: error in cutoff scores for screening purposes. Psychological Assessment, 14(2), 146–149. doi: 10.1037/1040-3590.14.2.146. [DOI] [PubMed] [Google Scholar]
  62. Monroe S, & Harkness K (2011). Recurrence in major depression: A conceptual analysis. Psychological Review, 118(4), 655–674. doi: 10.1037/a0025190. [DOI] [PubMed] [Google Scholar]
  63. Muris P, Fokke M, & Kwik D (2009). The ruminative response style in adolescents: An examination of its specific link to symptoms of depression. Cognitive Therapy and Research, 33(1), 21–32. doi: 10.1007/s10608-007-9120-7. [DOI] [Google Scholar]
  64. Murray L, Arteche A, Fearon P, Halligan S, Goodyer I, & Cooper P (2011). Maternal postnatal depression and the development of depression in offspring up to 16 years of age. Journal of the American Academy of Child & Adolescent Psychiatry, 50(5), 460–470. doi: 10.1016/j.jaac.2011.02.001. [DOI] [PubMed] [Google Scholar]
  65. Myers K, & Winters NC (2002). Ten-Year Review of Rating Scales. II: scales for internalizing disorders. Journal of the American Academy of Child & Adolescent Psychiatry, 41(6), 634–659. doi: 10.1097/00004583-200206000-00004. [DOI] [PubMed] [Google Scholar]
  66. Nejad AB, Fossati P, & Lemogne C (2012). Self-referential processing, rumination, and cortical midline structures in major depression. Frontiers in Human Neuroscience, 7, 666. doi: 10.3389/fnhum.2013.00666. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Nowak W, Żarowska A, Szul-Pietrzak E, & Misiuk-Hojło M (2014). System and measurement method for binocular pupillometry to study pupil sizevariability. Biomedical Engineering Online, 13(1), 69. doi: 10.1186/1475-925X-13-69. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Obuchowski NA, & Lieber ML (1998). Confidence intervals for the receiver operating characteristic area in studies with small samples. Academic Radiology, 5(8), 561–571. doi: 10.1016/S1076-6332(98)80208-0. [DOI] [PubMed] [Google Scholar]
  69. Ong AD, Bergeman CS, Bisconti TL, & Wallace KA (2006). Psychological resilience, positive emotions, and successful adaptation to stress in later life. Journal of Personality and Social Psychology, 91(4), 730–749. doi: 10.1037/0022-3514.91.4.730. [DOI] [PubMed] [Google Scholar]
  70. Orue I, Calvete E, & Padilla P (2014). Brooding rumination as a mediator in the relation between early maladaptive schemas and symptoms of depression and social anxiety in adolescents. Journal of Adolescence, 8(37), 1281–1291. doi: 10.1016/j.adolescence.2014.09.004. [DOI] [PubMed] [Google Scholar]
  71. Pearson RM, Evans J, Kounali D, Lewis G, Heron J, Ramchandani PG, … & Stein A (2013). Maternal depression during pregnancy and the postnatal period: risks and possible mechanisms for offspring depression at age 18 years. JAMA Psychiatry, 70(12), 1312–1319. doi: 10.1001/jamapsychiatry.2013.2163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Pencina MJ, D’Agostino RB, & Steyerberg EW (2011). Extensions of net reclassification improvement calculations to measure usefulness of new biomarkers. Statistics in Medicine, 30(1), 11–21. doi: 10.1002/sim.4085. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Pencina MJ, D’Agostino RB, Pencina KM, Janssens AC, & Greenland P (2012). Interpreting incremental value of markers added to risk prediction models. American Journal of Epidemiology, 176(6), 473–481. doi: 10.1093/aje/kws207. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Poznanski EO, & Mokros HB (1996). Children’s Depression Rating Scale, Revised (CDRSR). Los Angeles: Western Psychological Services. [Google Scholar]
  75. Price RB, Rosen D, Siegle GJ, Ladouceur CD, Tang K, Allen KB, … & Silk JS (2016). From anxious youth to depressed adolescents: Prospective prediction of 2-year depression symptoms via attentional bias measures. Journal of Abnormal Psychology, 125, 267–278. 10.1037/abn0000127 [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Propper CB, & Holochwost SJ (2013). The influence of proximal risk on the early development of the autonomic nervous system. Developmental Review, 33(3), 151–167. 10.1016/j.dr.2013.05.001. [DOI] [Google Scholar]
  77. Rawal A, Collishaw S, Thapar A, & Rice F (2013). A direct method of assessing underlying cognitive risk for adolescent depression. Journal of Abnormal Child Psychology, 41, 1279–1288. doi: 10.1007/s10802-013-9760-x. [DOI] [PubMed] [Google Scholar]
  78. Rice ME, & Harris GT (2005). Comparing effect sizes in follow-up studies: ROC Area, Cohen’s d, and r. Law and Human Behavior, 29(5), 615–620. doi: 10.1007/s10979-005-6832-7. [DOI] [PubMed] [Google Scholar]
  79. Rice F, & Rawal A (2011). Can basic risk research help in the prevention of childhood and adolescent depression? Examining a cognitive and emotional regulation approach. Depression Research and Treatment, 1–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Richardson LP, McCauley E, Grossman DC, McCarty CA, Richards J, Russo JE, … & Katon W (2010). Evaluation of the Patient Health Questionnaire-9 Item for detecting major depression among adolescents. Pediatrics, 126(6), 1117–1123. doi: 10.1542/peds.2010-0852. [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Seeley J, Stice E, & Rohde P (2009). Screening for depression prevention: Identifying adolescent girls at high risk for future depression. Journal of Abnormal Psychology, 118(1), 161–170. doi: 10.1037/a0014741. [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Sheldrick RC, Benneyan JC, Kiss IG, Briggs-Gowan MJ, Copeland W, & Carter AS (2015). Thresholds and accuracy in screening tools for early detection of psychopathology. Journal of Child Psychology and Psychiatry, 56(9), 936–948. doi: 10.1111/jcpp.12442. [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Sheppard LC, & Teasdale JD (2004). How does dysfunctional thinking decrease during recovery from major depression?. Journal of Abnormal Psychology, 113(1), 64–71. doi: 10.1037/0021-843X.113.1.64. [DOI] [PubMed] [Google Scholar]
  84. Siegle GJ, Steinhauer SR, Carter CS, Ramel W, & Thase ME (2003). Do the seconds turn into hours? Relationships between sustained pupil dilation in response to emotional information and self-reported rumination. Cognitive Therapy and Research, 27(3), 365–382. doi: 10.1023/A:1023974602357. [DOI] [Google Scholar]
  85. Siegle GJ, Ichikawa N, & Steinhauer SR (2008). Blink before and after you think: blinks occur prior to and following cognitive load indexed by pupillary responses. Psychophysiology, 45(5), 679–687. doi: 10.1111/j.1469-8986.2008.00681.x. [DOI] [PubMed] [Google Scholar]
  86. Siegle GJ, Steinhauer SR, Friedman ES, Thompson WS, & Thase ME (2011). Remission prognosis for cognitive therapy for recurrent depression using the pupil: utility and neural correlates. Biological Psychiatry, 69(8), 726–733. doi: 10.1016/j.biopsych.2010.12.041. [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Silk JS, Siegle GJ, Whalen DJ, Ostapenko LJ, Ladouceur CD, & Dahl RE (2009). Pubertal changes in emotional information processing: Pupillary, behavioral, and subjective evidence during emotional word identification. Development and psychopathology, 21(1), 7–26. 10.1017/S0954579409000029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Siu AL, Bibbins-Domingo K, Grossman DC, Baumann LC, Davidson KW, Ebell M, … & Krist AH (2016). Screening for depression in adults: US Preventive Services Task Force recommendation statement. JAMA, 315(4), 380–387. [DOI] [PubMed] [Google Scholar]
  89. Spitzer RL, Endicott J, & Robins E (1978). Research Diagnostic Criteria: Rationale and Reliability. Archives of General Psychiatry, 35(6), 773–782. [DOI] [PubMed] [Google Scholar]
  90. Storch EA, Roberti JW, & Roth DA (2004). Factor structure, concurrent validity, and internal consistency of the Beck Depression Inventory—Second Edition in a sample of college students. Depression and Anxiety, 19(3), 187–189. doi: 10.1002/da.20002. [DOI] [PubMed] [Google Scholar]
  91. Straus SE, Richardson WS, Glasziou P, & Haynes RB (2011). Acquiring the Evidence: How to find current best evidence and have current best evidence find us Evidence-based medicine: how to practice and teach EBM. 4th ed. London: Elsevier. [Google Scholar]
  92. Swets JA (1988). Measuring the accuracy of diagnostic systems. Science, 240(4857), 1285–1293. doi: 10.1126/science.3287615. [DOI] [PubMed] [Google Scholar]
  93. Teasdale JD, Moore RG, Hayhurst H, Pope M, Williams S, & Segal ZV (2002). Metacognitive awareness and prevention of relapse in depression: Empirical evidence. Journal of Consulting and Clinical Psychology, 70(2), 275–287. doi: 10.1037/0022-006X.70.2.275. [DOI] [PubMed] [Google Scholar]
  94. Wang TJ, Gona P, Larson MG, Tofler GH, Levy D, Newton-Cheh C, … & Benjamin EJ (2006). Multiple biomarkers for the prediction of first major cardiovascular events and death. New England Journal of Medicine, 355(25), 2631–2639. doi: 10.1056/NEJMoa055373. [DOI] [PubMed] [Google Scholar]
  95. Wissow LS, Brown J, Fothergill KE, Gadomski A, Hacker K, Salmon P, & Zelkowitz R (2013). Universal mental health screening in pediatric primary care: a systematic review. Journal of the American Academy of Child & Adolescent Psychiatry, 52(11), 1134–1147. doi: 10.1016/j.jaac.2013.08.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  96. Yee AM, Algorta GP, Youngstrom EA, Findling RL, Birmaher B, Fristad MA, & LAMS Group. (2015). Unfiltered Administration of the YMRS and CDRS-R in a Clinical Sample of Children. Journal of Clinical Child & Adolescent Psychology, 44(6), 992–1007. 10.1080/15374416.2014.915548 [DOI] [PMC free article] [PubMed] [Google Scholar]
  97. Young CC, & Dietrich MS (2015). Stressful life events, worry, and rumination predict depressive and anxiety symptoms in young adolescents. Journal of Child and Adolescent Psychiatric Nursing, 28(1), 35–42. doi: 10.1111/jcap.12102. [DOI] [PubMed] [Google Scholar]
  98. Youngstrom EA (2014). A primer on receiver operating characteristic analysis and diagnostic efficiency statistics for pediatric psychology: We are ready to ROC. Journal of Pediatric Psychology, 39(2), 204–221. doi: 10.1093/jpepsy/jst062. [DOI] [PMC free article] [PubMed] [Google Scholar]
  99. Youngstrom EA, & De Los Reyes A (2015). Commentary: Moving toward cost-effectiveness in using psychophysiological measures in clinical assessment: Validity, decision making, and adding value. Journal of Clinical Child & Adolescent Psychology, 44(2), 352–361. [DOI] [PubMed] [Google Scholar]
  100. Youngstrom EA, Choukas-Bradley S, Calhoun CD, & Jensen-Doss A (2015). Clinical guide to the evidence-based assessment approach to diagnosis and treatment. Cognitive and Behavioral Practice, 22(1), 20–35. doi: 10.1016/j.cbpra.2013.12.005. [DOI] [Google Scholar]
  101. Youngstrom EA, Halverson TF, Youngstrom JK, Lindhiem O, & Findling RL (2018). Evidence-Based Assessment From Simple Clinical Judgments to Statistical Learning: Evaluating a Range of Options Using Pediatric Bipolar Disorder as a Diagnostic Challenge. Clinical Psychological Science, 2167702617741845. [DOI] [PMC free article] [PubMed] [Google Scholar]
  102. Youngstrom EA, Van Meter A, Frazier TW, Hunsley J, Prinstein MJ, Ong ML, & Youngstrom JK (2017). Evidence-based assessment as an integrative model for applying psychological science to guide the voyage of treatment. Clinical Psychology: Science and Practice. doi: 10.1111/cpsp.12207. [DOI] [Google Scholar]
  103. Ziegert DI, & Kistner JA (2002). Response styles theory: Downward extension to children. Journal of Clinical Child and Adolescent Psychology, 31(3), 325–334. doi: 10.1207/S15374424JCCP3103_04. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1

RESOURCES