Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2016 Aug 1.
Published in final edited form as: J Am Acad Child Adolesc Psychiatry. 2015 Jun 14;54(8):626–634. doi: 10.1016/j.jaac.2015.05.016

Clinical Implications of a Dimensional Approach: The Normal:Abnormal Spectrum of Early Irritability

Lauren S Wakschlag 1, Ryne Estabrook 2, Amelie Petitclerc 3, David Henry 4, James L Burns 5, Susan B Perlman 6, Joel L Voss 7, Daniel S Pine 8, Ellen Leibenluft 9, Margaret Briggs-Gowan 10
PMCID: PMC4515952  NIHMSID: NIHMS707436  PMID: 26210331

Abstract

Objective

The importance of dimensional approaches is widely recognized, but an empirical base for clinical application is lacking. This is particularly true for irritability, a dimensional phenotype that cuts across many areas of psychopathology and manifests early in life. We examine longitudinal, dimensional patterns of irritability and their clinical import in early childhood.

Method

Irritability was assessed longitudinally over an average of 16 months in a clinically enriched diverse community sample of preschoolers (N=497; M=4.2 years; SD=0.8). Using the Temper Loss scale of the Multidimensional Assessment Profile of Disruptive Behavior (MAP-DB) as a developmentally sensitive indicator of early childhood irritability, we examined its convergent/divergent, clinical and incremental predictive validity, and modeled its linear and nonlinear associations with clinical risk.

Results

The Temper Loss scale demonstrated convergent and divergent validity to child and maternal factors. In multivariate analyses, Temper Loss predicted mood (separation anxiety disorder [SAD], generalized anxiety disorder [GAD], and depression/dysthymia) and disruptive (oppositional defiant disorder [ODD], attention-deficit/hyperactivity disorder [ADHD], and conduct disorder [CD]) symptoms. Preschoolers with even mildly elevated Temper Loss scale scores showed substantially increased risk of symptoms and disorders. For ODD, GAD, SAD, and depression, increases in Temper Loss scale scores at the higher end of the dimension had a greater impact on symptoms relative to increases at the lower end. Temper Loss scale scores also showed incremental validity over DSM-IV disorders in predicting subsequent impairment. Finally, accounting for the substantial heterogeneity in longitudinal patterns of Temper Loss significantly improved prediction of mood and disruptive symptoms.

Conclusion

Dimensional, longitudinal characterization of irritability informs clinical prediction. A vital next step will be empirically generating parameters for incorporation of dimensional information into clinical decision-making with reasonable certainty.

Keywords: irritability, dimensional, developmental psychopathology, normal:abnormal spectrum, longitudinal modeling

INTRODUCTION

Irritability is present in diverse forms of mental illness.13 Prior research has generally focused on extreme irritability. However, because irritability falls along a spectrum and is an early-life precursor to psychopathology,4,5 dimensional, developmentally specified approaches are needed. Here, we characterize the normal:abnormal spectrum of irritability in early childhood using developmentally informed quantitative methods. Specifically, we: (I) model how progression along the dimensional spectrum of irritability relates to subsequent clinical risk and impairment; and (II) characterize the variability of irritability over time and test the value of this longitudinal variation for prediction.

Early identification of abnormal irritability would be of great value for the prevention of mental health disorders. However, irritable behavior is normative in early childhood, and its clinical significance varies based on its context, modulation, and pervasiveness.57 Recent work lays the foundation for making such normal to abnormal differentiations in early childhood.8,9 For example, we have defined a developmentally based irritability spectrum using the Temper Loss scale of the Multidimensional Assessment Profile of Disruptive Behavior (MAP-DB) questionnaire (this scale was originally titled the “Multidimensional Assessment of Preschool Disruptive Behavior” but has since been renamed to reflect its use and validation across a broader age range) in a prior unselected sample.10 This psychometric work lays the foundation for the present clinical validation study.

Currently, empirical approaches for extracting clinically useful information from a dimensionally defined irritability spectrum are underdeveloped. Dimensions are based on the assumption that risk cannot be defined by a single, extreme threshold but instead manifests probabilistically.4 Thus, a dimensional approach may enhance developmental sensitivity to prodromal phases of risk. Dimensional risk may increase linearly or nonlinearly, with different implications for clinical decision-making. Little is also known about the clinical informativeness of longitudinal variation in dimensional patterns. This is of particular importance in early childhood, when the capacity for self-control improves dramatically across relatively short time intervals.11,12 Dimensional, longitudinal approaches also hold promise for elucidating the substantial heterogeneity in outcome amongst young children exhibiting early high irritability (i.e. which early irritable children will go on to develop clinical problems and which will not). For example, recent trajectory modeling suggests that more than 25% of young children with high early irritability develop normally when followed longitudinally (Hawes S, Perlman S, Byrd A, Raine A, Loeber R, Pardini D, unpublished data, 2014).13

Here we draw on a large, clinically enriched sample of preschoolers to establish the validity of the dimensional Temper Loss scale for clinical prediction and explicate the shape of its relation to clinical outcomes. Goals of the paper are to:

  1. Establish the validity of the Temper Loss scale, including (a) convergent/divergent; (b) clinical; and (c) incremental validity; and

  2. Characterize the short-term longitudinal variation in Temper Loss scale score and test the incremental validity of this variation for clinical prediction.

METHOD

Participants

Multidimensional Assessment of Preschoolers (MAPS) Study includes a large, diverse sample of preschoolers recruited from the waiting rooms of multiple pediatric clinics in a large US urban area. This unselected sample (N=1,857) was seen only at baseline and is the sample on which the psychometric modeling of the Temper Loss scale is based.14 The primary analytic sample for the present study is an intensive subsample of this MAPS pediatric cohort (n=497), which was clinically enriched by oversampling for child disruptive behavior and parental intimate partner violence. The mean age of the sample at baseline was 4.2 years (T0: Mean =4.2 yrs. [range 2.9–6.0 yrs.]; T1: M=4.8 yrs. [3.1–7.7 yrs.]; T2: M=5.54 [3.8–8.5]). Approximately half of the sample were boys and were living in poverty. Participants were predominantly African-American, Hispanic, and Non-Hispanic White. (For additional sample details, see Supplement 1 and Table S1, available online, and prior published descriptions.10) All clinical validity analyses employed sampling weights that accounted for both unequal probabilities of selection and differential non-response rates in this subsample.

Procedures

Procedures were approved by institutional review boards, and parental informed consent was obtained. The clinical subsample participated in three longitudinal assessments over an average period of 15.8 months (SD=5.7 months; for overview, see Figure S1, available online). At baseline (T0), mothers completed the Temper Loss scale. At T1 (~ 6 months later), they took part in an intensive clinical and neurocognitive assessment. At T1, 80% also completed the Temper Loss scale again (20% were missing because the MAP-DB was added to the T1 assessment after this phase was underway). At T2 (~9 months later), participants completed the Temper Loss scale and survey measures of clinical symptoms and impairment (94% response rate).

Measures

Irritability was assessed via the MAP-DB Temper Loss scale at T0, T1, and T2. The Temper Loss scale measures key features of irritability including mood and tantrums.15 The 22 Temper Loss scale items capture variations in quality, intensity, and context along an objective frequency scale (ranging from never during the past month to many times each day). There were no significant differences in the structure of the Temper Loss scale from the prior independent sample16 based on differential item function (DIF) estimations using a weighted least squares approach (Χ2[109]=128.95, p = .09). Confirmatory factor analyses also indicated a unidimensional factor (Comparative Fit Index [CFI]=0.96,17 Tucker Lewis Index [TLI]=0.95,18 and Root Mean Square Error of Approximation [RMSEA]=0.09).19 Unidimensionality was evident across the 3 time points (α range = .96–.98). Scores were derived using item response theory (IRT).20 IRT is useful for dimensional modeling because it maps the locations of both items and respondents along an underlying latent continuum, scaled from mild, commonly occurring behaviors to severe, rarely occurring behaviors. Baseline (T0) Temper Loss was used as the primary predictor of all T1 and T2 outcomes.

Correlates

Convergent and divergent validity measures were derived from T1. Convergent validity was assessed in relation to two survey and one neurocognitive measure. Survey measures: We used (a) a composite child Irritability Symptom Index ( α=.73; index is detailed in 9), derived from the seven irritability symptoms of the conduct and depression sections of the Preschool Age Psychiatric Assessment (PAPA)21 (e.g., “is easily frustrated,” “has tantrums”); and (b) maternal irritability assessed with the Patient-Reported Outcomes Measurement Information System (PROMIS) Anger scale (α=.93).22 Neurocognitive measure: (c) A developmentally sensitive test of response reversal, the “Candy Game” task,23 was employed. Response reversal deficits have been demonstrated in older youth with clinical levels of irritability,24 and we have shown that performance on the Candy Game is associated with prefrontal cortex function at preschool age in a small subsample of the MAPS cohort (n=28) (Demir OE, Voss J, O’Neill J, Briggs-Gowan M, Wakschlag L, Booth J, unpublished data). The Candy Game is a computerized task in which two sets of boxes are presented. One box is designated as “winning,” and one box is designated as “losing” in each set. This designation is reversed in the second half of the trials. Learning occurs via trial and error. Of the 397 children who participated in the neurocognitive assessment, 76% (n=302) successfully completed the task. (Those children who completed less than 50 trials or had less than 50% accuracy were excluded; see Supplement 1 and Table S2, available online, for breakdown of task completion.) Preschoolers with and without complete Candy Game data did not differ on level of temper loss (t =−.15, not significant). For the present analyses, reversal learning was computed as post-switch minus pre-switch percent accuracy, indicating the decrement in children’s performance after the rule switch (M pre-switch accuracy=85% [SD=22.2%]; M post-switch accuracy=79% [SD=25.8]).

Divergent validity was assessed in relation to maternal report of children’s self-regulation, including the Self Control scale of the Social Skills Improvement System (SSIS),25 a 10-item scale measuring the child’s capacity to regulate emotions and behavior (α=.84), and the Initiative Scale of the Devereux Early Childhood Assessment (DECA),26 an 11-item scale measuring problem-solving and persistence (α=.88).

Covariates

Models controlled for: (a) child age and sex; (b) poverty derived from an income-to-needs ratio; (c) use of harsh physical discipline via the 8-item Parent-Child Physical Assault Scale of the Conflict Tactics Scale (CTS) (α=.65);26 (d) Aggressive Behavior assessed with the MAP-DB Aggression Scale. This 25-item scale encompasses mild to severe aggressive behaviors (α=.95).10 In addition, the model predicting Candy Game performance controlled for child nonverbal reasoning, assessed via the Picture Similarities Scale of the Differential Abilities Scales (DAS).27

Clinical outcomes

Clinical symptoms of DSM-IV disruptive (ODD, ADHD, and CD) and mood (SAD, GAD and depression/dysthymia) disorders were assessed at T1 and T2. At T1, these symptom counts were derived from the Preschool Age Psychiatric Assessment (PAPA). We also created a composite categorical indicator across these disorders, i.e., meets full criteria for any of these mood or disruptive disorders from T1 PAPA data. PAPA inter-rater reliability was monitored for 20% of all interviews (with 83% to 100% agreement on symptom scores). At T2, symptom counts were derived from the Stonybrook Early Childhood Inventory (ECI).28 There were high rates of psychopathology in this enriched sample, and all disruptive and mood disorders were associated with higher Temper Loss scores in bivariate analyses (Table 1).

Table 1.

Differences in Temper Loss Scorea by DSM-IV Disorder

Disorder Prevalence (%)b Mean (SE) Significance

Meets Criteria Does not Meet Criteria
Disruptive Disorders
 ODD 14.24 1.00 (0.11) −.07 (0.05) t(394)=8.32****
 CD 5.15 .89 (0.28) .04 (0.05) t(393)=2.91**
 ADHD 6.18 .90 (0.22) .02 (0.05) t(390)=3.80***
 Any Disruptive Disorder 17.94 .87 (0.11) −.10 (0.05) t(389)=7.81****
Mood Disorders
 GAD 21.22 .63 (0.10) −.07 (0.06) t(378)=5.85****
 SAD 10.55 .64 (0.20) .01 (0.05) t(384)=2.96**
 Depressive Disordersc 2.41 1.23 (0.38) .06 (0.05) t(399)=3.06**
 Any Mood Disorder 26.59 .53 (0.10) −.09 (0.06) t(377)=5.09****
Both Disruptive and Mood Disorders 33.88 .56 (0.08) −.17 (0.06) T(377)=6.66****

Note: ADHD = attention-deficit/hyperactivity disorder; CD = conduct disorder; GAD = generalized anxiety disorder; ODD = oppositional defiant disorder; SAD = separation anxiety disorder.

a

Calibrated item response theory scores (Mean=0, SD=1).

b

Adjusted for clinical enrichment via sampling and response weights.

c

Combines depression and dysthymia.

*

p<.05

**

p<.01

***

p<.001

****

p<.0001

Impairment outcomes

The Family Life Impairment Scale (FLIS; Briggs-Gowan M, Horowitz S, Carter A. The Family Life Impairment Scale [unpublished rating scale]. New Haven CT: Yale University; 1997) was administered at T1 and T2 to assess the extent to which children’s emotions and behavior interfered with daily functioning, family functioning, and functioning at preschool/daycare on a 3-point scale (M α=.75, range=.63–.81).

Analytic Plan Overview

Convergent and divergent validity analyses were conducted via multiple regressions controlling for child age, sex, race/ethnicity, and poverty status. The task-based model also controlled for non-verbal reasoning, task version, and pre-switch accuracy. Predictive clinical and incremental validity models added control for harsh physical discipline, child aggression, and the time interval between assessments. Incremental analyses also controlled for having any mood or disruptive disorders at T1. Predictive clinical and incremental validity analyses tested the shape (linear and quadratic) of the relation of T0 temper loss in relation to symptoms and impairment at subsequent time points (T1 and T2). A linear association indicates that increases in the clinical outcome occur equally across the full spectrum of Temper Loss scale scores, while a quadratic effect indicates that the association increases or decreases as Temper Loss scores increase. Linear and curvilinear patterns are tested simultaneously. When both are present, the linear pattern must be interpreted as conditional on the curvilinear. In the incremental analyses predicting impairment, T1 mood and disruptive disorders were controlled. We utilized structural equation models (SEM) to characterize longitudinal variability in temper loss across T0–T2, and to test its incremental clinical validity. These longitudinal incremental analyses predicted T2 symptoms from T0–T1 temper loss. Ns varied somewhat across analyses due to differential missing data across waves.

RESULTS

Establish the validity of the Temper Loss scale

Convergent/divergent validity

Temper loss was associated in expected directions with convergent and divergent measures. Temper loss correlated with preschoolers’ composite irritability symptoms scores (β=.52, p<.0001) and maternal PROMIS anger scores (β=.37, p<.0001). It was also associated with poorer response reversal on the Candy Game (β −.11, p<.05). Specifically, higher Temper Loss scores were associated with greater decrements in Candy Game performance after the rule switch, signifying poorer capacity to flexibly shift behavior in response to contextual cues. In contrast, Temper Loss was negatively associated with indicators of self-regulation and competence, i.e., the SSIS Self-Control scale (β =−.32, p<.0001) and the DECA Initiative Scale (β=−.20, p<.01).

Clinical validity

In multivariate models, Temper Loss scale scores added significant variance to prediction of mood and disruptive symptoms. Temper Loss scale scores predicted ODD, ADHD, and SAD symptoms at both T1 and T2, and GAD and depression at T1. As shown in Table 2a, the shape of the relationship between temper loss and symptoms varied by type of symptoms (for full models, see Table S3a, available online). Specifically, for T1 outcomes, there were significant curvilinear patterns for ODD, depression, and SAD symptoms, whereas ADHD and GAD symptoms showed a linear association with temper loss (Figure 1). Figure 1a illustrates the quadratic effect for ODD symptoms; increases in ODD symptoms accelerated as scores increased on the Temper Loss scale. In contrast, ADHD symptoms increased relatively evenly regardless of Temper Loss values (Figure 1b). While Temper Loss scale scores did not predict CD symptoms, aggression was highly predictive (β =.34, p<.0001). In addition to Temper Loss, the most consistent predictors of mood and disruptive symptoms were demographic risks (i.e. minority race/ethnicity and poverty status). Temper Loss scale scores predicted T2 ODD, ADHD and SAD symptoms.

Table 2.

Clinical and Incremental Validity of Temper Loss: Predicting DSM-IV Symptoms (T1–T2) and Disorders (T1) and Impairment (T1–T2) from T0 Temper Lossa

Clinical Validity Models
T1 Prediction Estimate (SE)[β] T2 Prediction Estimate (SE)[β]
ODD Symptoms
 T0 Temper Loss (linear) .56 (0.15) [.29]*** .14 (0.07) [.12]*
 T0 Temper Loss (quadratic) .19 (0.07) [.14]** .14 (0.07) [.17]*
CD Symptoms
 T0 Temper Loss (linear) −.03 (0.09) [−.03] .03 (0.04) [.04]
 T0 Temper Loss (quadratic) .09 (0.06) [.12] .11 (0.06) [.21]
ADHD Symptoms
 T0 Temper Loss (linear) .73 (0.34) [.17]* .52 (0.23) [.15]*
 T0 Temper Loss (quadratic) .23 (0.18) [.08] .29 (0.16) [.12]
SAD Symptoms
 T0 Temper Loss (linear) .31 (0.16) [.21] .12 (0.14) [.14]*
 T0 Temper Loss (quadratic) .17 (0.07) [.16]* .14 (0.23) [.23]
GAD Symptoms
 T0 Temper Loss (linear) .47 (0.13) [.32]*** .06 (0.06) [.06]
 T0 Temper Loss (quadratic) .13 (0.07) [.13] .16 (0.08) [.22]
Depression/Dysthymia Symptoms
 T0 Temper Loss (linear) .27 (0.11) [.19]* .13 (0.08) [.11]
 T0 Temper Loss (quadratic) .19 (0.06) [.19]*** .14 (0.23) [.17]
Any Mood/Disruptive Disorder
 T0 Temper Loss (linear) 1.20 (0.31) [1.05]*** N/A

Incremental Validity Models
Functional Impairment
 T0 Temper Loss (linear) .05 (0.04) [.13] −.01 (0.02) [−.03]
 T0 Temper Loss (quadratic) .07 (0.03) [.27]** .02 (0.01) [.16]**
Impairment in Preschool/Day Care
 T0 Temper Loss (linear) .00 (0.02) [.02] −.05 (0.02) [−.22]*
 T0 Temper Loss (quadratic) .01 (0.01) [.09] .02 (0.01) [.08]
Impairment in Family Context
 T0 Temper Loss (linear) .00 (0.04) [.00] .00 (0.02) [.00]
 T0 Temper Loss (quadratic) .06 (0.03) [.23]* .01 (0.01) [.08]

Note: ADHD = attention-deficit/hyperactivity disorder; CD = conduct disorder; GAD = general anxiety disorder; N/A = not available; ODD = oppositional defiant disorder; SAD = separation anxiety disorder.

a

Symptom and impairment models derived from hierarchical linear regression and DSM disorder model derived from logistic regression model. All models controlled for child age, sex, race/ethnicity, poverty status, harsh physical discipline, child aggression, and time lag between visits. Incremental models predicting impairment also controlled for any mood or disruptive disorder (for full models, see Table S3, available online). T1–T2 symptom model ns ranged from 370–446, based on missing data. Impairment in preschool/day care ns were smaller due to some children not being in an out-of-home setting (320, 297 respectively).

*

p<.05

**

p<.01

***

p<.001

****

p<.0001

Figure 1.

Figure 1

Relation of Temper Loss dimensional location to clinical symptoms. 1a shows oppositional defiant disorder (ODD) symptoms. 1b shows attention-deficit/hyperactivity disorder (ADHD) symptoms. Note: IRT = item response theory.

Next, we also examined the shape of the relation between Temper Loss scores and the composite DSM-IV mood and disruptive disorders outcome at T1 (Table 2a). The probability of DSM-IV disorders increased substantially as Temper Loss scores increased above the mean (Figure 2). Importantly, this increase in risk for subsequent DSM-IV disorders even occurred across levels of Temper Loss that are considered to be “normative.” For example, the probability of having a DSM-IV mood or disruptive disorder at T1 was 67% for children who were 1 SD from the T0 Temper Loss population mean, a level typically viewed at the upper bound of normal. This risk increased linearly across the dimension, with those who fell 2 SDs above the mean at T0 having an 83% probability of a T1 DSM-IV disorder.

Figure 2.

Figure 2

Relation of Temper Loss dimensional location to probability of DSM disorders. Note: Numbers below the red line indicate percentage of children at each value of Temper Loss predicted to have a DSM-IV mood or disruptive disorder, based on Temper Loss score alone. Percentages above the red line indicate the predicted percentage for every value of T0 Temper Loss, controlling for all covariates. IRT = item response theory.

Incremental validity

T0 Temper Loss scores predicted higher FLIS impairment in all domains, above and beyond the presence of DSM disorders. The specific shape of the relation between Temper Loss scores and impairment varied by domain and time point (Table 2b; see Table S3b, available online, for full models). For example, child functional impairment demonstrated a curvilinear relationship to T0 Temper Loss scores at both T1 and T2.

Patterns of longitudinal variation

Across the three time points, short-term longitudinal correlations of Temper Loss scores were approximately .70 (r_T0–T1= .70, r_T0–T2=.69, r_T1–T2=.71). Although this is typically considered “very stable,” we underscore that this coefficient indicates that only about half (.702=.49) of the variance in Temper Loss at any measurement occasion is shared by Temper Loss at a previous occasion. Thus, approximately 50% of the variance in Temper Loss scores may be due to development, measurement error, or other factors.

Longitudinal patterns for each individual child with data on at least two time points (n=493) show considerable between- and within-subject variability around a small downward trend (0.2/SD per year), with a drop in variance at older ages (Figure 3a). Figure 3b shows the range (maximum–minimum score) for all children. Approximately one-third (n=160, 32.5%) of preschoolers have a set of scores that change more than 1 population SD across consecutive time points. (Note that this reflects magnitude, not direction, of change.) Much of that change spans common thresholds for determining abnormality. For example, 6.5% (n=32) of the sample exhibited temper loss levels >2 SD above the mean at least once, but nearly ¾ of those children (23/32) were above this threshold at only 1 of 3 measurement occasions.

Figure 3.

Figure 3

Longitudinal variation in Temper Loss. 3a shows pattern of intra-individual change in Temper Loss by child age. 3b shows variation in magnitude of intra-individual change in Temper Loss.

Contribution of longitudinal variation to clinical prediction

To test improvement in prediction when two, rather than a single, time points of Temper Loss scale were used, we tested the variance explained with only T0 temper loss in the model (as presented in Table 2a), vs. only T1 temper loss, vs. both. In multivariate models, adding T1 temper loss to the models had significant effects on ODD, CD, and ADHD symptoms, above and beyond T0 effects (Table 3; for full model parameters, see Table S4, available online). Additionally, T0 and T1 temper loss did not predict T2 GAD and SAD symptoms when considered individually, but were predictive when considered simultaneously. These findings indicate that accounting for change over time may enhance clinical prediction, perhaps by identifying periods with stronger effects and/or finding effects that cannot be detected at a single occasion because change per se is predictive.

Table 3.

Incremental Validity of Longitudinal Temper Loss Assessments (T0–T1) for Predicting T2 Symptoms and Impairment

T0 Temper Loss (F) T1 Temper Loss (F) T0 and T1 Temper Loss (F)
T2 Clinical Symptoms
 ODD 0.51 9.27*** 9.80****
 CD 0.99 3.40* 2.90*
 ADHD 0.87 3.37* 5.82***
Depression/Dysthymia 1.35 1.75 1.36
 GAD 1.89 2.53 2.70*
 SAD 1.94 2.10 2.81*

Note: From three independent tables controlling for child age, sex, race/ethnicity, poverty status, harsh physical discipline, child aggression, and time lag between visits. F values reflect a Wald test with 2, 2, and 4 numerator degrees of freedom for tests of total T0, T1, and joint T0 and T1 tests, respectively. These Wald tests assess the total effect of Temper Loss (linear and quadratic) at any given time point (or set of time points), controlling for all other variables. For the T0 and T1 tests, each of these Temper Loss time points is also controlled. ADHD = attention-deficit/hyperactivity disorder; CD = conduct disorder; GAD = generalized anxiety disorder; ODD = oppositional defiant disorder; SAD = separation anxiety disorder.

*

p<.05

**

p<.01

***

p<.001

****

p<.0001

DISCUSSION

Dimensional measurement of irritability has added value for short-term longitudinal prediction of clinical outcomes and impairment in early childhood and is associated with theorized correlates. It is evident from these data that clinical risk is not an “either/or” phenomenon, and that children well below traditional clinical cut-points are at substantial clinical risk. Extreme (and sometimes arbitrary) cut-points may sacrifice important information by lumping together children below the cut-point who manifest substantial risk for clinical problems with children who are unlikely to develop problems. Consistent with the cross-cutting nature of irritability in psychopathology,29,30 the Temper Loss scale and its longitudinal variation was strongly and uniquely predictive of symptoms of both mood and disruptive disorders.

What is the added value of assessing the dimensional spectrum of irritability in young children? First, Temper Loss provides unique information about emergent irritability relevant to cross-cutting syndromes. In addition, the use of continuous severity scores rather than symptoms provides unique information about variability. DSM-IV symptoms are designed to capture extreme manifestations of behavior that clearly demarcate clinical problems. This may be adequate for identifying children with severe problems, and for ruling out problems for children who are emotionally very well regulated. However, there is increased consensus about the importance of identifying not just symptoms, but also prodromal patterns.31 The Temper Loss scale identifies abnormality within a narrow developmental age band in terms of (a) unusual frequency (i.e., rare occurrence and high frequency of commonly occurring behaviors), as well as (b) qualitatively atypical expression of behavior and emotions. In this way, dimensional patterns can identify children who have irritable tendencies that are not yet severely impairing but who have a significant probability of becoming impaired over time. Developmental specification of the boundaries between normal and abnormal is key for prodromal identification, particularly during early childhood when the core behaviors that define irritability also occur normatively.32

These findings also highlight for the clinician the dynamic nature of irritability. Over only an average period of 16 months, approximately 2/3 of children exhibited fairly stable irritability patterns, whereas even extreme irritability was transient in the other third. Characterizing such short-term variability lies at the heart of clinical prediction, and dimensional approaches provide a vital tool for addressing this need. The importance of this is further shown in models with two relatively closely spaced occasions of measurement, where effects varied across occasions or required multiple occasions to manifest. Even within a relatively short period of time, a substantial minority of children showed meaningful variation, and accounting for this variation enhanced clinical prediction. In some ways, examining changes over such relatively short periods is as important as examining longer time periods, because the relatively short timeframe covered by the study establishes a short-term benchmark for the clinician. This short-term benchmark might define the boundaries of a meaningful “watch and wait” period for a clinician.

Findings suggest that a focus solely on the extreme end of the continuum will underestimate prodromal risk. However, a clinical challenge raised by identifying a dimensional spectrum of risk is that it brings to the fore a clinical gray area. This raises the question, When is intervention warranted for young children at the boundaries of risk? This murkiness is perhaps best highlighted by individual differences in the probability of having a DSM-IV disorder for preschoolers just a bit higher than average on the Temper Loss scale (1 SD); 2/3 had a clinically significant disorder but 1/3 did not. Clearly, irritability is a complex trait that may or may not become impairing: What determines which way the clinical wind will blow for young children at this normal:abnormal boundary? This requires going beyond a single dimensional score for clinical decision-making.

Empirically determining the factors that determine probabilistic risk for children at the mid-range of the irritability dimensional spectrum is the critical next step in research designed to advance clinical applications. This will require a neurodevelopmental profile approach33 with key elements of (1) dimensional clusters of behavior; (2) executive function; and (3) longitudinal patterning. Clustering the covariation of irritability with other salient dimensions of behavior (e.g. aggression, impulsivity, anxiety) and developmental competencies (e.g. social skills, language) is important for generating an integrated profile of developmental risk.34,35 Atypicalities in prefrontal regions sub-serving executive function have been demonstrated in clinical populations of irritable youth,29 and cognitive flexibility has been shown to buffer high irritable older youth from progressing to severe antisocial behavior (Hawes S, Perlman S, Byrd A, Raine A, Loeber R, Pardini D, unpublished data, 2014). In young children, conjoint consideration of irritable behavior and delays in maturation of executive function are theorized as a key explanatory factor for the clinical escalation of early irritability (Perlman S, Jones B, Wakschlag L, Axelson D, Birmaher B, Phillips M, unpublished data, 2014).36 With regard to longitudinal variation, assessing irritability at a single time point is likely to contribute to both over- and under-identification. Empirical investigation is needed to determine the optimal number and spacing of time-points to provide an adequate level of certainty in a manner that is also clinically feasible. This type of neurodevelopmental probabilistic assessment approach holds promise for differentiating children with moderate levels of temper loss at highest risk of clinical progression.

Because dimensional approaches do not provide clear thresholds for clinical-decision making, a stepped framework may be needed to incorporate dimensional information clinically. We have previously suggested the following heuristic:11 Level 1: Well-regulated emotions and behavior (annual assessments); Level 2: Watch (longitudinally) and wait: clinically at risk. This level would be targeted to children with mildly elevated scores (e.g., .5–1 SD from the mean) to assess whether patterns escalate, remain stable, or diminish over the course of a year. Level 3: Low intensity intervention: clinically prodromal. A level targeted for children with moderately elevated (e.g., >1 SD above the mean) scores at least 3 (e.g. bimonthly) time-points. Our data suggest that this subgroup of children have more than a 2/3 chance of having a clinical disorder. Thus, developmentally promoting, low-intensity interventions (e.g., improving self-regulation skills) may be warranted, as the benefits of preventing frank disorder are likely to far outweigh the costs. Level 4: Treatment: clinically significant. Highly elevated scores (e.g. ≥2 SD above the mean) at two time-points indicates clearly abnormal patterns that warrant in-depth assessment and treatment. The use of computer adaptive test (CAT) approaches may provide a brief, efficient method of longitudinal monitoring for this purpose.37,38 The importance of proximal family context as a buffer against clinical progression is well documented. 39,40 Although beyond the scope of the present paper, this will also be another important future direction to incorporate into a multifaceted framework for probabilistic clinical decision making.

Our findings must be considered within the limitations of the present dataset. First, shared method variance may have inflated associations, as mothers were reporters for most outcomes. However, this is representative of what typically happens in clinical assessments of preschoolers, where the mother’s report is often the only source of information. Second, in order to explicate the “shape” of dimensional patterns and clinical risk, we focused centrally on a single dimension, i.e., irritability, although clearly such behaviors cannot be considered in isolation for clinical purposes. Third, our longitudinal follow-up was over a relatively short period (~16 months on average), and children’s age at baseline and intervals across time points were not uniform. All of these limitations in longitudinal measurement impeded our ability to specify when change was due to development versus other factors (e.g. contextual changes).

The present findings speak to the need for an empirically validated multi-level dimensional assessment toolkit, aligned with the framework articulated by the National Institute of Mental Health Research Domain Criteria (RDoC).41 With this in hand, clinicians will have an integrated way to assess and interpret dimensional patterns in conjunction with directly observed behavior and neurocognition. Studies conducted within this type of clinical–developmental framework will provide crucial data for determining how to meaningfully incorporate information on variations across the normal:abnormal spectrum to enhance early identification and clinical decision-making. Such an approach holds promise for advancing a truly developmental understanding of clinical phenomenology, ontogeny, and course.

Supplementary Material

1. Figure S1.

Overview of Multidimensional Assessment of Preschoolers Study longitudinal assessment waves.

2

Table S1. Baseline Demographic Characteristics of Multidimensional Assessment of Preschoolers Study (MAPS) Intensive Sample (n=497)

Table S2: Breakdown of Children With and Without Useable Candy Game Data

Table S3: Full Regression Models for Clinical and Incremental Validity of Temper Loss: Predicting DSM-IV Symptoms (T1–T2) and Disorders (T1) and Impairment (T1–T2) from T0 Temper Loss

Table S4: Full Regression Models for Incremental Validity of Longitudinal Temper Loss Assessments (T0–T1) for Predicting T2 Symptoms

Acknowledgments

Drs. Wakschlag, Estabrook, Henry, and Briggs-Gowan were supported by National Institute of Mental Health grants R01MH082830, UO1MH090301, and 2U01MH082830, and Dr. Perlman was supported by K01MH094467. Dr. Wakschlag was also supported by the Walden and Jean Young Shaw Foundation.

The authors thank Alice Carter, PhD, University of Massachusetts-Boston, Patrick Tolan, PhD, University of Virginia, Carri Hill, PhD, Rush University, and Barbara Danis, PhD, Northwestern University, for their contributions to the Multidimensional Assessment Profile of Disruptive Behavior (MAP-DB). This study would not have been possible without the outstanding efforts of Jacqueline Kestler, MPH, Erica Anderson, PhD, of Northwestern University, and their dedicated team, and the generous participation of the Multidimensional Assessment of Preschoolers (MAPS) Study families. The authors also thank the pediatric clinics and participants from Rush University, the University of Illinois at Chicago, NorthShore University HealthSystem, North Suburban Pediatrics, and the following pediatric research group practices for their participation: Healthlinc in Valparaiso, IN, Healthlinc in Michigan City, IN, and Associated Pediatricians in Valparaiso, IN.

Footnotes

Supplemental material cited in this article is available online.

Dr. Estabrook served as the statistical expert for this research.

Disclosure: Drs. Wakschlag, Estabrook, Petitclerc, Henry, Perlman, Voss, Pine, Leibenluft, Briggs-Gowan, and Mr. Burns report no biomedical financial interests or potential conflicts of interest.

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

Contributor Information

Dr. Lauren S. Wakschlag, Northwestern University, Evanston, IL and Institute for Policy Research, Northwestern University.

Dr. Ryne Estabrook, Northwestern University, Chicago.

Dr. Amelie Petitclerc, Northwestern University, Chicago.

Dr. David Henry, Institute for Health Research and Policy, University of Illinois at Chicago.

Mr. James L. Burns, University of Pittsburgh, Pittsburgh, PA.

Dr. Susan B. Perlman, University of Pittsburgh, Pittsburgh, PA.

Dr. Joel L. Voss, Northwestern University, Chicago.

Dr. Daniel S. Pine, National Institute of Mental Health, Bethesda, MD.

Dr. Ellen Leibenluft, National Institute of Mental Health, Bethesda, MD.

Dr. Margaret Briggs-Gowan, University of Connecticut, Farmington.

References

  • 1.Leibenluft E. Severe mood dysregulation, irritability, and the diagnostic boundaries of bipolar disorder in youths. Am J Psychiatry. 2011;168:129–142. doi: 10.1176/appi.ajp.2010.10050766. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Stringaris A, Goodman R. Longitudinal outcome of youth oppositionality: Irritable, headstrong, and hurtful behaviors have distinctive predictions. J Am Acad Child Adolesc Psychiatry. 2009;48:404–412. doi: 10.1097/CHI.0b013e3181984f30. [DOI] [PubMed] [Google Scholar]
  • 3.Stringaris A, Zavoa H, Leibenluft E, Maughan B, Eley T. Adolescent irritability: Phenotypic associations and genetic links with depressed mood. Am J Psychiatry. 2012;169:47–54. doi: 10.1176/appi.ajp.2011.10101549. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Markon K, Krueger R, Watson D. Delineating the structure of normal and abnormal personality: An integrative hierarchical approach. J Pers Soc Psychol. 2005;88:137–157. doi: 10.1037/0022-3514.88.1.139. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Deater-Deckard K, Wang Z. Anger and irritability. In: Zentner M, Shiner RL, editors. Handbook of Temperament. New York: Guilford Press; 2012. pp. 124–144. [Google Scholar]
  • 6.Wakschlag L, Tolan P, Leventhal B. “Ain’t misbehavin”: Towards a developmentally-specified nosology for preschool disruptive behavior. J child psychol psychiatry. 2010;51:3–22. doi: 10.1111/j.1469-7610.2009.02184.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Wakschlag L, Henry D, Tolan P, Carter A, Burns J, Briggs-Gowan M. Putting theory to the test: Added value of a multidimensional, developmentally-sensitive approach to disruptive behavior in early childhood. J Am Acad Child Adolesc Psychiatry. 2012;51:593–604. doi: 10.1016/j.jaac.2012.03.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Belden A, Thompson N, Luby J. Temper tantrums in healthy versus depressed and disruptive preschoolers: Defining tantrum behaviors associated with clinical problems. Journal of Pediatrics. 2008;152:117–122. doi: 10.1016/j.jpeds.2007.06.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Dougherty LR, Smith VC, Bufferd SJ, et al. Preschool irritability: Longitudinal associations with psychiatric disorders at age 6 and parental psychopathology. J Am Acad Child Adolesc Psychiatry. 2013;52:1304–1313. doi: 10.1016/j.jaac.2013.09.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Wakschlag L, Choi S, Carter AS, et al. Defining the developmental parameters of temper loss in early childhood: Implications for developmental psychopathology. Journal of Child Psychology and Psychiatry. 2012;53:1099–1108. doi: 10.1111/j.1469-7610.2012.02595.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Bufferd S, Dyson M, Hernandez I, Wakschlag L. Explicating the “developmental” in preschool psychopathology. In: Cicchetti D, editor. Developmental Psychopathology. 3. Hoboken, NJ: Wiley; in press. [Google Scholar]
  • 12.Kochanska G, Coy K, Murray K. The development of self-regulation in the first four years of life. Child Development. 2001;72:1091–1111. doi: 10.1111/1467-8624.00336. [DOI] [PubMed] [Google Scholar]
  • 13.Wiggins J, Mitchell C, Stringaris A, Leibenluft E. Developmental trajectories of irritability and bidirectional associations with maternal depression. J Am Acad Child Adolesc Psychiatry. 2014;53:1191–1205. doi: 10.1016/j.jaac.2014.08.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Nichols S, Briggs-Gowan M, Estabrook R, et al. Punishment insensitivity in early childhood: A developmental, dimensional approach [epub ahead of print] Journal of Abnormal Child Psychology. 2014 doi: 10.1007/s10802-014-9950-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Wakschlag L, Choi S, Carter AS, et al. Defining the developmental parameters of temper loss in early childhood: Implications for developmental psychopathology. Journal of Child Psychology and Psychiatry. 2012;53:1099–1108. doi: 10.1111/j.1469-7610.2012.02595.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Wakschlag L, Briggs-Gowan M, Choi S, et al. Advancing a multidimensional, developmental spectrum approach to preschool disruptive behavior. J Am Acad Child Adolesc Psychiatry. 2014;53:82–96.e3. doi: 10.1016/j.jaac.2013.10.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Bentler PM. Comparative fit indexes in structural models. Psychology. 1990;107(2):238–246. doi: 10.1037/0033-2909.107.2.238. [DOI] [PubMed] [Google Scholar]
  • 18.Tucker L, Lewis C. A reliability coefficient for maximum likelihood factor analysis. Pschometrika. 1973;23:111–136. [Google Scholar]
  • 19.Browne MW, Cudeck R. In: Alternative ways of assessing model fit. Bollen KA, Long JS, editors. Beverly Hills, CA: Sage; 1993. [Google Scholar]
  • 20.Reise S, Waller N. Item response theory and clinical measurement. Annu Rev Clin Psychol. 2009;5:25–46. doi: 10.1146/annurev.clinpsy.032408.153553. [DOI] [PubMed] [Google Scholar]
  • 21.Egger HL, Erkanli A, Keeler G, Potts E, Walter B, Angold A. The test-retest reliability of the Preschool Age Psychiatric Assessment (PAPA) J Am Acad Child Adolesc Psychiatry. 2006;45:538–549. doi: 10.1097/01.chi.0000205705.71194.b8. [DOI] [PubMed] [Google Scholar]
  • 22.Cella D, Yount S, Rothrock N, et al. The Patient-Reported Outcomes Measurement Information System (PROMIS): Progress of an NIH Roadmap Cooperative Group during its first two years. Medical Care. 2007;45(5):S3–S11. doi: 10.1097/01.mlr.0000258615.42478.55. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Crowley M, Mays LC. The Candy Reversal Learning Task. New Haven, CT: Yale Child Study Center; 2004. [Google Scholar]
  • 24.Gorrindo T, Blair R, Budhani S, Dickstein D, Pine D, Leibenluft E. Deficits on a probabilistic response-reversal task in patients with pediatric bipolar disorder. Am J Psychiatry. 2005;(162):1975–1977. doi: 10.1176/appi.ajp.162.10.1975. [DOI] [PubMed] [Google Scholar]
  • 25.Gresham F, Elliot S. Social Skills Improvement System: Rating Scales Manual. Minneapolis: Pearson; 2008. [Google Scholar]
  • 26.LeBuffe PA, Nagalieri JA. The Devereux Early Childhood Assessment. Lewisville, NC: Kaplan Press Publishing; 1999. [Google Scholar]
  • 27.Elliott C. Differential Abilities Scales: Introductory and technical handbook. New York: Psychological Corporation; 1983. [Google Scholar]
  • 28.Gadow KD, Sprafkin J, Nolan EE. DSM-IV symptoms in community and clinic preschool children. J Am Acad Child Adolesc Psychiatry. 2001;40(12):1383–1392. doi: 10.1097/00004583-200112000-00008. [DOI] [PubMed] [Google Scholar]
  • 29.Stringaris A, Rowe R, Maughan B. Mood dysregulation across developmental psychopathology - general concepts and disorder specific expressions. J child psychol psychiatry. 2012;53:1095–1097. doi: 10.1111/jcpp.12003. [DOI] [PubMed] [Google Scholar]
  • 30.Leibenluft E, Stoddard D. The developmental psychopathology of irritabiltiy. Development and Psychopathology. 2013;25:1473–1487. doi: 10.1017/S0954579413000722. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Insel T. Disruptive insights in Psychiatry: Transforming a clinical discipline. Journal of Clinical Investigation. 2009;119:700–705. doi: 10.1172/JCI38832. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Wakschlag LS, Choi S, Carter A, et al. Defining the developmental parameters of temper loss in young children: Implications for developmental psychopathology. J Child Psychol Psychiatry. 2012;53:1099–1108. doi: 10.1111/j.1469-7610.2012.02595.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Karalunas S, Fair DA, Musser E, Aykes K, Iyer S, Nigg J. Subtyping Attention-Deficit/Hyperactivity Disorder using temperament dimensions: Towards biologically based nosologic criteria. JAMA Psychiatry. 2014;71:1015–24. doi: 10.1001/jamapsychiatry.2014.763. [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
  • 34.Beitchman JH, Wilson B, Brownlie EB, Walters H, Inglis A, Lancee W. Long-Term Consistency in Speech/Language Profiles: II. Behavioral, Emotional, and Social Outcomes. J Am Acad Child Adolesc Psychiatry. 1996;35(6):815–825. doi: 10.1097/00004583-199606000-00022. [DOI] [PubMed] [Google Scholar]
  • 35.Cote S, Vaillancourt T, Barker E, Nagin D, Tremblay R. The joint development of physical and indirect aggression: Predictors of continuity and change during childhood. Dev Psychopathol. 2007;19:37–55. doi: 10.1017/S0954579407070034. [DOI] [PubMed] [Google Scholar]
  • 36.Healey DM, Brodzinsky L, Bernstein B, Rabinovitz B, Halperin J. Moderating effects of neurocognitive abilities on the relationship between temperament and global functioning. Child NeuroPsychology. 2009;16:20–31. doi: 10.1080/09297040902984490. [DOI] [PubMed] [Google Scholar]
  • 37.Choi SW, Reise SP, Pilkonis PA, Hays RD, Cella D. Efficiency of static and computer adaptive short forms compared to full-length measures of depressive symptoms. Qual Life Res. 2010;19:125–36. doi: 10.1007/s11136-009-9560-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Wakschlag LS, Briggs-Gowan M, Choi S, et al. Advancing a multidimensional, developmental spectrum approach to preschool disruptive behavior. J Am Acad Child Adolesc Psychiatry. 2014;53:82–96. doi: 10.1016/j.jaac.2013.10.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Wakschlag L, Hans S. Relation of maternal responsiveness during infancy to the development of behavior problems in high risk youths. Developmental Psychology. 1999;37:569–579. doi: 10.1037//0012-1649.35.2.569. [DOI] [PubMed] [Google Scholar]
  • 40.Hill C, Maskowitz K, Danis B, Keenan K, Burns J, Wakschlag L. Validation of a clinically sensitive observational coding system for parenting behaviors: The Parenting Clinical Observation Schedule (P-COS) Parent Sci Pract. 2008;8:53–85. doi: 10.1080/15295190802045469. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Cuthbert BN. The RDoC framework: Faciltiating transition from ICD/DSM to dimensional approaches that integrate neuroscience and psychopathology. World Psychiatry. 2014;13:28–35. doi: 10.1002/wps.20087. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1. Figure S1.

Overview of Multidimensional Assessment of Preschoolers Study longitudinal assessment waves.

2

Table S1. Baseline Demographic Characteristics of Multidimensional Assessment of Preschoolers Study (MAPS) Intensive Sample (n=497)

Table S2: Breakdown of Children With and Without Useable Candy Game Data

Table S3: Full Regression Models for Clinical and Incremental Validity of Temper Loss: Predicting DSM-IV Symptoms (T1–T2) and Disorders (T1) and Impairment (T1–T2) from T0 Temper Loss

Table S4: Full Regression Models for Incremental Validity of Longitudinal Temper Loss Assessments (T0–T1) for Predicting T2 Symptoms

RESOURCES