Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2011 Apr 12.
Published in final edited form as: Eur J Pers. 2010 Dec;24(8):656–674. doi: 10.1002/per.767

Context specificity in stability of hyperactivity-impulsivity

J Kim, K Deater-Deckard, P Y Mullineaux, C R Beekman
PMCID: PMC3074171  NIHMSID: NIHMS250174  PMID: 21494576

Abstract

This paper exemplifies a secondary data analysis of context-specific differences in children’s hyperactivity-impulsivity while controlling for informant-specific effects. Participants were boys and girls from the NICHD Study of Early Child Care and Youth Development who were measured in 1st, 3rd, and 5th grades. Latent factor models were structured using multi-informant reports including mothers, fathers, teachers, and observers. Temporal stability within a context was stronger than cross-context consistency, and the magnitude of longitudinal stability was higher in the home context compared to the school context. Controlling for informant-specific effects resulted in a significantly improved model fit and increased within-context stability. Our findings highlight the importance of considering both context and informant effects when studying longitudinal stability and change in personality development.

Keywords: Longitudinal Stability, Context Specificity, Informant Variation, Hyperactivity-Impulsivity


Many developmental theorists agree that temperament represents underlying dispositions, and possibly traits. Individual differences in temperament are evident as behaviors that vary widely across individuals, are readily observed from early in life, are somewhat stable over time and across settings, and include a biological foundation (Buss & Plomin, 1984; Goldsmith, Buss, & Lemery, 1997; Rothbart & Bates, 1998; Sanson, Hemphill, & Smart, 2004; Strelau, Zawadzki, & Piotrowska, 2001). Understanding how individual differences in temperament—and related personality factors—unfold over time is greatly enhanced by its accurate assessment aided by using multiple measures that span at least two contexts and that involve observations by multiple informants. In the current quantitative longitudinal study, we used factor analytic models with such data to examine the potential effects of contexts and informants on longitudinal stability and change in two indicators of child behavior that are related to individual differences in temperament, personality and psychopathology: hyperactivity and impulsivity.

Temperament is one of the bases of personality, the latter being a much broader array of attributes including perceptions and interests (Keogh, 2004; Rothbart, Ahadi, & Evans, 2000). Typically, developmental theorists state that temperament represents underlying dispositions or traits. However, some theorists have focused on individual differences in temperament reflect mainly characteristic styles of behavioral response (Buss & Plomin, 1975; Thomas & Chess, 1977) whereas other theorists view temperament as including individuals’ affective qualities (Rothbart, Ahadi, Hershey, & Fisher, 2001; Rothbart & Bates, 1998). In the further development of the operationalization of child temperament, Rothbart and her colleagues suggested the importance of individual differences in attentional self-regulation as a basic dimension of temperament (Posner & Rothbart, 1998; Rothbart et al., 2000) and demonstrated evidence for three distinct dimensions of child temperament – surgency/extraversion, negative affectivity, and effortful control (Rothbart et al., 2001). Surgency/extraversion captures “approach” tendencies and includes activity level, impulsivity, high intensity pleasure, and low shyness. Negative affectivity includes sadness, anger, discomfort, and low soothability. Effortful control includes enjoyment of low-intensity stimulation, perceptual sensitivity, and self-control of impulses and attention.

There is evidence of context and method/informant effects in child temperament (Sanson et al., 2004). Agreement between different informants on measures of temperament is far less than unity, with correlations ranging from .15 to .51 between maternal and observer ratings, and between mothers’ and fathers’ ratings (Bornstein, Gaughran, & Segui, 1991; Carnicero, Perez-Lopez, Salinas, & Martinez-Fuentes, 2000; Goldsmith, 1996; Goldsmith & Campos, 1990; Hayden, Klein, & Durbin, 2005; Rothbart et al., 2001; Seifer, Sameroff, Barrett, & Krafchuk, 1994). This modest to moderate agreement between informants could arise from informants using different bases of knowledge when evaluating child behavior (De Los Reyes & Kadzin, 2005), but it also could indicate that they are reliably reporting children’s context-specific behaviors (Mangelsdorf, Schoppe, & Buur, 2000; Beenstra et al., 2008). It might also arise from informant-specific “rater bias”. For example, depression is associated with systematic bias in maternal ratings of their children’s hyperactivity-impulsivity (Boyle & Pickles, 1997).

In early childhood through middle childhood, exploratory and confirmatory factor analyses indicate that impulsivity and activity level together provide the highest loadings on the surgency/extraversion factor, forming the foundation for the surgency/extraversion temperament dimension (e.g., Mullineaux, Deater-Deckard, Petrill, Thompson, & DeThorne, 2009; Rothbart et al., 2001).Hyperactivity-impulsivity seems to be one of the most phenotypically stable temperament dimensions although longitudinal stability of childhood hyperactivity-impulsivity varies widely depending on the assessment method utilized (see Pelham, Fabiano, & Massetti, 2005 for a review). Olson, Schilling and Bates (1999) reported stability in impulsivity between 1st and 3rd grade of r = .44. A study by Rothbart and her colleagues (2001) further demonstrated that the two components of surgency/extraversion – impulsivity and activity level – were two of the most stable facets over time (i.e., two-year stability from 5 to 7 years of age was in .7 range for mothers and fathers), compared to all other facets of temperament measured by the Children’s Behavior Questionnaire (CBQ). More recently, Majdandzic and van den Boom (2007) used latent factor models to examine consistency across situations and time in early childhood and reported higher temporal stability for the parent perceptions of activity level based on mothers’ and fathers’ reports on the CBQ (.83), compared to the observed activity level measured by ratings on the Laboratory Temperament Assessment Battery (Lab-TAB) (.70). In regard to inter-rater agreement, impulsivity and activity level both have shown moderate to substantial (r = .4 to .7 range) mother-father agreement, as shown in multiple samples (e.g., Majdandzic & van den Boom, 2007; Mullineaux et al., 2009; Rothbart et al., 2001).

There is a substantial literature supporting the heritability of temperament (e.g., Auerbach, Faroy, & Ebstein, 2001; Goldsmith, Lemery, & Essex, 2004). In particular, prior behavioral genetic studies have demonstrated that hyperactivity-impulsivity is highly heritable (see Levy, McStephen, & Hay, 2001). However, the high end of such heritability estimates (i.e., .8 and above) have been found in investigations in which parents were the informants who provided information on the twins’ zygosity and rated their children’s behaviors (Nigg, Hinshaw, & Huang-Pollock, 2006). It is plausible that those estimates were inflated by rater contrast effects (Eaves et al., 2000).

According to a trait/state view of temperament behaviors like hyperactivity-impulsivity, genetic and environmental factors work together to cause variation in neuronal and biochemical factors that in turn cause variation in an abstract trait, which itself is observed directly as transient (state-like) and stable (trait-like) behaviors (Strelau, 2001). These behaviors also are influenced by immediately contingent situational, physiological, and psychological setting conditions. In the current study, we considered variance that is stable across time as well as variance that is time/context/informant-specific, in an effort to test directly the veracity of the disposition concept. By developing a multi-informant measurement model and testing its sensitivity to detecting time and context variants, we attempted to derive accurate estimates of the most likely ‘trait’ and ‘state’ components of variance in some behavioral indicators of dispositional hyperactivity-impulsivity—as a conceptual and statistical model that could be applied to all dimensions of temperament and personality.

There remains a longstanding debate about which methods and informants are most valid and reliable to operationalize temperament. One approach that has the potential to resolve this debate is the use of multi-informant and method constructs with good internal and external validity (Karp, Serbin, Stack, & Schwartzman, 2004). The use of multi-informant, multi-context temperament constructs not only can elucidate systematic method/rater biases if they exist, but also can promote the development of new measurement models and assessment tools with improved psychometric properties, including enhanced predictive validity and replicability (Nunnally & Bernstein, 1994; Rushton, Brainerd, & Pressley, 1983).

One common approach that has been used involves computation of multi-informant composites that are calculated as a simple average score across different informants. Such composites constrain the data by obscuring the examination of the variance in behavior that may or may not overlap across different contexts and informants. Another approach utilizes latent variables to combine information from multiple informants and to isolate informant-specific effects as measurement error that is not examined. In the current study, we offer an alternative that integrates these common approaches without removing or obscuring context and informant effects. We examined longitudinal factor models of temperament that utilizes multi-informant scores, in an effort to describe and predict longitudinal stability and change from 4.5 to 11 years of age.

Many studies of child temperament share the same data structure: parents (and sometimes observers) report on child behavior in the home context, and teachers (and sometimes different observers) report on child behavior in the childcare or school context. The development of models that can address potential context and informant effects is important, especially since context and informant effects are always confounded in the longitudinal data in this common design. Our goal was to identify systematic context effects— for home and school settings—on stability and change in behavioral indicators of hyperactivity-impulsivity in the transition to and through middle childhood, while also considering mother, father, teacher, and observer informant variance. We chose to examine hyperactivity-impulsivity because it is a temperament trait that is highly heritable, highly stable, and easily observable by others. Furthermore, hyperactivity-impulsivity plays a central role in the conceptualization of child psychopathology (Quay, 1993).

Although secondary analysis offers opportunities for increasing the informational value as well as a relatively low cost way to ask original research questions (Bullock, 2007), we were limited as to the available variables in constructing the measurement models of hyperactivity-impulsivity. One consequence was that the items used in the current analysis were narrowly defined behavioral indicators of the temperamental dimension of surgency/extraversion. Although it is beyond the scope of this study, it should be noted that researchers of child temperament and psychopathology generally agree that temperament incorporates the normative range of responding, whereas behavior problems are symptom clusters that are dysfunctional, excessive, maladaptive, or debilitating (Achenbach, 1995). Measurement confounding (i.e., item-content overlap) has been an issue in the studies of child temperament and behavior problems, because most temperament and symptom measures tap behaviors, and similar behaviors may stem from either temperament or behavior problems (Lemery, Essex, & Smider, 2002). Empirical evidence, however, has been accumulated suggesting that measurement confounding does not account for the association between temperament and behavior problems (Lemery et al., 2002; Lengua, West, & Sander, 1998). The items used in this study were adopted from the CBCL and the TRF that rated hyperactive behaviors (rather than activity) and impulsive behaviors. Thus, these item scan be viewed as reflecting the distribution in the range of behavior problems rather than normal range behaviors. Nevertheless, we believe that these items could capture individual differences in the temperamentally based behaviors that were part of a surgency/extraversion dimension.

The existing research on temperament described above (though almost all of it based on mono-method/informant assessments of constructs) suggests moderate to substantial short-term (1–2 years) stability of child hyperactivity-impulsivity, with the estimates of stability declining over longer time periods(e.g., Green, Loeber, & Lahey, 1991). In light of this literature, we hypothesized that individual differences in the multi-informant construct of hyperactivity-impulsivity would show moderate to high stability within the same context across time (2 years).Based on previous findings of stronger stability with particular contexts compared to consistency across contexts (e.g., Majdandzic & van den Boom, 2007; Ruff, Capozzoli, & Weissberg, 1998), we hypothesized that within-context longitudinal stability would be comparable or exceed the magnitude of correlations between contexts within a concurrent time point. We also expected that longitudinal stability for the measurement construct would be stronger for the home context than for the school context, in line with previous research that suggest higher consistency for parent reports compared to observational ratings. In addition, it was important for us to consider informant effects in the longitudinal factor model in order to identify and distinguish stable informant effects (i.e., systematic effects of informants due to the unique aspects of the relationship with the child and/or due to informant-specific rater bias) from reliable cross-informant variance when estimating context-specific hyperactivity-impulsivity. Overall, current theory in temperament emphasizes the role of social influences (e.g., demands of the situation such as home or school, or cultural expectations) in the development of individual differences (e.g., Sanson et al., 2004). The examination of reliable estimates of the ‘trait’ and ‘state’ components of temperamentally-based hyperactive-impulsive behaviors among children has significant implications for understanding of the developmental continuity in child temperament.

Method

Participants

In the current study, we examined the public datasets of the NICHD (National Institute of Child Health and Development) Study of Early Child Care and Youth Development (http://www.nichd.nih.gov/research/supported/seccyd/datasets.cfm). Data collection began in 1991 in 9 states (Arkansas, California, Kansas, New Hampshire, North Carolina, Pennsylvania, Virginia, Washington, and Wisconsin) and included 1364 children (52% male) and their families when the children were one month of age. The current sample included 978 children (482 boys and 496 girls) who had mothers’ ratings of hyperactivity-impulsivity measures for at least two time points over the three assessments in 1st, 3rd, and 5th grades. Participating children were from five racial categories: White (83%); Black (11%); Asian or pacific islander (1%); American Indian, Inuit, or Aleutian (1%); and Other (4%). At the first time point (1st grade) these 978 children did not differ from the 102 children who were not included in the current sample because they had data only in 1st grade but not in 3rd or 5th grade (i.e., ‘non-participants’) with respect to gender (49% boys among participants vs. 57 % boys among non-participants), race (83% whites among participants vs. 77 % whites among non-participants), maternal education (an average of 14.5 years among participants vs. 14.2 years among non-participants), and the presence of a husband or partner for the mothers (82% among participants vs. 83 % among non-participants).Mothers of those 978 children, however, had significantly higher family incomes compared to the 102 non-participants (income-to-need ratio; M = 4.06 vs. 3.01, t (980) = 4.29, p< .05). The analysis sample did not differ from the non-participants with respect to mothers’, fathers’, and teachers’ reports on hyperactivity/impulsivity in 1st grade, but showed slightly lower levels of hyperactivity/impulsivity according to observers’ ratings (M = 4.35 vs. 4.14, t (949) = 2.03, p = .04).

Measures

We used mothers’, fathers’, teachers’, and observers’ (trained paid research staff) ratings on items pertaining to several key indicators of hyperactivity and impulsivity. The items were selected based on 1) face validity from a variety of instruments that are described here (see Table 1 for an overview), and 2) availability across the three measurement points (i.e., measured consistently in 1st, 3rd, and 5th grade.

Table 1.

Measures used to construct hyperactivity-impulsivity, by time point, and N with valid data for mothers’ (M), fathers’ (F), teachers’ (T), and observers’ (O) reports

1st Grade
3rd Grade
5th Grade
M F TO M F TO M F TO
(Number)
Child Behavior Checklist 906 603 -- -- 953 698 -- -- 939 696 -- --
Teacher Report Form -- -- 921 -- -- -- 925 -- -- -- 886 --
Classroom Observation System -- -- -- 881 -- -- -- 910 -- -- -- 900

Parent and teacher ratings of children’s hyperactive-impulsive behavior were gathered at each time point. The Child Behavior Checklist or CBCL and the closely related Teacher Report Form or TRF (Achenbach, 1991) are scored using a three-point Likert-type scale: 0 = not true, 1 = somewhat or sometimes true, 2 = very true or often true. From the CBCL and the TRF, we included the same two items from each (can’t sit still, restless, hyperactive; impulsive or acts without thinking).

The Classroom Observation System or COS was developed by the SECC Steering Committee for the National Institute of Child Health and Development Study of Early Child Care and Youth Development (2006). The COS captured discrete child behaviors and interactions with others in the classroom over the course of two 44-minute observation cycles. Each cycles consisted of three ten-minute time sampled periods (30-second observe and 30-second record intervals). In addition to the discrete behavioral recordings completed, a qualitative rating was completed for each these cycles using a global 7-point scale (1 = uncharacteristic to 7 = extremely characteristic). These three global ratings were then averaged to yield an overall child rating across the observational cycles. For hyperactivity, we used the activity level scale which reflected an overall rating of how physically active, restless, and fidgety the child was during the observation period. Inter-rater reliability estimates for the activity level based upon repeated measures ANOVA (i.e., the unbiased estimate of the reliability of the mean of k = 2 measurements after taking into account differences in the raters, k= number of raters, described by Winer, 1971) were .70 for 1st grade ratings, .67 for 3rd grade ratings, and .61 for 5th grade ratings.

Statistical Analysis

We used confirmatory factor analyses (CFA) models via structural equation modeling (SEM; Bollen, 1989) to estimate measurement models of the behavioral dimensions of hyperactivity-impulsivity. We used two criteria to determine empirical indices of a certain latent psychological construct (Nunnally & Bernstein, 1994). First we considered content validity by examining the adequacy with which a specified domain of content was sampled. Each item/or scale that is said to comprise the latent construct must stand on its own as an adequate representation of that construct. Next we conducted a series of factor analyses to test internal validity—demonstrated empirically as showing that correlations among the manifest variables can be patterned according to the mathematical expectations of a single latent construct (McArdle, 1996). Only the observed variables that have significant factor loadings on the latent factors were maintained using a cutoff of .3 (accounting for 9% of the variance; Pedhazur & Schmelkin, 1991) for what we consider minimally acceptable factor loadings (with one exception of .22 for the COS rating on the school context latent factor in 5th grade).

We focused on testing measurement models for the latent context factors (i.e., home and school) based on the scores reported by multiple informants to study the factorial composition of hyperactivity-impulsivity while differentiating context effects. The latent factors of the home context consisted of the scores reported by mothers and fathers, and the latent factors of the school context consisted of the scores reported by teachers and observers. Items were reverse scored if necessary so that higher scores indicated higher levels of hyperactivity-impulsivity.

In order to examine longitudinal stability and changes in these behavioral indicators of temperament, we estimated longitudinal factor analysis models that specified within-occasion correlations between home and school context factors (i.e., cross-context correlations) as well as between-occasion auto-regressive effects for the same context factor (i.e., stability of contexts). We used the Amos 7.0 program (Arbuckle, 2006) that estimated parameters incorporating full information maximum likelihood (FIML) methods. The FIML methods allowed data from all individuals to be included regardless of their pattern of missing data and are more appropriate than other commonly used methods such as mean substitution. In evaluating the overall goodness of fit of each model, we report the chi-square goodness-of-fit statistic (χ2), degrees of freedom (df), and corresponding p value; the Comparative Fit Index (CFI), and the Root Mean Square Error of Approximation (RMSEA). CFI values greater than .90 and RMSEA values of .08 or lower are indicative of good-fitting models (Browne & Cudeck, 1993). In order to the significance of cross-context correlations and informant effects, equality constraints were hierarchically imposed to test the adequacy of the constraints using nested χ2 difference tests (Bollen, 1989). We also examined changes in CFI, because the chi-square difference test for nested model comparisons may be too sensitive with relatively large samples (Cheung & Rensvold, 2002). We tested if a value of CFI is greater than the recommended .01 (Cheung & Rensvold, 2002)to reflect a meaningful difference in model fit.

Results

Table 2 presents descriptive statistics (Means and SDs) of hyperactive-impulsivity scores based on mother, father, teacher, and observer reports in 1st, 3rd, and 5th grades – for the whole sample and separately for boys and girls. Two indicators for mother, father, and teacher ratings were averaged within informant and time. We performed a series of t-tests to examine differences between boys and girls, using a Bonferroni-adjusted alpha level (i.e., dividing the per comparison alpha level by the number of outcome variables, α = .004) to control for inflated Type I error rates (Jaccard & Guilamo-Ramos, 2002). Boys were rated as showing higher levels of hyperactive-impulsive behaviors regardless of informants and measurement time points (t = 3.51 to 9.51, p < .001).

Table 2.

Means and Standard Deviations for Mothers’, Fathers’, Teachers’, and Observers’ Ratings of Hyperactivity-Impulsivity

Full Sample
Mean (SD)
Boys
Mean (SD)
Girls
Means (SD)
Grade 1
 Mothers’ Ratings .46 (.50) .55 (.53) .39 (.45)
 Fathers’ Ratings .47 (.46) .57 (.48) .38 (.43)
 Teachers’ Ratings .36 (.56) .51 (.62) .23 (.40)
 Observers’ Ratings 4.15 (.85) 4.24 (.83) 4.04 (.85)
Grade 3
 Mothers’ Ratings .43 (.50) .51 (.51) .34 (.45)
 Fathers’ Ratings .45 (.48) .51 (.50) .38 (.45)
 Teachers’ Ratings .39 (.59) .53 (.64) .26 (.50)
 Observers’ Ratings 4.18 (.47) 4.24 (.50) 4.11 (.44)
Grade 5
 Mothers’ Ratings .39 (.47) .44 (.52) .28 (.41)
 Fathers’ Ratings .35 (.46) .41 (.48) .29 (.42)
 Teachers’ Ratings .31 (.51) .47 (.58) .16 (38)
 Observers’ Ratings 4.14 (.46) 4.20 (.45) 4.08 (.44)

We conducted repeated-measures general linear modeling (GLM) analysis to examine whether there were significant developmental changes in the levels of hyperactive-impulsive behaviors across 1st, 3rd, and 5th grades, and whether the changes differed by gender (i.e., time by gender interactions). The results indicated that reports by mothers, fathers, and teachers showed significant declines in children’s hyperactive-impulsive behaviors regardless of gender, with F (1, 844) = 54.06, p< .05 for mother ratings, F (1, 472) = 61.67, p< .05 for father ratings, and F (1, 779) = 7.13, p < .05 for teacher ratings. In contrast, there were no significant changes across 1st, 3rd, and 5th grades based on observer ratings, F (1, 779) = .38, p = .53.

We estimated zero-order bivariate Pearson correlations between all of the study variables which included 4 items for the home context and 3 items for the school context at each measurement time (available upon request).Inter-item correlations ranged from .20 to .41 for the home context, from .22 to .64 for the school context in 1st grade; from .27 to .49 for the home context and from .21 to .64 for the school context in 3rd grade; and from .33 to .52 for the home context and from .16 to .58 for the school context in 5th grade. Correlations between parent ratings ranged from .20 to .41 in 1st grade, from .27 to .49 in 3rd grade, and from .33 to .52 in 5th grade. Inter-item correlations for teacher ratings were .64 in 1st grade, .64 in 3rd grade, and .58 in 5th grade.

Figure 1 presents the longitudinal factor model of context and informant effects. Different informants represent possible systematic errors that are referred to as method factors (Campbell & Fiske, 1959). Informant effects were measured by the degree of covariation between the unique factors for the variables reported by the same informant at a given measurement time (Kenny & Kashy, 1992).Stabilities within the same informants were estimated through autocorrelations over time (between the adjacent times) for unique factors for the same variable provided by the same informant (see McArdle & Nesselroade, 1994). First, we tested the significance of informant effects in studying temporalstability and cross-context consistency of hyperactivity-impulsivity by comparing two models — one with and one without informant effects. If there are no systematic informant variances, then the fit of the simpler model (the one without estimating informant effects) should not be significantly worse than the one estimating informant effects. Second, we compared the models with and without within-time cross-context correlations in order to appreciate the significance of cross-context consistency. If the home and the school contexts were completely independent, then the model fit should not significantly degrade for the simpler model (the one without estimating cross-context correlations) compared to the more complex model which estimates cross-context correlations.

Figure 1.

Figure 1

Longitudinal factor model of context and informant effects for hyperactivity-impulsivity from 1st grade to 5th grade. CBCL = Child Behavior Checklist; TRF = Teacher Report Form; COS = Classroom Observation System; M = Mother; F = Father; T = Teacher; O = Observer; G1 = 1st Grade; G3 = 3rd Grade; G5 = 5th Grade.

We fitted a model that specified both informant effects and cross-context correlations. The correlations between unique factors (within a measurement time)represented informant effects, representing variances due to different sources of informants: mother, father, and teacher in 1st, 3rd, and 5th grades. In addition, to consider the longitudinal stability of informant effects, correlations between unique factors of the corresponding variables for the same informant between two adjacent occasions were estimated. The longitudinal factor model with informant effects — “the informant effect model” —fit the data well, χ2 (165) = 505.77, p = .00, CFI = .94, RMSEA = .05. In a subsequent model, the informant effects were fixed to zero. This model without informant effects provided a mediocre fit, χ2 (182) = 946.35, p = .00, CFI = .87, RMSEA = .07. Compared to the model with informant effects, the model fit for the model without informant effects degraded significantly, Δχ2 = 440.58, Δ df = 17, p< .05, and ΔCFI = .07.

Next, we compared the model with both informant effects and cross-context correlations with two alternative models that fixed cross-context correlations to zero in order to test the context specificity —whether the home and the school contexts are independent of each other. In this series of models, if we fixed one of the cross-context correlations to zero, the variance for the home context in 3rd grade became small yet negative (e.g., -.005). In order to avoid such a Heywood case(i.e., the estimated error term for an indicator for a latent variable is negative), the variance of the home context factor in 3rd grade and the covariance between the home and the school context factors in 3rd grade had to be fixed to zero. Consequently, we were not able to test the significance of the cross-context correlations in 3rd grade. We first tested the significance of the zero-order correlation between the home and the school contexts at 1st grade. This model without the cross-context correlation in 1st grade provided a mediocre fit, χ2 (168) = 830.76, p = .00, CFI = .88, RMSEA = .06. Compared to the model that estimates 1st grade correlation between the home and the school contexts – with the variance of the 3rd grade home context and the 3rd grade cross-context covariance fixed to zero, χ2 (167) = 508.06, p = .00, CFI = .94, RMSEA = .05– the model fit without the 1st grade cross-context correlation had a significantly worse fit, Δχ2 = 322.70, Δ df = 1, p< .05, and ΔCFI = .06.

We examined the significance of cross-context consistency between the home and the school contexts in 5th grade by fixing the correlation between the two context factors to zero. In doing so, within-context stability was kept in the model order to control for possible confounding effects of stability when examining cross-context consistency. This model without the cross-context correlation in 5th grade yielded an acceptable fit, χ2 (168) = 516.88, p = .00, CFI = .94, RMSEA = .05. Compared to the model with the cross-context correlation in 5th grade, the model fit did not significantly degrade by assuming a correlation of zero between the home and the school contexts in 5th grade, as indicated by a trivial change in CFI, Δχ2 = 8.82, Δ df = 1, p< .05, and ΔCFI = .002.

As shown in Table 3 and Figure 2, a closer examination of the significant coefficients in the best-fitting model – the informant effect model – indicated that all of the within-context stability coefficients were significant and substantial. There was a strong stability in the home context between 1st grade and 3rd grade (β = .97, p< .05), and between 3rd grade and 5th grade (β = .90, p< .05). Similarly, there was significant stability, with weaker magnitude compared to the home context, in hyperactivity-impulsivity measured in the school context between 1st grade and 3rd grade (β = .95, p< .05), and between 3rd grade and 5th grade (β = .69, p< .05). With respect to cross-context consistency, the concurrent correlation between the home and the school contexts in 1st grade was high and significant (r = .71, p< .05). In contrast, the association between hyperactivity-impulsivity measured in the home context and hyperactivity-impulsivity measured in the school context was significant but low in 5th grade (r = .25, p< .05), after controlling for the context-specific stabilities (i.e., autoregressive effects of 3rd grade context on 5th grade context). It should be noted that the variances of home and school context factors as well as the covariance between them (r = .55, p = .20) in 3rd grade became non-significant after controlling for informant effects and context-specific stabilities. This may be in part due to extremely high percentages of variances that were explained by the auto-regressive effects of the 1st grade factors (i.e., 94% for the home context and 90% for the school context).

Table 3.

Unstandardized parameter estimates, standard errors, and critical ratios for longitudinal factor models of hyperactivity-impulsivity from 1st Grade to 5th Grade

Model without informant effects
Model with informant effects
Variable Estimate Std. Error Critical Ratio Estimate Std. Error Critical Ratio
Factor loadings
 CBCL M1→Home G1 1= 1=
 CBCL M2→Home G1 1.29* .08 15.34 1.32* .09 14.47
 CBCL F1→Home G1 1.21* .09 13.37 1.25* .10 12.33
 CBCL F2→Home G1 .74* .07 10.07 .71* .08 8.92
 TRF T1 →School G1 1= 1=
 TRF T2 →School G1 1.09* .05 20.00 1.12* .06 19.01
 COS O1 →School G1 .55* .07 8.04 .74* .09 7.94
 CBCL M1→Home G3 1= 1=
 CBCL M2→Home G3 1.22* .07 18.33 1.27* .08 16.76
 CBCL F1→Home G3 .76* .06 11.92 .77* .07 10.62
 CBCL F2→Home G3 1.13* .07 15.31 1.19* .09 13.64
 TRF T1 →School G3 1= 1=
 TRF T2 →School G3 1.13* .06 20.20 1.15* .06 19.47
 COS O1 →School G3 .32* .04 8.84 .40* .05 8.52
 CBCL M1→Home G5 1= 1=
 CBCL M2→Home G5 1.21* .07 16.98 1.25* .08 16.12
 CBCL F1→Home G5 .80* .07 12.17 .77* .07 11.00
 CBCL F2→Home G5 1.10* .08 14.61 1.09* .08 13.38
 TRF T1 →School G5 1= 1=
 TRF T2 →School G5 .86* .06 15.06 .85* .06 15.53
 COS O1 →School G5 .21* .04 5.77 .20* .05 4.00
Factor stabilities
 Home G1→Home G3 1.01* .08 13.96 1.01* .08 13.33
 Home G3→Home G5 .88* .06 15.11 1.02* .09 11.63
 School G1→School G3 .75* .05 15.47 .94* .06 15.57
 School G3→School G5 .64* .05 13.56 .82* .07 12.33
Factor covariances
 Home G1 School G1 .09* .01 10.74 .08* .01 9.80
 Home G3 School G3 .02* .00 4.55 .01 .00 1.28
 Home G5 School G5 .02* .01 3.62 .01* .01 2.82
Factor variances
 Home G1 .11* .01 8.84 .10* .01 8.00
 School G1 .22* .02 12.05 .14* .02 8.34
 Home G3 .01* .00 2.71 .01 .01 1.33
 School G3 .12* .01 9.30 .02 .01 1.38
 Home G5 .02* .00 3.82 .02* .01 4.07
 School G5 .13* .02 8.96 .12* .05 2.39

Note. CBCL = Child Behavior Checklist; TRF = Teacher Report Form; COS = Classroom Observation System; M = Mother; F = Father; T = Teacher; O = Observer; G1 = 1st Grade; G3 = 3rd Grade; G5 = 5th Grade. The “=” symbol means a parameter is fixed.

*

p< .05.

Figure 2.

Figure 2

Structural equation model showing standardized estimates of stability coefficients and cross-context correlations for hyperactivity-impulsivity.

We observed a general tendency for the context-specific stability to increase when considering informant effects that may have been confounded with context effects (see Table 3). Specifically, when we compared the longitudinal factor models with and without informant effects, stabilities for the home context increased from β = .95 to β = .97 for 1st grade–3rd grade but slightly decreased from β = .92 to β = .90 for 3rd grade–5th grade. Stabilities for the school context increased from β = .73 to β = .95 for 1st grade–3rd grade, and from β = .64 to β = .69 for 3rd grade–5th grade.

An interesting trend was found for changes in the cross-context consistency (i.e., correlations between the home context and the school context latent factors) between models with and without informant effects. It appears that the cross-context correlations decrease over time while at the same time informant effects become less responsible for context-specificity as children age. Specifically, the correlation between the home and the school contexts increased from r = .62 to r = .71 in 1st grade and decreased from r = .37 to r = .25 in 5th grade. By estimating informant effects, the overlapping variances between the home and the school contexts notably increased (from r2= .38 to r2= .50) in 1st grade, whereas the common variance between the two contexts decreased in 5th grade (from r2 = .14 to r2= .06).Finally, as for longitudinal stability of informant effects, autocorrelations over time for the unique factors ranged from .21 to .28 for mother reports and from .13 to .24 for father reports.

Although we did not expect any specific gender differences in the structural relationships among the longitudinal measures of hyperactivity-impulsivity scores measured for the home and the school contexts, we explored possible differences between boys and girls in terms of the magnitude of within-context stabilities and cross-context consistencies using two-group structural equation models. In general, boys and girls did not seem to differ significantly with respect to within-context stabilities and cross-context consistencies.1

Discussion

We used multi-method multi-informant scores based on a large, representative national sample to model context-specific stability (i.e., home vs. school) and cross-context consistency in hyperactivity-impulsivity while considering informant effects that may confound both context-specific stability and cross-context consistency. Our findings suggested (1) stronger longitudinal stability of hyperactivity-impulsivity within a context compared to cross-context consistency, (2) higher within-context stability in the home context than the school context, and (3) systematic variances that were introduced by sharing a common measurement method (e.g., parent vs. teacher rating) and influenced the estimates of within-context stability and cross-context consistency.

Several findings of this study are notable as they add to the current knowledge base regarding context specificity and informant/method effects in child temperament. First, it is noteworthy that the magnitude of the estimates of cross-context consistency of hyperactive-impulsive behavior was exceeded by the magnitude of the estimates of temporal stability within each context (regardless of whether or not informant effects were considered). Research examining genetic and environmental sources of variance in hyperactivity-impulsivity commonly reports significant genetic influences. High heritability for hyperactivity-impulsivity also has been indicated regardless of measurement instrument, informant, and context (Derks et al., 2008; Kuntsi & Stevenson, 2001; Silberg et al., 1996). Particularly, high heritability has been indicated for hyperactivity-impulsivity across middle childhood (Derks et al., 2008). For example, heritability estimate of 88% was indicated in a recent study of 8-year-old twins for hyperactivity-impulsivity (McLoughlin, Ronald, Kuntsi, Asherson, & Plomin, 2007). These genetic influences are thought to represent, at least in part, a biological basis for temperament traits. However, our findings demonstrated that context-specific variance in behavior is systematic and not trivial even for the stable, trait-like temperament behavior of hyperactivity-impulsivity. Just as important, these context effects may only be modestly confounded with systematic effects of informants. The correlations between hyperactive-impulsive behavior in the home and the school contexts were significant and substantial (average r = .50) yet far from unity. In comparison to prior analyses (see Kim, Mullineaux, Allen, & Deater-Deckard, in press) that examined other temperament behaviors such as attention regulation (part of a broader effortful control dimension) and frustration/anger (part of a negative affectivity dimension), hyperactivity-impulsivity (part of a surgency/extraversion dimension) seems to have lower levels of consistency across diverse contexts than attention (average r = .68) but higher levels of consistency across contexts than anger (average r = .38). Taken together, the findings lend support for prior research suggesting that the particular pattern of context stability may depend on the particular aspect of temperament or personality in question (e.g., Majdandzic & van den Boom, 2007).

Second, greater temporal stability would be expected for the home-context because it is defined using the same informants over time (mothers and fathers) and represents the same stable home environment for the vast majority of children (e.g., Hanson, 1975). Furthermore, parental perceptions are based on a larger sample of daily experiences and their views of the child may be less likely to change over time (e.g., Majdandzic & van den Boom, 2007; Rothbart et al., 2001). In contrast, the school context is defined using different informants in the same “role” (i.e., teacher or observer) across the time points, and the classrooms obviously are different at each time point. From a methodological point view, lower levels of temporal stability for the school context factor compared to the home context factor may be in part due to the fact that the school context factor included observations of temperament that have shown to hold lower stability than questionnaires. Our data were consistent with such an expectation showing that the within-context stability was higher for the home context (average stability coefficient = .94) than for the school context (average stability coefficient = .69) before controlling for informant effects. The same pattern was found after controlling for informant effects (average stability coefficient = .94 for the home context and average stability coefficient = .82 for the school context). More generally, within-context stability estimates were higher than the stability estimates based on simple zero-order correlations, as we used the latent factor SEM approach that increased reliabilities of the variables by controlling for measurement errors as well as method variances due to informants. Therefore, these findings suggest that even after taking into consideration any systematic effects that result from a particular informant, children’s expression of hyperactivity/impulsivity was influenced by context-specific effects due to factors related to environmental settings (e.g., Kraemer et al., 2003).

Interestingly, there was evidence for differences in developmental changes in the stability of hyperactivity-impulsivity across the home and the school contexts. Specifically, the variances explained by autoregressive effects (e.g., regression effects of 1st grade home context factor on 3rd grade home context factor) were high and fairly comparable between the two contexts in 3rd grade (94% for home context and 90% for school context). However, in 5th grade, variances explained by autoregressive effects were fairly stable and still high for the home context (81%), whereas such variances dropped by almost half in magnitude for the school context (47%). This notable reduction may be due in part to modest decreases in measurement reliability especially in fifth grade, as seen in inter-rater reliabilities for the observation measures as well as inter-item correlations for the teacher ratings that composed the school context factor. Furthermore, when the variance attributable to potential informant effects were taken into consideration, the within-context stability increased notably for the school context behaviors (the average change in the stability coefficient = .13), but not for the home context (the average change in the stability coefficient = 0). Such findings seem to support the commonly assumed but rarely tested assumption that the magnitude of informant effects on estimates of stability of behavior will be stronger as a function of the number of informants being used. In other words, variances in stability estimates that can be attributed to informant bias are expected to be greater when larger numbers of informants are involved.

Third, there were significant informant effects that influenced stability within a context. The overall fit of the longitudinal factor model for hyperactivity-impulsivity significantly improved when the informant effects were estimated. Longitudinal stability within a context increased in general once the informant effects were taken into account, suggesting that children’s hyperactive-impulsive behaviors were substantially stable over time. Our results are in support of the recent findings of Cole and his colleagues who demonstrated that failure to include residual correlations that reflect shared method variance can change the meaning of the extracted latent variables and generate potentially misleading results (Cole, Ciesla, & Steiger, 2007).Our finding further implies that if we ignore informant effects in testing temporal stability of a psychological construct, we may obtain biased estimates of stability as they would beattenuated.

Fourth, the cross-context consistency between the home and the school contexts within a measurement time changed when we controlled for confounding informant effects. Specifically, the variances that overlapped between the two contexts increased from 38% to 50% in 1st grade. The common variances decreased from 14% to 6% in 5th grade after controlling for informant effects as well as context-specific autoregressive effects. These findings suggest that context-specific effects may become over- or under-estimated thus making children’s behaviors look less or more consistent across different contexts (e.g., Zeman & Shipman, 1996) when informant effects are not appropriately addressed. Our findings lend support for the argument by prior researchers who emphasized the importance of the psychological assessment of children being a multi-informant and multi-method process (Frazier & Youngstrom, 2006; Kendall & Morris, 1991). By collecting data from multiple informants and modeling explicitly the informant/method variances, researchers can better control the potential bias in estimating context-specific effects arising from confounds with informant effects.

One of the limitations of our secondary data analysis was that we were constrained by the available data—the items that were consistently available across all four occasions. This limited our ability to construct more comprehensive home and school context factors of hyperactivity-impulsivity. As the estimates of stability and consistency are expected to be dependent upon the method of assessment, it is important to note that the CBCL/TRF items are rather narrow descriptions of the child’s behavior, compared to measures of temperament impulsivity and activity level (e.g., CBQ). The limitation with data availability also prevented us from constructing latent factors that represent informant effects, because we had fewer than three indicators for each informant.

This limitation aside, the present study illustrates an approach for deriving multi-method composite scores while also considering context-specific behavior and controlling for systematic and transient informant effects. Our longitudinal factor model has broad applicability and is appropriate for use in other existing studies that measure child behavior across parent, teacher/caregiver, tester, observer, and self ratings. The findings demonstrate how “perspective” (i.e., characteristics of the informant) and “context” (i.e., factors related to circumstances that might influence the subject’s expression of the trait) interface with the other to influence testing of the stability of the “trait” (Kraemer et al., 2003).It would be informative to apply the approach we used to answer questions about personality stability later in life when self-reports are available. Researchers are encouraged to consider both context and informant effects in studying the stability of personality traits among individuals using self-reports and others’ ratings (such as family member and peer ratings).

Furthermore, the current findings illuminate potentially different patterns of stability and change for key indicators of Rothbart et al.’s (2001) three child temperament dimensions. Specifically, temperament behaviors of a negative affectivity dimension (represented by frustration/anger) seem to be the most specific to the influences and changes in the environmental settings (e.g., Kim et al., in press), followed by temperament behaviors of a surgency/extraversion dimension (represented by hyperactivity-impulsivity behaviors). Temperament behaviors of an effortful control dimension (represented by attention regulation) seem to be the least specific to the influences and changes in the child’s environment (e.g., Kim et al., in press).More broadly, our results affirm the significant context effects on individual differences in the development of temperament and personality, by illustrating significantly differential behaviors across different contexts that were not simply attributable to systematic informant bias with respect to hyperactivity-impulsivity – an aspect of temperament that is highly stable and easily observable by others. Our understanding of longitudinal stability in personality and behavioral development will be greatly enhanced by considering factors that contribute to context-specificity and context-consistency as well as informant-specific effects.

Acknowledgments

This work was supported by NICHD HD54481. The Study of Early Child Care and Youth Development was conducted by the NICHD Early Child Care Research Network, and was supported by NICHD through a cooperative agreement that calls for scientific collaboration between the grantees and the NICHD staff. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institute of Child Health And Human Development or the National Institutes of Health.

Footnotes

1

We first tested measurement invariance by imposing equality constraints on the factor loadings across the two gender groups. The model fit did not degrade significantly compared to the model without the equality constraints, Δχ2 = 11.56, Δ df = 15, p< .05, and ΔCFI = .00, indicating measurement invariance of the latent factors of hyperactivity-impulsivity between the two gender groups. The next two models testing numeric invariance of within-context stabilities and cross-context consistencies between the two gender models failed to converge, indicating problems due to a non-positive definite variance-covariance matrix for the 3rd grade home and school context latent factors among girls. We examined 3rd grade scores among girls for possible causes of the non-positive definite matrix including linear dependency among the variables and non-normality of the variables (Wothke, 1993), yet we were not able to identify a source of the problem. Given the impression that these two-group latent factors models had over parameterization problems (Wothke, 1993), we tested two-group path analytic models (which involves manifest variables rather than latent factors) based on composite scores for the home and the school contexts. Three composite scores were calculated by averaging standardized z-scores of mother ratings and father ratings for the home context variables in 1st, 3rd, and 5th grades, and three composite scores were calculated by averaging standardized z-scores of teacher ratings and observer ratings for the school context variables at 1st, 3rd, and 5th grades. The results of the nested model comparisons indicated that imposing equality constraints on within-context stabilities did not significantly degrade the model fit compared to the unconstrained model (i.e., no equality constraints between the two gender groups), ΔCFI = .005. The difference in the model fit between the model with equality constraints on cross-context consistencies and the unconstrained model was barely significant with ΔCFI = .011.

References

  1. Achenbach TM. Manual for the Teacher’s Report Form and 1991 Profile. Burlington, VT: University of Vermont, Department of Psychiatry; 1991. [Google Scholar]
  2. Achenbach TM. Empirically based assessment and taxonomy: Applications to clinical research. Psychological Assessment. 1995;7:261–274. [Google Scholar]
  3. Arbuckle JL. Amos 7.0 Updated to the Amos User’s Guide. Chicago: Small Waters Corporation; 2006. [Google Scholar]
  4. Auerbach JG, Faroy M, Ebstein R, Kahana M, Levine J. The association of the dopamine D4 receptor gene (DRD4) and the serotonin transporter promotor gene (5-HTTLPR) with temperament in 12-month-old infants. Journal of Child Psychology and Psychiatry. 2001;42:777–783. doi: 10.1111/1469-7610.00774. [DOI] [PubMed] [Google Scholar]
  5. Bollen KA. Structural Equations with Latent Variables. New York: Wiley; 1989. [Google Scholar]
  6. Bornstein MH, Gaughran JM, Segui I. Multimethod assessment of infant temperament: Mother questionnaire and mother and observer reports evaluated and compared at five months using the Infant Temperament Measures. International Journal of Behavioral Development. 1991;14:131–151. [Google Scholar]
  7. Boyle MH, Pickles AR. Influence of maternal depressive symptoms on ratings of childhood behavior. Journal of Abnormal Child Psychology. 1997;25:399–412. doi: 10.1023/a:1025737124888. [DOI] [PubMed] [Google Scholar]
  8. Browne MW, Cudeck R. Alternative ways of assessing model fit. In: Bollen K, Long S, editors. Testing structural equation models. Beverly Hills, CA: Sage; 1993. pp. 136–162. [Google Scholar]
  9. Bullock M. Secondary analysis - extending the value of data. Journal of Applied Developmental Psychology. 2007;28:383. [Google Scholar]
  10. Buss A, Plomin R. A temperament theory of personality development. New York: Wiley; 1975. [Google Scholar]
  11. Buss A, Plomin R. Temperament: Early developing personality traits. Hillsdale, N.J: Erlbaum; 1984. [Google Scholar]
  12. Campbell DT, Fiske DW. Convergent and discriminant validation by multitrait-multimethod matrix. Psychological Bulletin. 1959;56:81–105. [PubMed] [Google Scholar]
  13. Carnicero JAC, Perez-Lopez J, Salinas MDCG, Martinez-Fuentes MT. A longitudinal study of temperament in infancy: Stability and convergence of measures. European Journal of Personality. 2000;14:21–37. [Google Scholar]
  14. Cheung GW, Rensvold RB. Evaluating goodness-of-fit indexes for testing measurement invariance. Structural Equation Modeling. 2002;9:233–255. [Google Scholar]
  15. Cole DA, Ciesla JA, Steiger JH. The insidious effects of failing to include design-driven correlated residuals in latent-variable covariance structure analysis. Psychological Methods. 2007;12:381–398. doi: 10.1037/1082-989X.12.4.381. [DOI] [PubMed] [Google Scholar]
  16. De Los Reyes A, Kazdin AE. Informant discrepancies in the assessment of childhood psychopathology: A critical review, theoretical framework, and recommendations for further study. Psychological Bulletin. 2005;131:483–509. doi: 10.1037/0033-2909.131.4.483. [DOI] [PubMed] [Google Scholar]
  17. Derks EM, Hudziak JJ, Dolan CV, van Beijsterveldt TCEM, verhulst FC, Boomsma DI. Genetic and environmental influences on the relation between attention problems and attention deficit hyperactivity disorder. Behavior Genetics. 2008;38:11–23. doi: 10.1007/s10519-007-9178-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Eaves L, Rutter M, Silber JL, Shillady L, Maes H, Pickles A. Genetic and environmental causes of covariation in interview assessments of disruptive behavior in child and adolescent twins. Behavior Genetics. 2000;30:321–334. doi: 10.1023/a:1026553518272. [DOI] [PubMed] [Google Scholar]
  19. Frazier TW, Youngstrom EA. Evidence-Based Assessment of Attention-Deficit/Hyperactivity Disorder: Using Multiple Sources of Information. Journal of the American Academy of Child & Adolescent Psychiatry. 2006;45:614–620. doi: 10.1097/01.chi.0000196597.09103.25. [DOI] [PubMed] [Google Scholar]
  20. Goldsmith HH. Studying temperament via construction of the Toddler Behavior Assessment Questionnaire. Child Development. 1996;67:218–235. [PubMed] [Google Scholar]
  21. Goldsmith HH, Campos JJ. The structure of temperamental fear and pleasure in infants: A psychometric perspective. Child Development. 1990;61:1944–1964. [PubMed] [Google Scholar]
  22. Goldsmith HH, Buss KA, Lemery KS. Toddler and childhood temperament: Expanded content, stronger genetic evidence, new evidence for importance of environment. Developmental Psychology. 1997;33:891–905. doi: 10.1037//0012-1649.33.6.891. [DOI] [PubMed] [Google Scholar]
  23. Goldsmith HH, Lemery KS, Essex MJ. Temperament as a liability factor for childhood behavioral disorders: The concept of liability. In: DiLalla L, editor. Behavior genetics principles: Perspectives in development, personality, and psychopathology. Washington, DC: American Psychological Association; 2004. pp. 19–39. [Google Scholar]
  24. Green SM, Loeber R, Lahey BB. Stability of mothers' recall of the age of onset of their child's attention and hyperactivity problems. Journal of the AmericanAcademy of Child & Adolescent Psychiatry. 1991;30:135–137. doi: 10.1097/00004583-199101000-00020. [DOI] [PubMed] [Google Scholar]
  25. Hanson RA. Consistency and stability of home environmental measures related to IQ. Child Development. 1975;46:470–480. [Google Scholar]
  26. Hayden EP, Klein DN, Durbin CE. Parent reports and laboratory assessments of child temperament: A comparison of their associations with risk for depression and externalizing disorders. Journal of Psychopathology and Behavioral Assessment. 2005;27:89–100. [Google Scholar]
  27. Jaccard J, Guilamo-Ramos V. Analysis of variance frameworks in clinical child and adolescent psychology: Issues and recommendations. Journal of Clinical Child and Adolescent Psychology. 2002;31:130–146. doi: 10.1207/S15374424JCCP3101_15. [DOI] [PubMed] [Google Scholar]
  28. Karp J, Serbin LA, Stack DM, Schwartzman AE. An observational measure of children’s behavioural style: Evidence supporting a multi-method approach to studying temperament. Infant and Child Development. 2004;13:135–158. [Google Scholar]
  29. Kendall PC, Morris RJ. Child therapy: Issues and recommendations. Journal of Consulting and Clinical Psychology. 1991;59:777–784. doi: 10.1037//0022-006x.59.6.777. [DOI] [PubMed] [Google Scholar]
  30. Kenny DA, Kashy DA. Analysis of the multitrait-multimethod matrix by confirmatory factor analysis. Psychological Bulletin. 1992;112:165–172. [Google Scholar]
  31. Keogh BK. Temperament in the classroom: Understanding individual differences. Baltimore: Paul H. Brookes; 2004. [Google Scholar]
  32. Kim J, Mullineaux PY, Allen B, Deater-Deckard K. Longitudinal studies of stability in attention span and anger: Context and informant effects. Journal of Personality. doi: 10.1111/j.1467-6494.2010.00621.x. (in press) [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Kraemer HC, Measelle JR, Ablow JC, Essex MJ, Boyce WT, Kupfer DJ. A new approach to integrating data from multiple informants in psychiatric assessment and research: Mixing and matching contexts and perspectives. American Journal of Psychiatry. 2003;160:1566–1577. doi: 10.1176/appi.ajp.160.9.1566. [DOI] [PubMed] [Google Scholar]
  34. Kuntsi J, Stevenson J. Psychological mechanisms in hyperactivity: II the role of genetic factors. Journal of Child Psychology and Psychiatry. 2001;42:211–219. [PubMed] [Google Scholar]
  35. Lemery KS, Essex MJ, Smider NA. Revealing the relation between temperament and behavior problem symptoms by eliminating measurement confounding: Expert ratings and factor analyses. Child Development. 2002;73:867–882. doi: 10.1111/1467-8624.00444. [DOI] [PubMed] [Google Scholar]
  36. Lengua LJ, West SG, Sanderl IN. Temperament as a predictor of symptomatology in children: Addressing contamination of measures. Child Development. 1998;69:164–181. [PubMed] [Google Scholar]
  37. Levy F, McStephen M, Hay DA. The diagnostic genetics of ADHD symptoms and subtypes. In: Levy F, Hay DA, editors. Attention, genes, and ADHD. Philadelphia: Taylor & Francis; 2001. pp. 35–57. [Google Scholar]
  38. Majdandzic M, van den Boom D. Multimethod longitudinal assessment of temperament in early childhood. Journal of Personality. 2007;75:121–168. doi: 10.1111/j.1467-6494.2006.00435.x. [DOI] [PubMed] [Google Scholar]
  39. Mangelsdorf SC, Schoppe SJ, Buur H. The meaning of parent reports: A contextual approach to the study of temperament and behaviour problems in childhood. In: Molfese VJ, Molfese DL, editors. Temperament and Personality Development Across the Life Span. London: Erlbaum; 2000. pp. 121–140. [Google Scholar]
  40. McArdle JJ. Current directions in structural factor analysis. Current Directions in Psychological Science. 1996;5:11–18. [Google Scholar]
  41. McArdle JJ, Nesselroade JR. Using multivariate data to structure developmental change. In: Cohen SH, Reese HW, editors. Life-Span Developmental Psychology: Methodological Contributions. Hillsdale, NJ: Erlbaum; 1994. pp. 223–267. [Google Scholar]
  42. McLoughlin G, Ronald A, Kuntsi J, Asherson P, Plomin R. Genetic support for the dual nature of attention deficit hyperactivity disorder: Substantial genetic overlap between the inattentive and hyperactive-impulsive components. Journal of Abnormal Psychology. 2007;35:999–1008. doi: 10.1007/s10802-007-9149-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Mullineaux PY, Deater-Deckard K, Petrill SA, Thompson LA, DeThorne LS. Temperament in middle childhood: A behavioral genetic analysis of fathers' and mothers' reports. Journal of Research in Personality. 2009;43:737–746. doi: 10.1016/j.jrp.2009.04.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. NICHD SECCYD. Manuals of Operation. 2006 http://secc.rti.org/
  45. Nigg JT, Hinshaw SP, Huang-Pollock C. Disorders of attention and impulse regulation. In: Cicchetti D, Cohen DJ, editors. Developmental psychopathology, Vol 3: Risk, disorder, and adaptation. 2. Hoboken, NJUS: John Wiley & Sons Inc; 2006. pp. 358–403. [Google Scholar]
  46. Nunnally JC, Bernstein IH. Psychometric theory. 3. New York: McGraw-Hill; 1994. [Google Scholar]
  47. Olson SL, Schilling EM, Bates JE. Measurement of impulsivity: Construct coherence, longitudinal stability, and relationship with externalizing problems in middle childhood and adolescence. Journal of Abnormal Child Psychology. 1999;27:151–165. doi: 10.1023/a:1021915615677. [DOI] [PubMed] [Google Scholar]
  48. Pedhazur EJ, Schmelkin LP. Measurement, design, and analysis: An integrated approach. Hillsdale, N.J: Erlbaum; 1991. [Google Scholar]
  49. Pelham WEJ, Fabiano GA, Massetti GM. Evidence-Based Assessment of Attention Deficit Hyperactivity Disorder in Children and Adolescents. Journal of Clinical Child and Adolescent Psychology. 2005;34:449–476. doi: 10.1207/s15374424jccp3403_5. [DOI] [PubMed] [Google Scholar]
  50. Quay HC. The psychobiology of undersocialized aggressive conduct disorder. Development and Psychopathology. 1993;5:165–180. [Google Scholar]
  51. Rothbart MK, Ahadi SA, Evans DE. Temperament and personality: Origins and outcomes. Journal of Personality & Social Psychology. 2000;78:122–135. doi: 10.1037//0022-3514.78.1.122. [DOI] [PubMed] [Google Scholar]
  52. Rothbart MK, Ahadi SA, Hershey KL, Fisher P. Investigations of temperament to three to seven years: The children's behavior questionnaire. Child Development. 2001;75:1394–1408. doi: 10.1111/1467-8624.00355. [DOI] [PubMed] [Google Scholar]
  53. Rothbart MK, Bates JE. Temperament. In: Damon W, Eisenberg N, editors. Handbook of Child Psychology: Vol. 3, Social, Emotional and Personality Development. 5. New York: Wiley; 1998. pp. 105–176. [Google Scholar]
  54. Ruff HA, Capozzoli M, Weissberg R. Age, individuality, and context as factors in sustained visual attention during the preschool years. Developmental Psychology. 1998;34:454–464. doi: 10.1037//0012-1649.34.3.454. [DOI] [PubMed] [Google Scholar]
  55. Rushton JP, Brainerd CJ, Pressley M. Behavioral development and construct validity: The principle of aggregation. Psychological Bulletin. 1983;94:18–38. [Google Scholar]
  56. Sanson A, Hemphill SA, Smart D. Connections between temperament and social development: A review. Social Development. 2004;13:142–170. [Google Scholar]
  57. Seifer R, Sameroff AJ, Barrett LC, Krafchuk E. Infant temperament measured by multiple observations and mother report. Child Development. 1994;65:1478–1490. doi: 10.1111/j.1467-8624.1994.tb00830.x. [DOI] [PubMed] [Google Scholar]
  58. Silberg JL, Rutter ML, Meyer J, Maes H, Hewitt J, Simonoff E, et al. Genetic and environmental influences on the covariation between hyperactivity and conduct disturbance in juvenile twins. Journal of Child Psychology and Psychiatry. 1996;37:803–816. doi: 10.1111/j.1469-7610.1996.tb01476.x. [DOI] [PubMed] [Google Scholar]
  59. Strelau J. The concept and status of trait in research on temperament. European Journal of Personality. 2001;15:311–325. [Google Scholar]
  60. Strelau J, Zawadzki B, Piotrowska A. Temperament and intelligence: A psychometric approach to the links between both phenomena. In: Collis JM, Messick S, editors. Intelligence and personality: Bridging the gap in theory and measurement. Mahwah, NJ: Erlbaum; 2001. pp. 61–78. [Google Scholar]
  61. Thomas A, Chess S. Temperament and development. New York: Brunner/Mazel; 1977. [Google Scholar]
  62. Veenstra R, Lindenberg S, Oldehinkel AJ, De Winter AF, Verhulst FC, Ormel J. Prosocial and antisocial behavior in preadolescence: Teachers’ and parents’ perceptions of the behavior of girls and boys. International Journal of Behavioral Development. 2008;32:243–251. [Google Scholar]
  63. Winer BJ. Statistical principles in experimental design. 2. McGraw-Hill; New York: 1971. [Google Scholar]
  64. Wothke W. Nonpositive definite matrices in structural modeling. In: Bollen KA, Long JS, editors. Testing structural equation models. Newbury Park, CA: Sage; 1993. pp. 256–93. [Google Scholar]
  65. Zeman J, Shipman K. Children’s expression of negative affect: Reasons and Methods. Developmental Psychology. 1996;32:842–849. [Google Scholar]

RESOURCES