Abstract
Children vary widely in the rate at which they acquire words—some start slow and speed up, others start fast and continue at a steady pace. Do early developmental variations of this sort help predict vocabulary skill just prior to kindergarten entry? This longitudinal study starts by examining important predictors (SES, parent input, child gesture) of vocabulary growth between 14 and 46 months (n=62), and then uses growth estimates to predict children's vocabulary at 54 months. Velocity and acceleration in vocabulary development at 30 months predicted later vocabulary, particularly for children from low socioeconomic backgrounds. Understanding the pace of early vocabulary growth thus improves our ability to predict school readiness, and may help identify children at risk for starting behind.
Children vary widely in the rate at which their vocabulary grows during early childhood (Fenson, Dale, Reznick, Bates, Thal & Pethick, 1994). It is plausible that children's early environments and social interactions influence the course of language acquisition (Braine, 1994). This theoretical perspective has led to many studies exploring the relation between parent and family factors (including parental talk to children and family socioeconomic status) and child vocabulary growth, and positive relations have been found between the two (see Hoff, 2006 for a review). Thus, previous research strongly suggests that variations in early social-interactive environments are associated with variations in vocabulary growth rates, and highlights the importance of understanding the role that parent and family factors play in child language development.
It is important to understand not only the causes of variation in vocabulary growth rates, but also the consequences of acquiring vocabulary at different rates. Children's oral language skills when they enter kindergarten predict their later literacy skills and school success (Dickinson & Tabors, 2001; Duncan, Dowsett, Claessens et al., 2007; Snow, Burns & Griffin, 1998)––those who start behind tend to stay behind (Stanovich, 1986). The question we ask is whether differences in the rate at which vocabulary is acquired in the earliest stages predict children's vocabulary skills at school entry.
The obvious way to approach this question is to look to the period prior to school entry. We could, for example, focus on background factors and child data gathered at one moment in time during the pre-school period and use this information to predict academic skills in the early elementary years. But information gathered at a single point in time can be misleading. Take, for example, two children who have the same vocabulary size at 30 months. Using observed vocabulary at 30 months to predict later language skill would lead us to predict identical vocabularies for the two children at school entry. However, if we knew that one of the two children was increasing her vocabulary at a faster rate than the other (i.e., had a greater slope at 30 months), we might predict that the first child would have a larger vocabulary at school entry than the second. In this case, looking at rate of change and/or rate of acceleration in vocabulary growth would be useful in predicting vocabulary size at school entry.
The goals of our study are threefold. First, we aim to understand the role that parent, family and child factors play in explaining variation in child vocabulary acquisition across the early childhood period. Second, we aim to make use of our detailed longitudinal data to examine whether these early vocabulary growth trajectories can help us predict the language skills children bring with them to school. Third, we address the more practical issue of how useful early language data can be in predicting later vocabulary skill. Thus, this study extends the current body of research by investigating the role of early vocabulary growth in predicting later vocabulary skill.
Why study growth trajectories? Developmental phenomena, by definition, involve change. Questions about how and why abilities develop are therefore often of central interest. Not surprisingly, attempts to understand change over time in vocabulary growth have been part of the language acquisition literature for decades. The previous work in this area has taken two primary approaches. First, a large body of work has asked whether and when children experience a vocabulary spurt early in development. Researchers have hypothesized that around the age of 18 months (or around a vocabulary size of 50 words) children experience a marked increase in rate of vocabulary learning (Bloom, 1973; Nelson, 1973). Potential underlying mechanisms behind this phenomenon have been extensively investigated (e.g., Gopnik & Meltzoff, 1987; Reznick & Goldfield, 1992), as have individual differences in the vocabulary spurt (Goldfield & Reznick, 1990). More recently, however, longitudinal methods for developmental research have progressed, and research highlighting the importance of sampling procedures (Adolph, Tobinson, Young & Gill-Alvarez, 2008) and measurement in studying change over time has led to sophisticated work that challenges the existence of a vocabulary growth spurt (Ganger & Brent, 2004; McMurray, 2007).
A second approach has been to more broadly model growth in vocabulary skill using frequent, intensive observation of parent-child interactions. This work has advanced our understanding of the course of vocabulary acquisition over discrete periods of development and has uncovered important factors related to vocabulary growth. For example, Huttenlocher and colleagues (Huttenlocher, Haight, Bryk, Seltzer, & Lyons, 1991) emphasized the important positive relation between parental talk to children and child vocabulary growth, a finding replicated for children who have experienced unilateral pre- or peri-natal brain injury (Rowe, Levine, Fisher & Goldin-Meadow, 2009). Hart and Risley (1992) and Hoff (2003) showed that children of high socioeconomic status (SES) displayed more rapid vocabulary growth than children from lower SES families. Recent work within a low-income sample found relationships between other parental factors (such as language and literacy skills and maternal depression) and child vocabulary growth (Pan, Rowe, Singer & Snow, 2005). Many of these studies make use of individual growth modeling (Rogosa, Brandt, & Zimowski, 1982), often through hierarchical linear modeling (Raudenbush & Bryk, 2002), a powerful statistical method for addressing questions about change over time using a continuous outcome. This approach allows researchers to model individual, within-person change over time in vocabulary development, and to study predictors of between-person differences in change over time. Growth modeling is appropriate for constructs such as vocabulary, where the unit of measurement remains stable over time (McCartney, Burchinal, & Bub, 2006; Singer & Willett, 2003).
In the current study, we are not concerned with the vocabulary spurt per se. Our goal is to understand whether and to what extent aspects of changes in early language are linked to vocabulary skills at entry to kindergarten. To that end, our first aim is to fit the best model of early vocabulary growth to our longitudinal data of children's cumulative vocabulary production across the 14-46 month period, using predictors of individual differences in early growth. In this way, we follow the second body of research mentioned above. However, we then take this work one important step further by using the growth estimates from the early longitudinal model to predict later vocabulary skills at 54 months, just prior to kindergarten entry. This strategy allows us to explore the consequences of variation in early vocabulary growth rates for later vocabulary achievement. According to our theoretical model, children's personal parameters of early vocabulary growth carry important information about their status at school entry. We cannot directly observe these child-specific parameters and must instead use sample estimates of the parameters as predictors. Although this approach has the potential to lead to biased estimates of the association between the parameters of early growth and later outcomes, we introduce a strategy designed to eliminate bias of this sort (see the methods section). Our approach thus allows us to address an important developmental and methodological question: Does having information about the trajectory that children follow in their vocabulary acquisition (as opposed to having an isolated snapshot of their pre-school performance) help predict where children will end up several years later?
We hypothesize that the path children have followed to arrive at a particular level of vocabulary acquisition at a given age may be telling about their future linguistic abilities. Even if there is no true “vocabulary spurt,” there clearly is substantial growth in vocabulary size during the second and third years of life (e.g., Fenson et al., 1994). Although the shape of growth often depends on how vocabulary is measured (Pan et al., 2005), on average, early growth tends to be quadratic in nature, with children acquiring more words at a faster rate over time during the second year of life (Ganger & Brent, 2004; Huttenlocher et al., 1991). Importantly, there is wide variation in children's early vocabulary growth rates (e.g., Fenson et al., 1994). For example, some children start out very slow and then increase rapidly later on, while others start out strong and increase at a relatively constant rate over time. Our hypothesis is that using individual children's growth rates will help us predict later vocabulary size over and above their status at one point in time. This is a plausible hypothesis because the rates of growth likely contain more information about the child's language acquisition potential than their ability at one point in time. If so, we can then explore factors that are responsible for that growth, with an eye toward eventually understanding how to manipulate those factors to improve vocabulary, particularly in children from low socioeconomic backgrounds. We test this hypothesis by first modeling growth in cumulative vocabulary in a sample of 62 children visited on nine occasions (every 4 months) between 14 and 46 months. We examine predictors of growth during this period, using our knowledge of the literature and previous work with the sample. We include three primary background predictor variables gathered at the first (14-month) visit.
The first predictor is socioeconomic status (SES), measured as parent education and family income. Studies within and across socioeconomic groups have consistently found strong relations between socioeconomic measures and child vocabulary skill. Children from families with higher incomes and/or whose parents are more educated have larger vocabularies, on average, than do their peers from lower income homes with less-educated parents (Hart & Risley, 1992; 1995; Hoff, 2003a; Hoff, 2003b; Pan et.al., 2005). Importantly, some portion of this relation between SES and child vocabulary skill is mediated by the speech that parents offer children on a day-to-day basis (Hoff, 2003b). Thus, SES relates not only to child language outcomes, but also to parent input. For example, on average, parents from higher SES backgrounds use more words, more diverse vocabulary, and more complex syntax than do parents from lower SES backgrounds (Hoff, 2003b; Hoff-Ginsberg, 1991; Huttenlocher, Vasilyeva, Vevea, & Hedges, 2007; Lawerence & Shipley, 1996; Pan et. al., 2005; Rowe, 2008).
The second predictor in the current study is parent input, particularly vocabulary input. To acquire specific vocabulary words children need to be exposed to those words. Of course, word learning is not as simple as mere exposure, yet exposure is a necessary component of the process (Bloom, 2002). The number of different words children hear is thus a reasonable proxy for the vocabulary exposure children experience in their home environments. Note that, in measuring vocabulary input, it is impossible to separate the quantity of talk from the diversity of talk, as parents who talk more are likely to use more different vocabulary words (Hoff & Naigles, 2002). Thus, measuring the number of different vocabulary words a child hears provides information on not only the amount of input the child is getting but also the diversity of the input. Indeed studies show that children who experience a more diverse vocabulary in their input develop larger vocabularies themselves (Weizman & Snow, 2001). Moreover, when child vocabulary growth is examined as the outcome measure of interest, the number of different words parents use with their children (parent vocabulary input) is a stronger input predictor than overall talk or syntactic complexity (Huttenlocher, Waterfall, Vasilyeva, Vevea & Hedges, in press; Pan et al., 2005). We thus use parent vocabulary input as a predictor in the current study.
The final predictor is the child's early gesture use (14 months) which has also been found to be a strong predictor of vocabulary skill. For example, we can predict the specific lexical items that will enter a child's vocabulary by looking at the objects to which a child gestured during earlier interactions (Iverson & Goldin-Meadow, 2005). As another example, ‘gesture vocabulary’ (as measured by the number of items the child gestures to or about) early in development (14-18 months) predicts later vocabulary skill, measured either as child production (i.e.,word types used during longitudinal parent-child interactions, Sauer, Levine & Goldin-Meadow, 2010) or as vocabulary comprehension (the Peabody Picture Vocabulary Test, Rowe, Ozcaliskan & Goldin-Meadow, 2008; Rowe & Goldin-Meadow, 2009a,b). Importantly, even though child gesture is obviously a child characteristic, children are not necessarily born high or low gesturers. In fact, SES also relates to early child gesture, a relation that is partially mediated by the gestures parents use in interaction with their children (Rowe & Goldin-Meadow, 2009b). Thus, children's early gesture use likely encompasses some combination of child ability or motivation to communicate and exposure to a gesture-rich communicative environment.
SES, parent vocabulary input, and early child gesture vocabulary are therefore three important early predictors to consider in our analysis of child vocabulary growth. We also consider the role of child gender, as studies have found an early advantage in language development for girls (Huttenlocher et al., 1991). After presenting our growth models with incorporated predictors, we use the growth estimates from these models to predict children's vocabulary skill at 54 months, just prior to kindergarten entry. Our goal is to determine which aspects of vocabulary growth are related to later vocabulary skill, controlling for SES, parent input, and child gesture.
Finally, we turn to a more practical question. We explore the extent to which child vocabulary data collected early in life predicts vocabulary skill just prior to kindergarten entry. Because the oral language skills children bring with them to kindergarten have a strong relation to their later success in school (e.g., Snow, Burns & Griffin, 1998), it is essential to understand how early we can collect measures of child language skill that will reliably predict abilities at kindergarten entry. Thus, we use the child language data collected during the first half of our study (e.g., between 14-30 months) to examine how well various measures of early vocabulary skill predict vocabulary prior to kindergarten entry (PPVT scores at 54 months).
Specifically, we ask the following questions:
What is the best longitudinal model that describes child vocabulary growth between 14 and 46 months, and how much variation is there in growth rates?
What is the role of SES, parent input, and child gesture (our early 14 month predictors) in vocabulary growth?
Which aspects of vocabulary development from the longitudinal model best predict children's vocabulary skill at 54 months? In particular, do we improve our ability to predict later vocabulary skill by including a child's velocity and acceleration in early vocabulary acquisition in the model?
To what extent can vocabulary data collected very early in life (1-2.5 years) predict children's vocabulary skill at 54 months?
Method
Participants
The participants are 62 children and their primary caregivers. These families were recruited via direct mailings to roughly 5,000 families living in targeted zip codes and an advertisement in a free, monthly parent magazine. Interested parents were interviewed about background characteristics and a final sample of 64 families was selected to be representative of the greater Chicago area in terms of ethnicity and income. Annual income levels of the sample varied from less than $15,000 to over $100,000 (M = $59,476), and children come from more than five different ethnic groups. On average, parents had 15.7 years of education (16 years is equivalent to a college degree) when they entered the study; however, the range was from 10 years (less than high school degree) to 18 years (Master's degree or more). Two of the original 64 families dropped out of the study early on; thus the sample considered here contains 62 families. Eight of the families are single-parent families (all mothers). Thirty-two of the children are boys and 30 are girls. All families are raising their children as monolingual English speakers. Table 1 presents the distribution of families by race/ethnicity and income, and makes it clear that income and race/ethnicity are confounded in our sample. For this reason, and because we have very uneven distributions of race/ethnicity, we do not consider race/ethnicity as a variable in our analyses.
Table 1. Demographic characteristics of the parents.
Family Income | Parents' Ethnicity | Parents' Race | |||
---|---|---|---|---|---|
Hispanic* | Black | White | Two or more races | Total | |
Less than $15,000 | 1 | 3 | 1 | 5 | |
$15,000 - $34,999 | 4 | 4 | 4 | 1 | 13 |
$35,000 - $49,999 | 1 | 2 | 4 | 1 | 8 |
$50,000 - $74,999 | 1 | 1 | 8 | 2 | 12 |
$75,000 - $99,999 | 1 | 1 | 7 | 1 | 10 |
$100,000 or more | 1 | 1 | 11 | 1 | 14 |
| |||||
Total | 9 | 12 | 35 | 6 | 62 |
Those of Hispanic ethnicity are only included once in this table
Procedure
Our goal was to get a representative picture of what the typical home environment is like for these children. Thus, parents and children were visited in the home every four months between child ages 14 and 46 months, resulting in nine visits covering a 32 month period. At each home visit, we videotaped parents and children interacting for 90 minutes “as they normally would.” That is, we did not bring any toys with us and instead asked the families to do what they would typically do. We used this research strategy to capture a typical interaction for each family. In seven of the families, the mother and father share the primary care-giving role; as a result, these families sometimes participated in triadic rather than dyadic interactions and parents sometimes alternated interacting with their child. Siblings were present during some of the sessions, but neither their speech nor speech directed to them was included in analysis. The particular activities in which parents and children engaged varied, but a typical session included time playing with toys, reading a book, and eating a meal or a snack. On some occasions, the session was not quite 90 minutes (less than five percent of the visits); for these visits, we pro-rated the language data based on the number of minutes missing.
All speech and gestures on the videotapes were transcribed. The unit of transcription was the utterance, defined as any sequence of words and/or gestures preceded and followed by a pause, a change in conversational turn, or a change in intonational pattern. All dictionary words, as well as onomatopoeic sounds (e.g., woof-woof) and evaluative sounds (e.g., uh-oh), were counted as words. Children and parents produced gestures indicating objects, people, or locations in the surrounding context (e.g. point at dog), gestures depicting attributes or actions of concrete or abstract referents via hand or body movements (e.g., flapping arms to describe a flying bird), and gestures having pre-established meanings associated with particular gesture forms (e.g., shaking the head “no”). Other actions or hand movements that involved manipulating objects (e.g., turning the page of a book) or were part of a ritualized game (e.g., itsy-bitsy spider) were not considered gestures. Transcription reliability was established by having a second coder transcribe 20 percent of the videotapes; reliability was assessed at the utterance level and was achieved when coders agreed on 95 percent of transcription decisions.
The children were again visited in the home prior to kindergarten entry (age 54 months) and were given a receptive vocabulary assessment as described in the next section.
Measures
Vocabulary skill prior to kindergarten entry
At 54 months, children were administered the Peabody Picture Vocabulary Test (PPVT–III; Dunn & Dunn, 1997) at home. The PPVT is a widely used measure of receptive vocabulary. In this task, children hear a word and are asked to point to the picture (out of 4 options) that matches the word they heard. We were able to obtain PPVT scores for 55 of the 62 children at age 54 months. Thus, the regression analysis predicting PPVT is based on a sample of 55 children, as we chose not to impute data for our outcome measure. There were no differences between the children with PPVT scores (n=55) and the children without PPVT scores (n=7) on SES, parent input or child gesture measures. The mean standardized score for the sample was 112 (SD = 18.3).
Vocabulary growth during early childhood
Children's vocabulary growth across the nine sessions from 14 to 46 months was measured as their cumulative word types over time. Automated analyses of the transcripts yielded data on the number of different word types (i.e., number of different intelligible word roots) produced by each child at each session. Several decisions were made as to what constituted a word type. Morphologically inflected variants of words (e.g., run, running) were considered a single type. Words produced in imitation of the mother were included in the corpus of child word types, as were words that the parent or child produced while reading. The number of cumulative word types produced at each age served as our measure of vocabulary growth. That is, if a child said a specific word at age 14 months (i.e., “cat”), the child was given credit for producing “cat” from that point on, whether the word was produced again or not. This method of measuring vocabulary growth has been used previously in longitudinal analyses (Huttenlocher et al., 1991) and provides less noisy estimates of growth over time. As in most longitudinal studies, families occasionally missed a data collection period. The sample of 62 families visited over 9 time points resulted in 558 possible visits of which only 8 were missing. Thus, less than 2 percent of the visits were missed. Because we measured cumulative vocabulary, we imputed values of the “new word types added” for the missing sessions using regression equations from the full sample at the missing age, with age and age-squared as predictors. The imputed value for the missing session was then added to the child's previous cumulative word types to generate the new cumulative word type value for that child. We did not impute values if the missing data point was the final session(s) for that child. In those cases, we left the data as missing and only used the child's data up until that point in the growth models. Overall, only 1.4% of the vocabulary growth data were imputed.
Early (14 month) Predictors
Socioeconomic Status (SES)
SES was measured as the primary caregiver's education level and family income and was calculated categorically from parents' responses to a questionnaire at or before the 14 month visit. For purposes of analyses, the categories were transformed into a continuous scale of years of education and US Dollars. On this scale, parental education ranged from 10 to 18 years, with 12 years equivalent to receiving a high school degree or GED (M = 15.7, SD =2.3). The average family income level ranged from less than $15,000 per year to over $100,000 (M = $59,476, SD = $31,353). In the families where both parents were primary caregivers, we used the education level of the mother; we used this strategy because we were missing father's education level for two of the relevant eight families (and for some others), and because there was a strong positive correlation between mother's and father's education for families where we did have data on both parents (r = 0.62, p < 0.001).
For our full sample, education and income were positively related to one another (r = 0.47, p < 0.001). We consider them here separately at first and then combine them in a composite score to include as predictors in growth models and regression analyses. The composite was generated using Principal Components Analysis (PCA). The first principal component weighted education and income positively and equally. This component accounted for approximately 73 percent of the original variance. The mean score of the composite is 0 (SD = 1.0). A family that scores high on the SES composite thus has a high annual income and a primary caregiver with a high level of education.
Parent input
Parent word types produced at child age 14-months was used as the input measure predicting child language growth because previous work has shown that this measure is a strong input predictor of child vocabulary (Huttenlocher et al, in press; Pan et al., 2005; Weizman & Snow, 2001). We calculated parent word types in exactly the same way as child word types, with one exception. Since numerous studies have shown that variation in parent input remains fairly stable over time (Huttenlocher et al., 2007; Rowe, Pan & Ayoub, 2005), we used parent word types at the first visit as our input measure. Our parent input measure was therefore not cumulative and, in this sense, differed from our child measure. When two parents were present for the interaction, we used the total number of different words produced by both parents, as that was the talk to which the child was exposed. There were 6 families with two parents present. Input averaged 476 word types for these 6 families, compared to 367 word types for the remaining 56 families.
Child Gesture
The gesture measure of interest at 14 months is Gesture types, or the number of different meanings each child conveyed in gesture. We used this particular measure as an early predictor because it is the gesture measure found to be most related to later vocabulary skills (Rowe & Goldin-Meadow, 2009a; Rowe, Ozcaliskan & Goldin-Meadow, 2008). In calculating gesture types, we followed guidelines in Goldin-Meadow and Mylander (1984) and Iverson and Goldin-Meadow (2005). We counted each deictic gesture that indicated a different object as a distinct gesture type. For example, if a child pointed to his dog 10 times during an interaction, those 10 points would count as 1 gesture type (dog) and 10 gesture tokens. We also counted each conventional or representational gesture associated with a different meaning as a distinct gesture type (e.g., head shake conveying the meaning no; arm flap conveying the meaning flying bird).
Preliminary analyses determined that child gender did not relate to vocabulary growth and therefore was not included in further analyses except when checking for interaction effects. Descriptive statistics for all variables examined here are presented in Table 2.
Table 2. Descriptive Statistics.
Variable | n | Mean | SD | Min | Max |
---|---|---|---|---|---|
Child Age (months) | |||||
Visit 1 | 62 | 14.22 | 0.44 | 13.52 | 16.10 |
Visit 2 | 62 | 18.07 | 0.31 | 17.48 | 19.10 |
Visit 3 | 62 | 22.13 | 0.34 | 21.48 | 23.29 |
Visit 4 | 62 | 26.08 | 0.27 | 25.58 | 27.06 |
Visit 5 | 62 | 30.11 | 0.36 | 29.58 | 31.52 |
Visit 6 | 62 | 34.12 | 0.31 | 33.52 | 35.35 |
Visit 7 | 61 | 38.17 | 0.42 | 37.00 | 39.55 |
Visit 8 | 60 | 42.17 | 0.38 | 41.55 | 43.55 |
Visit 9 | 59 | 46.27 | 0.44 | 45.35 | 47.39 |
Child Cumulative Word Types | |||||
Visit 1 | 62 | 13.33 | 13.98 | 0 | 50 |
Visit 2 | 62 | 47.36 | 38.03 | 1 | 183 |
Visit 3 | 62 | 127.49 | 85.92 | 4 | 359 |
Visit 4 | 62 | 238.32 | 130.33 | 6 | 574 |
Visit 5 | 62 | 355.84 | 154.92 | 18 | 699 |
Visit 6 | 62 | 469.61 | 171.45 | 44 | 835 |
Visit 7 | 61 | 588.25 | 179.95 | 127 | 962 |
Visit 8 | 60 | 690.07 | 193.05 | 235 | 1121 |
Visit 9 | 59 | 789.65 | 206.32 | 328 | 1237 |
Parent Education (yrs) | 62 | 15.64 | 2.28 | 10.00 | 18.00 |
Family Income ($) | 62 | 59,476 | 31,353 | 7,500 | 100,000 |
Parent word types (Visit 1) | 62 | 377.24 | 133.31 | 60 | 708 |
Child gesture types (Visit 1) | 62 | 22.10 | 12.97 | 4 | 58 |
PPVT 54 months | 55 | 111.89 | 18.27 | 69 | 160 |
Methodological Approach
To address the questions at hand, we present our “theoretical” statistical model, which we then estimate empirically. Our theory is that, during early childhood, a child's trajectory of vocabulary growth predicts important aspects of later linguistic functioning. In this longitudinal data set ranging from 14 to 46 months, we use child exact age at each visit as our measure of time and center age at 2.5 years (30 months), which is the midpoint of the age range under investigation. We hypothesize that a child's status, velocity, and acceleration in expressive vocabulary at age 2.5 years can help us predict the size of the child's vocabulary 2 years later. The first part of our larger statistical model is a two-level model that defines child-specific parameters of early vocabulary growth. The second part is a prediction model in which these child-specific growth parameters are explanatory variables and the vocabulary skills prior to kindergarten entry are outcomes.
The two-level model for child-specific growth
The first part of our larger statistical model has two levels: a “level-1 model” that accounts for variation in repeated measures within each child; and a “level-2 model” that represents variation between children.
At level-1 (within children), we represent the vocabulary trajectory as a cubic model, so we have, for each child i at time t:
(1) |
where ati is the age of child i at time t, π0i is child i's status at age 2.5 years, π1i is child i's velocity at age 2.5 years, and π2i is child i's acceleration at age 2.5 years, and π3i is child i's cubic change at age 2.5 years. The residual eti represents that portion of child i's cumulative vocabulary production at age t that is not predicted by his or her age. We presume a heterogeneous level-q variance, meaning that the within-person variation, , varies across time as child age increases.
At level 2 (between children), we then hypothesize, based on the previous literature, that an individual child's status, velocity, acceleration, and cubic change may be predicted by our early (14 month) predictors; namely, SES, parent input, and child gesture. So we have level-2, or between-person, models in which there is a separate equation for each level-1 coefficient, πpi, where p = 0, 1, 2, 3:
(2) |
where πpi is the pth growth parameter from the level-1 model (Equation 1), βp0, βp1, βp2, βp3 are linear regression coefficients, and rpi is a random effect. We allow random effects to be correlated within children but not between children.
The prediction model
The second part of our larger statistical model represents the hypothesis that a child's status, velocity, acceleration, and cubic change at age 2.5 years help us predict later vocabulary skill Wi, controlling for other background characteristics, say Xi, where X = SES, Input, and Gesture. We might assume a linear prediction model, i.e.,
(3) |
Thus, our goal from Equation 3 is to learn about the γ's. In particular, γ0 tells us how child i's vocabulary status at age 2.5, that is, π0i, contributes to Wi, controlling for the other aspects of that child's growth curve, as well as background covariates Xi. Similarly, γ1, γ2, γ3 tell us about the unique contribution of the velocity, acceleration, and cubic parameters of child growth net the other predictors in the model.
Equation 3 represents our theoretical belief about the association between early vocabulary growth trajectories and vocabulary prior to kindergarten entry, controlling for child background. Unfortunately, we cannot directly estimate the γ's in Equation 3 because we cannot observe π0i, π1i, π2i, π3i. One possible option is to estimate these unknowns, and determine how our outcome (defined by (3)) changes as a function of these estimates. For example, we could estimate π0i, π1i, π2i, π3i via ordinary least squares regression, child-by-child, yielding π̂0i, π̂1i, π̂2i, π̂3i. However, simply substituting these values for the true values of π0i, π1i, π2i, π3i would lead to bias, as is well known from the literature on regression with error-prone predictors (cf., Fuller, 1987). However, from the theoretical model in Equation 3, we can derive the conditional mean of the outcome Wi given the error-prone estimates π̂0i, π̂1i, π̂2i, π̂3i. Thus:
(4) |
Equation 4 shows that if we substitute E (πpi | π̂0i, π̂1i, π̂2i, π̂3i, Xi), p=0,1,2,3 for the unknown π0i, π1i, π2i, π3i in Equation 3, we will obtain unbiased estimates of our γ's. These conditional expectations are, in fact, the “Bayes” or “empirical Bayes posterior means” (see Raudenbush & Bryk, 2002, Chapter 6) that we have used in our analysis. Previous work has shown these empirical Bayes estimates to be more precise than estimates from OLS models (Raudenbush & Bryk, 2002, Chapter 6). The key assumption is that the random effects in the level-2 model (Equation 2) are multivariate normal in distribution. These empirical Bayes estimates can be computed by estimating the two-level model specified in Equations 1 and 2 and then outputting the empirical Bayes (or fully Bayes) coefficients in a child-level file. This is a standard procedure in many software packages for hierarchical linear models. Note that it is essential to include in Equation 2 any covariate X that will also be used in Equation 3.
Results
The results that follow are presented in four steps. In the first step, we use individual growth modeling (employing HLM; Raudenbush, Bryk & Cheong, 2000) to best model vocabulary growth between 14 and 46 months. We then incorporate the role of early 14-month predictors (SES, input, gesture) in that growth (Equations 1 and 2 above). We then use each child's empirical Bayes estimated growth rates from the vocabulary growth models with early 14-month predictors as predictors of children's kindergarten language skills, using multiple regression analyses to estimate the parameters of our theoretical model (Equation 3 above). Thus, we are empirically testing a theoretical model based on the assumption that children's trajectories of growth in vocabulary during the toddler period will be good predictors of their later vocabulary skill prior to kindergarten entry. Finally, we turn to the more practical question of the utility of early vocabulary data in predicting later vocabulary skill. Here we use data from 14-30 months and examine relations between vocabulary measures during that early period and PPVT at 54 months.
Modeling child vocabulary growth
In finding the best model for vocabulary growth, we began with an empirical plot of children's cumulative word types between 14 and 46 months (Figure 1a). As is evident from this plot, children vary both in their rates of vocabulary growth and in their estimated vocabulary size at any given age. To obtain the best-fitting level-1, or within-person, model for these data, we fit linear, quadratic, and cubic growth models. All models were fit using heterogeneous sigma squared, meaning that the variation was allowed to vary over time, and age was centered at 2.5 years, the midpoint of the 32 month period. We determined that the cubic model best represented the empirical data because it had the lowest goodness of fit statistic (-2 Log Likelihood) and because the plot of the cubic model best mirrored the plot of the empirical data (see Figures 1a and 1b). A plot of this model is displayed in Figure 1b. This cubic model shows that, on average, children at age 2.5 years have an estimated cumulative vocabulary of approximately 343 word types, with an increase in cumulative word types at this age at a rate of 362 types per year. Further, the significant quadratic effect indicates that, over time, the yearly rate of increase itself increases, and the significant negative cubic terms suggest that acceleration is not constant and reduces at the later ages. It is not surprising that a cubic model fits the data best, as our videotaped sessions are fixed at 90-minutes and thus the potential for adding new words decreases as vocabularies increase and observation time is constrained. The average fitted growth trajectory based on this model is displayed in Figure 1c.
Figure 1.
Plots of empirical growth trajectories (a), predicted growth trajectories from cubic model (b), and average vocabulary growth from cubic model (c) (n=62). Figure 1: Empirical growth (b) Predicted cubic growth (c) Average growth (predicted)
Predictors of vocabulary growth
We next determine the role of the early (14-month) predictors in cumulative vocabulary growth. We begin by examining relations among the predictors themselves. Table 3 presents correlations between parent input (word types) at 14 months, child gesture types at 14 months, family yearly income, and parent education. As Table 3 shows, parent input is significantly positively related to child gesture, to family income, and to parent education. Thus, parents who use more vocabulary with their children, have larger yearly incomes, more years of formal schooling, and have children who gesture to or about more things during the 14-month interaction than parents who use fewer words with their children. In addition to being related to parent input, child gesture is also positively related to parent education (but not to family income), and as noted previously, family income and parent education are positively related to one another.
Table 3.
Simple correlations (Pearson's r) among early (14-month) predictors (n=62).
Input | Gesture | Income | Education | |
---|---|---|---|---|
Input | 1.0 | |||
Gesture | 0.31* | 1.0 | ||
Income | 0.30* | 0.06 | 1.0 | |
Education | 0.36** | 0.30* | 0.47*** | 1.0 |
p < 0.05;
p < 0.01;
p < 0.001
Table 4 presents a taxonomy of models investigating the relationship between early (14 month) predictors and growth in child cumulative vocabulary production. Model 1 in Table 4 is the unconditional cubic growth model. We continue by adding SES as a predictor. Preliminary analyses (not shown) indicated that when considered on their own, both family income and parent education were significant predictors of intercept and linear change. However, when considered together in the same model, there was no longer an effect of education or income on intercept (and income was a significant predictor of linear change but education was not). Thus, it appears that the predictors are too collinear to be considered separately in this analysis; we therefore use the composite created in our Principal Components Analysis as a measure of SES from this point on. Model 2 in Table 4 presents the effect of SES on growth in child cumulative vocabulary production. In this model, the SES composite is a significant predictor of intercept (p < 0.05) and linear growth (p < 0.01). That is, children whose parents had higher family incomes and were more educated had greater estimated vocabularies at 2.5 years, and increased in vocabulary at a faster rate, than children from lower SES families. There was no association between SES and quadratic or cubic change. Model 3 in Table 4 shows the effect of parent input, controlling for SES. Here we see only a marginal effect of input on intercept, and when input is included in the model, the significant effect of SES on intercept disappears. A test of mediation (the Sobel test) showed that the t-ratio of 1.51 (p = 0.12) is not sufficient to clearly establish that parental input mediates the association between SES and child vocabulary. However the effect is suggestive and should be investigated further with larger samples. Finally, Model 4 in Table 4 presents the effect of early child gesture types on growth in cumulative vocabulary, controlling for SES and input. In this model, controlling for SES and input, child gesture has a significant effect on intercept (p < 0.01) and a marginal effect on acceleration (p < 0.10).
Table 4.
Estimates of fixed effects, random effects, and goodness of fit For cubic growth models using SES, Parent input and Child gesture to predict Intercept and change in children's cumulative vocabulary (N=62).
Cubic time | Adding SES | Adding Input | Adding Gesture | |
---|---|---|---|---|
MODEL1 | MODEL 2 | MODEL 3 | MODEL 4 | |
FIXED EFFECTS: | ||||
| ||||
Intercept | 342.5*** (17.8) | 342.6*** (17.1) | 342.6*** (16.6) | 342.6*** (15.7) |
Linear change | 362.1*** (12.5) | 362.2*** (11.7) | 362.3*** (11.5) | 362.6*** (11.2) |
Quadratic change | 28.4*** (4.6) | 28.5*** (4.6) | 28.5*** (4.5) | 28.5*** (4.5) |
Cubic change | -42.8*** (3.3) | -42.8*** (3.3) | -42.8*** (3.3) | -43.1*** (3.2) |
| ||||
SES | 39.3* (17.2) | 26.7 (18.1) | 21.5 (17.2) | |
SES* AGE | 34.8** (11.8) | 28.1* (12.5) | 25.8* (12.3) | |
SES* AGE2 | -1.1 (4.6) | 0.2 (5.0) | 1.0 (4.9) | |
SES* AGE3 | -3.5 (3.3) | -3.9 (3.6) | -3.8 (3.5) | |
| ||||
INPUT | 0.25∼ (0.14) | 0.15 (0.13) | ||
INPUT* AGE | 0.13 (0.09) | 0.10 (0.10) | ||
INPUT* AGE2 | -0.03 (0.04) | -0.01 (0.04) | ||
INPUT* AGE3 | 0.01 (0.03) | 0.004 (0.04) | ||
| ||||
Gesture | 3.74** (1.3) | |||
Gesture* AGE | 1.45 (0.92) | |||
Gesture* AGE2 | -0.66∼ (0.36) | |||
Gesture* AGE3 | 0.04 (0.26) | |||
| ||||
RANDOM EFFECTS: | ||||
| ||||
Level-2: | ||||
Intercept | 19421.5*** (139.4) | 17898.5*** (133.8) | 16971.5*** (130.3) | 14995.6*** (122.5) |
Linear change | 9145.5*** (95.6) | 7938.2*** (89.1) | 7665.9*** (87.6) | 7199.3*** (84.8) |
Quadratic change | 1118.7*** (33.5) | 1113.3*** (33.4) | 1105.8*** (33.3) | 1054.0*** (32.5) |
Cubic change | 425.2*** (20.6) | 414.3*** (20.4) | 410.3*** (20.3) | 374.5 (19.4) |
| ||||
Goodness of fit: -2LL | 5588.1 (18) | 5572.9 (22) | 5566.7 (26) | 5547.3 (30) |
p<.01;
p<.05,
p<.01,
p<.001
The final model containing the effects of SES, input, and gesture (Model 4 in Table 4) is displayed in Figure 2. Here we examine the role of SES and gesture by plotting four hypothetical trajectories based on higher and lower (75th percentile vs. 25th percentile) SES and gesture levels, holding input constant at its mean. We see that, at the earliest ages, the trajectories of vocabulary growth separate, based on whether or not the child is a high or low gesturer. This pattern reflects the marginal effect of gesture on acceleration and the significant effect of gesture on intercept in our model. The two solid lines (low-gesturers) are lower than the two dashed lines (high-gesturers), emphasizing the role of gesture in early vocabulary growth. However, around 2 years of age, the lines start to separate and the effect of SES becomes more apparent. This pattern reflects the significant positive effect of SES on linear growth in our model. From this point on, estimated vocabulary growth is highest for high-gesturers in high SES families, and lowest for low-gesturers in low-SES families. Around 3 years of age, the two middle lines become virtually indistinguishable, indicating that a low-gesturer in a high SES family typically has a vocabulary by age 3 that has caught up to the vocabulary of a high-gesturer from a low SES family, controlling for input. These effects are main effects, as we did not allow for a Gesture X SES interaction in the model. By the final visit at age 3 years 10 months, the difference in estimated vocabulary size for a high-gesture/high-SES child compared to a low-gesture/low-SES child is approximately 200 word types.
Figure 2.
Effect of SES and child gesture on cumulative vocabulary growth, holding parent input constant (at its mean).
High SES and High Gesture are plotted at the 75th percentile Low SES and Low Gesture are plotted at the 25th percentile
Using growth to predict later language skills
To assess the relation between early vocabulary growth and vocabulary skill prior to kindergarten entry, we use the empirical Bayes intercept and growth estimates (See Equation 4) from Model 4 (Table 4) as predictors of children's 54 month PPVT scores. These values are the estimated growth rates derived from the model including SES, input, and gesture as predictors. Not surprisingly, the intercept and growth estimates from the vocabulary growth model are related to one another. For example, children with higher intercepts or cumulative vocabularies at 2.5 years (based on the full range of data in the model) also show faster linear growth and less acceleration at that time than children with lower cumulative vocabularies at 2.5 years. To determine which aspects of early vocabulary growth (e.g. intercept, linear growth, acceleration) are most related to later vocabulary skill, we fit a series of models comparing growth estimates as predictors. Collinearity was too high to allow for stable estimation in a model containing intercept, linear growth, and quadratic growth (plus controls). As a result, we fit several models containing one or two of the estimates at a time, and then determine the best model based on overall model fit. Note that cubic change was essentially invariant across people, thus we did not include it as a predictor of 54 month vocabulary.
Model 1 in Table 5 shows that, on their own, the controls explain 43.2 percent of the variation in child vocabulary skill at 54 months. In model 2, when we include the intercept estimates from the growth model, we see that the intercept is significantly and positively related to the outcome, and that the model explains an additional 6.5 percent of the variation. Model 3 includes the linear growth estimates from the vocabulary growth models (and not intercept) and shows that linear growth is a significant predictor of later vocabulary, explaining 11.2 percent more of the variation over the control only model (Model 1). Thus, a model including linear growth estimates appears to be a better fitting model than one including the intercept estimates.
Table 5. A series of regression models using vocabulary growth estimates from early childhood (and controls) to predict Pre-K Vocabulary comprehension (PPVT) (n=55).
PPVT (54 months) (Standardized Beta) | ||||||||
---|---|---|---|---|---|---|---|---|
Model 1 | Model 2 | Model 3 | Model 4 | Model 5 | Model 6 | Model 7 | Model 8 | |
SES | 0.30* | 0.27* | 0.22 | 0.17 | 0.14 | 0.18 | 0.15 | 0.15 |
Input | 0.24* | 0.19 | 0.18 | 0.19∼ | 0.18 | 0.16 | 0.18 | 0.24* |
Child Gesture | 0.35** | 0.25* | 0.28** | 0.35** | 0.31** | 0.23* | 0.30* | 0.23* |
Intercept | 0.29* | -0.44 | 0.73*** | 0.09 | ||||
Linear Growth | 0.37** | 0.77** | 0.56*** | 0.51 | 0.57*** | |||
Quadratic Growth | 0.27* | 0.49** | 0.30 | 0.22∼ | ||||
Linear* SES | -0.33** | |||||||
Quadratic* SES | -0.13 | |||||||
R-square | 0.432 | 0.497 | 0.544 | 0.565 | 0.582 | 0.563 | 0.582 | 0.651 |
p<.10;
p<.05;
p<.01,
p<.001
In model 4 (Table 5), intercept and linear growth are included together as predictors. The resulting parameter estimates indicate that the predictors are likely too collinear to include together in one model; nevertheless, linear growth remains significant while intercept is not. Model 5 investigates the combined effect of linear growth and quadratic growth. In this model, both linear growth and acceleration are significant predictors of vocabulary skill at 54 months. The addition of acceleration to this model (in comparison to Model 3 with just linear growth) resulted in an R-square increase of 3.8 percent. In Model 6, we include the estimates of intercept and acceleration as predictors. In this model, both estimates are significant and, with controls, explain a combined 56.3 percent of the variation in child vocabulary skill. Finally, in Model 7, we present the model with intercept, linear and quadratic parameters as predictors; none of the parameters is a significant predictor because of collinearity. We include this model for comparison purposes only.
In comparing models 4, 5, and 6 in Table 5, we conclude that Model 5, which includes linear growth and acceleration, is the best model we can fit to our data to receive unbiased estimates of the role of early vocabulary growth in later vocabulary skill. We cannot use standard hypothesis testing to compare the intercept plus quadratic model (Model 6) to the linear plus quadratic model (Model 5) because they are non-nested models. However, we can see that the latter has a somewhat higher R-squared statistic (58.2 percent for Model 5 compared to 56.3 percent for Model 6). Moreover, a useful and entirely appropriate model comparison procedure in this case is the Bayesian Information Criterion (BIC) (Schwarz, 1978). Applying this procedure, we found that the linear plus quadratic model (Model 5) has a slightly lower BIC (84.08) than does the intercept plus quadratic model (Model 6; BIC = 84.13), consistent with an interpretation that linear plus quadratic give the best account of the data. Further, when we compare Model 5 to nested model 7 containing intercept, linear and quadratic growth, we see that we do not explain any more of the variation in PPVT scores by adding the intercept to Model 5. Thus, we conclude that Model 5, containing the controls, linear and quadratic growth parameters, is the most parsimonious model that we can fit to the data.
Model 5 shows that, controlling for important early (14 month) predictors, children's rate and acceleration of vocabulary growth explains a large portion of later vocabulary skill (R-square = 58.2 percent). Thus, incorporating early vocabulary growth rates refines our prediction of later vocabulary skill. Specifically, children who have a faster rate of growth during toddlerhood, and who increase in vocabulary growth at a faster rate, have larger receptive vocabularies at 54 months (just before entering kindergarten) than children who have a slower rate of growth and increase in vocabulary growth at a slower rate.
As a final step, we included interaction terms in the model to determine whether the individual growth curve parameters predict PPVT scores differently as a function of control factors, including gender, SES, input, and gesture. In several different models, we added interaction terms to Model 5 in Table 5. For gender, we added the main effect as well because it was not in Model 5. We found no interactions between growth parameters and child gender. A model (not shown) containing the main effects of SES, input, gesture, linear growth, quadratic growth plus interactions between each growth parameter with each control parameter (linear*SES, quadratic*SES; linear*input, quadratic*input; linear*gesture, quadratic*gesture) resulted in a significant interaction between linear growth and SES. We then fit an additional model removing the non-significant interaction effects between growth parameters and input and gesture, and retaining the SES interactions. The results are shown in Model 8 in Table 5. Model comparisons revealed that Model 8 (with the interactions) was a significantly better fitting model than Model 5 (without the interactions) and than the model containing the additional non-significant interaction effects (not shown in the table). Thus, Model 8, containing a significant negative interaction between linear growth and SES, is our “final” model.
This interaction indicates that the effect of linear vocabulary growth rates on later PPVT scores is stronger for children from lower SES backgrounds. Specifically, controlling for input (p<.05) and gesture (p < 0.05), the standardized parameter estimate for linear growth in predicting PPVT is 0.892 (p < 0.001) for a child at the 25th percentile for SES, whereas it is 0.222 (ns) for a child at the 75th percentile for SES. Thus, the rate of vocabulary growth early on in development (30 months) is particularly telling for lower SES children.
To what extent can data collected early in life predict later vocabulary skill?
To address our final, more practical question regarding the extent to which early vocabulary production data can be used to predict later vocabulary skill, we take a closer look at the relation between PPVT at 54 months and the data we collected during the first half of our study between 14 and 30 months. We chose to use this period of time because, by 30 months, children already vary widely in the size of their productive vocabularies (Fenson, et al., 1994) and because this period is one during which there is much acceleration in vocabulary acquisition (McMurray, 2007). Table 6 presents partial correlations between PPVT at 54 months, and children's observed cumulative word types at 14, 18, 22, 26, and 30 months, as well as the growth estimates (intercept, linear and quadratic) from a quadratic growth model of vocabulary production across the 14-30 month time period with age centered at 30 months. The partial correlations control for the same variables as in our earlier analysis, namely, SES, parent vocabulary input (at 14 months), and child gesture (at 14 months); these controls were included in the growth model as described in the methods in equation 3.
Table 6.
Partial correlations between PPVT scores at 54 months and early measures of vocabulary controlling for SES, parent vocabulary input (14 months), and child gesture (14 months) (n=55).
PPVT (54 months) | |
---|---|
Observed Word Types (cumulative) | |
14 months | 0.14 |
18 months | 0.13 |
22 months | 0.09 |
26 months | 0.22 |
30 months | 0.27∼ |
Estimates from 14-30 month growth model (data centered at 30 months) | |
Intercept | 0.26∼ |
Linear growth | 0.37* |
Acceleration | 0.36* |
p<.10;
p<.05
NOTE: using non-cumulative observed word types results in the same pattern of correlations as cumulative word types.
As Table 6 shows, controlling for SES, parent input, and early child gesture, child observed cumulative vocabulary does not significantly predict PPVT scores at 54 months until children are 30 months old, at which point the relation is marginal (p < 0.10). However, both velocity (r = 0.37, p < 0.05) and acceleration (r = 0.36, p < 0.05) estimates from the 14-30 month quadratic growth model are related to PPVT scores at 54 months. Thus, having early longitudinal information on expressive vocabulary, including measures of velocity and acceleration, is indeed very useful in predicting later vocabulary skill. In addition, the results suggest that if it is not feasible to collect such detailed data throughout the longitudinal period, collecting data early (at 14 months to gather information on each of the background variables) and then again at another later time point (at around 30 months), while not providing measures of growth, might be a fruitful alternative.
Discussion
The size of the vocabulary that children bring with them when they enter school has been shown to be a good index of their readiness for school and ultimate school success (e.g., Snow, Burns & Griffin, 1998). Although research on the course of vocabulary development during the early language learning years has found that children vary a great deal in the rate at which they acquire vocabulary (e.g., Fenson et al., 1994), we have little understanding of the role that these differences in early growth trajectories play in predicting vocabulary skill at school entry. Our study fills this gap by showing that the pace of early vocabulary growth predicts children's vocabulary school readiness skills
We began by fitting the best longitudinal model to our detailed observational data of vocabulary growth between ages 14 and 46 months. We then incorporated background predictors measured at 14 months (e.g., SES, parent input, child gesture) into this growth model and examined the role that those predictors play in children's vocabulary development. Finally, using intercept and growth estimates from our vocabulary growth model, we showed that estimates of children's growth and acceleration in vocabulary during the early childhood period results in the best predictor of vocabulary skill prior to kindergarten entry, and that the relation between growth rates and later vocabulary skill differs depending on the socioeconomic status of the child. Thus, when predicting children's academic starting point in language skills, we gain the most from looking not only at background factors but also at the early trajectories children are following as their language skills develop during early childhood.
The role of background factors in child vocabulary development: SES, parent input, child gesture
The results of our longitudinal growth models support and add to previous literature on the role that background factors play in child vocabulary development. We find that socioeconomic status (SES), measured as a composite of parent education and family income, is a significant positive predictor of intercept and linear growth of child vocabulary development. This is not a new finding and replicates previous research by Hart and Risley (1992; 1995) and many others (see Hoff, 2006, for a review).
In this study, parent input did not explain the relation between SES and vocabulary growth, but it did reduce the SES effect to a certain degree. However, with SES and input in a growth model together as predictors, SES remains a significant predictor of linear growth and input is a marginal predictor of intercept. Thus, SES relates to children's vocabulary growth even with input controlled (and vice versa, although to a lesser extent). Note that, in this study, we used one relatively gross measure of parent input, namely, number of different words directed to the child. It is likely that different aspects of input are more or less influential at different points in development (e.g., Rowe, under review). Thus, our measure of “input” could be an underestimate, particularly since it does not capture differences in quality of input over time. Nonetheless, the positive relation between child vocabulary growth and both SES and parent input suggests that manipulating aspects of the environment may be one way to close the vocabulary gap between children in low vs. high socioeconomic families. For example, parents can be encouraged to increase the diversity of the words they speak to their children, or the gestures they produce along with that speech, which may then have beneficial effects on child vocabulary growth (cf. Rowe & Goldin-Meadow, 2009; Sauer & Goldin-Meadow, 2010).
Finally, when children's early gesture vocabulary at 14 months is included as an additional predictor, we see a very strong relation between early child gesture and estimated vocabulary skill at 2.5 years (intercept), and a weaker relation between early child gesture and acceleration (quadratic growth). The relation between early gesture and vocabulary growth can be interpreted in several non-mutually exclusive ways. First, it could be that child gesture is merely a good early indicator of underlying linguistic skills and is not playing a role in the process of language learning. Second, it could be that producing gestures for a range of ideas causes children to become good vocabulary learners, perhaps indirectly through the verbal feedback they receive from parents (Goldin-Meadow et al., 2007), or more directly by enabling them to practice producing meanings with their hands before they can produce them verbally (Goldin-Meadow, 2003; 2007). Finally, it is worth mentioning that child gesture at 14 months is related to SES and to parent gesture use at that same time period (Rowe & Goldin-Meadow, 2009b). Thus, children are not necessarily born high or low gesturers but, as early as 14 months, may have already been socialized to gesture more (or less) during interaction with their parents.
In terms of child gender, the absence of a strong relation between gender and vocabulary growth in our sample is not surprising given the inconsistent findings in the literature. Gender differences have been found either only very early on in language development (e.g., Huttenlocher et al., 1991) or in specific stylistic uses of language, but not in overall vocabulary growth (see Bates, Dale & Thal, 1995, for a summary).
To better understand our final growth model, we use Figure 2 to illustrate how the different predictors exert their effects at different points in development. SES and gesture are both significant predictors in our model (controlling for parent input). However, the effect of gesture is most pronounced during the earlier period of vocabulary learning (14 to 30 months), whereas the effect of SES is most pronounced during the later period (30 to 46 months). The overall pattern is an interesting one and illustrates the importance of considering both factors: Children who are from higher SES families (whose parents tend to use many different vocabulary words with them), and who themselves produce many gestures conveying different meanings early in development, have the greatest estimated vocabularies at 2.5 years and the fastest growth over the early childhood period; children who are from lower SES families and who produce fewer different gestures have the lowest estimated vocabularies at 2.5 years and the slowest growth over the early childhood period; and children with mixed profiles (high SES/low gesture and low SES/high gesture) fall somewhere in between and, by the end of the period, are indistinguishable from one another.
Using growth trajectories to predict vocabulary size at school entry
Our next step was to build an empirical mathematical model to test our theoretical hypotheses about the importance of early vocabulary growth trajectories in later language skills. We obtained estimators of the association between early growth parameters and kindergarten skill using the method of empirical Bayes. These estimators are unbiased under the assumption of the polynomial representation of growth and the assumption that the true growth parameters are multivariate normal in distribution. Both assumptions can be checked (Raudenbush & Bryk, 2002, Chapter 9) and, in our study, were checked using residual plots and alternative specifications. This model-driven approach (Rodgers, 2010) is particularly well suited to vocabulary acquisition, as vocabulary is a construct where the unit of measurement (number of word types) remains stable over the course of development, enabling precise measurement of trajectories. However, the paradigm could, in principle, be used to examine the association between early growth trajectories and later outcomes in other developmental phenomena.
Our findings are intuitively plausible––it is not surprising that vocabulary growth trajectories predict later language skills. After all, vocabulary is one area where we know that there are marked individual differences in rates of growth during early childhood (e.g., Fenson et al., 1994) and these differences are likely to have an impact on the eventual size of a child's vocabulary. Our focus on trajectories, however, highlights the importance of understanding the path of development (Adolph et. al., 2008) and the underlying mechanisms of change that create different trajectories of growth (Elman, 2003). The field of language development is approaching consensus on the general shape of vocabulary growth during early childhood (e.g., Ganger & Brent, 2004) and is making progress on understanding the factors or mechanisms that contribute to individual differences in that growth (see Hoff, 2006a, for a review). However, additional large-scale longitudinal studies that make use of the methods we have introduced here would deepen our understanding of individual differences in language learning, and would thus not only move the field forward theoretically, but also address the practical concerns involved in optimizing language development (e.g., Hoff, 2006b).
We also found an interesting and important interaction effect showing that the relation between early vocabulary growth rates and later language skill is stronger for children from lower SES families than for children from higher SES families. It is plausible that, for children from higher SES homes, the relation between growth and later skill is not as strong because these children have other ways to catch up in the event that they start out behind. Children from lower SES homes may not have these alternate routes, suggesting that it is even more important for children in lower SES homes to get a jump start in their vocabulary learning. Our findings thus speak to the larger issue of the SES achievement gap––it is not only the skills children have when they enter school that matter for later success, but also the path they followed in getting to these skills.
Our findings may also have practical significance. Identifying children who are likely to come to school with lagging vocabulary skills well before they arrive at school would be useful so that intervention programs can be made available to them in a timely fashion. In this vein, we examined data from the first half of our observations (from 14 to 30 months) and found that, controlling for SES, parental input, and gesture at the 14 month time point, it was the growth estimates during this period that provided the most useful information about later vocabulary skill, not the data at any one time point. Although it is rarely feasible to gather the kind of detailed longitudinal data that we have collected in our study, it may be possible to collect data at three time points early in development. Collecting background data (SES, parental input, gesture) and data on a child's productive vocabulary at 14 months, combined with child data at two other time points up to 30 months, would allow researchers to construct estimates of linear growth and acceleration and thus better predict vocabulary skill just prior to kindergarten entry. From a theoretical point of view, it is important to note that, whether we look at the entire developmental period (from 14 to 46 months) or the truncated period (from 14 to 30 months), including information about the pace of learning vocabulary early in development improves our ability to predict vocabulary skills several years later.
The work presented here does have some limitations, however. First, to get an idea of what each child's typical environment is like, we encouraged parents and children to engage in their normal activities at home. This procedure led naturally to a great deal of variability in the activities that the children and parents engaged in, and this variability may have been responsible, in part, for our results. To begin to address this concern, we isolated utterances involved in reading episodes and coded them separately. We found that parents who read more with their children did indeed use more vocabulary with the children than parents who read less. Importantly, however, the pattern of results we report here was the same whether or not the reading utterances were included in the database. Note also that one of our measures, the PPVT, did not involve spontaneous conversation and thus is not tied in any way to the activities that took place during our observations. It is also worth noting that the diversity of activities we saw on our videotapes may be central to the phenomenon we report here and not an artifact of our procedure. The different types of activities the children in our sample routinely experienced at home may have engendered the different types of talk they heard and produced, which, in turn, had an impact on the eventual size of their vocabularies.
A second limitation of the study is the fact that ethnicity/race and SES were confounded in our sample. We therefore were unable to consider how race might (or might not) fit into this picture. Future research with a larger sample is needed to tease these effects apart.
A final limitation of the study is that correlational data do not allow us to talk about causal relations. Studies that experimentally manipulate the variables that we hypothesize play a role in the language learning process are needed for us to take this important step (cf. Sauer & Goldin-Meadow, 2010).
Despite the correlational nature of the work we have presented, the findings do have the potential to influence how we conduct interventions designed to change the course of vocabulary growth. Our findings suggest that we should intervene early, and perhaps in different ways, at different points in development. More specifically, our growth models suggest that early interventions involving child gesture might lead to acceleration in vocabulary growth over time (cf. Rowe & Goldin-Meadow, 2009), assuming of course that child gesture is amenable to intervention (see, for example, Sauer & Goldin-Meadow, 2010). Interventions that accelerate vocabulary growth early in life have the potential to increase the size of a child's vocabulary at school entry and, in turn, improve school readiness. Our findings thus have particular relevance for populations likely to be behind at the start of kindergarten and thus at risk for reading difficulties and school failure.
Acknowledgments
The research was supported by grants from the National Institute of Child Health and Human Development: P01HD40605 to Susan Goldin-Meadow and K99/R00HD055522 to Meredith L. Rowe. We thank the other faculty members on the Program Project, Larry Hedges, Janellen Huttenlocher, Susan Levine, and Steve Small for their intellectual contributions to the project; Kristi Schonwald and Jason Voigt for administrative and technical support; and Karyn Brasky, Laura Chang, Elaine Croft, Kristin Duboc, Jennifer Griffin, Sarah Gripshover, Kelsey Harden, Lauren King, Max Masich, Carrie Meanwell, Erica Mellum, Molly Nikolas, Jana Oberholtzer, Lilia Rissman, Becky Seibel, Meredith Simone, Calla Trofatter, Kevin Uttich, Julie Wallman, and Kristin Walters for help in data collection and transcription. We are grateful to the participating children and families.
References
- Adolph KE, Robinson SR, Young JW, Gill-Alvarez F. What is the shape of developmental change? Psychological Review. 2008;115:527–543. doi: 10.1037/0033-295X.115.3.527. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bates E, Dale PS, Thal D. Individual differences and their implications for theories of language development. In: Fletcher P, MacWhinney B, editors. The Handbook of Child Language. Oxford: Basil Blackwell; 1995. [Google Scholar]
- Bloom L. One word at a time: The use of single word utterances before syntax. The Hague: Mouton; 1973. [Google Scholar]
- Bloom P. How children learn the meanings of words. The MIT Press; Cambridge, MA: 2002. [Google Scholar]
- Braine MDS. Is nativism sufficient? Journal of Child Language. 1994;21:9–32. doi: 10.1017/s0305000900008655. [DOI] [PubMed] [Google Scholar]
- Dickinson DK, Tabors PO, editors. Beginning literacy with language: Young children learning at home and school. Baltimore: Paul Brookes Publishing; 2001. [Google Scholar]
- Duncan GJ, Dowsett CJ, Claessens A, Magnuson K, Huston AC, Klebanov P, et al. School readiness and later achievement. Developmental Psychology. 2007;43(6):1428–1446. doi: 10.1037/0012-1649.43.6.1428. [DOI] [PubMed] [Google Scholar]
- Dunn LM, Dunn LM. Peabody Picture Vocabulary Test. 3rd. Circle Pines, MN: American Guidance Service; 1997. [Google Scholar]
- Elman JL. Development: It's about time. Developmental Science. 2003;6:430–433. [Google Scholar]
- Fenson L, Dale P, Reznick JS, Bates E, Thal D, Pethick S. Variability in early communicative development. Monographs for the Society for Research in Child Development. 1994;59(5, Serial No. 242) [PubMed] [Google Scholar]
- Ganger J, Brent M. Reexamining the Vocabulary Spurt. Developmental Psychology. 2004;40:621–632. doi: 10.1037/0012-1649.40.4.621. [DOI] [PubMed] [Google Scholar]
- Goldfield BA, Reznick JS. Early lexical acquisition:Rate, content, and the vocabulary spurt. Journal of Child Language. 1990;17:171–183. doi: 10.1017/s0305000900013167. [DOI] [PubMed] [Google Scholar]
- Goldin-Meadow S. Hearing gesture: How our hands help us think. Cambridge, MA: Harvard University Press; 2003. [Google Scholar]
- Goldin-Meadow S. Pointing sets the stage for learning language –– and creating language. Child Development. 2007;78:741–745. doi: 10.1111/j.1467-8624.2007.01029.x. [DOI] [PubMed] [Google Scholar]
- Goldin-Meadow S, Goodrich W, Sauer E, Iverson J. Young children use their hands to tell their mothers what to say. Developmental Science. 2007;10:778–785. doi: 10.1111/j.1467-7687.2007.00636.x. [DOI] [PubMed] [Google Scholar]
- Gopnick A, Meltzoff A. The development of categorization in the second year and its relations to other cognitive and linguistic developments. Child Development. 1987;58:1523–1531. [Google Scholar]
- Hart B, Risley TR. American parenting of language-learning children: Persisting differences in family-child interactions observed in natural home environments. Developmental Psychology. 1992;28:1096–1105. [Google Scholar]
- Hart B, Risley T. Meaningful differences in the everyday experience of young American children. Baltimore; Brookes: 1995. [Google Scholar]
- Hoff E. Causes and consequences of SES-related differences in parent-to-child speech. In: Bornstein MH, Bradley RH, editors. Socioeconomic status, parenting, and child development. Mahwah, NJ: Lawrence Erlbaum Associates; 2003a. pp. 147–160. [Google Scholar]
- Hoff E. The specificity of environmental influence: Socioeconomic status affects early vocabulary development via maternal speech. Child Development. 2003b;74:1368–78. doi: 10.1111/1467-8624.00612. [DOI] [PubMed] [Google Scholar]
- Hoff E. How social contexts support and shape language development. Developmental Review. 2006a;26:55–88. [Google Scholar]
- Hoff E, Naigles L. How children use input to acquire a lexicon. Child Development. 2002;73:418–433. doi: 10.1111/1467-8624.00415. [DOI] [PubMed] [Google Scholar]
- Hoff E. The uses of longitudinal data and person-centered analyses in the study of cognitive and language development. Merrill-Palmer Quarterly. 2006b;52:633–644. [Google Scholar]
- Hoff-Ginsberg E. Mother-child conversation in different social classes and communicative settings. Child Development. 1991;62:782–96. doi: 10.1111/j.1467-8624.1991.tb01569.x. [DOI] [PubMed] [Google Scholar]
- Hoff-Ginsberg E. The relation of birth order and socioeconomic status to children's language experience and language development. Applied Psycholinguistics. 1998;19:603–629. [Google Scholar]
- Huttenlocher J, Haight W, Bryk A, Seltzer M, Lyons T. Early vocabulary growth: Relation to language input and gender. Developmental Psychology. 1991;27(2):236–248. [Google Scholar]
- Huttenlocher J, Vasilyeva M, Waterfall H, Vevea J, Hedges L. The varieties of speech to young children. Developmental Psychology. 2007;43:1062–1083. doi: 10.1037/0012-1649.43.5.1062. [DOI] [PubMed] [Google Scholar]
- Huttenlocher J, Waterfall H, Vasilyeva M, Vevea J, Hedges L. Sources of variability in children's language growth. Cognitive Psychology. doi: 10.1016/j.cogpsych.2010.08.002. in press. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Iverson JM, Goldin-Meadow S. Gesture paves the way for language development. Psychological Science. 2005;16:368–371. doi: 10.1111/j.0956-7976.2005.01542.x. [DOI] [PubMed] [Google Scholar]
- Lawrence V, Shipley EF. Parental speech to middle and working class children from two racial groups in three settings. Applied Psycholinguistics. 1996;17:233–56. [Google Scholar]
- McCartney K, Burchinal MR, Bub KL. Best practices in quantitative methods for developmentalists. Monographs of the Society for Research in Child Development. 2006;71(3, Serial No. 285) doi: 10.1111/j.1540-5834.2006.07103001.x. [DOI] [PubMed] [Google Scholar]
- McMurray B. Defusing the childhood vocabulary explosion. Science. 2007;317:631. doi: 10.1126/science.1144073. [DOI] [PubMed] [Google Scholar]
- Nelson K. Structure and strategy in learning to talk. Monographs of the Society for Research in Child Development. 1973;38(Serial No. 149) [Google Scholar]
- Oshima-Takane Y, Goodz E, Derevensky JL. Birth order effects on language development: Do second born children learn from overheard speech? Child Development. 2008;67:621–634. [Google Scholar]
- Pan BA, Rowe ML, Singer JD, Snow CE. Maternal correlates of growth in toddler vocabulary production in low-income families. Child Development. 2005;76:763–782. doi: 10.1111/j.1467-8624.2005.00876.x. [DOI] [PubMed] [Google Scholar]
- Raudenbush SW, Bryk AS. Hierarchical linear models: Applications and data analysis methods. 2nd. Thousand Oaks, CA: Sage Publications; 2002. [Google Scholar]
- Raudenbush SW, Sadoff S. Statistical inference when classroom quality is measured with error. Journal of Research on Educational Effectiveness. 2008;1:138–154. [Google Scholar]
- Reznick JS, Goldfield BA. Rapid change in lexical development in comprehension and production. Developmental Psychology. 1992;28:406–413. [Google Scholar]
- Rodgers JL. The Epistemology of Mathematical and Statistical Modeling: A Quiet Methodological Revolution. American Psychologist. 2010;65(1):1–12. doi: 10.1037/a0018326. [DOI] [PubMed] [Google Scholar]
- Rogosa DR, Brandt D, Zimowski M. A growth curve approach to the measurement of change. Psychological Bulletin. 1982;90:726–748. [Google Scholar]
- Rowe ML, Goldin-Meadow S. Early gesture selectively predicts later language learning. Developmental Science. 2009a;12:182–187. doi: 10.1111/j.1467-7687.2008.00764.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rowe ML, Goldin-Meadow S. Differences in early gesture explain SES disparities in child vocabulary size at school entry. Science. 2009b;323:951–953. doi: 10.1126/science.1167025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rowe ML, Levine SC, Fisher J, Goldin-Meadow S. Does linguistic input play the same role in language learning for children with and without early brain injury? Developmental Psychology. 2009;45:90–102. doi: 10.1037/a0012848. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rowe ML, Özçalýskan S, Goldin-Meadow S. Learning words by hand: Gesture's role in predicting vocabulary development. First Language. 2008;28:182–199. doi: 10.1177/0142723707088310. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sauer E, Goldin-Meadow S. Experimental evidence that early gesturing can play a role in vocabulary development. 2010 under review. [Google Scholar]
- Sauer E, Levine SC, Goldin-Meadow S. Early gesture predicts language delay in children with pre- and perinatal brain lesions. Child Development. 2010;81:528–539. doi: 10.1111/j.1467-8624.2009.01413.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schwarz Gideon E. Estimating the dimension of a model. Annals of Statistics. 1978;6(2):461–464. [Google Scholar]
- Singer JD, Willett JB. Applied longitudinal data analysis: Modeling change and event occurrence. New York, NY: Oxford University Press; 2003. [Google Scholar]
- Snow CE, Burns S, Griffin P. Preventing reading difficulties in young children. Washington, DC: National Academy Press; 1998. [Google Scholar]
- Stanovich KE. Matthew Effects in Reading: Some Consequences of Individual Differences in the Acquisition of Literacy. Reading Research Quarterly. 1986;21(4):360–407. [Google Scholar]
- Weizman ZO, Snow CE. Lexical input as related to children's vocabulary acquisition: Effects of sophisticated exposure and support for meaning. Developmental Psychology. 2001;3:7, 265–279. doi: 10.1037/0012-1649.37.2.265. [DOI] [PubMed] [Google Scholar]