Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2013 Jan 1.
Published in final edited form as: Sci Stud Read. 2012 Jan 9;16(5):457–474. doi: 10.1080/10888438.2011.599902

Evidence for Common Etiological Influences on Early Literacy Skills in Kindergarten

Brooke Soden-Hensler 1, Jeanette Taylor 1, Christopher Schatschneider 1
PMCID: PMC3507462  NIHMSID: NIHMS334440  PMID: 23204828

Abstract

Understanding how the etiology of print awareness and phonological awareness are related to the etiology of decoding can provide insights into the development of word reading. To address this issue, we examined the degree of overlap among etiological influences of pre-reading skills in 1,252 twin pairs in kindergarten. Genetic, shared environmental, and non-shared environmental factors were significant for all three literacy phenotypes. The majority of genetic and shared environmental influence on decoding was due to common factors that included print awareness and phonological awareness. Notably, only a single genetic factor contributed to all three literacy phenotypes but there was additional shared environmental influence common to phonological awareness and decoding. Findings suggest commonalities in the etiology of pre-reading literacy skills that could inform work on the development of reading skill.

Keywords: decoding, print awareness, phonological awareness, genetic, twins


Much research has focused on identifying struggling readers as early as possible in order to provide intervention before students lag behind (Torgesen, 2004). Several reading-related skills have been identified that are highly predictive of later reading achievement including print awareness and phonological awareness (Adams, 1990; Schatschneider, Fletcher, Francis, Carlson, & Foorman, 2004). The National Early Literacy Panel (NELP) found that early alphabet knowledge and rapid automatized naming of letters/digits predicted decoding (r = .50 and .40, respectively) as well as reading comprehension (r = .48 and .43, respectively; Lonigan, Schatschneider, Westberg, & National Early Literacy Panel, 2008). Thus, gaining a better understanding of pre-reading skills and the sources of variation common to these skills and reading could provide opportunities to recognize and identify prereaders who may become struggling readers.

Among the more widely examined pre-reading skills is letter naming, a measure of print awareness, that is one of the single best predictors of future reading achievement (Adams, 1990; Scarborough, 1998; Snow, Burns, & Griffin, 1998). Given its high predictive ability, letter knowledge is frequently assessed in preschool and kindergarten. By the end of kindergarten, children should be able to recognize and name upper- and lowercase letters for most of the letters in the alphabet (Snow et al., 1998). Knowing the names of letters is a prerequisite to learning the correspondence of letters to sounds (Whitehurst & Lonigan, 1998), making letter naming a foundational skill for later literacy.

In addition to being able to discriminate units of print (such as letters) children must be able to discriminate units of language, including phonemes (Whitehurst & Lonigan, 1998). By the end of kindergarten, children should have at least some phonemic awareness, including that words can be segmented into smaller units of sound (Snow et al., 1998). The role of phonological awareness in learning to read has received a fair amount of attention and it is now well established that phonological awareness has a causal influence on reading acquisition (Bradley & Bryant, 1983; Wagner & Torgesen, 1987; Wagner et al., 1997). Once a child has learned the alphabet and has developed phonological awareness, learning the alphabetic principle (the mapping of sounds to letters) enables him/her to decode words. Decoding of pseudowords is a direct measure of children’s mastery of the alphabetic principle since there is no exposure to such words and they must be sounded out.

Though there has been considerable research on emergent literacy and early reading acquisition, the etiology of these skills is less understood. Much of the research on genetic and environmental influences on reading has relied on older samples (e.g., age 13, Stevenson, 1987) or samples including broad age ranges (e.g., age range 8–20, Light & DeFries, 1995; age range 6–12, Thompson, Detterman, & Plomin, 1991). More recently, studies involving younger samples have been conducted (e.g., Byrne et al., 2007; Harlaar, Spinath, Dale, & Plomin, 2005; Hohnen & Stevenson, 1999; Petrill, Deater-Deckard, Thompson, DeThorne, & Schatschneider, 2006; Taylor, Roehrig, Soden Hensler, Connor, & Schatschneider, 2010; Taylor & Schatschneider, 2010).

Print awareness, phonological awareness, and decoding have each been shown to be influenced by genetic and environmental factors in pre- and early readers. Measures of print awareness tend to be modestly affected by genetic factors (heritability estimates less than 30%) and more substantially influenced by environmental factors. For example, a composite measure of print knowledge in preschoolers in the International Longitudinal Twin Study (ILTS; Byrne et al., 2002) showed a shared environmental influence that was roughly twice the magnitude of the genetic influence. A similar outcome was found for letter identification in kindergarten and first grade twins (mean age 6.1) in the Western Reserve Reading Project (WRRP; Petrill et al., 2006). In a sample of kindergarteners in the Florida Twin Project on Reading (FTP-R), Taylor and Schatschneider (2010) found that letter naming fluency had approximately equivalent influences of genetic, shared environmental, and non-shared environmental influences.

In contrast, moderately high heritability estimates (50–65%) have been found for phonological awareness across various studies of first and second grade children (Byrne et al., 2002; Hohnen & Stevenson, 1999) and for a latent phonological awareness factor in kindergarten (Byrne et al., 2005). Shared environmental influences on phonological awareness in these studies were moderately low but nonetheless larger than the estimate of non-shared environment. The WRRP sample showed similar estimates of genetic and shared environmental influences (.48 and .43, respectively; Petrill et al., 2006) as did the FTP-R although with estimates of genetic and shared environment both in the 20–40% range (Taylor & Schatschneider, 2010).

The genetic effects on decoding pseudowords and words are substantial in early readers as compared to environmental effects. High heritabilities were found in a sample of 7-year olds participating in the Twins Early Development Study (TEDS; Kovas, Haworth, Dale, & Plomin, 2007) on both nonword reading (.67) and word reading (.69), whereas shared environmental influences were minimal (.15 and .13, respectively). In the WRRP sample, about half of the variability in decoding pseudowords (.49) was due to genetic factors, with a moderate effect of shared environment (.31; Petrill et al., 2006). In contrast, genetic effects on reading words (.68) were substantial whereas shared environment was much less so (.22). Hohnen and Stevenson (1999) reported heritabilities of .60 and .59 on a literacy composite for 6 and 7-year old children, respectively. Conversely, shared environmental influences were more modest (.36 and .32, respectively). A latent factor of reading that included both words and nonwords was highly heritable (.70) with only modest shared environment effects (.22) in the ILTS kindergarten sample (Byrne et al., 2005).

While the univariate results speak to the relative impact of genetic and environmental factors on each of these skills, a multivariate approach is required to establish the degree of overlap in these etiological factors among the skills. Hohnen and Stevenson (1999) examined the link between phonological awareness and literacy in 6 and 7-year-old children and found common genetic influences on both phonological awareness and literacy; however, when performance IQ and general verbal ability were controlled, no independent genetic variance was observed between the two skills. This was not true for environmental influences: additional covariance between phonological awareness and literacy was environmentally mediated. This suggests that genetic and environmental influences for literacy and phonological awareness overlap in beginning readers and that environmental factors may be particularly important in the covariance of these skills. In a more developmental approach, Byrne and colleagues (2005) investigated print knowledge and phonological awareness in preschool and their relationship with kindergarten reading, finding that most of the variability in these skills was accounted for by common genetic and environmental influences, though some of the genetic variance in phonological awareness and reading was independent.

Understanding how the etiology of print awareness and phonological awareness are related to the etiology of decoding can provide insights into the development of word reading. The phenotypic literature on the acquisition of these skills is well-established and the behavioral genetic research indicates that the traits are influenced by both genes and environment and, furthermore, that there is some degree of overlap in genetic and environmental factors among these skills from preschool to kindergarten. However, it is not clear whether the same genetic and environmental factors are acting on these skills during the phase when students initially read. The current study filled this gap in the literature by investigating common versus distinct genetic and environmental influences on measures of print awareness, phonological awareness, and decoding ability in kindergarten.

Method

Participants

The Florida Twin Project on Reading (FTP-R) is a longitudinal study that is part of a Learning Disabilities Center at Florida State University and the Florida Center for Reading Research. Twins were ascertained from the Progress Monitoring and Reporting Network (PMRN), a state-wide database of achievement measures administered three or four times per school year. Potential twin pairs were identified in the PMRN based on a match of children with the same last name, birth date, and school. Parents of potential twins were contacted by mail to assess zygosity and obtain consent to allow the use of the children’s PMRN data for twin analyses. Zygosity was assessed using five questions about similarity of the twins that have been used in other twin studies and show an accuracy rate of over 95% when compared to DNA tests (Lykken, Bouchard, McGue, & Tellegen, 1990). This procedure was approved by the Florida State University IRB as well as the IRBs of the various counties in Florida from which twins have been ascertained thus far (representing northern, central, and southern parts of the state).

At the time of this report, 5,716 possible twin pairs/multiples had been identified in 14 Florida counties and a response was received from 43% of families. Data from the PMRN were only available for twins whose parent replied and consented and, therefore, analyses comparing responders and non-responders were not possible. Only 4% of responses indicated an ascertainment error (i.e., the children were not twins). The present study examined early literacy skills measured in kindergarten. From the initial sample of 2,620 individual twins with data on at least one of the study measures in kindergarten, a total of 1,252 pairs with complete data on all measures were available for analysis (116 twins from the initial sample were excluded due to missing data on one or two measures). The present sample included 427 monozygotic (MZ) pairs (197 male; 230 female) and 825 dizygotic (DZ) pairs (209 male; 210 female; 406 opposite-sex). Some of this sample was included in a previous report examining effects of socioeconomic status on these same measures (Taylor and Schatschneider, 2010).

The mean age of the sample at the beginning of kindergarten was 5.56 (SD = 0.40). Parents reported on the racial/ethnic status of the twins when they registered them for school and this information is contained in the PMRN. Parents could identify the children into only one of six categories and the percent of each category represented in the present sample was as follows: 1.5% Asian/Pacific Islander, 20.8% Black, 23.4% Hispanic, 0.2% American Indian, 5.1% Mixed, and 48.5% White. The rest (0.5%) had missing data.

Measures

Letter Naming Fluency (LNF)

Letter Naming Fluency (Elliott, Lee, & Tollefson, 2001; Ritchey, 2002; Speece & Case, 2001) is a measure of print awareness that measures the number of correct letter names provided in one minute. Upper and lower case letters are randomly arranged on a standard size page (five rows and five columns) in 30 point, Century Gothic font. Administration begins with three practice items and feedback. Reported alternate-forms reliability (r =. 82–.93; Elliott et al., 2001; Speece & Case, 2001) and predictive criterion-related validity with the Basic Reading Cluster score (WJ-R) is judged as adequate (r =. 58–.75; Elliott et al., 2001; Speece & Case, 2001).

Phoneme Segmentation Fluency (PSF)

In this measure of phonological awareness, a child is presented orally with a word containing three or four phonemes and asked to produce the individual phonemes. A child is given one point for each correctly produced phoneme. The directions are modified slightly from Kaminski and Good (2003) to include additional practice items, picture cues, and more explicit feedback (Ritchey, 2002). Reported alternate-forms reliability (r = .60 to .90; Kaminski & Good, 1996, 2003; Ritchey, 2002) and predictive and concurrent criterion-related validity with reading, spelling, and reading-related skills are adequate (r =.54 to .68; Kaminski & Good, 1996; 2003; Ritchey, 2002). There are 20 alternate forms. The score is the number of correct phonemes produced in one minute.

Nonsense Word Fluency (NWF)

This is a measure of print awareness that requires the child to read vowel-consonant and consonant-vowel-consonant, single syllable pseudowords all of which have the short vowel sound. After a practice trial, the examiner instructs the child to read the “make believe” words as quickly and accurately as possible. If the child does not respond within 3 seconds, the examiner prompts with “next?” The stimuli are presented in 10 rows of five words each. Reported alternate-forms reliability is strong (r =. 83 to .94; Kaminski & Good, 2003; Speece, Mills, Ritchey, & Hillman, 2003) and predictive and concurrent criterion-related validity coefficients with reading (r =. 36 to .91; Kaminski & Good, 2003; Speece et al., 2003) are adequate to strong. There are 20 alternate forms. The original scoring guidelines give credit for correctly producing individual phonemes or for producing the pseudoword as a blended unit. Thus, if the nonsense word is “vab,” 3 points are awarded if the child says “/v/ /a/ /b/” or “vab.” Partial credit is awarded when only some phonemes are produced. The NWF score is the total number of letter-sounds correctly produced in one minute.

Procedure

All measures were administered in students’ schools by district-employed testers who were trained using a state-approved training procedure. The state-wide PMRN database includes information from multiple assessments within the school year. For this study, the last assessment point (end of school year) was used. The scores from the first time through kindergarten were used in cases where one or both members of the twin pair repeated the grade.

Analyses

Descriptive statistics and phenotypic (Pearson) correlations among all measures were calculated for the entire sample and by zygosity. Intraclass correlations were also calculated by zygosity. Genetic influence is inferred when the magnitude of the MZ twin intraclass correlation exceeds that of the DZ twin correlation. Correlations of similar magnitude across zygosity are indicative of shared environmental influence. As an initial step in evaluating genetic and environmental influence on the covariation of the literacy measures, cross-twin cross-trait correlations were calculated in which a trait score from one twin (e.g., letter naming fluency) was correlated with a different trait score from the co-twin (e.g., phoneme segmentation fluency). If the MZ cross-twin cross-trait correlation is greater in magnitude than the corresponding DZ twin correlation, then genetic influence on the covariation between the two traits is inferred.

The intraclass correlations indicated that non-additive genetic factors (evidenced by an MZ twin correlation that is more than twice the magnitude of the DZ twin correlation) were not influencing any measure and, therefore, those effects were not estimated in the biometrical models. Thus, the total phenotypic variance (VP) in each decoding-related measure was decomposed into additive genetic effects (A), shared environmental effects (C), and non-shared environmental effects (E) by the following equation:

VP=VA+VC+VE. (1)

The expected covariances among twins were specified as follows:

MZ covariance=a2+c2 (2)
DZ covariance=.5a2+c2. (3)

In order to examine the extent to which genetic and environmental influences contribute to variability in each measure and to their covariation, a multivariate Cholesky decomposition model was utilized. Based on theories of the development of reading skill, the Cholesky model was specified with Letter Naming Fluency as the first phenotype followed by Phoneme Segmentation Fluency and then Nonsense Word Fluency. A Cholesky decomposition is akin to a factor analysis in which etiological influences (A, C, and E) are represented as latent factors onto which the observed variables can load. Variance in each observed variable can load onto one or more latent factors. For each source of etiological influence, the maximum number of factors equals the number of observed variables. Covariance paths in the model (e.g., a21, c32, etc.) allow for an assessment of common sources of etiological influence on different phenotypes.

Models were fit to raw data using full information maximum-likelihood in the Mx GUI software program (Neale, Xie, Boker, & Maes, 2003). The fit of a model is indicated by the -2LL estimate. Reduced models were fit by setting one or more parameters to zero. In this way, the significance of a covariance path could be tested. Reduced models were then compared to the full model using a -2LL difference test in which the difference in likelihood estimate values was tested for significance using the difference in df between the full and reduced models as the df for the test, which is distributed as a chi-square. A non-significant -2LL difference test indicated that the reduced model could be accepted over the full model. Reduced models that could be accepted over the full model were compared to each other using Akaike’s Information Criterion (AIC = χ2 - 2df; Akaike, 1987) with the lowest AIC indicating the best-fitting model.

Finally, the multivariate Cholesky model provided an estimate of the genetic correlation (raxy, the association of genetic factors contributing to pairs of measures) and environmental correlations (rcxy, the association of shared environmental factors contributing to pairs of measures; rexy, the association of unique environmental factors contributing to pairs of measures).

Results

Analyses were prefaced by an examination of the distribution of each variable for normality and none required transformation. Means and standard deviations for the literacy measures are presented in Table 1 for the whole sample and by zygosity. An important assumption in twin research is that MZ twins are more alike than DZ twins because of their greater genetic rather than environmental similarity. If MZ twins receive more similar treatment in their environment than DZ twins and that greater similarity has an effect on the phenotype of interest, then the MZ twin variance for that phenotype would be smaller than the DZ twin variance. None of the measures showed a significant variance difference. Table 1 presents the results of the tests for mean differences by zygosity. Means were evaluated with an alpha corrected for the number of tests (p = .016), and though effect sizes for all three literacy skills were small (d = .09 – .10), nonsense word fluency showed a significant difference by zygosity. The phenotypic (Pearson) correlations in the whole sample were as follows: letter naming fluency-phoneme segmentation fluency, r = .45; letter naming fluency-nonsense word fluency, r = .69; phoneme segmentation fluency-nonsense word fluency, r = .48. As expected, the different early literacy measures were significantly (p < .001) correlated with one another.

Table 1.

Means (and Standard Deviations) for Decoding-related Measures for the Total Sample and by Zygosity

Total Sample
N=2,504
MZ
N=854
DZ
N=1,650
MZ vs. DZ
t (2,502)
Letter Naming Fluency 52.39 (17.92) 51.28 (17.53) 52.97 (18.09) −2.24
Phoneme Segmentation Fluency 40.07 (17.03) 38.97 (17.47) 40.64 (16.78) −2.33
Nonsense Word Fluency 43.93 (24.91) 42.23 (23.95) 44.81 (25.35) −2.46*

Note: Ns are for individuals. Independent samples t-tests are reported for the comparison between MZ and DZ twins. Variances did not differ significantly between MZ and DZ twins for any measure.

*

p = .014.

Twin intraclass correlations by zygosity and by gender within zygosity are presented in the upper part of Table 2. The pattern of correlations was similar for males and females, but the opposite-sex DZ correlations did not approximate the geometric mean of the same-sex DZ correlations for any variable suggesting the possibility of sex limitation or sex differences in the magnitude (scalar) or type (nonscalar) of etiological influence. To investigate sex differences, multigroup multivariate Cholesky models were fit to the data to assess scalar and nonscalar effects following suggestions by Neal, Røysamb, and Jacobson (2006). In brief, a model was fit in which genetic and environmental correlations were constrained across sex to test scalar sex limitation. That model fit better than the one in which correlations were not constrained. Next, four models were fit that tested for genetic or shared environmental nonscalar effects on either males or females. The best fitting of those four models was the one with nonscalar genetic effects for males. That model was then compared to the model with no scalar sex limitation (genetic and environmental correlations constrained across sex) and the latter provided a non-significant difference in fit, indicating that the nonscalar sex effect for males was not significant. Finally, a model in which estimates of A, C, and E were constrained across sex was fit to the data and it provided a non-significant difference in fit over the model with no scalar sex limitation, indicating that there was no scalar or nonscalar sex limitation and that model-fitting could continue on combined data. Table 2 also provides the cross-twin cross-trait correlations and the phenotypic correlations. The pattern of intraclass and cross-twin cross-trait correlations were consistent with the expectation in suggesting the influence of both genetic and shared environmental influences on each measure and on their bivariate associations.

Table 2.

Correlations for Decoding-related Measures by Gender and Zygosity

MZ
N=427
MZ Male
N=197
MZ Female
N=230
DZ
N=825
DZ Male
N=209
DZ Female
N=210
DZ OS
N=406
Intraclass Correlation
 Letter Naming Fluency .64 .67 .61 .47 .47 .53 .43
 Phoneme Segmentation Fluency .61 .58 .63 .45 .47 .53 .39
 Nonsense Word Fluency .69 .72 .66 .46 .52 .48 .41
Cross-twin Cross-trait Correlation
 Letter Naming-Phoneme Segmentation .36 .38 .35 .29 .26 .39 .26
 Letter Naming-Nonsense Word .57 .59 .54 .40 .41 .44 .36
 Phoneme Segmentation -Nonsense Word .43 .45 .41 .30 .28 .42 .25
Phenotypic Correlation
 Letter Naming-Phoneme Segmentation .43 .43 .44 .46 .49 .48 .44
 Letter Naming-Nonsense Word .68 .69 .67 .70 .72 .71 .68
 Phoneme Segmentation -Nonsense Word .53 .55 .51 .46 .50 .47 .42

Note: Ns are pairs. OS = opposite-sex.

As foreshadowed by the pattern of intraclass correlations, the ACE model was the best-fitting base model as reduced models (e.g., AE) provided a significant decrement in fit. Next, covariance paths were tested in a series of reduced models. First, all covariance paths for a given source of variance (A, C, or E) were dropped. If that reduced model could not be accepted over the full ACE model, then the parameter estimates from the full model were used to identify specific paths to be tested for significance. The multivariate Cholesky model-fitting results are presented in Table 3. Reduced models in which all covariance paths associated with a particular source of variance were dropped provided a significantly worse fit than the full model. The reduced models that provided a non-significant change in fit over the full model were compared using the AIC fit statistic and the best-fitting model was the one in which the a32 and c33 paths were dropped. The best-fitting model is presented in Figure 1. As indicated in Figure 1, the first additive genetic factor (A1) accounted for variance in each of the three early literacy skills and that was the only additive genetic factor common to the second and third phenotypes in the model. Shared environmental variance in the three early literacy skills was accounted for by just two factors. Non-shared environmental variance was accounted for by three factors with the covariance paths yielding significant parameters that were nonetheless small in magnitude.

Table 3.

Multivariate Cholesky Model-fitting Results

Model -2LL df AIC -2LLΔ
ACE 17995.84 7488 3019.84
Drop all A covariance paths 18050.21 7491 3068.21 54.37 (3)*
Drop only a32 path 17996.03 7489 3018.03 0.19 (1)
Drop all C covariance paths 18026.58 7491 3044.58 30.74 (3)*
Drop only c33 path 17995.94 7489 3017.94 0.10 (1)
Drop all E covariance paths 18100.67 7491 3118.67 104.83 (3)*
Drop only e32 path 18015.22 7489 3037.22 19.38 (1)*
Drop only a32, c33, and e32 paths 18021.85 7491 3039.85 26.01 (3)*
Drop only a32 and c33 paths 17996.31 7490 3016.31 0.47 (2)

Note: The best-fitting model is presented in bold type. A = additive genetic effects; C = shared environmental effects; E = non-shared environmental effects; AIC = Akaike Information Criterion; -2LLΔ = difference in -2LL estimate between the ACE model and the reduced model.

*

p < .05.

Figure 1.

Figure 1

Best-fitting multivariate Cholesky decomposition model for decoding-related skills. A = additive genetic effects; C = shared environmental effects; E = non-shared environmental effects.

The Cholesky model produces univariate estimates of A, C, and E for each phenotype and, as expected, significant estimates of both A and C were found for each literacy skill. Univariate estimates of each source of variance can be derived from Figure 1 by squaring path estimates pointing to the phenotype from respective etiological factors. As illustrated in Figure 1, the first observed variable in a Cholesky decomposition can have loadings on just the initial factor for each source of variance. Thus, the univariate estimates of genetic, shared environmental, and non-shared environmental variance [and 95% CI] for letter naming fluency are derived by squaring the A1, C1, and E1 paths: a2 = .40 [.26 – .53], c2 = .26 [.15 – .36], and e2 = .34 [.30 – .39], respectively.

The amount of variance associated with A, C, and E for the other phenotypes in the model in Figure 1 can be further decomposed into that which is attributable to a factor that is common to other phenotypes and that which is unique to the phenotype. The additive genetic variance in phoneme segmentation fluency is the sum of squared paths from factors A1 and A2, or .28 [.14 – .43]. Most of the genetic variance in phoneme segmentation fluency (78%: .472/[.252 + .472]) was unique to that decoding-related skill and the remainder (22%) was associated with the A1 factor common to all of the decoding-related skills. The estimate of shared environment effect on phoneme segmentation fluency was c2 = .30 [.19 – .42], and just over half (57%) of that was common to letter naming fluency and nonsense word fluency. In contrast, nearly all (95%) of non-shared environmental variance (e2 = .41 [.36 – .46]) for phoneme segmentation fluency was unique to just that skill. For the third variable in the model, nonsense word fluency, the majority (75%) of the additive genetic variance (a2 = .52 [.41 – .62]) was attributable to the A1 factor common to all of the decoding-related skills and the remainder was unique to nonsense word fluency. There was no genetic factor common to just phoneme segmentation and nonsense word fluency. All of the shared environmental variance for nonsense word fluency (c2 = .19 [.10 – .28]) was common to the other reading skills and the bulk of that (77%) was common to both of the other phenotypes and the remainder was common to just phoneme segmentation fluency. The reverse was true for non-shared environmental variance (e2 = .29 [.25 – .33]) for which 85% was unique to only that phenotype.

Finally, the correlations among the genetic factors, the shared environmental factors, and the non-shared environmental factors associated with the phenotypic variables in the model were computed. There was a moderate correlation between the genetic factors that contribute to variability in letter naming fluency and phoneme segmentation fluency skills (raxy = .47). Similarly, the genetic correlation between phoneme segmentation fluency and nonsense word fluency was also moderate (raxy = .41), indicating common genetic influences as well as specificity. The genetic correlation between letter naming and nonsense word fluency was much stronger (raxy = .87), suggesting that the genetic factors that contribute to variability in nonsense word fluency are largely the same as those that contribute to variability in letter naming fluency. The shared environmental correlations were uniformly high among the factors contributing to variability in decoding-related skills (rcxy = .76, .87, and .98, for letter naming fluency-phoneme segmentation fluency, letter naming fluency-nonsense word fluency, and phoneme segmentation fluency-nonsense word fluency, respectively). Based on the high genetic and shared environmental correlations found between some factors, additional reduced Cholesky models were fit testing the significance of the a33 and c32 paths but none of the reduced models could be selected over the best-fitting model shown in Figure 1, indicating that the A2, A3 and C2 factors could not be dropped from the model. The non-shared environmental correlations were uniformly small in magnitude (rexy = .22, .34, and .26, for letter naming fluency-phoneme segmentation fluency, letter naming fluency-nonsense word fluency, and phoneme segmentation fluency-nonsense word fluency, respectively), indicating that largely different sets of non-shared environmental factors contributed to variability in the three early literacy skills.

Discussion

It is well established that print awareness and phonological awareness in early readers are predictive of future reading achievement. Genetically sensitive studies are beginning to indicate the extent to which these early reading-related skills are due to genetic and environmental factors. However, the extent to which the genetic and environmental factors that influence each of these skills is common or unique is less well known. The aim of the current study was to investigate the overlap of etiological influences on measures of print awareness, phonological awareness, and decoding in beginning readers.

The present findings showed that the bulk of genetic and environmental influences on decoding are common to print awareness and phonological awareness. About half of the variability in decoding in kindergarteners was due to genetic factors and the vast majority (75%) of those genetic influences were common to print awareness and phonological awareness. Similarly, 77% of the shared environmental influence on decoding was common to print awareness and phonological awareness. That is, the same factors that are acting on the pre-reading skills print awareness and phonological awareness are also acting on decoding. For phonological awareness this overlap was substantial for shared environment but only moderate for genetic influences. In contrast, for print awareness this overlap was substantial for both shared environment and genetic influences. Interestingly, interventions targeting phonological awareness have succeeded in improving reading achievement (e.g., Blachman, Tangel, Ball, Black, & McGraw, 1999; Mathes et al., 2005), whereas interventions that focus primarily on letter knowledge (the print awareness measure in the current study) have had non-significant or only modest effects on subsequent reading (Piasta & Wagner, 2010). This suggests that the reasons that phonological awareness interventions succeed could be due to overlap of common environmental factors impacting phonological awareness and reading more so than overlap of genetic factors. The lack of significant effects for interventions focused on letter knowledge despite substantial overlap of both shared environmental and genetic factors might be because letter knowledge seems to tap a process that is influenced by genetic factors that continue to have an effect on later decoding and this process may not be adequately affected by current interventions.

The environmental link between phonological awareness and reading observed in the current study was also seen in another study of early readers. After controlling for performance IQ and general verbal ability, Hohnen and Stevenson (1999) found that there was no additional genetic influence common to phonological awareness and a composite of literacy skills, but there was additional common environmental influence. In the current study, there was also no common genetic influence between just phonological awareness and decoding; all of the shared genetic influence between the two skills was via a genetic factor which also included print awareness. Also similar to Hohnen and Stevenson, current findings showed significant (though relatively small) additional common environmental influences on phonological awareness and decoding beyond that shared with print awareness. This further supports the idea that environmental factors may be of particular importance in the observed covariance of these two skills. As discussed above, the modest genetic correlation coupled with higher environmental correlations (especially for shared environment) offers further support that this may be the case.

The pattern of results from the Cholesky decomposition in the current report and those from two studies with ILTS twins (Byrne et al., 2005, 2006) show both similarities and differences. Similar to the present study, Byrne and colleagues found that beyond the genetic factor that included print awareness, phonological awareness, and reading, there was no additional significant shared genetic influence between phonological awareness and reading. Although phonological awareness and reading are known to be developmentally intertwined constructs, converging evidence suggests that they are nonetheless influenced to a degree by independent genetic influences. While the genetic architecture across the literacy skills examined was similar in the current study and in the ILTS studies (Byrne et al., 2005, 2006), there were differences in the magnitude of the overall effects. We found that about one quarter of the total genetic influence on phonological awareness was common to print awareness and reading, whereas Byrne et al. (2005, 2006) found that about half of the genetic influence on phonological awareness was common to print awareness and reading. Thus, we provide independent evidence that converges on the finding that there appears to be a single genetic factor that contributes to both pre-reading skills (print awareness and phonological awareness) and decoding with additional contributions from skill-specific factors on the more complex skills.

Our results are also consistent with the prior investigations (Byrne et al., 2005, 2006) in showing that decoding in kindergarten is not influenced by a unique shared environmental factor. However, we found two shared environmental factors: one common to all of the early literacy skills and one common to just the higher level skills of phoneme segmentation and decoding. Byrne et al. (2005, 2006) found just a single shared environmental factor common to all three reading related skills. This difference may be due to the disparate timing of data collection for the phonological awareness and reading measures. In the current study, both phonological awareness and decoding were taught in the same educational environments to kindergarteners and, therefore, the measures of these skills were concurrently collected. In the Byrne et al. (2005, 2006) studies, the phonological awareness measures were administered during preschool and the reading measures were administered during kindergarten. There is also quite a lot of environmental variability in the FTP-R twin sample as participants come from a wide range of SES backgrounds. Further, the sample is drawn from Florida where early education experience varies widely with some children participating in the state-funded Voluntary Pre-Kindergarten program and others not (the PMRN data set does not provide information on this and so participation in pre-K cannot be estimated in the present sample). The large environmental variability in the current sample may explain the greater proportion of variability due to environmental factors and potentially the additional shared environmental factor that was found.

The results of the current study should be considered in light of potential limitations. Though the PMRN affords the advantage of data collection from a large and diverse sample, it is limited in that only state-prescribed measures and very restricted environmental variables are available. At the time of data collection for this report, no environmental variables such as preschool attendance prior to kindergarten, quality of current literacy instruction, home literacy environment, or parental education were available. As such, it was not possible to test the influence of potential sources of environmental influence. With regard to the literacy measures investigated, we found somewhat higher non-shared environmental estimates than other studies (e.g., Byrne et al., 2005, 2006) that may be due to larger measurement error. If that is the case then the higher variability in non-shared environment would not fully reflect true non-shared environmental influences (to the extent that it includes measurement error) and would serve to reduce the true variability that could be captured in genetic and shared environmental estimates. Another limitation was that there are other early literacy skills that were not examined. Finally, a measure of general intelligence was not available for the present sample and it could be that some of the etiological factors that were identified are related primarily to overall intellectual ability. Prior research suggests that this is probably more of an issue with regard to the genetic factor contributing to early literacy skills rather than the environmental factors (Hohnen & Stevenson, 1999). Nonetheless, additional studies are needed to examine how the etiological factors associated with general intellectual functioning interface with the etiological factors associated with literacy skills.

Despite these limitations, the results of the present study provide independent converging evidence of great overlap in genetic and environmental influences on early literacy skills. Future studies can build on this growing knowledge base and investigate whether the observed trends of common etiological influences continue or whether new genetic and environmental influences come online as reading skills develop in later grades.

Acknowledgments

This work was supported in part by grants P50 HD052120, R305B090021, and R305B04074.

The authors would like to thank Dr. Kristen Jacobson for her assistance with the statistical models used to test the effects of sex limitation and Amy Mikolajewski, Sarah Kershaw, and Scott Pizzarello for assisting with twin ascertainment. The authors also thank the twins and their families for their participation in the Florida State Twin Registry.

References

  1. Adams MJ. Beginning to read: Thinking and learning about print. Cambridge, MA: MIT Press; 1990. [Google Scholar]
  2. Akaike H. Factor analysis and the AIC. Psychometrika. 1987;52:317–332. [Google Scholar]
  3. Blachman BA, Tangel DM, Ball EW, Black R, McGraw CK. Developing phonological awareness and word recognition skills: A two-year intervention with low-income, inner-city children. Reading and Writing: An Interdisciplinary Journal. 1999;11:239–273. [Google Scholar]
  4. Bradley L, Bryant PB. Categorizing sounds and learning to read: A causal connection. Nature. 1983;301:419–421. [Google Scholar]
  5. Byrne B, Delaland C, Fielding-Barnsley R, Quain P, Samuelsson S, Hoien T, Corley R, DeFires JC, Wadsworth S, Willcutt E, Olson RK. Longitudinal twin study of early reading development in three countries: Preliminary results. Annals of Dyslexia. 2002;52:49–73. [Google Scholar]
  6. Byrne B, Olson RK, Samuelsson S, Wadsworth S, Corley R, DeFries J, et al. Genetic and environmental influences on early literacy. Journal of Research in Reading. 2006;29:33–49. [Google Scholar]
  7. Byrne B, Samuelsson S, Wadsworth S, Hulslander J, Corely R, DeFires JC, Quain P, Willcutt E, Olson RK. Longitudinal twin study of early literacy development: Preschool through Grade 1. Reading and Writing. 2007;20:77–102. [Google Scholar]
  8. Byrne B, Wadsworth S, Corley R, Samuelsson S, Quain P, DeFries JC, et al. Longitudinal twin study of early literacy development: Preschool and kindergarten phases. Scientific Studies of Reading. 2005;9:219–236. [Google Scholar]
  9. Elliott J, Lee SW, Tollefson N. A reliability and validity study of the Dynamic Indicators of Basic Early Literacy Skills. School Psychology Review. 2001;30:33–50. [Google Scholar]
  10. Harlaar N, Spinath FM, Dale PS, Plomin R. Genetic influences on early word recognition abilities and disabilities: A study of 7-year-old twins. Journal of Child Psychology and Psychiatry. 2005;46:373–384. doi: 10.1111/j.1469-7610.2004.00358.x. [DOI] [PubMed] [Google Scholar]
  11. Hohnen B, Stevenson J. The structure of genetic influences on general cognitive, language, phonological, and reading abilities. Developmental Psychology. 1999;35:590–603. doi: 10.1037//0012-1649.35.2.590. [DOI] [PubMed] [Google Scholar]
  12. Kaminski RA, Good RH., III Toward a technology for assessing basic early literacy skills. School Psychology Review. 1996;25:215–227. [Google Scholar]
  13. Kaminski RA, Good RH. DIBELS: Dynamic Indicators of Basic Early Literacy Skills. Fredrick, CO: Sopris West; 2003. [Google Scholar]
  14. Kovas Y, Haworth CMA, Dale PS, Plomin R. The genetic and environmental origins of learning abilities and disabilities in the early school years. Monographs of the Society for Research in Child Development, Ser. 288. 2007;72 doi: 10.1111/j.1540-5834.2007.00439.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Light JG, DeFries JC. Comorbidity of reading and mathematics disabilities: Genetic and environmental etiologies. Journal of Learning Disabilities. 1995;28:96–106. doi: 10.1177/002221949502800204. [DOI] [PubMed] [Google Scholar]
  16. Lonigan CJ, Schatschneider C, Westberg L National Early Literacy Panel. National Early Literacy Panel, Developing early literacy, report of the National Early Literacy Panel: A scientific synthesis of early literacy development and implications for intervention. Washington, DC: National Center for Family Literacy; 2008. Identification of children’s skills and abilities linked to later outcomes in reading, writing, and spelling; pp. 55–106. [Google Scholar]
  17. Lykken DT, Bouchard TJ, Jr, McGue M, Tellegen A. The Minnesota Twin Registry: Some initial findings. Acta Geneticae Medicate at Gemellologiae: Twin Research. 1990;39:35–70. doi: 10.1017/s0001566000005572. [DOI] [PubMed] [Google Scholar]
  18. Mathes PG, Denton CA, Fletcher JM, Anthony JL, Francis DJ, Schatschneider C. The effects of theoretically different instruction and student characteristics on the skills of struggling readers. Reading Research Quarterly. 2005;40:148–182. [Google Scholar]
  19. Neale MC, Røysamb E, Jacobson K. Multivariate genetic analysis of sex limitation and G x E interaction. Twin Research and Human Genetics. 2006;9:481–489. doi: 10.1375/183242706778024937. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Neale MC, Boker SM, Xie G, Maes HH. Mx: Statistical modeling. 6th ed. Richmond, VA: Virginia Commonwealth University, Department of Psychiatry; 2003. [Google Scholar]
  21. Petrill SA, Deater-Deckard K, Thompson LA, DeThorne LS, Schatschneider C. Reading skills in early readers: Genetic and shared environmental influences. Journal of Learning Disabilities. 2006;39:48–55. doi: 10.1177/00222194060390010501. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Piasta SB, Wagner RK. Developing early literacy skills: A meta-analysis of alphabet learning and instruction. Reading Research Quarterly. 2010;45:8–38. doi: 10.1598/RRQ.45.1.2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Ritchey KD. Unpublished doctoral dissertation. University of Maryland, College Park; 2002. The prediction of growth in reading subskills and the relationship of growth to literacy outcomes in kindergarten. [Google Scholar]
  24. Scarborough HS. Early identification of children at risk for reading disabilities: Phonological awareness and some other promising predictors. In: Shapiro BK, Accardo PJ, Capute AJ, editors. Specific reading disabilities: A view of the spectrum. Timonium, MD: York Press; 1998. pp. 75–119. [Google Scholar]
  25. Schatschneider C, Fletcher JM, Francis DJ, Carlson C, Foorman BR. Kindergarten prediction of reading skills: A longitudinal comparative analysis. Journal of Educational Psychology. 2004;96:265–282. [Google Scholar]
  26. Snow CE, Burns S, Griffin P, editors. Preventing reading difficulties in young children: Report of the Committee on the Prevention of Reading Difficulties in Young Children. Washington, DC: National Academy Press; 1998. [Google Scholar]
  27. Speece DL, Case LP. Classification in context: An alternative approach to identifying early reading disability. Journal of Educational Psychology. 2001;93:735–749. [Google Scholar]
  28. Speece DL, Mills C, Ritchey KD, Hillman E. Initial evidence that letter fluency tasks are valid indicators of early reading skill. The Journal of Special Education. 2003;36:223–233. [Google Scholar]
  29. Stevenson J, Graham P, Fredman G, McLoughlin V. A twin study of genetic influences on reading and spelling ability and disability. Journal of Child Psychology & Psychiatry. 1987;28:229–247. doi: 10.1111/j.1469-7610.1987.tb00207.x. [DOI] [PubMed] [Google Scholar]
  30. Taylor J, Roehrig AD, Soden Hensler B, Connor CM, Schatschneider C. Teacher Quality Moderates the Genetic Effects on Early Reading. Science. 2010;328:512–514. doi: 10.1126/science.1186149. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Taylor J, Schatschneider C. Genetic Influence on Literacy Constructs in Kindergarten and First Grade: Evidence from a Diverse Twin Sample. Behavior Genetics. 2010;40:591–602. doi: 10.1007/s10519-010-9368-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Thompson LA, Detterman DK, Plomin R. Associations between cognitive abilities and scholastic achievement: Genetic overlap but environmental differences. Psychological Science. 1991;2:158–165. [Google Scholar]
  33. Torgesen JK. Preventing early reading failure—and its devastating downward spiral: The evidence for early intervention. American Educator. 2004;28(3):6–19. 45–47. [Google Scholar]
  34. Wagner RK, Torgesen JK. The nature of phonological processing and its causal role in the acquisition of reading skills. Psychological Bulletin. 1987;101:192–212. [Google Scholar]
  35. Wagner RK, Torgesen JK, Rashotte CA, Hecht SA, Barker TA, Burgess SR, et al. Changing relations between phonological processing abilities and word-level reading as children develop from beginning to skilled readers: A 5-year longitudinal study. Developmental Psychology. 1997;33:468–479. doi: 10.1037//0012-1649.33.3.468. [DOI] [PubMed] [Google Scholar]
  36. Whitehurst GJ, Lonigan CJ. Child development and emergent literacy. Child Development. 1998;69:848–872. [PubMed] [Google Scholar]

RESOURCES