Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2011 Sep 1.
Published in final edited form as: Behav Genet. 2011 Jan 23;41(5):660–667. doi: 10.1007/s10519-011-9445-6

DeFries-Fulker analysis of longitudinal reading performance data from twin pairs ascertained for reading difficulties and from their nontwin siblings

Raven L Astrom 1,2, Sally J Wadsworth 1, Richard K Olson 1,2, Erik G Willcutt 1,2, John C DeFries 1,2
PMCID: PMC3085049  NIHMSID: NIHMS262583  PMID: 21259040

Abstract

Reading difficulties are both heritable and stable; however, little is known about the etiology of this stability. Results from a preliminary analysis of data from 56 twin pairs who participated in the Colorado Longitudinal Twin Study of Reading Disability (Astrom et al., 2007) suggested that about two-thirds of the proband deficit at follow-up was due to genetic factors that also influenced deficits at their initial assessment. Although our proband sample is now nearly twice as large, it is still relatively small; thus, to increase power, we subjected data from probands, co-twins and their nontwin siblings to a novel extension of DeFries-Fulker analysis (DeFries & Fulker, 1985; 1988). In addition to providing estimates of univariate and bivariate heritability, this analysis facilitates a test of the difference between shared environmental influences for twins versus siblings. Longitudinal composite reading performance scores at 10.6 and 15.5 years of age, on average, were analyzed from 33 MZ and 64 DZ twin pairs in which at least one member of each pair had reading difficulties, and from 44 siblings of the probands. Scores were highly stable (.86 ± .03, across probands, cotwins and siblings) and heritability of the group deficit at initial assessment was .67 ± .22. Longitudinal bivariate heritability was .59 ± .21, suggesting that nearly 60% of the proband reading deficit at follow-up is due to genetic factors that influenced reading difficulties at the initial assessment. However, tests for special twin environmental influences were nonsignificant.

Keywords: DF analysis, twins, siblings, reading, longitudinal

Introduction

During the past few decades, much has been learned about the heritable nature of reading ability and disability. For example, twin and adoption studies have shown that individual differences in reading performance are highly heritable, although estimates of heritability have ranged from .18 to .81 for subjects 7 to 16 years of age (Alarcón et al., 1995; Harlaar et al., 2007; Stevenson et al., 1987; Wadsworth et al., 2006; Wadsworth et al., 2007). At comparable ages, heritability estimates for reading deficits range from .37 to .72 (e.g., DeFries & Alarcón, 1996; DeFries & Gillis, 1991; Harlaar et al., 2005). Moreover, results obtained from longitudinal studies have shown that stability correlations for reading performance are considerable, ranging from .23 to .96 over intervals of 1 to 8 years (e.g., Bast & Reitsma, 1998; DeFries & Baker, 1983; Shaywitz et al., 1992; Wadsworth et al., 2006; Wadsworth et al., 2007; Wagner et al., 1997). When reading performance was estimated by Hulslander et al. (2010) as a latent trait from multiple measures of word recognition, the stability correlation between age 10 and 15 years was .98.

Although studies have consistently yielded evidence that both individual differences in reading performance and reading deficits are heritable and stable, little is known about the genetic and environmental etiologies of this stability. Only a small number of studies have evaluated the etiology of stability utilizing genetically informative designs, and they have primarily assessed the etiology of individual differences in reading performance within the normal range. For example, in the Colorado Adoption Project, word recognition was examined at ages 7, 12, and 16 in a sample of adoptive and nonadoptive sibling pairs (Wadsworth et al., 2006). Results indicated substantial stability for reading performance with stability correlations of .62 between ages 7 and 12, .71 between ages 12 and 16, and .58 between ages 7 and 16. Moreover, the proportion of observed stability attributable to shared genetic influences was 53%, 62%, and 86%, respectively.

Analyzing data from a younger sample participating in the International Longitudinal Twin Study (ITLS), Byrne et al. (2007) found that genetic influences accounted for approximately 90% of the observed stability between word reading in kindergarten and first grade. However, Grade 1 reading was also influenced by a second genetic factor, suggesting both genetic stability and genetic change at this developmental milestone. Environmental contributions to stability were limited to nonshared environmental influences, which were also a significant source of change. In contrast, shared environment was important only for kindergarten word reading, and did not contribute to stability.

Petrill et al. (2007) assessed the longitudinal stability of reading-related skills and reading outcomes of children who were tested on two occasions in the Western Reserve Reading Project (WRRP): in kindergarten or first grade, and a year later. Measures of reading related skills included phonological awareness (PA), expressive vocabulary (VOCAB) and rapid automatized naming (RAN); outcome was measured by performance on tests of letter knowledge (LET), word knowledge (WORD), phonological decoding (PD) and passage comprehension (COMP). Results of a series of Cholesky decomposition analyses suggested that, with the exception of RAN, genetic influences accounted for a significant proportion of the observed stability (as much as 42%, depending on the measure) for all measures of reading outcome at the two assessments. Shared environmental influences contributed to stability of PA, VOCAB, and LET. Nonshared environment had small, but significant, effects on the stability of RAN, WORD, PD and COMP.

These results are highly similar to those of Harlaar et al. (2007) who examined the genetic and environmental influences on reading performance and reading stability from the Twins Early Development Study (TEDS). For this study, reading achievement was measured by teacher assessments of subjects at ages 7, 9, and 10 years, using a rating scale of general reading achievement that referenced U.K. National Curriculum (NC) achievement goals for literacy. Results of the study showed substantial heritabilities for reading performance ranging from .57 to .67. In addition, NC scores were found to be significantly correlated across all three age groups; .62 for ages 7 to 9, .59 for ages 7 to 10, and .63 for ages 9 to 10, with genetic influences contributing to 77%, 68%, and 77% respectively, of the observed stability.

To our knowledge, only one study has assessed the etiology of stability of reading deficits (Astrom et al., 2007). Subjects from this preliminary longitudinal analysis of reading disabilities of school-aged children (aged 8 – 16 years) were tested in the Colorado Learning Disabilities Research Center (CLDRC, DeFries, et al., 1997) and again approximately 5–6 years later in the Longitudinal Twin Study of Reading Disability (LTSRD, Wadsworth et al., 2007). Probands scored approximately two standard deviations below the mean of the control sample at both measurement occasions (Astrom et al., 2007), and a stability correlation of .84 was obtained between the two assessments. A bivariate extension of the DeFries-Fulker basic multiple regression model for analysis of selected twin data (DF; DeFries & Fulker, 1985; Light & DeFries, 1995) yielded an estimate of bivariate heritability of .65 (± .32), suggesting that nearly two-thirds of the proband reading deficit at follow-up was due to genetic factors that also influenced reading at initial assessment (Astrom et al., 2007).

Although our current longitudinal sample of MZ and DZ twin pairs who met criteria for inclusion in the proband sample at their initial assessment is now nearly twice as large, it is still somewhat underpowered for more complex analyses. However, since the inception of the CLDRC, we have also collected data from siblings of approximately half of the MZ and DZ probands. Consequently, using a novel application of the DF multiple regression method, we have included data from siblings of probands in the present analysis. Thus, the primary objectives of this study were twofold: (1) to assess more rigorously the etiology of the stability of reading deficits in a larger sample of twin pairs than was previously analyzed by Astrom et al. (2007); and (2) to apply an extension of the DF method (DeFries and Fulker, 1985; 1988) to analyze both twin and sibling data and test for “special twin environments” (i.e., a measure of the extent to which shared environmental influences for members of twin pairs differ from those for nontwin-sibling pairs). Based on previous findings of genetic influences on the stability of reading performance within the normal range, as well as our preliminary findings regarding the stability of reading deficits, we hypothesize that largely the same genetic influences on reading deficits are manifested at initial and follow-up assessments. Moreover, given the results obtained from a small twin and sibling study of individual differences in reading-related measures (Zieleniewski, et al., 1987), we hypothesize that shared environmental influences for reading deficits are similar for members of twin pairs and between twins and their nontwin siblings.

Methods

Participants and Measures

Subjects in the present study were first tested in the ongoing CLDRC between September 1996 and March 2003, and also participated in follow-up testing approximately five years later in the LTSRD. In order to minimize the possibility of ascertainment bias, school personnel in 27 different school districts within the state of Colorado identify twin pairs without knowledge of reading status. Parental permission is then sought to review the school records of the twins for any evidence of reading problems (i.e., low reading achievement test scores, referral to resource rooms or reading therapists, reports by classroom teachers, school psychologists, and/or parents). If either member of the twin pair displays a history of reading problems, both twins and their siblings are invited to participate in the study at the University of Colorado, Boulder, and at the University of Denver. The subjects complete a comprehensive battery of psychometric tests which includes the Wechsler Intelligence Scale for Children–Revised (WISC-R; Wechsler, 1974) or the Wechsler Adult Intelligence Scale-Revised (WAIS-R; Wechsler, 1981), the Peabody Individual Achievement Test (PIAT; Dunn & Markwardt, 1970), and measures of reading and language processes and executive functions. A discriminant function score (DISCR) is computed for each subject employing discriminant weights estimated from an analysis of data from the Reading Recognition, Reading Comprehension, and Spelling subtests of the PIAT obtained from an independent sample of 140 non-twin children with reading disabilities and 140 non-twin children without reading problems (DeFries, 1985). In order for an individual to be diagnosed as reading-disabled in the current study, he or she must have a positive history of reading problems and score at least 1 standard deviation below the mean of the control sample on the discriminant score. Additional criteria include a minimum IQ score of 85 on either the Verbal or Performance Scale of the WISC-R (Wechsler, 1974) or the WAIS-R (Wechsler, 1981); no evidence of neurological problems or severe behavioral or emotional problems; and no uncorrected visual or auditory acuity deficits. Control twin pairs are matched to probands on the basis of age, gender, and school district. For a twin pair to be included in the control sample, both members of the pair must have a negative history for reading problems. Selected items from the Nichols and Bilbro (1966) questionnaire are used to determine zygosity of same-sex twin pairs. For indeterminate cases, zygosity is confirmed by analysis of blood or buccal samples.

The current sample included 33 MZ and 64 DZ twin pairs in which at least one twin met proband criteria at initial assessment in the CLDRC, and 44 of their nontwin siblings. Selected twins and both their co-twins and siblings underwent follow-up testing approximately 5 years after their initial participation (average age of 10.6 years at initial assessment and 15.5 years at follow-up). Because concordant twin pairs were double entered, the siblings were paired with both twins in those cases. As a result, there were 58 twin/sib pairings; in 21 of those pairings the twin was selected from an MZ pair and in 37 pairings the twin was selected from a DZ pair. For standardization and transformation of the variables at initial assessment, the control sample comprised 284 subjects tested in the CLDRC during the same time period as those subjects who participated in follow-up testing. At follow-up the control sample included 171 control twins who participated in follow-up testing.

Analyses

Multiple Regression Analysis

Early twin studies of reading difficulties compared concordance rates for deviant scores in identical and fraternal twin pairs to test for genetic etiology (DeFries & Alarcón, 1996). However, because reading performance is a continuous trait, a comparison of concordance rates of “affected” and “unaffected” pairs does not make optimal use of the data. Thus, DeFries and Fulker (1985, 1988) proposed a multiple regression analysis of twin data to assess the etiology of deviant scores, as well as individual differences within selected groups. This method has become a standard of behavioral genetic analysis and the method of choice for analyzing data from selected samples. This multiple regression method is particularly appropriate when analyzing data from probands who are selected because of deviant scores on a continuous variable such as reading performance; the differential regression of MZ and DZ co-twin scores toward the mean of the unselected population provides a test of genetic etiology (DeFries & Fulker, 1985). As MZ twins are genetically identical and DZ twins share only half of their segregating genes on average, the scores of DZ co-twins should regress more toward the mean of the unselected population if the condition is heritable. Consequently, when the MZ and DZ proband means are approximately equal, a simple t-test of the difference between the MZ and DZ co-twin means provides a test of genetic etiology. However, DeFries and Fulker (1985, 1988) proposed that a multiple regression analysis of such data, in which a co-twin’s score is regressed on both the proband’s score and the coefficient of relationship, facilitates a more flexible and statistically powerful test.

The basic DF model is as follows:

C=B1P+B2R+A [1]

where C is the co-twin’s score, P is the proband’s score, R is the coefficient of relationship (R = 1.0 for MZ and 0.5 for DZ twin pairs), and A is the regression constant. The B1 coefficient is the partial regression of the co-twin’s score on the proband’s score, a measure of the average MZ and DZ twin resemblance (DeFries & Fulker, 1985; 1988). The B2 partial regression coefficient estimates the differential regression of co-twin scores by zygosity and equals twice the difference between the MZ and DZ co-twin means after covariance adjustment for any difference between the MZ and DZ proband means. Therefore, B2 provides a direct test for genetic etiology. Moreover, when the data are appropriately transformed prior to multiple-regression analysis (i.e., each score is expressed as a deviation from the mean of the unselected population and then divided by the difference between the proband and population means), B2 provides a direct estimate of heritability of the group deficit, h2g, an index of the degree to which the observed proband deficit is heritable.

In order to incorporate sibling data, a simple extension of the basic DF model can be simultaneously fitted to transformed data from selected twins, their co-twins and co-sibs. The expected transformed MZ and DZ co-twin means (see DeFries & Fulker, 1988) and the corresponding transformed co-sib means are presented in Table 1. From these expected values, it may be seen that the difference between the DZ co-twin mean and the co-sib mean is a simple function of the difference between shared environmental influences in twins versus sibling pairs.

Table 1.

Expected transformed1 co-twin and co-sib means

Subjects Model
MZ Co-twins h2g + c2g(t)
DZ Co-twins ½h2g + c2g(t)
Co-sibs ½h2g + c2g(s)
1

Scores are expressed as a deviation from the unselected population mean and then divided by the difference between the proband and population means (see DeFries & Fulker, 1988).

Thus, to test for differential c2g between twins and siblings, the following extended basic model can be simultaneously fitted to transformed data from selected twins, their co-twins and co-sibs:

C=B1P+B2R+B3S+A [2]

where C is now the co-twin’s or co-sib’s score, P is the proband’s score, R is the coefficient of relationship (1.0 for MZ pairs and 0.5 for both DZ pairs and twin/sib pairs), and S is a dummy code for pair type, i.e., twin pair versus twin-sibling pair (+.5 for MZ twins, +.5 for DZ twins and −.5 for twin-sib pairs). When this model is fitted to the data and all three partial regression coefficients are estimated simultaneously, B3 estimates the difference between c2g(t) and c2g(s) which equals the difference between the DZ co-twin (CDZ) and co-sib (CS) means, as can be observed in the Appendix. As a result, B3 provides a direct test of significance for the difference between environmental influences shared by DZ twin pairs versus those shared by twin-sib pairs, i.e., a test for “special twin environment”. As in the basic model, B2 estimates h2g from twice the difference between the MZ and DZ co-twin means.

Appendix.

Parameter estimates calculated from the expectations in Table 1 and the transformed co-twin and co-sib means

Univariate
Transformed means
 MZ Co-twin mean (CMZ) h2g + c2g(t) = .8947
 DZ Co-twin mean (CDZ) ½h2g + c2g(t) = .5620
 Co-sib mean (CS) ½h2g + c2g(s) = .4199
h2g = 2(CMZ-CDZ) = 2(.8947 – .5620) = .6654
Differential c2g
c2g(MZ) (CMZ- h2g) = (.8947 – .6654) = .2293
c2g(DZ) (CDZ - ½h2g) = (.5620 – .3327) = .2293
c2g(s) (CS - ½h2g) = (.4199 – .3327) = .0872
B3 c2g (t) - c2g (s) = (.2293 – .0872) = .1421

Bivariate1
Transformed means
 MZ Co-twin mean (CMZ) h2g(biv) + c2g(biv)(t) = .7934
 DZ Co-twin mean (CDZ) ½h2g(biv) + c2g(biv)(t) = .4989
 Co-sib mean (CS) ½h2g(biv) + c2g(biv)(s) = .4203
h2g(biv) = 2(CMZ-CDZ) = 2(.7934 – .4989) = .5890
Differential c2g
c2g(biv)(t) (CMZ- h2g(biv)) = (.7934 – .5890) = .2044
c2g(biv)(t) (CDZ - ½h2g(biv)) = (.4989 – .2945) = .2044
c2g(biv)(s) (CS - ½h2g (biv)) = (.4203 – .2945) = .1258
B3 c2g(biv)(t) - c2g(biv)(s) = (.2044 – .1258) = .0786
1

Co-twin and co-sib mean reading scores at follow-up are transformed based on the proband mean at initial assessment.

Because B3 estimates the difference between c2g(t) and c2g(s), its significance is relevant for obtaining an estimate of h2g based upon an analysis of the combined twin and co-sibling data. If B3 is small and not significant, S may be dropped from the extended model, and Equation 1 fitted to the combined data set of twins and siblings. In that case, B2 will estimate h2g from both the twin and co-sib data, rather than from only the twin data. However, if B3 is significant or relatively large, h2g should be estimated from fitting Equation 2 to the combined data set.

Because subjects were not reselected at follow-up, univariate DF models were fitted only to data from the initial assessment. Then, to assess the heritable nature of the stability of reading deficits a bivariate extension of the basic DF model was fitted to data from the initial and follow-up assessments as follows:

Cy=B1Px+B2R+A [3]

where Cy is the co-twin’s or co-sib’s composite reading score at follow-up, Px is the proband’s initial composite reading score, R is the coefficient of relationship, and A is the regression constant. B1 is the partial regression of the co-twin’s or co-sib’s follow-up reading score on the proband’s initial reading score and is a measure of average MZ and DZ cross-variable twin resemblance. Thus, B1 estimates the extent to which co-twin and co-sib scores on the follow-up measure are related to proband scores on the initial measure across zygosity. B2 is the partial regression of the co-twin’s or co-sib’s follow-up reading score on the coefficient of relationship. When the data are transformed prior to multiple-regression analysis, the bivariate B2 coefficient is a function of the square roots of the group heritabilities for reading performance at the two time points and the genetic correlation (rG) between them (i.e., hinitial x hfollow-up x rG; Light & DeFries, 1995). Therefore, B2 provides an estimate of “bivariate heritability” (h2g(Biv)), an index of the extent to which the proband reading deficit at follow-up is due to heritable factors which also influenced the reading deficit at the initial assessment. Further, the proportion of the phenotypic stability correlation (rp) attributable to genetic influences can be obtained by dividing the B2 estimate by rp. Finally, when the following extended bivariate model is fitted to both twin and sibling data (Equation 4), B3 provides a test of significance for the difference between bivariate shared environmental influences in twins and siblings:

Cy=B1Px+B2R+B3S+A [4]

Because truncate selection was employed (DeFries & Gillis, 1991), pairs in which both members met criteria for RD were double-entered for all regression analyses. This is analogous to the computation of probandwise concordance rates, in which both affected members of concordant pairs are included as probands. Standard error estimates and tests of significance were adjusted accordingly. Models were fit using linear regression in SPSS for UNIX server.

Results

Table 2 presents the standardized mean reading performance scores for MZ and DZ probands, as well as those of their co-twins and co-sibs at each assessment. The MZ and DZ proband means are highly similar at both initial and follow-up assessments (averaging about 2 standard deviations below the respective control means) suggesting that the deficit of the probands is highly stable. In addition, at each assessment there is a differential regression of the MZ co-twin, DZ co-twin and co-sib means toward the mean of the control twins. At the initial assessment, the MZ co-twin mean regressed 0.23 standard deviation units toward the control mean on average, whereas that of the DZ co-twin regressed 0.92 standard deviation units. In addition, the co-sib mean regressed 1.30 standard deviation units. Similarly, at follow-up the MZ co-twin mean regressed 0.26 standard deviation units toward the control mean on average, whereas the DZ co-twin and co-sib means regressed 0.75 and 1.00 standard deviation units suggesting that reading deficits are both stable and substantially due to heritable influences.

Table 2.

Mean standardized reading performance scores (± SD) of probands, co-twins and co-sibs at initial1 and follow-up test sessions2

Initial
MZ pairs DZ pairs Twin/sib pairs

M SD M SD M SD
Proband −2.15 ± .95 −2.11 ± .87 −2.24 ± .95
Co-twin/sib −1.92 ± 1.17 −1.19 ± 1.39 −0.94 ± 1.22

Follow-up
Proband −2.00 ± .67 −1.93 ± .64 −2.14 .70
Co-twin/sib −1.74 ± .99 −1.18 ± 1.16 −1.14 1.08
1

Standardized against the mean of 284 Control twins participating in the initial assessment in the CLDRC.

2

Standardized against the mean of 171 Control twins participating in the LTSRD at follow-up.

Corresponding transformed proband, co-twin and co-sib means are presented in Figure 1.

Figure 1.

Figure 1

Mean proband, co-twin and co-sib transformed reading scores at initial and follow-up assessments.

Table 3 presents results of the basic univariate regression analysis of twin-only data and twin-sibling data. When the basic model incorporating data from twins only (Equation 1) was fitted to the transformed proband and co-twin initial scores, the B2 estimate was .67, confirming that the proband reading deficit in this sample is due substantially to genetic influences. Similarly, as expected, when the extended model was fitted to data from both twins and siblings (Equation 2), the B2 estimate was again .67, but slightly more significant. In contrast, B3 was nonsignificant, but not trivial (.14). Therefore, although the more parsimonious model (Equation 1) was also fitted to the combined twin and sibling data for illustrative purposes, as shown in Table 3 the resulting estimate of h2g was biased upward (.79). However, if B3 were nonsignificant and relatively small, fitting the more parsimonious model to the combined data set would be appropriate.

Table 3.

Comparison of twin and twin-sibling results of univariate and bivariate DF analysis

Model Subjects Model B2 ± S.E. p B3 ± S.E. p
Univariate Twins only C = B1P + B2R + A .67 ± .23 = .004 ---- ----
Twins & siblings C = B1P + B2R + B3S + A .67 ± .22 = .003 .14 ± .10 .167
Twins & siblings1 C = B1P +B2R + A .79 ± .20 ≤ .001 ---- ----

Bivariate Twins only Cy = B1Px + B2R + A .59 ± .22 = .008 ---- ----
Twins & siblings Cy = B1Px + B2R + B3S + A .59 ± .21 = .006 .08 ± .10 .415
Twins & siblings1 Cy = B1Px + B2R + A .66 ± .20 = .001 ---- ----
1

Ignoring DZ co-twin versus co-sibling status

When the bivariate model (Equation 3) was fitted to the transformed proband initial scores and co-twin follow-up scores, the estimate of bivariate h2g was .59, suggesting that approximately 60% of the proband deficit in reading at follow-up is due to genetic factors that also influenced reading difficulties at the initial assessment. As expected, when the bivariate model was extended to include siblings (Equation 4), the resulting B2 estimate was also .59, but more highly significant. Moreover, the ratio of B2 to the observed correlation between initial and follow-up scores (.86) suggests that common genetic influences account for nearly 70% (.59/.86 = .69) of the stability of reading difficulties in this sample. However, the estimate of bivariate B3 was relatively small (.08) and non-significant (p = .415). Table 3 also presents results of fitting the more parsimonious model to the longitudinal twin and sibling data, combining the data from DZ co-twins and co-siblings.

As shown in the appendix, estimates of both univariate and bivariate h2g and differential c2g may also be readily calculated from the transformed co-twin and co-sib means.

Discussion

Results obtained from previous studies have shown that reading deficits are stable and heritable; however, the genetic and environmental etiologies of the stability of reading deficits have not been well characterized. Therefore, the primary objectives of the present study were (1) to assess more rigorously genetic and environmental influences on the stability of reading deficits using data from a larger sample of twin pairs and their siblings tested in the CLDRC and re-tested approximately 5 years later in the LTSRD and (2) to fit a novel extension of the DeFries-Fulker multiple regression model (DeFries & Fulker, 1985) to reading performance data from both twins and siblings, potentially increasing power and facilitating a test for “special twin environments.”

In the current study, the average reading performance of the probands at their initial assessment was approximately two standard deviations below those of the controls, and this deficit persisted at follow-up. The stability correlation of probands was .72 ± .04 and that for controls was .75 ± .03. This result is consistent with those of previous studies that have found both reading deficits and reading performance within the normal range to be highly stable (e.g., Bast & Reitsma, 1998; DeFries, 1988; DeFries & Baker, 1983; Hulslander, et al., 2010; Shaywitz et al., 1992; Wadsworth et al., 2006; Wadsworth et al., 2007; Wagner et al., 1997).

Results of univariate DF analysis of data from twins and siblings at their initial assessment suggested that about two-thirds of the proband deficit in this sample was due to genetic influences. Because subjects were not reselected at follow-up, univariate DF analyses were not conducted for follow-up data. When the bivariate extension of the multiple regression model was fitted to proband scores at initial assessment and co-twin/co-sib scores at follow-up, combining DZ co-twin and co-sibling status, the resulting estimate of bivariate heritability was 0.66 (± .20, p = .001), suggesting that about two-thirds of the proband deficit at follow-up was due to genetic influences which also influenced reading deficits at the initial assessment. Further, common genetic influences accounted for approximately 70% of the observed stability between reading scores at the initial and follow-up assessments.

These results are highly consistent with our previous longitudinal analysis of the stability of reading deficits which obtained an estimate of bivariate heritability of .65 (± .32) and found that about 75% of the stability of reading difficulties between initial and follow-up assessments was due to common genetic influences (Astrom et al., 2007). However, our previous study included only twins. Results of the current study suggest that including sibling data increases power at least to some extent. For example, given the effect size estimated from the basic univariate model and sample of 97 twin pairs, power to detect significance of the B2 term, i.e., h2g,is about .85. However, when the 44 siblings are also included and the basic model is fitted to the combined data sets, power improves to .97. Similarly, power for the bivariate model using only twin data is .79; including the sibs, assuming the same parameters, power is increased to about .95. Because the basic DF twin model is quite powerful, even with relatively small samples, the addition of sibling data may be especially beneficial when fitting more complex models, such as those testing hypotheses of differential genetic etiology.

The similarity of results from Astrom et al. (2007) and the current study may be expected as there is considerable overlap between the two samples. However, previous studies of individual differences in reading performance also support these findings. For example, Byrne et al. (2007) found that 90% of the observed stability of word reading in twin pairs between kindergarten and grade one was due to common genetic influences. Results from the TEDS study (Harlaar et al., 2007) were also highly similar with more than two-thirds of the phenotypic stability being genetically mediated. Although Petrill et al. (2007) reported somewhat lower estimates (at most, 42% of the phenotypic stability was due to genetic influences), their study examined the genetic variance of individual subtests of several different measures utilizing Cholesky decomposition models. Our findings are also highly similar to results from the Colorado Adoption Project (Wadsworth et al., 2006) in which data from related and unrelated sibling pairs were analyzed, and 86% of the phenotypic correlation between reading performance at ages 7 and 16 was accounted for by genetic influences. These remarkably similar results across different studies with different types of subjects, ages, measures and analytical methods suggest that this is a highly robust finding.

The extended DF model for analyzing data from both twins and siblings also provides a test for “special twin environments” (e.g., Koeppen-Schomerus et al., 2003; Medland et al., 2003; Van Grootheest et al., 2007; Young et al., 2006). In fact, B3 provides a direct estimate of the difference between shared environmental influences for members of twin pairs versus those for less contemporaneous twin/co-sibling pairs. Moreover, its magnitude and significance indicate whether the estimate of h2g should be based upon an analysis of the combined twin and co-sibling data. If B3 is small and not significant, S may be dropped from the extended model, and Equation 1 fitted to the combined data from twins and siblings. However, if B3 is significant or relatively large, h2g should be estimated from fitting Equation 2 to the data set. In the current study, no significant differences were found between twin and sibling shared environmental influences. However, the parameter estimate of .14 for differential c2g in the univariate model is not trivial. Consequently, when the B3 term was dropped from the extended twin/sib model, the resulting estimate of h2g was inflated. Thus, results of fitting the more parsimonious model (Equation 1) to the combined twin and sibling data are presented for illustrative purposes only.

Finally, although our findings suggest that reading deficits are highly stable and that this stability is due principally to genetic influences, this should not deter our best efforts with regard to environmental intervention and remediation. There are multiple pathways to poor reading (Gilger & Kaplan, 2001), and the relative contributions of genetic and environmental influences may differ depending on the measure (Gayán & Olson, 2001). Reading disabilities do not generally segregate in families in a simple Mendelian fashion; thus, multiple genetic and environmental factors almost certainly influence reading abilities and may also interact (e.g., Friend, DeFries, & Olson, 2008). Nevertheless, our results suggest that intensive remediation efforts may be needed to compensate for genetic and other biological constraints on learning rates (Olson, et al., in press, a). Therefore, in order to identify and treat at risk children more effectively and account for their individual abilities and disabilities, additional longitudinal studies of the genetic and environmental etiologies of reading disability are clearly warranted (Olson, et al., in press, b).

Acknowledgments

The Colorado Learning disabilities Research Center is supported by The Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD) grant HD027802. The Longitudinal Twin Study of Reading Disability is supported by grant DC005190 from the National Institute on Deafness and other Communication Disorders (NIDCD). During the preparation of this manuscript, Raven L. Astrom was supported by institutional training Grant HD007829. The continued cooperation of the many families participating in the CLDRC and LTSRD, as well as the work of the staff members of these projects is gratefully acknowledged.

References

  1. Alarcón M, DeFries JC, Fulker DW. Etiology of individual differences in reading performance: A test of sex-limitation. Behavior Genetics. 1995;25:17–23. doi: 10.1007/BF02197238. [DOI] [PubMed] [Google Scholar]
  2. Astrom R, Wadsworth SJ, DeFries JC. Etiology of the stability of reading difficulties: The Longitudinal Twin Study of Reading Disabilities. Twin Res & Hum Gen. 2007;10:434–439. doi: 10.1375/twin.10.3.434. [DOI] [PubMed] [Google Scholar]
  3. Bast J, Reitsma P. Analyzing the development of individual differences in terms of Matthew effects in reading: Results from a Dutch longitudinal study. Developmental Psychology. 1998;34:1373–1399. doi: 10.1037//0012-1649.34.6.1373. [DOI] [PubMed] [Google Scholar]
  4. Byrne B, Samuelsson S, Wadsworth SJ, Hulslander J, Corley R, DeFries JC, Quain P, Willcutt EG, Olson RK. Longitudinal twin study of early literacy development: Preschool through Grade 1. Reading and Writing. 2007;20:77–102. [Google Scholar]
  5. DeFries JC. Colorado Reading Project. In: Gray DB, Kavanagh JF, editors. Biobehavioral measures of dyslexia. Parkton, MD: York Press; 1985. pp. 107–122. [Google Scholar]
  6. DeFries JC. Colorado Reading Project: Longitudinal analysis. Annals of Dyslexia. 1988;38:120–130. doi: 10.1007/BF02648251. [DOI] [PubMed] [Google Scholar]
  7. DeFries JC, Alarcón M. Genetics of specific reading disability. Mental Retardation and Developmental Disabilities Research Reviews. 1996;2:39–47. [Google Scholar]
  8. DeFries JC, Baker LA. Parental contributions to longitudinal stability of cognitive measure in the Colorado Family Reading Study. Child Development. 1983;54:388–395. [PubMed] [Google Scholar]
  9. DeFries JC, Filipek PA, Fulker DW, Olson RK, Pennington BF, Smith SD, Wise BW. Colorado Learning Disabilities Research Center. Learning Disabilities: A Multidisciplinary Journal. 1997;8:7–19. [Google Scholar]
  10. DeFries JC, Fulker DW. Multiple regression analysis of twin data. Behavior Genetics. 1985;15:467–473. doi: 10.1007/BF01066239. [DOI] [PubMed] [Google Scholar]
  11. DeFries JC, Fulker DW. Multiple regression analysis of twin data: Etiology of deviant scores versus individual differences. Acta Geneticae Medicae et Gemellologiae: Twin Research. 1988;37:205–216. doi: 10.1017/s0001566000003810. [DOI] [PubMed] [Google Scholar]
  12. DeFries JC, Gillis JJ. Etiology of reading deficits in learning disabilities: Quantitative genetic analysis. In: Obrzut JE, Hynd GW, editors. Neuropsychological foundations of learning disabilities: A handbook of issues, methods and practice. Orlando, Fl: Academic Press; 1991. pp. 29–47. [Google Scholar]
  13. Dunn LM, Markwardt FC. Examiner’s manual: Peabody Individual Achievement Test. Circle Pines, MN: American Guidance Service; 1970. [Google Scholar]
  14. Friend A, DeFries JC, Olson RK. Parental education moderates genetic influences on reading disabilities. Psychological Science. 2008;11:1124–1130. doi: 10.1111/j.1467-9280.2008.02213.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Gayán J, Olson RK. Genetic and environmental influences on orthographic and phonological skills in children with reading disabilities. Developmental Neuropsychology. 2001;20:483–507. doi: 10.1207/S15326942DN2002_3. [DOI] [PubMed] [Google Scholar]
  16. Gilger JW, Kaplan BJ. Atypical brain development: A conceptual framework for understanding learning disabilities. Developmental Neuropsychology. 2001;20:465–481. doi: 10.1207/S15326942DN2002_2. [DOI] [PubMed] [Google Scholar]
  17. Harlaar N, Dale PS, Plomin R. From learning to read to reading to learn: Substantial and stable genetic influence. Child Development. 2007;78:116–131. doi: 10.1111/j.1467-8624.2007.00988.x. [DOI] [PubMed] [Google Scholar]
  18. Harlaar N, Spinath FM, Dale PS, Plomin R. Genetic influences on early word recognition, abilities and disabilities: a study of 7-year-old twins. Journal of Child Psychology and Psychiatry. 2005;46:373–384. doi: 10.1111/j.1469-7610.2004.00358.x. [DOI] [PubMed] [Google Scholar]
  19. Hulslander J, Olson RK, Willcutt EG, Wadsworth SJ. Longitudinal stability of reading-related skills and their prediction of reading development. Scientific Studies of Reading. 2010;14:111–136. doi: 10.1080/10888431003604058. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Koeppen-Schomerus G, Spinath FM, Plomin R. Twins and non-twin siblings: different estimates of shared environmental influence in early childhood. Twin Research. 2003;6:97–105. doi: 10.1375/136905203321536227. [DOI] [PubMed] [Google Scholar]
  21. Light JG, DeFries JC. Comorbidity of reading and mathematics disabilities: Genetic and environmental etiologies. Journal of Learning Disabilities. 1995;28:96–106. doi: 10.1177/002221949502800204. [DOI] [PubMed] [Google Scholar]
  22. Medland SE, Wright MJ, Geffen GM, Hay DA, Levy F, Martin NG, Duffy DL. Special twin environments, genetic influences and their effects on the handedness of twins and their siblings. Twin Research. 2003;6:119–130. doi: 10.1375/136905203321536245. [DOI] [PubMed] [Google Scholar]
  23. Nichols RC, Bilbro WC., Jr The diagnosis of twin zygosity. Acta Geneticae Medicae et Gemellologiae. 1966;16:265–275. doi: 10.1159/000151973. [DOI] [PubMed] [Google Scholar]
  24. Olson R, Hulslander J, Christopher M, Keenan J, Wadsworth SJ, Willcutt EG, Pennington B, DeFries JC. Genetic and environmental influences on writing and their relations of language and reading. Annals of dyslexia. doi: 10.1007/s11881-011-0055-z. (in press, a) [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Olson R, Keenan J, Byrne B, Samuelsson S, Coventry W, Corley R, Wadsworth S, DeFries J, Pennington B, Hulslander J. Genetic and environmental influences on vocabulary and reading development from pre-kindergarten through grade 4. Scientific Studies of Reading. doi: 10.1007/s11145-006-9018-x. (in press, b) [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Petrill SA, Deater-Deckard K, Thompson LA, Schatschneider C, Dethorne LS, Vandenbergh DJ. Longitudinal genetic analysis of early reading. The Western Reserve Reading Project. Reading and Writing. 2007;20:127–146. doi: 10.1007/s11145-006-9021-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Shaywitz SE, Escobar MD, Shaywitz BA, Fletcher JM, Makuch R. Evidence that dyslexia may represent the lower tail of a normal distribution of reading ability. New England Journal of Medicine. 1992;326:144–150. doi: 10.1056/NEJM199201163260301. [DOI] [PubMed] [Google Scholar]
  28. Stevenson J, Graham P, Fredman G, McLoughlin V. A twin study of genetic influences on reading and spelling ability and disability. Journal of Child Psychology and Psychiatry. 1987;28:229–247. doi: 10.1111/j.1469-7610.1987.tb00207.x. [DOI] [PubMed] [Google Scholar]
  29. Van Grootheest DS, Cath DC, Beekman AT, Boomsma DI. Genetic and environmental influences on obsessive-compulsive symptoms in adults: a population-based twin-family study. Psychological Medicine. 2007;37:1635–1644. doi: 10.1017/S0033291707000980. [DOI] [PubMed] [Google Scholar]
  30. Wadsworth SJ, Corley RP, Plomin R, Hewitt JK, DeFries JC. Genetic and environment influences on continuity and change in reading achievement in the Colorado Adoption Project. In: Huston AC, Ripke MN, editors. Developmental contexts in middle childhood. New York: Cambridge University Press; 2006. [Google Scholar]
  31. Wadsworth SJ, DeFries JC, Olson RK, Willcutt EG. The Colorado Longitudinal Twin Study of Reading Disability. Annals of Dyslexia. 2007;57:139–160. doi: 10.1007/s11881-007-0009-7. [DOI] [PubMed] [Google Scholar]
  32. Wagner RK, Torgeson KK, Rashotte CA, Hecht SA, Barker TA, Burgess SR, Donahue J, Garon T. Changing relations between phonological processing abilities and word-level reading as children develop from beginning to skilled readers: A 5-year longitudinal study. Developmental Psychology. 1997;33:468–479. doi: 10.1037//0012-1649.33.3.468. [DOI] [PubMed] [Google Scholar]
  33. Wechsler D. Examiners’ manual: Wechsler Intelligence Scale for Children -Revised. New York: The Psychological Corporation; 1974. [Google Scholar]
  34. Wechsler D. Examiners’ manual: Wechsler Adult Intelligence Scale - Revised. New York: The Psychological Corporation; 1981. [Google Scholar]
  35. Young SE, Rhee SH, Stallings MC, Corley RP, Hewitt JK. Genetic and environmental vulnerabilities underlying adolescent substance use and problem use: general or specific? Behavior Genetics. 2006;36:603–615. doi: 10.1007/s10519-006-9066-7. [DOI] [PubMed] [Google Scholar]
  36. Zieleniewski AM, Fulker DW, DeFries JC, LaBuda MC. Multiple regression analysis of twin and sibling data. Personality of Individual Differences. 1987;8:787–791. [Google Scholar]

RESOURCES