Abstract
Schooling generally is positively associated with better health-related outcomes—for example, less hospitalization and later mortality—but these associations do not measure whether schooling causes better health-related outcomes. Schooling may in part be a proxy for unobserved endowments—including family background and genetics—that both are correlated with schooling and have direct causal effects on these outcomes. This study addresses the schooling-health-gradient issue with twins methodology, using rich data from the Danish Twin Registry linked to population-based registries to minimize random and systematic measurement error biases. We find strong, significantly negative associations between schooling and hospitalization and mortality, but generally no causal effects of schooling.
Keywords: Health-schooling gradients, Schooling, Mortality, Hospitalization, Twins
Introduction
Schooling differences are a major component of socioeconomic differences. Schooling also generally is positively associated with better health-related behaviors and better health/mortality outcomes. Those with more schooling tend to have better physical and mental health and to have longer lives, and these results are often interpreted to mean that increasing schooling improves health and that there is an important schooling-health gradient that disadvantages those with less schooling.1 However, these associations do not measure by how much schooling causes better health-related behaviors and better health/mortality outcomes. Schooling may in part be a proxy for unobserved endowments—including family background and genetics—that have direct causal effects, additional to any effects through schooling, on health behaviors and health/mortality outcomes.
This study’s principal goals are to describe the associations between schooling and (a) hospitalization in 1980–2002 and (b) mortality by 2003 for the 1921–1950 birth cohorts in Denmark, and to investigate for the same cohorts the causal impact of schooling on health and mortality, net of endowments, using rich data on twins from the Danish Twin Registry that are linked to Danish population-level registries. We explore four central questions:
What are the cross-sectional associations between schooling attainment and hospitalization and mortality for Danish adults?
What are the estimated causal impacts of schooling on hospitalization and mortality for these individuals when the monozygotic (MZ) or identical twin design is used to control for unobserved family endowments?
To what extent do these within–MZ twins estimated impacts differ from what cross-sectional associations suggest?
To what extent do within–MZ and within–DZ (dyzygotic, or fraternal) twin estimates differ, as they will if individual-specific endowments (as contrasted with common family endowments) are important?
The reduced-form relations that we investigate in this study may change with market conditions, public policies, aggregate economic conditions, other secular trends, and life-cycle stage. Because of this, for all these questions, we consider the 1921–1935 and 1936–1950 birth cohorts separately and test whether they can be combined in the same estimates for the 1921–1950 birth cohorts. Because there may be gender differences in such relations, we also estimate the relations for females and males separately and test whether they can be combined. The linked twins data, conditional on the assumptions necessary for the within-twins estimates to be consistent, permit us to answer these questions; the data thereby significantly increase our understanding of causal effects of schooling on health-related behaviors and mortality outcomes, net of endowments, and therefore on what underlies the schooling-health gradient.
The contribution of this article rests on using within-twins estimators with unusually rich data. We therefore first describe the methodology, previous related studies, and the data; and then we turn to the estimates. The estimates question, in brief summary, the standard inference that the strong negative association between schooling and outcomes such as hospitalization and mortality reflects important direct causal effects of schooling. Instead, schooling seems primarily to serve as a marker for parental family and individual-specific endowments that are uncontrolled in the usual estimates. This raises questions about the usual attribution to schooling of substantial positive effects on health-related behaviors and outcomes. In terms of causal effects, despite the strong associations with schooling, the real stratification appears to be with regard to parental family and individual-specific background endowments.
Methodology
We are interested in simple reduced-form relations between schooling and health-related behaviors and outcomes because we are interested in schooling’s total or gross associations with, and effects on, health-related behaviors and outcomes. This approach does not permit us to identify the underlying structural channels through which schooling may be operating, and indeed, our data do not permit the identification of those channels. Still, understanding better the gross associations with and effects of schooling on these outcomes is valuable in itself, as noted in the introduction.
We do not have very strong prior assumptions about even the sign of the schooling effects in these reduced forms, particularly for hospitalization. More schooling, for example, may have countervailing effects on hospitalization if it reduces the probability of negative health conditions that might lead to hospitalization yet increases the probability of hospitalization (e.g., through greater income, more knowledge, or better connections) for given health conditions. Likewise, more schooling may have countervailing effects on mortality if it increases income and access to health-system care but also increases higher-risk behaviors and selection into more stressful occupations. Of course, which opposing effect dominates depends on preferences, resources, technologies, markets, and policies that prevail in the context being studied. Schooling effects through increasing access to health services through higher income, for example, a priori would seem to be less important in the context of the Danish health care system, with its broad coverage and access, than in a health-care system such as that in the United States, in which many individuals are not covered, particularly prior to the recent reforms, by health insurance.
Consider the following linear representation of a reduced-form equation relating adult health-related behaviors or health/mortality outcomes Hij for the ith member of paternal family j to his or her schooling Sij and to three sets of unobserved variables representing (1) endowments hj that are common among all children of the paternal family j (e.g., exogenous features of the paternal family environment in childhood, including prices, family income, parents’ human capital characteristics, average genetic endowments among siblings, local schooling, and health-related options); (2) the component of endowments or “innate health” specific to child i in j, represented by aij (e.g., individual-specific deviations from average family genetic endowments); and (3) a random health shock specific to i in j, inclusive of measurement errors in health, represented by vij2:
(1) |
where βS is the effect of schooling. Sij is itself a function of (usually largely unobserved) variables that pertain to the paternal family and to the individual children in the paternal family3:
(2) |
where αh is the effect of the family-specific endowment hj on child schooling investment, αa is the effect of the individual-specific endowment aij of child i on schooling investment in that child, αs is the effect of the individual-specific endowment akj of sib k (which is a vector if there are multiple siblings) on schooling investment in child i,4 and uij is a disturbance that affects Sij but not Hij except indirectly through Sij (a critical assumption for the within-MZ estimators below to yield consistent estimates, as we discuss further below with reference to the Bound and Solon (1999) critique of twins studies. The stochastic term, for example, might reflect the chance event of whether one twin was assigned a more inspiring teacher than the other or happened to encounter more inspiring role models than the other. As is well known, the parameter βS is not identified in Eqs. 1 and 2 if αa or αh is not zero. βS is estimated with bias if Eq. 1 is estimated across individuals with different values of hj and aij. The regression coefficient for an ordinary least squares (OLS) estimate of relation (1) is, cov(Hij, Sij) / σ2(Sij) = [βS(σ2(Sij) + cov(Sij, hj) + cov(Sij, aij)] / σ2(Sij), which is a biased estimate of βS unless cov(Sij, hj) + cov(Sij, aij) = 0. Thus, the cross-sectional estimate of the association between schooling and health is generally a biased estimate of the causal impact of schooling on health because schooling is partially a proxy for genetic, family background, and other endowments in such estimates.
Within–MZ Twins Estimators
With no further assumptions, it is clear that βS (the health impact of schooling) is not identified even if sibling-pair data are used to control in the estimation of βS for the covariant common components of the endowment and environment hj and sj. This is because of the existence of the specific component of the endowments aij. As long as families or individuals respond to individual-specific differences in endowments, and such differences are important, then sibling estimators may not be very useful.5 In recognition of this problem, researchers (e.g., Ashenfelter and Krueger 1994; Ashenfelter and Rouse 1998; Behrman et al. 1980, 1994, 1996; Behrman and Rosenzweig 1999; Behrman and Taubman 1976; Bonjour et al. 2003; Miller et al. 1995, 1997) have employed samples of pairs of MZ twins, between whom endowment differences at conception are as minimal as possible,6 to identify βS in estimates of models parallel to Eqs. 1 and 2 with ln(earnings) as the dependent variable of interest (and in Behrman and Rosenzweig (2002, 2005), with child schooling as the dependent variable). To our knowledge, only one published study used this approach to explore the relation between schooling and health (Fujiwara and Kawachi 2009; see the section Previous Related Studies). Equations 1 and 2 can be rewritten for MZ twins:
(1A) |
(2A) |
where the superscript M refers to MZ twins. Relations parallel to Eqs. 1A and 2A can be written for the kth MZ twin in the jth family. Within–MZ twin estimators are obtained by subtracting such relations from Eqs. 1A and 2A. With a within–MZ twin estimator, all unobserved endowment components in Eqs. 1A and 2A are eliminated.
These within-MZ estimators can be used to identify the true reduced-form impact of schooling on health-related behaviors and health/mortality outcomes under the assumptions of this section for the own-health behavioral and outcome variables in the data that are introduced in the Data section. As noted, comparisons between the within-MZ estimates and OLS estimates of relation (1) may be made for the same health-related behaviors and outcomes to learn the extent to which the estimates of the impact of schooling on health βS are biased in standardly used cross-sectional estimates because of the failure to control for unobserved endowments hj and aij, which affect health and which are correlated with S because they partly determine schooling in relation (2). Comparisons also can be made of the within-MZ estimates for females versus males and across birth cohorts to check for significant gender or cohort effects in the reduced-form impact of schooling on health-related behaviors and health/mortality outcomes. Further, comparisons can be made between within-MZ and within-DZ twins to assess whether the unobserved individual-specific endowments aij are important; if so, then within-sibling estimates that control only for common family endowments hj are misleading because they still include aij − aik, which is correlated with Sij − Sik, in the disturbance term.
We now consider the key assumption, which has been questioned by Bound and Solon (1999) and others, that is necessary for the twins estimates to produce unbiased estimates: the disturbance terms in the health and schooling relations are uncorrelated (cov(uij, vij) = 0).7 Suppose that this assumption does not hold because there is some unobserved individual twin-specific factor (persistent shock) cij that directly affects both the schooling and the health of twin i and that differs from any individual twin-specific factor (persistent shock) ckj that affects both the schooling and the health of twin k in the same family j, so that Eqs. 1 and 2 are replaced by Eqs. 1B and 2B:
(1B) |
(2B) |
If γc is positive, then the impact of the persistent shock on schooling is in the same direction as the impact on health (and vice versa if γc is negative). As noted earlier with regard to the endowment effects, the own effects are plausibly greater than the cross effects, so αa > akj and |γc| > |γk|. Equations 1B and 2B can be rewritten for MZ twins to incorporate these persistent shocks:
(1C) |
(2C) |
Again, relations parallel to Eqs. 1C and 2C can be written for the kth MZ twin in the jth family. Within–MZ twin estimators are obtained by subtracting such relations from Eqs. 1C and 2C. With a within–MZ twin estimator, the unobserved endowment components hjM in Eqs. 1B and 2B again are eliminated. In this case, though, the difference in the unobserved twin-specific persistent shocks remains:
(1D) |
(2D) |
where Δ is the difference between the kth and ith MZ twin in the jth family. Estimating Eq. 1D, therefore, does not give an unbiased estimate of βS because ΔSijM and ΔcijM are correlated (see Eq. 2D); and ΔcijM is, by assumption, unobserved. The sign of this bias is determined by the sign of the correlation, which is the sign of (γc − γk). This sign is positive (negative) if the impact of the shock on schooling is in the same (opposite) direction as the impact of the shock on health; the estimate of βS from Eq. 1D, then, is an overestimate (underestimate) of the true value of βS. For example, if more favorable in utero environments owing to proximity to the placenta increase both schooling and health beyond any effect through schooling, as suggested by the results in Behrman and Rosenzweig (2004), the estimate of βS from Eq. 1D is an overestimate of the true value of βS8: likewise, if an accident or illness limits schooling and has persistent negative effects on later health. Also note that if βS truly is positive, but the estimate of βS from Eq. 1D is not significant as we find (see Estimates section), and this is because of the failure to control for ΔcijM in the estimates, that means that γc must be negative so that the unobserved factor cij, which positively affects long-run health, has negative effects on schooling (or vice versa).
Previous Related Studies
Most of the previous literature has estimated relations, such as in Eq. 1 with OLS (or logit or probit), but as noted earlier, βS (the health impact of schooling) is not identified in such estimates. However, a relatively small number of studies have used alternative estimators to try to identify the causal impact of schooling on health (for reviews of these studies, see Cutler et al. 2006; Cutler and Lleras-Muney 2008; and Grossman 2006).
Almost all these studies have used instrumental variable (IV) estimators. Selected examples are summarized here.9 In a relatively early study, Berger and Leigh (1989) used per capita income and per capita expenditures on education in the state of birth as instruments for schooling in their study of schooling effects on health in the United States. They found that with the IV estimates, schooling effects were slightly reduced but remained significant. Their instruments, however, may have been related to expenditures on health, which, if health expenditures affect health, raises questions about the validity of instruments. Most of the more recent studies used “natural experiments”—often school reforms, school age regulations, and labor market dimensions—as instruments for schooling. Lleras-Muney (2005), for example, used changes in compulsory school attendance and child labor laws from 30 states in the United States from 1915 to 1939 to identify schooling effects on mortality. She used grouped census data as well as individual data from 1985. Her results from the grouped data suggested that one additional grade of schooling lowered the 10-year death rate by 3.6%, but her estimated schooling effects from individual data were not significant. Adams (2002) adapted the identification strategy suggested by Angrist and Krueger (1991), using quarter of birth in a U.S. cross-section of individuals from 1992. Adams’ dependent variables were self-reported health and a number of variables describing functional limitations as health measures. When IV estimation was used, schooling effects increased slightly but were insignificant. Furthermore, F values on the instruments were just slightly more than 1, indicating a problem of weak instruments. Arkes (2003) estimated schooling effects on work-limiting health problems, need for personal care, and mobility limitations, using a large U.S. sample of white males aged 47–56. Arkes used within-state differences in unemployment rates as instruments for schooling. He found that schooling effects increased somewhat, allowing for endogeneity, and were significant for two of the three health measures. Spasojevic (2003) used changes in compulsory schooling in Sweden in the 1950s as instruments to analyze schooling effects on an index of bad health and on body mass index (BMI) in a healthy range over the period 1981–1991. She found positive significant effects (although only when using one-tailed tests) from completed years of schooling on both health indicators. Arendt (2005) used schooling reforms in Denmark in 1958 and 1975 as instruments but did not find any significant effects of instrumented schooling on health.
As indicated earlier, many of these IV studies seem to have suffered from problems of having weak instruments. Also, these IV estimates tended to be LATE (local average treatment effects) estimates (see Angrist et al. 1996; and Lundborg 2008), and the estimated schooling effects are therefore relevant for individuals who are at the margin of being affected by the instruments used (e.g., at the margin of unemployment or compulsory schooling levels). In heterogeneous populations, these LATE estimators can differ substantially from the more generally relevant ATE (average treatment effects) estimators that measure the average causal effect of schooling on health outcomes.
Behrman and Wolfe (1989) used adult sisters (sibling) estimators with Nicaraguan data and found that significant inverse associations of their schooling with three of four disease categories for adult women in the level estimates became much less precisely estimated and no longer significant at the standard .05 level with within-sibling estimates. As noted earlier, within-sibling estimates can identify βS under the assumption that cov(u, v) equals zero and that the covariance between the individual-specific component of endowments and schooling is zero. The latter seems a strong assumption, and was not supported by the Behrman et al. (1994) finding of significant impacts of individual-specific endowments on schooling and earnings using MZ and DZ twins (the former to estimate unbiased coefficients and the latter, conditional on those coefficients, to investigate individual-specific endowment effects). Similarly, Fletcher and Lehrer (2009), using a subset of the AddHealth data for which DNA markers are available, documented that academic achievement among DZ twins was partially related to unobserved genetic differences among DZ twins, thus further questioning the suitability of sibling estimators to obtain causal effects of schooling on health outcomes later in life.
To our knowledge, the only published study that applied within–MZ twins estimators to this problem is that by Fujiwara and Kawachi (2009). They used 702 MZ twins from the 1995 Midlife in the United States (MIDUS) survey with a continuous measure of schooling attainment. They reported generally no significant causal effects of schooling on seven health outcomes and 10 health behaviors in within-MZ estimates when males and females were combined. (An exception was for perceived global health if male and female samples were combined, but not in separate estimates for females and males.)10 Their coefficient estimates did not change substantially between the two sets of estimates, but their significance did. In an as-yet unpublished study, Lundborg (2008) used a sample of 694 MZ twins from the same MIDUS survey with a nonlinear representation of schooling by including dichotomous variables for high school, some college, and college degrees.11 His within-MZ results suggested a causal impact of self-reported schooling levels on health, with a significant positive impact on self-reported health and negative impacts on the number of chronic conditions.12 He found no evidence of bias owing to the unobserved endowments that twins share in common; in fact, his within–MZ twins estimates of schooling impacts were larger for self-reported health than were his OLS estimates even though he was not able to control for measurement error in self-reports of schooling, which in itself was likely to bias his coefficient estimates toward zero (see the end of the Data section). He noted that within–MZ twins estimates are likely to be closer to ATE than are IV estimates identified by factors like school reforms, such as those discussed earlier, that are likely to be LATE estimators. The Fujiwara and Kawachi (2009) and Lundborg (2008) studies appear to differ importantly only in their representation of schooling as a continuous variable in the former case and a nonlinear variable in the latter case. We therefore explore both options in the Estimates section.
Data
Our analyses of schooling differences and health-related behaviors and outcomes use a rich new longitudinal data set obtained through a link between the Danish Twin Registry13 with various population-based registers at Statistics Denmark. This link provides rich longitudinal information for schooling and health-related behaviors and outcomes for all Danish twins born between 1921 and 195014 that have been identified in the Danish Twin Registry. Of particular relevance for this study is the link to the Danish National Hospital Register compiled by the National Board of Health from information provided by local health authorities for all discharges from somatic hospital overnight stays. The Danish National Hospital Registry, established in 1977, includes information on about 99.4% of all discharges from nonpsychiatric hospitals in Denmark. Recorded information includes the Civil Personal Registration number (CPR) and the dates of admission and discharge (Andersen et al. 1999). In addition to the twins, the data set that we use includes a 5% same-age random sample of the Danish population. In order to assure confidentiality, access to these data is limited to a dedicated computer at the Danish Twin Registry at the University of Southern Denmark that is linked to limited-access computers at Statistics Denmark.
The data set used for our analysis encompasses 87,773 singletons in the 5% sample, 5,294 MZ twins in complete pairs, and 11,234 DZ twins in complete pairs (Tables 1 and 2). For the 5% sample, the 1936–1950 birth cohort includes about one-quarter more individuals than the 1921–1950 cohort; this reflects overall population growth between these two cohorts. For the MZ and DZ complete-pair twins samples, the 1936–1950 birth cohort includes about one-half more individuals than the 1921–1950 cohort, probably reflecting that a twin pair is eliminated if one or both twins migrated before 2003 or died before 1980—and the older birth cohort had more exposure to both migration and mortality (at older ages for mortality)—in addition to the overall population growth between these two cohorts.15 The male/female ratio is close to 1 for both cohorts (0.99, 1.02), but is a fair amount higher for both MZ and DZ twins for the 1936–1950 cohort than for the 1921–1935 cohort, with a surprisingly high value for the younger cohort for DZ twins (1.24). This apparently reflects that the recruitment of twins in 1943–1968 was partly conducted using conscription boards that focused more on men (see Kyvik et al. 1996).
Table 1.
Descriptive statistics
Cohort and Gender | Variables and Sample Sizes | 5% Sample
|
MZ Twins
|
DZ Twins
|
|||
---|---|---|---|---|---|---|---|
Mean | SD | Mean | SD | Mean | SD | ||
1921–1935, Males | Schooling (years) | 10.4 | 3.5 | 10.6 | 3.6 | 10.0 | 3.5 |
Hospital days/exposure | 3.2 | 8.3 | 3.5 | 14.8 | 3.2 | 10.9 | |
HD/Exp < 2y death | 1.8 | 4.0 | 1.7 | 3.7 | 1.8 | 5.0 | |
Mortality by 2003 | 0.44 | 0.50 | 0.35 | 0.48 | 0.41 | 0.49 | |
N | 19,373 | 970 | 2,140 | ||||
N for HD/Exp < 2y death | 19,016 | 956 | 2,099 | ||||
1921–1935, Females | Schooling (years) | 9.2 | 3.0 | 9.2 | 3.1 | 9.0 | 3.0 |
Hospital days/exposure | 3.0 | 8.2 | 2.7 | 7.2 | 3.0 | 10.7 | |
HD/Exp < 2y death | 1.8 | 4.4 | 1.6 | 3.6 | 1.6 | 3.4 | |
Mortality by 2003 | 0.30 | 0.46 | 0.24 | 0.43 | 0.28 | 0.45 | |
N | 19,525 | 1,092 | 2,184 | ||||
N for HD/Exp < 2y death | 19,305 | 1,082 | 2,156 | ||||
1936–1950, Males | Schooling (years) | 11.7 | 3.4 | 11.8 | 3.4 | 11.3 | 3.5 |
Hospital days/exposure | 1.0 | 3.7 | 0.9 | 3.3 | 1.0 | 3.9 | |
HD/Exp < 2y death | 0.7 | 2.5 | 0.7 | 2.6 | 0.8 | 3.1 | |
Mortality by 2003 | 0.11 | 0.32 | 0.09 | 0.29 | 0.12 | 0.33 | |
N | 24,649 | 1,664 | 3,828 | ||||
N for HD/Exp < 2y death | 24,546 | 1,660 | 3,802 | ||||
1936–1950, Females | Schooling (years) | 10.9 | 3.3 | 10.9 | 3.2 | 10.7 | 3.3 |
Hospital days/exposure | 1.1 | 4.2 | 1.0 | 2.6 | 1.0 | 2.6 | |
HD/Exp < 2y death | 0.8 | 2.1 | 0.8 | 2.0 | 0.8 | 1.8 | |
Mortality by 2003 | 0.08 | 0.27 | 0.07 | 0.25 | 0.07 | 0.25 | |
N | 24,226 | 1,568 | 3,082 | ||||
N for HD/Exp < 2y death | 24,174 | 1,566 | 3,075 | ||||
1921–1950, Males and Females | Schooling (years) | 10.6 | 3.4 | 10.8 | 3.4 | 10.4 | 3.4 |
Hospital days/exposure | 1.9 | 6.3 | 1.8 | 7.6 | 1.8 | 7.3 | |
HD/Exp < 2y death | 1.2 | 3.3 | 1.1 | 2.9 | 1.1 | 3.4 | |
Mortality by 2003 | 0.22 | 0.41 | 0.16 | 0.37 | 0.19 | 0.39 | |
N | 87,773 | 5,294 | 11,234 | ||||
N for HD/Exp < 2y death | 87,041 | 5,264 | 11,132 |
Notes: Hospital days/exposure refers to the 1980–2002 period. HD/Exp < 2y death is hospital days/exposure up through 2000 or up to two years before death if died prior to 2003.
Source: Danish Twin Registry linked with population registry data, as described in text.
Table 2.
Summary measures of sample composition in Table 1
5% Sample | MZ Twins | DZ Twins | |
---|---|---|---|
Total N | 87,773 | 5,294 | 11,234 |
1936–1950/1921–1935 | 1.26 | 1.57 | 1.60 |
Males/Females 1921–1935 | 0.99 | 0.89 | 0.98 |
Males/Females 1936–1950 | 1.02 | 1.06 | 1.24 |
In this article, we consider three health-related outcomes. First, we measure the number of days hospitalized per year of exposure (with exposure terminated by mortality) during 1980–2002. Although previous studies have used such measures for much shorter periods of time or based on respondent recall, we are unaware of previous studies that have been able to use such registry-based data for more than two decades. The days hospitalized per year of exposure during these 23 years vary substantially across individuals within gender-cohort groups (Table 1). The means are about three times as high for the 1921–1935 birth cohort as for the 1936–1950 birth cohort (a little more for males, a little less for females) for the 5% sample and both twins samples, reflecting the increased hospitalization rates with age. In 1980, the members of the older cohort were 45–59 years old, and those of the younger cohort were 30–44 years old; in 2002, the members of the older cohort were 67–81 years old, and those of the younger cohort were 52–66 years old. For the older cohort, the mean days hospitalized for females is slightly smaller for all three samples than for males, but there are almost no differences for the younger cohort.16 For the older cohort, this pattern may reflect that females have higher life expectancies than males, but it contrasts with some studies that suggest that females nevertheless use health services more than males.
Our second health-related outcome is the number of days hospitalized per year of exposure during 1980–2000 up to two years before mortality or before the end of the sample in 2002.17 This measure is closely related to the previous one, but attempts to abstract from hospitalization related to imminent mortality by not including the last two years of life in order to obtain a measure of the extent of hospitalization prior to the onset of terminal conditions. Because substantial hospitalization is not uncommon in the two years prior to death, the means for this measure are only about three-fifths as large as those for the first measure above for the 1921–1935 birth cohort group; the difference is not quite as large for the 1936–1950 birth cohort group because of the lower prevalence of mortality before 2003. For both cohorts, the standard deviations for this variable are considerably smaller than for the previous one, implying that a substantial portion of the variance in the hospitalization per year over the life cycle is due to large variation in the two years before death.
Third, we measure whether an individual died prior to 2003. Again, our mortality data are based on registry data, not on recall data of relatives or others (which probably is important particularly for reducing measurement error in calculating the relevant exposure periods for the health-related outcome variables described earlier). For the older, 1921–1935 birth cohort (who were 67–81 years old by 2002), about 40% of the males and 30% of the females died prior to 2003, with somewhat higher mortality rates for the 5% sample than for the twins samples, perhaps reflecting the selectivity with regard to survival of both twins to 1980, as noted earlier. For the younger, 1936–1950 birth cohort (who were 52–66 years old by 2002), about 11% of the males and 8% of the females died prior to 2003, with somewhat higher mortality rates again for the 5% sample than for the twins samples, except for male DZ twins pairs.
Our key explanatory variable is years of schooling attainment,18 as calculated from the highest vocational and academic training and the months/years of schooling associated with the highest attained degree. This information comes from administrative registers based on individual self-reports from the 1970 population census that were updated with educational institution reports following the 1973 establishment of the integrated student register, which covers about one-half of our sample (e.g., 48% of the MZ twins).19 That means that measurement error related to recall errors, which has played a major role in the twins (and siblings more generally) literature because of the exacerbation of the noise-to-signal ratio in within-twins (siblings) estimates (e.g., Ashenfelter and Krueger 1994; Behrman et al. 1994; Bishop 1976; Griliches 1979), is not likely to be as serious a problem for this study as for those studies with only recall data for their samples; we consider measurement error further later in this section. Years of schooling attainment in the 5% sample averaged 10.4 for males and 9.2 for females in the 1921–1935 cohort, and 11.7 for males and 10.9 for females in the 1936–1950 cohort. Thus, both cohorts had high levels of schooling, on average, in comparison with most countries (see UNESCO 2011) with some increase between the two periods (12% for males, 18% for females) and a reduction of the gender gap favoring males. The mean schooling grade attainment for MZ twins is about the same as for the 5% sample for each gender-birth cohort group, and that for DZ twins is a little lower. This may reflect a tendency, for these birth cohorts, for DZ twinning to be more common among lower-income families with more children.
As noted in the section Previous Related Studies, the question arises whether differences exist in schooling between twins over substantial ranges in the distribution of schooling so that ATE estimates over this range rather than LATE estimates over a much narrower range can be obtained. A detailed analysis of these schooling differences within MZ twin pairs is provided in Table 3. For the combined male and female MZ sample that is used for most of our analyses, the mean difference in years of schooling within twin pairs is 1.6, with a standard deviation (SD) of 2.5. The difference in the years of schooling is zero for about one-half of the twin pairs, but considerable variation exists in the other half. This variation in years of schooling occurs across the complete range of schooling attainment by the twins. To illustrate this pattern of schooling differences, Table 3 tabulates and summarizes the difference in years of schooling separately for all twin pairs in which at least one member attained one of the following broad educational categories20: (1) compulsory schooling/basic schooling; (2) upper secondary schooling/high school diploma; (3) (3–4 years of) vocational training; (4) short-cycle higher education; (5) medium-cycle higher education; (6) master’s and Ph.D. degrees; (7) unknown educational levels were replaced with the minimum required years of schooling (= 7 years for these cohorts). The fraction of twin pairs for which the within-twin pair difference in years of schooling equals zero is highest in the lowest educational category (category 1 in Table 3) at about 56%, and it is considerably below 50% in the remaining educational groups. This high fraction of twin pairs with zero schooling difference in the first category is due to the fact that in these cohorts, a relatively large number of twins completed the legal minimum of seven years of schooling (compulsory schooling only), resulting in a years-of-schooling difference of zero for about 56% of all twin pairs in which at least one twin has only compulsory/basic schooling. Nevertheless, in the remaining 44% of these pairs, the schooling difference is sizable, and the mean difference in the years of schooling in these pairs is 2.2 (SD = 2.9). The difference in the years of schooling within twin pairs continues to average between about 1.9–2.5 (with SDs ranging from 1.9 to 2.8) for twin pairs in which at least one twin has attained educational categories 2 through 6, and the mean and standard deviation are slightly higher among the small number of twin pairs in which at least one twin has unknown education.
Table 3.
Difference in years of schooling within twin pairs (MZ twins only), by education levels
Gender, Differences | Twin Pairs in Which at Least One Twin Has Education Category (%)
|
Total | ||||||
---|---|---|---|---|---|---|---|---|
1 | 2 | 3 | 4 | 5 | 6 | 7 | ||
Males | ||||||||
0 | 52.3 | 15.2 | 29.5 | 20.3 | 25.6 | 33.0 | 32.1 | 49.0 |
0 < diff. ≤ 1 | 2.3 | 15.2 | 25.3 | 43.5 | 25.0 | 10.6 | 0.0 | 17.9 |
1 < diff. ≤ 2 | 2.2 | 24.2 | 6.6 | 17.4 | 18.3 | 16.0 | 3.6 | 6.1 |
2 < diff. ≤ 3 | 4.0 | 6.1 | 4.9 | 4.3 | 11.0 | 6.4 | 3.6 | 3.8 |
3 < diff. ≤ 4 | 2.6 | 12.1 | 4.1 | 4.3 | 3.7 | 16.0 | 0.0 | 2.9 |
4 < diff. ≤ 5 | 4.6 | 18.2 | 3.5 | 1.4 | 1.8 | 4.3 | 3.6 | 2.7 |
diff. > 5 | 32.0 | 9.1 | 26.1 | 8.7 | 14.6 | 13.8 | 57.1 | 17.6 |
Mean | 2.70 | 2.64 | 2.46 | 1.53 | 2.23 | 2.50 | 4.43 | 1.79 |
SD | 3.14 | 1.89 | 2.74 | 1.89 | 2.70 | 2.88 | 3.61 | 2.62 |
N | 650 | 33 | 712 | 69 | 164 | 94 | 28 | 1,293 |
Females | ||||||||
0 | 58.7 | 22.2 | 36.8 | 11.4 | 26.5 | 43.2 | 18.2 | 58.1 |
0 < diff. ≤ 1 | 4.9 | 22.2 | 12.5 | 31.8 | 15.3 | 18.9 | 13.6 | 10.2 |
1 < diff. ≤ 2 | 3.1 | 11.1 | 6.0 | 18.2 | 14.8 | 5.4 | 0.0 | 5.7 |
2 < diff. ≤ 3 | 5.4 | 5.6 | 9.1 | 9.1 | 12.2 | 0.0 | 9.1 | 5.3 |
3 < diff. ≤ 4 | 3.6 | 11.1 | 5.9 | 6.8 | 8.5 | 18.9 | 4.5 | 3.6 |
4 < diff. ≤ 5 | 11.3 | 27.8 | 16.2 | 2.3 | 5.8 | 5.4 | 4.5 | 7.9 |
diff. > 5 | 13.1 | 0.0 | 13.5 | 20.5 | 16.9 | 8.1 | 50.0 | 9.2 |
Mean | 1.84 | 2.35 | 2.37 | 2.45 | 2.69 | 1.98 | 4.46 | 1.50 |
SD | 2.55 | 1.93 | 2.38 | 2.27 | 2.75 | 2.59 | 3.37 | 2.28 |
N | 835 | 18 | 562 | 44 | 189 | 37 | 22 | 1,309 |
Males and Females Combined | ||||||||
0 | 55.9 | 17.6 | 32.7 | 16.8 | 26.1 | 35.9 | 26.0 | 53.6 |
0 < diff. ≤ 1 | 3.8 | 17.6 | 19.6 | 38.9 | 19.8 | 13.0 | 6.0 | 14.0 |
1 < diff. ≤ 2 | 2.7 | 19.6 | 6.4 | 17.7 | 16.4 | 13.0 | 2.0 | 5.9 |
2 < diff. ≤ 3 | 4.8 | 5.9 | 6.8 | 6.2 | 11.6 | 4.6 | 6.0 | 4.5 |
3 < diff. ≤ 4 | 3.2 | 11.8 | 4.9 | 5.3 | 6.2 | 16.8 | 2.0 | 3.3 |
4 < diff. ≤ 5 | 8.4 | 21.6 | 9.1 | 1.8 | 4.0 | 4.6 | 4.0 | 5.3 |
diff. > 5 | 21.3 | 5.9 | 20.6 | 13.3 | 15.9 | 12.2 | 54.0 | 13.4 |
Mean | 2.22 | 2.54 | 2.42 | 1.89 | 2.47 | 2.35 | 4.44 | 1.64 |
SD | 2.85 | 1.89 | 2.59 | 2.09 | 2.73 | 2.80 | 3.47 | 2.46 |
N | 1,485 | 51 | 1,274 | 113 | 353 | 131 | 50 | 2,602 |
Notes: Education categories are 1 = Compulsory schooling/Basic schooling; 2 = Upper secondary schooling/High school diploma; 3 = (3–4 years of) vocational training; 4 = Short-cycle higher education; 5 = Medium-cycle higher education; 6 = Master’s and PhD degrees; and 7 = Unknown (which, for the calculations in this table was replaced with the minimum required years of schooling, 7 years for these cohorts).
The clustering of small differences in schooling among twin pairs in which at least one twin has only compulsory education, and the fairly sizable within-twin pair variation for twin pairs in which at least one twin attained more than compulsory education, imply that our estimates are ATE for about the 70% of the sample with more than seven years of schooling; this contrasts with LATE estimates for schooling close to compulsory levels that would be obtained with IV estimates using as instruments increases in compulsory schooling levels.
A related issue in assessing the properties of within–MZ twin pair estimators pertains to measurement error in schooling because there may not be much information in the within-MZ schooling differences if measurement error is large. As is well-known, the bias toward zero owing to random measurement error generally is exacerbated in fixed-effects estimates, of which within-MZ estimates are one example. Assume that measured schooling (Sij′) is linearly related to true schooling (Sij) but is measured with random measurement error wij: Sij′ = Sij + wij. Bishop (1976) and Griliches (1979) showed that if measurement error is not correlated across siblings,21 the bias toward zero in bSw, the estimated within coefficient βS, is plim bSw = βS[1−σ2(wij)/(σ2(Sij)(1−ρS))], where ρS is the correlation in schooling between siblings (which is zero in standard individual estimates). This bias toward zero owing to measurement error is likely to be greater for within-DZ estimates than for individual estimates and for within-MZ estimates than for within-DZ estimates because ρS is likely to be positive and greater for MZ than for DZ twins.
Table 4 gives some illustrations, with each row representing a different noise-to-signal ratio (σ2(wij)/σ2(Sij)) as given in column 1; the percentage biases in individual, within-DZ, and within-MZ estimates owing to measurement error are in columns 2–4, and the ratios of the coefficients from DZ estimates and MZ estimates to individual estimates owing to measurement error are in columns 5 and 6. Other twins studies that have reports from other respondents (i.e., the other member of a twin pair, the twins’ adult children) so that they can estimate measurement error models found estimated noise-to-signal ratios of 0.06–0.12 (Ashenfelter and Krueger 1994; Ashenfelter and Rouse 1998; Behrman et al. 1994). Therefore, the row with a noise-to-signal ratio of about 0.10 in Table 4 is suggestive of the extent of biases from measurement error alone in these studies and of how they differ across the three types of estimates: 10% for individual estimates, 19% for within-DZ estimates, and 27% for within-MZ estimates. Thus, fairly substantial drops in the coefficient estimates for the within-DZ and within-MZ estimates occur because of measurement errors of this magnitude even if there are no biases from unobserved endowments. These measurement error biases would result in the coefficient estimates for the within-DZ and within-MZ estimates being, respectively, 9% and 19% smaller in absolute magnitude than those for the individual estimates (columns 5 and 6).
Table 4.
Implications of random measurement error for individual, within-DZ, and within-MZ estimates
Noise-to-Signal Ratio | Biases Toward Zero in Estimated βS (percentages)
|
Ratio of estimated βS Estimated Owing to Measurement Error Biases Alone
|
|||
---|---|---|---|---|---|
σ2(wij) / σ2(Sij) (1) | Individual (2) | Within-DZ (3) | Within-MZ (4) | Within-DZ/ Individual (5) | Within-MZ/ Individual (6) |
0.02 | 2 | 4 | 5 | 0.98 | 0.97 |
0.04 | 4 | 7 | 11 | 0.96 | 0.93 |
0.06 | 6 | 11 | 16 | 0.95 | 0.89 |
0.08 | 8 | 15 | 22 | 0.93 | 0.85 |
0.10 | 10 | 19 | 27 | 0.91 | 0.81 |
0.12 | 12 | 22 | 32 | 0.88 | 0.77 |
0.14 | 14 | 26 | 38 | 0.86 | 0.72 |
0.16 | 16 | 30 | 43 | 0.84 | 0.68 |
0.18 | 18 | 33 | 49 | 0.81 | 0.63 |
0.20 | 20 | 37 | 54 | 0.79 | 0.57 |
Notes: Based on the expression for measurement error in the text, with ρ = 0 for individuals, 0.46 for DZ twins, and 0.63 for MZ twins. (MZ and DZ correlations are estimated from the twins data used in this study).
We infer, however, that measurement error is likely to be lower for our study than for the studies mentioned in the previous paragraph for three reasons. First, the self-reported schooling data are from the 1970 census, when the respondents were of ages at which their recall probably was more accurate than if they had been asked to recall their schooling more than 30 years later in 2002 at the end of the period for our health data, as in data sets that start with older individuals. Second, for 48% of the MZ twins in the sample, the administrative data on schooling that we use presumably are measured with very low error. Third, for this same subsample, we also have self-reported data, so we can estimate the measurement error relation for this subsample: the estimated noise-to-signal ratio is 0.04. If we are conservative and assume that this noise-to-signal ratio applies to the whole sample (even though we have administrative data for almost one-half of the sample), then measurement error biases would result in the coefficient estimates for the within-DZ estimates being 4% and those for the within-MZ estimates being 7% smaller in absolute magnitude than those for the individual estimates (columns 5 and 6).
Estimates
To answer the questions posed in the introduction, we apply the methodology outlined earlier to the special data described in the Data section. Thereby, we attain alternative estimates of the gross reduced-form relations between years of schooling attainment that are ATE estimators for the approximately 70% of the sample with more than seven years of schooling and (1) hospitalization per year of exposure between 1980 and 2002 (or until mortality if mortality occurred prior to 2003), (2) hospitalization per year of exposure between 1980 and 2000 (or until two years prior to mortality if mortality occurred prior to 2003), and (3) mortality prior to 2003. As noted in the introduction, we initially estimated our analyses separately for men and women in each of two cohorts (1921–1935, 1936–1950). We then tested whether the analyses can be pooled across cohorts and gender. Except for the mortality outcomes, the statistical tests suggest that the analyses can be pooled across both gender and cohorts. For hospital days, we therefore present estimates for the combined sample for men and women born in 1921–1950; and for mortality, we present gender-specific estimates for each of the two cohorts.
Tables 5 and 6 give our estimated schooling coefficients for days hospitalized per year of exposure during 1980–2002 (Table 5, top panel); days hospitalized per year of exposure up to two years before death during 1980–2000 (Table 5, bottom panel); and for the probability of mortality prior to 2003 (Table 6) for the 5% random sample of singletons, MZ twins, and DZ twins. All the estimates control for birth-year fixed effects and additive gender differences. The twins sample standard estimates control for within–twin pair clustering. Robust standard errors are reported for all analyses.22 The estimates in these tables permit us to address the questions posed in the introduction:
What are the cross-sectional associations between schooling attainment and hospitalization and mortality for Danish adults in the 1921–1950 birth cohort? These associations are given in the first column of Tables 5 and 6 for the 5% sample. They are significantly negative, consistent with the dominant finding in the literature, and fairly substantial for all three outcomes. The 5% sample estimates indicate that a one SD increase in schooling attainment is associated with declines of 9.8% of the mean days hospitalized per year prior to 2003, or death if that occurs earlier; of 7.4% of the mean days hospitalized per year prior to two years prior to death; and of 6.8%–20.7% in the mean probability of mortality by 2003 (Table 7, column 1). The associations are similar (and apparently not statistically different from those for the 5% sample) for the MZ and DZ samples, although less precisely estimated (but still significantly nonzero at the .05 level in 10 of the 12 cases and at the .10 level in 11 of the 12 cases).
What are the estimated causal impacts of schooling on hospitalization and mortality for these individuals when the MZ-twin design is used to control for unobserved family endowments? The within-MZ estimates (Tables 5 and 6, column 4) are insignificant at the .10 level for both measures of hospitalization per year in the pooled analyses, and in all four gender/cohort analyses for mortality by 2003. These estimates might be consistent with a true positive effect of schooling on health, but this seems unlikely. In particular, one possibility consistent with a true positive effect of schooling on health is that a systematic persistent factor affects schooling and health in opposite directions, as discussed at the end of the Methodology section. Although it is conceivable that unobserved persistent shocks affect health and schooling in the opposite direction, what generally are considered to be more plausible shocks are likely to affect health and schooling in the same direction (e.g., in utero environments or later illnesses or accidents that reduce both schooling and health)—and that would mean that the within-MZ estimate of the impact of schooling on health is upwardly biased. A second possibility is that measurement error bias toward zero, which is exacerbated by within-MZ estimates as is discussed at the end of the Data section, accounts for the differences. However, our conservative estimate of the extent of measurement error in our data presented at the end of the Data section would account for a drop of about 7% in the estimated coefficient between the individual and within-MZ estimates in contrast with the much larger drops that we obtain: 93% to 99% for days hospitalized, and 53% to 79% for the two estimates for mortality that do not change sign. A third possibility emerges in light of the Lundborg (2008) results of significant nonlinear school effects for the United States in contrast with the Fujiwara and Kawachi (2009) results of generally no significant casual effects with a continuous schooling measure using the same data: there are significant nonlinear effects for our Danish data that are obscured with our linear specification. If nonlinearities are introduced through dichotomous variables for completing different schooling levels, the standard cross-sectional estimates suggest some strong differences in outcomes across these educational categories. However, with the within-twin estimates, the null hypothesis of all schooling coefficients equaling zero (and therefore no differences across completed schooling categories) is not rejected (Table 9 in the Appendix). Thus, our basic results do not seem to be an artifact of unobserved factors that affect schooling and health in the same direction, measurement errors of the magnitudes estimated in other studies, or nonlinearity in schooling effects of the sort that apparently underlie the differences for the United States between the significance of the within-MZ results in Lundborg (2008) and Fujiwara and Kawachi (2009) using the same data set.
To what extent do these within–MZ twins estimated impacts differ from what cross-sectional associations suggest? Column 6 in Tables 5 and 6 gives the ratio of the within-MZ point estimates to the MZ cross-sectional point estimates. A ratio of 1 indicates that the schooling coefficient estimate is not affected by using the within estimate rather than the standard estimate. A negative sign indicates that the estimate for the within-MZ case is opposite in sign from that for the MZ standard estimate. This comparison emphasizes that the implications of the within-MZ estimates in a number of cases are very different from those of the cross-sectional MZ associations. Three of the six ratios in Tables 5 and 6 are negative, and none of the ratios is between 0.5 and 1.5: that is, in none of the cases is the standard estimate within 50% of the within-twin estimate. Of course, a number of the underlying point estimates are estimated very imprecisely, so the differences between the two estimators are not always statistically significant. If one considers the MZ standard estimates in column 2 in Tables 5 and 6 in isolation, though, all the point estimates are negative, and all but one are so at the standard .05 significance level; this is similar to, although less precise than, the pattern in column 1 for the 5% sample estimates. These negative estimates in columns 1 and 2, if a causal interpretation is given, presumably lead to a conclusion that more schooling leads to less hospitalization and lower mortality by 2003. In contrast, if one considers the within-MZ estimates in column 4 in Tables 5 and 6 in isolation, none of the six estimated coefficients is significantly different from zero at the .10 level, and three of six have positive signs (for hospitalization per year up to two years before death, and for mortality prior to 2003 for both males and females in the 1921–1935 cohort).23 The conclusion about the impact of schooling based on the within-MZ estimates is therefore much different than with the standard estimates (also see column 1 versus column 2 in Table 7).
To what extent do within–MZ and within–DZ twin estimates differ, as they will if individual-specific endowments (as contrasted with common family endowments) are important? Once again, a number of the estimates are imprecisely estimated, and the differences between the within-MZ and within–DZ twin estimates in Tables 5 and 6 are not statistically significant. However, there is a tendency—that holds in all but one of the six cases in Tables 5 and 6—for the within–DZ twin estimates to fall between the standard cross-sectional estimates and the within-MZ estimates. Thus, the patterns in the estimates suggest that individual-specific endowments may be playing a role in the DZ-within estimates, particularly because the within-DZ estimates look more like the standard estimates than do the within-MZ estimates (i.e., with more negative coefficient estimates for the within-DZ estimates, including the two that are significant at the .10 level).
Table 5.
Estimates of schooling coefficients for days hospitalized
Dependent Variable | Standard
|
Within Twins
|
Within/Standard
|
|||||
---|---|---|---|---|---|---|---|---|
5% | MZ | DZ | MZ | DZ | MZ | DZ | ||
Number of Hospital Days per Year | Coefficient | −0.056 | −0.060 | −0.053 | 0.005 | −0.008 | −0.083 | 0.145 |
SE | 0.005 | 0.020 | 0.012 | 0.034 | 0.028 | |||
t Statistic | −12.09 | −3.06 | −4.31 | 0.15 | −0.28 | |||
Number of Hospital Days per Year, up to 2 Years Prior to Death (or end of observation period) | Coefficient | −0.042 | −0.050 | −0.041 | 0.000 | −0.027 | 0.010 | 0.650 |
SE | 0.003 | 0.012 | 0.008 | 0.025 | 0.014 | |||
t Statistic | −13.10 | −4.03 | −4.92 | −0.02 | −1.84 |
Table 6.
Linear probability estimates of schooling coefficients for mortality prior to 2003
Gender, cohort | Standard
|
Within Twins
|
Within/Standard
|
|||||
---|---|---|---|---|---|---|---|---|
5% | MZ | DZ | MZ | DZ | MZ | DZ | ||
Males, 1921–1935 | Coefficient | −0.0084 | −0.0108 | −0.0076 | 0.0051 | −0.0071 | −0.471 | 0.935 |
SE | 0.0010 | 0.0046 | 0.0031 | 0.0084 | 0.0052 | |||
t Statistic | −8.58 | −2.33 | −2.47 | 0.60 | −1.35 | |||
Males, 1936–1950 | Coefficient | −0.0070 | −0.0056 | −0.0091 | −0.0012 | −0.0035 | 0.218 | 0.383 |
SE | 0.0006 | 0.0022 | 0.0016 | 0.0040 | 0.0027 | |||
t Statistic | −11.44 | −2.53 | −5.75 | −0.31 | −1.28 | |||
Females, 1921–1935 | Coefficient | −0.0085 | −0.0099 | −0.0060 | 0.0127 | −0.0006 | −1.282 | 0.095 |
SE | 0.0010 | 0.0039 | 0.0031 | 0.0079 | 0.0055 | |||
t Statistic | −8.09 | −2.52 | −1.93 | 1.61 | −0.10 | |||
Females, 1936–1950 | Coefficient | −0.0047 | −0.0032 | −0.0047 | −0.0015 | −0.0059 | 0.476 | 1.248 |
SE | 0.0005 | 0.0021 | 0.0014 | 0.0038 | 0.0026 | |||
t Statistic | −8.71 | −1.52 | −3.30 | −0.40 | −2.27 |
Table 7.
Estimated percentage changes with 1 SD increase in schooling attainment
5% Sample | MZ Twins Within | DZ Twins Within | |
---|---|---|---|
Hospital Days/Exposure | −9.8 | 1.0 | −1.5 |
Hospital Days/Exposure < 2y Death | −7.4 | −0.1 | −5.0 |
Mortality by 2003 | |||
Males, 1921–1935 | −6.8 | 5.2 | −6.0 |
Males, 1936–1950 | −20.7 | −4.5 | −10.1 |
Females, 1921–1935 | −8.4 | 16.3 | −0.6 |
Females, 1936–1950 | −19.6 | −7.3 | −27.6 |
Conclusions
We contribute to the literature by obtaining—together with the recently published study by Fujiwara and Kawachi (2009) and unpublished studies by Amin et al. (2010), Behrman et al. (2006), and Lundborg (2008)—among the first estimates of the impact of schooling on health using the within–MZ twins methodology to control for unobserved endowments. We use objective health measures, over long time periods for hospitalization, from Danish administrative hospital discharge records and the Danish national mortality register. Our estimates arguably are closer to average treatment effect (ATE) estimates for most of the sample that have more than seven years of schooling than are most of the instrumental variable (IV) estimates that recently have been used to address this question and that tend to be local average treatment effects (LATE) estimates because they depend on instruments such as school reform that change compulsory schooling levels.
Our results are very suggestive for the questions that are posed in the introduction. They imply, in summary, strong negative associations between schooling attainment and hospitalization per year during 1980–2002 and hospitalizations per year up to two years before death and mortality by 2003 for both males and females for both the 1921–1935 and 1936–1950 birth cohorts. However, our estimates also question the standard inference from other, similar estimates that these strong negative associations reflect important direct causal effects of schooling. Instead, schooling seems primarily to serve as a marker for parental family and individual-specific endowments that are uncontrolled in the usual estimates. This raises questions about the usual attribution to schooling of substantial positive effects on health-related behaviors and outcomes and about whether there is an important causal schooling-health gradient. In terms of causal effects, despite the strong associations with schooling, the real stratification appears to be with regard to parental family and individual-specific background endowments. Better endowments apparently tend to lead to more schooling, less hospitalization, and less mortality by 2003, but the resulting negative associations between schooling on one hand and hospitalization and mortality on the other do not appear to reflect significant causal effects of schooling in reducing hospitalization and mortality in the context studied. Our results are similar to estimates for a subset of the same sample that significant cross-sectional associations among individual adult social class, health, and mortality and education were not significant in within-twin estimates (Madsen et al. 2010; Osler et al. 2007). Our results contrast with those presented in the recent study using a very similar methodology by Lundborg (2008), who found that using the within–MZ twins approach did not suggest that schooling was largely a proxy for family background in the United States if schooling was allowed to have nonlinear effects. However, functional form apparently is important for the Lundborg results because Fujiwara and Kawachi (2009) did not find significant causal effects of schooling on health using the same data as Lundborg (2008) but with a continuous representation of schooling. In contrast, we find no significant effect of schooling on health whether we use a nonlinear representation of schooling (as did Lundborg) or a continuous measure (as did Fujiwara and Kawachi).24
Acknowledgments
This study is part of a large collaborative project, involving the authors of this article and others, that is supported in part by NICHD Grants HD046144 (“Causal Effects of Schooling on Adult and Child Health”) and HD043417 (“Bio-Social Determinants of Fertility and Related Behaviors”). The authors thank participants at a session in which this article was presented at the 2006 annual meetings of the Population Association of America in Los Angeles, CA; three reviewers; the editor; and Mia Madsen for helpful comments on previous versions of the article.
Appendix
Table 8.
Estimates of schooling coefficients for days hospitalized with semi-log specification
Dependent Variable | Standard
|
Within Twins
|
Within/Standard
|
|||||
---|---|---|---|---|---|---|---|---|
5% | MZ | DZ | MZ | DZ | MZ | DZ | ||
Number of Hospital Days per Year | Coefficient | −0.0159 | −0.0118 | −0.0151 | 0.0084 | −0.0079 | −0.708 | 0.521 |
SE | 0.0007 | 0.0030 | 0.0020 | 0.0052 | 0.0035 | |||
t Statistic | −22.300 | −3.990 | −7.720 | 1.610 | −2.230 | |||
Number of Hospital Days per Year, up to 2 Years Prior to Death (or end of observation period) | Coefficient | −0.0133 | −0.0110 | −0.0123 | 0.0054 | −0.0074 | −0.495 | 0.603 |
SE | 0.0006 | 0.0026 | 0.0017 | 0.0048 | 0.0029 | |||
t Statistic | −21.320 | −4.240 | −7.230 | 1.130 | −2.520 |
Table 9.
Estimates of nonlinear schooling effects using dummy variables for completed schooling levels
Dependent Variable | Standard
|
Within Twins
|
||||
---|---|---|---|---|---|---|
5% | MZ | DZ | MZ | DZ | ||
Number of Hospital Days per Year | ||||||
Education categorya | ||||||
Upper secondary schooling/High School diploma | coefficient | −0.193 | −0.623 | 0.027 | 0.400 | 0.913 |
SE | 0.135 | 0.267 | 0.905 | 0.309 | 1.054 | |
t Statistic | −1.440 | −2.340 | 0.030 | 1.290 | 0.870 | |
(3–4 years of) Vocational training | coefficient | −0.232 | −0.352 | −0.188 | 0.028 | 0.050 |
SE | 0.036 | 0.141 | 0.094 | 0.225 | 0.165 | |
t Statistic | −6.460 | −2.490 | −2.000 | 0.120 | 0.300 | |
Short-cycle higher education/Two-year college degree | coefficient | −0.407 | −0.486 | 0.156 | 0.036 | 0.705 |
SE | 0.076 | 0.220 | 0.681 | 0.279 | 0.851 | |
t Statistic | −5.330 | −2.210 | 0.230 | 0.130 | 0.830 | |
Medium-cycle higher education/Three years college or university degree | coefficient | −0.488 | −0.488 | −0.522 | 0.141 | −0.236 |
SE | 0.046 | 0.198 | 0.107 | 0.306 | 0.235 | |
t Statistic | −10.630 | −2.470 | −4.860 | 0.460 | −1.010 | |
Master’s and PhD degrees/Five-year university degree and PhD graduates | Coefficient | −0.723 | −0.701 | −0.887 | 0.116 | −0.551 |
SE | 0.065 | 0.251 | 0.139 | 0.388 | 0.435 | |
t Statistic | −11.210 | −2.790 | −6.370 | 0.300 | −1.270 | |
Unknown educational level (mostly immigrants or ongoing basic schooling) | Coefficient | 0.401 | 0.146 | −0.066 | −0.049 | 0.286 |
SE | 0.185 | 0.476 | 0.437 | 0.738 | 0.555 | |
t Statistic | 2.170 | 0.310 | −0.150 | −0.070 | 0.520 | |
F Statistic for null hypothesis that all schooling coefficients are equal to zero | 35.49 | 2.45 | 9.38 | 0.39 | 0.94 | |
p Value | <.01 | 0.02 | <.01 | .89 | .46 | |
Number of Hospital Days per Year, up to 2 Years Prior to Death (or end of observation period) | ||||||
Education categorya | ||||||
Upper secondary schooling/High school diploma | Coefficient | −0.234 | −0.503 | 0.514 | 0.199 | 1.199 |
SE | 0.111 | 0.187 | 1.098 | 0.270 | 1.264 | |
t statistic | −2.100 | −2.700 | 0.470 | 0.740 | 0.950 | |
(3–4 years of) Vocational training | Coefficient | −0.177 | −0.278 | −0.175 | 0.000 | −0.070 |
SE | 0.025 | 0.089 | 0.065 | 0.172 | 0.093 | |
t Statistic | −7.100 | −3.130 | −2.720 | 0.000 | −0.760 | |
Short-cycle higher education/Two-year college degree | Coefficient | −0.294 | −0.281 | −0.500 | 0.028 | −0.275 |
SE | 0.056 | 0.179 | 0.089 | 0.254 | 0.161 | |
t Statistic | −5.290 | −1.570 | −5.640 | 0.110 | −1.710 | |
Medium-cycle higher education/Three years college or university degree | Coefficient | −0.359 | −0.372 | −0.357 | 0.154 | −0.375 |
SE | 0.032 | 0.120 | 0.082 | 0.231 | 0.190 | |
t Statistic | −11.120 | −3.090 | −4.330 | 0.670 | −1.970 | |
Master’s and PhD degrees/Five-year university degree and PhD graduates | Coefficient | −0.517 | −0.603 | −0.610 | −0.187 | −0.216 |
SE | 0.045 | 0.171 | 0.121 | 0.266 | 0.189 | |
t Statistic | −11.390 | −3.530 | −5.050 | −0.710 | −1.140 | |
Unknown educational level (mostly immigrants or ongoing basic schooling) | Coefficient | 0.318 | 0.002 | −0.226 | 0.384 | 0.197 |
SE | 0.163 | 0.326 | 0.227 | 0.371 | 0.290 | |
t Statistic | 1.950 | 0.010 | −0.990 | 1.030 | 0.680 | |
F statistic for null hypothesis that all schooling coefficients are equal to zero | 37.06 | 3.42 | 8.66 | 0.8 | 1.27 | |
p Value | <.01 | <.01 | <.01 | .80 | .27 |
Reference category is basic and compulsory primary and upper secondary schooling.
Footnotes
Many studies have considered such possibilities, some of which have focused on the relative schooling (and other aspects of socioeconomic status) effects on health (e.g., Adams et al. 2002; Adler et al. 1994; Brooks-Gunn et al. 1997; Case et al. 2002a, b; Deaton 2001a, b; Deaton and Paxson 1998, 1999, 2001; Elo and Preston 1992, 1996; Kawachi et al. 1999; Marmot 1999; Mellor and Milyo 2002; Preston 1975; Preston and Elo 1995; Ross and Mirowsky 1999; Smith 1999; Strauss and Thomas 1998; Wilkinson 1992, 1996, 2000). Most of these studies did not control for the endogenous choice of schooling, but a relatively few studies did (see the Previous Related Studies section).
Many studies have documented the association of genetic endowments with physical and mental health (e.g., Amouyel et al. 1996; Bartres-Faz et al. 2000; Chen et al. 2001; Christensen et al. 1995, 1998, 2000a; Clee et al. 2001; Eicher et al. 2002; Forsberg et al. 2002; Frosst et al. 1995; Humbert et al. 1993; Jenny et al. 2002; Jiang et al. 2001; Kelly et al. 2002; Kluijtmans et al. 1996; Morita et al. 1997; Myllykangas et al. 2001; Pericak-Vance and Haines 1995; Roses 1998; Sawano et al. 2001; Voetsch et al. 2002). For simplicity, we represent the endowments here as if they were scalars. However, they may be vectors with different components, for example, that differentially affect schooling versus health and that are not perfectly correlated and, indeed, not necessarily positively correlated. In fact, some recent studies have suggested that innate education and health components of endowments may be negatively correlated (Behrman et al. 2008; Behrman and Rosenzweig 2004).
This formulation is consistent with standard models of intrahousehold allocation of investments in children (e.g., Becker 1991; Becker and Tomes 1976; Behrman et al. 1982, 1995). Behrman et al. (1994) used a similar formulation with MZ and DZ twins to estimate whether such intrahousehold allocations reinforce or compensate for individual-specific endowment differences among siblings (and found reinforcement).
In general, endowments of all members of a sibship affect the investments in any member of the sibship, but the effects of all siblings other than the other twin drop out in the within-twin estimators because the characteristics of other siblings are the same for the two members of a twinship. We expect the direct effect to be greater than the cross effect (αa > αs). If the coefficient of the cross effect is positive, there is compensating behavior in that parents invest more in sibling j if sibling k is better endowed. If the coefficient of the cross effect is negative, there is reinforcing behavior (i.e., reinforcing endowment differences) in that parents invest less in sibling j if sibling k is better endowed. In the within-MZ estimator, the two twins have identical individual-specific endowments, so aij and akj cancel out. The only estimates of which we are aware that are consistent with this framework and that estimate the cross-twin effect using a combination of MZ and DZ twins indicated reinforcing behavior (Behrman et al. 1994).
Behrman et al. (1994, 1996) reported evidence that the individual-specific endowments are significant and important in schooling-earning models estimated for the United States.
Differences in birth weights between MZ twins have been used to estimate the impact of in utero nutrition on subsequent life cycle outcomes, but such differences could be due to factors such as differential placement in the womb relative to the placenta even if the two members of a twinship were identical at conception (e.g., Behrman and Rosenzweig 2004; Conley et al. 2006). However, for some observable outcomes for which MZ twins are discordant in a few percent of births, such as congenital malformations, some studies suggested that the differences originate at least in part in differences in genes, perhaps related to the process of twinning (e.g., Hall 2003; Kondo et al. 2002; Lubinsky and Hall 1991), and not just in the environment in the womb. For such reasons, within–MZ twins estimates may not control perfectly for endowments at conception (as we assume herein), but they nevertheless control better than other options for such endowments.
Schnittker and Behrman (2010) and Kohler et al. (2011) provided recent evaluations of this and other criticisms of within-twins estimates and other twins designs used in the social sciences.
Amin et al. (2010) tested for the impact of this possibility by including birth weights in their estimates and found that this does not change their estimates of the schooling coefficient substantially.
Other examples that mostly focus on health-related behaviors, such as smoking, are Chou et al. (2007), de Walque (2007), Grimard and Parent (2007), Kenkel et al. (2006), and Oreopoulos (2006).
In their discussion, they state: “In summary, the current study showed possible causal effects of education on perceived global health and on smoking habits among males, but did not suggest direct associations between schooling and the other health outcomes studied.” (Fujiwara and Kawachi 2009:1320). The significant outcomes for males to which they refer, however, did not occur in within-MZ estimates, but only in within-MZ and -DZ combined estimates.
Two other unpublished studies that present within–MZ twin estimates of the impact of schooling on health outcomes and health-related behaviors are Amin et al. (2010), with estimates for the United Kingdom, Australia, and the United States; and Behrman et al. (2006), with estimates for China.
Lundborg (2008) also explored the impacts of schooling on health-related behaviors and reported little evidence of significant impacts.
This has been a valuable source of data for study of a number of biomedical, biodemographic, and socioeconomic topics. For more information and some examples, see Andersen-Ranberg et al. (1999); Bingley et al. (2009); Christensen et al. (1995, 1996, 1998, 1999, 2000b); Gaist et al. (2000); Hauge (1981); Kohler and Kohler (2002); Kohler and Rodgers (1999, 2000); Kohler et al. (1999, 2001, 2002, 2003, 2005); Kyvik et al. (1996); Rodgers et al. (2008); and Skytthe et al. (1998, 2002). Some have been concerned that twins have a different health and aging trajectory profile owing to the fact that they experience growth restrictions in the womb, but extensive tests generally have found no differences in health and aging trajectories between twins and singletons in high-income populations (Christensen and McGue 2008).
The data includes birth cohorts up to 1975, but we limit this analysis to the 1921–1950 birth cohorts because the dependent variables on which we focus are primarily relevant for older adults.
If migration or early mortality is affected by endowments that also affect schooling and the health outcomes that we study, the inability to include individuals who migrated or experienced early mortality biases the estimated impact of schooling on health using standard OLS individual estimates. However, within–MZ twins estimates control for the first-order effects of such endowments and thus do not suffer from the same biases of sample selection due to endowment-related migration or early mortality as do OLS estimates.
The hypothesis of equal means for men and women is rejected at with a p value of .016 for the cohort 1921–1935, but the means for men and women are not statistically different for the 1936–1950 cohort.
The Ns are slightly smaller for this variable than for the other two because of some mortality in 1980–1982 (see Table 1).
We use “years of schooling,” as is conventional in this literature, to mean the years of schooling required to attain a given level or grade of schooling with full-time school attendance and no grade repetition or skipping. If there is grade repetition or part-time attendance, the calendar years attending school may exceed the years of schooling as we use the term (and if there is skipping of grades, the reverse may hold).
Both individual population census self-reports and education institution reports are for training undertaken and qualifications attained. The Ministry determined which is the highest qualification separately along academic and vocational lines according to “normal” completion times. The data that are available to us are normal completion times for highest qualification achieved. In a few cases where the highest vocational/academic training is unknown in the data, the minimal required years of schooling for these cohorts (= 7 years) was used for years of schooling.
This approach is advantageous relative to an alternative tabulation of schooling differences by average twin pair schooling level because by construction, the mean difference in the years of schooling will tend to become small for twin pairs that have either a very high or very low mean schooling level.
If the correlation in measurement error between siblings (ρw) is nonzero, plim bSw = βS[1 − φσ2(wij)/σ2(Sij)], where φ = (1 − ρw)/(1 − ρS). Note that the measurement error bias in the within-sibling estimate is decreasing in ρw and is less in the within-sibling estimate than in the standard estimate if ρw > ρS. We are not aware of any estimates of ρw. What appears to be random noise in cross-sectional data may have a family component if the measurement error is due to such unobserved factors as exaggeration or modesty or to failure to control for school quality, all of which may be shared by siblings.
The specifications for these estimates are all linear. The basic thrust of the results summarized below is robust to using a semi-log specification (because the distributions are skewed to the right) for the two hospitalization outcomes. See Tables 8 and 9 in the Appendix.
With the alternative specifications in the Appendix (Tables 8 and 9), the differences between the cross-sectional estimates and the within-MZ estimates appear even stronger. For example, for the two measures of hospitalization per year both of the cross-sectional estimates are significantly negative at high levels, but in the within-MZ estimates, both of these are positive, although not significant at the .05 level.
Amin et al. (2010) also reported no significant effect of schooling regardless of whether a nonlinear or continuous measure of schooling was used.
Contributor Information
Jere R. Behrman, Email: jbehrman@econ.upenn.edu, Economics and Sociology, University of Pennsylvania, McNeil 160, 3718 Locust Walk, Philadelphia, PA 19104-6297, USA.
Hans-Peter Kohler, Department of Sociology, University of Pennsylvania, Philadelphia, PA, USA.
Vibeke Myrup Jensen, Danish National Centre for Social Research, Copenhagen, Denmark.
Dorthe Pedersen, The Danish Twin Registry and The Danish Aging Research Center, University of Southern Denmark, Copenhagen, Denmark.
Inge Petersen, The Danish Twin Registry and The Danish Aging Research Center, University of Southern Denmark, Copenhagen, Denmark.
Paul Bingley, Danish National Centre for Social Research, Copenhagen, Denmark.
Kaare Christensen, The Danish Twin Registry and The Danish Aging Research Center, University of Southern Denmark, Copenhagen, Denmark.
References
- Adams S. Educational attainment and health: Evidence from a sample of older adults. Education Economics. 2002;10:97–109. [Google Scholar]
- Adams P, Hurd MD, McFadden D, Merrill A, Ribeiro T. Healthy, wealthy, and wise? Tests for direct causal paths between health and socioeconomic status. Journal of Econometrics. 2002;112:3–56. [Google Scholar]
- Adler N, Boyce T, Chesney MA, Cohen S, Folkman S, Kahn R, Syme LS. Socioeconomic status and health: The challenge of the gradient. American Psychologist. 1994;49:15–24. doi: 10.1037//0003-066x.49.1.15. [DOI] [PubMed] [Google Scholar]
- Amin V, Behrman JR, Spector TD. Does more schooling improve health behaviours and health outcomes? Evidence from U K twins. London, UK: Economics Department, Royal Holloway College, University of London; 2010. Working paper. [Google Scholar]
- Amouyel P, Richard F, Cottel D, Amant C, Codron V, Helbecque N. The deletion allele of the angiotensin i converting enzyme gene as a genetic susceptibility factor for cognitive impairment. Neuroscience Letters. 1996;217:203–205. [PubMed] [Google Scholar]
- Andersen TF, Madsen M, Joergensen J, Mellemkjaer L, Olsen JH. The Danish National Hospital Register: A valuable source for modern health sciences. Danish Medical Bulletin. 1999;46:263–268. [PubMed] [Google Scholar]
- Andersen-Ranberg K, Christensen K, Jeune B, Skytthe A, Vasegaard L, Vaupel JW. Declining physical abilities with age: A cross-sectional study of older twins and centenarians in Denmark. Age and Aging. 1999;28:373–377. doi: 10.1093/ageing/28.4.373. [DOI] [PubMed] [Google Scholar]
- Angrist JD, Imbens GW, Rubin DB. Identification of causal effects using instrumental variables. Journal of the American Statistical Association. 1996;91:444–455. [Google Scholar]
- Angrist J, Krueger A. Does compulsory school attendance affect schooling and earnings? Quarterly Journal of Economics. 1991;106:979–1014. [Google Scholar]
- Arendt JN. Does education cause better health? A panel data analysis using school reforms for identification. Economics of Education Review. 2005;24:149–160. [Google Scholar]
- Arkes J. Does schooling improve adult health? Santa Monica, CA: RAND Corporation; 2003. DRU-3051. [Google Scholar]
- Ashenfelter O, Krueger A. Estimates of the economic return to schooling from a new sample of twins. American Economic Review. 1994;84:1157–1173. [Google Scholar]
- Ashenfelter O, Rouse C. Income, schooling and ability: Evidence from a new sample of identical twins. Quarterly Journal of Economics. 1998;113:153–284. [Google Scholar]
- Bartres-Faz D, Junque C, Clemente IC, Lopez-Alomar A, Valveny N, Lopez-Guillen A, Moral P, et al. Angiotensin I converting enzyme polymorphism in humans with age-associated memory impairment: Relationship with cognitive performance. Neuroscience Letters. 2000;290:177–180. doi: 10.1016/s0304-3940(00)01349-5. [DOI] [PubMed] [Google Scholar]
- Becker GS. A treatise on the family. 2. Cambridge, MA: Harvard University Press; 1991. [Google Scholar]
- Becker GS, Tomes N. Child endowments and the quantity and quality of children. Journal of Political Economy. 1976;84:S143–S162. [Google Scholar]
- Behrman JR, Hoddinott J, Maluccio JA, Soler-Hampejsek E, Behrman E, Martorell R, Stein AD, et al. What determines adult skills? Impacts of pre-schooling, schooling and post-schooling experiences in Guatemala. Washington, DC: International Food Policy Research Institute; 2008. IFPRI Discussion Paper No. 826. [Google Scholar]
- Behrman JR, Hrubec Z, Taubman P, Wales TJ. Socioeconomic success: A study of the effects of genetic endowments, family environment and schooling. Amsterdam, The Netherlands: North-Holland Publishing Company; 1980. [Google Scholar]
- Behrman JR, Li H, Suen W, Yang Z, Zhang J. Does more schooling improve health behaviors and health outcomes? Evidence from Chinese twins. Hong Kong: Chinese University of Hong Kong; 2006. Working paper. [Google Scholar]
- Behrman JR, Pollak RA, Taubman P. Parental preferences and provision for progeny. Journal of Political Economy. 1982;90:52–73. [Google Scholar]
- Behrman JR, Pollak RA, Taubman P. The wealth model: Efficiency in education and equity in the family. In: Behrman JR, Pollak RA, Taubman P, editors. From parent to child: Intrahousehold allocations and intergenerational relations in the United States. Chicago, IL: University of Chicago Press; 1995. pp. 113–138. [Google Scholar]
- Behrman JR, Rosenzweig MR. “Ability” biases in schooling returns and twins: A test and new estimates. Economics of Education Review. 1999;18:159–167. [Google Scholar]
- Behrman JR, Rosenzweig MR. Does increasing women’s schooling raise the schooling of the next generation? American Economic Review. 2002;92:323–334. [Google Scholar]
- Behrman JR, Rosenzweig MR. Returns to birthweight. Review of Economics and Statistics. 2004;86:586–601. [Google Scholar]
- Behrman JR, Rosenzweig MR. Does increasing women’s schooling raise the schooling of the next generation? American Economic Review. 2005;95:1745–1751. [Google Scholar]
- Behrman JR, Rosenzweig MR, Taubman P. Endowments and the allocation of schooling in the family and in the marriage market: The twins experiment. Journal of Political Economy. 1994;102:1131–1173. [Google Scholar]
- Behrman JR, Rosenzweig MR, Taubman P. College choice and wages: Estimates using data on female twins. Review of Economics and Statistics. 1996;73:672–685. [Google Scholar]
- Behrman JR, Taubman P. Intergenerational transmission of income and wealth. American Economic Review. 1976;66:436–440. [Google Scholar]
- Behrman JR, Wolfe BL. Does more schooling make women better nourished and healthier? Adult sibling random and fixed effects estimates for Nicaragua. Journal of Human Resources. 1989;24:644–663. [Google Scholar]
- Berger MC, Leigh JP. Education, self-selection and health. Journal of Human Resources. 1989;24:433–455. [Google Scholar]
- Bingley P, Christensen K, Jensen VM. Parental schooling and child development: Learning from twin parents. Denmark: University of Southern Denmark, and Aarhus School of Business, Aarhus University; 2009. The Danish National Centre for Social Research Working Paper 07–2009. [Google Scholar]
- Bishop J. Reporting errors and the true return to schooling. Economics Department, University of Wisconsin–Madison; 1976. Unpublished manuscript. [Google Scholar]
- Bonjour D, Cherkas L, Haskel J, Hawkes D, Spector T. Returns to education: Evidence from UK twins. American Economic Review. 2003;93:1799–1812. [Google Scholar]
- Bound J, Solon G. Double trouble: On the value of twins-based estimation of the return to schooling. Economics of Education Review. 1999;18:169–182. [Google Scholar]
- Brooks-Gunn J, Duncan GJ, Maritato N. Poor families, poor outcomes: The well-being of children and youth. In: Duncan GJ, Brooks-Gunn J, editors. Consequences of growing up poor. New York: Russell Sage Foundation; 1997. pp. 1–17. [Google Scholar]
- Case A, Fertig A, Paxson C. The development of the gradient: Earnings and health dynamics. Center for Health and Wellbeing, Princeton University; Princeton, NJ: 2002a. Unpublished manuscript Retrieved from http://www.wws.princeton.edu. [Google Scholar]
- Case A, Lubotsky D, Paxson C. Economic status and health in childhood: The origins of the gradient. American Economic Review. 2002b;92:1308–1334. doi: 10.1257/000282802762024520. [DOI] [PubMed] [Google Scholar]
- Chen W, Srinivasan SR, Elkasabany A, Ellsworth DL, Boerwinkle E, Berenson GS. Influence of lipoprotein lipase serine 447 stop polymorphism on tracking of triglycerides and hdl cholesterol from childhood to adulthood and familial risk of coronary artery disease: The Bogalusa Heart Study. Atherosclerosis. 2001;159:367–373. doi: 10.1016/s0021-9150(01)00508-1. [DOI] [PubMed] [Google Scholar]
- Chou S-Y, Liu J-T, Grossman M, Joyce T. Parental education and child health: Evidence from a natural experiment in Taiwan. Cambridge, MA: National Bureau of Economic Research; 2007. NBER Working Paper No. 1346. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Christensen K, Gaist D, Jeune B, Vaupel JW. A tooth per child? Lancet. 1998;352:204. doi: 10.1016/S0140-6736(05)77810-7. [DOI] [PubMed] [Google Scholar]
- Christensen K, Holm NV, McGue M, Corder L, Vaupel JW. A Danish population-based twin study on general health in the elderly. Journal of Aging and Health. 1999;11:49–64. doi: 10.1177/089826439901100103. [DOI] [PubMed] [Google Scholar]
- Christensen K, Holm NV, Vaupel JW. Alzheimer’s disease in twins. Lancet. 1996;347:976. [PubMed] [Google Scholar]
- Christensen K, Kristiansen M, Hagen-Larsen H, Skytthe A, Bathum L, Jeune B, Orstavik KH, et al. X-linked genetic factors regulate hematopoietic stem-cell kinetics in females. Blood. 2000a;95:2449–2451. [PubMed] [Google Scholar]
- Christensen K, McGue M. Academic achievement in twins: Yet another twin-singleton difference is diminishing. British Medical Journal. 2008;337:245–246. [Google Scholar]
- Christensen K, McGue M, Yashin A, Iachine I, Holm N, Vaupel J. Genetic and environmental influences on functional abilities in Danish twins aged 75 years and older. Journals of Gerontology Series A–Biological Sciences and Medical Sciences. 2000b;55:M446–M452. doi: 10.1093/gerona/55.8.m446. [DOI] [PubMed] [Google Scholar]
- Christensen K, Vaupel JW, Holm NV, Yashin AI. Mortality among twins after age 6: Fetal origins hypothesis versus twin method. British Medical Journal. 1995;310:432–436. doi: 10.1136/bmj.310.6977.432. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Clee SM, Loubser O, Collins J, Kastelein JJ, Hayden MR. The LPL S447X cSNP is associated with decreased blood pressure and plasma triglycerides, and reduced risk of coronary artery disease. Clinical Genetics. 2001;60:293–300. doi: 10.1034/j.1399-0004.2001.600407.x. [DOI] [PubMed] [Google Scholar]
- Conley D, Strully K, Bennett N. Twin differences in birth weight: The effects of genotype and parental environment on neonatal and post-neonatal mortality. Economics and Human Biology. 2006;4:151–183. doi: 10.1016/j.ehb.2005.12.001. [DOI] [PubMed] [Google Scholar]
- Cutler DM, Deaton AS, Lleras-Muney A. The determinants of mortality. Journal of Economic Perspectives. 2006;20(3):97–120. [Google Scholar]
- Cutler DM, Lleras-Muney A. Education and health: Evaluating theories and evidence. In: House J, Schoeni R, Kaplan G, Pollack H, editors. The effects of social and economic policy on health. New York: Russell Sage Foundation; 2008. pp. 29–60. [Google Scholar]
- Deaton AS. The policy implications of the gradient. Economics Department, Princeton University; Princeton, NJ: 2001a. Unpublished manuscript. [Google Scholar]
- Deaton AS. Relative deprivation, inequality, and mortality. Cambridge, MA: National Bureau of Economic Research; 2001b. NBER Working Paper No. 8099. [Google Scholar]
- Deaton AS, Paxson CH. Aging and inequality in income and health. American Economic Review. 1998;88:248–253. [Google Scholar]
- Deaton AS, Paxson CH. Mortality, education, income, and inequality among American cohorts. Cambridge, MA: National Bureau of Economic Research; 1999. NBER Working Paper No. 7140. [Google Scholar]
- Deaton AS, Paxson CH. Mortality, income, and income inequality over time in Britain and the United States. Center for Health and Wellbeing, Princeton University; Princeton, NJ: 2001. Unpublished manuscript. [Google Scholar]
- de Walque D. Does education affect smoking behavior? Evidence using the Vietnam draft as an instrument for college education. Journal of Health Economics. 2007;26:877–895. doi: 10.1016/j.jhealeco.2006.12.005. [DOI] [PubMed] [Google Scholar]
- Elo IT, Preston SH. Effects of early-life conditions on adult mortality: A review. Population Index. 1992;58:186–212. [PubMed] [Google Scholar]
- Elo IT, Preston SH. Educational differentials in mortality: United States, 1979–85. Social Science & Medicine. 1996;42:47–57. doi: 10.1016/0277-9536(95)00062-3. [DOI] [PubMed] [Google Scholar]
- Fletcher JM, Lehrer SF. Using genetic lotteries within families to examine the causal impact of poor health on academic achievement. Cambridge, MA: National Bureau of Economic Research; 2009. NBER Working Paper No. 15148. [Google Scholar]
- Forsberg L, Lyrenas L, de Faire U, Morgenstern R. A common functional C-T substitution polymorphism in the promoter region of the human catalase gene influences transcription factor binding, reporter gene transcription and is correlated to blood catalase levels. Free Radical Biology and Medicine. 2002;30:500–505. doi: 10.1016/s0891-5849(00)00487-1. [DOI] [PubMed] [Google Scholar]
- Frosst P, Blom HJ, Milos R, Goyette P, Sheppard CA, Matthews RG, Rozen R, et al. A candidate genetic risk factor for vascular disease: A common mutation in methylenetetrahydrofolate reductase. Nature Genetics. 1995;10:111–113. doi: 10.1038/ng0595-111. [DOI] [PubMed] [Google Scholar]
- Fujiwara T, Kawachi I. Is education causally related to better health? A twin fixed-effect study in the USA. International Journal of Epidemiology. 2009;38:1310–1322. doi: 10.1093/ije/dyp226. [DOI] [PubMed] [Google Scholar]
- Gaist D, Bathum L, Skytthe A, Jensen TK, McGue M, Vaupel JW, Christensen K. Strength and anthropometric measures in identical and fraternal twins: No evidence of masculinization of females with male co-twins. Epidemiology. 2000;11:340–343. doi: 10.1097/00001648-200005000-00020. [DOI] [PubMed] [Google Scholar]
- Griliches Z. Sibling models and data in economics: Beginnings of a survey. Journal of Political Economy. 1979;87:S37–S64. [Google Scholar]
- Grimard F, Parent D. Education and smoking: Were Vietnam draft avoiders also more likely to avoid smoking? Journal of Health Economics. 2007;26:896–926. doi: 10.1016/j.jhealeco.2007.03.004. [DOI] [PubMed] [Google Scholar]
- Grossman M. Education and non-market outcomes. In: Hanushek E, Welch F, editors. Handbook of the economics of education. Vol. 1. Amsterdam, the Netherlands: North-Holland, Elsevier Science; 2006. pp. 577–633. [Google Scholar]
- Hall JG. Twinning. Lancet. 2003;362:735–743. doi: 10.1016/S0140-6736(03)14237-7. [DOI] [PubMed] [Google Scholar]
- Hauge M. The Danish twin register. In: Mednich SA, Baert AE, Bauchmann BP, editors. Prospective longitudinal research: An empirical basis for the primary prevention of psychological disorders. Oxford, UK: Oxford University Press; 1981. pp. 217–221. [Google Scholar]
- Humbert R, Adler DA, Disteche CM, Hassett C, Omiecinski C, Furlong CE. The molecular basis of the human serum paraoxonase activity polymorphism. Nature Genetics. 1993;3:73–76. doi: 10.1038/ng0193-73. [DOI] [PubMed] [Google Scholar]
- Jenny NS, Tracy RP, Ogg MS, Luong le A, Kuller H, Arnold AM, Humphries SE, et al. In the elderly, interleukin-6 plasma levels and the 174G > C polymorphism are associated with the development of cardiovascular disease. Arteriosclerosis Thrombosis and Vascular Biology. 2002;22:2066–2071. doi: 10.1161/01.atv.0000040224.49362.60. [DOI] [PubMed] [Google Scholar]
- Jiang Z, Akey JM, Shi J, Xiong M, Wang Y, Shen Y, Jin L, et al. A polymorphism in the promoter region of catalase is associated with blood pressure levels. Human Genetics. 2001;109:95–98. doi: 10.1007/s004390100553. [DOI] [PubMed] [Google Scholar]
- Kawachi I, Kennedy BP, Wilkinson RG. The society and population health reader: Volume 1: Income inequality and health. New York: New Press; 1999. [Google Scholar]
- Kelly PJ, Rosand J, Kistler JP, Shih VE, Silveira S, Plomaritoglou A, Furie KL. Homocysteine, MTHFR 677C-T polymorphism, and risk of ischemic stroke: Results of a meta-analysis. Neurology. 2002;59:529–536. doi: 10.1212/wnl.59.4.529. [DOI] [PubMed] [Google Scholar]
- Kenkel D, Lillard D, Mathios A. The roles of high school completion and GED receipt in smoking and obesity. Journal of Labor Economics. 2006;24:635–660. [Google Scholar]
- Kluijtmans LAJ, van den Heuvel LPWJ, Boers GHJ, Frosst P, Stevens EMB, van Oost BA, Blom HJ, et al. Molecular genetic analysis in mild hyperhomocysteinemia: A common mutation in the methylenetetrahydrofolate reductase gene is a genetic risk factor for cardiovascular disease. American Journal of Human Genetics. 1996;58:35–41. [PMC free article] [PubMed] [Google Scholar]
- Kohler H-P, Behrman JR, Schnittker J. Social science methods for twins data: Integrating causality, endowments, and heritability. Biodemography and Social Biology. 2011;57:88–141. doi: 10.1080/19485565.2011.580619. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kohler H-P, Behrman JR, Skytthe A. Partner + children = happiness? An assessment of the effect of fertility and partnerships on subjective well-being in Danish twins. Population and Development Review. 2005;31:407–445. [Google Scholar]
- Kohler H-P, Knudsen L, Skytthe A, Christensen K. The fertility pattern of twins and the general population compared: Evidence from Danish cohorts 1945–64. Demographic Research. 2002;6:383–408. doi: 10.4054/DemRes.2002.6.14. [DOI] [Google Scholar]
- Kohler H-P, Kohler I. The effect of marriage on mortality: Selection or protection? Evidence from Danish twins using fixed-effect survival models. Paper presented at the annual meeting of the Population Association of America; Atlanta, GA. 2002. May, [Google Scholar]
- Kohler H-P, Rodgers JL. DF-like analyses of binary, ordered and censored variables using probit and Tobit approaches. Behavior Genetics. 1999;29:221–232. [Google Scholar]
- Kohler H-P, Rodgers JL. DF-analyses of heritability with double-entry twin data: Asymptotic standard errors and efficient estimation. Behavior Genetics. 2000;31:179–191. doi: 10.1023/a:1010253411274. [DOI] [PubMed] [Google Scholar]
- Kohler H-P, Rodgers JL, Christensen K. Is fertility behavior in our genes: Findings from a Danish twin study. Population and Development Review. 1999;25:253–288. [Google Scholar]
- Kohler H-P, Rodgers JL, Christensen K. Between nurture and nature: The shifting determinants of female fertility in Danish twin cohorts 1870–1968. Social Biology. 2003;49(1–2):76–106. doi: 10.1080/19485565.2002.9989060. [DOI] [PubMed] [Google Scholar]
- Kohler H-P, Skytthe A, Christensen K. The age at first birth and completed fertility reconsidered: Findings from a sample of identical twins. Rostock, Germany: Max Planck Institute for Demographic Research; 2001. (Working Paper No. 2001–006). Retrieved from http://www.demogr.mpg.de. [Google Scholar]
- Kondo S, Schutte BC, Richardson RJ, Bjork BC, Knight AS, Watanabe Y, Murray JC, et al. Mutations in IRF6 cause Van der Woude and popliteal pterygium syndromes. Nature Genetics. 2002;32(2):285–289. doi: 10.1038/ng985. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kyvik KO, Christensen K, Skytthe A, Harvald B, Holm NV. The Danish Twin Register. Danish Medical Bulletin. 1996;43:465–470. [PubMed] [Google Scholar]
- Lleras-Muney A. The relationship between education and adult mortality in the United States. Review of Economic Studies. 2005;72:189–221. [Google Scholar]
- Lubinsky MS, Hall JG. Genomic imprinting, monozygous twinning, and X inactivation. Lancet. 1991;337:1288. doi: 10.1016/0140-6736(91)92956-3. [DOI] [PubMed] [Google Scholar]
- Lundborg P. The health returns to education: What can we learn from twins? Bonn, Germany: Institute for the Study of Labor; 2008. IZA Discussion Paper No. 3399. [Google Scholar]
- Madsen M, Andersen AMN, Christensen K, Andersen PK, Osler M. Does educational status impact adult mortality in Denmark? A twin approach. American Journal of Epidemiology. 2010;72:225–234. doi: 10.1093/aje/kwq072. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marmot M. Epidemiology of socioeconomic status and health: Are determinants within countries the same as between countries? In: Adler NE, Marmot M, McEwen BS, Steward J, editors. Socioeconomic status and health in industrial nations. New York: New York Academy of Sciences; 1999. pp. 16–29. [DOI] [PubMed] [Google Scholar]
- Mellor JM, Milyo J. Income inequality and health status in the United States: Evidence from the Current Population Survey. Journal of Human Resources. 2002;37:510–539. [Google Scholar]
- Miller P, Mulvey C, Martin N. What do twins studies tell us about the economic returns to education? A comparison of U.S. and Australian findings. American Economic Review. 1995;85:586–599. [Google Scholar]
- Miller P, Mulvey C, Martin N. Family characteristics and the returns to schooling: Evidence on gender differences from a sample of Australian twins. Economica. 1997;64:119–136. [Google Scholar]
- Morita H, Taguchi J, Kurihara H, Kitaoka M, Kaneda H, Kurihara Y, Yazaki Y, et al. Genetic polymorphism of 5,10-methylenetetrahydrofolate reductase (MTHFR) as a risk factor for coronary artery disease. Circulation. 1997;95:2032–2036. doi: 10.1161/01.cir.95.8.2032. [DOI] [PubMed] [Google Scholar]
- Myllykangas L, Polvikoski T, Sulkava R, Notkola IL, Rastas S, Verkkoniemi A, Haltia M, et al. Association of lipoprotein lipase Ser447Ter polymorphism with brain infarction: A population-based neuropathological study. Annals of Medicine. 2001;33:486–492. doi: 10.3109/07853890109002098. [DOI] [PubMed] [Google Scholar]
- Oreopoulos P. Estimating average and local average treatment effects of education when compulsory school laws really matter. American Economic Review. 2006;96:152–175. [Google Scholar]
- Osler M, McGue M, Christensen K. Socioeconomic position and twins’ health: A life-course analysis of 1266 pairs of middle-aged Danish twins. International Journal of Epidemiology. 2007;36:77–83. doi: 10.1093/ije/dyl266. [DOI] [PubMed] [Google Scholar]
- Pericak-Vance MA, Haines JL. Genetic susceptibility to Alzheimer disease. Trends in Genetics. 1995;11:504–508. doi: 10.1016/s0168-9525(00)89161-1. [DOI] [PubMed] [Google Scholar]
- Preston SH. The changing relation between mortality and level of economic development. Population Studies. 1975;29:231–248. [PubMed] [Google Scholar]
- Preston SH, Elo IT. Are educational differentials in adult mortality increasing in the United States? Journal of Aging and Health. 1995;7:476–496. doi: 10.1177/089826439500700402. [DOI] [PubMed] [Google Scholar]
- Rodgers JL, Kohler H-P, McGue M, Behrman JR, Petersen I, Bingley P, Christensen K. Education and cognitive ability as direct, mediated, or spurious influences on female age at first birth: Behavior genetic models fit to Danish twin data. American Journal of Sociology. 2008;114:S202–S232. doi: 10.1086/592205. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Roses AD. Alzheimer diseases: A model of gene mutations and susceptibility polymorphisms for complex psychiatric diseases. American Journal of Medical Genetics. 1998;81:49–57. doi: 10.1002/(sici)1096-8628(19980207)81:1<49::aid-ajmg10>3.0.co;2-w. [DOI] [PubMed] [Google Scholar]
- Ross CE, Mirowsky J. Refining the association between education and health: The effects of quantity, credential, and selectivity. Demography. 1999;36:445–460. [PubMed] [Google Scholar]
- Sawano M, Watanabe Y, Ohmura H, Shimada K, Daida H, Mokuno H, Yamaguchi H. Potentially protective effects of the Ser447-Ter mutation of the lipoprotein lipase gene against the development of coronary artery disease in Japanese subjects via a beneficial lipid profile. Japanese Circulation Journal. 2001;65:310–314. doi: 10.1253/jcj.65.310. [DOI] [PubMed] [Google Scholar]
- Schnittker J, Behrman JR. The value of twins for sociological research. Philadelphia: University of Pennsylvania; 2010. Working paper. [Google Scholar]
- Skytthe A, Holm NV, Vaupel JW, Christensen K. Establishing a population-based register of middle-aged twins in Denmark. Twin Research. 1998;1:110. [Google Scholar]
- Skytthe A, Kyvik K, Holm NV, Vaupel JW, Christensen K. The Danish Twin Registry: 127 birth cohorts of Twins. Twin Research. 2002;5:352–357. doi: 10.1375/136905202320906084. [DOI] [PubMed] [Google Scholar]
- Smith JP. Healthy bodies and thick wallets: The dual relation between health and economic status. Journal of Economic Perspectives. 1999;13:145–166. [PMC free article] [PubMed] [Google Scholar]
- Spasojevic J. Effects of education on adult health in Sweden: Results from a natural experiment. City University of New York Graduate Center; 2003. Unpublished doctoral dissertation. [Google Scholar]
- Strauss J, Thomas D. Health, nutrition and economic development. Journal of Economic Literature. 1998;36:766–817. [Google Scholar]
- UNESCO. Institute for Statistics. 2011 Retrieved from http://stats.uis.unesco.org/unesco/ReportFolders/ReportFolders.aspx?IF_ActivePath=P,50&IF_Language=eng.
- Voetsch B, Benke KS, Damasceno BP, Siqueira LH, Loscalzo J. Paraoxonase 192 Gln–>Arg polymorphism: An independent risk factor for nonfatal arterial ischemic stroke among young adults. Stroke. 2002;33:1459–1464. doi: 10.1161/01.str.0000016928.60995.bd. [DOI] [PubMed] [Google Scholar]
- Wilkinson RG. Income distribution and life expectancy. British Medical Journal. 1992;304:165–168. doi: 10.1136/bmj.304.6820.165. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wilkinson RG. Unhealthy societies. The affliction of inequality. London, UK: Routledge; 1996. [Google Scholar]
- Wilkinson RG. Mind the gap: Hierarchies, health, and human evolution. London, UK: Weidenfeld and Nicolson; 2000. [Google Scholar]