Abstract
This paper studies distributional effects of education on health. In 1972, England, Scotland, and Wales raised their minimum school-leaving age from 15 to 16 for students born after 9/1/1957. Using a regression discontinuity design and objective health measures for 129,000 individuals, we find that education reduced body size and increased blood pressure in middle age. The reduction in body size was concentrated at the upper tail of the distribution with an 8 percentage point reduction in obesity. The increase in blood pressure was concentrated at the lower tail of the distribution with no effect on stage 2 hypertension.
Keywords: I10, I20
There is broad disagreement about whether education has a causal effect on health (Grossman 2015). Some studies find that more education causes better health (e.g. Lleras-Muney 2005) while others find little to no effect (e.g. Clark and Royer 2013). Galama, Lleras-Muney, and Kippersluis (2018) argue that heterogeneity in these effects may underlie these conflicting results. Their model lays out conditions for such heterogeneity to exist, and evidence indicates that such conditions are met: (i) heterogeneity in the labor market returns to education (Card 2001); (ii) heterogeneity in the non-market returns to education (Barcellos, Carvalho, and Turley 2018); and (iii) heterogeneity in skill formation as a result of education (Grenet 2013).
This paper investigates whether the effects of education on health are heterogeneous by studying how education affects the distribution of health. If education affects some parts of the health distribution but not others, then the effect of education on health must vary across individuals. In particular, the effect may be larger for individuals with poorer health. In a Grossman-type model, education will cause the unhealthy to invest more in their health than the healthy because of the diminishing marginal utility of health (Kippersluis and Galama 2014).1
Since little is known ex-ante about the dimensions along which the effect of education on health may vary, a distributional approach is a suitable way to investigate heterogeneity. It avoids multiple hypotheses testing concerns and considerations about the lack of good measures of relevant dimensions. Moreover, if the effects are concentrated at particular parts of the health distribution, a distributional test may be better powered than a test of difference in means—see Online Appendix A.
We exploit a well-known schooling reform in the United Kingdom using a regression discontinuity design. In 1972, England, Scotland, and Wales raised their minimum school-leaving age from 15 to 16 for students born on or after September 1, 1957 (students born before this date could drop out at age 15), generating a discontinuity in the relationship between education and date of birth at the September 1, 1957 “cutoff.” The reform had the primary effect of inducing students who would have dropped out at age 15 to obtain exactly one additional year of education. We estimate the distributional effects of education on health from discontinuities in the cumulative distribution function (CDF) of health at the birthdate cutoff.
We use data from the UK Biobank, a study that collected multiple objective and continuous measures of health between 2006 and 2010; 34–38 years after the policy change. Using standardized protocols, healthcare technicians and nurses measured the height, weight, body fat percentage, waist and hip circumferences, lung function, and blood pressure of more than a quarter million people born in England, Scotland, and Wales between September 1, 1947 and August 31, 1967. The unprecedented availability of objective health measures for such a large sample permits well-powered estimation of distributional treatment effects. To ease concerns about multiple hypotheses testing, we focus our analysis on three health indices constructed from the multiple measures available: body size2, lung function, and blood pressure.
There are three main takeaways from the distributional analysis. First, staying in school until age 16 improves one dimension of health—there is a reduction in body size—and worsens another: there is an increase in blood pressure.3 Second, it reveals that these effects vary considerably along the health distribution. To give a sense of how effects are concentrated, staying in school until age 16 reduces the 90th percentile of the body size distribution by 0.43 of a standard deviation—that is equal to 2.5 times the effect on average body size. Third, the effects on body size and on blood pressure occur in different parts of their respective distributions: while the effect on body size is concentrated at the upper tail (i.e., among the least healthy), the effect on blood pressure is concentrated at the lower tail (i.e., among the most healthy).4
We conduct a distributional test based on Shen and Zhang (2016) to formally investigate whether these changes are statistically significant, testing differences in the bottom and top halves of the CDFs of our three health indices. The test rejects at the 5% significance level the null of equality for the top half of the body size distribution (p-value of 0.013) and for the bottom half of the blood pressure distribution (p-value of 0.028). We only find suggestive evidence of a positive effect on lung function.
The results from our distributional analyses suggest that the effects of education on health are heterogeneous. Our estimates recover the distributional effects on compliers, that is, students who stayed in school an additional year because of the schooling reform. If the effects were homogeneous, we would see effects across the whole support of the complier distribution, with the largest effect at the mode (see Online Appendix B). This is because, if education affects each complier equally, the distributional effect at a given point should roughly correspond to the mass at this point in the pre-reform complier distribution. As described above, this is not what we find: effects are concentrated at the top of the body size distribution, with the largest effect at the 83rd percentile. Therefore, our findings are not consistent with homogeneous treatment effects. Importantly, our method recovers differences in the health distribution of compliers only and does not rely on rank preservation assumptions.
We also conduct an exploratory analysis to investigate possible channels through which education may affect health. While improvements in SES and diet may explain the reduction in body size, the channels for the increase in blood pressure are less clear. We do show, however, that education allowed individuals to earn higher income in middle age, in part because they hold better jobs (Barcellos, Carvalho and Turley 2021). One possibility is that, by changing the types of occupations and careers individuals have, education might have had an effect on job responsibilities, expectations, and work-related stress. This would be consistent with evidence that academically successful African Americans in the U.S. have higher biomarkers related to cardiometabolic risk (i.e. blood pressure and stress hormones) than other groups (Brody et al. 2013, Miller et al. 2015, Chen et al. 2015) and with findings that, despite improving SES outcomes, the 1972 reform had detrimental effects on mental health (Avendano et al. 2020).5
For the sake of comparison, we also estimate the effects of education on average health. A number of studies have exploited changes in compulsory schooling laws to study such effects (e.g., Lleras-Muney 2005; Albouy and Lequien 2009; Silles 2009; Powdthavee 2010; Kemptner et al. 2011; Clark & Royer 2013; Jürges et al. 2013; Davies et al. 2018; Janke et al. 2020; Meghir et al. forthcoming). We find that staying in school until age 16 reduces body size by 0.17 of a standard deviation and increases blood pressure by 0.15 of a standard deviation, but these estimates are only significant at the 10% level.6
In an apparent contrast to the results of this paper, Clark and Royer (2013) estimate a precise null effect of the 1972 ROSLA reform on mortality. They also find no statistically significant effect on the averages of non-mortality health outcomes.7 Our estimates of the effects on average health are, however, broadly consistent with their corresponding estimates. While Clark and Royer had very large samples to estimate the effect of education on mortality, their estimates of the effects on non-mortality health outcomes are less precise because they had more limited samples for such outcomes. For this reason, our estimates lie within the 95% confidence intervals of Clark and Royer (2010, 2013)’s corresponding estimates.
In addition, our work is closely related to Davies et al. (2018) which also use the 1972 ROSLA and UK Biobank data to study the effects of education on health, albeit with a focus on the effects on average health. They find that the 1972 ROSLA reduced the risk of diabetes and mortality, which is consistent with our finding that the policy reduced body size and obesity. Their sample is almost 10 years older than Clark and Royer (2013)’s which could explain why they find effects on mortality when Clark and Royer (2013) do not.8
While the main take-away from Clark and Royer (2013) is that education does not affect health, our results suggest a more nuanced story. The distributional analysis shows that education can affect the health of some subpopulations. The heterogeneity of the effects of education on health may explain the conflicting findings in the literature (Galama et al. 2018). First, education affects different parts of the health distribution differently, implying that the effects vary across individuals. Second, the effects also vary across outcomes: the additional schooling caused a reduction in body size and an increase in blood pressure.
Despite this, the question remains as to how education could affect some health outcomes while having no downstream effects on mortality. There are several hypotheses for such a result. First, as noted above, those affected by the 1972 ROSLA would have been less than 50 years old at the time of the Clark and Royer study. It is possible that not enough time had passed to observe meaningful differences in mortality.9 Second, health is a multi-dimensional construct, and, as we see in this study, education may improve some dimensions of health while worsening others. As a result, the ultimate effect of education on mortality may be attenuated by these opposing factors.10 In short, Clark and Royer’s finding of no effect on mortality are plausibly consistent with Davies et al. (2018)’s and our results.
We note that, while the UK Biobank data are rich and the sample size is large, the sample is not nationally representative – its participants have higher SES than the target population (Fry et al. 2017). We compare objective health outcomes of compliers of the schooling reform we study (i.e. those born before the reform who dropped out of school before age 16) in the UK Biobank to compliers in the Health Survey for England – a nationally representative sample. We do not find unequivocal evidence that the UK Biobank sample is healthier than the average population (among compliers) and the health differences among the two samples, while statistically significant, are small. Still, such differences could threaten the external validity of our results.
The paper is structured as follows. Section I discusses the 1972 raising of the school leaving-age reform and the data. In Section II we present the effects of the reform on education and the effects on average health. Section III discusses the methods used to estimate the distributional effects with results shown in Section IV. Section V presents suggestive evidence on mechanisms and Section VI concludes.
I. Background and Data
A. The 1972 Raising of the School Leaving Age
The British compulsory schooling laws specify the maximum age by which children must start school and the minimum age at which they can leave school. In this paper, we exploit the 1972 Raising of School Leaving Age (ROSLA) legislation, which increased the minimum school-leaving age from 15 to 16 years of age in England, Scotland, and Wales. These laws and their implementation have been extensively documented in other studies (see Clark and Royer 2010, 2013) so we only include a brief summary of its main features here.
The UK’s 1944 Education Act raised the minimum school-leaving age from 14 to 15 years of age in England, Wales, and Scotland and gave the Minister of Education the power to further raise it to 16 years when conditions allowed. The Minister did so in January 1972 for Scotland (Statutory Instrument No. 59)11 and in March 1972 for England and Wales (Statutory Instrument No. 444)12. Both changes took effect in September 1, 1972, implying that those who were 15 or younger before that date (born on September 1, 1957 or later) had to stay in school until at least age 16 in the three countries (hereafter, we use the term “stayed in school until age 16” to refer to those who stayed in school until at least age 16). Infrastructure investments, such as school building to absorb the additional students, preceded the 1972 ROSLA but key elements of the school system did not change with the policy.
Figure 1, which displays the fraction of study participants who stayed in school until age 16 (y-axis) by quarter of birth (x-axis), shows that the policy generated a discontinuous relationship between these two variables. There is a large jump at the September 1, 1957 cutoff marked by the vertical dashed line. Those born during the summer months could in practice drop out at age 15 even after the 1972 ROSLA, since the law required students to be 16 by the start of the next school year. We estimate that the policy increased the fraction of UK Biobank participants who stayed in school until age 16 by 14 percentage points – see Table 1.13
Figure 1. Fraction Staying in School until Age 16 by Quarter of Birth.
Notes: The figure shows the fraction of study participants who stayed in school until age 16 by quarter of birth. The dashed vertical line marks the first birth cohort affected by the 1972 school-leaving age reform. Cohorts born to the right of the line had to stay in school until age 16 while cohorts born before could leave at age 15. The curves show linear trends in quarter of birth that capture birth cohort trends. The circumference of each circle reflects the number of participants born in that quarter. N = 129,370.
Table 1.
Effects on Education
Left school at age ≥ 16 | No Qualifications | CSE | ||||
|
||||||
Post | 0.136 [0.004]*** | 0.136 [0.004]*** | −0.046 [0.003]*** | −0.046 [0.003]*** | 0.069 [0.005]*** | 0.069 [0.005]*** |
Additional Controls? | No | Yes | No | Yes | No | Yes |
Mean of Y | 0.826 | 0.113 | 0.206 | |||
|
||||||
O-level | A-level | College Degree | ||||
|
||||||
Post | 0.032 [0.006]*** | 0.033 [0.006]*** | 0.009 [0.006] | 0.009 [0.006] | −0.011 [0.006]* | −0.011 [0.006]* |
Additional Controls? | No | Yes | No | Yes | No | Yes |
Mean of Y | 0.513 | 0.325 | 0.368 |
Notes: The table shows the effects of the school reform on education. Each cell corresponds to a separate regression. We report the coefficient on the indicator variable for being born on or after September 1, 1957 (i.e., “Post”). The dependent variable mean in the bottom row is the weighted mean among those born in the 12 months before September 1, 1957. All regressions control for calendar month of birth. Additional controls include male, age in days and age squared dummies for ethnicity, and dummies for country of birth. Robust standard errors. N = 129,370 for “Stayed in school until 16” and N = 128,169 for all other outcomes.
*, **, and *** denote significance at the 10, 5 and 1 percent level.
Notice there is a cyclical drop in the fraction of students staying in school until age 16, corresponding to those born between June and August. This phenomenon is not specific to our data and has been noted by others. According to Clark and Royer (2013), “when the minimum leaving age became age 16, students had to stay until part way through grade 10. Grade 10 finishes with the “O level” exam period and, technically, students finish when they complete their last exam. Since the exam period starts in late May and finishes in mid-June, starting in 1972, students born in late June, July, and August could leave at 15, technically younger than the minimum leaving age (16)” (see page 2 of their online appendix). We include calendar month of birth dummies in our regressions to control for this seasonality.
B. Data
We use data from the UK Biobank, a large, population-based prospective study initiated by the UK National Health Service (NHS) (Sudlow et al. 2015). Between 2006 and 2010, invitations were mailed to 9.2 million people between the ages of 40 and 69 who were registered with the NHS and lived up to about 25 miles from one of 22 study assessment centers distributed throughout the UK (Allen et al. 2012) – see Online Appendix Figure D1.14 The sample is formed by 502,643 individuals who agreed to participate (i.e. a response rate of 5.46%). Although the sample is not nationally representative, our estimates have internal validity because there is no differential selection on the two sides of the September 1, 1957 cutoff – see Online Appendix C.15
Physical measures, such as anthropometrics, spirometry, and blood pressure, were collected of survey participants. The collection was standardized across centers and was conducted by trained nurses or healthcare practitioners. Every participant was genotyped.
In this paper, we focus on objective and continuous measures of health. Continuous measures because we are interested in studying how education affects the distribution of health. Objective measures because research shows that discrepancies between subjective (e.g. self-reported hypertension) and objective measures of health (e.g. objectively measured hypertension) vary with socioeconomic status (e.g. Johnston et al. 2009).
We restrict ourselves to three dimensions of health (that satisfy the two criteria above and) that can be arguably affected by education: body size, lung function, and blood pressure.16 Weight and body size can be affected by education through diet and physical exercise. Lung function can be affected by education through smoking and pollution. Blood pressure may be affected by education through diet, physical exercise, medication, and stress. Moreover, these health dimensions are risk factors for high-prevalence diseases such as diabetes (Chan et al. 1994), lung cancer (Tockman et al. 1987) and cardiovascular disease (Kannel 1996). The UK Biobank has multiple measures of each of these health dimensions.
Next, we describe how each of these health dimensions is measured in the data.
1. Body Size
We use three measures of body size: BMI, body fat percentage, and waist-hip ratio.17 A bioimpedance analyzer was used to calculate body fat percentage. This device passes a low electrical current through the body. Water conducts electricity. While fat contains very little water, muscle contains 70% water. The bioimpedance analyzer calculates body fat from the speed of the current: The slower the signal travels, the greater the fat content.
2. Lung Function
A spirometry test was conducted to measure participants’ lung function. The spirometer is a small machine attached to a mouthpiece by a cable that measures the volume and speed of air after a forced exhale. Participants were asked to fill their lungs as much as possible and to blow air out as hard and as fast as possible in the mouthpiece.18 Three parameters were measured: 1) forced expiratory volume in the first second is the amount of air exhaled during the first second; 2) forced vital capacity is the total amount of air exhaled during the forced breath; and 3) peak expiratory flow is the fastest rate of exhalation. These parameters are used to assess pulmonary conditions, such as chronic obstructive pulmonary disease and asthma. We follow DeMatteis et al. (2016)’s criteria to identify acceptable expiratory maneuvers in the UK Biobank data. Valid spirometry measures are available for 80% of our sample.19
3. Blood Pressure
Two measurements were taken of the diastolic and systolic blood pressures of each study participant. We use the average of these two measurements.
4. Summary Indices
For each health dimension, we construct a summary index that is a weighted average of the different outcomes measuring that dimension:
Body size: body mass index, waist-to-hip ratio, and body fat percentage;
Lung function: forced expiratory volume in the first second, forced vital capacity, and peak expiratory flow;
Blood pressure: diastolic and systolic blood pressures.
First, each measure is standardized separately by gender, using as a reference those born in the 12 months before September 1, 1957. We then follow the procedure proposed by Anderson (2008), weighting the measures by their variance-covariance matrix. The weights are calculated to maximize the amount of information captured in the index. Finally, we construct a fourth “summary index” that is a summary of the body size, the lung function, and the blood pressure indices, using the same weighting procedure. We construct all four indices so that a higher number corresponds to worse health.
These indices have two main advantages relative to traditional biomarker measures of health such as BMI and blood pressure. First, they reduce the number of outcomes and mitigate multiple hypotheses testing concerns. Second, they decrease noise by averaging out measurement error across the outcomes measuring the same health dimension, increasing the statistical power of our estimates. In addition, since only a very small fraction of our sample has low blood pressure (0.73%) or are underweight (0.6%), we can treat the relationships between our constructed indices and health as approximately monotonic over the range of the data. The main limitation is that these indices may not be as readily interpretable as more traditional measures such as BMI because they do not have clinical meanings.
The correlation between the body size and lung function indices is 0.18. The correlation between the body size and the blood pressure indices is 0.30. The correlation between the lung function and the blood pressure indices is 0.08. The correlations between the summary index and the body size, lung function, and blood pressure indices are respectively 0.68, 0.67, and 0.68.
II. Mean Effects
A. Effects of the Compulsory Schooling Change on Education
We use a regression discontinuity design (RDD) to estimate the “first stage”, i.e., the effect of the 1972 ROSLA on education. In particular, we estimate the following regression:
(1) |
where Educi is a measure of the educational attainment of individual i; Posti is 1 if individual i was born on or after September 1, 1957 (and 0 otherwise); DoBi is individual i’s date of birth; and the vector xi contains predetermined characteristics. Date of birth is measured in days relative to the cutoff, such that DoB=0 for someone born on September 1, 1957. The function f(⋅) captures birth cohort trends in educational attainment, which are allowed to differ on either side of the September 1, 1957 cutoff. The coefficient a1 gives the effect of the 1972 ROSLA on educational attainment.20
We restrict the data to 129,730 study participants born in England, Scotland, or Wales within five years of September 1957 – that is, born between September 1, 1952 and August 31, 1962 – and use a linear trend in date of birth to capture cohort trends (i.e., function f(⋅) in equation (1)).21,22 In Online Appendix E we show our main results are robust to the choice of bandwidth and to the use of quadratic trends.23 We use triangular kernel weights that give greater weight to study participants born closer to the cutoff. All regressions include dummies for calendar month of birth to control for seasonality. We also present results adding other set of predetermined characteristics as controls including gender, age in days (at the time of the baseline assessment) and age squared, dummies for ethnicity, dummies for country of birth.24
Notice that even though previous work studying the 1972 ROSLA clustered standard errors by month-year of birth (e.g., Clark and Royer 2013; Davies et al. 2018), we do not need to cluster our standard errors because our data include exact date of birth. As Card and Lee (2008) discuss, in applications where the running variable is only reported in coarse intervals (e.g., month-year of birth), researchers have to choose a particular functional form for the model relating the outcomes of interest to the running variable. The deviation between the expected value of the outcome and the predicted value from a given functional form is modeled as a random specification error, which is incorporated in inference by clustering the standard errors for different values of the running variable. This specification error is negligible in our context because our data include exact date (day-month-year) of birth.
Table 1 shows estimates of effects of the 1972 ROSLA on education. Each cell reports results from a separate ordinary least squares estimation of (1), where we vary the dependent variable (listed in the column) and whether the predetermined characteristics are included as controls (in addition to calendar month of birth dummies). The table shows the coefficient on the indicator variable for being born on or after September 1, 1957, a1, and the mean of the dependent variable among those born in the 12 months before September 1, 1957. Robust standard errors are reported between brackets.
We estimate that the 1972 ROSLA increased the fraction of study participants staying in school until age 16 by 14 percentage points, an estimate significant at the 1% significance level.25 Studies using nationally representative data, such as Clark and Royer (2013), estimate this figure to be closer to 25 percentage points. This difference is likely due to the composition of the UK Biobank sample, which is more educated than the overall population (despite the selectivity of the UK Biobank sample, our estimates have internal validity because there is no differential selection on the two sides of the September 1, 1957 cutoff – see Online Appendix C). One consequence is that the standard errors of our two stages least squares (2SLS) estimates will be ceteris paribus larger than of studies with nationally representative data, something that is compensated by the larger sample size of the UK Biobank.
One may worry that these students forced to stay in school an extra year did not learn much if they did not put effort into it. The evidence does not support this hypothesis. By the 70’s high schools offered a series of two-year courses that ran through grades 10 and 11 and required students to take exams at the end of grade 11 (the grade they are typically in by age 16): Certificate of Secondary Education (CSE) or a General Certificate of Education (GCE) Ordinary Level (also known as an O-level). By compelling students to stay in school until grade ten, the 1972 ROSLA gave students an incentive to complete these courses and get these qualifications, which are valued in the labor market (Dickson and Smith 2011).
Figure 2 shows that the policy generated a discontinuous increase in the fraction of study participants with these qualifications. In Table 1 we estimate that the policy increased the fraction of study participants with a CSE by 7 percentage points and the fraction with an O-level by 3 percentage points. The fraction without any formal qualification dropped by 5 percentage points. All of these reduced-form estimates are statistically significant at 1%. We find no effect of the policy on having an A-level or a college degree.26 Consistent with these results, we document that the policy increased household income27, especially at lower income levels – see Online Appendix Table F1.
Figure 2. Fraction with a CSE of O-level by Quarter of Birth.
Notes: The figure shows the fraction of study participants with a CSE or O-level qualification by quarter of birth. The dashed vertical line marks the introduction of the policy. Cohorts born to the right of the line had to stay in school until age 16 while cohorts born before could leave at age 15. The curves show quadratic polynomials in quarter of birth that capture birth cohort trends. The circumference of each circle reflects the number of participants born in that quarter. N = 128,169.
B. Effects on Average Health
We now turn to the effects of the 1972 ROSLA on average health. We are interested in the relationship between health and education:
(2) |
where Healthi is a health measure for individual i. Educ16i, an indicator for whether individual i stayed in school until age 16, is our endogenous measure of education. The function g(⋅) captures birth cohort trends in health and is allowed to differ on either side of the September 1, 1957 cutoff. We substitute (1) into (2) to get the “reduced-form” effect of the 1972 ROSLA on average health:
(3) |
The coefficient γ1 gives the effect of the school leaving-age reform on average health. The RDD identifying assumption is that, in the absence of the reform, our outcomes of interest would have been smooth across the September 1, 1957 threshold. This assumption is violated if determinants of health are discontinuous at the cutoff (Lee 2008). In Online Appendix C we partially test for such violations by investigating whether the average (or the cumulative distribution function) of predetermined characteristics, such as gender and place of birth, are discontinuous around September 1, 1957. Since the UK Biobank genotyped the full sample, we also test for the smoothness of a pair of genetic variables, which are determined at conception and are objectively measured.28 Our analyses indicate that these characteristics are smooth across the September 1 1957 threshold, which strengthens our confidence that the RDD results provide unbiased estimates of the causal effects of education on the health of UK Biobank participants.
We estimate the causal effect of staying in school until age 16 on average health, β1, through two stages least squares (2SLS), using the indicator for being born on or after September 1, 1957 (i.e., Posti) to instrument for staying in school until age 16 (i.e., Educ1616i) in equation (2). We adopt the same specifications used to estimate the effects on education (see section II.A), namely: 5-year bandwidths, linear trends to capture birth cohort trends, triangular kernel weights, and the same set of controls. Online Appendix E shows our results are robust to quadratic cohort trends and other bandwidths.
Figure 3 examines the effects of the 1972 compulsory schooling change on average health. The graphs show average health (y-axis) by quarter of birth (x-axis), where health is measured by the four health indices: the body size index (top left); the lung function index (top right); the blood pressure index (bottom left); and the summary index (bottom right).
Figure 3. – Effects on Average Health.
Notes: These figures show average health by quarter of birth. See Table 2 for number of observations.
Table 2 shows regression estimates of the effects of the 1972 compulsory schooling change on average health.29 The first rows show the coefficients on the indicator variable for being born on or after September 1, 1957, γ1 in equation (3), from reduced-form estimates. The third row shows the coefficients on staying in school until age 16 from 2SLS estimates, β1 in equation (2), where the indicator variable for being born on or after September 1, 1957 is used to instrument for staying in school until age 16. Again, the health indices were constructed such that higher values correspond to worse health.
Table 2.
Effects on Average Health
Body Size | Lung Function | Blood Pressure | Summary | |||||
---|---|---|---|---|---|---|---|---|
|
||||||||
Reduced-form | ||||||||
Post | −0.023 [0.012]* | −0.023 [0.012]* | −0.019 [0.014] | −0.020 [0.014] | 0.020 [0.012] | 0.020 [0.012]* | −0.014 [0.014] | −0.015 [0.014] |
Two stages least squares | ||||||||
Stayed in school until 16 | −0.170 [0.090]* | −0.170 [0.090]* | −0.155 [0.112] | −0.164 [0.112] | 0.148 [0.091] | 0.150 [0.091]* | −0.112 [0.112] | −0.118 [0.112] |
Additional Controls? | No | Yes | No | Yes | No | Yes | No | Yes |
N Observations | 127,366 | 127,366 | 103,682 | 103,682 | 129,165 | 129,165 | 102,378 | 102,378 |
Notes: The table shows the effects on average health. The first two rows show reduced-form effects of the 1972 Raising of the School Leaving Age. The last two rows show two stages least squares estimates of the effect of staying in school until age 16 obtained by using an indicator for being born on or after September 1, 1957 to instrument for staying in school until age 16. Robust standard errors. All regressions control for calendar month of birth. Additional controls include male, age in days and age squared, dummies for ethnicity, and dummies for country of birth.
*, **, and *** denote significance at the 10, 5 and 1 percent level.
Overall Figure 3 suggests education may lead to small average improvements in health, with minor discontinuous decreases in the body size, lung function, and summary indices at the cutoff. One noteworthy exception is blood pressure. There is a discontinuous increase in the blood pressure index at the cutoff, suggesting that education may worsen this particular dimension of health. Online Appendix Figures E1–E4 assess the sensitivity of Figure 3 to changes in the bandwidth and to using quadratic trends.
Table 2 shows that the effects on body size and blood pressure are statistically significant at the 10% significance level. The 2SLS point estimates imply that staying in school until age 16 decreases the body size, the lung function, and the summary indices respectively by 0.23, 0.19–0.20, and 0.14–0.15 of a standard deviation. At the same time, staying in school until age 16 increases the blood pressure index by 0.20 of a standard deviation.
The p-value of a test of the difference between the effects on the body size and the blood pressure indices is 0.003. The difference between the effects on the lung function and the blood pressure indices has a p-value of 0.089. The difference between the effects on the body size and the lung function indices has a p-value of 0.901. These results, notably the difference between blood pressure and body size, point to the importance of treating health as multi-dimensional and considering the effects of education on different dimensions separately. Focusing on the analysis of summary measures of health can lead to misleading conclusions of no health impact if effects going on opposite directions cancel out, as is the case in Table 2.
Our estimates lie within the 95% confidence intervals of Clark and Royer (2010) – see Online Appendix Table D1.30 Clark and Royer (2010) do not estimate the effects on systolic blood pressure or on lung function. As discussed above, even though we have a larger sample than Clark and Royer (2010), we have a smaller first stage, which explains why our standard errors are not substantially smaller than theirs.
When making this comparison, one should keep in mind that the UK Biobank is not a nationally representative sample. It recruited only individuals living within 25 miles of one of the 22 assessment centers, all of which were located in urban areas. Fry et al. (2017) also document that younger, males living in socioeconomically deprived areas and in particular regions of the UK were less likely to accept the invitation to join the study.
In Online Appendix G, we compare compliers in the UK Biobank to compliers in the Health Survey for England – a nationally representative sample – in terms of objectively-measured health outcomes, namely BMI, waist-hip ratio, and diastolic and systolic blood pressures.31 We find that compliers in the UK Biobank have lower BMIs and waist-hip ratios, but that they also have higher diastolic and systolic blood pressures.32 Thus, we do not find unequivocal evidence that the UK Biobank sample is healthier than the average population (among compliers). Moreover, while these differences in health are statistically significant, their magnitudes are small. Finally, for each health outcome, there is substantial overlap between the distribution in the UK Biobank sample and in the Health Survey for England sample.
We also used genetic data and a nationally representative sample, the English Longitudinal Survey of Ageing (ELSA), to investigate the selectivity of the UK Biobank sample. Genetic data may be useful in this regard because it is fixed at conception. We calculated the distribution of the polygenic score for educational attainment—an index created from millions of genetic markers—in the ELSA and the UK Biobank samples. Online Appendix figure G5 shows that, while there are differences, there is substantial overlap in the support of the two distributions. Moreover, despite concerns about sample selection, our results retain their internal validity—see Online Appendix C.
III. Methods for Distributional Effects Estimates
Even though the effects on average health are informative, they may conceal larger effects on particular parts of the health distribution with important policy implications. Moreover, the average effects documented here are suggestive at best, being only significant at the 10% level. As illustrated in Online Appendix A, if the effects of education on health are concentrated at particular parts of the health distribution, a distributional test may be better powered than a test of difference in means. In this section, we describe the methods we use to estimate how education affects the distribution of health.
In Section II.B, we estimated the effect of education on average health (of compliers) by investigating if there was a discontinuity in the relationship between average health and date of birth at the September 1, 1957 cutoff. Here we estimate the effect of education on the health distribution (of compliers) by investigating if, at the September 1, 1957 cutoff, there is a discontinuity in the relationship between the cumulative distribution function (CDF) of health and date of birth.33
The assumptions of our proposed method are similar to those of a traditional RDD (Imbens and Lemieux 2008). The key difference is that, while a traditional RDD requires that the expected values of the potential outcomes are continuous in the running variable at the threshold, our method requires that the distributions of potential outcomes are continuous in the running variable at that point (Shen and Zheng 2015). While we cannot test this assumption formally, we find no significant discontinuity in the distributions of several predetermined variables, strengthening our confidence that this assumption holds—see Online Appendix C.
The rest of this section is structured as follows. In Section III.A, we define the reduced-form distributional effects of the 1972 ROSLA. The local distributional treatment effect is defined in Section III.B. Section III.C explains the approach we use to graphically display the estimates of the local distributional treatment effects. In these sections, we present analyses for BMI to illustrate our methods. Section III.D presents the hypothesis tests used for inference. Finally, Section III.E discusses some particularities of our approach.
A. The Reduced-Form Distributional Effect of the Compulsory Schooling Change
We begin by defining Pr(Health≤h|DoB as the CDF of health for individuals born on a specific date DoB. The CDF is a function of h, giving the fraction of those born on DoB with a health outcome smaller or equal to h. For example, if the health outcome is BMI, then some h values of particular interest are 18.5, 25, 30, and 40 because these correspond to standard clinical thresholds. To illustrate, in our data the fraction of individuals born on August 31, 1957 (i.e., DoB=−1) with a BMI below these clinical values are respectively 3%, 53%, 78%, and 100%. The CDF may differ for different dates of birth. For example, the fractions of those born on September 1, 1952 (i.e., DoB=−1826) with a BMI below 18.5, 25, 30, and 40 are respectively 0%, 22%, 69%, and 96%.
We want to investigate whether Pr(Health≤h|DoB changes discontinuously at September 1, 1957. For this purpose, it is useful to define the pre-reform and post-reform CDFs. Let the pre-reform CDF be the CDF in the limit when date of birth is converging to September 1, 1957 from the left (i.e., for those born right before September 1, 1957):
Similarly, the post-reform CDF is the CDF in the limit when date of birth is converging to September 1, 1957 from the right (i.e., for those born right after September 1, 1957):
The reduced-form distributional effect of the 1972 compulsory schooling change ϕ(⋅) is the difference between the post-reform CDF Fpostx(⋅) and the pre-reform CDF Fpre(⋅):
(4) |
which corresponds to the discontinuous change at September 1, 1957 in the fraction of the population with a health outcome less than or equal to h. To make it concrete, if the health outcome is BMI and we are interested in the effect of the ROSLA at h=30 (which is the clinically defined threshold for obesity), ϕ(30) is the effect of the compulsory schooling change on obesity rates.
We estimated the reduced-form average effect of the policy by estimating equation (3), which is reproduced here for convenience:
Similarly, the reduced-form distributional effect ϕ(⋅) is estimated by running:
(5) |
where I(Healthi≤h) is 1 if individual i’s health outcome is smaller or equal to h.
There are two key differences between (3) and (5). The first one concerns the dependent variable. In (3), the dependent variable may be for example BMI while in (5) it is an indicator for whether individual i’s BMI is smaller or equal to h. The second difference is that, while (3) is one equation, (5) is a continuum of equations for each value of h.
The coefficient γ1(h) estimates the discontinuity in Pr(Healthi≤h) at DoBi and thus provides an estimate of the reduced-form distributional effect ϕ(h). To illustrate, Figure 4 shows the usual RD graph for the fraction with a BMI below 30.34 The jump at 0 is the estimate of ϕ(30). The observed increase in the fraction of individuals with a BMI less than 30 corresponds to a reduction in obesity rates of about 1 percentage point.
Figure 4. Fraction with a BMI Below 30.
Notes: The figure shows the fraction of study participants with a BMI below 30 by quarter of birth. The dashed vertical line marks the introduction of the policy. Cohorts born to the right of the line had to stay in school until age 16 while cohorts born before could leave at age 15. The curves show linear trends in quarter of birth that capture birth cohort trends. The circumference of each circle reflects the number of participants born in that quarter. N = 128,856.
We discuss in Section III.D how we use the estimates of ϕ(⋅) to construct our hypothesis tests.
B. The Local Distributional Treatment Effect
We are interested in how staying in school until age 16 affected the CDF of those affected by the policy, that is, the CDF of “compliers” who stayed in school until age 16 because of the compulsory schooling change but who would otherwise have dropped out at 15. This local distributional treatment effect (LDTE) is given by:
(6) |
where FPost(⋅|C) and FPre(⋅|C) are the post-reform and the pre-reform CDFs for compliers (which are denoted by C). Edu16 is an indicator for staying in school until age 16. Note that the numerator of eq. 6 is equal to the reduced-form distributional effect ϕ(h).
The reader will notice the similarity between equation (6) and a local average treatment effect (LATE) estimator. As with LATE, we can estimate the LDTE through two-stage least squares (2SLS):
(7) |
using the indicator for being born on or after September 1, 1957 (i.e., Posti) to instrument for staying in school until age 16 (i.e., Educ16i). The coefficient θ1(h) is an estimate of the LDTE μ(h). It estimates the effect of staying in school until age 16 on the probability of having a health outcome smaller or equal to h.
If we estimate (7) for a BMI below 30, for example, we get that staying in school until age 16 reduced obesity rates by 8 percentage points (i.e., (30) = 0.08). We can do the same for other values of h. For example, staying in school reduces the probability of having a BMI smaller than 25 by approximately 0.3 percentage points.
C. Visualizing the Local Distributional Treatment Effect
We illustrate the LDTE in a single graph by plotting the pre- and the post-reform CDFs FPost(⋅|C) and FPre(⋅|C) for compliers. To construct the post-reform CDF, we rewrite equation (6):
That is, for any given h, the vertical distance between the pre- and post-reform CDFs is μ(h).
The pre-reform CDF FPost(h|C) and FPre(⋅|C) is defined as . To estimate it, we restrict the sample to respondents born before September 1, 1957 and who left school at age 15 or younger and estimate equation (8) via ordinary least squares (OLS):
(8) |
where the function k(⋅;h) captures pre-reform birth cohort trends in health.35 The sample used in this regression is a good approximation of the set of compliers because there are very few never-takers in our sample (i.e., individuals who drop out at age 15 irrespective of the policy). The coefficient on the constant, δ0(h), estimates Fpre(h|C). It is an estimate of the fraction of compliers born on September 1, 1957 who would have had a health outcome smaller or equal to h in the absence of the policy. We estimate, for example, that 0.68, such that 0.68 0.08 0.76.
To illustrate, Figure 5 shows the distributional treatment effect of staying in school until age 16 on BMI. The dashed black line is the pre-reform CDF. The solid red line is the post-reform CDF. The vertical distance between the post- and pre-reform CDFS for h= 30 is 0.08. Theoretically, we could estimate FPre(⋅|C) and FPost(⋅|C) for all values of . In practice, we approximate this by discretizing the support of the distribution of the health outcome and plotting and for each grid point.36
Figure 5. Distributional Effects on Body Mass Index.
Notes: The figure shows the pre- and post-reform CDFs of body mass index for compliers. See section III.C for an explanation of how the figure is constructed. N = 128,856
We emphasize that the approach described in this section is used only to graphically display our results. The hypothesis tests, which are discussed next and represent the main results of this paper, depend only on the reduced-form distributional effects discussed in Section III.A.
D. Inference
Inference based on the standard errors generated by 2SLS estimates of (7) or by OLS estimates of (5) is problematic because it leads to an infinite number of highly correlated statistical tests, raising concerns about multiple hypothesis testing. We, therefore, use a single distributional test based on Shen and Zhang (2016) to formally investigate whether education changes the distribution of health. Our test compares the pre- and post-reform CDFs FPre(⋅) and FPost(⋅) of the whole population. It is based on the reduced-form specification (5).
The basis of our test is that—under the null hypothesis of no effect on the health distribution—the function of estimates where h(τ) is the value corresponding to the th quantile of Healthi, is a Brownian bridge (Shen and Zhang 2016).37 We perform an Anderson-Darling test (Anderson and Darling 1952) using the following weighted integral as our test statistic:
(9) |
This integral is approximated numerically using rectangular integration by evaluating the integrand at each percentile.38 We chose the Anderson-Darling test because it is uniformly powered for the whole range τ∈[0,1] (Stephens 1974). In contrast, the Kolmogorov-Smirnov test used by Shen and Zhang (2016) is better powered to detect deviations of the distribution near the median. To test for differences in the bottom half of the health distribution, we use a modified version of (9), integrating only from zero to 0.5. Similarly, we test differences in the top half by integrating from 0.5 to 1.
The p-values for the test are calculated by simulation. Specifically, we generate an independent, standard normally distributed outcome for each individual (such that there is no discontinuous change in distribution at the discontinuity), and evaluate T (or the upper and lower distribution analogue) for this simulated variable. By Shen and Zhang (2016), this is equivalent to drawing from the test statistic distribution under the null. This is repeated 10,000 times. As the p-value, we report the fraction of times our simulated values of T are greater than our estimated value of T.
E. Discussion
The CDF approach described above is closely related to a quantile IV approach. The CDF approach is based on the vertical distance between the pre- and post-reform CDFs whereas a quantile approach is based on the horizontal distance between these two CDFs. Therefore, either approach would lead us to the same substantive conclusions. We opted to present the CDF approach because it is the framework used by Shen and Zhang (2016), whose results we use in our distributional tests. Nevertheless, when we present our results, we also discuss the effects on some particular quantiles of interest.
It is important to note that our method only recovers differences in the health distribution of compliers and it does not rely on rank preservation assumptions. Stronger assumptions would be needed to recover the distribution of treatment effects, but this is beyond the scope of our paper. Although we do not estimate the distribution of treatment effects, we can examine whether our estimates of the effect on the health distribution are consistent with homogeneous treatment effects. We show in Online Appendix B that, if treatment effects were homogeneous, our estimates of distributional effects would behave predictably. Namely, we would see the largest distributional effects where the largest mass of compliers is: around the mode of the distribution of compliers. Therefore, any other pattern of results is indirect evidence of heterogeneous treatment effects.
Importantly, our two-stage least squares estimates of distributional effects are not likely to be biased by heterogeneity in the first stage. There are two potential forms of first-stage heterogeneity for distributional effects: extensive- and intensive-margin heterogeneity. Extensive-margin heterogeneity means that the fraction of individuals staying in school for longer as a result of the 1972 ROSLA may vary depending on where in the health distribution these individuals would have been absent the reform. For example, unhealthy individuals may be more likely to be drop out of school at age 15 before the reform relative to their healthy peers and therefore may be more likely to be “compliers” in this setting. Intensive-margin heterogeneity means that, among compliers, one’s additional schooling associated with the ROSLA may vary depending on where in the health distribution s/he would have been in the absence of the reform. For example, healthy compliers may be more likely to stay in school for greater than one year due to the ROSLA despite only being compelled to stay for one.
Under the assumptions of the regression discontinuity design, extensive-margin heterogeneity does not affect on our two-stage least squares estimates (see Online Appendix H). The main intuition behind the proof is that two-stage least squares estimates of distributional effects recover differences in the health distribution of only compliers, who, by definition, were all affected by the policy. However, if there is heterogeneity on the intensive margin, the estimates of the distributional effects will overweigh the treatment effects of those who stayed in school longer. As a result, the largest distributional effects would not be observed at the mode of the distribution of compliers; instead, the estimates would be biased toward the mode of distribution of those who increased their schooling the most as a result of the ROSLA.
In practice, it is unlikely that heterogeneity on the intensive margin could explain our results. The main effect of the ROSLA was to keep in school until age 16 students who would have dropped out by age 15 otherwise; the increase in the fraction of students staying in school until at least age 17 as a result of the ROSLA is small and not statistically significant: about 0.5 percentage points (compared to 14 percentage points at age 16, see Online Appendix I). Clark and Royer (2013) reach a similar conclusion (see pg. 2102). These suggest little scope for first-stage, intensive-margin heterogeneity in our context.
IV. Distributional Effects of Education on Health
Figure 6 shows the distributional treatment effects of education on body size. It shows the pre- and post-reform CDFs of the body size index for compliers. As explained in Section III, the pre-reform CDF is obtained by estimating (8) for each grid point h and then plotting against h.39 The discontinuity in the CDF, μ(h), is obtained by estimating (7) through 2SLS for each grid point h. The post-reform CDF at a given grid point h is obtained by adding “vertically” to .
Figure 6. Distributional Effects on Body Size Index.
Notes: The figure shows the pre- and post-reform CDFs of the body size index for compliers. See section III.C for an explanation of how the figure is constructed. N = 127,366.
Figure 6 shows that education reduces body size: The post-reform CDF is shifted to the left relative to the pre-reform CDF. Importantly, the shift is not parallel; the gains are concentrated at the top of the distribution, among the least healthy. This result is consistent with a Grossman-type model with diminishing marginal utility of health (Kippersluis and Galama 2014). Staying in school until age 16 increases the fraction of study participants with a body size index smaller than 1 standard deviation above the mean from 77.8% to 85.3%. Similarly, the 90th percentile of the body size distribution decreases from 1.56 to 1.13 standard deviations. This effect is 2.5 times the average treatment effect (on the treated) of −0.17 standard deviations estimated in Table 2.
While Figure 6 has the advantage of graphically displaying the distributional treatment effects in just one figure, it lacks the transparency of traditional RDD graphs. In Online Appendix Figure J1, we present traditional RDD graphs for the 10th, 25th, 50th, 75th, 90th, and 95th percentiles of the distribution of the body size index (of those born between September 1, 1956 and August 31, 1957). Online Appendix Figures J2 and J3 present corresponding graphs for the lung function and blood pressure indices.
Figure 7 shows that education also improves lung function: The post-reform CDF is shifted to the left relative to the pre-reform CDF. Staying in school until age 16 increases the fraction of study participants with a lung function index smaller than 1 standard deviation above the mean from 78.1% to 85.5%. Similarly, the 90th percentile of the lung function distribution decreases from 1.47 to 1.21 standard deviations. This effect is 1.6 times the average treatment effect (on the treated) of −0.16 standard deviations estimated in Table 2.
Figure 7. Distributional Effects on Lung Function Index.
Notes: The figure shows the pre- and post-reform CDFs of the lung function index for compliers. See section III.C for an explanation of how the figure is constructed. N = 103,682.
While Figures 6 and 7 show that education improves body size and lung function, Figure 8 shows that education worsens one dimension of health: it increases blood pressure.40 The post-reform CDF lies to the right of the pre-reform CDF. Staying in school until age 16 decreases the fraction of study participants with a blood pressure index smaller than 0 from 49% to 40.1%. Similarly, the 30th percentile of the blood pressure index distribution increases from −0.49 to −0.16 standard deviations. This effect is 2.2 times the average treatment effect (on the treated) of 0.15 standard deviations estimated in Table 2. This result is particularly striking because blood pressure can be controlled through medication, diet, and exercise (Chobanian et al. 2003), and there is a positive association between education and these healthy behaviors (Park and Kang 2008; Conti, Heckman, and Urzua 2010; Cutler and Lleras-Muney 2010).
Figure 8. Distributional Effects on Blood Pressure Index.
Notes: The figure shows the pre- and post-reform CDFs of the blood pressure index for compliers. See section III.C for an explanation of how the figure is constructed. N = 129,165.
A comparison of Figures 6 and 8 shows that not only the effect on the body size and blood pressure indices have different signs, but the effects also happen in different parts of the respective distributions. While the effects on body size occur in the upper part of the body size distribution, the effects on blood pressure occur in the lower part of the blood pressure distribution among the healthiest. Online Appendix Figures E5–E10 assess the sensitivity of Figures 6–8 to changing the bandwidth, using linear trends, and the inclusion of controls.
To test whether these shifts in our health indices CDFs are significant and where they are concentrated, we use distributional tests as described in Section III above. The first row in Table 3 shows p-values of tests of the equality of the pre- and post-reform CDFs. The middle and bottom rows show p-values of tests of the equality of the bottom half (i.e., the healthiest) and the top half (i.e., the least healthy) of pre- and post-reform CDFs.
Table 3.
P-values of Distributional Tests
Body Size | Lung Function | Blood Pressure | |
---|---|---|---|
|
|||
Full Distribution | 0.0674 | 0.2972 | 0.0592 |
Bottom Half | 0.8985 | 0.4815 | 0.0282 |
Top Half | 0.0132 | 0.1880 | 0.2244 |
Notes: The table shows the p-values of tests of the equality of the full distribution, the bottom and top halves of the pre- and post-reform CDFs.
We can reject the null for the top half of the body size distribution and for the bottom half of the blood pressure distribution (at the 5% significance level). Online Appendix Tables E1–E3 assess the sensitivity of these results to changing the bandwidth, using quadratic trends, and including controls. While the p-values change across specifications, the main patterns remain: in most cases, we can reject the null of equality for the top half of the body size distribution and for the bottom half of the blood pressure distribution.
Figures 5 and 9 shed light on these findings by plotting results for measures with clinical thresholds. Figure 5 shows the pre- and post-reform CDFs of body mass index (for compliers). Figure 9 shows the pre- and post-reform CDFs of diastolic blood pressure (for compliers).41
Figure 9. Distributional Effects on Diastolic Blood Pressure.
Notes: The figure shows the pre- and post-reform CDFs of diastolic blood pressure for compliers. See section III.C for an explanation of how the figure is constructed. N = 129,165.
Figure 5 shows that the reductions in BMI caused by more education occur where they matter the most: Staying in school until age 16 reduces obesity rates (i.e., the fraction of study participants with a BMI below 30) by 8 percentage points.42 These effects are especially important because the participants were in their 50s and being obese in middle-age is associated with a number of adverse health conditions and with lower quality of life in old age (Daviglus et al. 2003). In contrast, Figure 9 shows that the increase in blood pressure does not affect the prevalence of stage 2 hypertension (classified as having a diastolic blood pressure above 90 mmHg). Staying in school until age 16 may have increased the probability of stage 1 hypertension (defined as having diastolic blood pressure between 80 mmHg and 90 mmHg; known as “prehypertension” before the 2017 redefinition) but this change is not statistically significant.43
These results illustrate the importance of studying distributional effects. While the average treatment effects show improvements in body size and deterioration in blood pressure, the distributional effects reveal in which part of the distributions these changes occur. The deterioration of blood pressure occurs with no observed consequences for the prevalence of stage 2 hypertension. In contrast, the improvements in body size are concentrated at the right tail, with a large reduction in obesity rates. These effects offer important information about trade-offs policy makers may wish to make when considering the health consequences of educational policies: while education may improve some dimensions of health while hurting others, it is important to consider where in the distribution these effects are occurring.
Our findings are not consistent with homogeneous treatment effects of education on health. The main effect of the schooling reform was to induce students who would have dropped out at age 15 (i.e., the compliers) to stay in school an additional year. Homogeneous treatment effects do not imply a constant for all h in equation (7), or equivalently, a parallel vertical shift in Figures 5–9. Rather, if this additional year of schooling affected the health of all compliers equally, we would see effects across the whole support of the complier distribution, with the largest effect at the mode of the pre-reform distribution. This is because, when more people are near some threshold h0 absent the reform, more people will cross the h0 threshold after the reform is in place. Online Appendix B provides a formal proof.
The results show a different pattern. Figures 5–9 indicate that our effects are limited to some parts of the distributions of health of compliers. Indeed, the statistical tests in Table 3 cannot reject a null effect for the bottom half of the body size and lung function distributions and the top half of the blood pressure distribution. Importantly, the effects we do find are not concentrated around the modes of these distributions. For body size, they are concentrated around the 83rd percentile of the complier distribution (69th percentile for lung function and 37th for blood pressure). These patterns imply heterogeneous effects of education on health.
V. Channels
One of the channels through which education may affect health is health behaviors. Correlational evidence shows that the more educated are more likely to use preventive care, that they manage chronic conditions more effectively, and that they are less likely to smoke and drink heavily (Cutler and Lleras-Muney 2008; Goldman and Smith 2002).44
Taking advantage of the richness of the UK Biobank data, we investigate whether education has a causal effect on three types of health behaviors: diet, smoking, and physical activity. Diet was measured using a 24-h dietary assessment tool self-completed through the Internet (Galante et al. 2016).45 Accelerometers worn for 7 days were used to measure physical activity.46 Smoking was self-reported.
Online Appendix Table F3 shows the effects on diet. Staying in school until age 16 reduces the intake of fat and saturated fat (as a fraction of total energy intake). There are, however, no effects on total caloric intake, sugars, or carbohydrates. Online Appendix Table F4 shows no effects on the measures of smoking and physical activity we have available.47
This analysis suggests that improved diet is an important channel through which education reduces body size. Those who stayed in school until age 16 had better diets in middle age – about 10% lower in fat and 15% lower in saturated fat. Even if the energy content of one’s diet is held constant, changes in diet composition can affect body weight (Hall et al. 2012).
The pathways are less clear for the harmful effect of education on blood pressure. We find no effects on a (self-reported) measure of hypertension diagnosis48 and on (current) blood pressure medication (Online Appendix Table F4). One alternative hypothesis is that, by changing the types of occupations and careers individuals have, education might have an effect on job responsibilities, expectations, and work-related stress with negative implications for blood pressure.49 In the U.S. context, for example, academically successful African Americans have higher biomarkers related to cardiometabolic risk (i.e. blood pressure and stress hormones) than other groups (Brody et al. 2013, Miller et al. 2015, Chen et al. 2015). This is potentially driven by stressors related to upward mobility, which could also be playing a role in the U.K context. We have no credible data to test this hypothesis in the UK Biobank so we leave it for future work.
VI. Conclusions
In this paper, we investigate how education affects the distribution of health along three dimensions: body size, lung function, and blood pressure. The distributional analysis shows that education affects the health of some subpopulations. We find, for example, that the school-leaving age reform shifted part of the body size distribution to the left—i.e., it caused reductions in body size. Importantly, our results do not rely on rank preservation assumptions and or on extensive-margin homogeneity in the effects of the ROSLA on education (i.e., the first stage).
Because the effects of the reform are concentrated at the upper tail of the distribution rather than at the mode, this is evidence that the effect of education on body size is heterogeneous. Indeed, in related work we show that the effect varies with one’s genetic predisposition to obesity: the reductions in body size were larger for individuals at greater genetic risk of obesity (Barcellos, Carvalho, and Turley 2018). These findings support Galama et al. (2018)’s hypothesis that heterogeneity in the effects of education on health explain why some studies find that more education causes better health while others find no effect.
We conduct a back-of-envelope calculation to illustrate the importance of accounting for the heterogeneity in the effects of education when evaluating its estimated impacts (Heckman et al. 2018). Take, for example, the relationship between BMI and mortality. It is U-shaped (Fogel 1994; Aune et al. 2016), suggesting that the reduction in mortality caused by a given reduction in BMI is larger for someone obese than for someone with normal weight. A reduction in average BMI of 0.4 caused by staying in school until age 16 (i.e., the average treatment effect on the treated) implies a reduction in the relative risk of death of 0.87%—see Online Appendix K for details. In contrast, the local distributional treatment effects yield a much larger reduction of 3.28%. That is because of the U-shaped relationship between BMI and mortality and because the effects on BMI were concentrated at the upper tail of the BMI distribution.
The effects of education on health may vary not only across people but also across different health dimensions. Besides reducing body size, the school-leaving age reform increased blood pressure. Education seems to have reduced body size through improvements in SES and diet, but it is not clear the channels through which education increased blood pressure. Interestingly, the improvement in body size was concentrated at the top half of the body size distribution (among the least healthy) while the worsening in blood pressure was concentrated at the bottom half of the blood pressure distribution (among the most healthy).
These results underscore the policy relevance of studying distributional effects. Even though the average treatment effects show a reduction in body size and an increase in blood pressure of similar magnitudes (0.15–0.17 of a standard deviation), a policy maker may wish to trade-off these effects based on which parts of the respective distributions are affected. Because the reductions in BMI occurred at the upper tail of the BMI distribution, staying in school until age 16 reduced obesity rates (i.e., BMI above 30) by 8 percentage points. In contrast, the increase in blood pressure was concentrated below the clinical threshold for stage 2 hypertension (i.e., diastolic blood pressure above 90 mmHg or systolic blood pressure above 140) with no statistically significant consequences for prevalence.
Our results offer evidence on the causal distributional health effects of one extra year of compulsory schooling at age 15 in a specific context. Therefore, it will be important to investigate whether our findings generalize to other contexts and levels of education. For example, it is not clear that increases in schooling due to other types of educational policies, such as policies that lower the cost of college attendance, would have similar effects. Moreover, our paper uses a sample which is not nationally-representative, which might threaten the external validity of our findings. Future research investigating distributional effects of education on health in other national, historic and policy contexts, preferably using nationally-representative samples, would increase our understanding of the role that education plays in population health.
Supplementary Material
Acknowledgments
The authors declare that they have no relevant or material financial interests that relate to the research described in this paper. Research reported in this publication was supported through the Roybal Center for Health Decision Making and Financial Independence in Old Age (P30AG024962-13S1 and P30AG024962-13S2), NIA grant K01AG050811-01 (Barcellos), NIA grant 1K01AG066999-01A1 (Carvalho), RF1AG055654 (Carvalho), R56AG058726 (Carvalho), R21AG060447 (Barcellos) and by the USC Population Research Center. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. This research has been conducted using the UK Biobank Resource under Application Number 15666. See http://www.ukbiobank.ac.uk/register-apply/ for how to gain access to the data. Additional replication materials are available at https://doi.org/10.7910/DVN/IJMVBL. IRB approval was not obtained as there were no human subjects involved. We benefited from discussions with James Banks, Dan Benjamin, Damon Clark, Maria Fitzpatrick, Dana Goldman, Mireille Jacobson, Arie Kapteyn, Adriana Lleras-Muney, Neil Mahoney, Michael Mechine, Heather Royer, Jon Skinner, Jim P. Smith, and from the feedback of seminar participants at a number of universities, institutes and conferences. Jiewen Luo, Andrew Yu, Joao Vilela, and Sean Lee provided excellent research assistance. The online appendix can be found at http://jhr.uwpress.org/.
Footnotes
Consistent with this hypothesis, in related work we show that additional schooling due to compulsory schooling laws in the UK had larger effects on the BMI of those with greater genetic risk of obesity (Barcellos et al. 2018).
As explained below, body size is an index that includes BMI, body fat percentage, and waist-hip ratio.
In our sample, there are only 939 participants (0.73% of sample) with low blood pressure and 769 (0.6% of sample) who are underweight. Therefore, in this sample, it is safe to assume that a reduction in body size is a health improvement and an increase in blood pressure is a health deterioration.
We do not have the power to test whether some people experienced both a reduction in body size and an increase in blood pressure. However, given that there is a moderate positive correlation between these two outcomes (ρ=0.3), it is unlikely (though possible) that those who experienced reductions in body size were the same individuals experiencing increases in blood pressure.
Studying a similar compulsory schooling reform in France, Courtin et al. (2019) also finds that the additional education led to increased blood pressure among those from low parental social class. The blood pressure finding is also consistent with the literature on birth-order effects that show that while first-borns have higher educational attainment, earnings and better self-reported health, they are more likely to have high blood pressure than later-borns (Black, Devereux and Salvanes 2016).
Although the point estimates suggest an improvement in lung function, they are not statistically significant.
Clark and Royer also analyzed the effects of education on particular points of the distribution of BMI and blood pressure, but these clinical cutoffs are arguably arbitrary (Komaroff 2016) and the approach may overlook changes in other parts of the distribution.
Davies et al. (2018) note the probability of mortality in the UK Biobank is more than 4 times higher. The causes of death also differ.
This argument would not apply to their 1947 ROSLA results, but we also do not know whether the 1947 ROSLA had the same effects on health as we estimate for the 1972 ROSLA.
Davies et al. 2018 also note that Clark and Royer could not exclude immigrants (who were not affected by the reform), possibly leading to attenuated estimates.
The effect on staying in school until at least age 17 is an order of magnitude smaller: about half a percentage point. In Clark and Royer’s words (AER 2013): “the 1972 change had small, at best, effects on the fractions completing 11 or fewer years…To a first approximation therefore, one can view these law changes as forcing students that would previously have left at the earliest opportunity to stay in school for one more year” (pg. 2102).
The NHS has contact details for an estimated 98% of the UK population.
When the predetermined variable is continuous, we test whether the pre- and post-reforms distributions are different. When the predetermined variable is dichotomous, we conduct a test of difference in means.
The other objective and continuous measures of health currently available in the UK Biobank are either available for just a subsample (e.g., arterial stiffness, bone densitometry of heel, ECG) or there is no clear hypothesis on how they could be affected by education (e.g., hand grip strength).
The UK Biobank provides two measures of BMI: one calculated as weight in kilograms divided by height squared (in meters) and one using height and electrical impedance to quantify mass. We take the average of these two measures. We can get very similar results if we use exclusively the first measure of BMI (i.e., the weight in kilograms divided by height in meters squared). The waist-hip ratio is equal to the waist circumference divided by the hip circumference.
They were instructed to continue blowing until no more air came out of their lungs. Up to three attempts were allowed. The participant was allowed a third attempt if the first two blows did not satisfy the reproducibility criteria of the spirometry protocol.
Participants born before and after September 1957 are equally likely to have valid spirometry measures.
The inclusion of predetermined controls in equation (1) is not needed for identification but can improve the precision of estimates.
We opt to use 5-year bandwidths because, to our knowledge, there are no existing methods to estimate the optimal bandwidth for distributional RDD.
We started with 502,643 UKB participants. 373,273 of them were dropped because of the following reasons: date of birth was not available (N = 14); were not born in England, Scotland, or Wales (N = 49,773); were born before September 1, 1952 or after August 31, 1962 (N = 321,407); and did not report school-leaving age (N = 2,079). We are left with 129,370 observations. Out health outcomes are available for the following number of observations: 127,366 (body size); 103,682 (lung function); and 129,165 (blood pressure).
Gelman and Imbens (2019) caution against the use of higher order polynomials (higher than 2) in RDD.
Because participants were surveyed for the baseline assessment between 2006 and 2010, date of birth and age are not perfectly collinear.
Estimates of the effect of the 1972 school-leaving age reform on staying in school until age 17 or later are an order of magnitude smaller than the effect on staying in school till age 16. See Online Appendix Table I1.
The effects on college degree in Table 1 are marginally significant, but these estimates are not robust to the choice of bandwidth and polynomial specification.
These results are broadly consistent with Grenet (2013) that finds that the extra year of schooling induced by the 1972 ROSLA increased wages.
See Online Online Appendix of Barcellos, Carvalho and Turley (2018) for details about the construction of genetic variables.
Notice that while Figure 3 uses quarter of birth Table 2 uses day of birth (e.g., September 1, 1957). The same distinction applies to Figure 1 and Table 1.
In contrast, our results lie outside the confidence intervals of Davies et al. (2017), which study the same reform and data (UK Biobank). We believe this is due to differences in the weighting procedure they use, their specification (bandwidth and polynomial choice) and sample selection (they do not include Wales and Scotland).
We approximate compliers as those born before September 1957 who dropped out of school at age 15 or younger. In order to make it comparable to the Health Survey for England, the UK Biobank sample is restricted to respondents living in England. We focus on objectively-measured health outcomes to avoid concerns that differences in self-reported measures may be partly due to differences in question wording.
Fry et al. (2017) also find that the UK Biobank sample has lower BMI than the Health Survey for England sample. Notice, however, that they are not comparing compliers. They also do not compare the two samples in terms of blood pressure.
The RDD identifies differences in the marginal distributions of cohorts affected and unaffected by the reform. Stronger assumptions (such as rank preservation) would be needed to estimate the distribution of treatment effects.
There is a corresponding graph for any value h (and for any health index). In Online Appendix Figures J1–J3, we show corresponding graphs for the 10th, 25th, 50th, 75th, 90th, and 95th percentiles of each health index.
In our main specification with linear trends: k(DoBi;h)=η(h)DoBi. These terms take into account any secular trends in our outcomes.
The grid consists of 21 points uniformly distributed between h1 and h21.
More precisely, the difference in the empirical CDFs estimated in this way is a standard Brownian bridge times a scalar. See Shen and Zhang (2016) for details on calculating the scalar which allows us to transform the difference into a standard Brownian bridge.
where {τj} is a set of discrete points in 0.01 unit increments. When testing the full distribution we sum from 0.01 to 0.99, inclusive. For the lower or upper portion of the distribution, we sum from 0.01 to 0.50 or 0.50 to 0.99 inclusive, respectively.
Compliers are less healthy than always takers but not dramatically so. Online Appendix Figures D2, D3, and D4 compare the pre-reform CDFs for compliers and the whole UK Biobank sample (both estimated using equation (8)) for our 3 indices.
The fraction of people with low blood pressure in our sample is negligible; in contrast, 30% are hypertensive (see Figure 9). Therefore we interpret an increase in blood pressure as a worsening of health.
Results for systolic blood pressure, omitted due to space constraints, are similar.
This reduction in BMI at the top of the distribution is consistent with Janke et al. (2020) that show that the 1972 ROSLA reduced diabetes.
Despite no change in stage 2 hypertension, the increase in the fraction of people in the stage 1 (formerly pre-high) range might still mean a worsening of health, as observational studies indicate that death from both ischemic heart disease (IHD) and stroke increases progressively and linearly from levels as low as 75mmHg DBP (Lewington et al. 2002). In addition, longitudinal data have indicated that DBP between 85 and 89 mmHg are associated with a more than twofold increase in relative risk from cardiovascular disease as compared to those with DBP below 80mmHg (Vasan et al. 2001). However, stage 2 hypertension is more serious than stage 1: while medications are the main treatment for stage 2 hypertension, for stage 1 the focus is on lifestyle changes (Bakris and Sorrentino 2018).
There are also other possible channels through which education might affect health, such as increasing forward-looking behavior or the ability to process health-related information. Recently, Delaney and Devereux (2019) documented that the 1972 ROSLA reduced income variability over the life-cycle, which could have beneficial health implications. The UK Bionbank data do not permit investigating these alternative channels.
The Oxford WebQ collects information on the quantities of all foods and beverages consumed over the previous day. Respondents are asked whether they consumed any of 21 food groups over the previous day. A positive response results in the screen expanding to reveal a list of commonly consumed foods in the corresponding category. Respondents then select the amount of each food consumed using standard categories to indicate the amount consumed. Energy and nutrient values are generated by multiplying the quantity of each food or drink consumed by its nutrient composition. The Oxford WebQ was included at the assessment visit of the baseline measures for the last 70,724 participants and administered over the Internet to all UK Biobanks participants with a known email address, who were invited to complete the Oxford WebQ on four separate occasions over a 16-month period.
Accelerometer data were collected from May 2013 until December 2015 from 103,720 UK Biobank participants. Our outcome of interest is the average acceleration adjusted for no-wear bias (UKB field 90087): http://biobank.ctsu.ox.ac.uk/crystal/field.cgi?id=90087
These are consistent with Clark and Royer (2013), who also find no effects of the 1972 ROSLA on self-reported smoking or physical activity.
The wording of the question was “Have you been told by a doctor that you have high blood pressure”?
We find evidence that education increases the socioeconomic class of occupations participants hold in middle-age. They are less likely to work on semi-routine and lower supervisory occupations, for example. See Barcellos, Carvalho and Turley (2021).
Contributor Information
Silvia H. Barcellos, University of Southern California and an affiliate of NBER.
Leandro S. Carvalho, Center for Economic and Social Research and at the Department of Economics at the University of Southern California in Los Angeles.
Patrick Turley, Department of Economics and at the Center for Economic and Social Research at the University of Southern California in Los Angeles.
REFERENCES
- Allen Naomi, Sudlow Cathie, Downey Paul, Peakman Tim, Danesh John, Elliott Paul, Gallacher John, Green Jane, Matthews Paul, Pell Jill, Sprosen Tim, and Collins Rory. 2012. “UK Biobank: Current Status and What It Means for Epidemiology.” Health Policy and Technology 1(3): 123–126. [Google Scholar]
- Albouy Valerie, and Lequien Laurent. 2009. “Does Compulsory Education Lower Mortality?.” Journal of Health Economics 28(1): 155–168. [DOI] [PubMed] [Google Scholar]
- Anderson Michael L. 2008. “Multiple Inference and Gender Differences in the Effects of Early Intervention: A Reevaluation of the Abecedarian, Perry Preschool, and Early Training Projects.” Journal of the American Statistical Association 103(484): 1481–1495. [Google Scholar]
- Anderson TW, & Darling DA 1952. Asymptotic Theory of Certain” Goodness of Fit” Criteria Based on Stochastic Processes. the Annals of Mathematical Statistics, 193–212. [Google Scholar]
- Aune Dagfinn, Sen Abhijit, Prasad Manya, Norat Teresa, Janszky Imre, Tonstad Serena, Romundstad Pål, and Vatten Lars J.. 2016. “BMI and All-Cause Mortality: Systematic Review and Non-Linear Dose-Response Meta-Analysis of 230 Cohort Studies With 3.74 Million Deaths Among 30.3 Million Participants.” bmj 353. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Avendano Mauricio, De Coulon Augustin, and Nafilyan Vahé. 2020. “Does Longer Compulsory Schooling Affect Mental Health? Evidence from A British Reform.” Journal of Public Economics 183: 104137. [Google Scholar]
- Bakris George, and Sorrentino Matthew. 2018. “Vitamin D Life.” N Engl. J Med 378: 497–499. [DOI] [PubMed] [Google Scholar]
- Barcellos Silvia H., Carvalho Leandro S., and Turley Patrick. 2018. “Education Can Reduce Health Differences Related to Genetic Risk of Obesity.” Proceedings of the National Academy of Sciences 115(42): E9765–E9772. [DOI] [PMC free article] [PubMed] [Google Scholar]
- __________. 2021. “The Effect of Education on the Relationship Between Genetics, Early-Life Disadvantages and Later-Life SES.” USC Working Paper. [Google Scholar]
- Bitler Marianne P., Gelbach Jonah B., and Hoynes Hilary W.. 2006. “What Mean Impacts Miss: Distributional Effects of Welfare Reform Experiments.” American Economic Review 96(4): 988–1012. [Google Scholar]
- Black Sandra E., Devereux Paul J., and Salvanes Kjell G.. 2016. “Healthy (?), Wealthy, and Wise: Birth Order and Adult Health.” Economics & Human Biology 23: 27–45. [DOI] [PubMed] [Google Scholar]
- Brody Gene H., Yu Tianyi, Chen Edith, Miller Gregory E., Kogan Steven M., and Beach Steven RH. 2013. “Is Resilience Only Skin Deep? Rural African Americans’ Socioeconomic Status–Related Risk and Competence in Preadolescence and Psychological Adjustment and Allostatic Load At Age 19.” Psychological Science 24(7): 1285–1293. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Card David. 2001. “Estimating the Return to Schooling: Progress on Some Persistent Econometric Problems.” Econometrica 69(5): 1127–1160. [Google Scholar]
- Chan June M., Rimm Eric B., Colditz Graham A., Stampfer Meir J., and Willett Walter C.. 1994. “Obesity, Fat Distribution, and Weight Gain as Risk Factors for Clinical Diabetes in Men.” Diabetes Care 17(9): 961–969. [DOI] [PubMed] [Google Scholar]
- Chen Edith, Miller Gregory E., Brody Gene H., and Lei Mankit. 2015. “Neighborhood Poverty, College Attendance, and Diverging Profiles of Substance Use and Allostatic Load In Rural African American Youth.” Clinical Psychological Science 3(5): 675–685. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chobanian Aram V., Bakris George L., Black Henry R., Cushman William C., Green Lee A., Izzo Joseph L. Jr, Jones Daniel W. et al. 2003. “The Seventh Report of the Joint National Committee on Prevention, Detection, Evaluation, and Treatment of High Blood Pressure: The JNC 7 Report.” Jama 289(19): 2560–2571. [DOI] [PubMed] [Google Scholar]
- Clark Damon, and Royer Heather. 2010. “The Effect of Education on Adult Health and Mortality: Evidence from Britain” NBER Working Paper 16013. [DOI] [PubMed] [Google Scholar]
- __________. 2013. “The Effect of Education on Adult Mortality and Health: Evidence from Britain.” American Economic Review 103(6): 2087–2120. [DOI] [PubMed] [Google Scholar]
- Conti Gabriella, Heckman James, and Urzua Sergio. 2010. “The Education-Health Gradient.” American Economic Review 100(2): 234–38. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Courtin Emilie, Nafilyan Vahe, Avendano Mauricio, Meneton Pierre, Berkman Lisa F., Goldberg Marcel, Zins Marie, and Dowd Jennifer B.. 2019. “Longer Schooling but Not Better Off? A Quasi-Experimental Study of the Effect of Compulsory Schooling on Biomarkers in France.” Social Science & Medicine 220: 379–386. [DOI] [PubMed] [Google Scholar]
- Cutler David M., Lleras-Muney Adriana, and Vogl Tom. 2008. “Socioeconomic Status and Health: Dimensions and Mechanisms” NBER Working Paper 14333. [Google Scholar]
- Cutler David M., and Lleras-Muney Adriana. 2010. “Understanding Differences in Health Behaviors by Education.” Journal of Health Economics 29(1): 1–28. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Davies Neil M., Dickson Matt, George Davey Smith Gerard J. Van Den Berg, and Windmeijer Frank. 2018. “The Causal Effects of Education on Health Outcomes in the UK Biobank.” Nature Human Behaviour 2(2): 117–125. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Daviglus Martha L., Liu Kiang, Yan Lijing L., Pirzada Amber, Garside Daniel B., Schiffer Linda, Dyer Alan R., Greenland Philip, and Stamler Jeremiah. 2003. “Body Mass Index in Middle Age and Health-Related Quality of Life in Older Age: The Chicago Heart Association Detection Project in Industry Study.” Archives of Internal Medicine 163(20): 2448–2455. [DOI] [PubMed] [Google Scholar]
- Delaney Judith M., and Devereux Paul J.. 2019. “More Education, Less Volatility? the Effect of Education on Earnings Volatility Over the Life Cycle.” Journal of Labor Economics 37(1): 101–137. [Google Scholar]
- DeMatteis Sara, Jarvis Deborah, Hutchings Sally, Darnton Andy, Fishwick David, Sadhra Steven, Rushton Lesley, and Cullinan Paul. 2016. “Occupations Associated with COPD Risk in the Large Population-Based UK Biobank Cohort Study.” Occupational and Environmental Medicine 73(6): 378–384. [DOI] [PubMed] [Google Scholar]
- Dickson Matt, and Smith Sarah. 2011. “What Determines the Return to Education: An Extra Year or A Hurdle Cleared?.” Economics of Education Review 30(6): 1167–1176. [Google Scholar]
- Fogel Robert W. 1994. “Economic Growth, Population Theory, and Physiology: The Bearing of Long-Term Processes on the Making of Economic Policy.” NBER Working Paper 4638. [Google Scholar]
- Fry Anna, Littlejohns Thomas J., Sudlow Cathie, Doherty Nicola, Adamska Ligia, Sprosen Tim, Collins Rory, and Allen Naomi E.. 2017. “Comparison of Sociodemographic and Health-Related Characteristics of UK Biobank Participants with Those of the General Population.” American Journal of Epidemiology 186(9): 1026–1034. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Galama Titus J., Lleras-Muney Adriana, and Van Kippersluis Hans. 2018. “The Effect of Education on Health and Mortality: A Review of Experimental and Quasi-Experimental Evidence.” NBER Working Paper 24225. [Google Scholar]
- Galante Julieta, Adamska Ligia, Young Alan, Young Heather, Littlejohns Thomas J., Gallacher John, and Allen Naomi. 2016. “The Acceptability of Repeat Internet-Based Hybrid Diet Assessment of Previous 24-H Dietary Intake: Administration of the Oxford Webq in UK Biobank.” British Journal of Nutrition 115(4): 681–686. [DOI] [PubMed] [Google Scholar]
- Gelman Andrew, and Imbens Guido. 2019. “Why High-Order Polynomials Should Not Be Used in Regression Discontinuity Designs.” Journal of Business & Economic Statistics 37(3): 447–456. [Google Scholar]
- Grenet Julien. 2013. “Is Extending Compulsory Schooling Alone Enough to Raise Earnings? Evidence from French and British Compulsory Schooling Laws.” the Scandinavian Journal of Economics 115(1): 176–210. [Google Scholar]
- Goldman Dana P., and Smith James P.. 2002. “Can Patient Self-Management Help Explain the SES Health Gradient?.” Proceedings of the National Academy of Sciences 99(16): 10929–10934. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Grossman Michael. 2015. “The Relationship Between Health and Schooling: What’s New?” NBER Working Paper 21609. [Google Scholar]
- Hall Kevin D., Heymsfield Steven B., Kemnitz Joseph W., Klein Samuel, Schoeller Dale A., and Speakman John R.. 2012. “Energy Balance and Its Components: Implications for Body Weight Regulation.” the American Journal of Clinical Nutrition 95(4): 989–994. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Heckman James J., Smith Jeffrey, and Clements Nancy. 1997. “Making the Most Out of Programme Evaluations and Social Experiments: Accounting for Heterogeneity in Programme Impacts.” the Review of Economic Studies 64(4): 487–535. [Google Scholar]
- Heckman James J., Humphries John Eric, and Veramendi Gregory. 2018. “Returns to Education: The Causal Effects of Education on Earnings, Health, and Smoking.” Journal of Political Economy 126(S1): S197–S246. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Imbens Guido W., and Lemieux Thomas. 2008. “Regression Discontinuity Designs: A Guide to Practice.” Journal of Econometrics 142(2): 615–635. [Google Scholar]
- Janke Katharina, Johnston David W., Propper Carol, and Shields Michael A.. 2020. “The Causal Effect of Education on Chronic Health Conditions in the UK.” Journal of Health Economics 70: 102252. [DOI] [PubMed] [Google Scholar]
- Johnston David W., Propper Carol, and Shields Michael A.. 2009. “Comparing Subjective and Objective Measures of Health: Evidence from Hypertension for the Income/Health Gradient.” Journal of Health Economics 28(3): 540–552. [DOI] [PubMed] [Google Scholar]
- Jürges Hendrik, Kruk Eberhard, and Reinhold Steffen. 2013. “The Effect of Compulsory Schooling on Health—Evidence from Biomarkers.” Journal of Population Economics 26(2): 645–672. [Google Scholar]
- Kannel William B. 1996. “Blood Pressure as A Cardiovascular Risk Factor: Prevention and Treatment.” Jama 275(20): 1571–1576. [PubMed] [Google Scholar]
- Kemptner Daniel, Jürges Hendrik, and Reinhold Steffen. 2011. “Changes in Compulsory Schooling and the Causal Effect of Education on Health: Evidence from Germany.” Journal of Health Economics 30(2): 340–354. [DOI] [PubMed] [Google Scholar]
- Kippersluis Hans Van, and Galama Titus J.. 2014. “Wealth and Health Behavior: Testing the Concept of a Health Cost.” European Economic Review 72: 197–220. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Komaroff Marina. 2016. “For Researchers on Obesity: Historical Review of Extra Body Weight Definitions.” Journal of Obesity 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lee David S., and Card David. 2008. “Regression Discontinuity Inference with Specification Error.” Journal of Econometrics 142(2): 655–674. [Google Scholar]
- Lewington S 2002. “Prospective Studies Collaboration. Age-Specific Relevance of Usual Blood Pressure to Vascular Mortality: A Meta-Analysis of Individual Data for One Million Adults In 61 Prospective Studies.” Lancet 360: 1903–1913. [DOI] [PubMed] [Google Scholar]
- Lleras-Muney Adriana. 2005. “The Relationship Between Education and Adult Mortality in the United States.” the Review of Economic Studies 72(1): 189–221. [Google Scholar]
- Meghir Costas, Palme Mårten, and Simeonova Emilia. 2018. “Education and Mortality: Evidence from A Social Experiment.” American Economic Journal: Applied Economics 10(2): 234–56. [Google Scholar]
- Miller Gregory E., Yu Tianyi, Chen Edith, and Brody Gene H.. 2015. “Self-Control Forecasts Better Psychosocial Outcomes but Faster Epigenetic Aging in Low-SES Youth.” Proceedings of the National Academy of Sciences 112(33): 10325–10330. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Park Cheolsung, and Kang Changhui. 2008. “Does Education Induce Healthy Lifestyle?.” Journal of Health Economics 27(6): 1516–1531. [DOI] [PubMed] [Google Scholar]
- Powdthavee Nattavudh. 2010. “Does Education Reduce the Risk of Hypertension? Estimating the Biomarker Effect of Compulsory Schooling in England.” Journal of Human Capital 4(2): 173–202. [Google Scholar]
- Shen Shu, and Zhang Xiaohan. 2016. “Distributional Tests for Regression Discontinuity: Theory and Empirical Examples.” Review of Economics and Statistics 98(4): 685–700. [Google Scholar]
- Silles Mary A. 2009. “The Causal Effect of Education on Health: Evidence from the United Kingdom.” Economics of Education Review 28(1): 122–128. [Google Scholar]
- Stephens Michael A. 1974. “EDF Statistics for Goodness of Fit and Some Comparisons.” Journal of the American Statistical Association 69(347): 730–737. [Google Scholar]
- Sudlow Cathie, Gallacher John, Allen Naomi, Beral Valerie, Burton Paul, Danesh John, Downey Paul, Elliott Paul, Green Jane, Landray Martin, Liu Bettte, Matthews Paul, Ong Giok, Pell Jill, Silman Alan, Young Alan, Sprosen Tim, Peakman Tim, and Collins Rory. 2015. “UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age.” Plos Med 12(3): E1001779. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tockman Melvyn S, Anthonisen Nicholas R, Wright Elizabeth C, and Donithan Michele G. 1987. “Airways Obstruction and the Risk for Lung Cancer.” Annals of Internal Medicine 106(4): 512–518. [DOI] [PubMed] [Google Scholar]
- Vasan Ramachandran S., Larson Martin G., Leip Eric P., Evans Jane C., O’Donnell Christopher J., Kannel William B., and Levy Daniel. 2001. “Impact of High-Normal Blood Pressure on the Risk of Cardiovascular Disease.” New England Journal of Medicine 345(18): 1291–1297. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.