Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2019 Jan 1.
Published in final edited form as: J Health Econ. 2017 Nov 24;57:1–14. doi: 10.1016/j.jhealeco.2017.10.007

Heterogeneity in the Impact of Type of Schooling on Adult Health and Lifestyle

Anirban Basu 1,*, Andrew M Jones 2, Pedro Rosa Dias 3
PMCID: PMC5905417  NIHMSID: NIHMS922748  PMID: 29179025

Abstract

Using data from a major educational reform in England and Wales, we examine heterogeneity in the long-term impacts of the exposure to different secondary schooling systems, characterized by selective early-tracking system versus non-selective comprehensive schooling, on health outcomes and smoking. We adopt a local instrumental variables approach to estimate person-centered treatment (PeT) effects, thereby recovering the full distribution of individual-level causal effects. We find that the transition from a selective early-tracking system to a non-selective one produced, on a fraction of individuals, significantly increased depression and cigarette smoking. These effects were persistent over time. Cognitive abilities did not moderate the effects, but students with lower non-cognitive skills were most likely to be negatively affected by this exposure. Keywords: Educational reform, Health, Lifestyle, Heterogeneity, instrumental variable


The positive health gradient by years of schooling is a particularly robust empirical regularity. Several papers have examined the causal nature of this relationship by exploiting changes in compulsory education laws, such as Lleras-Muney (2005), Kenkel et al. (2006), Oreopoulous (2006), Grimard and Parent (2007), Silles (2009), Van Kippersluis et al. (2011), Oreopoulous and Salvanes (2011) and Clark and Royer (2013). While the magnitude of the estimated health effects varies considerably among these studies1, in general, they tend to find positive long-term mean effects of post-compulsory years of education on longevity, health-related behaviors, and outcomes. This body of evidence has inspired some policy interventions targeting young children and adolescents in various parts of the world (e.g., The Quebec Educational Program; Best Start for Kids in King County, Washington).

Nonetheless, as made clear in Clark and Royer (2013), other key aspects of the association between education and health have received much less attention. First, most of the available evidence concerns the effects of length of schooling, rather than the type of that education; this leaves aside all qualitative aspects of an educational system, such as conditions of access, tracking, resources and educational environment. For example, there is a large literature that has looked at the impact of assimilation and also segregation of children from different cultural, socio-economic, and racial backgrounds on their long-term outcomes (Borjas 1985; Fordham and Ogbu 1986; Cutler and Glaeser 1997; Guryan 2004; Card and Rothstein 2007; Echenique et al 2006). More recently, the effects of assimilation of abilities (cognitive and non-cognitive), independent of other factors, have become a focus of attention. Ability tracking within schools, where high ability students are grouped in separate classes within the same school, have found better results from both high ability and low-ability students. For example, recently Card and Giuliano (2014) found that establishing separate “gifted/high achiever” classrooms for fourth and fifth graders leads to significant achievement gains for non-gifted but high-achiever participants in those classrooms. Beyond ability tracking, mixing of abilities within a school has found mixed results. Kang et al. (2007) exploited a policy experiment in South Korea in 1970 and found that ability-mixing compared to ability-grouping in high school had a positive but statistically insignificant effect on average adulthood earnings. However, this effect was larger for lower ability students and lower, even negative, for higher ability students. Understanding the impact of such mixing of abilities, not only based on cognitive dimensions but also on non-cognitive abilities, on long-term health and lifestyle choices has remained largely unexplored.

In this paper, we try to fill this gap in evidence through three main channels. First, we exploit the implementation of a major education reform in England and Wales that, rather than increasing the school leaving age, exposed pupils of the same age cohort to qualitatively different secondary schooling systems. Geographic variation in the pace that comprehensive schools were introduced in England and Wales means that one group was exposed to a selective early-tracking system in which children took a test to determine whether they were given the opportunity to enroll in an academically intensive secondary school (called ‘grammar schools’) or else into a less academic secondary school (called ‘secondary modern schools). Another group was exposed to a non-selective system, under which testing was abolished and where all pupils, with varying abilities, attended the same type of schools (called comprehensive schools). This allows us to examine the effect of exposure to qualitatively different educational systems, which did vary regarding the curriculum a median student was exposed to and the distribution of teacher characteristics, and importantly, allowed different levels of assimilation of abilities among the students, on health and lifestyle in adulthood. Non-pecuniary effects of exposure to different types of secondary school systems have been postulated to be sizable in the education sciences and psychology literature (Dupriez et al. 2008 and Perkins and Graham-Bermann 2012) but have received less attention in the economics literature.

Second, we examine the patterns of heterogeneity of effects on health and lifestyle of the exposure to qualitatively different educational systems. Conti et al. (2010, 2011) and Heckman et al. (2014) have used longitudinal data from the UK and the US to characterize heterogeneity health returns to post-compulsory education. Thus, as in Heckman et al. (2014), we adopt an instrumental variables identification strategy and estimate person-centered treatment (PeT) effects (Basu 2014) of selective over non-selective schooling system, thereby recovering the full distribution of the effects and identifying gainers and losers from this policy across the lifecycle.

Third, we examine the roles of cognitive and non-cognitive ability in childhood as possible moderators of the heterogeneity of the estimated effects. Using Dutch data, Bijwaard (2014) has recently shown that cognitive ability can be an important moderator of heterogeneity in the health returns to different lengths of schooling.

We use data from the National Child Development Study (NCDS), which follows a cohort of around 17,000 individuals from their birth in the week of 3rd March 1958. The NCDS cohort-members went through secondary schooling during the 1970s and attended different types of school since their secondary schooling lay within the transition period of the comprehensive education reform that was implemented in England and Wales from the mid-1960s2. This reform restructured secondary education by replacing the former system of early tracking of students with a non-selective, comprehensive, educational system. Comprehensive schooling was not implemented simultaneously across the country, and hence only some cohort members attended the selective system of state-funded education, which comprised grammar schools, secondary modern schools and a small number of technical schools. Among members of the NCDS cohort as a whole, 12 percent attended grammar schools at age 16, 25 percent attended secondary moderns, and 57 percent attended comprehensive schools. An additional 6 percent attended private fee-paying schools, independent of the state educational system and reforms3.

Previous evaluations of the comprehensive schooling reform have focused largely on their direct impact on educational attainment and indirect impact on labour market outcomes (Kerkchoff et al., 1996; Jesson, 2000; Dearden et al., 2002; Galindo-Rueda and Vignoles, 2005, Bonhomme and Sauder, 2011; Burgess et al., 2014). While earlier work that investigated health outcomes used a matching approach and focused on average effects rather than evidence of heterogeneity in responses to the reform (Jones et al., 2011). Here we focus on heterogeneous responses in health-related behaviours and health outcomes.

Our results indicate that there is substantial heterogeneity in the effects of the reform and significant persistence of these effects over time at the individual level. Compared to the selective system of schooling, the newer comprehensive schooling system produced significant negative effects on long-term health and increased smoking among a fraction of individuals. However, the ATE and TT were quantitatively similar and statistically insignificant indicating that cognitive abilities, which were the major drivers for selection into comprehensive schools, did not moderate the effects. Analyzing the PeT effects shows that individuals who had lower non-cognitive skills are most likely to be negatively affected in the long-run from exposure to mixed-ability schools. Our results also show that cigarette smoking could be an important transmission channel of the long-term impact on health outcomes.

1. THE COMPREHENSIVE EDUCATION REFORM

As Kerckhoff et al. (1996: 1-2) describe it, following the Second World War, a cycle of educational reforms was initiated in Western Europe. This was characterized by a move away from older selective systems and towards unified, non-selective, systems of comprehensive schooling. From the mid-1960s the comprehensive reorganization took-off in England and Wales, and aimed at promoting equal opportunity in society through an improvement of the quality of the schooling accessible to children from disadvantaged backgrounds. It replaced the selective educational system with a non-selective, comprehensive system of secondary schooling. This policy reform was implemented at different speeds at the local level: some Local Education Authorities (LEAs) implemented it quickly, but others resisted the change. This gradual and uneven transition allowed the two systems to co-exist for a long period.

The NCDS cohort entered state secondary schools at age 11 in 1969 and completed their compulsory schooling at age 16 in 1974; at that point, among those who attended state schools 40 percent of the cohort members still experienced the selective system, while the remaining 60 percent attended comprehensive schools. The NCDS cohort-members who still experienced the selective system attended either “grammar schools” or “secondary modern” ones. Grammar schools were academically oriented state schools that provided teaching for the entire age range 11-18 and prepared pupils to go on to higher education. Admission into these schools was determined by an exam taken at age 11 (the ‘Eleven Plus’)4. Pupils whose examination score did not permit entry into a grammar school attended either secondary modern schools, which were less academically oriented and covered the ages 11-16 or, in a small minority of cases, vocational schools aimed at providing training and technical apprenticeships5.

The comprehensive education reform affected schooling in a variety of ways: grammar and secondary modern schools were significantly different regarding their curricula, examinations, and academic environment. They were also very different regarding other qualitative aspects: among the schools attended at age 16 by the NCDS cohort members used in our analysis, 69 percent of grammar schools were single-sex, while only 13 percent of comprehensive schools were single sex. Streaming of classes by academic ability was common in secondary modern schools (42 percent) and comprehensive schools (39 percent) but rare among grammar schools (17 percent). Some comprehensive schools were former secondary moderns (26 percent) or grammar schools (19 percent), with the rest being newly created. Furthermore, the distribution of indicators of quality of the school, such as the pupil-teacher ratio and of the average number of expelled students, also differs across types of schools6.

The issue of the relative advantages of early tracking versus late tracking of students in secondary schools remains a topical education policy issue7, and the comprehensive reform has received considerable attention in the social sciences (Kerkchoff et al., 1996; Jesson, 2000; Dearden et al., 2002; Galindo-Rueda and Vignoles, 2005, Burgess et al., 2014). Galindo-Rueda and Vignoles (2005) use the NCDS data to investigate the effects of the reform on educational outcomes. Since the type of schooling system attended by each cohort member may be endogenous, they propose an instrumental variables identification strategy, relying on two main instruments: the share of comprehensive schools in a cohort member’s LEA in 1974 and the Conservative Party control of the cohort member’s LEA (which the authors claim to be negatively correlated with the probability of attending a comprehensive school, but orthogonal to the educational outcomes). However, possible selection into a particular schooling system should be based on the availability of comprehensive schools when the cohort members were 11 years old and ready to enter secondary education, rather than in 1974, when they were 16 old, and this decision had already been made. In addition, as made clear in Pischke and Manning (2006), Conservative areas tended to be significantly richer and better educated than Labour ones, hence the validity of Conservative control of individuals’ constituency as an instrument for the educational system attended is open to question8 We draw on these concerns as a basis for our identification strategy.

2. EMPIRICAL STRATEGY

As shown by Conti et al. (2010, 2011), individual health returns to schooling are likely to be heterogeneous. In our case, selection into the comprehensive or the selective schooling systems may reflect this type of heterogeneity. Our primary goal is to study such heterogeneity. To do this we apply the local instrumental variables approach of Heckman and Vytlacil (1999) and estimate person-centered treatment effect parameters (Basu 2014) and compare these to conventional IV and OLS estimates. A summary of this method and parameters are given here.

We define a model of potential outcomes and treatment choice using the Neyman-Fisher-Cox-Rubin framework of potential outcomes. Consider two treatment regimes – comprehensive schools (treated) state denoted by D = 1 and selective schools (untreated) denoted by D = 0. The corresponding potential individual outcomes in these two states are denoted by Y1 and Y0. We assume:

Y1=μ1(X0,XU,ϑ)andY0=μ0(X0,XU,ϑ), (1)

where X0 is a vector of observed random variables, XU is a vector of unobserved random variables which are also believed to influence treatment selection (they are the unobserved confounders) and ϑ is an unobserved random variable that captures all remaining unobserved random variables. (X0, XU)∐ϑ and X0XU where denotes statistical independence.

Selection into treatment (before the realization of the outcome of interest) is determined by:

D=1ifμD(X0,Z)UD>0, (2)

where Z is a (non-degenerate) vector of observed random variables (instruments) influencing the decision equation but not the potential outcome equations, μD is an unknown function of X0 and Z, and UD is a random variable that is correlated with XU and all remaining unobserved random variables influencing choice. By definition, UDϑ, which also defines the distinction between XU and ϑ in (1). Equations (1) and (2) represent the nonparametric models that conform to the Imbens and Angrist’s (1994) independence and monotonicity assumptions needed to interpret instrumental variable estimates in a model of heterogeneous returns. As in Heckman and Vytlacil (1999), we can rewrite (2) as:

D=1ifP(X0=x0,Z=z)>V (3)

where V = FUD[UD|X0 = x0, Z = z], P(x0,z)=FUD|X0,Z[μD(X0,Z)] and F represents a cumulative distribution function. Therefore, for any arbitrary distribution of UD conditional on XO and Z, by definition, V ~ Unif[0, 1] conditional on XO and Z. Under regular IV assumptions, Heckman and Vytlacil (1999) show that Marginal Treatment Effects can be identified by:

Eϑ(Y|X0=x0,Z=z)p=Eϑ((Y1Y0)|X0=x0,V=v)=MTE(x0,v) (4)

where Y = DY1 + (1 − D)Y0 is the observed outcomes and v = P(xO, z).

Basu (2014) extends the LIV methods to identify PeT effects, which, for persons who choose treatment, are defined by:

EXu|X0,P(Z),DEϑ((Y1Y0)|x0,P(z),D=1)=E((Y1Y0)|x0,V<P(z))=P(z)10P(z)MTE(x0,v)dv. (5)

Similarly, the conditional effect for a person who did not choose treatment is obtained by integrating MTEs over values of V greater than p.

Conceptually, a PeT effect is a weighted version of MTEs. For any given individual, the PeT effect identifies the specific margins where that individual may belong given its individual values of XO, P(Z) and D. It then averages the MTEs over those margins, but not all as in CATE. Therefore, a PeT effect is the X-Z-conditional Effect on the Treated for persons undergoing treatment and is the X-Z-conditional Effect on the Untreated for persons not undergoing treatment. It is important to note that in the absence of essential heterogeneity, a PeT effect reduces to an individualized CATE where the condition is enforced based on the entire vector of covariate values for an individual. Further technical details can be found in Basu (2014). However, here we provide an explanation of what the PeT effects are and why estimation of the PeT effects are important in this context.

2.1 Intuition behind and importance of PeT effects

Suppose that in our example, we presume (only to get the general intuition) comprehensive schooling to be more beneficial, in terms of long-term health, for children with higher test scores in elementary schools and for children who are self-confident. Moreover, choice of treatment is influenced by test scores and the self-confidence of the child. However, our data on children from NCDS do not have a measure of their self-confidence and only contains for each student: test scores, treatment, long-term health outcomes, and a characteristic that all agree is a powerful and valid instrument. Let the instrument be a continuous variable (e.g. distance to the comprehensive school). Greater the distance to the comprehensive less likely will be the enrollment to that school.

In this case, a traditional naïve regression would produce a biased estimate of the average treatment effect (ATE) and also the conditional average treatment effects (CATEs) for children groups with low and high test scores groups due to the endogeneity of the treatment status caused by the missingness of self-confidence levels in the analysis.

A traditional instrumental variable analysis, using a strong and valid IV, will also produce biased estimates for ATE and CATEs due to the presence of essential heterogeneity, which suggests that the treatment effects vary over unobserved confounders, which in this case is the self-confidence status.

A local instrumental variable (LIV) approach can be used to overcome these issues when a continuous instrument is available. LIV methods are used to estimate the marginal treatment effects (MTEs) parameters. MTEs are the effects for individuals for whom the influence of the observed characteristics (tests score and distance to comprehensive school) balance with the influence of the unobserved confounders (self-confidence level) on treatment choice such that they are indifferent to choosing between using the comprehensive and selective schools. To estimate an MTE, LIV methods are used to compare the outcomes of two groups of, say, students with low test scores, where one group is staying at a distance d from the comprehensive school and the other at a distance d+ε, ε representing an epsilon (very small) change in distance. Therefore, the two groups are identical in their distribution of self-confidence. Any difference in outcomes between these two groups of students should be attributed to the changes in treatment choices between these two groups, which is only brought about by the difference in distance.

However, since the difference in distance is very small, the effect can be attributed to a margin of students who were indifferent between two treatment options and were moved to change treatment choice by the small perturbation of the instrumental variable, i.e., distance. For this margin of students, one can quantify a normalized level of unobserved confounders as they must balance the observed levels. Similarly, for another dyad of distances, d’ and d’+ ε, one can estimate another MTE, which reflects the causal treatment effect on students at another level of a unobserved confounder. In this way, a full schedule of MTEs can be estimated that vary over the unobserved confounder levels (i.e. self-confidence) given any level of the observed confounder (i.e. test scores). Intuitively, LIV methods estimate these MTEs by estimating the partial derivate of the outcome with respect to the IV-based propensity score (reflecting epsilon changes).

Once MTEs are estimated over the range of observed and unobserved levels, they can then be easily aggregated to form meaningful treatment effect parameters such as the ATE, CATEs, TT and TUT and also study heterogeneity in effects using person-centered treatment (PeT) effects. The PeT effect for a student in this stylized sample is conditioned not only on that individual’s test score but his test score-specific MTEs are also averaged over a distribution of self-confidence statuses that conforms with the individual’s observed choice of the schooling system. Thus these are deemed to be personalized effect for this student.

If essential heterogeneity is present, estimation of ATE and CATE would not be possible using traditional IV methods, as explained above. Even in the absence of essential heterogeneity, where one can estimate CATEs for single covariates using traditional IV methods, there is no easy way to develop algorithms that can help understand the combination of covariates that explain treatment effect heterogeneity at the individual level. PeT allows us to develop such algorithm as we demonstrate below. In the context of the policy that we are evaluating, such an algorithm can inform a variety of stakeholders how to group students across types of schooling.

2.2 Identification

Galindo-Rueda and Vignoles (2005) suggest the existence of unobserved individual heterogeneity and selection into types of schooling based on unobserved gains (essential heterogeneity). Hence, traditional instrumental variables approaches would produce biased estimates of mean treatment effect parameters. We follow the LIV approach to estimate PeT effects. Specifically, as a continuous instrument, we use the percentage of comprehensive schools in individual’s Local Education Authority in 1969. A similar variable based on 1974 percentages has been used in earlier literature as an instrument to evaluate the reform. However, as explained above, we believe that that 1969 penetration rates of comprehensive schooling provides an even more exogenous variation in schooling system choices since, for this cohort, the transition into either the selective or the comprehensive systems of schooling took place in 1969, when cohort members were 11 years of age, rather than in 1974. Besides directly testing the strength of the instrument, we perform a variety of necessary test on observed risk factors to see whether the instrument can reduce the imbalance in these factors as compared to that observed across school systems. We also adapt the falsification test proposed by Pischke and Manning (2006) to the analysis of health outcomes and estimate the impact of comprehensive schooling on probability of cohort members suffering from at least four childhood morbidities at age 11 (i.e. before exposure to the policy took place) using the 1969 percentage of comprehensive schools in the LEA as an instrument.

2.3 Estimation

To estimate the impact of the comprehensive school system and the individual heterogeneity in these effects, we use local instrumental variable methods (LIV) to compute person-centered treatment (PeT) effects (Heckman and Vytlacil 1999; Basu 2014).

To estimate PeT effects, we follow the following algorithm (Basu, 2014):

  1. Run the first stage by regressing the indicator for the comprehensive system (D) against observed factors (XO) and the instrument (Z) using a probit model and predict the propensity score, p^(x0,z), for every individual.

  2. Ensure that p^(x0,z) has mass at every value (rounded to 0.01) under both comprehensive and selective school systems, dropping any observations that do not.

  3. Denote minp=min{p^(x0,z)} and maxp=max{p^(x0,z)}

  4. Run the second stage LIV estimand for each binary outcome Y using a probit model with the following specification (determined by a series of goodness of fit tests): Y=Φ(α0+α1X0+α2p^+α3p^X0+α4p^2), where Φ is the cumulative normal distribution function.

  5. Numerical integration: For each individual i:
    1. Draw 1000 replications of u~Uniform[minp, maxp]
    2. Compute dΦ^(.)/dp^ and evaluated it by replacing p^(x0,z) with each value of u. So there are 1000 values of dΦ^(.)/dp^ for each individual i.
    3. Compute D=Φ1(p^(x0,z))+Φ1(1u) also generating 1000 values for each individual i.
    4. Compute the PeT by averaging dΦ^(.)/dp^ over values of u for which (D* > 0) if D=1, otherwise, by averaging dΦ^(.)/dp^ over values of u for which (D* ≤ 0) if D=0.
  6. Estimated PeT effects provide individualized effects of treatment effects. Mean treatment effect parameters are computed: averaging PeTs over all observation gave ATE, averaging PeTs over over D=1 or D=0 gives the TT and TUT respectively.

We estimate the PeT effects of experiencing comprehensive schooling compared to the selective schooling system on health outcomes (indicators for self-reported excellent health, malaise and long-standing illness) and cigarette smoking (indicator for current smoking) at ages 23, 33 and 42 years separately. We use the estimated PeT effects to study the following:

  1. Differential drop-out: We examine systematic differences in the distribution of PeT effects at age 23 between those who would continue to participate at age 33 versus those who would drop out before age 33. Similarly, we looked at PeT effects at age 33 based on drop-out behaviors by age 42.

  2. Time dependence of effects: We explore the dependence of PeT effects on the same outcomes over different ages for the same individuals to study the time dependence or persistence of effects.

  3. Effects on multi-dimensional outcomes: We compare the dependence amongst the PeT effects of malaise versus long-standing illness for the same individuals to study the multi-dimensional effects of alternative schooling systems. We also repeated similar analysis between the effects on smoking at age 33 years and malaise at age 42 years to establish a potential channel through which alternative schooling systems may affect outcomes.

  4. Heterogeneity: We studied how the average effects on outcomes among those who did not attend comprehensive schooling systems differed between those who attended secondary modern schools versus grammar schools. This comparison helps to illustrate if exposure to comprehensive schools would have had differential effects on students who went to the different types of schools within the selective system.

  5. Finally, we develop a prediction algorithm based on individual factors that predict significant PeT effects on malaise and long-standing illness at age 42.

3. DATA

Members of the National Child Development Study (NCDS) cohort are followed from birth, in the week of 3rd March 1958. In the early waves of the Study information was obtained from parents, medical examinations, tests of ability and the child’s school; extensive data are thus available on the cohort-members’ childhood health, parental background, and educational achievement. The subsequent waves include self-reported information on social status in adulthood, health-related behavior and a wide range of health outcomes.

Our empirical strategy aims at identifying the impact of the type of secondary schooling experienced by members of the 1958 cohort on adult health and health-related behaviors after controlling for a rich set of pre-treatment factors9. These include measures of family socioeconomic status and the local environment during early childhood; measures of childhood health and use of health care and health within the family; and measures of cognitive and non-cognitive skills and social adjustment of the child. Because the comprehensive reform was a secondary education policy, we also condition on characteristics of the individual’s primary education, which, in our case, constitute pretreatment factors. The specific variables used are described below and listed in Table A.1 in Appendix A.

3.1 Pre-treatment factors

Childhood health, genetic endowments, and family background

The NCDS includes extensive information on the childhood characteristics that were shown to be key predictors of health and lifestyle in adulthood in studies such as Case et al. (2005) and Currie and Stabile (2004). Following Power and Peckham (1987), we create morbidity indices by aggregating twelve categories of health conditions, which affect the child at ages 7 and 11. We complement this with indicator variables for the occurrence of diabetes, epilepsy, and other chronic conditions among parents and siblings, to account for hereditary conditions in the cohort member’s family. Following Carneiro et al. (2007), the parental socioeconomic background is proxied by the occupational social class and the years of schooling of cohort members’ father. Following Case et al. (2005) and Lindeboom et al. (2009), we augment this characterisation using data on the incidence of household financial difficulties during the cohort member’s childhood and adolescence.

Local area characteristics

Rich information about the cohort members’ immediate milieu is available and aggregated at different geographic levels, allowing a detailed demographic and socioeconomic characterization of each individual’s local area: electoral constituency level, local education authority level and census enumeration district level (the smallest unit for which census statistics were then available, with an average population of about 500)10. Available variables include the percentage of the local population who are unemployed or long-term sick, working women, employed in particular sectors (manufacturing and agricultural), who are in different occupational groups (professional/managerial, other non-manual, skilled manual, semi-skilled, unskilled), owner occupiers, council tenants, non-whites, and immigrants (see Dearden et al., 2002; Galindo-Rueda and Vignoles, 2004; Pischke and Manning, 2006).

Cognitive ability and socio-emotional development

Cognitive skills, socio-emotional development, and personality traits have received growing attention in the literature on human capability formation. In particular, these factors were shown to be key determinants of health and health-related behaviors in recent papers such as Carneiro et al., (2007), Keastner (2009) Conti et al. (2010, 2011). The NCDS provides measures of early cognitive ability, and social development collected before respondents began their secondary schooling. Scores of ability tests taken at age 7 and 11 are available on a series of cognitive dimensions: mathematics, reading, copying designs and general ability. These test scores are highly correlated at the individual level leading to multicollinearity. To avoid this, we follow Galindo-Rueda and Vignoles (2005) and use principal components analysis to construct a single measure of cognitive ability using the first principal component11.

Serendipitously, the three dimensions of cognitive ability used to construct our index – mathematics, reading and general ability – along with the fact that the index derived from the first principal component gives equal weight to each dimension, mirrors the three elements of the Eleven Plus examination. Cognitive ability score at age 11 can thus be interpreted as a proxy for performance in the Eleven Plus for those who took the examination.

We draw on Carneiro et al. (2007) and use the score for the Bristol Social Adjustment Guide (BSAG) as our measure of early social development. This is a measure of problems with social adjustment at age 11: teachers were asked to report whether the child had problems in twelve behavioral domains such as hostility towards children and adults, anxiety, withdrawal, ‘writing off’ adults, un-forthcomingness, depression, restlessness, acceptance by adults, inconsequential behavior, and other psychological and nervous symptoms (Stott, 1987). One point is attributed to each positive answer; points are then summed to obtain the BSAG social maladjustment score.

3.2 Educational attainment and school characteristics

Detailed information on the cohort-members’ educational attainment is available in the NCDS. To exploit this wealth of data, we build a measure of attainment in secondary education with thirteen categories, ordered according to the grades obtained and a number of passes12. In our empirical analysis, we use a simple 0-12 scaling of this variable as a parsimonious measure of educational attainment, although we have also used models with dummy variables for each category to check the robustness of the results.

Schools are characterized using two sets of variables. First, as described in Section 1, the NCDS cohort-members attended different types of schools; this is captured by indicators of the school attended at age 16: secondary modern, grammar, and comprehensive. Second, we use information on characteristics of the school, which have been widely used as standard proxies of the quality of schooling in applied work on education and labor (see, for example, Dearden et al., 2002). These include the teacher/pupil ratio, the ratio of expelled pupils to the total number, and indicators of whether the school was single sex and whether classes were streamed by ability.

Finally, we control for information about the individuals’ primary schooling and parental aspirations for their child, which include the number of children in the child’s primary school class at age 7 in 1965, whether parents reported that their child was unhappy at school in 1965, and an indicator for whether parents wished their child to continue beyond the minimum school leaving age.

3.3 Outcomes: health and health-related behaviors

We looked three measures of health (depression, long-standing illnesses, and self-assessed health) and one measure of lifestyle behavior (current smoking).

Respondents answer to a series of questions from the Cornell Medical Index Questionnaire, each targeting a particular mental ailment. The number of positive answers given is then used as a malaise/depression score along the lines of Carneiro et al. (2007). The malaise score is a measure of psychiatric morbidity (with a special focus on depression), developed at the Institute of Psychiatry from the Cornell Medical Index (Rutter et al., 1970). The NCDS team suggest the use of a severity scale: individuals are considered normal if they score between 0 and 7 points and depressed if they score between 8 and 24 points (Rodgers et al., 1999). In our analysis, we use this scale to define an indicator variable that takes the value one if individuals are depressed (malaise score above 7) and zero otherwise.

Information on the particular medical condition associated with the long-standing illness is available and classified according to the International Classification of Diseases (ICD-10). The most prevalent conditions are illnesses of the musculoskeletal system (25.7 percent), circulatory system (11.8 percent), respiratory system (11.4 percent) and metabolic problems (9.5 percent), of which 70 percent suffer from diabetes. We used an indicator to represent the presence of any long-standing illnesses.

We also look at the probability of individuals reporting excellent health based on self-assessed health on a four-point Likert scale (poor, fair, good, excellent)

The NCDS also documents some of the important health-related behaviors. Classroom peer-effects have been shown to be long-term determinants of these, as shown by Black, Devereux and Salvanes (2008 and 2013). Amongst these health-related behaviors, cigarette smoking has received particular attention. Fletcher (2010 and 2012) provide solid evidence that the adoption of health-related lifestyles such as cigarette smoking and alcohol consumption are partly determined through peer effects during schooling years. Using data from multiple American student cohorts Fletcher (2010) estimates that, on average, increasing the proportion of classmates who smoke by 10% will increase the likelihood an individual smokes by approximately 3 percentage points, after controlling for the endogeneity of school choice. The long-lasting influence of school-based peer effects on smoking is also documented in European countries. Lundborg (2006) uses rich Swedish data to show that school-class based peer effects in cigarette smoking (and illicit-drug use) are particularly strong. Finally, Etilé and Jones (2011) use French data to show that the effect of schooling reforms can also have long-term effects on cigarette smoking. The NCDS includes self-reported information on the number of cigarettes smoked per day, which we use. These data are only available once respondents are aged 23 and above. Other behavioral data have very high amount of missing data, which precludes us from studying them accurately.

4. RESULTS

4.1 Sample selection, missing values and attrition

Table 1 shows our sample selection and the number of observations lost due to various exclusion criteria. Some NCDS cohort members are excluded from our estimation sample because they attended private schools, which are independent of state education and schooling reforms.

Table 1.

Sample selection, missing values and attrition

Criteria Excluded Total

Total subjects in NCDS Cohort at Age 16 years 16338
Private Schools 706 15632
Comprehensive school penetration rate > 80th percentile 2251 13381

Basic exclusions
 Missing covariates, instrumental variable, outcomes at age 23 9933 3448
Further Exclusions
 No overlap on predicted propensity score 20 3428

More significantly, a large number of observations is lost due to item non-response to parts of the questionnaire that are essential for our analysis or because at some stage they have attrited out of the NCDS. This phenomenon is acknowledged in several papers that feature sample sizes comparable to ours (Dearden et al., 2002; Galindo-Rueda and Vignoles, 2004, 2005; Pischke and Manning, 2006, Lindeboom et al., 2009; Jones et al., 2011). The degree, and repercussions, of non-random attrition has been extensively examined in this earlier literature (for example, Plewlis et al., 2004; Case et al., 2005 and Lindeboom et al., 2009). Table 2 shows, in the case of our analysis, the comparison between included and excluded observations. The difference in means for most covariates is small and statistically insignificant, with a few notable exceptions. Individuals included in our estimation sample tend to exhibit higher levels of cognitive and non-cognitive ability, to live in slightly more affluent areas and to be less likely to have experienced financial hardship in childhood. We discuss the implications of our results from the included sample in light of these disparities with the excluded sample.

Table 2.

Included and excluded observations: mean differences in select covariates

Covariates Excluded
N = 9,933
Included
N = 3,448
p-value
Cognitive ability, age 7 −0.05 (1.23) 0.12 (1.12) <0.001
Non-cognitive ability, age 11 8.9 (9.14) 7.2 (8.08) <0.001
Parents say pupil unhappy at primary school, age 7 6% 6% 0.40
Parents want pupil to stay in school 73% 77% <0.001
Male 52% 49% <0.001
No. of acute illnesses child suffered up until age 7
 0 19% 18%
 1 30% 28%
 > 1 51% 53% 0.12
No. of times child was hospitalized up until age 7
 0 72% 70%
 1 22% 24%
 > 1 6% 6% 0.09
Height at age 7 (in cm)
Epilepsy in family, until age 7 6.9% 7.3% 0.37
Presence of chronic illnesses in the family 2.2% 2.5% 0.25
Father SES
 Low 24% 19%
 Middle 52% 53%
 High 26% 28% 0.001
Financial hardship episode (un till child age 7) 8.2% 6.6% 0.008
% of population unemployed or disabled 5.1% 4.5% <0.001
% of women in formal employment 41.3% 41.1% 0.80
% of population in employed in mining and manufacturing 37.2% 37.4% 0.66
% of population in employed in agriculture 2.4% 2.7% 0.26
% of population in professional / managerial jobs 11.8% 12.5% 0.02
% of population in unskilled workers 7.6% 6.8% <0.001
% of population who is home owner 42.7% 48.5% <0.001
% of population who is immigrant 1.6% 1.4% 0.05

It is also informative to compare the mean differences in covariates between the cohort members in our treatment and control groups; these are reported in Table 3. Average ability levels are higher amongst selective system pupils when compared with their comprehensive counterparts. Also, selective areas tend to be slightly more affluent than comprehensive ones, as identified in Pischke and Manning (2006)13.

Table 3.

Treatment and control group observations: mean differences in select covariates

Covariates Selective
N = 1,602
(47%)
Comprehensive
N = 1,826
(53%)
p-value p-value (across IV median)
Cognitive ability, age 7 0.25 (1.1) 0.05 (1.1) <0.001 0.32
Non-cognitive ability, age 11 6.95 (7.7) 7.74 (8.7) 0.21 0.06
Parents say pupil unhappy at primary school, age 7 5.6% 6.9% 0.50 0.08
Parents want pupil to stay in school 78% 77% 0.51 0.60
Male 48% 49% 0.30 0.83
No. of acute illnesses child suffered up until age 7
 0 17% 20%
 1 30% 27%
 > 1 53% 53% 0.13 0.55
No. of times child was hospitalized up until age 7
 0 69% 71%
 1 25% 23%
 > 1 6% 6% 0.31 0.10
Height at age 7 (in cm) 122.7 (5.6) 123.0 (5.7) 0.48 0.37
Epilepsy in family, until age 7 7.4% 7.5% 0.91 0.22
Presence of chronic illnesses in the family 2.3% 2.6% 0.50 0.67
Father SES
 Low 18% 19%
 Middle 53% 54%
 High 29% 26% 0.23 0.11
Financial hardship episode (until child age 7) 5.1% 6.6% 0.24 0.36
% of population unemployed or disabled 4.3% 4.3% 0.67 0.74
% of women in formal employment 40.3% 42.2% 0.30 0.20
% of population in employed in mining and manufacturing 35.7% 37.6% 0.001 0.03
% of population in employed in agriculture 2.9% 2.6% 0.30 0.65
% of population in professional / managerial jobs 13.0% 12.3% 0.01 0.54
% of population in unskilled workers 6.5% 6.7% 0.04 0.48
% of population who is home owner 50.0% 47.1% <0.001 0.19
% of population who is immigrant 1.8% 1.6% 0.29 0.93

4.2 First stage regression results

Our instrumental variable was strong in predicting selection of comprehensive schooling (F-state = 84, p< 0.0001). Importantly, Table 3 also shows that the differences in covariates are almost eliminated across the median of our instrument;14 this balancing of observed covariates over the distribution of the instrument is reassuring of its ability to deal with the issues of selection and unobserved heterogeneity.

Figure 1 shows the distribution of the values of the propensity score amongst treated and untreated observations. There are only 20 individuals outside the common support, which are dropped before running the second-stage regression.

Figure 1.

Figure 1

Distribution of the propensity score P(X, Z): observations within and off common support

Also, our IV is not rejected by the falsification test suggested by Pischke and Manning (2006). We estimate the impact of the comprehensive school on the probability of suffering from at least four childhood morbidities at age 11. The estimated ATE, TT, and TUT from this placebo regression approach show null effects: −0.01 (0.08), −0.001 (0.09) and −0.02 (0.07) and they are not statistically significant.

4.3 Second stage regression results: health outcomes

We examine the effect of experiencing the comprehensive schooling system on our outcomes of interest, at three different points of the life-cycle: ages 23, 33 and 42. We also focus on retrieving the PeT effects for each individual in our sample, which are then averaged to calculate the average treatment effect (ATE), the treatment effect on the treated (TT) and the treatment effect on the untreated (TUT). In addition to these overall effects, the PeT effects are used to study the following:

  1. Differential drop-out: Figure 2 shows the distributions of PeT effects at ages 23 or 33 between those who would continue to participate in the next wave versus those who would drop out before the next wave. We do not observe any systematic differences in these distributions for any of the outcomes.

  2. Time dependence of effects: Figure 3 illustrates the PeT effects on the probability of depression at age 23 and age 42 for the same individuals. Figure 3 illustrates the PeT effects on the probability of depression at age 23 and age 42 for the same individuals. Comprehensive schooling, compared to selective schooling, appear to increase the probability of depression at age 42 for 90% of the sample. That is a big jump from 49% at age 23. We find persistence of the effect of comprehensive schooling over age among individuals who negatively affected beginning at age 23; most of them continue to be negatively affected at age 42. Similar persistence effects were found for other outcomes. However, among individuals who were not affected by comprehensive schooling at age 23, and even may have experienced slight positive effects (i.e., negative PeT effects), most of them would end up experiencing increases in the probability of depression by age 42 attributable to comprehensive schooling (North-west quadrant in Figure 3). It is plausible that realization of ability gap, which may have been masked under the comprehensive schooling system, have caught up with these individuals at a later age. However, this is only conjecture as we do not have sufficient data to pinpoint the channels through which such effects are working.

  3. Effects on multi-dimensional outcomes: Table 4 shows the effect of comprehensive schooling on the probability of cohort members suffering from depression, having a long-standing illness and self-assessed excellent health and smoking. In general, we do not find significant evidence of essential heterogeneity for any of the outcomes. The IV-OLS average effects are all similar to the ATEs estimated via LIV methods. Over the ages, the effect of comprehensive schools seems to have increased the probabilities of depression and long-standing illnesses and decreased the probability of excellent self-assessed health. Although none of these effects reach significance at the 5% level, the large point estimate indicates that accepting null effects based on these results would be premature. The absence of essential heterogeneity does not signify the absence of heterogeneity in effects since the observed risk factors may still moderate these effects. Indeed, we see that the point estimates of the unconditional TT and TUT effects are different from each other, especially for the probabilities of depression and long-standing illnesses.

  4. Heterogeneity: Figure 4 shows the PeT effects of comprehensive schooling on the probability of depression among untreated individuals, i.e. those who experienced the selective system of education. Interestingly, the mental health losses associated with comprehensive schooling would be similar amongst cohort-members who attended secondary modern schools (the less academically intensive schools) or the grammar schools, highlighting that cognitive abilities, which distinguish these two groups of students, may not be a driving factor for these effects.

    However, if we perform univariate variance decomposition of the PeT effects, which estimates how much of the overall variation in the PeT effects on the probability of depression at age 42 by each of the observed baseline factors, we find six univariate factors that each explain at least 5% of the variance in the PET effects (Table 5). Both cognitive and non-cognitive scores in childhood are identified amongst these factors, indicating univariate CATEs, as typical in most subgroup analysis, would have identified cognitive scores as a factor explaining heterogeneity in the effects of comprehensive schooling over selective schooling. That would be at odds with our previous evidence and shows the value of developing a prediction algorithm using the PeT effects.

  5. Prediction algorithm. Since it is important to identify the gainers and losers from comprehensive schooling over selective schooling. we use a stepwise backward logistic regression approach on the PeT effects, with very conservative inclusion criteria of p-value < 0.0001, to jointly identify the baseline factors that are most influential in predicting the individuals for whom comprehensive school would have significantly increased (at p <=0.10) in probability of depression by age 42. The results, presented in Table 6, show the key factors that identify these individuals do not include cognitive ability, in line with what we found in Figure 4. Nonetheless, children with one point increase in their social adjustment scores (which increase in non-cognitive difficulties) are at a 10% greater risk of a significant increase in the probability of depression at age 42.

Figure 2.

Figure 2

Person-centered Treatment (PeT) effects over time.

Figure 3.

Figure 3

Dependence between PeT effects of comprehensive schooling on depression at ages 23 (horizontal axis) and 42 (vertical axis)

Table 4.

Effect of comprehensive schooling on depression at ages 23, 33 and 42

Effect on Pr(Depression)
at Age 23
Mean (s.e.)
at Age 33
Mean (s.e.)
at Age 42
Mean (s.e.)
N 3426 2831 2775
Unadjusted .001 (.009) 0.014 (0.009) 0.015 (0.012)
IV-OLS 0.04 (0.06) 0.10 (0.06) 0.16 (0.08)
PeT-based
 ATE 0.02 (0.08) 0.09 (0.11) 0.14 (0.11)
 TT −0.01 (0.08) 0.07 (0.17) 0.11 (0.08)
 TUT 0.05 (0.09) 0.12 (0.17) 0.17 (0.15)

Effect on Pr(Long Standing Illnesses)
at Age 23
Mean (s.e.)
at Age 33
Mean (s.e.)
at Age 42
Mean (s.e.)

N 3426 2831 2775
Unadjusted .001 (.009) 0.014 (0.009) 0.015 (0.012)
IV-OLS 0.07 (0.05) 0.12 (0.09) 0.08 (0.10)
PeT-based
 ATE 0.08 (0.08) 0.12 (0.15) 0.07 (0.17)
 TT 0.05 (0.05) 0.08 (0.12) 0.06 (0.16)
 TUT 0.11 (0.13) 0.17 (0.15) 0.07 (0.18)

Effect on Pr(Excellent Self Assessed Health)
at Age 23
Mean (s.e.)
at Age 33
Mean (s.e.)
at Age 42
Mean (s.e.)

N 3426 2831 2775
Unadjusted .001 (.009) 0.014 (0.009) 0.015 (0.012)
IV-OLS 0.04 (0.11) 0.22 (0.13) −0.12 (0.11)
PeT-based
 ATE 0.01 (0.13) 0.20 (0.18) −0.13 (0.16)
 TT 0.01 (0.13) 0.18 (0.18) −0.13 (0.18)
 TUT 0.01 (0.13) 0.21 (0.19) −0.13 (0.15)

Effect on Pr(Smoking)
at Age 23
Mean (s.e.)
at Age 33
Mean (s.e.)
at Age 42
Mean (s.e.)

N 3426 2831 2775
Unadjusted .034 (.02) .03 (.017) .017 (.016)
IV-OLS 0.16 (0.15) 0.19 (0.12) 0.05 (0.10)
PeT-based
 ATE 0.10 (0.14) 0.15 (0.12) 0.05 (0.10)
 TT 0.11 (0.14) 0.13 (0.11) 0.04 (0.10)
 TUT 0.10 (0.13) 0.17 (0.13) 0.05 (0.10)

Figure 4.

Figure 4

PeT effects of comprehensive schooling on the untreated (i.e. cohort-members who did not attend comprehensive schools)

Table 5.

Variance decomposition: factors explaining variation in individual effects on depression at Age 42

Factors % of total variance in PeT effects on Depression at Age 42 explained by each factor

Parent say pupil unhappy at primary school, age 7 22
Non-cognitive ability, age 11 16
Height, age 7 14
Cognitive ability, age 7 12
No. of hospitalizations, by age 7 9
Parents want pupil to stay in school 6
Male 5

NOTE: Only factors explaining at least 5% of total variance are reported

Table 6.

Primary identifiers for individuals for whom comprehensive school would have significant increase in probability of depression by age 42. (Stepwise backward logistic with very stringent inclusion criteria. i.e. LR p-value < 0.0001)

Covariates Odds Ratio

Non-cognitive ability, age 11 1.1 (0.01)
Parents say pupil unhappy at primary school, age 7 0.02 (0.01)
Parents want pupil to stay in school 36.8 (12.6)
Male 28.6 (6.8)
No. of times child was hospitalized up until age 7: 1 (ref 0) 0.03 (0.01)
No. of times child was hospitalized up until age 7: > 1 4.5 (1.2)
Father SES - middle 0.03 (0.06)
% of population in employed in mining and manufacturing 10.1 (4.6)

4.4 Transmission channels: quality of schooling, cigarette smoking and health

To explore whether the effect of comprehensive schooling on long-term health outcomes such as depression and long-standing illnesses could have been mediated through its influence on lifestyle, we explore the association of PeT effects on smoking behavior at age 23 with the PeT effects on depression and long-standing illness at age 42 for the same individuals. Figure 5 shows that there appear to be strong positive correlations in both cases. The positive PeT effects of comprehensive schooling on smoking track positive effects on probabilities of suffering from depression (rho = 0.47; 95% CI: 0.13, 0.81) and long-standing illness (rho = 0.27; 95%CI: −0.10, 0.64) at age 42. In contrast, we find that the correlation between the PeT effects for smoking and long-standing illness or smoking and depression are close to zero at age 23 (rho = 0.06 and 0.09 respectively). This contrast in correlations corroborates the role of cigarette smoking as a transmission mechanism of the impact of quality of schooling on health outcomes, especially depression.

Figure 5.

Figure 5

Dependence between PeT effects of comprehensive schooling on cigarette smoking at age 23 (horizontal axis) and (a) probability of depression at age 42 (vertical axis) and (b) probability of longstanding illnesses at age 42 (vertical axis)

5. DISCUSSION

We studied the effect of exposure to selective versus non-selective schooling system among secondary school students by exploring the implementation of the comprehensive education reform in England and Wales which, rather than increasing the school leaving age, exposed pupils of the same age cohort to qualitatively different secondary schooling systems. We estimated the effects on long-term health and lifestyle outcomes within a causal framework and explored heterogeneity in these effects.

Similar to the story that Heckman et al. (2014) find regarding the effects of length of schooling health outcomes, especially mental health, our analysis shows that the ‘devil’ is in the distribution (Conway and Deb 2005). In our work, exposure to comprehensive schooling system, compared to a selective schooling system in adolescence, exposes students to a different set of curriculum and distribution of teacher quality and also a mix of peers with a much broader range of cognitive and non-cognitive skills. Our results indicate that the average impacts of such exposure led to an increased smoking prevalence, that peaks at age 33, and increase probability of depression and long-standing illness at ages 33 and 42. However, the average effects, seemingly large, are not statistically significant. In fact, we do not see differences in the average treatment effect and the effect of the treated, where selection into comprehensive schools was driven primarily cognitive ability levels. However, we do find considerable variation in individual treatments effects (in line with Heckman et al. 2014), and the magnitude of these effects are persistent within individuals as they age. We find that if all children in the NCDS cohort were sent to comprehensive schools, about 50% of them would have significant (p< 0.10) increases in the probabilities of long-standing illness or depression at some point in their adulthood (up to age 42). Those who would be significantly worse off on depression score at age 42 due to comprehensive schools are more likely to be men and those who had lower non-cognitive abilities at age 11. These results align well the recent literature on mixing abilities. Card and Giuliano (2014) show that students with lower non-cognitive skills (Plan B students) would not benefit much from exposure to high ability kids. In fact, we show that they may get hurt in the long run. Similar results were also found by Kang et al. (2007) study, which exploits a policy experiment in South Korea15.

One limitation of our analysis is that these results are contingent upon analyzing one cohort. However, this one cohort, which is the population of people born in one specific week in the entire UK, is the only of the large British cohorts whose schooling years are in the transition period of the comprehensive education reform and lines up well with their entry age to secondary schools and the timing of the reform. Thus, it enables us to study these effect right from entry, which is important because the UK government has recently announced plans for re-introducing a system of selective education based in the selective system abolished by the reform and highlights that the debate on the relative merits of comprehensive versus selective schooling is still a topical issue16. Our paper provides evidence that is relevant to this debate17.

Our results can also provide useful evidence to the ongoing debates, in the US and other countries, on the usefulness of grouping students around abilities and targeting of the curriculum. In fact, much of these debates have centered around grouping based on cognitive abilities (e.g. designated gifted schools in the US). More discussions and research are needed to precisely delineate and fully understand the effects of grouping students based on non-cognitive abilities.

Acknowledgments

Jones and Rosa Dias acknowledge funding from the Economic and Social Research Council under grant reference RES-060-0045. Basu acknowledges support from NIH research grants RC4CA155809 and R01CA155329. We are grateful for comments on earlier versions of this work from two anonymous reviewers seminar participants at the University of Bergen, University of Chicago, University of Gothenburg, University of Lugano, University of Venice Ca’ Foscari, University of Vienna, University of Southern Denmark, Monash University, University of Melbourne and the 9th World Congress of the International Health Economics Association in Dublin. Opinions expressed are the authors’ alone and do not reflect those of the University of Washington, NBER, University of York, University of Bergen and Monash University.

APPENDIX A

Figure A1.

Figure A1

Mean levels of standardized covariate levels across comprehensive school penetration rates

Table A.1.

Pre-schooling and secondary school characteristics

Variables
Pre-schooling characteristics
Cognitive ability score (age 7)
Relative rank of cognitive ability (age 11)
Non-cognitive ability (BSAG) score (age 11)
Number of children in primary school class (age 7)
Indicator for unhappy at primary school (age 7)
Indicator for parents’ wanting child to stay in school
Indicator for male
Morbidity index (age 7)
Number of hospitalisations (age 7)
Indicator for diabetes in family
Indicator for epilepsy in family
Indicator for heart disease in family
Indicator for father chronically ill
Indicator for mother chronically ill
Indicator for father’s occupational SES professional
Indicator for father’s occupational SES other non-manual
Indicator for single parent family
Indicator for financial hardship in family (age 7)
Enumeration district: percentage unemployed/long-term sick
Enumeration district: percentage women working
Enumeration district: percentage employed in manufacturing
Enumeration district: percentage emplyed in agriculture
Enumeration district: percentage in professional/managerial occupations
Enumeration district: percentage in other non-manual occupations
Enumeration district: percentage in skilled manual occupations
Enumeration district: percentage in semi-skilled manual occupations
Enumeration district: percentage in unskilled manual occupations
Enumeration district: percentage owner occupiers
Enumeration district: percentage council tenants
Enumeration district: percentage non-white
Enumeration district: percentage immigrants
Indicator for change of residence between LEAs (age 7)
Indicators for Standard Regions
Secondary school characteristics
Indicator for single sex school
Indicator for streaming by ability within school
Pupil-teacher ratio
Ratio of expelled to total pupils

Table A.2.

Detailed measure of educational attainment

%
No grade at CSE, GCE O or A levels 20.6
1+ passes at O level, grades 4 or 5 only 0.6
1+ passes at CSE, grades 4 or 5 only 8.2
1+ passes at CSE, grades 2 or 3 9.5
5+ passes at CSE, grades 2 to 5 13.6
1-4 passes at GCE O level or CSE grade 1 25.5
5 or 6 passesGCE O level or CSE 1 5.0
7+ passes at GCE O level grades A-C, or CSE grade 1 3.5
1 pass at A level, grades A-E 2.9
2 passes at A levels, up to 8pts 3.7
3+ passes at A levels, up to 8pts 2.7
2 passes at A levels and 9+ pts 0.2
3+ passes at A levels and 9+pts 4.0

N=11,086

Note: A level points are allocated as 5, 4, 3, 2 and 1 for grades A-E respectively.

Table A.3.

Breakdown of long-standing illness (LSI) by percentage with specific main conditions (ICD-10)

Wave 7 (age 46)
Infectious & parasitic diseases 0.7
Neoplasms 1.6
Diseases of blood & immune mechanism 1.5
Endocrine, nutritional and metabolic 9.5
Mental and behavioral disorders 5.9
Nervous system 5.9
Eye, ear and mastoid process 4.6
Circulatory system 11.8
Respiratory system 11.4
Digestive system 5.5
Skin 2.1
Muscoloskeletal system 25.7
Genitourinary system 2.0
Congenital malformations 0.3
Undiagnosed illness 1.8
Injury, poisoning etc 5.3
Other LSI/uncoded 4.3

N=2990

Table A.4.

First-stage regression results.

Covariates Coefficient [95% Interval] p-value confidence
Cognitive ability, age 7 −0.09 (0.03) 0.002 −0.15 −0.03
Non-cognitive ability, age 11 0.001
(0.002)
0.6 −0.003 0.01
Parents say pupil unhappy at primary school, age 7 0.02 (0.08) 0.835 −0.15 0.18
Parents want pupil to stay in school 0.11 (0.06) 0.055 0.005 0.23
Male 0.03 (0.04) 0.481 −0.05 0.11
No. of acute illnesses child suffered up until age 7: 1 (ref 0) −0.1 (0.06) 0.111 −0.22 0.02
No. of acute illnesses child suffered up until age 7: >1 −0.03 (0.07) 0.662 −0.16 0.1
No. of times child was hospitalized up until age 7: 1 (ref 0) −0.09 (0.05) 0.05 −0.19 0.0005
No. of times child was hospitalized up until age 7: > 1 −0.22 (0.1) 0.033 −0.42 −0.02
Height at age 7 (in cm) 0.002
(0.004)
0.672 −0.01 0.01
Epilepsy in family, until age 7 −0.07 (0.08) 0.39 −0.23 0.09
Presence of chronic illnesses in the family 0.05 (0.14) 0.694 −0.22 0.33
Father SES - high (Ref: ‘low SES’) −0.07 (0.08) 0.344 −0.23 0.08
Father SES - middle (Ref: ‘low SES’) 0.03 (0.06) 0.667 −0.09 0.14
Financial hardship episode (un till child age 7) −0.02 (0.08) 0.766 −0.19 0.14
Northwest −0.13 (0.19) 0.483 −0.5 0.23
EW riding 0.53 (0.23) 0.02 0.08 0.99
North Midlands −0.07 (0.21) 0.75 −0.47 0.34
Midlands 0.13 (0.17) 0.453 −0.21 0.47
East 0.08 (0.23) 0.706 −0.36 0.53
Southeast −0.08 (0.2) 0.697 −0.48 0.32
Southwest 0.04 (0.21) 0.843 −0.37 0.45
Wales 0.55 (0.24) 0.02 0.09 1.02
% of population unemployed or disabled −0.12 (0.38) 0.755 −0.85 0.62
% of women in formal employment 0.2 (0.12) 0.109 −0.04 0.43
% of population in employed in mining and manufacturing 0.15 (0.2) 0.443 −0.24 0.55
% of population in employed in agriculture −0.21 (0.24) 0.386 −0.68 0.26
% of population in professional / managerial jobs 0.16 (0.23) 0.484 −0.3 0.62
% of population in unskilled workers 0.01 (0.32) 0.974 −0.62 0.64
% of population who is home owner −0.33 (0.09) 0.000 −0.5 −0.16
% of population who is immigrant −0.29 (0.43) 0.5 −1.13 0.55

IV: % comprehensive schools (F-stat for IV is 83) 0.02 (0.004) 0.000 0.008 0.03

Constant −0.44 (0.54) 0.421 −1.51 0.63

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

1

These studies use different data, obtain identification from distinct policy reforms and employ different methodologies, hence their results are not directly comparable. Nonetheless, Royer and Clark (2013) find much smaller effects of post-compulsory years of schooling on longevity then Lheras Muney (2005) and Kippersluis (2011). The possible reasons for this, as well as the limitations of the different studies, are discussed in detail in Royer and Clark (2013).

2

Comprehensive schooling was introduced in Scotland much earlier than in England and Wales, preventing the use of the Scottish sub-sample as an adequate comparison group for types of school and educational qualifications.

3

This subsample is thus not used in our analysis.

4

The earliest comprehensive schools in England and Wales were set up after the Second World War. But the large expansion of comprehensives followed from the policy adopted by the Labour government in 1965. This was implemented through an instruction to local education authorities (LEAs) to plan to convert their schools to comprehensives. In 1970 the new Conservative government, ended the compulsion on local authorities, however, for most the process was well underway. By the mid-1970s most LEAs in England and Wales had converted to the comprehensive system with virtually no secondary modern schools remaining. Many secondary moderns and grammar schools were amalgamated to form comprehensives and a number of new schools were built. As part of the comprehensive reform, the Eleven Plus was abolished as a national examination in England and Wales by the 1976 Education Act. Despite this, the selective system and the existence of grammar schools has persisted in certain areas, such as Kent.

5

In a few cases, pupils whose CSE grades were sufficient transferred to grammar schools or sixth form colleges to complete their A-levels.

6

While the reform affected multiple dimensions of the cohort members schooling experience, evidence from the health literature suggest that the association between many of aspects of the policy (single-sex schools, number of pupils per class, number of pupils expelled from school) and health behaviors and outcomes, is weak both in the short and long run: see for example Jones et al. (2011). Thus, the literature has focused on the effect of the policy through the ability mix.

7

For example, the OECD evaluation of the results of the Programme for International Student Assessment (PISA) - OECD (2005) - devotes great attention to comparing the relative performance of national education systems with different degrees of selectivity.

8

These concerns have been reinforced in Bonhomme and Sauder (2011). In fact it is difficult to discern a clear pattern in the pace and geographic expansion of the reform, as this was determined simultaneously by a myriad of factors, ranging from existing infrastructure and school size to the evolution of educational theories in the 1960s and 1970s, as well as political affiliation of the respective constituencies.

9

As noted by Dearden et al. (2002), these reflect early life circumstances, occur prior to secondary schooling, and are not influenced by subsequent educational choices.

10

These small area data are available for the years 1971 and 1981 under a special license, which imposes restrictions on the handling and usage of the data.

11

At age 11, the first principal component accounts for 85 percent of the joint variation and, strikingly, the weights attached to the three dimensions – 0.583, 0.567 and 0.582 – are virtually identical.

12

This is variable ‘E386’ in the NCDS data dictionary; this information was collected in the 1978 Survey of Public Exams, based on a questionnaire sent to the school attended by NCDS respondents at wave 3.

13

The small number of LEAs where the rate of penetration of comprehensive schools was very close to 100% was systematically different from all the others in terms of observables and was therefore dropped from our analysis.

14

We also study how these observed characteristics (scaled by their corresponding standard deviation) are balanced across the entire range of the IV and not just above or below the median. The observed levels do not seem to vary more than 0.1SD in most cases (Figure A1 in Appendix A). The largest variation across that entire range of the penetration rate was in mining manufacturing occupation rate, but it was less that 1 SD.

15

Our results also align well with the earlier education literature on the comprehensive education reform. Galindo-Rueda and Vignoles (2005) evaluate this reform in terms of educational attainment at age 16. Their estimates indicate that pupils in the top 20% of the cognitive ability distribution do significantly worse under the non-selective (mixed ability) system, relative to the selective one. Also low-ability students did not improve their scores in the mixed ability system, leading the authors to conclude that they did not benefit from having more able peers.

16

The UK Government plans for the re-introduction of early tracking selective schooling includes bringing back the grammar school that were removed by the comprehensive schooling reform. This issue has dominated the front pages in the UK press as seen, for example, here: https://www.theguardian.com/education/grammarschools

17

As mention above, a second limitation of this analysis (which is common to the entire literature on this reform) is its inability to fully disentangle all the channels through which it affected the behavior and outcomes of the NCDS cohort members. This would require sources of independent in its multiple dimensions, which is not available.

Contributor Information

Anirban Basu, Pharmaceutical Outcomes and Policy Program, Department of Pharmacy, and Departments of Health Services and Economics, University of Washington, Seattle, USA And the National Bureau of Economic Research, Cambridge MA, USA.

Andrew M. Jones, Department of Economics and Related Studies, University of York, Department of Economics, York, UK, and University of Bergen and Centre for Health Economics, Monash University, Victoria, Australia

Pedro Rosa Dias, Imperial College Business School, London UK.

References

  1. Basu A. Person-Centered Treatment (PeT) effects using instrumental variables: An application to evaluating prostate cancer treatments. Journal of Applied Econometrics. 2014;29:671–691. doi: 10.1002/jae.2343. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Bijwaard G, Kippersluis H, Veenman J. Education and health: the role of cognitive ability. IZA DP7648. 2013 doi: 10.1016/j.jhealeco.2015.03.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bonhomme S, Sauders U. Recovering distributions in difference-in-differences models: a comparison of selective and comprehensive schooling. Review of Economics and Statistics. 2011;93(2):479–494. [Google Scholar]
  4. Borjas G. Assimilation, Changes in Cohort Quality and the Earnings of Immigrants. Journal of Labor Economics. 1985;3(4):463–489. doi: 10.1086/298373. [DOI] [PubMed] [Google Scholar]
  5. Burgess S, Dickson M, Macmillan L. Selective schooling systems increase inequality. Institute of Education working paper. 2014:14–09. [Google Scholar]
  6. Card D, Giuliano L. (NBER Working Paper # 20453).Does gifted education work? For which students? 2014 http://www.nber.org/papers/w20453.
  7. Card A, Rothstein J. Racial segregation and black-white test score gap. Journal of Public Economics. 2007;XCI:2158–2184. [Google Scholar]
  8. Case A, Fertig A, Paxon C. The lasting impact of childhood health and circumstance. Journal of Health Economics. 2005;24:365–389. doi: 10.1016/j.jhealeco.2004.09.008. [DOI] [PubMed] [Google Scholar]
  9. Carneiro P, Crawford C, Goodman A. The impact of cognitive and non-cognitive skills on later outcomes. CEE Discussion Papers 2007 [Google Scholar]
  10. Clark D, Royer H. The Effect of Education on Adult Mortality and Health: Evidence from Britain. American Economic Review. 2013;103(6):2087–2120. doi: 10.1257/aer.103.6.2087. [DOI] [PubMed] [Google Scholar]
  11. Conti G, Heckman J, Urzúa S. The Education-Health Gradient. American Economic Review. 2010;100(2):234–38. doi: 10.1257/aer.100.2.234. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Conti G, Heckman J, Urzúa S. Early endowments, education and health. University of Chicago; 2011. (Human Capital and Economic Opportunity Working Group Working paper 2011 011). [Google Scholar]
  13. Conway KS, Deb P. Is prenatal care really ineffective? Or, is the ‘devil in the distribution? Journal of Health Economics. 2005;24(3):489–513. doi: 10.1016/j.jhealeco.2004.09.012. [DOI] [PubMed] [Google Scholar]
  14. Cunha F, Heckman J. The Technology of Skill Formation. American Economic Review. 2007;97(2):31–47. [Google Scholar]
  15. Cunha F, Heckman J. Formulating, identifying and estimating the technology of cognitive and noncognitive skill formation. Journal of Human Resources. 2008 doi: 10.3982/ECTA6551. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Cunha F, J Heckman J, Schennach S. Estimating the Technology of Cognitive and Noncognitive Skill Formation. Econometrica. 2010;78(3):883–931. doi: 10.3982/ECTA6551. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Currie J, Moretti E. Mother’s education and the intergenerational transmission of human capital: evidence from college openings. Quarterly Journal of Economics. 2003;118:1495–1532. [Google Scholar]
  18. Currie J, Stabile M. Socioeconomic status and health: why is the relationship stronger for older children. American Economic Review. 2004;93:1813–1823. doi: 10.1257/000282803322655563. [DOI] [PubMed] [Google Scholar]
  19. Cutler D, Glaeser E. Are ghettos good or bad? Quarterly Journal of Economics. 1997;CXII:827–872. [Google Scholar]
  20. Department of Education and Science. Statistics of Education – 1969. Her Majesty’s Stationary Office; London: 1971. [Google Scholar]
  21. Dearden L, Ferri J, Meghir C. The effect of school quality on educational attainment and wages. Review of Economics and Statistics. 2002;84:1–20. [Google Scholar]
  22. Dupriez V, Dumay X, Vause A. How do school systems manage pupil’s heterogeneity? Comparative Education Review. 2008;52(2):245–273. [Google Scholar]
  23. Echenique F, Fryer R, Kaufman A. Is school segregation good or bad? American Economic Review. 2006;XCVI:265–269. [Google Scholar]
  24. Etilé F, Jones AM. Schooling and smoking among the baby boomers – An evaluation of the impact of educational expansion in France. Journal of Health Economics. 2011;30:4811–831. doi: 10.1016/j.jhealeco.2011.05.002. [DOI] [PubMed] [Google Scholar]
  25. Fletcher J. Social Interactions and Smoking Decisions: Evidence Using Multiple Cohorts, Instrumental Variables, and School Fixed Effects. Health Economics. 2010;19(4):466–484. doi: 10.1002/hec.1488. [DOI] [PubMed] [Google Scholar]
  26. Fletcher J. Peer influences on adolescent alcohol consumption: evidence using an instrumental variables/fixed effect approach. Journal of Population Economics. 2012;25(4):1265–1286. [Google Scholar]
  27. Fordham S, Ogbu JU. Black Students’ School Success: Coping with the Burden of ‘Acting’ White. The Urban Review. 1986;18(3):176–206. [Google Scholar]
  28. Galindo-Rueda F, Vignoles A. (IZA Discussion Paper No. 1245).The heterogeneous effect of selection in secondary schools: understanding the changing role of ability. 2004 [Google Scholar]
  29. Galindo-Rueda F, Vignoles A. The declining relative importance of ability in predicting educational attainment. Journal of Human Resources. 2005;40:335–353. [Google Scholar]
  30. Grimard F, Parent D. Education and smoking: Were Vietnam War draft avoiders also more likely to avoid smoking. Journal of Health Economics. 2007;26:896–926. doi: 10.1016/j.jhealeco.2007.03.004. [DOI] [PubMed] [Google Scholar]
  31. Guryan J. Desegregation and black drop-out rates. American Economic Review. 2004;XCIV:919–944. [Google Scholar]
  32. Heckman JJ, Humphries JE, Veramendi G, Urzúa S. (NBER working paper 19971).Education, health and wages. 2014 [Google Scholar]
  33. Heckman JJ, Urzúa S, Vytlacil E. Understanding instrumental variables in models with essential heterogeneity. Review of Economics and Statistics. 2006;88(3):389–432. [Google Scholar]
  34. Heckman JJ, Vytlacil EJ. Local instrumental variables and latent variable models for identifying and bounding treatment effects. Proceedings of the National Academy of Sciences. 1999;96(8):4730–34. doi: 10.1073/pnas.96.8.4730. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Imbens G, Angrist J. Identification and estimation of local average treatment effects. Econometrica. 1994;62(2):467–475. [Google Scholar]
  36. Jesson D. The comparative evaluation of GCSE value-added performance by type of school and LEA. 2000. (University of York discussion paper No. 2000/52). [Google Scholar]
  37. Johnson R. The Health Returns to Education Policies: From Preschool to High School & Beyond. American Economic Review. 2010:188–94. doi: 10.1257/aer.100.2.188. [DOI] [PubMed] [Google Scholar]
  38. Jones AM, Rice N, Rosa Dias P. Long-term effects of cognitive skills, social adjustment and schooling on health and lifestyle: evidence from a reform of selective schooling. Journal of Human Capital. 2011;5(3):342–76. [Google Scholar]
  39. Lundborg P. Having the wrong friends? Peer effects in adolescent substance use. Journal of Health Economics. 2006;25(2):214–33. doi: 10.1016/j.jhealeco.2005.02.001. 2006 Mar. [DOI] [PubMed] [Google Scholar]
  40. Kang C, Park C, Lee MJ. Effects of ability mixing in high school on adulthood earnings: quasi-experimental evidence from South Korea. Journal of Population Economics. 2007;20:269–297. [Google Scholar]
  41. Keastner R. Adolescent cognitive and non-cognitive correlates of adult health. National Bureau of Economic Research; 2009. (Technical report). [Google Scholar]
  42. Kenkel D, Lillard D, Mathios A. The roles of high school completion and GED receipt in smoking and obesity. Journal of Labor Economics. 2006;24:635–660. [Google Scholar]
  43. Kerkchoff AC, Fogelman K, Crook D, Reeder D. Going comprehensive in England and Wales: A Study of Uneven Change. London: The Woburn Press; 1996. [Google Scholar]
  44. Lindeboom M, Llena-Nozal A, Van der Klaauw B. Parental education and child health: evidence from a schooling reform. Journal of Health Economics. 2009;28:109–131. doi: 10.1016/j.jhealeco.2008.08.003. [DOI] [PubMed] [Google Scholar]
  45. Lleras-Muney A. The relationship between education and adult mortality in the United States. Review of Economic Studies. 2005;72:189–221. [Google Scholar]
  46. Oreopoulos P. Estimating average and local average treatment effects of education when compulsory schooling laws really matter. American Economic Review. 2006;96:152–175. [Google Scholar]
  47. Oreopoulous P, Salvanes K. Priceless: The Nonpecuniary Benefits of Schooling. Journal of Economic Perspectives. 2011;25(1):159–84. [Google Scholar]
  48. Perkins S, Graham-Bermann S. Violence exposure and the development of school-related functioning: mental health, neurocognition and learning. Aggressive and Violent Behavior. 2012;17(1):89–98. doi: 10.1016/j.avb.2011.10.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Pischke S, Manning A. (IZA DP No. 2072).Comprehensive versus selective schooling in England and Wales: what do we know? 2006 [Google Scholar]
  50. Plewis I, Calderwood L, et al. Changes in the NCDS and BCS 70 populations and samples over time. London: Centre for Longitudinal Studies; 2004. [Google Scholar]
  51. Power C, Peckham C. (National Child Development Study User Support Working Paper No. 37).Childhood morbidity and adult ill-health. 1987 [Google Scholar]
  52. Rodgers B, Pickles A, Power C, Collishaw S, Maughan B. Validity of the Malaise Inventory in general population samples. Social Psychiatry and Psychiatric Epidemiology. 1999;34:333–341. doi: 10.1007/s001270050153. [DOI] [PubMed] [Google Scholar]
  53. Rutter M, Tizard J, Whitemore K. Education, Health and Behaviour. London: 1970. [Google Scholar]
  54. Silles M. The causal effect of education on health: evidence from the United Kingdom. Economics of Education Review. 2009;28:122–128. [Google Scholar]
  55. Stott DH. The social adjustment of children: Manual to the Bristol Social Adjustment Guides. London: Hodder and Stoughton; 1987. [Google Scholar]
  56. Van Kippersluis H, O’Donnell O, Doorslaer E van. Long Run Returns to Education: Does Schooling Lead to an Extended Old Age? Journal of Human Resources. 2011;46(4):695–721. [PMC free article] [PubMed] [Google Scholar]

RESOURCES