Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2022 Mar 1.
Published in final edited form as: J Child Psychol Psychiatry. 2020 Jun 2;62(3):339–348. doi: 10.1111/jcpp.13276

School quality ratings are weak predictors of students’ achievement and wellbeing

Sophie von Stumm 1, Emily Smith-Woolley 2, Rosa Cheesman 3, Jean-Baptiste Pingault 3,4, Kathryn Asbury 1, Philip S Dale 5, Rebecca Allen 6, Yulia Kovas 7,8, Robert Plomin 3
PMCID: PMC8298902  NIHMSID: NIHMS1711945  PMID: 32488912

Abstract

Background:

In England, all state-funded schools are inspected by an independent government agency, the Office for Standards in Education (Ofsted). Inspections aim to hold schools accountable and to promote the improvement of education, with the results made available to the public. Ofsted reports intend to index school quality but their influence on students’ individual outcomes has not been previously studied. The aim of the current study was to explore the extent to which school quality, as indexed by Ofsted ratings, is associated with students’ educational achievement, wellbeing, and school engagement.

Methods:

We use an England population-based sample of 4,391 individuals, for whom school performance at age 11 and GCSE grades at age 16 were accessed from the National Pupil Database, and who completed measures of wellbeing and school engagement at age 16.

Results:

We found that Ofsted ratings of secondary school quality accounted for 4% of the variance in students’ educational achievement at age 16, which was further reduced to 1% of the variance after we accounted for prior school performance at age 11 and family socioeconomic status. Furthermore, Ofsted ratings were poor predictors of school engagement and student wellbeing, with an average correlation of .03.

Conclusion:

Our findings suggest that differences in school quality, as indexed by Ofsted ratings, have little relation with students’ individual outcomes. Accordingly, our results challenge the usefulness of Ofsted ratings as guides for parents and students when choosing secondary schools.

Keywords: School quality, Ofsted, wellbeing, educational achievement, school engagement

Introduction

In the England, parents can choose where to send their children to secondary school. To help with this decision-making process, many turn to the reports by the Office of Standards in Education, Children’s Services and Skills (Ofsted). Ofsted is an independent government agency whose purpose is to “inspect and regulate services that care for children and young people” (Ofsted, 2018). The primary aim of these inspections is to drive improvement within schools and hold them to account. School inspections happen once every four years and comprise lesson observations, teacher meetings, paperwork checks, and pupil interviews. Once an inspection has been conducted, a school is awarded an overall effectiveness rating that informs parents and the government of the quality of education that pupils of the school receive. This score falls into one of four categories: ‘Outstanding’ (21% of schools received this rating in 2018), ‘Good’ (64%), ‘Requires Improvement’ (11%) or ‘Inadequate’ (4%; Institute for Government, 2019). Especially for those schools that are deemed to be ‘Outstanding’, this rating can act as a marketing tool, driving up interest from parents, students, potential teachers (Waterreus, 2003) and even house prices (Black, 1999; Gibbons & Machin, 2008; Leech & Campos, 2003). In contrast, schools that are judged to be underperforming suffer reputational damage and special measures are taken to improve the school, including the dismissal of senior managers and teaching staff and the replacement of the school governors by an appointed executive committee (Hutchinson, 2016; Roberts, 2019). These schools will also be placed under further, more frequent inspections. Although there is no doubt that Ofsted serves an important function by inspecting and rating schools’ quality, it is less clear whether differences between schools in Ofsted ratings are associated with educational and social-emotional outcomes for children.

Ofsted inspections

All state-funded schools in England are inspected by Ofsted. In 2017/18, £44 million was spent on 6,079 school inspections, with an average of £7,200 per school inspection (National Audit Office, 2018). The frequency of visits and the length of inspection depends on the school’s existing rating. For example, a school judged to be ‘Good’ at their last inspection will normally receive a one-day short inspection every four years (Ofsted, 2015). At the other end of the rating scale, a school whose overall effectiveness category is judged to be ‘Inadequate’ will receive more regular inspections and can even be closed down (Ofsted, 2015; Roberts, 2019).

After the inspection, schools receive a detailed report which includes the overall effectiveness rating (Inadequate, Requires Improvement, Good or Outstanding). This rating is published by Ofsted for each school and publicly available on the internet. In particular, these reports are deemed useful by parents when deciding where to send their children to secondary school. A survey of 1,000 parents in the UK found that Ofsted ratings were the third most important factor to parents when choosing a school, after location and suitability to the child’s needs (Wespieser, Durbin, & Sims, 2015). A separate report of over 1,000 parents found that Ofsted ratings are the second most important information source for parents choosing schools, after word of mouth from other parents (Ofsted, 2017a).

Ofsted inspections and individual-level outcomes

Why do parents look to Ofsted reports of schools? Because they believe that Ofsted ratings index aspects of school quality that shapes students’ individual outcomes, including their educational achievement and also their wellbeing and happiness (Coldron & Boulton, 1991, 1996). But to what extent does the Ofsted rating of a school actually predict such individual-level outcomes? Although parents and students evidently want to know if going to a better Ofsted-rated school means better exam results or better student wellbeing, we could not find a single published study looking at the association between school-level Ofsted ratings and individual-level outcomes.

However, several studies have tested associations between individual student outcomes and school quality measured in other ways (Karvonen, Tokola, & Rimpelä, 2018), for example student-rated (Keith & Cool, 1992), parent-rated (Gibbons & Silva, 2011), and teacher-rated school quality (Hoy, Hannum, & Tschannen-Moran, 1998), as well as more objective measures of school quality, such as pupil-teacher ratio, percentage of teachers with advanced degrees, and pupil expenditure (Eide & Showalter, 1998). These studies reported small to moderate associations between of school quality on pupil outcomes. For example, an analysis of the Trends in International Mathematics and Science Study (TIMSS) showed that class size, teacher education, and teacher experience, which are objective markers of school quality, are inconsistently and weakly associated with students’ test scores in maths and science across 40 countries (Hanushek & Luque, 2003; see also Hanushek, 1986, for US focused analyses). With regard to social-emotional outcomes, one study of more than 10,000 pupils from the Longitudinal Survey of Young People in England found that school quality was only weakly associated with pupil happiness and wellbeing at school (Gibbons & Silva, 2011).

Overall, existing research converges on the conclusion that ratings of school quality tend to inform and dominate parents’ perceptions of educational excellence, but they are not strongly associated with students’ educational achievement or enjoyment of the learning environment (Gibbons & Silva, 2011; Kutsyuruba, Klinger, & Hussain, 2015). Here, we explore for the first time if Ofsted ratings, which intend to index school quality, fit this pattern or if they indicate a domain of school quality that meaningfully adds to pupil’s individual outcomes.

The present research

Students are non-randomly distributed across schools, because parents’ choice of school for their children depends on a variety of factors, including personal preferences, resources, and schools’ reputation. Furthermore, in some cases, schools use students’ individual characteristics, such as ability or achievement on school entry exams, to select their student population. Thus, any observed associations between school quality and pupils’ individual outcomes may attributable to systematic differences between children that attend different schools (i.e. selection biases). To isolate any unique effects of school quality on student outcomes, it is important to account for child covariates (Karvonen et al., 2018; Rivkin, Hanushek, & Kain, 2005). In the present study, we focus on the influence of Ofsted-rated quality of secondary school on students’ educational achievement, wellbeing and their school engagement, after taking their family background and their prior educational achievement into account. Any remaining differences in achievement gains can be thought of as the school’s influence on academic progress or its ‘added value’.

We use an England representative sample of 4,391 teenagers for whom independent Ofsted quality ratings of their secondary school were available, as well as extensive information on individual outcomes at age 16 and their academic achievement prior to entering secondary school at age 11. Our primary goal was to investigate whether the overall Ofsted ratings were associated with a range of individual student outcomes, including academic achievement, wellbeing and school engagement while accounting for differences between students on entry into the school. We predicted significant but weak associations between Ofsted ratings and individual student outcomes, and we expected these associations to reduce substantially when students’ prior achievement and family background were considered.

Methods

Sample

The sample for this study was drawn from the Twins Early Development Study (TEDS). TEDS is a large, population-based sample of twin pairs born in England and Wales between 1994–1996 and followed from birth to the present day (Rimfeld et al., 2019). Ethical approval for this study was received from King’s College London Ethics Committee. In the present study, we included 4,391 unrelated individuals (one twin randomly from a pair, to preserve independence of data) who attended state school in England at age 16 years, and for whose schools Ofsted school quality ratings were available. In other parts of the UK, specifically in Wales, Northern Ireland, and Scotland, state schools are also regularly inspected, as are private independent schools across the United Kingdom. However, respective inspection agencies and assessment frameworks, scope, and criteria differ between countries. Participants with severe medical or psychiatric problems or whose mothers had severe medical complications during pregnancy were excluded from the analysis. We also excluded those who attended non-mainstream schools such as special schools for those with learning disabilities. The analysis sample included 2,403 females (55%) and 1,988 males (45%). This discrepancy in gender distribution resulted from boys’ or men’s greater attrition relative to girls’ or women’s: the 50% of the boys that were assessed at 18 months (relative to 50% girls) reduced to 49% boys in early childhood, and then to 48% in adolescence, and finally to 45% at age 18 years (Rimfeld et al, 2019), when they were asked to consent to sharing their school’s Ofsted rating. Similar gender differences in attrition have been widely observed (Watson & Wooden, 2009). Written informed consent was given for all participants involved. This sample of 4,391 individuals is broadly representative of the United Kingdom’s population for education and socioeconomic characteristics (see Table S1).

Measures

Ofsted-rated school quality

Headline quality rating.

In the current study, there were 4,391 participants for whom we had the overall Ofsted ratings of the school that they attended at age 16 (‘Overall effectiveness: How good is the school?’). Of these, 27% attended an ‘Outstanding’ school, 47% attended a ‘Good’ school, 22% attended a ‘Requires Improvement’ school, and 4% attended a school rated as ‘Inadequate’. These statistics were roughly similar to the national percentages previously reported (Ofsted, 2017b). Ofsted reports, which include the overall quality rating are publicly available on the internet for all state-funded secondary schools: https://reports.beta.ofsted.gov.uk/https://reports.beta.ofsted.gov.uk/https://reports.beta.ofsted.gov.uk/. Test-retest reliability of Ofsted ratings is not available; however, in 2015/16, Ofsted carried out inspections on the same schools by different inspectors. Of the 24 schools inspected, inspectors agreed on the outcome in 22 cases (National Audit Office, 2018).

Individual items.

Depending on the length of the Ofsted inspection and the risk criteria addressed in their visit, we also had data available on up to 23 individual inspection items, such as “The extent to which pupils contribute to the school and wider community” and “The schools capacity for sustained improvement”. The inter-correlations among the 26 individual Ofsted items revealed moderate to high associations, with an average correlation of r = .59 (see Figure S1). See Table S2 for the individual items, along with their sample sizes, means and standard deviations.

To guide our decision on the most appropriate measure of Ofsted-rated school quality to use, we conducted principal components analysis (PCA) on the 26 individual items (Table S3). The scree plot (Figure S2) and item loadings (Table S4) supported one general ‘school quality’ principal component, explaining 59% of the variance. The extracted unrotated component correlated highly with all 26 individual items (Figure S2; average r = .77), as well as with the Ofsted overall quality rating (Figure S2; r = .93). This suggests that the Ofsted overall quality rating captures what is in common among the individual items. This result justified our use of the overall quality rating in subsequent analyses in order to maximise sample size (N of overall quality rating = 4,391; N of Ofsted extracted component, which requires complete data for all items = 1,114).

Outcomes at age 16

Educational achievement.

At the end of compulsory education, students in the UK sit the ‘General Certificate for Secondary Education’ (GCSE) examinations. Almost all students take the three core subjects: English, mathematics and science. In addition, students take a range of other subjects such as geography, history and art. All subjects were graded from 4 (G, the lowest grade) to 11 (A*, the best possible grade), in line with the GCSE grading system that was in place when the twins were 16 years old (i.e. 2010 to 2012). In the current sample, GCSE results were obtained in three ways: from questionnaires sent via mail; from telephone interviews with twins and their parents; and with data from the National Pupil Database (NPD; https://www.gov.uk/government/collections/national-pupil-database). The NPD is a pupil-level database that matches pupil and school characteristic data to pupil level attainment in England. GCSE scores from NPD and TEDS correlate at .99; therefore, we used NPD ratings when TEDS data was missing. There were 4,379 students who had GCSE data and Ofsted data.

In the present study, we focused on the three core subjects: English, mathematics and science, which are taken by all students. Because English, mathematics and science grades correlated highly (r = 0.70–0.82), we created a GCSE composite requiring at least two grades to be present.

Student-reported school engagement.

At age 16, students answered seven questionnaires about their experience of school engagement, including: teacher-student relations, control over and relevance of schoolwork, peer support for learning, family support for learning, homework behaviour, homework feedback, attitudes to school and peer victimisation. Details of these questionnaires can be found in Appendix S1.

Academic wellbeing.

At age 16, students also answered six questionnaires relating to their academic wellbeing. These questionnaires assessed: academic self-concept, future aspirations and goals, life satisfaction in relation to school, subjective happiness, grit and ambition. Details of these questionnaires can be found in Appendix S1.

Student covariates

To estimate the relationship between school quality and pupil outcomes more rigorously, we considered individual characteristics of students as covariates. We selected two covariates that previous studies have shown to be influential on student achievement: family socioeconomic status and prior achievement (Hemmings, Grootenboer, & Kay, 2011; Sirin, 2005; von Stumm, 2017).

Socioeconomic status.

A measure of family socio-economic status was created by calculating the mean of five measures: maternal and paternal education (measured on a scale from 1–8, where 1 = no education and 8 = postgraduate qualifications), maternal and paternal occupation (indexed by the Standard Occupational Classification on a scale from 1–9, where 1 = elementary administration and service occupations and 9 = managers, directors and senior officials), and maternal age at birth of first child. These measures were collected at first contact, when the study members were on average 18 months old. All measures were standardised to have a mean of 0 and a SD of 1 and at least three measures were required to calculate the arithmetic mean.

Prior achievement.

Children’s academic performance at the end of primary school, which in the UK is a different institution than their secondary school, was assessed with a standardized exam at age 11. The exam spans English, mathematics and science tests. We used the ‘fine point score’ of each of these tests from the NPD (for details on the scoring method, Department of Education, 2017).

Analysis

Associations between Ofsted ratings and individual outcomes

We calculated Spearman’s Rank correlation to explore the relationship between the Ofsted overall quality rating and achievement, wellbeing, and student engagement measures. In addition to investigating individual differences in outcomes, we also estimated the average differences among students attending schools of different quality using ANOVA with polynomial trend analysis and planned contrasts. Trend analysis tests the relationship between the group means (Inadequate/Requires Improvement/Good/Outstanding) comparing linear, quadratic and cubic trends. A linear trend would suggest a proportionate change in the value of the outcome across ordered categories, for example GCSE scores increasing proportionately across each Ofsted categories (Inadequate/Requires improvement/Good/Outstanding). By contrast, quadratic and cubic trends suggest that the relationship between outcome measures (educational achievement, wellbeing, and student engagement) and Ofsted-rated school quality changes across the ordered categories of Ofsted school quality.

To test the influence of Ofsted-rated school quality on individual achievement, independent of student characteristics (family socioeconomic status and prior achievement), we conducted regressions and observed the unique variance explained by Ofsted-rated school quality. We also looked at the unstandardized beta coefficients to estimate the average GCSE difference between different Ofsted-rated schools. Finally, we ran ANCOVA to investigate the adjusted means of the Ofsted-rated school quality categories.

Results

Associations between Ofsted ratings and educational achievement

The Ofsted overall quality rating correlated .21 with students’ GCSE scores, accounting for 4.4% of the variance. Figure 1 depicts the flow of pupils from the four quality categories to GCSE grades. The figure shows that fewer students in Outstanding schools achieved lower grades as compared to students in schools rated ‘Requires Improvement’ or ‘Inadequate’. Despite the mean differences, what is striking is the variability of GCSE grades obtained by students attending schools of different quality. Each school quality category contains students who achieved a wide mix of grades at GCSE.

Figure 1.

Figure 1.

Flow of Ofsted ratings to GCSE grades.

Turning to our analyses of means, a linear trend best described the relationship between the Ofsted school quality categories and students’ educational achievement (F = 201.96, p = 7.68 × 10−45; Table S5). The difference between Inadequate and Requires Improvement schools was a third (.33) of a grade (t = 3.06, p <.05), which was similar to the difference between Requires Improvement and Good (0.30 of a grade; t = 6.35, p <.001), and to the difference between Good and Outstanding (0.34 of a grade; t – 7.78, p <.001). The biggest GCSE difference was therefore between those attending Inadequate schools and those attending Outstanding schools, with almost a grade difference (0.94 of a grade; t = 9.93, p <.001). Students attending Inadequate schools scored on average a GCSE grade of C (M = 8.17, SD = 1.23), whereas those in Outstanding schools had a mean GCSE grade of B (M = 9.11, SD = 1.20).

Once we controlled for student covariates, the variance in GCSE predicted by the Ofsted overall quality rating fell from 4.4% to less than 1% (Table S6). Furthermore, the unstandardized beta associated with the Ofsted overall quality rating (B = .13) indicated that the average GCSE difference between the categories (Inadequate/Requires Improvement/Good/Outstanding) was now approximately one tenth of a grade, which was confirmed by the ANCOVA with pairwise comparisons (Table S7). At the extremes, between Inadequate and Outstanding schools, the grade difference was 0.4 (p = 2.91 × 10−9). The GCSE difference between attending an Ofsted-rated ‘Good’ school (the most common Ofsted category) and an Outstanding school was approximately 0.1 of a GCSE grade (p = .001), once student covariates are taken into account. Figure 2 shows the raw and unadjusted GCSE means for each Ofsted school quality category.

Figure 2.

Figure 2.

Raw and adjusted GCSE means and 95% confidence intervals. GCSE was graded from 4 (lowest grade: G) to 11 (highest grade: A*).

Associations between Ofsted ratings and students’ wellbeing and school engagement

Spearman’s correlations between the Ofsted overall quality rating and the 14 student-reported measures of wellbeing and engagement ranged from −.04 (Ambition) to .07 (Homework behaviour), with an average correlation of .03 (see Figure S3). After correction for multiple testing, only the correlation between Ofsted ratings and Homework behaviour remained significant. A series of additional ANOVAs supported these results (see Table S8). Figure 3 depicts the means and 95% confidence intervals for wellbeing and school engagement for students in schools rated as Inadequate, Requires Improvement, Good and Outstanding. It shows that students attending ‘Inadequate’ rated schools reported similar levels of happiness, attitudes to school, homework, student teacher relations and ambition as those attending ‘Outstanding’ rated schools.

Figure 3.

Figure 3.

Means and 95% confidence intervals for wellbeing and school experiences measures for students attending schools rated as: Inadequate, Requires Improvement, Good and Outstanding by Ofsted. Note: the maximum scores for each of the scales are in brackets.

Discussion

The purpose of this study was to explore the relationship between school quality as rated by Ofsted and individual-level outcomes for pupils. We found that the Ofsted overall quality rating ‘Overall effectiveness: how good is the school’ accounted for 4.4% of the differences in educational achievement at age 16. However, most of this association could be attributed to family socioeconomic status and prior achievement in primary school. Once the covariates were included, Ofsted ratings of school quality predicted less than 1% of the observed differences in GCSE examination grades. This finding suggests that even the small benefits of school quality for students’ individual outcomes can be largely attributed to schools’ selection of student intake, not to their added value. We also found that Ofsted-rated school quality was a weak predictor of student wellbeing and student engagement. Overall, our findings suggest that individual student outcomes are largely independent of schools’ Ofsted rated quality. Our findings align with earlier reports that pupils’ individual outcomes show little relation with markers of school quality (Gibbons & Silva, 2011; Hanushek & Luque, 2003; Hanushek, 1986).

Ofsted states that their ratings “allow parents to make informed decisions about where to educate their children” (Ofsted strategy 2017–22, p3). Indeed, one of Ofsted’s priorities is to make their reports “better focused on the issues that parents care about when choosing or seeking assurances about a school” (p9). However, we find that the factors that parents care about most – educational achievement and students’ wellbeing – are negligibly predicted by Ofsted ratings. Pupils’ average GCSE difference between schools of varying quality was just a tenth of a GCSE grade. Put another way, attending a ‘Good’ school over a ‘Requires improvement’ school is associated with a GCSE boost of just 0.1 of a grade on average.

By accounting statistically for student covariates, such as prior achievement, in the prediction of GCSEs we generate a proxy of academic progress. Academic progress (referred to as ‘Progress 8’ by the Department for Education) is calculated as achievement at age 16 independent of previous achievement at 11, and is thought to index value added by schools. In other words, academic progress is students’ change (i.e. gains and losses) in school performance between the age of 11 and 16 years. In the present study, we find that Ofsted-rated quality of a school has little relation with the progress students make during secondary school.

This finding is important for two reasons. Firstly, in a survey of parent views (Ofsted, 2017a), 32% of parents with children aged up to 18 years said that they would want to find out about children’s progress in maths at a school when deciding on which school to send their child to. However, if this is weakly predicted at secondary-school level by Ofsted-rated school quality, then parents may want to prioritise other factors when choosing secondary schools, for example the physical distance between the family home and school. Secondly, it highlights that the examination differences between students attending different Ofsted-rated quality schools are largely accounted for by the school’s student population intake: schools with higher Ofsted ratings admit better performing students (cf. Hutchinson, 2016). This is in line with previous research suggesting that when schools are responsible for their own admissions, they are more likely to select more able pupils (Rimfeld, et al., 2019; Rivkin et al., 2005; Smith-Woolley et al., 2018; West, 2006).

Although achievement outcomes are important to parents, they are not the only reason why parents opt to send their children to one school over another (Coldron & Boulton, 1991, 1996). The factors most often cited in the literature on parental choice in education are student happiness, wellbeing, and pupil behaviour. In the present study, we find that the correlations between Ofsted ratings and measures of student wellbeing and school engagement were very small (average r = .03) and non-significant. This suggests school quality, as rated by Ofsted, has little influence on individual-level wellbeing factors. Put another way, students attending schools with the worst Ofsted ratings report similar levels of happiness, bullying, future aspirations, satisfaction with school, and ambition as those students attending schools with the highest Ofsted ratings. These results are in line with previous research that showed that parent-rated school quality is not strongly associated with pupil happiness and wellbeing at the school (Gibbons & Silva, 2011).

There are several limitations to our study. First, we did not consider the impact of school quality at younger ages. The present study focuses on Ofsted reports of secondary schools only. School quality may be more important at younger ages. Indeed, a review of primary school quality on academic achievement across 29 countries concluded that the quality of primary schools and teacher quality contributed meaningfully to student achievement, especially in low-income countries (Heyneman & Loxley, 1983). In the present study, we go some way to account for differences between pupils when they enter secondary school by controlling for their prior achievement and family socioeconomic status. However, for testing potential cumulative effects of school quality across primary and secondary education, comprehensive longitudinal research is needed that elucidates the academic trajectories of students as they move through schools of varying quality.

Another limitation of the present study is the lack of objective measures of student wellbeing and student engagement. Instead, we analysed data from 14 self-report measures. It is possible that students would be happier at different schools; yet, because they only have experience of attending their own school, they lack a comparative perspective. One way to explore this possibility would be to look at students who have attended multiple schools of varying quality and compare their wellbeing and satisfaction levels at each school. However, these students may not be representative of the student population and are often moved for a reason, such as family separation, military deployment, exclusion or bullying. Indeed, students who switch schools are, on average, from lower income families and have greater behaviour problems and social interaction difficulties (Gasper, DeLuca, & Estacion, 2012; Sorin & Iloste, 2006). Furthermore, Ofsted ratings were only available for the secondary schools that students in our sample attended at age 16 but any secondary school changes that they may have experienced earlier were not recorded.

A final limitation to note is that the current sample was drawn from a twin study. Although we only used one twin from a pair for the current study, being a twin might influence the results. However, our sample appears to be largely representative of the general population for achievement (Table S5) and previous research has shown twins to be broadly representative of the general population for health (Andrew et al., 2001), personality (Johnson, Krueger, Bouchard, & McGue, 2002), psychiatric problems (Kendler, Martin, Heath, & Eaves, 1995), emotional/behavioural problems (Moilanen et al., 1999), and educational achievement (Rimfeld et al., 2019).

Conclusion

In the current study, we find that Ofsted-rated school quality is a weak predictor of secondary-school outcomes at age 16, including educational achievement, wellbeing, and student engagement, once student characteristics have been taken into account. These findings call into question the usefulness of Ofsted ratings as a guide for parents who are looking for information to make an informed choice for their children’s secondary school. Furthermore, our study contests the notion that Ofsted inspections, which are perceived as exhausting, stressful, and demoralizing by teachers and other school staff (de Wolf & Janssens, 2007; Hopkins et al., 2016), capture differences in school quality that matter for students’ individual outcomes.

Supplementary Material

supinfo

Appendix S1. Supplementary measures.

Table S1. Representativeness of the current sample.

Table S2. Sample sizes, means and standard deviations (SD) for Ofsted ratings.

Table S3. Principal component analysis of Ofsted items.

Table S4. Ofsted individual item loadings.

Table S5. Analysis of variance with polynomial trend analysis and planned contrasts of GCSE scores between students attending schools rated as: Outstanding, Good, Requires Improvement and Inadequate.

Table S6. Results from multiple regression analysis predicting examination results at age 16 (GCSEs) from student covariates and Ofsted Headline Quality Rating.

Table S7. Analysis of covariance of GCSE scores between students attending schools rated as: Outstanding, Good, Requires Improvement and Inadequate, accounting for covariates of prior achievement and socioeconomic status.

Table S8. Analysis of variance with polynomial trend analysis of school engagement and wellbeing measures between students attending schools rated as: Outstanding, Good, Requires Improvement and Inadequate.

Figure S1. Correlation matrix of Ofsted individual items.

Figure S2. Scree plot showing eigenvalues for each principal component after performing PCA on individual Ofsted items.

Figure S3. Spearman correlation coefficients (rs) for the relationship between measures of student wellbeing and engagement and the Ofsted headline quality rating.

Key points and relevance.

What’s known

In England, a government agency inspects all state-funded schools to objectively assess differences among schools in the quality of the education that children receive. The inspection reports are widely used by parents to guide which school they will send their children to.

What’s new

Our findings show that differences in school quality, as indexed by government inspections, have little influence on students’ educational achievement, wellbeing, and school engagement.

What’s relevant

Parents who are looking for information to make an informed choice for their children’s secondary school may be ill-advised to draw conclusions about individual student outcomes based on government school inspection reports.

Acknowledgements

R.P. is supported by a Medical Research Council Professorship award (G19/2). S.v.S. is supported by a Jacobs Foundation Early Career Fellowship (2017–2019). R.C. is supported by an ESRC studentship. The authors gratefully acknowledge the on-going contribution of the participants in the Twins Early Development Study (TEDS) and their families. TEDS is supported by a program grant to R.P. from the UK Medical Research Council (MR/M021475/1 and previously G0901245), with additional support from the US National Institutes of Health (AG046938) and the European Commission (602768; 295366). Data for this study came from the Twins Early Development Study (TEDS) and the National Pupil Database. Researchers can apply for access to both (TEDS: https://www.teds.ac.uk/researchers/teds-data-access-policy; NPD: https://www.gov.uk/guidance/how-to-access-department-for-education-dfe-data-extracts). The authors have declared that they have no competing or potential conflicts of interest.

Footnotes

Conflict of interest: No conflicts declared.

Supporting information

Additional supporting information may be found online in the Supporting Information section at the end of the article:

References

  1. Andrew T, Hart DJ, Snieder H, de Lange M, Spector TD, & MacGregor AJ (2001). Are twins and singletons comparable? A study of disease-related and lifestyle characteristics in adult women. Twin Research and Human Genetics, 4(6), 464–477. [DOI] [PubMed] [Google Scholar]
  2. Black SE (1999). Do better schools matter? Parental valuation of elementary education. The Quarterly Journal of Economics, 114(2), 577–599. [Google Scholar]
  3. Coldron J, & Boulton P (1991). ‘happiness’ as a criterion of parents’ choice of school. Journal of Education Policy, 6(2), 169–178. [Google Scholar]
  4. Coldron J, & Boulton P (1996). What do Parents Mean when they Talk About ‘Discipline’in Relation to their Children’s Schools? British Journal of sociology of education, 17(1), 53–64. [Google Scholar]
  5. De Los Reyes A, & Kazdin AE (2005). Informant discrepancies in the assessment of childhood psychopathology: a critical review, theoretical framework, and recommendations for further study. Psychological bulletin, 131(4), 483. [DOI] [PubMed] [Google Scholar]
  6. Department for Education. (2017). National Pupil Database, Key Stage 4, Tier 4, 2002–2016: Secure Access. 4th Edition. UK Data Service. SN: 7627, 10.5255/UKDA-SN-7627-4. [DOI] [Google Scholar]
  7. Eide E, & Showalter MH (1998). The effect of school quality on student performance: A quantile regression approach. Economics letters, 58(3), 345–350. [Google Scholar]
  8. Gasper J, DeLuca S, & Estacion A (2012). Switching schools: Revisiting the relationship between school mobility and high school dropout. American Educational Research Journal, 49(3), 487–519. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Gibbons S, & Machin S (2008). Valuing school quality, better transport, and lower crime: evidence from house prices. Oxford Review of Economic Policy, 24(1), 99–119. [Google Scholar]
  10. Gibbons S, & Silva O (2011). School quality, child wellbeing and parents’ satisfaction. Economics of Education Review, 30(2), 312–331. [Google Scholar]
  11. Haworth CM, Davis OS, & Plomin R (2013). Twins Early Development Study (TEDS): a genetically sensitive investigation of cognitive and behavioral development from childhood to young adulthood. Twin Research and Human Genetics, 16(1), 117–125. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Hemmings B, Grootenboer P, & Kay R (2011). Predicting mathematics achievement: The influence of prior achievement and attitudes. International Journal of Science and Mathematics Education, 9(3), 691–705. [Google Scholar]
  13. Heyneman SP, & Loxley WA (1983). The Effect of Primary-School Quality on Academic Achievement Across Twenty-nine High- and Low-Income Countries. American Journal of Sociology, 88, 6, 1162–1194. [DOI] [PubMed] [Google Scholar]
  14. Hopkins E, Hendry H, Garrod F, McClare S, Pettit D, Smith L, et al. (2016). Teachers’ views of the impact of school evaluation and external inspection processes. Improving Schools, 19(1), 52–61. [Google Scholar]
  15. Hoy WK, Hannum J, & Tschannen-Moran M (1998). Organizational climate and student achievement: A parsimonious and longitudinal view. Journal of School Leadership, 8, 336–359. [Google Scholar]
  16. Hutchinson J (2016). School inspection in England: Is there room to improve? Report; Education Policy Institute. [Google Scholar]
  17. Institute for Government (2019). https://www.instituteforgovernment.org.uk/explainers/schools-10-key-facts. Accessed 07 October 2019.
  18. Johnson W, Krueger RF, Bouchard TJ, & McGue M (2002). The personalities of twins: Just ordinary folks. Twin Research and Human Genetics, 5(2), 125–131. [DOI] [PubMed] [Google Scholar]
  19. Karvonen S, Tokola K, & Rimpelä A (2018). Wellbeing and academic achievement-differences between schools from 2002 to 2010 in the Helsinki Metropolitan Area. Journal of School Health, 88(11), 821–829. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Keith TZ, & Cool VA (1992). Testing models of school learning: Effects of quality of instruction, motivation, academic coursework, and homework on academic achievement. School Psychology Quarterly, 7(3), 207. [Google Scholar]
  21. Kendler KS, Martin NG, Heath AC, & Eaves LJ (1995). Self-report psychiatric symptoms in twins and their nontwin relatives: Are twins different? American Journal of Medical Genetics, 60(6), 588–591. [DOI] [PubMed] [Google Scholar]
  22. Kutsyuruba B, Klinger DA and Hussain A (2015). Relationships among school climate, school safety, and student achievement and well-being: a review of the literature. Review of Education, 3, 103–135. doi: 10.1002/rev3.3043 [DOI] [Google Scholar]
  23. Leech D, & Campos E (2003). Is comprehensive education really free?: a case-study of the effects of secondary school admissions policies on house prices in one local area. Journal of the Royal Statistical Society: Series A (Statistics in Society), 166(1), 135–154. [Google Scholar]
  24. Hanushek EA, & Luque JA (2003). Efficiency and equity in schools around the world. Economics of Education Review, 22(5), 481–502. [Google Scholar]
  25. Hanushek EA (1986). The Economics of Schooling Production and Efficiency in Public Schools. Journal of Economic Literature, 24, 3, 1141–1177. [Google Scholar]
  26. Moilanen I, Linna S-L, Ebeling H, Kumpulainen K, Tamminen T, Piha J, et al. (1999). Are twins’ behavioural/emotional problems different from singletons’? European child & adolescent psychiatry, 8(4), S62. [DOI] [PubMed] [Google Scholar]
  27. National Audit Office. (2018). Ofsted’s inspection of schools. London, UK. [Google Scholar]
  28. Ofsted. (2015). School Inspection Handbook: Handbook for Inspecting Schools in England Under Section 5 of the Education Act 2005: Ofsted Manchester. [Google Scholar]
  29. Ofsted. (2017a). Annual parents survey 2017: parents’ awareness and perceptions of Ofsted. London: Office for Standards in Education, Children’s Services and Skills. [Google Scholar]
  30. Ofsted. (2017b). The Annual Report of Her Majesty’s Chief Inspector of Education, Children’s Services and Skills 2016/17. London, UK: Office of Standards in Education. [Google Scholar]
  31. Rimfeld K, Malanchini M, Spargo T, Spickernell G, Selzam S, McMillan A, … Plomin R (2019). Twins Early Development Study: a genetically sensitive investigation into behavioural and cognitive development from infancy to emerging adulthood. 10.31234/osf.io/xqh52 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Rivkin SG, Hanushek EA, & Kain JF (2005). Teachers, schools, and academic achievement. Econometrica, 73(2), 417–458. [Google Scholar]
  33. Roberts N (2019). School inspections in England: Ofsted. House of Commons Library, Briefing Paper, Number 07091, 5 August 2019. [Google Scholar]
  34. Sirin SR (2005). Socioeconomic status and academic achievement: A meta-analytic review of research. Review of educational research, 75(3), 417–453. [Google Scholar]
  35. Smith-Woolley E, Pingault J-B, Selzam S, Rimfeld K, Krapohl E, von Stumm S, et al. (2018). Differences in exam performance between pupils attending selective and non-selective schools mirror the genetic differences between them. npj Science of Learning, 3(1), 3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Sorin R, & Iloste R (2006). Moving schools: Antecedents, impact on students and interventions. Australian Journal of Education, 50(3), 227–241. [Google Scholar]
  37. von Stumm S (2017). Socioeconomic status amplifies the achievement gap throughout compulsory education independent of intelligence. Intelligence, 60, 57–62. [Google Scholar]
  38. Waterreus JM (2003). Lessons in teacher pay: Studies on incentives and the labor market for teachers. University of Amsterdam, Netherlands. [Google Scholar]
  39. Watson N, & Wooden M (2009). Identifying Factors Affecting Longitudinal Survey Response. In Methodology of Longitudinal Surveys, Eds. Groves RM, Kalton G, Rao JNK, Schwarz N, Skinner C and Lynn P. doi: 10.1002/9780470743874.ch10 [DOI] [Google Scholar]
  40. Wespieser K, Durbin B, & Sims S (2015). School Choice: The Parent View. Slough: National Foundation for Educational Research. [Google Scholar]
  41. West A (2006). School choice, equity and social justice: the case for more control. British Journal of Educational Studies, 54(1), 15–33. [Google Scholar]
  42. de Wolf IF, & Janssens FJG (2007). Effects and side effects of inspections and accountability in education: an overview of empirical studies. Oxford Review of Education, 33(3), 379–396. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

supinfo

Appendix S1. Supplementary measures.

Table S1. Representativeness of the current sample.

Table S2. Sample sizes, means and standard deviations (SD) for Ofsted ratings.

Table S3. Principal component analysis of Ofsted items.

Table S4. Ofsted individual item loadings.

Table S5. Analysis of variance with polynomial trend analysis and planned contrasts of GCSE scores between students attending schools rated as: Outstanding, Good, Requires Improvement and Inadequate.

Table S6. Results from multiple regression analysis predicting examination results at age 16 (GCSEs) from student covariates and Ofsted Headline Quality Rating.

Table S7. Analysis of covariance of GCSE scores between students attending schools rated as: Outstanding, Good, Requires Improvement and Inadequate, accounting for covariates of prior achievement and socioeconomic status.

Table S8. Analysis of variance with polynomial trend analysis of school engagement and wellbeing measures between students attending schools rated as: Outstanding, Good, Requires Improvement and Inadequate.

Figure S1. Correlation matrix of Ofsted individual items.

Figure S2. Scree plot showing eigenvalues for each principal component after performing PCA on individual Ofsted items.

Figure S3. Spearman correlation coefficients (rs) for the relationship between measures of student wellbeing and engagement and the Ofsted headline quality rating.

RESOURCES