Abstract
Non-cognitive skills, such as motivation and self-regulation, are partly heritable and predict academic achievement beyond cognitive skills. However, how the relationship between non-cognitive skills and academic achievement changes over development is unclear. The current study examined how cognitive and non-cognitive skills are associated with academic achievement from ages 7 to 16 years in a sample of over 10,000 children from England and Wales. The results showed that the association between non-cognitive skills and academic achievement increased across development. Twin and polygenic scores analyses found that the links between non-cognitive genetics and academic achievement became stronger over the school years. The results from within-family analyses indicated that non-cognitive genetic effects on academic achievement could not simply be attributed to confounding by environmental differences between nuclear families, consistent with a possible role for evocative/active gene–environment correlations. By studying genetic associations through a developmental lens, we provide further insights into the role of non-cognitive skills in academic development.
Subject terms: Behavioural genetics, Human behaviour
Malanchini et al. find that non-cognitive skills increasingly predict academic achievement over development, driven by shared genetic factors whose influence grows over school years. These effects persist across socio-economic contexts and suggest the importance of fostering non-cognitive skills in education.
Main
Children who are emotionally stable, motivated and capable of regulating their attention and impulses do better in school, independent of their level of cognitive (Cog) ability1–7. These important socio-emotional characteristics have been broadly described as non-cognitive skills (NCS)8. ‘Non-cognitive’ is an imperfect term that primarily serves to differentiate these characteristics from what they are not—performance on standardized tests of cognitive ability. The panoply of non-cognitive skills that predict better educational outcomes can be organized into three partly overlapping domains: motivational factors, self-regulatory strategies and personality traits9.
Twin research has shown that genetic differences between people contribute to their differences in non-cognitive skills. Most domains of non-cognitive skills, including academic motivation10,11, self-regulation12 and personality13, are moderately heritable (~30–50%). In addition, twin studies have found evidence that non-cognitive skills are genetically correlated with academic achievement14,15. That is, some of the same genetic differences that are associated with variation in academic achievement are also associated with non-cognitive skills.
DNA-based methods have confirmed genetic links between non-cognitive skills and academic performance. Genome-wide association studies (GWAS) of educational attainment (EA) (that is, years of formal education completed) have identified genetic variants that are correlated with completing formal education16,17. A polygenic score (PGS) constructed from these GWAS results predicts higher levels of self-control18, more adaptive personality traits (higher conscientiousness, agreeableness and openness to experience) and greater academic motivation19. Additionally, previous GWAS work has identified associations between DNA variants and EA that were independent of cognitive test performance, essentially performing a GWAS of non-cognitive skills20. The genetics of non-cognitive skills were found to be related to conscientiousness, openness to experience, delay of gratification and health-risk behaviours20.
The current study uses both twin and DNA-based methods to expand our understanding of the association between non-cognitive skills and academic achievement over development. We address four key questions (Fig. 1). First, does the strength of the association between non-cognitive skills and academic achievement change over development (from age 7 to age 16 years)? Second, do genetic dispositions towards non-cognitive skills vary in their association with academic achievement across development? Third, to what extent are these associations accounted for by between-family processes, such as environmental influences shared between individuals in a family? Fourth, do non-cognitive genetic associations with academic achievement vary by socio-economic status (SES)?
First, we investigated the links between non-cognitive skills and academic achievement across the school years. Developmental studies that have investigated the association between non-cognitive skills and academic achievement remain scarce and have focused on a few specific measures over relatively short time frames21. Here, we analyse a comprehensive battery of developmental data collected from over 10,000 children born in England and Wales who were followed across compulsory education (Fig. 1, left). Furthermore, we simultaneously consider the role of cognitive skills in these associations. Past research has highlighted how skills that are broadly considered non-cognitive, such as self-control, rely on cognitive competencies22. Therefore, it is important to take into account developing cognitive skills when assessing the relationship between non-cognitive skills and academic achievement over time.
Second, we investigated whether genetic dispositions towards non-cognitive skills become increasingly predictive of academic achievement across development. Twin studies focusing on specific moments in childhood or adolescence23 have found that heritable variation in non-cognitive skills, such as motivation and self-regulation, contribute to academic achievement beyond cognitive competencies24. However, this relationship across development remains underinvestigated. We triangulate evidence across different methods, including twin and PGS analyses, to investigate the association between genetic factors linked to cognitive and non-cognitive skills and academic achievement across compulsory education.
Third, with a sibling-difference design, we examined to what extent the developmental relationship between genetic propensity for non-cognitive skills and academic achievement is accounted for by family-wide environmental processes. Sibling differences in genotypes are randomized by meiosis, such that siblings have an equal probability of inheriting any given parental allele. Therefore, within-sibling pair PGS associations are thought to be less confounded by environmental differences between nuclear families, including population stratification and indirect genetic effects25. Indirect genetic effects refer to the association between the non-transmitted parental genotypes and the offspring phenotype, potentially reflecting rearing environments, although they can also capture broader demographic phenomena, such as assortative mating26.
Conversely, differences between siblings in PGS associations are often referred to as ‘direct’ genetic effects27 in that they are consistent with a causal effect of genetic variants within an individual on their phenotype. However, even direct genetic effects involve mediation through environmental processes. For example, children with a greater motivation towards academic achievement might actively select, modify and create environmental experiences that foster further achievement, such as deciding to take advanced classes28. That is, genetic differences between children can result in differential exposure to learning environments, which, in turn, can affect their academic achievement29. These active/evocative gene–environment correlations amplify the effects of genetic difference and are one theorized mechanism for increasing genetic associations over development30,31.
Fourth, we explored whether genetic associations with academic achievement varied by SES. Genetic and environmental processes might interact such that the effects of environmental experiences on a trait might be partly dependent on genetic factors and vice versa32,33. Studies that examined this possibility have focused on the role of socio-economic disadvantage across a broad range of contexts, including family SES34,35 and the school environment36,37. We explore whether the cognitive and non-cognitive PGS prediction of academic achievement differs at different levels of socio-economic disadvantage across development.
Under a developmental lens, these analyses address four core research questions providing a detailed account of the processes through which cognitive and non-cognitive skills are linked to individual differences in academic achievement. We triangulated evidence across multiple genetic methods. Since each method is subject to different and unrelated assumptions and limitations, triangulating multiple methods provides a powerful tool to increase the reliability of our results38.
Results
Associations between non-cognitive skills and academic achievement
Parents, teachers and twins rated different non-cognitive skills at different ages. On the basis of extant literature and measures availability, we focused on two broad dimensions of non-cognitive skills, which were modelled as latent factors (Fig. 1): (1) education-specific non-cognitive skills, including measures of academic interest, attitudes towards learning and academic self-efficacy and (2) domain-general self-regulation skills, including measures of behavioural and emotional regulation not necessarily related to the school context (Fig. 1 and Methods). Here, we report analyses of these two dimensions. Analyses of individual measures are reported in Supplementary Information (Supplementary Note 1 and Supplementary Tables 1b and 2).
Latent factors of education-specific non-cognitive skills and domain-general self-regulation skills (Supplementary Tables 3 and 4) were positively correlated with academic achievement at all developmental stages. Effect sizes differed by rater and developmental stage and tended to increase with age. For example, the association between self-rated education-specific non-cognitive skills and academic achievement increased from r = 0.10 (95% confidence interval (CI) 0.07 to 0.14) at 9 years of age, to r = 0.41 (95% CI 0.38 to 0.44) at 12 years of age and to r = 0.51 (95% CI 0.48 to 0.55) at 16 years of age (Supplementary Note 1, Supplementary Fig. 1 and Supplementary Table 5). Latent non-cognitive factors were also modestly correlated with latent factors of general cognitive ability (Supplementary Table 6) at the same age (Supplementary Table 7).
We examined whether general cognitive ability could account for the associations between non-cognitive skills and academic achievement. Multiple regression analyses showed that both non-cognitive factors were substantially and significantly associated with academic achievement beyond cognitive skills at every stage of compulsory education (Fig. 2a and Supplementary Table 8). The relative association between non-cognitive skills and academic achievement increased developmentally, particularly when considering self-reported measures. For self-reported education-specific non-cognitive skills, the effect size of the relative prediction of achievement increased from β = 0.10 (95% CI 0.06 to 0.13) at 9 years of age (effect size for Cog ability: β = 0.46, 95% CI 0.44 to 0.48) to β = 0.28 (95% CI 0.24–0.32) at 12 years of age (effect size for Cog ability: β = 0.36, 95% CI 0.32 to 0.40) to β = 0.58 (95% CI 0.54 to 0.62) at 16 years of age (effect size for Cog ability: β = 0.39, 95% CI 0.37 to 0.41). A developmental increase was also observed for self-reported measures of domain-general self-regulation skills, for which the predictive power increased from β = 0.11 (95% CI 0.07 to 0.15) at 9 years of age to β = 0.21 (95% CI 0.19 to 0.23) at 16 years of age, after accounting for general cognitive ability (Supplementary Table 8).
Univariate and multivariate twin analyses
Applying twin designs (Methods), we found that the heritability (that is, the extent to which observed differences in a trait are accounted for by genetic differences) of non-cognitive skills differed significantly across raters and developmental stages (Supplementary Note 2, Supplementary Table 9 and Supplementary Figs. 2–6). Heritability estimates of latent non-cognitive factors, which exclude error of measurement, ranged between 70% (95% CI 0.63 to 0.77) for self-reported education-specific skills at 9 years of age and 93% (95% CI 0.91 to 0.96) for parent-reported education-specific non-cognitive skills at 9 years of age (Supplementary Note 2, Supplementary Tables 10 and 11 and Supplementary Fig. 7). These substantial heritability estimates are consistent with previous studies that investigated the heritability of latent dimensions of non-cognitive skills11 and of a general factor of psychopathology across different raters39. The correlation between non-cognitive measures and academic achievement was mostly accounted for by genetic factors and, to a lesser extent, by non-shared environmental factors (Supplementary Note 2 and Supplementary Fig. 6).
We then investigated whether the observed genetic associations between latent non-cognitive factors and academic achievement could be accounted for by genetic factors associated with cognitive skills. We investigated this question with a series of trivariate Cholesky decompositions (Methods), the results of which are presented in Fig. 2b, which reports standardized squared path estimates, and Supplementary Tables 12 and 13, which report standardized path estimates and 95% CIs. Similar to hierarchical regression, the Cholesky approach parses the genetic and environmental variation in each trait into that accounted for by traits previously entered into the model and the variance which is unique to a newly entered trait.
Each bar in Fig. 2b is the outcome of a different trivariate Cholesky decomposition of the heritability of academic achievement (the total length of the bar) into genetic effects associated with non-cognitive skills after controlling for genetic effects associated with cognitive skills at the same age. We found that genetic effects associated with cognitive skills accounted for between 21% and 36% of the total variance in academic achievement, as indicated by standardized paths ranging between 0.46 (95% CI 0.37 to 0.54) and 0.60 (95% CI 0.50 to 0.70). Genetic effects associated with non-cognitive skills, independent of cognitive skills, accounted for between 0.1% and 32.5% of the variance in academic achievement. Standardized paths ranged between 0.01 (95% CI −0.16 to 0.17) for self-reported self-regulation at 9 years of age and 0.57 (95% CI 0.48 to 0.67) for teacher reported education-specific non-cognitive skills at 9 years of age. Last, we found that between 5% and 37% of the variance in academic achievement was independent of genetic effects associated with cognitive and non-cognitive skills. Standardized paths ranged between 0.23 (95% CI 0.13 to 0.33) and 0.61 (95% CI 0.52 to 0.70).
The top three rows of Fig. 2b illustrate the developmental increase in how the genetics of self-reported non-cognitive skills contribute to the genetics of academic achievement. Focusing on education-specific non-cognitive skills, we found that standardized squared path estimates increased from explaining 1% of the total variance in academic achievement at 9 years of age (standardized path estimate of 0.01, 95% CI −0.16 to 0.17) to 4% at 12 years of age (standardized path estimate of 0.16, 95% CI 0.02 to 0.30) and 12% of the total variance in achievement at 16 years of age (standardized path estimate of 0.35, 95% CI 0.26 to 0.44) (Supplementary Tables 12 and 13). This increased contribution beyond cognitive skills was also observed for domain-general self-regulation. See Supplementary Fig. 7 for the full models’ results, which include shared and non-shared environmental estimates.
A PGS of non-cognitive skills
To obtain a PGS for use in subsequent analyses, we first extended previous work using the GWAS-by-subtraction approach to identify genetic variants associated with non-cognitive skills20. Previous GWAS-by-subtraction work leveraged genomic structural equation modelling (SEM)40 and the two GWAS of EA and cognitive performance (CP) to separate the genetic variance in EA into a cognitive component and a residual non-cognitive component. We extended this model in two directions. First, we extended the latent cognitive factor by including GWAS summary statistics from additional cognitive measures (episodic memory; processing speed, executive functions and reaction time)41. Second, we included other socio-economic attainment variables, including Townsend Deprivation and Income42, in addition to EA17. The resulting non-cognitive factor can therefore be defined as genetic variation shared by EA, income and neighbourhood deprivation that is independent of all measured cognitive abilities. Akin to Demange et al.20, we then fitted a Cholesky model (Methods) where indicators of the non-cognitive latent factor (henceforth NonCog) were regressed on the cognitive latent factor (henceforth Cog; Fig. 3a and Supplementary Table 14).
Our Cog and NonCog factors correlated strongly with those obtained from Demange et al.20 (Supplementary Table 15). The genetic correlation (rg) was 0.96 between the Cog factors and 0.93 between the NonCog factors. The rg between Cog and NonCog was 0.15. Supplementary Fig. 8 shows the rg values between the Cog and NonCog genetics and 18 psychiatric, personality and socio-economic traits, which we compared with the rg values obtained by Demange et al.20. The pattern of associations was largely consistent across the two models. However, in some instances, the results diverged. Specifically, with respect to psychiatric traits, autism, anorexia and attention deficit/hyperactive disorder, a larger gap was observed between the Cog and NonCog factors compared to Demange et al., where differences in the correlations were less pronounced or absent. As expected, the results differed most for socio-economic traits, with stronger correlations for NonCog than Cog with longevity (rg = 0.52, 95% CI 0.44 to 0.60, P = 1.04 × 10−45 versus rg = 0.35, 95% CI 0.29 to 0.41, P = 6.40 × 10−31), neighbourhood DE (rg = −0.66, 95% CI −0.74 to −0.58, P = 3.85 × 10−54 versus rg = −0.28, 95% CI −0.36 to −0.21, P = 5.98 × 10−12) and EA (rg = 0.83, 95% CI 0.81 to 0.85, P = 0.00 versus rg = 0.65, 95% CI 0.63 to 0.67, P = 0.00; Supplementary Fig. 8 and Supplementary Table 15).
NonCog PGS effects increase developmentally
We calculated PGS for Cog and NonCog and examined their association with cognitive, non-cognitive and academic phenotypes over development. PGS analyses leverage findings from GWAS and aggregate single-nucleotide polymorphisms (SNPs) across the genome into a single composite index that summarizes genetic influence on a target trait. We calculated PGS as the sum of SNPs at all loci weighted by the effect size of their association (Methods). We first investigated whether and to what extent Cog and NonCog PGS predicted individual differences in non-cognitive skills across development by modelling both PGSs in a multiple regression model (Methods).
In line with our previously obtained results showing a moderate association between Cog and NonCog traits, we found that the Cog PGS significantly predicted variation in non-cognitive skills across development, with standardized effect sizes ranging between β = 0.04 (95% CI 0.001 to 0.079) and β = 0.22 (95% CI 0.18 to 0.26; Supplementary Fig. 9 and Supplementary Table 16). The NonCog PGS, independent of the Cog PGS, predicted observed variation in non-cognitive skills at all developmental stages. Associations were small at earlier ages (for example, β = 0.07, 95% CI 0.03 to 0.11, P (corrected) = 1.93 × 10−3) for parent-reported education-specific non-cognitive skills at 9 years and β = 0.10, 95% CI 0.08 to 0.12, P (corrected) = 2.24 × 10−11 for parent-reported self-regulation at 7 years) but they increased developmentally, particularly for self-reported education-specific non-cognitive measures (β = 0.16, 95% CI 0.12 to 0.20, P (corrected) = 8.30 × 10−17 at 16 years of age). The only exception was observed for self-reported education-specific non-cognitive skills at 9 years, for which the prediction was negative (β = −0.03, 95%CI −0.07 to 0.01) and did not reach significance after accounting for multiple testing (Supplementary Table 16).
In Supplementary Note 3a, we show that this increase in prediction was significant overtime for the NonCog PGS, but not for the Cog PGS. Furthermore, we show that this increase is not explained by the NonCog PGS capturing more Cog variance later in adolescence (Supplementary Note 3b) or by SES (Supplementary Note 3c).
Cog and NonCog PGSs predicted variation in general cognitive ability, verbal ability and non-verbal ability at all developmental stages. As expected, the Cog PGS prediction of cognitive phenotypes was substantially stronger than the NonCog prediction, with estimates ranging between β = 0.19, 95% CI 0.17 to 0.21, P (corrected) = 3.77 × 10−42 and β = 0.27, 95% CI 0.23 to 0.31, P (corrected) = 1.04 × 10−52 for the Cog PGS and between β = 0.10, 95% CI 0.06 to 0.14, P (corrected) = 4.41 × 10−10 and β = 0.18, 95% CI 0.14 to 0.22, P (corrected) 5.51 × 10−21 for the NonCog PGS (Supplementary Fig. 10 and Supplementary Table 16).
Next, we considered the effects of the Cog and NonCog PGS on academic achievement over development. We detected associations between the Cog PGS and achievement as early as 7 years (β = 0.24, 95% CI 0.22 to 0.26, P (corrected) = 3.68 × 10−86), these associations remained largely consistent across development (β = 0.26, 95% CI 0.24 to 0.28, P (corrected) = 2.71 × 10−126 at 16 years of age). Although we observed weaker effects for the NonCog PGS in early childhood (β = 0.10, 95% CI 0.08 to 0.12, P (corrected) = 8.12 × 10−15) compared with the Cog PGS, these increased across development and reached effects comparable to those of the Cog PGS at 16 years (β = 0.22, 95% CI 0.20 to 0.24, P (corrected) = 1.85 × 10−84; Fig. 3b and Supplementary Table 16). The same pattern of associations was observed also when considering achievement in English and mathematics, separately (Supplementary Table 16). This observed increase in the NonCog PGS prediction of academic achievement over development is consistent with transactional models of gene–environment correlation (rGE), driven by NonCog genetics. These PGS predictions were in line with those obtained from the PGSs created using the GWAS-by-subtraction method published by Demange et al. (Supplementary Table 17).
Within-family PGS–achievement associations
Given our observation of an increase in the NonCog PGS associations with academic achievement across development, we extended our pre-registered analyses (https://osf.io/m5f7j/) to examine whether and to what extent this increase was accounted for by family-wide processes. Specifically, using a sibling difference design, we separated the NonCog PGS associations into within-family effects, indexing direct genetic effects from between-family effects, which may include indirect genetic effects and demographic confounding (Methods). We examined Cog and NonCog within and between-family predictions of academic achievement from age 7 to 16 years.
Two main findings emerged from this analysis (Fig. 3c). First, we observed that the effect sizes for the direct effects of NonCog were about half the size of the population-level associations (Supplementary Table 18). Similarly, the prediction from the Cog PGS was reduced by over one-third, consistent with previous evidence43. Second, while the Cog direct and indirect genetic effects did not vary substantially over the developmental period considered (from β = 0.20, 95% CI 0.16 to 0.24, P = 2.75 × 10−20 to β = 0.23, 95% CI 0.19 to 0.27, P = 4.12 × 10−32), NonCog effects showed an increase from age 7 to age 16 (from β = 0.06, 95% CI 0.02 to 0.10, P = 0.005 to β = 0.15, 95% CI 0.11 to 0.19, P = 1.39 × 10−14; Fig. 3c and Supplementary Table 18). These results suggested that the developmental increase in the between-family PGS prediction was mostly driven by NonCog rather than Cog skills. In addition, this developmental increase could be observed for both indirect and direct genetic effects. We conducted sensitivity analyses and replicated the results with the PGSs constructed using the method published by Demange et al. (Supplementary Table 18b).
PGS–SES interactions on achievement
Last, we extended our pre-registered analyses to test whether SES could explain or modify the observed pattern of developmental associations between PGS and academic achievement. We fitted multivariable models at each developmental stage, including Cog/NonCog PGS effects, along with SES at recruitment, covariates and their two-way interactions (Methods) to test whether SES moderated Cog and NonCog PGS effects on academic achievement. After adjusting for SES, the same pattern of relationships was observed, with a relatively stable association between the Cog PGS and achievement, and a steeper increase in the NonCog PGS prediction, even though all effects were attenuated (Supplementary Table 19). We did not detect significant interaction effects between either the Cog or the NonCog PGS and SES (Supplementary Table 19).
Figure 3d depicts mutually adjusted slopes for the Cog and NonCog PGS prediction against academic achievement at different levels of family SES. The figure shows that although higher SES corresponded to greater achievement on average, the slope of the association between the Cog and NonCog PGS and achievement did not differ across socio-economic strata. Higher PGS, for both cognitive and non-cognitive skills, corresponded to higher academic achievement, and higher SES corresponded to both higher mean PGSs and higher achievement, indicating a correlation rather than an interaction between genetic and environmental influences on academic achievement.
Discussion
We investigated the association between cognitive and non-cognitive genetics and academic achievement during compulsory education in a UK-based sample. Four complementary findings emerged. First, non-cognitive skills increasingly predicted academic achievement over the school years, and these associations remained substantial even after accounting for cognitive skills. Second, the links between non-cognitive skills and academic achievement were mainly due to shared genetic factors, whose relative contribution also increased over the school years. For example, the NonCog PGS prediction of academic achievement nearly doubled over the school years while the Cog PGS prediction remained relatively stable. Third, the increasingly important role of non-cognitive genetics persisted even after accounting for family-fixed effects. Fourth, PGS associations with academic achievement over development did not differ across socio-economic contexts. Together, these findings highlight the important role that non-cognitive skills play during primary and secondary education and suggest that fostering such skills might provide an avenue for successful educational strategies and interventions.
The first set of interesting findings about development emerged from twin analyses of the covariance between non-cognitive traits and academic achievement. First, we found that genetic factors accounted for most of the observed correlations between non-cognitive skills and academic achievement at all developmental stages. Second, both phenotypic and genetic correlation values increased developmentally, particularly for self-reported measures of non-cognitive traits. Third, our twin analyses showed that genetic factors accounted for most of the correlations between non-cognitive skills and academic achievement after accounting for cognitive skills. Finally, this independent genetic contribution of non-cognitive skills to academic achievement increased developmentally. This increase was observed for both education-specific non-cognitive skills, where the measures included in the general factors changed developmentally, as well as for domain-general self-regulation skills, for which the same measures were collected at all developmental stages. Therefore, the observed developmental increase in phenotypic and genetic associations independent of cognitive skills is unlikely to be an artefact of inconsistencies in measurement but rather reflects the increasingly important role of non-cognitive skills across compulsory education.
A further aim of the current study was to better understand what was captured by the NonCog PGS constructed using GWAS-by-subtraction20, particularly in relation to what other skills beyond cognitive ability propel students down different educational trajectories. Given the link between SES and academic achievement44, we were specifically interested in whether the NonCog PGS also indexed socio-economic-related factors. To this end, we extended the GWAS-by-subtraction model in two directions. First, with the aim of making a more refined cognitive factor, we added summary statistics from several other GWASs of fluid intelligence. Second, we included GWASs of other traits known to associate with achievement beyond cognitive abilities, specifically targeting SES-related traits, such as household income and social deprivation, making the NonCog PGS factor more explicitly socio-economic relevant.
It should be highlighted that GWAS of SES-relevant measures may be more subject to socio-demographic confounds, such that estimates of SNP effects will also capture population stratification phenomena, such as geographic clustering45. This limitation is particularly relevant for the GWAS of social deprivation as the measure is an area-based score of social inequality. Interestingly, the results obtained from this model paralleled those we obtained when we applied the Cog and NonCog PGSs from the original GWAS-by-subtraction model, which only used EA to define the NonCog factor. This suggests that the PGS measure of non-cognitive skills from Demange et al. may have already captured some SES-related effects. Importantly, our employment of a within-family comparison helped us to mitigate possible confounds associated with uncontrolled population stratification.
Paralleling our multivariate twin results, we observed that the effects of the prediction from NonCog PGS to academic achievement increased from childhood to adolescence, beyond the effects of the Cog PGS. A few explanations are possible for this finding. First, this could be attributable to rGE, which could be passive, evocative or active29,46. Another explanation could be that PGSs become increasingly predictive during development as our sample gets closer in age to the adult samples where GWAS effect sizes were estimated in the case of EA and CP17. However, it is worth noting that this increase in prediction was not observed for the Cog PGS, for which effects on academic achievement were mostly developmentally stable. Moreover, our triangulation of results across multiple methods (including phenotypic and twin analyses) adds support to our finding of these developmental differences between Cog and NonCog genetics.
We applied a within-sibling design43 to test whether environmental variables that are shared by siblings and that potentially confound PGS associations could explain the observed increase in the predictive power of the NonCog PGS. While the contributions of both PGSs were attenuated within family, suggesting a substantial role for environmental confounds shared by family members, an increase in the contribution of NonCog PGS to academic achievement from age 7 to 16 years was still evident when comparing siblings. In contrast, the within-family contribution of the Cog PGS remained relatively stable. The increase in the NonCog PGS prediction at the within-family level is consistent with transactional processes driven by active or evocative rGE30,46,47 for NonCog PGS. As children grow up, they actively evoke or shape their environmental experiences based in part on their genetic dispositions, and these experiences in turn contribute to their academic development. Our findings suggest that children’s educational experiences are increasingly shaped by their propensity towards non-cognitive skills.
To delve deeper into the role of socio-economic factors, we tested whether SES could modify the relationship between Cog and NonCog PGSs and academic achievement over development. While we did not find evidence for interaction effects in this regard, the Cog and NonCog PGS were conditionally independent in a multivariable model including SES, further indicating that the genetics captured by the NonCog factor were at least partly independent of SES-related genetic and environmental effects.
One caveat of these gene–environment interaction analyses is that adjusting for a heritable covariate, such as SES, can yield biased estimates in multivariable models including PGS48,49. Future work is needed to determine whether this is the case, perhaps leveraging results of within-family GWAS to construct PGS for ‘direct’ effects within families50. This limitation also pertains to our within-sibling PGS analyses, as it might be difficult to separate direct and indirect effects using population-based GWAS effects as a starting point51. Follow-up of these analyses employing PGS for direct effects obtained from family-based GWAS will shed light on this potential limitation. It should also be acknowledged that indirect genetic effects on education might reflect social stratification across generations in addition to nurturing processes that operate within nuclear families52. A further caveat of the present work is that, while we investigated genetic effects on non-cognitive skills and their link with academic achievement across development, we did not investigate stability and change using longitudinal models. Future work explicitly investigating developmental change at the phenotypic53, genetic28 and genomic40,54 level, for example, using latent growth models55, will address further developmental questions related to the role of non-cognitive skills in academic development.
To conclude, our study provides an in-depth investigation of the role of non-cognitive genetics in academic development. Triangulating multiple genetic and genomic methods, we found consistent evidence for the increasingly important role that non-cognitive skills play during compulsory education. Genetic dispositions towards non-cognitive skills become increasingly predictive of academic achievement and, by late adolescence, they explain as much variance in achievement as do genetic dispositions towards cognitive skills. Results from within-family and developmental analyses are consistent with theorized transactional processes of active/evocative rGE by which, as they grow up, children evoke and actively select academic environments that correlate with their genetic disposition towards non-cognitive skills29,46. Fostering non-cognitive skills might provide a successful avenue for educational interventions.
Methods
Sample
Participants are part of the Twins Early Development Study (TEDS), a longitudinal study of twins born in England and Wales between 1994 and 1996. The families in TEDS are representative of the British population for their cohort in terms of socio-economic distribution, ethnicity and parental occupation. In total, 10,000 families are still actively involved with the TEDS study over 20 years after the first data collection wave (see ref. 56 for additional information on the TEDS sample). The present study includes data collected in TEDS across multiple waves. Specifically, we analysed data collected over five waves, when the twins were 4, 7, 9, 12 and 16 years old. The sample size and composition differ between collection waves, numbers for all measures included in the study are reported in Supplementary Table 1a. Individuals with severe medical conditions were excluded from analyses. These conditions include detrimental prenatal and postnatal conditions, as well as other conditions that could seriously impact later development. In addition, twins with uncertain and unknown zygosity were excluded from the analyses. TEDS has ethical approval from the research ethics committee of Kings College London (references: PNM/09/10–104 and HR/DP‐20/21–22060). Consent was obtained before data collection at every wave.
Measures
Below we provide a brief description of all the measures included in the present study. Please refer to https://www.teds.ac.uk/datadictionary for detailed descriptions of each measure and information on the items included in each construct.
Education-specific non-cognitive skills
At 9 years of age, data on education-specific non-cognitive skills were collected from parents, teachers and self-reports from the twins. Measures of academic self-perceived ability57, academic interest57 and the Classroom Environment Questionnaire (CEQ58) were available from all raters. The CEQ included the following subscales rated by parents and twins: (1) CEQ classroom satisfaction scale, (2) CEQ educational opportunities scale and (3) CEQ adventures scales that assessed enjoyment of learning. Ratings on the CEQ classroom satisfaction scale were also provided by the teachers.
At 12 years of age, data on education-specific non-cognitive skills were collected from parents, teachers and self-reports. The following measures were collected: academic self-perceived ability57, academic interest57, the mathematics environment questionnaire59 and the literacy environment questionnaire60. The questionnaires asked several questions related to literacy and mathematics, including items, such as ‘reading is one of my favourite activities’, ‘when I read books, I learn a lot’ and ‘in school, how often do you do maths problems from text books?’, all rated on a four-point Likert scale.
At 16 years of age, education-specific non-cognitive skills were assessed via self-reports provided by the twins. The battery of education-specific non-cognitive constructs included the following measures:
The brief academic self-concept scale included ten items (adapted from ref. 61), such as ‘I like having difficult work to do’ and ‘I am clever’, rated on a five-point Likert scale.
School engagement62 includes five subscales: teacher–student relations, control and relevance of schoolwork, peer support for learning, future aspirations and goals and family support for learning. The school engagement scale includes items, such as ‘I enjoy talking to the teachers at my school’, ‘I feel like I have a say about what happens to me at school’, ‘school is important for achieving my future goals’ and ‘when I have problems at school, my family/carer(s) are willing to help me’, rated on a four-point Likert scale.
Grit was assessed with eight items from the Short Grit Scale63 asking the twins to report on their academic perseverance answering questions such as ‘setbacks don’t discourage me’ and ‘I am a hard worker’, rated on a five-point Likert scale.
Academic ambition64 was measured with five items asking participants to rate statements, such as the following ‘I am ambitious’ and ‘achieving something of lasting importance is the highest goal in life’ on a five-point Likert scale.
Time spent studying mathematics was assessed with three items asking participants how much time every week they spent in ‘regular lessons in mathematics at school’, ‘out-of school-time lessons in mathematics’ and ‘study or homework in mathematics by themselves’.
Mathematics self-efficacy65 was measured with eight items asking students how confident they felt about having to perform different mathematics tasks, for example: ‘calculating how many square metres of tiles you need to cover a floor’ and ‘understanding graphs presented in newspapers’, rated on a four-point Likert scale
Mathematics interest65 asked participants to respond to three questions related to interest in mathematics, including ‘I do mathematics because I enjoy it’ and ‘I am interested in the things I learn in mathematics’.
Curiosity was assessed with seven items66 asking participants to rate statements, such as ‘when I am actively interested in something, it takes a great deal to interrupt me’ and ‘everywhere I go, I am looking out for new things or experiences’ on a seven-point Likert scale
Attitudes towards school was measured using the Programme for International Student Assessment attitudes to school measure65, which included four items, such as ‘school has helped give me confidence to make decisions’ and ‘school has taught me things, which could be useful in a job’ rated on a four-point Likert scale.
Self-regulation
Emotional and behavioural self-regulation was assessed at all ages using the Strengths and Difficulties Questionnaire (SDQ)67. Data on domain-general self-regulation skills were collected from parents, teachers and self-reported by the twins. The SDQ includes five subscales: hyperactivity, conduct problems, peer problems, emotional problems and pro-social behaviour. Composite scores for all subscales except pro-social behaviour were reversed so that higher scores indicated higher levels of domain-general self-regulation skills. At 7 years of age, domain-general self-regulation skills were rated by the parents; at 9 years and 12 years by the parents, teachers and self-reported by the twins; and at 16 years self-reported by the twins.
Cognitive ability
At 7 years of age, cognitive ability was measured using four tests that were administered over the telephone by trained research assistants. Two tests assessed verbal ability: a 13-item similarity test and 18-item vocabulary test, both derived from the Wechsler Intelligence Scale for Children (WISC-III)68. Non-verbal ability was measured using two tests: a nine-item conceptual groupings test69 and a 21-item WISC picture completion test68. Verbal and non-verbal ability composites were created by taking the mean of the standardized test scores within each domain. A general cognitive ability (g) composite was derived taking the mean of the two standardized verbal and two standardized non-verbal test scores.
At 9 years of age, cognitive ability was assessed using four tests that were administered as booklets sent to TEDS families by post. Verbal ability was measured using the first 20 items from WISC-III-PI words test70 and the first 18 items from WISC-III-PI general knowledge test70. Non-verbal ability was assessed using the shapes test (CAT3 Figure Classification)71 and the puzzle test (CAT3 Figure Analogies)71. Verbal and non-verbal ability composites were created by taking the mean of the standardized test scores within each domain. A g composite was derived taking the mean of the two standardized verbal and two standardized non-verbal test scores.
At 12 years of age, cognitive ability was measured using four tests that were administered online. Verbal ability was measured using the full versions of the verbal ability tests administered at 9 years: the full 30 items from WISC-III-PI words test70 and 30 items from WISC-III-PI general knowledge test70. Non-verbal ability was measured with the 24-item pattern test (derived from the Raven’s Standard Progressive Matrices)72 and the 30-item picture completion test (WISC-III-UK)68. Verbal and non-verbal ability composites were created by taking the mean of the standardized test scores within each domain. A g composite was derived from the mean of the two standardized verbal and two standardized non-verbal test scores.
At 16 years of age, cognitive ability was assessed using a composite of one verbal and one non-verbal test administered online. Verbal ability was assessed using an adaptation of the Mill Hill Vocabulary test73 and non-verbal ability was measured using an adapted version of the Raven’s Standard Progressive Matrices test72. A g composite was derived taking the mean of the two standardized tests.
Academic achievement
At 7 years of age, academic achievement was measured with standardized teacher reports and consisted of standardized mean scores of students’ achievements in English and mathematics, in line with the National Curriculum level. Performance in English was assessed in four domains: speaking, listening, reading and writing abilities. Performance in maths was assessed in three domains: applying mathematics, as well as knowledge about numbers, shapes, space and measures.
At 9 years of age, academic achievement was again assessed using teacher reports. The domains assessed were the same for English and mathematics (although on age-appropriate content). In addition, performance in science was assessed considering two key domains: scientific enquiry and knowledge and understanding of life processes, living things and physical processes.
At 12 years of age, academic achievement was assessed in the same way as at age 9, with two exceptions. Mathematics added a fourth domain, data handling, and science added a third domain, materials and their properties. These additions were in line with the changes made to the National Curriculum teacher ratings.
At 16 years of age, academic achievement was measured using the General Certificate of Secondary Education (GCSE) examination scores. The GCSE is the UK nationwide examination usually taken by 16 year olds at the end of compulsory secondary education74. Twins’ GCSE scores were obtained via mailing examination results forms to the families shortly after completion of the GCSE exams by the twins. For the GCSE, students could choose from a wide range of subjects. In the current analyses the mean score of the three compulsory GCSE subjects: English language and/or English literature, mathematics and a science composite (a mean score of any of the scientific subjects taken, including physics, chemistry and biology).
Family SES
At first contact, parents of TEDS twins received a questionnaire by post, and were asked to provide information about their educational qualifications, employment and mothers’ age at first birth. A SES composite was created by standardizing these three variables and calculating their mean. The same measures, except for mother’s age at first birth, were used to measure family SES at 7 years of age. At age 16 years, data on SES were collected using a web questionnaire, and a total score was calculated from the standardized mean of five items: household IN, mother’s and father’s highest qualifications and mother’s and father’s employment status.
Genetic data
Two different genotyping platforms were used because genotyping was undertaken in two separate waves, 5 years apart. AffymetrixGeneChip 6.0 SNP arrays were used to genotype 3,665 individuals. Additionally, 8,122 individuals (including 3,607 dizygotic (DZ) co-twin samples) were genotyped on Illumina HumanOmniExpressExome-8v1.2 arrays. Genotypes from a total of 10,346 samples (including 3,320 DZ twin pairs and 7,026 unrelated individuals) passed quality control, including 3,057 individuals genotyped on Affymetrix and 7,289 individuals genotyped on Illumina. The final data contained 7,363,646 genotyped or well-imputed SNPs. For additional information on the treatment of these samples see ref. 75.
Analytic strategies
Our analyses fully adhered to our pre-registration, available at the following link: https://osf.io/m5f7j. Although we did not deviate from our pre-registered analyses, we extended them to include two further analyses. First, we explored within-family PGS associations to examine whether and to what extent the PGS prediction of academic achievement could be accounted for by family-wide processes. Second, we explored PGS × SES interactions in predicting academic achievement across compulsory education.
Factor analysis, correlations and regressions
Confirmatory factor analysis was employed to create latent dimensions of non-cognitive skills and general Cog ability at all ages using the lavaan package for R76. On the basis of well-established literature on general cognitive ability (g) and previous work in the TEDS sample77, we constructed one factor for g at each developmental stage. Each g factor was created by taking the weighted loadings of two verbal and two non-verbal tests (Measures and Supplementary Table 6). Confirmatory factor analysis was also employed to construct dimensions of non-cognitive characteristics. On the basis of previous meta-analytic work on the non-cognitive characteristics that matter for educational outcomes9,78, we embraced a theoretical distinction between education-specific non-cognitive skills (for example, motivations, attitudes and goals) and broader, more de-contextualized measures of self-regulation (for example, behavioural and emotional regulation), and created separate factors for (1) education-specific non-cognitive skills and (2) domain-general self-regulation skills separately for ages and raters, including all the measures available at each age for each rater (see Supplementary Tables 2 and 3 for factor loadings and model fit indices).
We applied phenotypic correlations to examine the associations between non-cognitive skills (both observed measures and factors) and general cognitive ability and academic achievement at each age. We applied multiple regressions to explore the associations between non-cognitive skills and academic achievement accounting for general cognitive ability. We applied Benjamini–Hochberg correction79 to account for multiple testing.
The twin method
The twin method allows for the decomposition of individual differences in a trait into genetic and environmental sources of variance by capitalizing on the genetic relatedness between monozygotic (MZ) twins, who share 100% of their genetic makeup, and DZ twins, who share on average 50% of the genes that differ between individuals. The method is further grounded in the assumption that both types of twins who are raised in the same family share their rearing environments to approximately the same extent80. By comparing how similar MZ and DZ twins are for a given trait (intraclass correlations), it is possible to estimate the relative contribution of genetic factors and environments to variation in that trait. Heritability, the amount of variance in a trait that can be attributed to genetic variance (A), can be roughly estimated as double the difference between the MZ and DZ twin intraclass correlations80. The ACE model further partitions the variance into shared environment (C), which describes the extent to which twins raised in the same family resemble each other beyond their shared genetic variance, and non-shared environment (E), which describes environmental variance that does not contribute to similarities between twin pairs (and also includes measurement error).
The twin method can be extended to the exploration of the covariance between two or more traits (multivariate genetic analysis). Multivariate genetic analysis allows for the decomposition of the covariance between multiple traits into genetic and environmental sources of variance, by modelling the cross-twin cross-trait covariances. Cross-twin cross-trait covariances describe the association between two variables, with twin one’s score on variable one correlated with twin two’s score on variable two, which are calculated separately for MZ and DZ twins. The examination of shared variance between traits can be further extended to test the aetiology of the variance that is common between traits and of the residual variance that is specific to individual traits. We conducted these analyses in OpenMx for R81.
It is possible to apply SEM to decompose latent factors into A, C and E components, applying models, such as the common pathway model. The common pathway model is a multivariate genetic model in which the variance common to all measures included in the analysis can be reduced to a common latent factor, for which the A, C and E components are estimated. As well as estimating the aetiology of the common latent factor, the model allows for the estimation of the A, C and E components of the residual variance in each measure that is not captured by the latent construct82. We conducted these analyses in Mplus83.
A further multivariate twin method, grounded in SEM, is the Cholesky decomposition, which examines the genetic and environmental underpinnings of the associations between multiple variables or latent factors. The Cholesky approach parses the genetic and environmental variation in each trait into that accounted for by traits previously entered into the model and the variance, which is unique to a newly entered trait. In our case, the Cholesky decomposition partitions the genetic and environmental variance that is common across cognitive, non-cognitive and achievement measures from the genetic and environmental variance that is common between non-cognitive skills and achievement, independently of that accounted for by general cognitive ability. Cholesky decompositions were conducted on latent dimensions of cognitive and non-cognitive skills and observed variation in academic achievement (Supplementary Tables 12 and 13). We conducted these analyses in Mplus83.
Genomic SEM
Genomic SEM40 is an approach to conduct multivariate genome-wide association analyses. On the basis of the principles of SEM widely used in twin analyses and integrated with linkage disequilibrium score regression84, genomic SEM jointly analyses GWAS summary statistics for multiple traits to test hypotheses about the structure of the genetic covariance between traits. Here, we employed genomic SEM to create latent GWAS summary statistics for unmeasured traits on the basis of other traits for which GWAS summary statistics exist. Recent work applied a GWAS-by-subtraction approach20 leveraging the GWASs of EA17 and CP17,85 to obtain a GWAS of non-cognitive skills. The GWAS-by-subtraction approach estimates, for each SNP, an effect on EA that is independent of that SNP’s effect on CP (therefore, indexing residual ‘non-cognitive’ SNP effects). The model regresses the EA and CP summary statistics on two latent variables, Cog and NonCog. EA and CP are both regressed on the Cog latent variable and only EA is regressed on the NonCog latent factor. The Cog and NonCog factors are specified to be uncorrelated and residual covariances across factor indicators are set to zero. Cog and NonCog are then regressed on each SNP, iterating across all SNPs in the genome.
We extended the GWAS-by-subtraction with the aim of obtaining potentially more fine-grained Cog and NonCog factors. Specifically, the model was extended as follows: loading exclusively on the Cog factor: five UK Biobank Cog traits (CP85, symbol digit (SD) substitution, memory, trail making test and RT)41. Loading on both the Cog and NonCog factors: EA17, Townsend deprivation index86 and income42. An additional difference from the original GWAS-by-subtraction is that we let residual variances vary freely (that is, we did not constrain them to 0; Fig. 3a and Supplementary Table 14).
PGS analyses
PGSs were calculated as the weighted sums of each individual’s genotype across all SNPs, using LDpred weights87. LDpred is a Bayesian shrinkage method that corrects for local linkage disequilibrium (that is, correlations between SNPs) using information from a reference panel (we used the target sample (TEDS) limited to unrelated individuals) and a prior for the genetic architecture of the trait. We constructed PGS using an infinitesimal prior, that is, assuming that all SNPs are involved in the genetic architecture of the trait, as this has been found to perform well with highly polygenic traits, such as EA, and in line with the approach adopted by Demange et al.20. In regression analyses, as with Demange et al.20, both the Cog and NonCog PGSs were included in multiple regressions together with the following covariates: age, sex, the first ten principal components of ancestry and genotyping chip and batch. We accounted for non-independence of observation using the generalized estimating equation (GEE) R package88.
Within and between-family analyses
We conducted within-sibling analyses using DZ twins to estimate family-fixed effects of both Cog and NonCog PGS on achievement across development43. A mixed model was fit to the data, including a random intercept to adjust for family clustering, and two family-fixed effects in addition to covariates (age, sex, the first ten principal components of ancestry and genotyping chip and batch): a between-family effect, indexed by the mean family PGS (that is, the average of the DZ twins’ PGS within a family), and a within-family effect, indexed by the difference between each twin’s PGS from the family mean PGS. Analyses were repeated with the PGS from Demange et al.20 as sensitivity analyses.
Gene–environment interaction analyses
We conducted gene–environment interaction analyses to test whether SES moderated the effects of the Cog and NonCog PGS prediction on academic achievement over development. We fit a linear mixed model including Cog and NonCog PGS (the extensions), SES and their two-way interactions after adjusting for covariates (as above) and two-way interactions between predictors and covariates, plus a random intercept to adjust for family clustering. We adjusted for multiple testing using the Benjamini–Hochberg false discovery rate method79 for all PGS analyses, at an α level of 0.05.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Supplementary information
Acknowledgements
We gratefully acknowledge the ongoing contribution of the participants in the TEDS and their families. TEDS has been supported by a programme grant to R.P. from the UK Medical Research Council (MR/M021475/1 and previously G0901245), with additional support from the US National Institutes of Health (AG046938). M.M. is supported by a starting grant from the School of Biological and Behavioural Sciences at Queen Mary University of London. R.P. is supported by a Medical Research Council Professorship award (G19/2). A.D.G. was supported by NIH Grants R01MH120219 and RF1AG073593. J.-B.P. is funded by the European Research Council under the European Union’s Horizon 2020 research and innovation programme (grant agreement No. 863981) and the UK Research and Innovation (UKRI) under the UK government’s Horizon Europe funding guarantee (grant number 575067), attributed to J.-B.P. and supporting A.G.A. P.A.D. is supported by the European Union (grant agreement number 101045526). Views and opinions expressed are, however, those of the authors only and do not necessarily reflect those of the European Union or the European Research Council Executive Agency. Neither the European Union nor the granting authority can be held responsible for them. The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript.
Author contributions
M.M., A.G.A. and R.P. conceived and designed the study.; M.M. and A.G.A. analysed the data with helpful contributions from M.G.N. and P.B.; M.M., A.G.A., K.P.H. and R.P. wrote the paper with helpful contributions from M.G.N., P.B., K.R., R.C., S.v.S., P.A.D., E.v.B., A.D.G., L.R., J.D.l.F., J.-B.P. and E.M.T.-D. All authors contributed to the interpretation of data, provided critical feedback on manuscript drafts and approved the final draft.
Peer review
Peer review information
Nature Human Behaviour thanks Abdel Abdellaoui and the other, anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Data availability
Researchers can apply for access to the TEDS data through their data access mechanism (www.teds.ac.uk/researchers/teds-data-access-policy). Summary statistics for the extended Cog and NonCog factors can be found via figshare at https://figshare.com/s/25abf6cc4ca207468c6c (ref. 89).
Code availability
Code is available via GitHub at https://github.com/CoDEresearchlab/NoncognitiveGenetics.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Margherita Malanchini, Andrea G. Allegrini.
Contributor Information
Margherita Malanchini, Email: m.malanchini@qmul.ac.uk.
Andrea G. Allegrini, Email: a.allergini@ucl.ac.uk
Supplementary information
The online version contains supplementary material available at 10.1038/s41562-024-01967-9.
References
- 1.Tangney, J. P., Baumeister, R. F. & Boone, A. L. High self-control predicts good adjustment, less pathology, better grades, and interpersonal success. J. Pers.72, 271–324 (2004). [DOI] [PubMed] [Google Scholar]
- 2.Diamond, A. Executive functions. Annu. Rev. Clin. Psychol.64, 135–168 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Muenks, K., Wigfield, A., Yang, J. S. & O’Neal, C. R. How true is grit? Assessing its relations to high school and college students’ personality characteristics, self-regulation, engagement, and achievement. J. Educ. Psychol.109, 599–620 (2017). [Google Scholar]
- 4.Garon-Carrier, G. et al. Intrinsic motivation and achievement in mathematics in elementary school: a longitudinal investigation of their association. Child Dev.87, 165–175 (2016). [DOI] [PubMed] [Google Scholar]
- 5.Tucker-Drob, E. M. & Harden, K. P. Intellectual interest mediates gene-by-SES interaction on adolescent academic achievement. Child Dev.83, 743–757 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Tucker-Drob, E. M. & Harden, K. P. National GDP, science interest and science achievement: a person-by-nation interaction. Psychol. Sci.25, 2047–2057 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Chamorro-Premuzic, T., Harlaar, N., Greven, C. U. & Plomin, R. More than just IQ: a longitudinal examination of self-perceived abilities as predictors of academic performance in a large sample of UK twins. Intelligence38, 385–392 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Heckman, J. & Rubinstein, Y. The importance of noncognitive skills: lessons from the GED testing program. Am. Econ. Rev.91, 145–149 (2001). [Google Scholar]
- 9.Schneider, M. & Preckel, F. Variables associated with achievement in higher education: a systematic review of meta-analyses. Psychol. Bull.143, 565–600 (2017). [DOI] [PubMed] [Google Scholar]
- 10.Kovas, Y. et al. Why children differ in motivation to learn: insights from over 13,000 twins from 6 countries. Pers. Individ. Dif.80, 51–63 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Tucker-Drob, E. M., Briley, D. A., Engelhardt, L. E., Mann, F. D. & Harden, K. P. Genetically-mediated associations between measures of childhood character and academic achievement. J. Pers. Soc. Psychol.111, 790–815 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Malanchini, M., Engelhardt, L. E., Grotzinger, A. D., Harden, K. P. & Tucker-drob, E. M. ‘Same but different’: associations between multiple aspects of self-regulation, cognition and academic abilities. J. Pers. Soc. Psychol.117, 1164–1188 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Briley, D. A. & Tucker-Drob, E. M. Comparing the developmental genetics of cognition and personality over the life span. J. Pers.85, 51–64 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Rimfeld, K., Kovas, Y., Dale, P. S. & Plomin, R. True grit and genetics: predicting academic achievement from personality. J. Pers. Soc. Psychol.111, 780–789 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Malanchini, M. et al. Genetic factors underlie the association between anxiety, attitudes and performance in mathematics. Transl. Psychiatry10, 12 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Rietveld, C. A. et al. GWAS of 126,559 individuals identifies genetic variants associated with educational attainment. Science340, 1467–1471 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Lee, J. J. et al. Gene discovery and polygenic prediction from a genome-wide association study of educational attainment in 1.1 million individuals. Nat. Genet.50, 1112–1121 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Belsky, D. W. et al. The genetics of success: how single-nucleotide polymorphisms associated with educational attainment relate to life-course development. Psychol. Sci.27, 957–972 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Smith-Woolley, E., Selzam, S. & Plomin, R. Polygenic score for educational attainment captures DNA variants shared between personality traits and educational achievement. J. Pers. Soc. Psychol.117, 1145–1163 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Demange, P. A. et al. Investigating the genetic architecture of noncognitive skills using GWAS-by-subtraction. Nat. Genet.53, 35–44 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Boman, B. The influence of SES, cognitive, and non-cognitive abilities on grades: cross-sectional and longitudinal evidence from two Swedish cohorts. Eur. J. Psychol. Edu.38, 587–603 (2023). [Google Scholar]
- 22.Duckworth, A. L. & Yeager, D. S. Measurement matters: assessing personal qualities other than cognitive ability for educational purposes. Educ. Res.44, 237–251 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Krapohl, E. et al. The high heritability of educational achievement reflects many genetically influenced traits, not just intelligence. Proc. Natl Acad. Sci. USA111, 15273–15278 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Malanchini, M., Rimfeld, K., Allegrini, A. G., Ritchie, S. J. & Plomin, R. Cognitive ability and education: how behavioural genetic research has advanced our knowledge and understanding of their association. Neurosci. Biobehav Rev.111, 229–245 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Kong, A. et al. The nature of nurture: effects of parental genotypes. Science359, 424–428 (2018). [DOI] [PubMed] [Google Scholar]
- 26.Veller, C. & Coop, G. M. Interpreting population-and family-based genome-wide association studies in the presence of confounding. PLoS Biol.22, e3002511 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Young, A. I., Benonisdottir, S., Przeworski, M. & Kong, A. Deconstructing the sources of genotype-phenotype associations in humans. Science365, 1396–1400 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Malanchini, M. et al. Reading self-perceived ability, enjoyment and achievement: a genetically informative study of their reciprocal links over time. Dev. Psychol.53, 698–712 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Plomin, R. Genotype-environment correlation in the era of DNA. Behav. Genet44, 629–638 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Tucker-Drob, E. M. & Harden, K. P. Early childhood cognitive development and parental cognitive stimulation: evidence for reciprocal gene–environment transactions. Dev. Sci.15, 250–259 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Tucker-Drob, E. M. & Harden, K. P. A. in Genetics, Ethics and Education (eds Grigorenko, E. L. et al.) 134–158 (Cambridge Univ. Press, 2017).
- 32.Plomin, R., DeFries, J. C. & Loehlin, J. C. Genotype-environment interaction and correlation in the analysis of human behavior. Psychol. Bull.84, 309–322 (1977). [PubMed] [Google Scholar]
- 33.Belsky, D. W. et al. Genetic analysis of social-class mobility in five longitudinal studies. Proc. Natl Acad. Sci. USA115, E7275–E7284 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Tucker-Drob, E. M. & Bates, T. C. Large cross-national differences in gene × socioeconomic status interaction on intelligence. Psychol. Sci.27, 138–149 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Allegrini, A. G. et al. Multivariable G–E interplay in the prediction of educational achievement. PLoS Genet.16, e1009153 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Harden, K. P. et al. Genetic associations with mathematics tracking and persistence in secondary school. npj Sci. Learn.5, 1 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Cheesman, R. et al. A population-wide gene-environment interaction study on how genes, schools, and residential areas shape achievement. npj Sci. Learn.7, 29 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Munafo, M. R. & Davey Smith, G. Repeating experiments is not enough. Nature553, 399–401 (2018).29368721 [Google Scholar]
- 39.Allegrini, A. G. et al. The p factor: genetic analyses support a general dimension of psychopathology in childhood and adolescence. J. Child Psychol. Psychiatry61, 30–39 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Grotzinger, A. D. et al. Genomic structural equation modelling provides insights into the multivariate genetic architecture of complex traits. Nat. Hum. Behav.3, 513–525 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.de la Fuente, J., Davies, G., Grotzinger, A. D., Tucker-Drob, E. M. & Deary, I. J. A general dimension of genetic sharing across diverse cognitive traits inferred from molecular data. Nat. Hum. Behav.5, 49–58 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Hill, W. D. et al. Genome-wide analysis identifies molecular systems and 149 genetic loci associated with income. Nat. Commun.10, 5741 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Selzam, S. et al. Comparing within- and between-family polygenic score prediction. Am. J. Hum. Genet105, 351–363 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Krapohl, E. & Plomin, R. Genetic link between family socioeconomic status and children’s educational achievement estimated from genome-wide SNPs. Mol. Psychiatry21, 437–443 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Abdellaoui, A., Dolan, C. V., Verweij, K. J. H. & Nivard, M. G. Gene–environment correlations across geographic regions affect genome-wide association studies. Nat. Genet.54, 1345–1354 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Tucker-Drob, E. M. & Harden, K. P. in Current Perspectives in Psychology: Genetics, Ethics and Education (eds Bouregy, S. E. et al.) 134–158 (Cambridge Univ. Press, 2017).
- 47.Bronfenbrenner, U. & Ceci, S. J. Nature–nuture reconceptualized in developmental perspective: a bioecological model. Psychol. Rev.101, 568–586 (1994). [DOI] [PubMed] [Google Scholar]
- 48.Akimova, E. T., Breen, R., Brazel, D. M. & Mills, M. C. Gene-environment dependencies lead to collider bias in models with polygenic scores. Sci. Rep.11, 9457 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Pingault, J.-B. et al. Genetic sensitivity analysis: adjusting for genetic confounding in epidemiological associations. PLoS Genet.17, e1009590 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Howe, L. J. et al. Within-sibship genome-wide association analyses decrease bias in estimates of direct genetic effects. Nat. Genet.54, 581–592 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Fletcher, J., Wu, Y., Li, T. & Lu, Q. Interpreting polygenic score effects in sibling analysis. PLoS ONE19, e0282212 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Nivard, M. G. et al. More than nature and nurture, indirect genetic effects on children’s academic achievement are consequences of dynastic social processes. Nat. Hum. Behav.8, 771–778 (2024). [DOI] [PubMed] [Google Scholar]
- 53.Wang, Z., Oh, W., Malanchini, M. & Borriello, G. A. The developmental trajectories of mathematics anxiety: cognitive, personality, and environmental correlates. Contemp. Educ. Psychol.61, 101876 (2020). [Google Scholar]
- 54.Pritikin, J. N., Neale, M. C., Prom-Wormley, E. C., Clark, S. L. & Verhulst, B. GW-SEM 2.0: efficient, flexible, and accessible multivariate GWAS. Behav. Genet51, 343–357 (2021). [DOI] [PubMed] [Google Scholar]
- 55.von Stumm, S. et al. Predicting educational achievement from genomic measures and socioeconomic status. Dev. Sci.23, e12925 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Rimfeld, K. et al. Twins early development study: a genetically sensitive investigation into behavioral and cognitive development from infancy to emerging adulthood. Twin Res. Hum. Genet.22, 508–513 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Spinath, B., Spinath, F. M., Harlaar, N. & Plomin, R. Predicting school achievement from general cognitive ability, self-perceived ability, and intrinsic value. Intelligence34, 363–374 (2006). [Google Scholar]
- 58.Walker, S. O. & Plomin, R. Nature, nurture, and perceptions of the classroom environment as they relate to teacher‐assessed academic achievement: a twin study of nine‐year‐olds. Educ. Psychol.26, 541–561 (2006). [Google Scholar]
- 59.The Nation’s Report Card, Grade 4, Student Background Questionnaire on Mathematics (NAEP, 2005).
- 60.The Nation’s Report Card, Grade 4, Student Background Questionnaire on Reading (NAEP, 2005).
- 61.Burden, R. Assessing children’s perceptions of themselves as learners and problem-solvers: the construction of the Myself-as-Learner Scale (MALS). Sch. Psychol. Int.19, 291–305 (1998). [Google Scholar]
- 62.Appleton, J. J., Christenson, S. L., Kim, D. & Reschly, A. L. Measuring cognitive and psychological engagement: validation of the student engagement instrument. J. Sch. Psychol.44, 427–445 (2006). [Google Scholar]
- 63.Duckworth, A. L. & Quinn, P. D. Development and validation of the short grit scale (Grit-S). J. Pers. Assess.91, 166–174 (2009). [DOI] [PubMed] [Google Scholar]
- 64.Duckworth, A. L., Peterson, C., Matthews, M. D. & Kelly, D. R. Grit: perseverance and passion for long-term goals. J. Pers. Soc. Psychol.92, 1087–1101 (2007). [DOI] [PubMed] [Google Scholar]
- 65.Programme for International Student Assessment (PISA). Student questionnaire. OECDhttps://www.oecd.org/pisa/ (2006).
- 66.Kashdan, T. B., Rose, P. & Fincham, F. D. Curiosity and exploration: facilitating positive subjective experiences and personal growth opportunities. J. Pers. Assess.82, 291–305 (2004). [DOI] [PubMed] [Google Scholar]
- 67.Goodman, R. The strengths and difficulties questionnaire: a research note. J. Child Psychol. Psychiatry38, 581–586 (1997). [DOI] [PubMed] [Google Scholar]
- 68.Wechsler, D. Wechsler Intelligence Scale for Children (WISC-III UK) (Psychological Corporation, 1992).
- 69.McCarthy, D. McCarthy Scales of Children’s Abilities (Psychological Corporation, 1972).
- 70.Kaplan, E., Fein, D., Kramer, J., Delis, D. & Morris, R. The WISC-III as a Process Instrument (WISC-III PI) (Psychological Corporation, 1999).
- 71.Smith, P., Fernandes, C. & Strand, S. Cognitive Abilities Test 3 (CAT3) (nferNELSON, 2001). [Google Scholar]
- 72.Raven, J., Raven, J. C. & Court, J. Manual for Raven’s Progressive Matrices and Vocabulary Scales (Oxford Univ. Press, 1996).
- 73.Raven, J. C., Raven, J. & Court, J. H. Mill Hill Vocabulary Scale (Oxford Univ. Press, 1998). [Google Scholar]
- 74.Shakeshaft, N. G. et al. Strong genetic influence on a UK nationwide test of educational achievement at the end of compulsory education at age 16. PLoS ONE8, e80341 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Selzam, S. et al. Evidence for gene–environment correlation in child feeding: links between common genetic variation for BMI in children and parental feeding practices. PLoS Genet.14, e1007757 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Rosseel, Y. lavaan: an R package for structural equation modeling. J. Stat. Softw.48, 1–36 (2012). [Google Scholar]
- 77.Malanchini, M. et al. Pathfinder: a gamified measure to integrate general cognitive ability into the biological, medical, and behavioural sciences. Mol. Psychiatry26, 7823–7837 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Richardson, M., Abraham, C. & Bond, R. Psychological correlates of university students’ academic performance: a systematic review and meta-analysis. Psychol. Bull.138, 353–387 (2012). [DOI] [PubMed] [Google Scholar]
- 79.Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc.57, 289–300 (1995). [Google Scholar]
- 80.Knopik, V. S., Neiderhiser, J. M., Defries, J. C. & Plomin, R. Behavioral Genetics 7th edn (Worth Publishers, Macmillan Higher Education, 2016).
- 81.Neale, M. C. et al. OpenMx 2.0: extended structural equation and statistical modeling. Psychometrika81, 535–549 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Rijsdijk, F. V. in Encyclopedia of Statistics in Behavioral Science (eds Everitt, B. S. & Howell, D. C.) 330–331 (Wiley, 2005).
- 83.Muthén, L. K. & Muthén, B. O. Mplus User’s Guide 8th edn (Muthén & Muthén, 2017).
- 84.Bulik-Sullivan, B. et al. An atlas of genetic correlations across human diseases and traits. Nat. Genet.47, 1236–1241 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Savage, J. E. et al. GWAS meta-analysis (N = 279,930) identifies new genes and functional links to intelligence. Nat. Genet.50, 912–919 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Hill, W. D. et al. Molecular genetic contributions to social deprivation and household income in UK Biobank. Curr. Biol.26, 3083–3089 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Vilhjálmsson, B. J. et al. Modeling linkage disequilibrium increases accuracy of polygenic risk scores. Am. J. Hum. Genet97, 576–592 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Vanegas, L. H., Rondón, L. M. & Paula, G. A. Generalized estimating equations using the new R package glmtoolbox5. The R Journal15, 105–133 (2023). [Google Scholar]
- 89.Malanchini, M. et al. Summary statistics for multivariate GWAS extension of cognitive and non-cognitive skills. figsharehttps://figshare.com/s/25abf6cc4ca207468c6c (2024).
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Researchers can apply for access to the TEDS data through their data access mechanism (www.teds.ac.uk/researchers/teds-data-access-policy). Summary statistics for the extended Cog and NonCog factors can be found via figshare at https://figshare.com/s/25abf6cc4ca207468c6c (ref. 89).
Code is available via GitHub at https://github.com/CoDEresearchlab/NoncognitiveGenetics.