Abstract
Families transmit genes and environments across generations. When parents’ genetics affect their children’s environments, these two modes of inheritance can produce an “indirect genetic effect.” Such indirect genetic effects may account for up to half of the estimated genetic variance in educational attainment. We tested if indirect genetic effects reflect within-nuclear-family transmission (“genetic nurture”) or instead a multi-generational process of social stratification (“dynastic effects”). We analyzed indirect genetic effects on children’s academic achievement in their 5th-9th years of schooling in N=37,117 parent-offspring trios in the Norwegian Mother, Father, and Child Cohort Study (MoBa). We used pairs of genetically-related families (parents were siblings, children were cousins; N=10,913) to distinguish within-nuclear-family genetic-nurture effects from dynastic effects shared by cousins in different nuclear families. We found that indirect genetic effects on children’s academic achievement cannot be explained by processes that operate exclusively within the nuclear family.
Introduction
Genetically-informed research designs offer strong evidence that education is transmitted across generations via the inheritance of environmental advantage. Adoption1, twin2, molecular genetic3, and genome wide association studies (GWAS)4–6 all offer evidence that the intergenerational transmission of educational attainment occurs via both genetic and environmental (i.e. social) mechanisms (Table 1). Studies of adoptees show that children resemble their adoptive (social) parents in education, despite not being genetically related to them7. Studies of twins reveal educational similarities within monozygotic and dizygotic pairs that are in line with a role for both the genome and the environment8. Molecular genetics studies have also established evidence for environmental mechanisms of intergenerational transmission of educational attainment (EA)9,10. Molecular-genetic studies of parent-offspring trios, of adopted parent-child dyads, and of biological siblings all show that people’s EA is associated with the genetic variants they did not inherit—an association that can only operate via the environment/social context11–14.
Table 1.
Evidence from Genetically Informed Studies for Environmental Transmission of Educational Outcomes
| Design | Key Comparison |
|---|---|
| Twins | Are dizygotic twins more similar in their EA than can be accounted for by their genetic relatedness? |
| Twins + offspring | Are children of (e.g., female) monozygotic twins more similar in their EA to their mother than to their aunt? |
| Adoptees | Do adopted offspring resemble their adoptive parents more than their biological parents in their EA? |
| Adoptees + siblings | Do offspring adopted into more environmentally advantaged homes have higher EA than their siblings who were not adopted away? |
| Adoptees + PGIs | Do adoptive parents’ PGIs predict adopted children’s EA? Is the association between one’s own PGI and one’s own EA stronger if raised by biological parents than by adoptive parents? |
| Parent-offspring trios + PGIs | Is the portion of the parental genotype that is not inherited by the offspring (untransmitted PGI) associated with offspring’s EA? |
| Siblings + PGIs | Is the PGI-EA association attenuated after controlling for a family-specific effect or family’s socioeconomic status (SES)? |
A new wave of molecular genetic studies measures genetic correlations with EA using polygenic indices (PGIs). The PGI method uses results from GWAS to summarize information about hundreds of thousands of genetic variants associated with a target trait or behavior into a single number for each research participant. Taken at face value, PGIs based on GWAS of EA can predict as much as 12-16% of variation in EA in independent, population-based samples15, a level of explanatory power similar to parental education. In these studies, PGIs are measured from DNA collected from the same individuals whose education is being measured. However, PGI associations with EA reflect more than direct genetic influences on the development of characteristics that promote success in school. Studies of gene-environment correlations reveal that children’s EA PGIs are correlated with environments they inherit from their parents, including the social and economic circumstances of their families and neighborhoods14,16,17. EA PGIs therefore measure not just a child’s genetic background, but their environment as well.
EA PGIs are associated with not just educational outcomes, but a range of social and economic behaviors, including where and how far people move from home, who they have children with, and how they parent16–19. A parent generation’s EA PGIs therefore become their children’s environment. In family-based PGI studies, researchers can separate out the effects of genetics that are passed on from parents to children (transmitted genotypes) from those that parents’ possess, but which their children do not inherit (non-transmitted genotypes). These studies find that children’s EA is influenced by PGIs based on both the transmitted and non-transmitted genotypes11,12. The effects of the non-transmitted genotypes reflect a process of inheritance that is mediated by the environment. These “indirect” genetic effects will be detected in GWAS and subsequently affect downstream PGI analyses of EA20. Indirect genetic effects are also viewed as a means to study how parental traits affect childhood outcomes, while accounting for the direct genetic effects on offspring outcomes21.
Evidence for indirect genetic effects on EA come from PGI studies of siblings and adoptees. In sibling studies, GWAS of educational differences between siblings estimate lower heritability as compared to GWAS of unrelated individuals4. PGI studies based on these GWAS find that effect-sizes for PGIs based on sibling-difference GWAS are smaller than effect-sizes for PGIs based on between-family GWAS6,22. In adoption studies, two findings stand out. First, adoptive parents’ PGIs are associated with their adopted children’s outcomes23, an association that could not be mediated by direct genetic transmission. Second, PGI effect-sizes are larger for children living with their biological parents than for adoptees who live with social parents to whom they are not genetically related24. A key difference between these two groups of children is that the adoptee’s PGIs are uncoupled from environments correlated with their parents’ genotypes.
While there is consistent evidence for the presence of indirect genetic effects on EA, the mechanisms that produce these effects remain unclear. Among the most evocative descriptions of indirect genetic effects on EA are “Genetic nurture” and “dynastic effects”11,25. “Genetic nurture” invokes the transmission of skills and values from parents to children within nuclear families via “nurturing” parenting behaviors. Evidence for genetic nurture-type mechanisms comes from studies that find parenting behaviors mediate indirect-genetic-effect associations of parental genotypes with their children’s educational outcomes26. The term “dynastic effects” invokes the transmission of wealth and social status within family lineages across generations. Evidence for dynastic-type mechanisms comes from studies that show multi-generational socioeconomic stratification correlated to the parental genotype22, assortative mating27–29, and subtle population stratification30,31.
Indirect genetic effects are defined as the association of one individual’s genotype with another individual’s phenotype, above and beyond that individual’s own genotype. Within the regression framework, indirect genetic effects on EA are estimated by including the child’s genotype in a model testing the association between their parents’ genotype and the child’s EA. The control for the child’s genotype isolates the environmentally-mediated portion of the effect of the parent’s genotype. We define nurture effects as the consequence of mechanisms, operating within the nuclear family, flowing from parental actions or status that introduce a correlation between parental genotypes and child outcomes above and beyond the effect of the genotypes transmitted to the child. We negatively define “dynastic effects” as any social or historical process that introduces a correlation between parental genotype and offspring outcomes that is not nurture, as such it includes processes like assortative mating and population stratification. Assortative mating may contribute to indirect effects by capturing the genetic component of the phenotype with which non-transmitted alleles of the parents are correlated.32 Population stratification occurs when differences in genotype frequency spuriously correlate with environmental differences, and this induces confounding between genotype and outcome. We choose these specific definitions of nurture and dynastic effects as in our design we can sharply distinguish nurture from other causes of indirect genetic effects, but we cannot directly differentiate between other mechanisms behind the indirect effects.
An extended-pedigree design that includes multiple families in which some of the parents are siblings makes possible a further decomposition of the effect of the parent’s genotype. In the extended-pedigree design, the indirect genetic effect isolated by the control for the child’s genotype can be further divided into two components: (1) the between-family indirect genetic effect, identified as the effect of the average genotype among the siblings in the parental generation, and (2) the within-family indirect genetic effect, identified as the effect of the deviation of the parent’s genotype from their sibship average. Within the regression framework, this is accomplished by including the parental-sibship-average genotype as a covariate in the model. This covariate effectively captures the effects of the grandparental genotype as well as effects of any environments that are shared within the extended-family pedigree (the parent and their siblings) and correlated to genotype. This design further includes a control for subtle population stratification, as the parent and their sibling have identical ancestry. An alternate specification is to regress the child’s educational attainment on their own genotype, their parents genotype and their uncle or aunts’ genotype. If the parental genotype correlates with the child’s outcome because of nurture in the nuclear family, the genotype of the child’s aunt/uncle should be independent from the child’s outcome conditional on the parents genotype.
Within the extended-pedigree genetic-nurture model, the within-family indirect genetic effect represents “nurture”, i.e., environmentally-mediated effects operating within the nuclear family environment. This could include effects mediated by parenting behaviors or direct investments by parents in their children. In parallel, the estimate of the between-family indirect genetic effect captures both indirect effects that operate via “dynastic transmission” and those that act via nurture. This could include effects mediated by multigenerational stratification in environments. Between family indirect genetic effects could also be a function of a bias introduced in the estimated relations between genotype and phenotype introduced by people systematically selecting mates that are similar in terms of education or related traits (“assortative mating”) across multiple generations. This bias does not persist within families. Therefore assortment among spouses would introduce a between family indirect genetic effect but not a within family indirect genetic effect.
We conducted extended-pedigree analysis of indirect genetic effects on academic achievement in the Norwegian Mother, Father, and Child Cohort Study (MoBa), in which both children and their parents are genotyped, and which includes over 10,000 sibling pairs in the parental generation. MoBa recorded children’s grade-5, −8, and −9 standardized test scores on three subjects (reading comprehension in Norwegian [for almost all children their first language], math, English). We computed four PGIs from the most recent GWASs of EA and related phenotypes; a PGI for EA (GWAS N > 3 million)15, PGIs for cognitive- (GWAS N = 257,700) and non-cognitive-skill (GWAS N = 510,795) contributions to EA33, and a PGI based on a GWAS of EA performed within sibling pairs (GWAS N = 128,777)4.
We structure our analysis around four models (outlined in detail in the methods section). We first establish that the children’s own PGI’s are related to their academic achievement in this sample (model 1). Next, we establish the presence of indirect genetic effects following the standard approach of regressing children’s achievement on parental PGIs while conditioning on children’s own PGIs (model 2). Then, in order to test whether the indirect effects reflect genetic nurture or dynastic effects, we specify a model that includes the mean of one parent and sibling PGIs, the parent-sibling’s deviation from that mean, and the PGI of the parent that does not have a sibling in the data (model 3). The presence of a within family indirect effect is consistent with nurture-like processes, while its absence in the presence of a between family indirect effect is consistent with dynastic-like processes. Finally, we consider an alternative specification where the child’s achievement is regressed on their own PGI, their parents’ PGI’s and the PGI of a sibling of one of the parents (i.e. the child’s aunt’s or uncle’s PGI). In this final model, the logic is that the PGI of the aunt or uncle would not relate to the child’s achievement through nurture within the nuclear family (model 4).
Results
Figure 1 shows results from all four models estimated on the parent-sibling sample, with numerical results available in Supplementary Tables 2–4.
Figure 1. Results from four models of academic achievement using three definitions of polygenic scores.

From left to right results for the models 1 and 2 (N=37,117 families) and 3 and 4 (N=10,913 families) where achievement is regressed on a set of PGIs and covariates (age, year, test, genomic principal components), with a child-specific random effect. Figure only presents the effect of the PGIs of interest. Upper panel shows coefficients the educational attainment (EA4) PGI; middle panel, show coefficients for cognitive skills (blue) and noncognitive skills (orange) PGIs and lower panel coefficients for within-family PGI for educational attainment. Symbols represent point estimates and vertical error bars represent 95% confidence intervals. Dashed line is a reference value for the indirect genetic effect as established in previous meta-analysis (that did not include MoBa) of educational outcomes.
Associations of children’s PGIs with academic achievement
In the MoBa child cohort included in our analysis, the effect-size (standardized beta) for the association of the EA4 PGI with academic achievement was 0.24 (t(10287) = 30.3; SE = 0.008; p<0.001). For the PGIs of cognitive (Cog) and non-cognitive (NonCog) contributions to education, which were analyzed as concurrent predictors, effect-sizes were 0.26 (t(10318) = 32.9; SE=0.008; p<0.001) for Cog and 0.14 (t(10281) = 17.1; SE=0.008; p<0.001) for Non-Cog. For the PGI from the within-family GWAS of EA (WFEA), the effect-size was 0.17 (t(10315) = 22.1; SE=0.008; p<0.001).
Indirect genetic effect estimates from parent-offspring data
In all models that include an indirect genetic effect, the direct genetic effects remained significant, but were attenuated, with standardized betas that were reduced by 15% - 35% compared to models that did not include indirect genetic effects (see Figure 1 and Supplementary Tables S2–S4 for numerical results). The parent-offspring model (model 2) includes PGIs for parents and their child as concurrent predictors of the child’s academic achievement. In these models, the effect-estimate for the parental PGIs can be interpreted as an indirect genetic effect (because directly inherited genetic influences are captured by the child’s PGI). Effect-sizes for indirect genetic effects were modest, but in the expected direction and statistically different from zero at the alpha=0.05 level. For the EA4 PGI, effect-sizes for fathers and mothers were 0.05 (mothers: t(10271) = 5.22; SE=0.01; p<0.001; fathers: t(10293) = 4.72; SE=0.01; p<0.001); for the within-family GWAS PGI, the effect-size for fathers was 0.04 (t(10283) = 4.2; SE=0.02; p<0.001) and for mothers was 0.05 (t(10341) = 4.7; SE=0.01; p<0.001). For the Cog and Non-Cog PGIs, which were tested in the same model, Cog effect-sizes were 0.03 (t(10277) = 3.3, SE=0.01, p<0.001) for fathers and 0.04 (t(10295) = 4.0, SE=0.01, p<0.001) for mothers and Non-Cog effect-sizes were 0.05 (t(10285) = 4.8, SE=0.01, p<0.001) for fathers and 0.04 (t(10267) = 4.1, SE = 0.01, p<0.001) for mothers. The model confirms an indirect genetic effect.
Indirect genetic effects estimates from extended pedigrees
Models 3 and 4 are extended-family models. model 3 includes PGIs for parents and their child as well as the mean PGI for parental siblings as concurrent predictors of the child’s academic achievement. In model 3, the estimate for the parental PGIs can be interpreted as a within-family indirect genetic effect because directly inherited genetic influences are captured by the child’s PGI and between-family indirect genetic effects are captured by the parental-sibship-mean PGIs.
Considering the PGI based on the EA4 GWAS we find a large effect of the child’s PGI (beta = 0.184, t(10273) = 16.2, SE = 0.011, p<0.001) on their test score, a modest but significant (p<0.05) effect for the PGI of the parent for whom no sibling is in the data (beta = 0.051, t(10294) = 5.3, SE= 0.011), a similar effect of the mean sibling PGI for the parent and their sibling (who is an aunt/uncle to the child) (beta = 0.053, t(10261) = 5.0, SE= 0.011, p<0.001), while the deviance of their parent relative to their sibling is not significant (beta = 0.014, t(10324) = 0.82, SE = 0.017, p=0.412).
Using the PGI based on the WFEA GWAS we also find a large effect of the child’s PGI (beta = 0.126, t(10305) = 10.8, SE = 0.011, p<0.001) on their test score, a modest but significant effect for the PGI of the parent for whom no sibling is in the data (beta = 0.036, t(10329) = 3.6, SE = 0.010, p<0.001), a similar effect of the mean sibling PGI for the parent and their sibling (beta = 0.058, t(10294) = 5.7, SE= 0.010, p<0.001), while the deviance of their parent relative to their sibling is not significant (beta = 0.020, t(10292) = 1.14, SE= 0.017, p=0.252).
We consider the Cog and NonCog PGI jointly, and observe a substantial effect of the child’s Cog PGI (beta = 0.222, t(10276) = 19.3, SE= 0.011, p<0.001) and a modest effect of the child’s NonCog PGI (beta = 0.089, t(10271) = 7.71, se = 0.011, p<0.001). The between parental-sibling-pair PGIs were significant for both the Cog PGI (beta = 0.042, t(10280) = 3.87, SE = 0.011, p<0.001) and the NonCog PGI (beta = 0.053, t(10277) = 4.94, SE = 0.011, p<0.001). Crucially, neither the effect of the within parental-sibling-pair Cog (beta = −0.002, t(10270) = −0.14, SE=0.017, p=0.882) nor NonCog (beta = 0.027, t(10339) = 1.59, SE=0.016, p=0.112) PGI was significantly different from zero.
We performed one-tailed tests of the hypothesis that the difference between the between parental-sibling-pair coefficient(s) and the within parental-sibling-pair coefficient(s) is equal to or smaller than zero (cf. Supplementary Table 5). Tests reject the hypothesis for results obtained with all three PGIs: EA4 (t = 2.16, p = 0.0155, ), Cog/NonCog (t = 2.38, p = 0.0087, ), and WFEA (t = 2.09, p = 0.0183, ).
The alternate specification, model 4, includes the PGIs of the child, the parent for whom no sibling is in the data, the parent who has a sibling in the data, and that parent’s sibling. The results are as expected (see Table 2 for expectations). Considering the PGI based on the EA4 GWAS, we find a large direct effect of the child’s PGI (beta = 0.184, t(10273) = 16.2, SE = 0.011, p<0.001), a modest but significant effect for the PGI of the parent for whom no sibling is in the data (beta = 0.051, t(10294) = 5.3, SE= 0.011, p<0.001), a smaller, and significant effect for the parent for whom a sibling is available (beta = 0.034, t(10298) = 3.09, SE= 0.011, p<0.001), and a significant effect of the PGI of the aunt/uncle on the child (beta = 0.019, t(10315) = 2.08, SE= 0.009, p = 0.038).
Table 2.
Expectations for indirect genetic effect parameter estimates for two model specifications under three conditions.
We find a large effect of the child’s WFEA PGI (beta = 0.126, t(10305) = 10.7, SE = 0.012, p<0.001), a modest but significant effect for the PGI of the parent for whom no sibling is in the data (beta = 0.036, t(10330) = 3.63, SE = 0.010, p < 0.001), a smaller, and significant effect for the parent for whom a sibling is available (beta = 0.038, t(10296) = 3.5, SE= 0.011, p < 0.001), and a significant effect of the PGI of the aunt/uncle on the child (beta = 0.019, t(10286) = 2.11, SE= 0.009, p = 0.035).
Finally, the effect of the child’s Cog PGI is quite pronounced (beta = 0.222, t(10276) = 19.4, SE = 0.011, p < 0.001), the effect of the NonCog PGI is modest (beta = 0.088, t(10271) = 7.71, SE = 0.011, p < 0.001), the effect of the PGI of the parent for whom no sibling is in the data is significant and modest (Cog: beta = 0.040, t(10298) = 4.12, SE= 0.01, p<0.001; Non-Cog: beta = 0.040, t(10268) = 4.12, SE = 0.01, p<0.001), the effects of the PGIs for an aunt/uncle are significant for Cog but not NonCog (Cog: beta = 0.021, t(10282) = 2.27, SE = 0.009, p=0.023; Non-Cog: beta = 0.014, t(10333) = 1.48, SE= 0.009, p=0.139), while the Cog and NonCog effects of the parent for whom a sibling is in the data is modest (and insignificant for Cog: beta = 0.020, t(10265) = 1.84, SE= 0.011, p=0.066; Non-Cog: beta = 0.039, t(10307) = 3.57, SE= 0.011, p<0.001).
Results from models 1 and 2 estimated on the largest possible sample, and stratified by school grade and test subject, are shown in Supplementary Figure 1. Results from models 1–4 estimated on the sample of all parent-sibling families, and stratified by school grade and test subject are reported in Supplementary Figure 2–9.
Discussion
The discovery of specific genetic variants associated with EA has given researchers a new tool for investigating the intergenerational transmission of education. In particular, the observation of indirect genetic effects, whereby the parental genotype is associated with offspring outcomes beyond the child’s genotype, illustrates the role of the environment in the intergenerational transmission. Our indirect genetic effect estimates are consistent with previous analysis of MoBa34, and of Dutch35 and UK22 cohorts, but somewhat smaller than the average effect from a recent meta-analysis13. The meta-analytic average may be stronger because it included studies of educational outcomes obtained from later stages of the life course, such as adult educational attainment.
We used an unique extended pedigree dataset with genotyped relatives from multiple generations to study the environmental processes driving the indirect genetic effect. In contrast to the processes implied by the phrase “genetic nurture,” we did not find evidence that a large portion of the indirect genetic effects repeatedly established in previous work, predominantly runs through environmental mechanisms within the nuclear family, such as for example parental behaviors or investments. Instead, our findings suggest that the majority of the indirect genetic effect in academic achievement, though not necessarily all of it, does not arise within the nuclear family, but instead reflects processes shared across families with common grandparents. Specifically, after accounting for genetics shared at the extended-family-level (between a child’s parent and their aunt or uncle), and the child’s own genetics, the “genetic nurture” association of a parent’s PGI with their child’s educational achievement was not statistically different from zero. Another important source of indirect genetic effects that is consistent with our findings is a major role for assortative mating, where repeated spousal selection on characteristics that are related to educational success would introduce indirect genetic effects that in our extended-family design were fully controlled with parental sibling pairs, as those are matched on their history of genetic assortment.
In contrast, the extended-family-level PGI did show a statistically significant association with the child’s educational achievement that was comparable in size to the original indirect-genetic-effect estimate. This result does not rule out the presence of within-nuclear-family indirect genetic effects on EA. But it does suggest that genetic nurture processes unique to the nuclear family are likely to be a minor contributor to the indirect genetic effects observed in studies of trios or parent-child dyads.
We acknowledge limitations of the study. There are known biases to models that use PGIs to separate direct from indirect genetic effects. The GWAS from which we derived the per-SNP effect estimates for the Cog, NonCog and EA PGIs are influenced by unmodeled indirect genetic effects. Thus, for each SNP, we rely on an effect estimate that is a mix of direct and indirect effects. This mixture can result in bias to within-family analysis of PGIs derived from these GWAS36,37. However, our findings persist in analysis using a PGI derived from within-family GWAS, in which the bias in SNP effects that go into the PGI, induced by gene-environment correlation, are sharply attenuated, though not entirely absent38.
Our analysis may be under-powered to detect very small within-nuclear-family indirect genetic effects. Our analysis utilizes the largest sample for extended-pedigree indirect-genetic-effect analysis currently available. Our results are consistent across analysis of 3 PGIs and 2 different specifications. Nevertheless, there could be a non-zero within-family indirect genetic effect undetected in our analysis. Power calculations reveal that our analysis was powered to detect effects of half the size (0.04) of the indirect genetic effect on educational outcomes estimated in a recent meta-analysis13. Finally selection bias could affect results. MoBa has relatively high participation rates. Our use of national register data to determine educational achievement limits loss to follow-up as a source of bias. However, the MoBa sample is healthier and wealthier than the Norwegian population. Replications with other samples with the appropriate data structure (for example, the HUNT study39) should be a primary concern. Replication should be closely followed by generalization to other educational outcomes, such as high school completion or college enrollment. These outcomes are believed to be more strongly influenced by the nuclear family environment than are children’s scores on standardized tests.35 A recent meta-analysis of indirect genetic effects on a variety of educational outcomes including adult attainment reported an effect size (marked by dashed line in Figure 1) stronger than those obtained in our analysis. The specific educational tests we use have limited consequences for educational careers and therefore could be considered low stakes tests while parents may be more invested (and seek more influence) for educational outcomes that are closer linked to the child’s future social position. It would further be desirable to triangulate our result across alternate designs, for example, adoption in the parental generation or directly observed grandparental genotypes. Finally, there is a need to generalize beyond contemporary Norway, which has relatively low income inequality, a high quality tuition-free public education system, and a generous welfare state.
Our results are consistent with the interpretation of indirect genetic effects on academic achievement as in part or largely due to “dynastic effects”. Such effects could reflect subtle socioeconomic and genetic-ancestry stratification co-occurring within homogeneous populations30,31,4. According to this interpretation, the extended-family-level PGI is correlated with a set of inherited social circumstances which affect children’s academic achievement. An alternative interpretation is that dynastic effects reflect extended-family-level behaviors and investments that contribute to children’s academic achievement. Our results are further consistent with a bias in the population GWAS and PGI estimates introduced by assortative mating. Our analysis cannot isolate the precise mechanisms of indirect genetic effects on EA. However, we can conclude that, for childhood academic achievement in the context of contemporary Norway, the mechanisms that give rise to indirect genetic effects, as indexed by current PGIs, operate mostly beyond the boundaries of nuclear families.
Children from higher SES families perform better on standardized tests of academic achievement. As with PGI associations, correlations between children’s test scores and SES might reflect “nurture” processes occurring within the nuclear family (e.g., parents actively using their resources to support their children’s educational careers with higher incomes pay for private tutoring) as well as dynastic processes that persist across generations (e.g., accumulation of wealth and access to social capital). Our results suggest that, to the extent that currently available PGIs come to be correlated with child academic achievement because they are systematically associated with SES-related environmental variation, they are capturing multigenerational effects of socioeconomic privilege, rather than the more local advantages conferred by individual parents on their own offspring.
Our results do not imply that parenting behaviors or a nurturing family environment do not affect school performance. Instead, they shed light on the mechanisms behind the widely observed indirect genetic effect of parental education-related PGI on offspring education outcomes11,13,22–24. Any effects of parenting that are not correlated to the parental educational PGIs are not detected in studies of the indirect genetic effect. By focusing on parental PGI for education-related traits, we omit potentially important parental influences. The education PGI used here would for example not index all parental life events or circumstances that may relate to worse educational outcomes for children. While our analysis can speak to the widely studied effect of parental educational PGI on childhood academic achievement, and is well-designed to avoid genetic confounding, it does not represent a comprehensive evaluation of parental influences on their children’s educational outcomes.
There are strategies to leverage genetic data to study parenting without relying on education-related PGI. One follow-up would be to repeat the current analysis as a GWAS, regressing child outcomes on each SNP in the child, the same SNP in the parent and include the mean SNP of the parent and their sibling as a third covariate. A GWAS of parental effects on childhood outcomes, while using the parental sibling structure to control for confounding (i.e., a within-sibling GWAS), would yield SNP level summary statistics that would allow analytical techniques like LD score regression to test genetic correlations between the indirect effects and hundreds of heritable parental traits like personality, psychopathology, wellbeing and physical health. The primary constraint on this type of analysis is sample size. However, with the continued development of national genetic databases, such extended-family GWAS of genetic nurture may soon be possible. Ultimately, a better understanding of the environmental/social intergenerational transmission of education will benefit from a tighter integration between social scientific data and genetic data.
Methods
Participants
The Norwegian Mother, Father and Child Cohort Study (MoBa) is a population-based pregnancy cohort study conducted by the Norwegian Institute of Public Health40. Participants were recruited from all over Norway from 1999-2008. Women consented to participation in 41% of the pregnancies. The cohort now includes 114,500 children, 95,200 mothers and 75,200 fathers. Not all participants have yet been genotyped, and legal restrictions related to consent reduce our effective sample size relative to some other versions of the data. The current study is based on version 12 of the quality-assured survey data files released for research in January 2019 and MoBaPsychGen v.1. The establishment of MoBa and initial data collection was based on a license from the Norwegian Data Protection Agency and approval from The Regional Committees for Medical and Health Research Ethics. The MoBa cohort is now based on regulations related to the Norwegian Health Registry Act. The current study was considered by The Regional Committees for Medical and Health Research Ethics.
In our version of the data, 39,230 nuclear families have genotype information on complete trios (mother, father, and child) where both parents were born in Norway. The Norwegian system of personal ID numbers facilitates linking of data from MoBa to register-based information for educational outcomes, basic demography and links between parents and their siblings. The data structure is illustrated in Supplementary Figure 10. For 37,117 complete trio families, children have one or more educational outcomes available. In our analytic sample, there are 10.913 nuclear families where the child and both parents are genotyped, the child has taken at least one standardized test, and one parent has a sibling that is a genotyped parent in another MoBa family in the data set.
Measures
Academic achievement is measured by children’s results on national standardized tests (“Nasjonale prøver”) in reading (i.e., reading comprehension in Norwegian, for almost all children their first language), math, and English. Reading and math were administered in 5th, 8th and 9th grades and English in 5th and 8th grades. Nearly all MoBa children have data on 5th grade tests, while the youngest cohorts do not yet have data on 8th and 9th grade tests. The test scores were obtained from Norwegian administrative registries. The scores were standardized within the test and year to control for test version and changes over time.
PGIs were computed for all individuals using the LDpred2 software. GWAS summary statistics were obtained from a GWAS-by-subtraction, for cognitive (Cog) and non-cognitive (Non-Cog) SNP effects on EA33, and from the within-family GWAS of EA 15,41. For the EA PGIs, we relied on the top 10,000 publicly reported SNPs.15
Statistical analysis
Regression models
We fit four models. All eight test scores are included, with test fixed-effects and a child-specific random intercept included in all models. The first two models establish the presence of an indirect genetic effect in the sample of all genotyped parent-offspring trios. model 1 estimates the total genetic effect measured by the PGI.
| (model 1) |
Model 2 adds additional parameters for parents’ PGIs and decomposes the total genetic effect into a direct component, measured by (for the child’s PGI) and indirect components, measured by and (for the parents’ PGIs).
| (model 2) |
We then decompose the indirect genetic effect into within- and between-family components. First, we select parents with one or more siblings in the MoBa sample. We next compute the following predictors:
where is the mean PGI for a sibship of size m;
where is the deviation of each parent’s PGI from their sibship-mean , and , which is the PGI of parents who do not have a sibling in the data. Finally, we combine these parameters in the equation for model 3:
| (model 3) |
Children for whom neither parent has a sibling in the data are omitted (N=28,317). The model specification, which follows previous work22 results in identical between-family and within-family effects in the absence of population stratification and/or the absence of a multigenerational effect on childhood academic achievement42. In the presence of either, we expect the between parental-sibling-pair effect to be larger than the within parental-sibling-pair effect.
As an alternative specification to Model 3, we fit a parallel model that parametrizes within- and between-family indirect genetic effects using a different approach. In this alternative specification, model 4, we include the PGIs of the parents and their siblings (i.e., the uncle or aunt of the child) in the regression.
| (model 4) |
Here, an indirect genetic effect consisting of purely within-family (“genetic nurture”) mechanisms would result in a parameter estimate of zero for . In contrast, an indirect effect consisting of only between-family (“dynastic effect”) mechanisms would result in a parameter estimate of zero for .
All regression models include a set of child covariates indicated by the ellipsis (…): sex, birth year, test subject and grade fixed effects (to account for systematic differences in achievement between tests), the first 10 genetic principal components (to account for population stratification), and genotyping-batch fixed effects (to account for batch-to-batch variation in genotype processing and measurement). We performed a power analysis and established we had > 80% power to detect a sibling-pair effect half that of previous meta-analytic indirect genetic effects13. All tests are two-tailed unless otherwise specified.
Relation between parameter and conceptual processes
In the regression models above we define 3 parameters that relate in the following way to underlying mechanisms that generate associations between parental PGIs and child’s outcome conditional on the child’s own PGI. In model 2 we define: and which are the sum of influences of genetic nurture, dynastic effects and assortative mating. In model 3 and 4 we define which is a consequence of genetic nurture but not dynastic effects or assortment. and are again the sum of the effects of influences of genetic nurture, dynastic effects and assortative mating.
If is not different from zero we find no evidence for “genetic nurture”, while if is not different from to and not different from the average of and (which are estimated in a larger sample, and hence with more power) this would be consistent with the absence of the influence of a “dynastic effect” or “assortative mating”. For convenience summarize the relations between the mechanisms that can generate PGI-phenotype associations and the regression parameters we estimate in Table 2. The relationships are confirmed through simulations available on the GitHub repository that accompanies this paper.
Supplementary Material
Acknowledgements
We thank the Norwegian Institute of Public Health (NIPH) for generating high-quality genomic data. This research is part of the HARVEST collaboration, supported by the Research Council of Norway (#229624). We also thank the NORMENT Centre for providing genotype data, funded by the Research Council of Norway (#223273), South East Norway Health Authority and KG Jebsen Stiftelsen. We further thank the Center for Diabetes Research, the University of Bergen for providing genotype data and performing quality control and imputation of the data funded by the ERC AdG project SELECTionPREDISPOSED, Stiftelsen Kristian Gerhard Jebsen, Trond Mohn Foundation, the Research Council of Norway, the Novo Nordisk Foundation, the University of Bergen, and the Western Norway health Authorities (Helse Vest) The Norwegian Mother, Father and Child Cohort Study is supported by the Norwegian Ministry of Health and Care Services and the Ministry of Education and Research. We are grateful to all the participating families in Norway who take part in this on-going cohort study.
MGN is supported by ZonMW grants 849200011 and 531003014 from The Netherlands Organisation for Health Research and Development, a VENI grant awarded by NWO (VI.Veni.191G.030), and NIH grant R01MH120219. KPH is supported by grant R01HD092548 from the Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD), and by NICHD grant P2CHD042849 awarded to the Population Research Center at The University of Texas at Austin. DWB was supported by US National Institute on Aging grants R01AG066887, R01AG073402, Russell Sage Foundation BioSS Grant 1810-08987, and the Canadian Institute for Advanced Research. EvB is supported by ZonMw grant 531003014 and NWO Gravitation grant 024.001.003. THL and TB are supported by Horizon2020 ERC Consolidator grant #818420 OPENFLUX. MGN, DWB, KPH, and EvB are all past or present Jacobs Foundation Research Fellows. The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript.
Footnotes
Code availability statement
No custom computer code was used in the study. The software used in the data preparation and analysis were R 4.0, LDPred 2, and plink 1.9. R scripts for data preparation and analysis are available at http://github.com/torkildl/nurturenature
Competing interest statement
The authors declare no competing interests.
Data availability statement
The data analyzed in the study are administrative data maintained by Statistics Norway and genotype data from MoBa Genetics. The data are not publicly available, but available to researchers upon application to the respective data owners. Such applications require approval by the appropriate ethics / research data access authorities. Access to administrative data from Statistics Norway can be applied for at Statistics Norway (http://www.ssb.no/mikrodata/) and access to MoBa Genetics can be applied for at the Norwegian Public Health Institute (http://www.fhi.no/studier/moba/). In Norway, the appropriate ethics and research data boards are the Regional Committee on Medical Research Ethics (REK) or SIKT. The consent given by the MoBa participants does not open for storage of data on an individual level in repositories or journals.
References
- 1.Lundborg P, Nordin M & Rooth DO The intergenerational transmission of human capital: the role of skills and health. Journal of Population Economics vol. 31 1035–1065 Preprint at 10.1007/s00148-018-0702-3 (2018). [DOI] [Google Scholar]
- 2.Silventoinen K. et al. Genetic and environmental variation in educational attainment: an individual-based analysis of 28 twin cohorts. Sci. Rep 10, 12681 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Young AI et al. Relatedness disequilibrium regression estimates heritability without environmental bias. Nat. Genet 50, 1304–1310 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Howe LJ et al. Within-sibship GWAS improve estimates of direct genetic effects. bioRxiv 2021.03.05.433935 (2021) doi: 10.1101/2021.03.05.433935. [DOI] [Google Scholar]
- 5.Lee JJ et al. Gene discovery and polygenic prediction from a genome-wide association study of educational attainment in 1.1 million individuals. Nat. Genet 50, 1112–1121 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Okbay A. et al. Genome-wide association study identifies 74 loci associated with educational attainment. Nature 533, 539–542 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Sacerdote B. Nature and nurture effects on children’s outcomes: What have we learned from studies of twins and adoptees? Handbook of Social Economics 1A, 1–30 (2011). [Google Scholar]
- 8.Branigan AR, McCallum KJ & Freese J Variation in the heritability of educational attainment: An international meta-analysis. Soc. Forces 92, 109–140 (2013). [Google Scholar]
- 9.Demange PA et al. Estimating effects of parents’ cognitive and non-cognitive skills on offspring education using polygenic scores. bioRxiv (2020) doi: 10.1101/2020.09.15.296236. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Liu H. Social and Genetic Pathways in Multigenerational Transmission of Educational Attainment. Am. Sociol. Rev 83, 278–304 (2018). [Google Scholar]
- 11.Kong A. et al. The nature of nurture: Effects of parental genotypes. Science 359, 424–428 (2018). [DOI] [PubMed] [Google Scholar]
- 12.Bates TC et al. The Nature of Nurture: Using a Virtual-Parent Design to Test Parenting Effects on Children’s Educational Attainment in Genotyped Families. Twin Res Hum. Genet 21, 73–83 (2018). [DOI] [PubMed] [Google Scholar]
- 13.Wang B. et al. Robust genetic nurture effects on education: A systematic review and meta-analysis based on 38,654 families across 8 cohorts. Am. J. Hum. Genet (2021) doi: 10.1016/j.ajhg.2021.07.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Belsky DW et al. Genetic analysis of social-class mobility in five longitudinal studies. Proc. Natl. Acad. Sci. U. S. A 115, E7275–E7284 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Okbay A. et al. Polygenic prediction of educational attainment within and between families from genome-wide association analyses in 3 million individuals. Nat. Genet 54, 437–449 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Belsky DW et al. Genetics and the geography of health, behaviour and attainment. Nat Hum Behav 3, 576–586 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Abdellaoui A. et al. Genetic correlates of social stratification in Great Britain. Nat Hum Behav 3, 1332–1342 (2019). [DOI] [PubMed] [Google Scholar]
- 18.Wertz J. et al. Genetics of nurture: A test of the hypothesis that parents’ genetics predict their observed caregiving. Dev. Psychol 55, 1461–1472 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Belsky DW et al. The genetics of success: How single-nucleotide polymorphisms associated with educational attainment relate to life-course development. Psychol. Sci 27, 957–972 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Morris TT, Davies NM, Hemani G & Smith GD Population phenomena inflate genetic associations of complex social traits. Sci Adv 6, eaay0328 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Koellinger PD & Harden KP Using nature to understand nurture. Science vol. 359 386–387 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Selzam S. et al. Comparing Within- and Between-Family Polygenic Score Prediction. Am. J. Hum. Genet 105, 351–363 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Domingue BW & Fletcher J Separating Measured Genetic and Environmental Effects: Evidence Linking Parental Genotype and Adopted Child Outcomes. Behav. Genet 50, 301–309 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Cheesman R. et al. Comparison of Adopted and Nonadopted Individuals Reveals Gene-Environment Interplay for Education in the UK Biobank. Psychol. Sci 31, 582–591 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Brumpton B. et al. Avoiding dynastic, assortative mating, and population stratification biases in Mendelian randomization through within-family analyses. Nat. Commun 11, 3519 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Wertz J. et al. Using DNA From Mothers and Children to Study Parental Investment in Children’s Educational Attainment. Child Dev. 00, cdev.13329 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Yengo L. et al. Imprint of assortative mating on the human genome. Nat Hum Behav 2, 948–954 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Robinson MR et al. Genetic evidence of assortative mating in humans. Nat. Hum. Behav 1, 0016 (2017). [Google Scholar]
- 29.Border R. et al. Assortative Mating Biases Marker-based Heritability Estimators. Preprint at 10.1101/2021.03.18.436091. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Zaidi AA & Mathieson I Demographic history mediates the effect of stratification on polygenic scores. Elife 9, (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Mostafavi H. et al. Variable prediction accuracy of polygenic scores within an ancestry group. (2020) doi: 10.7554/eLife.48376. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Young AS Estimation of indirect genetic effects and heritability under assortative mating. bioRxiv (2023) doi: 10.1101/2023.07.10.548458. [DOI] [Google Scholar]
- 33.Demange PA et al. Investigating the genetic architecture of noncognitive skills using GWAS-by-subtraction. Nat. Genet 53, 35–44 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Isungset M. et al. Social and Genetic Effects on Educational Performance in Early Adolescence. Preprint at 10.3386/w28498 (2021). [DOI] [Google Scholar]
- 35.Zeeuw E. L. de et al. Intergenerational transmission of education and ADHD: Effects of parental genotypes. Preprint at 10.1101/664128. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Trejo S & Domingue BW Genetic Nature or Genetic Nurture? Quantifying Bias in Analyses Using Polygenic Scores. Preprint at 10.1101/524850. [DOI] [Google Scholar]
- 37.Fletcher J, Wu Y, Li T & Lu Q Interpreting Polygenic Score Effects in Sibling Analysis. bioRxiv 2021.07.16.452740 (2021) doi: 10.1101/2021.07.16.452740. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Veller C & Coop G Interpreting population and family-based genome-wide association studies in the presence of confounding. bioRxiv 2023.02.26.530052 (2023) doi: 10.1101/2023.02.26.530052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Krokstad S. et al. Cohort Profile: The HUNT Study, Norway. International Journal of Epidemiology vol. 42 968–977 Preprint at 10.1093/ije/dys095 (2013). [DOI] [PubMed] [Google Scholar]
- 40.Magnus P. et al. Cohort Profile Update: The Norwegian Mother and Child Cohort Study (MoBa). Int. J. Epidemiol 45, 382–388 (2016). [DOI] [PubMed] [Google Scholar]
- 41.Howe LJ et al. Within-sibship genome-wide association analyses decrease bias in estimates of direct genetic effects. Nat. Genet 54, 581–592 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Carlin JB, Gurrin LC, Sterne JA, Morley R & Dwyer T Regression models for twin studies: a critical review. Int. J. Epidemiol 34, 1089–1099 (2005). [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data analyzed in the study are administrative data maintained by Statistics Norway and genotype data from MoBa Genetics. The data are not publicly available, but available to researchers upon application to the respective data owners. Such applications require approval by the appropriate ethics / research data access authorities. Access to administrative data from Statistics Norway can be applied for at Statistics Norway (http://www.ssb.no/mikrodata/) and access to MoBa Genetics can be applied for at the Norwegian Public Health Institute (http://www.fhi.no/studier/moba/). In Norway, the appropriate ethics and research data boards are the Regional Committee on Medical Research Ethics (REK) or SIKT. The consent given by the MoBa participants does not open for storage of data on an individual level in repositories or journals.
