Skip to main content
PLOS One logoLink to PLOS One
. 2022 Dec 12;17(12):e0267254. doi: 10.1371/journal.pone.0267254

The intergenerational transmission of educational attainment: A closer look at the (interrelated) roles of paternal involvement and genetic inheritance

Renske Marianne Verweij 1,*, Renske Keizer 1
Editor: Marie-Pierre Dubé2
PMCID: PMC9744317  PMID: 36508409

Abstract

Numerous studies have documented a strong intergenerational transmission of educational attainment. In explaining this transmission, separate fields of research have studied separate mechanisms. To obtain a more complete understanding, the current study integrates insights from the fields of behavioural sciences and genetics and examines the extent to which paternal involvement and children’s polygenic score (PGS) are unique underlying mechanisms, correlate with each other, and/or act as important confounders in the intergenerational transmission of fathers’ educational attainment. To answer our research questions, we use rich data from The National Longitudinal Study of Adolescent to Adult Health (n = 4,579). Firstly, results from our mediation analyses showed a significant association between fathers’ educational attainment and children’s educational attainment (0.303). This association is for about 4 per cent accounted for by paternal involvement, whereas a much larger share, 21 per cent, is accounted for by children’s education PGS. Secondly, our results showed that these genetic and behavioural factors are significantly correlated with each other (correlations between 0.06 and 0.09). Thirdly, we found support for genetic confounding, as adding children’s education PGS to the model reduced the association between paternal involvement and children’s educational attainment by 11 per cent. Fourthly, evidence for social confounding was almost negligible (the association between child’s education PGS and educational attainment was only reduced by half of a per cent). Our findings highlight the importance of integrating insights and data from multiple disciplines in understanding the mechanisms underlying the intergenerational transmission of inequality, as our study reveals that behavioural and genetic influences overlap, correlate, and confound each other as mechanisms underlying this transmission.

Introduction

Individuals with highly educated parents generally achieve better outcomes in school than individuals with lower educated parents [1]. For example, during primary and secondary school years, children with highly educated parents obtain higher school grades [2], and eventually, children with at least one highly educated parent are more than twice as likely to obtain tertiary education themselves compared to children without highly educated parents [3]. Different research fields focus on different factors in explaining this intergenerational transmission of educational attainment. Research from the field of behavioural sciences aims at explaining this intergenerational transmission by considering, amongst others, characteristics of the family environment, such as parenting practices and the level of social, cultural, and economic capital within families [4, 5]. In contrast, research from the field of genetics focuses on the role of genetic transmissions [6].

With a few exceptions [710], studies that have looked at the interrelated linkages between behavioural and genetic factors are rare. That such research is rare is very unfortunate; investigating behavioural and genetic factors in isolation most likely overestimates the unique contribution each factor makes. In line with this idea, previous studies have revealed that controlling for genetics reduced the impact of parenting quality [8] and family environment [10] on children’s educational achievement, which indicates that part of the assumed social effect might be genetic, so-called ‘genetic confounding’. Genetic confounding arises amongst others because the shared genetic influences that parents pass on to their offspring collectively contribute to both increased genetic predisposition for variation in educational attainment as well as the environments that parents design for their children. This might be because traits that are assumed to be environmental, such as family SES, are partly accounted for by genetic factors [7]. Research has also shown the reverse; part of the genetic effect on education might be socially confounded; empirical studies have revealed that the association between parents’ and children’s education-related genes and children’s educational attainment is reduced when parenting practices and family SES are taken into account [911].

These findings underscore the importance of taking both behavioural and genetic factors into account to obtain a clear understanding of the mechanisms underlying the intergenerational transmission of educational attainment, which is the aim of the current paper. Our study builds forward on work by Wertz et al (2020). Although these authors did not investigate the intergenerational transmission of educational attainment, they did scrutinize the role of both maternal behaviour and genes in explaining children’s educational attainment. Their study revealed that mothers’ cognitively stimulating parenting accounted for the effect of mothers’ education polygenic score (PGS) on children’s educational attainment and that the inclusion of children’s education PGS slightly reduced the association between mothers’ parenting (cognitive stimulation, warmth, and sensitivity) and children’s education attainment [8]. This, amongst others, shows that part of the association between parental genes and children’s education is accounted for by the parenting mothers provide to their children. In the current paper, we turn our attention to the role of fathers (whilst controlling for the role mothers play).

Traditionally, studies on the intergenerational transmission of education/socioeconomic status (SES) focused on obtaining information on fathers’ educational attainment or profession. As women, particularly in previous decades, often retreated from the labour market after marriage or childbirth, information on mother’s educational attainment or profession was not always available for each child. In contrast, information on the role fathers played in parenting was often neglected or overlooked, as mothers were the primary source of information to report on child development. During the last 50 years, however, fathers have become more and more involved in parenting [12, 13]. Although some scholars argue and show that certain roles might still be most prominent amongst solely mothers or solely fathers, the roles of fathers (and mothers alike) are increasingly being expanded [14], which have made fathers and mothers more similar in their roles as caregivers [15]. Even though some scholars construct a gender-differentiated vision of how mothers and fathers parent, there is increasing consensus among scholars that there are little to no differences in how well mothers and fathers parent [e.g. 15]. In line with this, and pertaining to educational attainment, studies show that the relationship between parental involvement and academic achievement is similar for mothers and fathers [16].

That said, we do expect to see greater variation in paternal than in maternal involvement. Over the years, fathers in two-parent families have spent more time with their children [17], a pattern especially common among higher educated fathers [18]. After divorce, which is more common amongst lower-educated families [19], fathers often are, or become, less involved in their children’s lives, which is especially the case among lower-educated families [20]. This greater variation in father involvement across social strata suggests that father involvement could be an important underlying mechanism of the intergenerational transmission of educational attainment, and thus possible leverage to reduce inequality.

Several studies have investigated the role that paternal involvement plays in children’s educational attainment and in the intergenerational transmission of educational attainment [21, 22] Unfortunately, however, these studies did not control for genetic effects. Even though twin studies show that both genetic aspects, as well as the shared environment, are important in accounting for children’s educational attainment, the shared environment in these studies remains unmeasured [6, 2325].

In sum, the current study aims to provide a clearer understanding of the (interrelated) roles that paternal involvement and genes play in the intergenerational transmission of educational attainment. We use children’s PGS for educational attainment as our genetic indicator. This PGS is based on a genome-wide association study (GWAS) conducted among 1.1 million individuals [26]. In this GWAS, the association between hundreds of thousands of genetic variants and educational attainments is assessed. These GWAS summary statistics are used to calculate the sum of all risk alleles, weighted by their reported effect sizes. A PGS can therefore be seen as the summary measure of the genetic propensity for a trait based on a large number of genetic variants [27]. The Education PGS has been found to account for about 11–13% of the variation in educational attainment [26]. We will examine whether and to what extent genes as well as father involvement mediate the relationship between father’s and child’s educational attainment, whether and to what extent these mediators are correlated, and/or act as important confounders in the intergenerational transmission of fathers’ educational attainment.

Theory and hypotheses

Paternal involvement as mechanism underlying the intergenerational transmission of educational attainment

To build the argument that fathers’ involvement in their children’s lives is an underlying mechanism for the intergenerational transmission of educational attainment, we would first have to argue and show that (1) fathers’ educational attainment is significantly associated with children’s educational attainment, that (2) fathers’ involvement is significantly associated with children’s educational attainment and (3) fathers’ educational attainment is significantly associated with fathers’ involvement in their children’s lives (see Fig 1 top part).

Fig 1. Graphical presentation of hypotheses 1, 2, and 3: Mediation pathways and correlated effects, hypothesis 4: Genetic confounding and hypothesis 5: Social confounding.

Fig 1

Firstly, numerous studies have revealed that fathers’ educational attainment is associated with children’s cognitive functioning [28]. Children of highly educated fathers not only show better cognitive functioning but also obtain better school grades throughout their educational career [2] and higher educational levels [3]. For example, in the US less than 20% of the children without tertiary educated parents obtain tertiary education themselves, while around 55% of those with at least one tertiary educated parent do so [3]. Secondly, fathers’ involvement in their children’s lives has been argued to positively affect their children’s educational attainment. Different dimensions of fathers’ involvement might uniquely influence children’s educational performances. First of all, and most directly, fathers may influence their children’s educational outcomes through their involvement in teaching-related activities such as coaching, helping with homework, communication with school personnel, and active participation in classroom or school activities [21, 29, 30]. The involvement of fathers may benefit children’s school achievement, amongst others because the help fathers provide and the time that fathers practice with their children can improve their children’s skills [31]. In addition, the greater involvement of fathers in schoolwork could positively impact children’s educational achievements, because greater school involvement of parents is associated with more ambitious school aspirations, greater motivation to perform well in school, and higher school attendance [32]. The rationale behind these findings is that greater involvement of parents is indicative of the value they attach to educational achievements, and children internalize these values [31]. Previous research found that children who have fathers that are more involved in their schoolwork obtained higher grades in high school [21, 29, 30] (correlation of 0.113 between father involvement and academic achievement [29] These positive associations remain after accounting for the involvement of mothers [21, 30]. Besides fathers’ involvement in school (work), the frequency of joint activities undertaken by father and child is also associated with children’s educational attainment [33, 34]. Previous studies show that children who spend more time alone with their father score higher on cognitive tests [33] (also when controlling for the involvement of the mother). In addition, studies show that the more time fathers spend with their children on care-related tasks as well as playing and reading, the better children fare in school tasks [34]. Thirdly, there is empirical evidence showing that fathers’ educational attainment is related to fathers’ involvement, as fathers who finished more than 16 years of education spend 4,76 more hours on childcare than high school dropouts [18]. Highly educated fathers are generally more involved in their children’s lives, amongst others because these fathers feel more confident in helping their children with schoolwork [35, 36], because they have more intensive parenting ideologies [5] and they have more time available [18].

Based on the abovementioned theoretical and empirical work, it is likely that fathers’ involvement mediates the association between fathers’ and children’s educational attainment. Our first hypothesis therefore reads: Fathers’ involvement (school-specific involvement and leisure involvement) is an underlying mechanism for the association between fathers’ educational attainment and children’s educational attainment (H1). A small number of studies have shown that fathers’ involvement indeed serves as a mediator between educational attainment and child outcomes [29, 3739]. A clear limitation of these studies is that they did not control for children’s genetic characteristics, and therefore might not have been able to obtain a reliable understanding of the role father involvement plays in the intergenerational transmission of educational attainment.

Genetic influences as mechanism underlying the intergenerational transmission of educational attainment

A large number of twin studies showed that educational attainment is approximately 40% heritable, that is, 40% of the variation in education is accounted for by genetic variation [6]. This heritability can be explained partly by the heritability of intelligence, but also by the heritable component of amongst others personality traits, self-efficacy, and behavioural problems [40]. Children with genes that are positively related to higher educational attainment tend to be more open, agreeable, conscientious, and show more academic motivation (correlations between 0.01 and 0.03), which are linked to better educational achievements [41]. As parents and children are 50% genetically similar, part of the intergenerational transmission of education is posed to be through genetic influences. Several previous studies showed that genes can account for part of the intergenerational transmission of education [4244]. Based on the above-described literature we expect to find that genetic influences are an underlying mechanism for the association between fathers’ educational attainment and children’s educational attainment (H2).

Three types of correlations between genetic influences and father involvement

There are three different reasons why we can expect to find a correlation between father involvement and children’s education PGS. These three reasons are related to the three types of rGE that can theoretically be distinguished [45]. The first type is an evocative gene-environment correlation: parents will behave differently to a child based on the child’s characteristics (which are genetically influenced) [46]. In our case, children’s education PGS does not only capture the intelligence of the child but also personality traits, academic motivation, behavioural problems, and self-efficacy [40, 41]. It is therefore likely that children with a higher education PGS more often evoke their father’s involvement (in particular with school), as these children are also more likely to be more interested in learning. If this is true, this would result in a positive rGE.

The second type is an active gene-environment correlation: children with higher education PGS might not only be more likely to be more interested in learning, but they might also be more prone to actively seek help from their parents with homework or discuss school matters with their father, which may then result in greater involvement of the father.

The third type is a passive gene-environment correlation: children inherit half of their genes from each biological parent, and, if children live with their biological parents, these same parents also rear them and shape their environment. This can result in a correlation between the child’s education PGS and father’s involvement in two ways. Firstly, children with a high education PGS are more likely to have a parent with a high education PGS, and the parent’s education PGS is not only associated with their own educational attainment but also their parenting practices [9]. Secondly, children with a higher education PGS generally have a highly educated father, and because of reasons such as status maintenance motives [47], these highly educated fathers are more likely to be more involved in their children’s lives.

Based on the abovementioned theoretical considerations, we expect that father involvement (school-specific involvement and leisure involvement) is positively correlated with children’s education PGS in their relation to education (H3). Several previous studies showed a correlation between genes and different aspects of the family environment, such as parental sensitivity, warmth, stimulating parenting, and parental SES [79, 48], and one study looked specifically at the correlated effect of genes and family environment on education [10]. No study has looked at the correlation between genes and father involvement in their relation to education.

Implications of correlations between genetic factors and father involvement for understanding the mechanisms underlying the intergenerational transmission of educational attainment: Genetic confounding

In the context of the expected correlations between genetic factors and father involvement, it might be the case that part of the effect of father involvement is driven by genetic factors, so-called genetic confounding; both fathers’ involvement as well children’s educational attainment are accounted for by the same genetic factors (see Fig 1 middle part for a graphical representation). This is due to the passive rGE. For example, the same genes that are associated with higher education, are also associated with greater involvement of fathers through personality traits or a sense of responsibility of the father. Previous research by Wertz and colleagues showed that controlling for genetics reduced the association between parental warmth/sensitivity and child’s educational achievement by about 8% [8]. In line with this finding, we expect that the genetic influences partly account for the behavioural mechanism underlying the intergenerational transmission of educational attainment (H4).

Implications of correlations between genetic factors and father involvement for understanding the mechanisms underlying the intergenerational transmission of educational attainment: Social confounding

Alternatively, it might be the case that part of the association between children’s education PGS and children’s educational attainment is confounded by the environment, in our case father involvement (see Fig 1 bottom part for a graphical presentation of this hypothesis). The rationale for the existence of social confounding is that the education PGS is based on the GWAS of a large number of SNPs that are associated with educational attainment. Yet, these associations do not imply causation, and the pathways from genetic variants to education are diverse [49]. GWAS studies that are used to create PGSs cannot distinguish between, on the one hand, associations between genes and education through personal traits, such as intelligence and motivation, and on the other hand, associations between genes and education due to the environment, such as the family environment and parenting practices. As such, part of the effect of our genetic factors might be driven by our behavioural factors; children’s education PGS is associated with children’s educational attainment, because children’s education PGS is associated with the parenting of the child’s father, which is driven by his education PGS, and it is father involvement that is shaping children’s educational attainment. Previous research found that the association between mother’s genes and children’s education is partly accounted for by mothers’ cognitively stimulating parenting [8] and that the association between children’s own genes and their educational attainment is reduced once family SES and household chaos are taken into account [10]. Studies also showed that the genes that parents do not transmit to their children are associated with their children’s educational attainment, which is likely due to the environment parents provide to their children [5052]. This hints towards the idea that also the genes that parents do transmit to their children partly influence their children through the home environment. Based on the above, we expect that father involvement partly accounts for the genetic mechanism underlying the intergenerational transmission of education (H5).

Zooming in: Understanding which type of gene-environment correlation is at play

Previous research has not always been able to differentiate passive rGE from active and evocative correlations, which is unfortunate if one wants to understand why this correlation exists. In our study, we can differentiate active/evocative from passive rGE, by using a subsample of our data that includes sibling data. The presence of within-family correlations between children’s education PGS and father involvement (i.e. differences in the education PGS of siblings correlate with differences in father involvement towards these siblings) would imply active/evocative rGE. However, if within families we find a substantially smaller correlation between father involvement and children’s genes as compared to between-families, this would indicate that the correlation is mainly driven by characteristics of the family, which is indicative of passive rGE. Although no previous studies examined passive versus active rGE regarding father involvement, research found support for child evoked rGE, showing that children’s genotype evokes parental warmth, sensitivity, harsh discipline, negative effect [48], but also passive rGE has been found for example regarding warmth, sensitive and stimulating parenting [8, 9, 53]. We expect to find passive rGE for both indicators of father involvement (father’s school-specific involvement and leisure involvement) (H6). In addition, given that father involvement might be explicitly activated or evoked based on children’s genetic disposition towards school, we expect that child-evoked/active rGE will only be found for father’s school-specific involvement (H7).

Current study

This study aims to gain a better understanding of the mechanisms underlying the intergenerational transmission of educational attainment by scrutinizing the interrelatedness of genetic influences and father involvement. We will answer 5 related research questions: 1) To what extent do genes and father involvement independently account for the intergenerational transmission of education, 2) to what extent do genes and behaviour correlate in their relation to educational attainment, 3) to what extent do genes account for part of the behaviour mechanism (‘genetic confounding’), 4) to what extent does behaviour account for part of the genes mechanism (‘social confounding’) and 5) to what extent can we distinguish between passive and active/evocative rGE? We will examine these research questions using data from the National Longitudinal Study of Adolescent to Adult Health (Add Health) of 5,021 genotyped individuals. These data contain information on father involvement, as well as the same information for mothers. Controlling for maternal involvement is important, as this allows us to isolate the independent effects of paternal involvement. Due to amongst others behavioural contagion, in which parents copy each other’s behaviour, mothers and fathers within the same household are more likely to show similar behaviours [54, 55] also concerning parental involvement [56]. Because our sample of respondents has been genotyped, we can include the child’s education PGS, which captures the additive effect of common genetic influences on education. In addition, using sibling data, available for a subsample within our data, we can test the hypothesized causes of any correlation between father involvement and education PGS. These sibling data allow us to unravel whether observed correlations between father involvement and education PGS are attributable to passive versus active/evocative gene-environment correlations, which have not been assessed yet for father involvement. Respondents in our sample are followed from age 12–19 up to age 32–42.

Methods

Data

The Add Health data is a longitudinal study of adolescents from the United States who were in grades 7 to 12 in wave 1 in 1994/95 [57]. The data were collected using a school-based stratified sample, in which 80 high schools in the United States were selected, and from these 80 high schools, 20,745 adolescents in grades 7 to 12 participated [57]. In this first wave, respondents were between 12 and 19 years old, with the majority between 14 and 18 years old. In the second wave, in 1996, respondents were between 13 and 21 years old. In the third wave, in 2001/02, respondents were between 18 and 26. At wave 4 in 2008/09, respondents were between 24 and 32. Finally, during wave 5 in 2016/18, respondents were between 32 and 42 years old.

From the 20.745 respondents, we selected respondents who were followed up to wave 4 or 5, since these respondents are between the ages 24 and 42 and were thus most likely to have finished their educational training. This restriction reduced our sample size to 17,535. After this, we selected only respondents who reported the educational attainment of their biological father and biological mother, which narrowed down our sample size to 14,712. After this, we only included respondents with an education PGS, which reduced the sample to 5,021 individuals. Finally, we removed individuals with missing information on parental involvement, resulting in a final sample size of 4,579 respondents. Although Add Health is a nationally representative sample, selecting only respondents with information about their own educational level, the educational level of their parents, and information about parental involvement resulted in a sample that was relatively higher educated and contained respondents who relatively more often lived with both their parents during the first wave of data collection. Furthermore, selecting only genotyped respondents led to a sample that only contained White respondents.

In total 96% of the participants in wave 5 gave saliva, and 80% of them consented with long-term archiving of their data and were eligible for genome-wide genotyping [58]. This resulted in approximately 12,200 genotyped respondents. After quality control, data were available for 9,974 individuals (please see Highland et al [59] for details on quality control). Genetically-determined non-European descent individuals were removed from the sample, as PGSs based on a GWAS with European-descent individuals is much less suitable for non-European descent individuals, leaving 5,787 individuals. After additional quality control, 5,690 individuals remained with valid genetic information (of which we use 4,579 who also have information of their educational level and parental involvement available).

Individuals were genotyped using the Illumina Omni1-Quad BeadChip (80% of the sample) and the Illumina Omni2.5-Quad BeadChip (20% of the sample). Before quality control, 609,130 SNPs that were common across both genotyping platforms were available. After quality control (call rate <0.98, HW p-value <10−4, MAF<0.01) 346,754 SNPs remained and were used for imputation. Imputation was done using the Haplotype Reference Consortium (HRC) v1.1 European reference panel.

Sibling data: The Add Health contains in total 3,139 sibling pairs, consisting of full-siblings, half-siblings, and MZ and DZ twin pairs and unrelated siblings. Reducing this sample to only sibling pairs with education PGS available reduces the sample size to 619 sibling pairs, of which 429 full siblings and DZ twin pairs, which is further reduced to 380 sibling pairs once we only include those who have information of father involvement available.

Ethical considerations

The Medical Ethics Review Board of the Erasmus Medical Center Rotterdam considered whether or not this research falls within the scope of the Medical Research Involving Human Subjects Act (WMO). It was concluded that the research is not a clinical research with test subjects as meant in the Medical Research Involving Human Subjects Act (WMO). Therefore, the Medical Ethics Review Board of the Erasmus Medical Center Rotterdam had no task in reviewing the protocol. Therefore it was concluded that we were allowed to conduct the research. Add Health participants provided written informed consent for participation in all aspects of Add Health.

Measures

Years of education of the child: In waves 4 and 5 respondents were asked about the highest level of education that they completed. Answer categories ranged from ‘8th grade or less’, to ‘completing a post-baccalaureate professional degree’. We recoded this variable to years of education. Response options and the years of education that correspond to them (in parentheses) were: 8th grade or less (8), some high school (10), high school graduate (12), some vocational/technical training (13), completed vocational/ technical training (14), some college (14), completed college (16), some graduate school (17), completed a master’s degree (18), some graduate training beyond a master’s degree (19), completed a doctoral degree (20), some post-baccalaureate professional education (18), and completed post-baccalaureate professional education (19). This is in line with previous studies that coded years of education in AddHealth [26, 60, 61]. To reduce missing data, for those respondents who did not give information in wave 5, we used the information from wave 4.

Years of education of parent: In wave 1, respondents were asked about the educational attainment of their biological parents. Answer categories ranged from ‘8th grade or less’, to ‘professional training beyond a four-year college or university’. We recoded this variable to years of education. Response options and the years of education that correspond to them (in parentheses) were: never went to school (0), 8th grade or less (8), more than eighth grade, but did not graduate from high school (10), went to a business, trade, or vocational school instead of high school (10), high school graduate (12), completed a GED (12), went to a business, trade, or vocational school after high school (14), went to college, but did not graduate (14), graduated from a college or university (16), professional training beyond a four-year college or university (18). For a subset of the respondents, parents provided information on their educational attainment themselves. Our robustness checks show that the correspondence between child reports and parent reports is high: correlation of 0.87 for mothers and 0.79 for fathers, see Supplementary Material part 3 in S1 File).

Father’s school-specific involvement: In wave 1, respondents were asked if, in the past four weeks, they talked with their father about schoolwork or grades (yes or no), worked with their father on a project for school (yes or no), and talked with their father about other things they are doing in school (yes or no). These measures were added up, ranging from 0 for children who did none of these school-related activities with their father to 3 for those who did all these activities. The Cronbach’s alpha for this scale is 0.77.

Father’s leisure involvement: In wave 1, respondents were asked if in the past four weeks they went shopping with their father (yes or no), played sports with their father (yes or no), talked with their father about someone they’re dating or a party they went to (yes or no), went to a movie, play, museum, concert or sports event with their father (yes or no) or talked about a personal problem with their father (yes or no). These activities were added up, ranging from 0 if the respondent did none of these activities with their father, to 5, if they did all these activities with their father. The Cronbach’s alpha for this scale is 0.78.

Polygenic score (PGS) for years of education: We include PGS for years of education based on the GWAS conducted among 1.1 million individuals [26]. The PGS was constructed by the Social Science Genetic Association Consortium (SSGAS), based on the discovery sample that did not include Add Health [58] and is provided by Add Health. This PGS was created using LDpred, which uses all SNPs and weights them according to their conditional effect, given all other SNPs.

Controls

Father residence: We distinguish between whether the child lived with the biological father at wave 1 (resident father = 1) or not (non-resident father = 0).

Principal components: To control for population stratification, which is the case if certain SNPs are more common in certain ancestry groups than others, we control for the first 10 genetic principal components (PCs). These genetic PCs were created by the SSGAS [58].

In our models, we control for the age the respondent had at the first interview, which ranged between 12 and 21, with the majority of the respondents between 14 and 18. We control for the sex of the respondent, and whether or not the respondent was enrolled in school at the last wave of data collection.

To examine the unique contribution of father involvement, we control for certain characteristics of the mother, namely years of education of the mother and maternal involvement, namely mother’s school-specific involvement, mother’s leisure involvement, and whether or not the child lived with the biological mother at wave 1.

Analyses

Path model to test for mediation (hypothesis 1 and 2) and confounding (hypothesis 4 and 5)

To estimate the extent to which fathers’ involvement and children’s PGS for years of education mediated the relationship between fathers’ years of education and children’s years of education (hypothesis 1 and 2), a path model was estimated using the Lavaan package in R [62]. Since the respondents in our sample are not independent but nested within households, we ran a multilevel path model of individuals nested within households [63].

In our path model, we simultaneously tested the direct effect of the father’s years of education on the years of education of his child, as well as the mediated effect via father’s involvement and the child’s PGS for years of education. These mediated effects (hypothesis 1 and 2) are estimated by multiplying the coefficient of (a) the independent variable on the mediators and (b) the mediators on the outcome [64].

Because we wanted to assess potential confounding of genetic effects by father involvement and vice versa (hypothesis 4 and 5), three nested path models were fitted, first a multiple mediation model in which only the two aspects of father involvement are assessed simultaneously, second a mediation model in which the role of children’s PGS for years of education is examined, and third a multiple mediation model that includes both father involvement and children’s education PGS. To quantify the extent to which the effect of father involvement in explaining the intergenerational transmission of education is confounded by the education PGS (hypothesis 4), we compare the coefficients of father involvement between the first model (in which only father involvement is included as a mediator) and the third model (in which also the education PGS is included) [64]. The other way around, to quantify the extent to which the effect of the education PGS is partly socially confounded (hypothesis 5), and can be accounted for by father involvement, we compare the coefficient of the education PGS between the second model (in which only the education PGS is included as a confounder), and the third model (in which both the education PGS and father involvement are included). We regress all our control variables on educational attainment of the child, and we include the first 10 genetic PC’s on the path from father’s education to the child’s PGS.

rGE (hypothesis 3, 6, and 7)

Correlated effects: To examine to what extent the education PGS and father involvement correlate in their relation to educational attainment, we estimated the correlation between children’s years of education predicted from the PGS model and children’s years of education predicted from the father involvement model. To this end, we first regressed children’s years of education on both parents’ years of education, the first 10 genetic PCs, and all other control variables using a multilevel model that takes into account the nested structure of the data. The individual level residuals from this model were used and regressed on the education PGS (model 1), father’s school-specific involvement (model 2), and father’s leisure involvement (model 3). Finally, we assessed the correlation between the predicted values from model 1 with model 2 and model 3, the correlated effects. This approach allows us to not only assess the correlation between the PGS and father involvement, but also whether and to what extent their associations with years of education correlate.

To examine active/evocative and passive rGE, we estimate rGE between and within families. Between families, we cannot simply examine the correlation between father involvement and the education PGS, as we not only have to control for spurious associations based on ancestral differences but also because we have to take the nested structure of the data into account. Therefore, we estimated multilevel regression models in which we explain the two measures of father involvement by the education PGS while controlling for the first 10 genetic PCs (hypothesis 3).

To assess the rGE within families, we do not have to control for the first 10 genetic PCs. The reason that we do not have to control for the first 10 genetic PC’s is that siblings share their ancestry, and the genetic PCs are used to control for ancestral differences. We also do not have to take into account the nested structure of the data as there is only one sibling pair per family. Therefore, we estimated linear regression models in which we explain the difference in father involvement between siblings by the difference in the education PGS between siblings. To distinguish between active and passive rGE (hypotheses 6 and 7), we will compare estimates of rGE within and between families.

Results

Univariate descriptives

Sample descriptives can be found in Table 1. The respondents in our sample finished 14.63 years of education on average, which equals to some years of education after finishing high school (12 years equals to finishing high school and 16 years equals to finishing a bachelor’s degree). Their parents on average were in school a bit shorter (on average 13.5 years). Fathers were on average less involved with their children’s school and leisure activities than mothers were. We expected to find greater variation in father involvement compared to mother involvement. In contrast, however, the variation in these two aspects of parental involvement did not substantially differ between fathers and mothers. Children’s age at the first interview was, on average, 16 years and they were almost, on average, 36 at the final wave when they were asked about their final education. 71% lived with their father at the first wave, and 29% did not, while a little under 10% did not live with their mother at the time of the first wave of data collection.

Table 1. Univariate descriptives of the sample.

Mean SD Min Max
Years of education child 14.62 2.32 8 20
Years of education father 13.52 2.54 0 18
Years of education mother 13.48 2.35 0 18
PGS education child 0.55 0.15 0 1.05
Father’s school-specific involvement 1.16 1.01 0 3
Father’s leisure involvement 1.44 1.29 0 5
Mother’s school-specific involvement 1.32 1 0 3
Mother’s leisure involvement 2.08 1.21 0 5
Age first interview 16.01 1.7 12 21.33
Age asked education 35.83 4.13 25 43
Yes n % No n %
Enrolled in educ last wave 355 7.75 4223 92.25
Live with father at the first wave 3427 74.84 1152 25.16
Live with mother at the first wave 4181 91.31 398 8.69

Bivariate descriptives

Fig 2 and S2 Table show the correlation between the variables in our analyses. Children’s years of education is positively correlated with the education of the father, mother and with children’s education PGS (0.45, 0,43 and 0,38 respectively). Furthermore, both father’s and child’s years of education are positively correlated with father’s school-specific involvement (0.14 and 0.17 respectively) and father’s leisure involvement (0.14 and 0.12 respectively). The child’s education PGS is positively correlated with father’s school-specific involvement (0.09) and father’s leisure involvement (0.06). There is also a positive correlation between the two different aspects of father involvement of 0.30 (father’s school-specific involvement and father’s leisure involvement).

Fig 2. Correlation table graphically displayed.

Fig 2

Non-significant correlations are not displayed. The size and color represent respectively the strength and direction of the correlation. Education = Years of education child; Education father = Years of education father; Education mother = Years of education mother; pgs = Child’s Polygenic score for education; father school contact = father’s school-specific involvement; father activities = father’s leisure involvement; mother school contact = Mother’s school-specific involvement; mother activities = mother’s leisure involvement; age w1 = age at wave 1; age final = age at the final wave; father resident = lived with father at wave 1, 0 = no 1 = yes; mother resident = lived with mother at wave 1, 0 = no 1 = yes; enrolled w1: was the respondent enrolled in education at wave 1, 0 = no, 1 = yes.

Part 1) Regression results: Mediation

Our first two hypotheses focused on the extent to which the association between fathers’ years of education and children’s years of education is mediated by father involvement (hypothesis 1) and the child’s education PGS (hypothesis 2). To this end, we estimated path models in which we examined the direct association between father’s educational attainment and children’s educational attainment and the significance of the indirect effects via father involvement and education PGS. All measures are standardized. The results are displayed in S1 Table and Fig 3. The total effect of father’s education on child’s education is 0.303 (Se 0.016). Father’s school-specific involvement mediates 2.3% of the intergenerational transmission (0.007 of the total effect of 0.303), father’s leisure involvement mediates 1.3% (0.004 of 0.303), and the education PGS mediates 21.45% (0.065 of 0.303). These findings support both hypotheses 1 and 2 and show that both genes and father involvement are significant mediators. In addition, our findings show that the education PGS mediates a much larger proportion of the intergenerational transmission of father’s years of education than our two measures of father involvement.

Fig 3. Graphical presentation of the mediation and confounding results.

Fig 3

G mediate refers to the part of the effect that is genetically mediated, FLI mediate refers to the mediated effect by father’s leisure involvement, FSI mediate refers to the mediated effect by father’s school-specific involvement, G confound to the genetically confounded part and S confound to the social confounding by father involvement. The controls included in this model are years of education mother, age child at wave 1, child’s sex, resident father or not, enrolled in education, resident mother or not, mother’s school involvement, mother’s leisure involvement and the first 10 genetic PC’s. All measures are standardized. Standard errors are displayed in parentheses, and non-significant estimates in light grey.

Part 2) rGE

Table 2 displays the correlations between the child’s education PGS and our two dimensions of father involvement. The correlated effects are displayed in the left panel of Table 2. We find correlated effects between father’s school-specific involvement and the education PGS of 0.092, and somewhat smaller positive correlated effects between father’s leisure involvement and the education PGS of 0.063 (see Table 2 left panel).

Table 2. Correlations between education PGS and father involvement: Correlated effects, between-family, and within-family correlations.

Education PGS
Correlated effects Between families Within families
Corr. 95% CI β 95% CI Corr. 95% CI
Father’s school-specific involvement 0.092*** 0.063–0.121 0.088*** 0.059–0.118 0.096 -0.004–0.194
Father’s leisure involvement 0.063*** 0.034–0.092 0.058*** 0.029–0.088 0.006 -0.094–0.107
N individuals 4579 4579 760
N families 4154 4154 380

*p<0.05,

**p<0.01,

***p<0.001.

The between-family associations are based on multilevel regression models, in which father involvement is explained by the education PGS while controlling for the first 10 genetic PCs and taking into account the nested structure of the data. We display the standardized beta coefficients. The within-family estimates refer to the correlation between the difference in education PGS between siblings with the difference in father involvement between siblings.

Part 3) Genetic confounding

Thirdly, we were interested in genetic confounding; the extent to which accounting for genetic influences reduces the association between father involvement and children’s educational attainment. Results are displayed in Fig 3. The association between father’s school-specific involvement and children’s educational attainment is reduced from 0.056 to 0.050 when child’s education PGS is added to the model, which implies 10.7% genetic confounding, while the association between father’s leisure involvement and children’s educational attainment is not significantly reduced when including children’s education PGS. The former shows that a small, but nonnegligible, part of the role that father’s school-specific involvement plays in the intergenerational transmission of years of education is genetically confounded, which supports hypothesis 4. The latter shows that the role that father’s leisure involvement plays in the intergenerational transmission of years of education is not genetically confounded, which contrasts with hypothesis 4.

Part 4) Social confounding

Furthermore, we were interested in social confounding: the extent to which the role of children’s genes in the intergenerational transmission of years of education is reduced when information on father involvement is included in the model. Results are again displayed in Fig 3. Our results showed that, once father involvement is added to the model, the role of children’s education PGS as the underlying mechanism is only slightly reduced, from 0.220 to 0.219, which implies a mere 0.5% social confounding. Furthermore, this social confounding only holds for the father’s school-specific involvement. These findings support hypothesis 5 but are very small in magnitude.

Part 5) rGE within versus between families

Comparing the between-families and within-families associations between father involvement and children’s education PGS provides insights into the extent to which the rGE is either active/evocative or passive. For father’s school-specific involvement, we do not find evidence for within-family correlation, but we do find a between-family correlation (0.088, 95% CI 0.059–0.118), see Table 2. Similarly, for father’s leisure involvement, we only find a between-family correlation (0.058, 95% CI 0.029–0.088), implying passive rGE from the side of the child. These findings are in line with hypothesis 6, showing passive rGE, but are in contrast to hypothesis 7 on active or child-evoked rGE for father’s school-specific involvement.

Robustness checks

We conducted several additional analyses to assess the robustness of our findings. To assess the robustness of our findings with respect to the education PGS, we tested whether and to what extent our findings were inflated by indirect effects, due to population stratification, or due to assortative mating. We found that the within-family effect of the education PGS on the education of the child was smaller than the between family effect, yet it remained significant, which shows that controlling for indirect effects and population stratification does not fully explain our findings regarding the association between the education PGS and years of education. Our findings indicate that assortative mating could to a small extent result in an overestimation of the effect of the education PGS (see S1 Robustness checks in S1 File). We furthermore assessed whether the association between our measures of father involvement and education depends on the age of the respondents during the first interview. Our robustness checks reveal it does not (see S2 Robustness checks in S1 File). We also assessed whether multicollinearity issues arise when father involvement and mother involvement were added simultaneously to the model. Our analyses revealed no multicollinearity issues (see S2 Robustness checks in S1 File). We furthermore assessed whether the results in our sample with only resident fathers differed from our findings in the complete sample and found that the results are largely comparable and do not change any substantive conclusions. Finally, we examined the robustness of our findings by assessing whether our results differ when we use a parent-report or child-report on parental education. Our analyses revealed that our conclusions are similar, irrespective of the reported used (see S3 Robustness checks in S1 File).

Conclusion and discussion

This study aimed to gain a better understanding of the mechanisms, and their interrelatedness, underlying the intergenerational transmission of educational attainment. To this end, we answered five interrelated questions.

Our first question was to examine whether and to what extent father involvement and children’s education PGS were unique and independent mechanisms underlying this intergenerational transmission of educational attainment. Our results revealed that both dimensions of father involvement as well as children’s education PGS were unique mechanisms. Noteworthy is that our results indicated that children’s education PGS was a much more important underlying mechanism than the two dimensions of father involvement we considered in the current study. Our relatively small effects of the family environment compared to the genetic component is in contrast to findings from Allegrini and colleagues [10], who found that multiple PGSs can account for 15% of the variance in educational attainment, while environmental factors can account for 28%. The higher percentage for environmental factors found in their study is likely attributable to the fact that they used a much broader indication of the family environment, consisting of amongst others the educational attainment of both parents, employment status of parents, chaos at home, and important life events.

Our second question focused on the extent to which children’s education PGS and father involvement correlate as mechanisms underlying the intergenerational transmission of educational attainment. Our results indicated that both mechanisms indeed correlate. These findings add to the previous research that shows a correlation between other aspects of parenting and the education PGS, such as parental warmth and sensitivity [9]. This correlation implies that many children are either growing up with double disadvantages–having both a less involved father and a lower education PGS–or double advantages–a highly involved father and a higher education PGS. This highlights the importance for social scientists to take genetic influences into account when examining the importance of parenting in the intergenerational transmission of education, and for genetic scientists to take into account social pathways.

Our third question centred on genetic confounding; to what extent does children’s education PGS account for the role that father involvement plays in the intergenerational transmission of educational attainment. Our results indicated that genetic confounding plays a part, albeit small, in accounting for the role that fathers’ school-specific involvement plays as an underlying mechanism in the intergenerational transmission of educational attainment. This indicates that findings from the field of behavioural sciences have likely, at least to some extent, overestimated the role that fathers’ school-specific involvement plays in the intergenerational transmission of educational attainment. It furthermore suggests that part of “what we think of as measures of ‘environment’ are better described as external factors that might be partly under genetic control” [65]. However, and intriguingly, the role of fathers’ leisure involvement as an underlying mechanism hardly changed when information on children’s genes was added to the model. Furthermore, fathers’ school-specific involvement was not fully mediated by the children’s education PGS, indicating that also school-specific involvement independently explains the intergenerational transmission of education. Also, our suggestive finding of active gene-environment correlation within our sibling sample does not exclude the possibility that genetic confounding is largely caused by child evoked genetic correlation. Our findings regarding the relevance of father involvement in the intergenerational transmission of educational attainment therefore suggest that father involvement should not be dismissed as a potential candidate for an intervention to aid in breaking the intergenerational cycle of (dis)advantage. Several interventions, such as educational sessions on parenting skills, as well as policies, such as parental leave reserved for fathers, have been proven to increase father’s involvement [66, 67].

Our fourth question revolved around behavioural confounding; to what extent is the role that children’s education PGS plays in the intergenerational transmission of educational attainment accounted for by father involvement. Our results indicate that behavioural confounding plays a negligible role. These findings differ from the findings from previous research that examined social confounding of genetic effects [8, 10]. Again, these differences are likely attributable to differences in how broadly the family environment was defined.

Fifth, we assessed active/evocative versus passive rGE by looking at correlations within and between families. Our findings reveal that the correlation between the education PGS and activities between father and child is more passive from the perspective of the child, given that we do not find a within-family correlation. This indicates that children with a high education PGS grow up in families with highly educated fathers/fathers with a high education PGS, who have a more active parenting style and therefore perform more activities with their child.

Some limitations need to be considered when interpreting these findings. First, we believe that our measures of father involvement are relevant and cover important dimensions of involvement of fathers, yet there are other aspects of father involvement that are also relevant that we did not cover. Our measures largely tap into the quantity of involvement (how much do fathers and children discuss school matters and undertake activities together). Other studies have emphasized that pertaining to children’s educational outcomes, the quality of parental involvement is also important [68]. It is therefore likely that the current study only yielded an underestimation of the role father involvement plays in the intergenerational transmission of educational attainment.

Similarly, our genetic measure only captures a part of the genetic component of education, as twin studies showed a heritability of 40% [6] while the education PGS accounts for approximately 10%. This discrepancy between relatively high heritability estimates from twin studies and lower explained variance from PGSs has been called ‘missing heritability’ and has been found for many traits [69]. Possible reasons are amongst others that PGSs only captures common genetic variants and only additive effects, while rare variants and non-additive effects might be relevant as well. Therefore, our estimates of genetic mediation of the intergenerational transmission of education, and genetic confounding of father involvement, are underestimations and only capture part of the genetic effect. Furthermore, it is important to take into account that PGSs do not imply genetic causality, but are merely based on associations between genetic variants and phenotypes (in our case educational attainment). Therefore, we cannot say that the mediating role of the PGS indicates that this part of intergenerational transmission is due to genes. In fact, this study shows that one of the pathways between PGS and educational attainment is environmental, namely through father involvement.

A suggestion for future research would be to include the genotype of the father, as is done with mother’s genotypes in the study by Wertz and colleagues [8]. By incorporating father’s genotype, on would be able to examine to what extent the relationship between father’s education and father’s involvement is due to genetic transmission. It would furthermore allow one to examine the extent to which the non-transmitted part of the father’s genotype is related to the child’s education, and to what extent the association between the non-transmitted part of the genotype and the child’s education is accounted for by parental involvement.

When interpreting our findings regarding father involvement, we must consider that our data allow us to investigate the role of fathers in the United States in 1994. At that time, involvement of fathers was much lower than in more recent cohorts [70]. The greater involvement of fathers in more recent cohorts could imply that fathers play an even bigger role in the educational achievements of their children, and therefore in the intergenerational transmission of education in these cohorts. However, the group of fathers that exhibited strong involvement in their children’s lives was most likely more selective in older cohorts than in recent ones. Consequently, it might also be the case that father involvement played a less substantial role as underlying mechanism in more recent cohorts than in older ones.

Our study is limited by the fact that our sample only includes respondents of European ancestry (i.e. White respondents). The reason is that the GWAS for educational attainment is based only on European ancestry individuals, and PGSs created from such GWASs have lower predictive power among non-European samples [71]. Therefore, we cannot generalize our findings to other ancestry groups. For this reason, we also cannot use our findings to explain differences in educational achievement between different ancestry groups.

To summarize, we find that both genes and father involvement are underlying mechanisms in the intergenerational transmission of educational attainment, that these mechanisms are correlated with each other, and that part of the role that fathers’ school-specific involvement plays as underlying mechanism is confounded by children’s education PGS. Our findings underscore the need to control for genetic effects in studies that examine the role of parenting in the intergenerational transmission of inequality, but also the need to control for parental involvement and the family environment in general when considering the role that genes play in this intergenerational transmission. Thus, our study underscores that in order to fully understand the mechanisms that underly intergenerational reproduction of (dis)advantages, scholars need to integrate both insights and data from different disciplines.

Supporting information

S1 File

(DOCX)

Acknowledgments

Add Health was designed by J. Richard Udry, Peter S. Bearman, and Kathleen Mullan Harris at the University of North Carolina at Chapel Hill.

Data Availability

Data cannot be shared publicly because of the extensive restricted-use data. Information on how to obtain the Add Health data files is available on the Add Health website (http://www.cpc.unc.edu/addhealth). The data underlying the results presented in the study are available from CPC Data Portal (https://data.cpc.unc.edu/projects/2/view). Restricted-use data will be distributed only to certified researchers who commit themselves to maintaining limited access. The authors had no special access privileges, and other researchers will be able to access the data in the same manner.

Funding Statement

The present study was supported by a grant from the Netherlands Organization for Scientific Research to RK (NWO MaGW VIDI; grant no. 452-17-005) (https://www.nwo.nl/en) and by a grant from the European Research Council to RK (ERC StG; grant no. 757210) (https://erc.europa.eu/). Add Health is directed by Robert A. Hummer and funded by the National Institute on Aging cooperative agreements U01 AG071448 (Hummer) and U01AG071450 (Aiello and Hummer) at the University of North Carolina at Chapel Hill. Waves I-V data are from the Add Health Program Project, grant P01 HD319121 (Harris) from Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD), with cooperative funding from 23 other federal agencies and foundations The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Pfeffer FT. Persistent inequality in educational attainment and its institutional context. Eur Sociol Rev. 2008;24: 543–565. doi: 10.1093/esr/jcn026 [DOI] [Google Scholar]
  • 2.Passaretta G, Skopek J. Roots and Development of Achievement Gaps. A Longitudinal Assessment in Selected European Countries. Dublin; 2018. Report No.: D1.3. [Google Scholar]
  • 3.OECD. Education at a Glance 2017. Education at a Glance 2017. 2017. 10.1787/eag-2017-74-en [DOI] [Google Scholar]
  • 4.Keizer R, Van Lissa CJ, Tiemeier H, Lucassen N. The Influence of Fathers and Mothers Equally Sharing Childcare Responsibilities on Children’s Cognitive Development from Early Childhood to School Age: An Overlooked Mechanism in the Intergenerational Transmission of (Dis)Advantages? Eur Sociol Rev. 2020;36: 1–15. doi: 10.1093/esr/jcz046 [DOI] [Google Scholar]
  • 5.Lareau A. Invisible inequality: Social class and childrearing in black families and white families. Am Sociol Rev. 2002;67: 747–776. doi: 10.2307/3088916 [DOI] [Google Scholar]
  • 6.Branigan AR, McCallum KJ, Freese J. Variation in the Heritability of Educational Attainment: An International Meta-Analysis. Soc Forces. 2013;92: 109–140. doi: 10.1093/sf/sot076 [DOI] [Google Scholar]
  • 7.Krapohl E, Hannigan LJ, Pingault J-B, Patel H, Kadeva N, Curtis C, et al. Widespread covariation of early environmental exposures and trait-associated polygenic variation. Proc Natl Acad Sci. 2017; 201707178. doi: 10.1073/pnas.1707178114 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Wertz J, Moffitt TE, Agnew-Blais J, Arseneault L, Belsky DW, Corcoran DL, et al. Using DNA from mothers and children to study parental investment in children’s educational attainment. Child Dev. 2020;91: 1745–1761. doi: 10.1111/cdev.13329 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Wertz J, Belsky J, Moffitt TE, Belsky DW, Harrington H, Avinun R, et al. Genetics of nurture: A test of the hypothesis that parents’ genetics predict their observed caregiving. Dev Psychol. 2019. doi: 10.1037/dev0000709 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Allegrini AG, Karhunen V, Coleman JRI, Selzam S, Rimfeld K, Von Stumm S, et al. Multivariable G-E interplay in the prediction of educational achievement. PLoS Genet. 2020; 1–20. doi: 10.1371/journal.pgen.1009153 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Bates TC, Maher BS, Medland SE, McAloney K, Wright MJ, Hansell NK, et al. The Nature of Nurture: Using a Virtual-Parent Design to Test Parenting Effects on Children’s Educational Attainment in Genotyped Families. Twin Res Hum Genet. 2018;21: 73–83. doi: 10.1017/thg.2018.11 [DOI] [PubMed] [Google Scholar]
  • 12.Hook JL. Care in context: Men’s unpaid work in countries, 1965–2003. Am Sociol Rev. 2006;71: 639–660. [Google Scholar]
  • 13.Yeung WJ, Sandberg JF, Davis-kean PE, Hofferth SL. Children’s Time With Fathers in Intact Families. J marriage Fam. 2010;63: 136–154. [Google Scholar]
  • 14.Cabrera NJ, Tamis-lemonda CS, Bradley RH, Hofferth S, Lamb ME. Fatherhood in the Twenty-First Century. 2000;71: 127–137. doi: 10.1111/1467-8624.00126 [DOI] [PubMed] [Google Scholar]
  • 15.Fagan J, Lamb ME, Cabrera NJ. Should Researchers Conceptualize Differently the Dimensions of Parenting for Fathers and Mothers? J Fam Theory Rev. 2014;6: 390–405. doi: 10.1111/jftr.12044 [DOI] [Google Scholar]
  • 16.Kim SW, Hill NE. Including fathers in the picture: A meta-analysis of parental involvement and students’ academic achievement. J Educ Psychol. 2015;107: 919–934. doi: 10.1037/edu0000023 [DOI] [Google Scholar]
  • 17.Altintas E, Sullivan O. Trends in fathers’ contribution to housework and childcare under different welfare policy regimes. Soc Polit. 2017;24: 81–108. doi: 10.1093/sp/jxw007 [DOI] [Google Scholar]
  • 18.Guryan J, Hurst E, Kearney M. Parental Education and Parental Time with Children. J Econ Perspect. 2008;22: 23–46. doi: 10.1257/jep.22.3.23 [DOI] [Google Scholar]
  • 19.Aughinbaugh A, Robles O, Sun H. Marriage and divorce: patterns by gender, race, and educational attainment. Mon Labor Rev. 2013. doi: 10.21916/mlr.2013.32 [DOI] [Google Scholar]
  • 20.Kitterød RH, Lyngstad J. Characteristics of parents with shared residence and father sole custody. Evidence from Norway 2012. 2014.
  • 21.Gordon MS. Self-perception and relationship quality as mediators of father’s school-specific involvement and adolescent’s academic achievement. Child Youth Serv Rev. 2017;77: 94–100. doi: 10.1016/j.childyouth.2017.04.001 [DOI] [Google Scholar]
  • 22.Whitney SD, Prewett S, Wang Z, Chen H. Fathers’ Importance in Adolescents’ Academic Achievement. Int J Child, Youth Fam Stud. 2018;8: 101. doi: 10.18357/ijcyfs83/4201718073 [DOI] [Google Scholar]
  • 23.Nielsen F. Achievement and ascription in educational attainment: Genetic and environmental influences on adolescent schooling. Soc Forces. 2006;85: 193–216. doi: 10.1353/sof.2006.0135 [DOI] [Google Scholar]
  • 24.Johnson W, Deary IJ, Iacono WG. Genetic and environmental transactions underlying educational attainment. Intelligence. 2009;37: 466–478. doi: 10.1016/j.intell.2009.05.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Schulz W, Schunck R, Diewald M, Johnson W. Pathways of Intergenerational Transmission of Advantages during Adolescence: Social Background, Cognitive Ability, and Educational Attainment. J Youth Adolesc. 2017;46: 2194–2214. doi: 10.1007/s10964-017-0718-0 [DOI] [PubMed] [Google Scholar]
  • 26.Lee JJ, Wedow R, Okbay A. Gene discovery and polygenic prediction from a genome-wide association study of educational attainment in 1.1 million individuals. Nat Genet. 2018;50: 1112–1121. doi: 10.1038/s41588-018-0147-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Maier RM, Visscher PM, Robinson MR, Wray NR. Embracing polygenicity: a review of methods and tools for psychiatric genetics research. Psychol Med. 2017; 1–19. doi: 10.1017/S0033291717002318 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Varghese C, Wachen J. The Determinants of Father Involvement and Connections to Children’s Literacy and Language Outcomes: Review of the Literature. Marriage Fam Rev. 2016;52: 331–359. doi: 10.1080/01494929.2015.1099587 [DOI] [Google Scholar]
  • 29.Morales-castillo M. Family Contributions to School Performance of Adolescents: The Role of Fathers ‘ Perceived Involvement. J Fam Issues. 2021. doi: 10.1177/0192513X21994143 [DOI] [Google Scholar]
  • 30.Wilder S. Effects of parental involvement on academic achievement: A meta-synthesis. Educ Rev. 2014;66: 377–397. doi: 10.1080/00131911.2013.780009 [DOI] [Google Scholar]
  • 31.Pomerantz EM, Moorman EA, Litwack SD. The How, Whom, and Why of Parents ‘ Involvement in Children ‘ s Academic Lives: More Is Not Always Better. Rev Educ Res. 2007;77: 373–410. doi: 10.3102/003465430305567 [DOI] [Google Scholar]
  • 32.Hill NE, Castellino DR, Lansford JE, Nowlin P, Dodge KA, Pettit GS. Parent Academic Involvement as Related to School Behavior, Achievement, and Aspirations: Demographic Variations Across Adolescence. Child Dev. 2009;75: 1491–1509. doi: 10.1111/j.1467-8624.2004.00753.x.Parent [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Cano T, Perales F, Baxter J. A Matter of Time: Father Involvement and Child Cognitive Outcomes. J Marriage Fam. 2018;81: 164–184. doi: 10.1111/jomf.12532 [DOI] [Google Scholar]
  • 34.Huerta MC, Adema W, Baxter J, Han W-J, Lausten M, Lee R, et al. Fathers’ Leave, Fathers’ Involvement, and Child Development: Are They Related? Evidence from Four OECD Countries. Paris; 2013. Report No.: 140. [Google Scholar]
  • 35.Eccles JS, Harold RD. Family involvement in children’s and adolescents’ schooling. In: Booth A, Dunn JF, editors. Family-school links: How do they affect educational outcomes. 1996. pp. 3–34. [Google Scholar]
  • 36.Hoover-Dempsey K V., Sandler HM. Why do parents become involved in their children’s education? Rev Educ Res. 1997;67: 3–42. doi: 10.3102/00346543067001003 [DOI] [Google Scholar]
  • 37.Byford M, Kuh D, Richards M. Parenting practices and intergenerational associations in cognitive ability. Int J Epidemiol. 2012;41: 263–272. doi: 10.1093/ije/dyr188 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Miller DP, Thomas MMC, Waller MR, Nepomnyaschy L, Emory AD. Father Involvement and Socioeconomic Disparities in Child Academic Outcomes. J Marriage Fam. 2020;82: 515–533. doi: 10.1111/jomf.12666 [DOI] [Google Scholar]
  • 39.Matsuoka R, Nakamuro M, Inui T. Emerging inequality in effort: A longitudinal investigation of parental involvement and early elementary school-aged children’s learning time in Japan. Soc Sci Res. 2015;54: 159–176. doi: 10.1016/j.ssresearch.2015.06.009 [DOI] [PubMed] [Google Scholar]
  • 40.Krapohl E, Rimfeld K, Shakeshaft NG, Trzaskowski M, McMillan A, Pingault J-B, et al. The high heritability of educational achievement reflects many genetically influenced traits, not just intelligence. Proc Natl Acad Sci. 2014;111: 15273–15278. doi: 10.1073/pnas.1408777111 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Smith-Woolley E, Selzam S, Plomin R. Polygenic Score for Educational Attainment Captures DNA Variants Shared Between Personality Traits and Educational Achievement. J Pers Soc Psychol. 2019. doi: 10.1037/pspp0000241 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Liu H. Social and Genetic Pathways in Multigenerational Transmission of Educational Attainment. Am Sociol Rev. 2018;83: 278–304. doi: 10.1177/0003122418759651 [DOI] [Google Scholar]
  • 43.Ayorech Z, Krapohl E, Plomin R, von Stumm S. Genetic Influence on Intergenerational Educational Attainment. Psychol Sci. 2017;28: 1302–1310. doi: 10.1177/0956797617707270 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Krapohl E, Plomin R. Genetic link between family socioeconomic status and children’s educational achievement estimated from genome-wide SNPs. Mol Psychiatry. 2016;21: 437–443. doi: 10.1038/mp.2015.2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Plomin R, DeFries JC, Loehlin JC. Genotype-environment interaction and correlation in the analysis of human behavior. Psychol Bull. 1977;84: 309–322. doi: 10.1037/0033-2909.84.2.309 [DOI] [PubMed] [Google Scholar]
  • 46.Ayoub M, Briley DA, Grotzinger A, Patterson MW, Engelhardt LE, Tackett JL, et al. Genetic and Environmental Associations Between Child Personality and Parenting. Soc Psychol Personal Sci. 2019;10: 711–721. doi: 10.1177/1948550618784890 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Baizán P, Domínguez M, González MJ. Couple Bargaining or Socio-Economic Status?: Why some parents spend more time with their children than others. Eur Soc. 2014;16: 3–27. doi: 10.1080/14616696.2013.859717 [DOI] [Google Scholar]
  • 48.Avinun R, Knafo A. Parenting as a Reaction Evoked by Children’s Genotype: A Meta-Analysis of Children-as-Twins Studies. Personal Soc Psychol Rev. 2014;18: 87–102. doi: 10.1177/1088868313498308 [DOI] [PubMed] [Google Scholar]
  • 49.Young AI, Benonisdottir S, Przeworski M, Kong A. Deconstructing the sources of genotype-phenotype associations in humans. Science (80-). 2019;365: 1396–1400. doi: 10.1126/science.aax3710 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.de Zeeuw EL, Hottenga JJ, Ouwens KG, Dolan C V., Ehli EA, Davies GE, et al. Intergenerational Transmission of Education and ADHD: Effects of Parental Genotypes. Behav Genet. 2020;50: 221–232. doi: 10.1007/s10519-020-09992-w [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Kong A, Thorleifsson G, Frigge ML, Vilhjalmsson BJ, Young AI, Thorgeirsson TE, et al. The nature of nurture: Effects of parental genotypes. Science (80-). 2018;359: 424–428. doi: 10.1126/science.aan6877 [DOI] [PubMed] [Google Scholar]
  • 52.Hwang L, Tubbs JD, Luong J, Lundberg M, Moen H, Wang G, et al. Estimating indirect parental genetic effects on offspring phenotypes using virtual parental genotypes derived from sibling and half sibling pairs. PLoS Genet. 2020; 1–29. doi: 10.1371/journal.pgen.1009154 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Klahr AM, Burt SA. Elucidating the Etiology of Individual Differences in Parenting: A Meta-Analysis of Behavioral Genetic Research. Psychol Bull. 2014;140: 544–586. doi: 10.1037/a0034205 [DOI] [PubMed] [Google Scholar]
  • 54.Guay F, Ratelle CF, Duchesne S, Dubois P. Mothers’ and fathers’ autonomy-supportive and controlling behaviors: An analysis of interparental contributions. Parenting. 2018;18: 45–65. doi: 10.1080/15295192.2017.1337461 [DOI] [Google Scholar]
  • 55.Barnett MA, Deng M, Mills-Koonce WR, Willoughby M, Cox M. Interdependence of Parenting of Mothers and Fathers of Infants. J Fam Psychol. 2008;22: 561–573. doi: 10.1037/0893-3200.22.3.561 [DOI] [PubMed] [Google Scholar]
  • 56.Pleck JH, Hofferth SL. Mother Involvement as an Influence on Father Involvement with Early Adolescents. Fathering. 2008;6: 1–7. doi: 10.3149/fth.0603.267 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Harris KM, Halpern CT, Whitsel EA, Hussey JM, Killeya-jones LA, Tabor J, et al. Cohort Profile: The National Longitudinal Study of Adolescent to Adult Health (Add Health). Int J Epidemiol. 2019; 1–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Okbay A, Turley P, Benjamin D, Visscher PM, Braudt D, Mullan Harris K. SSGAC Polygenic Scores (PGSs) in the National Longitudinal Study of Adolescent to Adult Health (Add Health). 2018. [Google Scholar]
  • 59.Highland HM, Avery CL, Duan Q, Li Y, Harris KM. Quality control analysis of Add Health GWAS data. Chapel Hill; 2018. [Google Scholar]
  • 60.Liu H, Motz RT, Tanksley PT, Barnes JC, Harris KM. Adolescent Criminal Justice Involvement, Educational Attainment, and Genetic Inheritance: Testing an Integrative Model Using the Add Health Data. Journal of Developmental and Life-Course Criminology. Springer International Publishing; 2021. doi: 10.1007/s40865-021-00166-8 [DOI] [Google Scholar]
  • 61.Domingue BW, Belsky DW, Conley D, Mullan Harris K, Boardman JD. Polygenic Influence on Educational Attainment: New evidence from The National Longitudinal Study of Adolescent to Adult Health. AERA Open. 2015;1: 1–13. doi: 10.1177/2332858415599972 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Rosseel Y. lavaan: An R Package for Structural Equation. J Stat Softw. 2012;48: 1–36. doi: 10.18637/jss.v048.i02 [DOI] [Google Scholar]
  • 63.Snijders TAB, Bosker RJ. Multilevel Analysis: An introduction to basic and advanced multilevel modeling. 2nd ed. London: Sage publications; 2012. [Google Scholar]
  • 64.Mackinnon DP, Krull JL, Lockwood CM. Equivalence of the Mediation, Confounding and Suppression Effect. Prev Sci. 2000;1. doi: 10.1023/a:1026595011371 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Vinkhuyzen AAE, van der Sluis S, de Geus EJC, Boomsma DI, Posthuma D. Genetic influences on ‘environmental’ factors. Genes, Brain Behav. 2010;9: 276–287. doi: 10.1111/j.1601-183X.2009.00554.x [DOI] [PubMed] [Google Scholar]
  • 66.Kalembo FW, Kendall GE. A systematic review of interventions that have the potential to foster engaged fathering to enhance children’s health and development. Child Fam Soc Work. 2021; 1–22. doi: 10.1111/cfs.12897 [DOI] [Google Scholar]
  • 67.Huerta MC, Adema W, Baxter J, Han WJ, Lausten M, Lee R, et al. Fathers’ Leave and Fathers’ Involvement: Evidence from Four OECD Countries. Eur J Soc Secur. 2014;16: 308–346. doi: 10.1177/138826271401600403 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Hoskins D. Consequences of parenting on adolescent outcomes. Societies. 2014;4: 506–531. doi: 10.3390/soc4030506 [DOI] [Google Scholar]
  • 69.Manolio TA, Collins FS, Cox NJ, Goldstein DB, Hindorff LA, Hunter DJ, et al. Finding the missing heritability of complex diseases. Nature. 2009;461: 747–53. doi: 10.1038/nature08494 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Dotti Sani GM, Treas J. Educational gradients in parents’ child-care time across countries, 1965–2012. J Marriage Fam. 2016;78: 1083–1096. doi: 10.1111/jomf.12305 [DOI] [Google Scholar]
  • 71.Ware EB, Schmitz LL, Faul J, Gard A, Mitchell C, Smith JA, et al. Heterogeneity in polygenic scores for common human traits. bioRxiv. 2017; 1–13. [Google Scholar]

Decision Letter 0

Marie-Pierre Dubé

17 Jun 2022

PONE-D-22-08657The intergenerational transmission of educational attainment: A closer look at the (interrelated) roles of paternal involvement and genetic inheritancePLOS ONE

Dear Dr. Verweij,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Aug 01 2022 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Marie-Pierre Dubé, PhD

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at 

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Thank you for stating the following in the Acknowledgments Section of your manuscript: 

"The present study was supported by a grant from the Netherlands Organization for Scientific Research to RK (NWO MaGW VIDI; grant no. 452-17-005) and by a grant from the European Research Council to RK (ERC StG; grant no. 757210). Both authors declare no conflict of interest. 

Add Health is directed by Robert A. Hummer and funded by the National Institute on Aging cooperative agreements U01 AG071448 (Hummer) and U01AG071450 (Aiello and Hummer) at the University of North Carolina at Chapel Hill. Waves I-V data are from the Add Health Program Project, grant P01 HD31921 (Harris) from Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD), with cooperative funding from 23 other federal agencies and foundations. Add Health was designed by J. Richard Udry, Peter S. Bearman, and Kathleen Mullan Harris at the University of North Carolina at Chapel Hill."

We note that you have provided funding information that is not currently declared in your Funding Statement. However, funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form. 

Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement. Currently, your Funding Statement reads as follows: 

"The present study was supported by a grant from the Netherlands Organization for Scientific Research to RK (NWO MaGW VIDI; grant no. 452-17-005) (https://www.nwo.nl/en) and by a grant from the European Research Council to RK (ERC StG; grant no. 757210) (https://erc.europa.eu/)."

Please include your amended statements within your cover letter; we will change the online submission form on your behalf.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Partly

Reviewer #3: Yes

Reviewer #4: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: I Don't Know

Reviewer #3: Yes

Reviewer #4: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

Reviewer #4: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

Reviewer #4: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Thank you for inviting me to review this manuscript, which analyses the contributions of genes and paternal involvement to the intergenerational transmission of educational attainment. Data come from the Add Health Study of 5,021 genotyped adolescents and their parents. The study finds that both genes and father involvement (over and above mother involvement) are contribute to the intergenerational transmission of educational attainment; that these mechanisms are correlated with each other; and that part of the association of fathers’ school- specific involvement and educational attainment is potentially confounded by children’s education genetics.

Overall I thought this study was well-done and well-written. I particularly appreciated how the manuscript very clearly and systematically laid out and tested each hypothesis. I only have a few comments; most of these are about the language used when describing how different variables relate to each other. The study cannot establish causality, yet the language that is used often implies causality. This is problematic especially in genetics studies, because genetics studies are very prone to misinterpretation (which can have real-life devastating consequences, see recent events in the US). I used the following guide to replace terms in the paper, I may not have found every instance of the words used though and would ask the authors to carefully go through their manuscript and replace causal language with the appropriate wording:

shaping = is associated with

results in = is associated with

has an effect on = is associated with

impacts on = is associated with

leads to = is associated with

My list of suggested word-changes with page number is appended below; other comments I had were the following:

(1) For a general readership, it would be helpful to explain a bit more what a polygenic score is (i.e. explain what a GWAS does; emphasise that this is an aggregate score made up of many genetic variants, i.e. it is not a candidate gene study

(2) On Page 4 and 6 it says in the headlines for each hypothesis “independent” (e.g. “Genetic influences as an independent mechanism underlying the intergenerational transmission of educational attainment”) – it is not clear what the “independent” refers to (independent of what?) - would delete or clarify

(3) On Page 7 it says “children inherit half of their genes from each parent, and these parents also rear them and shape their environment.” it would be good to emphasise that this is only the case in biological families, which not all families are (e.g. sth like “children inherit half of their genes from each biological parent, and, if children live with their biological parents, these same parents also rear them and shape their environment)

(4) The distinction between direct and indirect effects (p9) is a little vague. Direct effects are genetic associations with an individual’s outcome that originate in that individual’s genetics; indirect genetic effects are associations that originate in another individual’s genetics (e.g. parents). In the current description it sounds as if direct genetic effects refer to effects that are mediated by a person’s behaviour, and indirect genetic effects are effects that are mediated by environments. However, this is incorrect, because effects of an individual’s genetics could be mediated by the environment (e.g. via evocative gene-environment correlation) yet they would still be direct effects (because they originate in that individual’s genetics). See also this paper:

Young, A. I., Benonisdottir, S., Przeworski, M., & Kong, A. (2019). Deconstructing the sources of genotype-phenotype associations in humans. Science, 365(6460), 1396-1400.

(5) The methods say “Non-European descent individuals were removed from the sample” – how was this done? Using self-identified ancestry? Genetically-identified ancestry?

(6) What are the estimates in Table 2 – standardised beta coefficients? Or unstandardised estimates? Please clarify.

(7) Authors interpret their findings to suggest that “that a substantial part of the role the father’s school-specific involvement plays in the intergenerational transmission of years of education is genetically confounded” (p 21), but the confounding is 10%. That doesn’t seem very substantial. Would rephrase accordingly.

(8) Another proposed robustness check: what happens, at least for the main analyses (as the sibling sample would probably end up too small), if the sample is restricted to those where children live with their father?

Here are my suggested changes of wording to replace language that implies causality (not these are suggested changes – happy for the authors to use their own wording, as long as it fixes the issue).

(9) page 6, change “40% of the variation in education can be explained by genetic variation” to “40% of the variation in education is associated with genetic variation”, because “explained” implies causality, and there is nothing causal about variance decomposition analysis. Likewise, page 6 “Children with genes that are positively related to higher educational attainment are more open”, change to “Children with genes that are positively related to higher educational attainment tend to be more open” or “On average, children with genes..” and then the second part of the sentence reads “..which all result in better educational achievements”, which again heavily implies causality and should be changed to sth like “which are linked with better educational achievements”.

(10) page 7 “the parent’s education PGS not only shapes their own educational attainment” should be sth like “the parent’s education PGS is not only associated with their own educational attainment”.

(11) page 8 “both fathers’ involvement as well children’s educational attainment is shaped by the same genetic factors” replace with “both fathers’ involvement as well children’s educational attainment is associated with the same genetic factors” ; same page “the same genes that result in higher education, also result in” should say “the same genes that are associated with higher education, are also associated with..”

(12) page 10: “do genes and father involvement independently explain the intergenerational transmission of education” change to sth like “are genes and father involvement independently associated with the intergenerational transmission of education” ; “to what extent do genes explain part of the behaviour mechanism” change to “to what extent do genes account for the behaviour mechanism” or “to what extent do genes confound the behaviour mechanism”

(13) page 11: “we can tap into the causes of the hypothesized correlation ” change to “we can test the hypothesized causes of any correlation ”

Reviewer #2: The manuscript “The intergenerational transmission of educational attainment: A closer look at the (interrelated) roles of paternal involvement and genetic inheritance” PONE-D-22-08657 investigates the joint effect of paternal involvement and education polygenic score (PGS) in predicting educational attainment, as well as their contribution as mechanisms of intergenerational transmission of education. The study is well motivated (with some reservations described below) to make a reasonable contribution to our understanding in an important topic. However, some empirical decisions and their presentation is confusing and the article overall requires polishing before the publication can be recommended.

I am admittingly not an expert in path modelling, but I struggle to understand the analysis description:

First, could the authors explain the intuition behind the multilevel models assessing rGE? As described on page 16, they fit model of something like:

Education=education_mother+education_father+PCs+controls+ζ+ε,

and then use ε (individual-level residual, I assume, although the authors do not specify which of the two residual terms ζ/ε, or even both, they use) of the model above to fit:

ε = PGS (or corresponding models examining two dimensions of father’s involvement)

What is the advantage of this approach compared to, for example, the more straightforward method of fitting first a model without mediators and then with mediators, and assessing the attenuation between these models?

Second, for hypothesis 3, why there is no need for other controls that PCs (page 17)?

Third, for the description of hypotheses 1, 2 ,4 and 5, it may be beneficial to state explicitly in what way the coefficients are compared. In addition, I would like to see some details on bootstrap simulations (method, has the multilevel structure been taken into account in sampling, how many replications).

Motivational and interpretational issues:

Authors motivate their focus on paternal involvement based on that “we do expect to see greater variation in paternal than in maternal involvement, and this is our main rationale for choosing to focus on paternal involvement in the current paper” (page 3). Based on Table 1, the difference in standard deviations between both dimensions of paternal and maternal involvement seems to be rather trivial. I tend to think that the paper might be stronger if both paternal and maternal involvement were on the focus, but I do not demand such change if authors think, for example, that this makes focus too scattered. Nevertheless, based on the evidence, I do not buy this specific argument for the current focus.

On page 24 authors state that “findings from the field of behavioural sciences have likely overestimated the role that fathers’ school-specific involvement plays in the intergenerational transmission of educational attainment.” Is this interpretation consistent with the results? Within-family analysis did not show any attenuations . Although subject of low power and thus only suggestive, wouldn’t this mean that the correlation may stem fully from active rGE. Would this mean that the causal direction would flow from pgs to father involvement, i.e. the involvement is not confounded by the PGS, but acts as one of the mechanisms via which PGS operates. Am I missing something?

On variables, and related issues:

Does controlling for enrolment in school involve a potential “bad control” problem? It can be a mediator (or even a collider) instead of a confounder given the analysis focus.

On page 15, authors claim to control for “the first 10 principal components (PCs).” PCs of what data? I think I can guess the answer, but scientific writing should not leave readers with guessing games.

Are there overlapping samples between GWAS and analysis data?

Could/should the genotyping chip be controlled in the models?

The relative importance of mediators is hard to assess, as they are all in the different scales. Could they be standardized to SD units?

There is a new generation PGS of education (Okbay et al. 2022, Nature genetics, 54(4), 437-449.). Could the new score be accommodated in the revision? However, the improvement is likely to be marginal, as the increase in sample size comes from 23andme, which usually cannot be shared. Thus, if this cannot be easily done, I understand if the authors want to skip this suggestion.

Reliability assessments of father involvement scales (Cronbach’s α or similar) would be nice.

Presentational issues:

• Standard errors (or Confidence intervals if authors prefer) could be presented in all tables & Figures. If not possible, then at least in the text when referring to estimates. There is also no need for vague description of p-values (e.g. “borderline significant” p.21) or asterisks when referring to them in the text. Precise values could be presented as easily.

• Figures 1-3 could be integrated into different panels of one figure.

• I prefer an old-school correlation matrix (with numbers) as Figure 4 relative to heat map

• Figure 5: It is hard to follow, where 0.016(ns) refers. I guess that between PGS and leisure involvement, but it took its time to understand (if correct).

• “Correlated effects” is as exotic term. What is this actually? The correlation, simply, or something else? Possibly clarifying the methods section may help here as well.

• Overall, it is very unconventional to present outcomes of regression models in the table rows as done in Table 2. This may cause misconceptions.

• There is no need to put different row on “N sibling pairs” in table 2. The old rows, “N individuals” and “N families” could accommodate also sibling design nicely

Issues (again mostly presentational) regarding the analyses of appendix

• Table S1 (upper panel). How can R2 drop between models 2 and 3?

• I would like to see direct effect in the lower panel of table S1

• Table S2 Contrasting OLS and FE models may be a category mistake. Linear FE models are also typically estimated via OLS. And even if not in this specific case, the essential substance-related difference is not the estimation method.

• P value can never be exactly 0

Reviewer #3: This study employed the National Longitudinal Study of Adolescent to Adult Health (Add Health) to provide a deeper understanding of the potential role of paternal involvement in intergenerational transmission in academic attainment. The results revealed that both genetic influences as well as father involvement effectively mediate the association between paternal and offspring academic attainment. I believe this study addresses an interesting topic, particularly its emphasis on paternal involvement, is generally well-written, and has the potential to make a meaningful contribution to the extant literature. With that said, however, there are a few areas that can be improved to more effectively display the underlying contribution and further inform future research. I’ve provided a summary of these areas below with some suggestions for the authors to consider. Best of luck with your revisions and thank you for the opportunity to review your work.

On page 2, the authors, rightfully, point out that previous studies have revealed that the association between family environment and educational attainment may be artificially inflated in light of shared genetic influences passed from parents to offspring that collectively contribute to both increased genetic predisposition for variation in educational attainment as well as the environments that parents design for their children. The latter is, at least to some degree, also a reflection of parent predisposition toward educational attainment—and related phenotypes. The resulting covariation between genetic predisposition and these specialized environments that explain variance in educational attainment is, as the authors note, an example of genetic confounding (as well as a passive rGE more specifically). All of this is to say that I completely agree with the authors assessment of this limitation in the literature, but I think it would be beneficial to expand on the underlying meaning of “genetic confounding” in this context as some readers may not be as familiar with this concept and the theoretical and methodological problems that it may give rise to.

Similar to my previous comment, the authors summarize Wertz et al.’s (2020) findings on pages 2-3 of the manuscript. Again, the authors provide a sufficient and accurate description of the concept of “genetic nurture” but I think a slightly more expanded definition and description of the supposed underlying mechanisms underlying genetic nurture would be beneficial for readers that are either unfamiliar with this concept or who are trying to understand how it applies to the current study more directly.

I think the authors do a great job of setting up their arguments for shifting focus to paternal involvement within the context of the current study throughout the literature review; however, once they reach the penultimate paragraph (the final full paragraph on page 4) I believe the authors can be a bit more direct. They mention that they are examining paternal involvement and genes (from a GWAS), but they do not provide any indication of how they will examine these two sources of influence. Will the GWAS measure simply serve as a control? Will they examine gene-environment interplay? Again, just a couple of sentences here to flesh things out a bit more may provide readers with valuable information regarding the primary goals of the study.

Hypothesis 1 frame paternal involvement as a mechanism of intergenerational educational attainment. Based on the arguments offered by the authors, I believe this is a reasonable hypothesis. With that said, do the authors believe it is at least possible that at least some of the covariation between paternal educational attainment and involvement is the result of a set of a single suite of genetic influences (or related genetic influences)? I think it is at least possible that educational attainment and parental involvement may be the result of shared genetic influences operating on higher order phenotypes (e.g., impulsivity), of which educational attainment and involvement may reflect more proximately. This could be addressed methodologically with paternal GWAS scores for educational attainment (or involvement, I suppose), but I think the authors need to at least explore/discuss this possibility more directly.

The addition of maternal involvement and educational attainment into the multivariate equation significantly strengthens the estimated models. Given the estimated indirect effects, however, it is not currently clear how these measures were “controlled.” In other words, did the authors regress the examined outcome (child educational attainment), mediators (father involvement and the PGS), and primary IV (father educational attainment) on the examined controls or just a subset of these measures?

The extent to which the examined PGS mediated the association between parental and offspring educational attainment was extremely interesting, in my opinion. The authors briefly discuss these findings on pages 22-23, but I think some additional expansion would be useful. More specifically, there has been much discussion surrounding the utility of PGSs as of late with many critics (perhaps, rightfully) challenging the notion of genetic influences as a source of causality. I’m not suggesting the authors tread into these choppy waters, but I do think that framing a PGS as a potential mechanism rather than a source of causal influence may be beneficial given their findings. We are still trying to figure out exactly what the variance explained by a PGS is and how to best leverage these measures. I believe the authors’ findings may provide some additional and useful insight in that perhaps we are better suited examining PGSs as a source of intergenerational transmission (when appropriate) rather than a source of more general causality. I don’t think the authors need to go too far down this rabbit hole, but some additional expansion here would be beneficial for future research in this area and also highlights an additional contribution of the current study to the extant literature.

Reviewer #4: This is an overall well-written paper exploring the mechanisms explaining the intergenerational transmission of educational attainment focusing on paternal involvement. While we know little about how intergenerational transmission works, and know little about maternal influences on educational outcomes, we know even less about paternal involvement in explaining educational outcomes in offspring. The paper adds to this major research gap. The manuscript is methodologically sound and suited for publication in PlosOne. The topic is very important and of interest to researchers from varied disciplines as well as for policymakers and will hopefully spark more research into intergenerational transmission of educational attainment using genetically sensitive designs. I have some minor concerns for the authors to consider.

• “To obtain a more complete understanding, the current study integrates insights from the fields of behavioral sciences and genetics and examines the extent to which factors from each field are unique underlying mechanisms, correlate with each other, and/or act as important confounders in the intergenerational transmission of educational attainment.” � this to me seemed like the study was looking into several mechanisms, that would be parental, grandparental, sibling, and societal effects on offspring/sibling outcomes, etc. I suggest rephrasing the sentence in the abstract to reflect more precisely what the study was looking into. Paternal effects are grossly understudied, so it is a very valuable study on its own.

• I suggest adding effect sizes to the abstract. That is, before talking about mediation analyses, state the direct effect, what is the effect size of the correlations between behavioral and genetic influences, etc.

• Nicely written introduction. I suggest including that SES itself is partly explained by genetic factors, while often assumed to be environmental. It would also be helpful if effect sizes are including in the introduction, for example, the magnitude of correlations between paternal teaching-related activities and offspring educational attainment, etc. (or for example, Children with genes that are positively related to higher educational attainment are more open, agreeable, conscientious, and show more academic motivation, which all result in better educational achievements – what are the effect sizes here?).

• A nice addition would have been to include data about parental genotypes- perhaps this could be discussed in the paper?

• Could you please unpack this? “PGSs cannot distinguish between “direct” genetic effects-associations between genes and education through intelligence and motivation” - What is the direct genetic effect? The genetic variants are not coding for educational outcomes, not even through intelligence and/or motivation. Same here: “and indirect genetic effects -associations between genes and education due to the family environment and parenting practices” Do the authors mean genetic factors explaining family environment and parenting?

• I suggest discussing the representativeness of the sample. Is the data missing at random? Does the genotyped sample reduce the representativeness? How about information available about paternal involvement? Some sensitivity analysis would be useful.

• The methodology is sound, however, I suggest talking about effect sizes rather than significance, e.g., “The effect of the father’s school-specific involvement is significantly reduced from 0.056 to 0.050 when including the child’s education PGS, which implies 10.7% genetic confounding, while the effect of the father’s leisure involvement is not significantly reduced when including children’s education PGS.’ � this is an interesting but a small effect, it is significant because of the large sample size. It is important to note this.

• Was the analysis plan preregistered? How was multiple testing controlled for?

• “Comparing the between-families and within-families associations between father involvement and children’s education PGS provides insights into the extent to which the rGE is active or passive.” � I do not think you can distinguish between active and evocative rGE?

• I suggest not using the term “borderline significant’ it is either significant or not significant. It is especially questionable to interpret the results as showing effect or ‘hinting’ to an effect.

• It is commendable that several robustness checks were done. I suggest doing some checks about missing data and representativeness as well.

• I suggest adding precise figure legends, what is presented, what are the error terms in parentheses (e.g. figure 5)

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

Reviewer #4: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

Decision Letter 1

Marie-Pierre Dubé

23 Nov 2022

The intergenerational transmission of educational attainment: A closer look at the (interrelated) roles of paternal involvement and genetic inheritance

PONE-D-22-08657R1

Dear Dr. Verweij,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Marie-Pierre Dubé, PhD

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: All comments have been addressed

Reviewer #3: All comments have been addressed

Reviewer #4: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

Reviewer #4: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

Reviewer #4: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: No

Reviewer #3: No

Reviewer #4: (No Response)

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

Reviewer #4: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The authors have taken great care in responding to my comments, which I appreciate. I'm satisfied with their revisions.

Reviewer #2: The authors have provided reasonable answers to my concerns in the first round. I have no further requests, and recommend the publication of the study!

Reviewer #3: The authors have adequately addressed all of my concerns. I see no reason the study cannot be published in its current form.

Reviewer #4: Thank you for addressing all the reviewer comments so carefully. I am generally very happy with the revision, although I do not agree with your decision about not correcting for multiple testing, especially as the analyses plan was not preregistered. At the very least, I suggest adding the justification for the decision to the manuscript.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Jasmin Wertz

Reviewer #2: No

Reviewer #3: No

Reviewer #4: No

**********

Acceptance letter

Marie-Pierre Dubé

1 Dec 2022

PONE-D-22-08657R1

The intergenerational transmission of educational attainment: A closer look at the (interrelated) roles of paternal involvement and genetic inheritance

Dear Dr. Verweij:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Marie-Pierre Dubé

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 File

    (DOCX)

    Attachment

    Submitted filename: Response to reviewers.pdf

    Data Availability Statement

    Data cannot be shared publicly because of the extensive restricted-use data. Information on how to obtain the Add Health data files is available on the Add Health website (http://www.cpc.unc.edu/addhealth). The data underlying the results presented in the study are available from CPC Data Portal (https://data.cpc.unc.edu/projects/2/view). Restricted-use data will be distributed only to certified researchers who commit themselves to maintaining limited access. The authors had no special access privileges, and other researchers will be able to access the data in the same manner.


    Articles from PLOS ONE are provided here courtesy of PLOS

    RESOURCES