Abstract
Perinatal traits are influenced by genetic variants from both fetal and maternal genomes. Genome-wide association studies (GWAS) of these phenotypes have typically involved separate fetal and maternal scans, however, this approach may be inefficient as it does not utilize the information shared across the individual GWAS. In this manuscript we investigate the performance of three strategies to detect loci in maternal and fetal GWAS of the same trait: (i) the traditional strategy of analysing maternal and fetal GWAS separately; (ii) a novel two degree of freedom test which combines information from maternal and fetal GWAS; and (iii) a novel one degree of freedom test where signals from maternal and fetal GWAS are meta-analysed together conditional on the estimated sample overlap. We demonstrate through a combination of analytical formulae and data simulation that the optimal strategy depends on the extent of sample overlap/relatedness between the maternal and fetal GWAS, the correlation between own and offspring phenotypes, whether loci jointly exhibit fetal and maternal effects, and if so, whether these effects are directionally concordant. We apply our methods to summary results statistics from a recent GWAS meta-analysis of birth weight from deCODE, the UK Biobank and the Early Growth Genetics (EGG) consortium. Both the two degree of freedom (213 loci) and meta-analytic approach (226 loci) dramatically increase the number of robustly associated genetic loci for birth weight relative to separately analysing the scans (183 loci). Our best strategy identifies an additional 62 novel loci compared to the most recent published meta-analysis of birth weight and implicates both known and new biological pathways in the aetiology of the trait. We implement our methods in the online DINGO (Direct and INdirect effects analysis of Genetic lOci) software package, which allows users to perform one and/or two degree of freedom tests easily and computationally efficiently across the genome. We conclude that whilst the novel two degree of freedom test may be particularly useful for the analysis of certain perinatal phenotypes where many loci exhibit discordant maternal and fetal genetic effects, for most phenotypes, a simple meta-analytic strategy is likely to perform best, particularly in situations where maternal and fetal GWAS only partially overlap.
Introduction
Perinatal traits like birth weight are influenced by genetic variants from both offspring and maternal genomes 1–6. Historically, genome-wide associations studies (GWAS) of these traits have involved separate analyses of offspring and maternal genomes i.e. a “fetal GWAS” where an individual’s own phenotype is regressed on their own genotype, and a separate “maternal GWAS” where offspring phenotype is regressed on maternal genotype. Genetic variants showing genome-wide significant associations in either scan are then followed up using conditional association analyses (or transmitted and non-transmitted haplotype analyses) to investigate whether their effects are likely mediated through the fetal genome (i.e. direct effects of an individual’s genome on their own trait), the maternal genome (i.e. indirect effects of the mother’s genome mediated through e.g. the intrauterine environment on an individual’s trait) or some combination of both 6,7. Using this strategy, large-scale GWAS meta-analyses have identified over 280 variants robustly associated with a range of perinatal traits through both the fetal and/or maternal genomes 1–6,8–10.
Whilst this strategy of conducting separate fetal and maternal GWAS has been successful in terms of locus identification, it is not optimal statistically, because it does not utilize information shared across the individual GWAS. Since maternal and offspring genotypes are correlated, a GWAS of an individual’s own phenotype also provides information on indirect maternal genetic effects (i.e. in addition to information on direct fetal genetic effects). Likewise, GWASs of offspring phenotype also provide information on direct fetal genetic effects (i.e. in addition to indirect maternal genetic effects). Performing separate fetal and maternal GWAS therefore does not take full advantage of the information contained across both GWAS. This issue is most pronounced when the fetal and maternal GWAS are only partially overlapping (and hence contain independent information)- which has certainly been the case in the past for the GWAS meta-analysis of many phenotypes within the Early Growth Genetics (EGG) consortium including birth weight 1–6. However, it may not be clear how best to combine information from maternal and fetal GWAS, particularly when the degree of sample overlap/relatedness is unknown. In addition, when conducting two or more GWAS (e.g. GWAS of one’s own trait and then GWAS of the same trait in one’s offspring), investigators should ideally increase the statistical penalty due multiple testing- although this is not often done in practice and may not be easy to do optimally given unknown sample overlap and correlation between the traits. Data simulations and asymptotic power calculations we have performed previously using a computationally intensive structural equation model (i.e. that is not suitable for GWAS) have hinted at the gains in power that might be achieved by modelling and testing indirect and direct effects simultaneously in one analysis7,11.
In this manuscript we examine the performance of a computationally simple two degree of freedom test that uses GWAS summary results statistics from fetal and maternal GWAS. Our method uses LD score regression 12 to estimate an effective sample overlap across fetal and maternal GWAS, and then utilizes this information to estimate the sampling variance and covariance of the conditional direct fetal and indirect maternal genetic effect estimates. In addition, by essentially performing a single GWAS, the method avoids the thorny issue of how best to adjust for multiple testing across two correlated GWAS that contain an unknown proportion of overlapping/related individuals. We also compare the two degree of freedom test to a simple one degree of freedom test where signals from maternal and fetal GWAS are meta-analysed together conditional on the estimated sample overlap, as well as to the traditional strategy of separately analysing maternal and fetal GWAS. We investigate the power of the different approaches through a combination of analytical formulae and data simulation and provide users a web tool to conduct their own asymptotic power calculations. We apply the different methods to summary statistics from the most recent GWAS of birth weight involving the deCODE Study, EGG Consortium and UK Biobank5 and illustrate how combining the data dramatically increases the number of significantly associated loci for these traits relative to analysing the data separately or via the commonly used MTAG program13. We show how our methods can be extended to investigate the genetic aetiology of other phenotypes like educational attainment and IQ that are putatively influenced by indirect maternal and indirect paternal genetic effects as well as the direct effect from individuals’ own genomes14–16. Finally, we implement our methods in the DINGO (Direct and INdirect effects analysis of Genetic lOci) software package, part of the online Complex Trait Genetics Virtual Laboratory (CTG-VL)(https://vl.genoma.io/)17, which allows users to perform these tests easily and computationally efficiently across the genome using summary results GWAS data.
Method
Formulation of the Two Degree of Freedom Test
Simply regressing own phenotype or offspring phenotype on own genotype will lead to inconsistent and biased estimates of association in the presence of indirect maternal and direct fetal genetic effects at a locus (see Supplementary Note). We and others have shown previously how consistent and unbiased estimates of indirect maternal and direct fetal genetic effects at individual variants can be obtained by combining summary statistic linear regression coefficient estimates of SNP-trait associations from maternal and fetal GWAS 6,18 i.e.:
where and are estimates of the true population direct fetal and indirect maternal genetic effects , and and are linear regression coefficients from a GWAS of own phenotype on own genotype, and offspring phenotype on own (maternal) genotype respectively. The standard error of these terms can be estimated using the below formulae:
where “” and “” refer to the sampling variance and covariance of the terms respectively 6,18. Note how the standard error of both terms is a function of the sampling covariance of and . When GWAS of own and offspring traits are performed in separate unrelated families, this term can be assumed to be zero. However, this is rarely the case in practice. Wu et al. (2021)18 show how this term is a function of the sample overlap and the phenotypic correlation between overlapping mothers and children, and can be estimated using the intercept from bivariate LD score regression 12:
where is the sample size of the GWAS of own genotype and own outcome (i.e. the “fetal GWAS”), is the sample size of the GWAS of maternal genotype and offspring phenotype (i.e. the “maternal GWAS”), is the effective number of overlapping individuals across both GWAS (e.g. in situations where all mothers report their own and their offspring’s birth weight the effective sample overlap would be 100%, whereas for genotyped mother-offspring pairs where only the offspring’s phenotype is measured, the effective sample overlap would be 50%, since offspring and maternal genotypes are correlated 0.5), and is the estimated bivariate LD score regression intercept. The parameter refers to the correlation between own and offspring phenotype among the overlapping individuals.
In large samples, the hypothesis that the estimated maternal effect (or estimated fetal effect ) differs from zero can be evaluated for significance against the standard normal distribution i.e.:
In this manuscript, we refer to these tests as “conditional one degree of freedom” tests. These conditional one degree of freedom tests have primary utility in locus characterization. That is, following the identification of the loci in GWAS, it is of interest to know whether the associated variant exerts its effect “directly” through the fetal genome, indirectly through the maternal genome, or some combination of both. However, we have shown that these conditional one degree of freedom tests often lack statistical power7,11 and typically do not produce stronger p-values than performing (potentially biased) unconditional tests of SNP-trait association in mothers or children separately. In other words, whilst conditional one degree of freedom tests are useful for locus characterization, in most situations they have limited utility for locus detection.
The joint sampling distribution of the estimated conditional maternal and fetal effects is bivariate normal in large samples i.e.:
To increase the power of locus identification in GWAS of own and offspring phenotypes, we propose , a two degree of freedom test that considers the joint distribution of the two effect estimates. It follows from elementary statistical theory that under the null hypothesis of no association between variant and maternal and fetal genotype (i.e. ) the test statistic:
is asymptotically distributed as a chi-square statistic with two degrees of freedom (see the Supplementary Note for the derivation19). We verified using simulated data that conforms with our expectations under the null (Supplementary Figure 1).
This manuscript primarily focuses on perinatal traits like birth weight that are known to be influenced by variants in both the maternal and offspring genomes. However, we also derive an analogous three degree of freedom test in the Supplementary Note that could be useful for identifying loci influencing phenotypes like educational attainment that are likely to be a function of direct genetic, indirect maternal and indirect paternal effects.
Formulation of One Degree of Freedom Meta-analytic Tests
We also investigated the power of a simple procedure for meta-analysing the results of (potentially overlapping) maternal and fetal GWAS. We illustrate this procedure in the case of fetal genetic effects noting that an analogous test for maternal genetic effects can also be derived in a complementary fashion. We first assume that maternal genetic effects are not present at the locus. Since maternal and fetal genotypes are correlated 0.5, we are able to obtain estimates of the fetal genetic effect from the maternal GWAS by simply doubling the estimated maternal effect regression coefficient (i.e. ), where is the coefficient of the regression of offspring phenotype on maternal genotype. Likewise, the sampling variance of this estimate will be . This estimate is then combined with estimates of the fetal genetic effect from the fetal GWAS by inverse variance weighted meta-analysis. The variance of this combined estimate is obtained by the usual inverse weighted variance formula plus an additional term due to twice the weighted covariance between the estimates:
where is the coefficient from the regression of own phenotype on own genotype and is the coefficient from the regression of offspring phenotype on maternal genotype, and the inverse variance weighted estimate of the fetal effect across both these scans. The effective sample overlap between the scans is estimated using bivariate LD score regression and the covariance between the regression coefficients estimated from this quantity:
In large samples, under the null hypothesis of no association between SNP and trait (i.e. fetal or maternally mediated), the regression coefficients over their standard errors are distributed as standard normal:
We refer to this test/procedure as the one degree of freedom meta-analytic strategy. We note that when both maternal and fetal effects are present at a locus, simply meta-analysing the data as described above will result in biased estimates of maternal and fetal genetic effects. However, since our interest is on locus discovery (i.e. regardless of whether a genetic effect is mediated by the fetal or maternal genome), we argue that this bias can be ignored, as the type I error for both tests should be well calibrated under the null (i.e. when neither maternal nor fetal effects contribute to trait variation at the locus). Indeed, if desired, unbiased estimates of maternal and fetal genetic effects can be obtained post-hoc using the two degree of freedom procedure described in the previous section.
With a couple of notable exceptions (e.g. the Norwegian MOBA cohort), the majority of the world’s birth cohorts contain only limited genotype information on the children’s fathers. Nevertheless, in some situations it may be useful to include information from a GWAS of offspring phenotype regressed on father’s genotype. This could be useful when paternal effects are suspected to contribute to offspring trait variation, or (in their absence) when fathers provide information on their offspring in the absence of offspring genotype information (i.e. in which case the fathers are providing information indirectly on offspring mediated effects). In the Supplementary Note we derive analogous formulae for meta-analyses when paternal, maternal, and fetal GWAS are available.
Power
We performed extensive data simulations to investigate the power of the three locus detection strategies described in this manuscript i.e. (i) the usual strategy of separately analysing maternal and fetal GWAS with one degree of freedom tests; ii) two degree of freedom tests applied to the maternal and fetal GWAS concurrently; and (iii) meta-analysing maternal and fetal GWAS to obtain estimates of the maternal and fetal effect using one degree of freedom tests. We simulated bivariate normally distributed quantitative traits of unit variance for own and offspring phenotype that were influenced by a single additive locus simulations). We varied locus effect size (three conditions: (i) and ; (ii) and ; (iii) and ), residual correlation between maternal and fetal phenotype (three conditions: (i) ; (ii) ; (iii) ), sample size of each GWAS (19 conditions: to 900, increment by 100, and then to 10,000, increment by 1000), relative sample size (two conditions: (i) equal sized maternal and fetal GWAS; (ii) maternal GWAS half the size of fetal GWAS) and extent of sample overlap (three conditions: (i) zero overlap; (ii) 50% of individuals in the maternal GWAS in the fetal GWAS; (iii) all of the mothers in the maternal GWAS in the fetal GWAS).
Since the locus detection strategies involving one degree of freedom tests entail performing double the number of statistical tests relative to the two degree of freedom test (i.e. performing genome-wide tests for a fetal effect, and genome-wide tests for a maternal effect), we evaluated power for the one degree of freedom tests against a genome-wide Bonferroni corrected alpha level of (for a justification of why a Bonferroni correction is reasonable, see the Supplementary Note). However, since our focus is on locus discovery, and so technically we are agnostic as to whether evidence for association comes from the maternal or fetal meta-analysis, we evaluated power according to whether either the maternal or the fetal test was significant for a given replicate. Note that for our data simulations we adopted a slightly more stringent significance level than the typical threshold employed in most GWAS because of the increased number of low frequency variants contained within modern imputation panels (see Morris et al (2019) for further justification of this threshold)20.
In the Supplementary Note we derive analytical expressions for the non-centrality parameter (and hence power) of all the statistical tests described in this manuscript and confirm that the asymptotic power implied by the expressions matches the results of our simulations closely. We encode several of our asymptotic formulae into an online web calculator that allows users to approximate power in their own GWAS (https://evansgroup.di.uq.edu.au/DINGO).
Empirical application
We applied our strategies to birth weight GWAS summary results statistics from the most recent GWAS of birth weight6 which contains meta-analysed data from deCODE, the EGG Consortium and the UK Biobank. The GWAS of own birth weight included 423,683 individuals of European ancestry and the GWAS of offspring birth weight included 270,002 women of European ancestry as defined by the authors of the study (https://www.decode.com/summarydata/). Given that the standard errors were not provided in the GWAS summary statistics, we derived them using p-values and beta effect estimates and the qnorm function in R, i.e. standard error = abs(beta / qnorm(p-value / 2)). Among 33,409,268 single nucleotide polymorphisms (SNPs) from the GWAS of own birth weight and 33,453,671 SNPs from the GWAS of offspring birth weight, 1,276,496 and 1,220,850 SNPs were removed, respectively, due to p-value = 1 or beta = 0 because in either of these conditions standard errors cannot be derived. For the remaining SNPs, 31,033,794 were available in both GWAS and were included in the two degree of freedom test and in the meta-analyses of the estimated fetal and maternal effects. We performed clumping analyses for SNPs within a 500-kb window size using of 0.05, and , and the 1000 Genome Phase 3 EUR reference panel using the online platform FUMA21. We compared genome-wide significant loci from the two degree of freedom test against those from the published GWAS that used a one degree of freedom test (evaluated at ) and with loci identified in the meta-analyses of maternal/fetal effects (evaluated at ). For genome-wide significant SNPs, we then applied conditional one degree of freedom tests (evaluated at ) to test for whether effects were likely to be mediated through the maternal and/or fetal genomes. Finally, for lead variants at loci not reported as genome-wide significant in the most recent deCODE meta-analysis of birth weight5, we searched the GWAS catalogue and variants in strong linkage disequilibrium for reported trait associations using the FUMA software package21.
Comparison with other multivariate software
differs from other commonly used multi-trait association analyses that do not precisely model the correlation between the parents and their offspring. We therefore compared the performance of against MTAG (evaluated at ), a commonly used tool for boosting statistical power to detect loci underlying correlated traits using summary results statistics 13. We applied MTAG to the same summary results statistics of fetal and maternal birth weight as described above and compared the results against those from .
Results
Power analyses
Figure 1 and Supplementary Figures 2 – 6 compare the power to detect association (as assessed by simulation) across the three analytic strategies examined in this manuscript. When there was minimal sample overlap between maternal and fetal GWAS, then the usual strategy of analysing maternal and fetal GWAS separately was typically less powerful than strategies that combined information from both GWAS (Figure 1 bottom panel). In general, in these situations, the meta-analysis strategy was often slightly more powerful than the two degree of freedom test in detecting loci that exhibited one type of effect only (Figure 1 bottom middle panel) and when detecting loci with concordant maternal and fetal effects (Figure 1 bottom left panel).
Figure 1. Power to detect association (as evaluated using simulation) using a traditional strategy of analysing separate maternal and fetal GWAS, a strategy involving one degree of freedom meta-analyses, and a strategy of performing two degree of freedom tests across the genome.
and refer to maternal and fetal genetic effects on a standardized trait. In the top row, results are shown for replicates where there is complete sample overlap. Data were simulated assuming a high residual correlation between maternal and offspring phenotypes (). In the bottom row, results are shown for replicates where there is no sample overlap. For the traditional strategy of running separate maternal and fetal GWAS and the one degree of freedom meta-analysis strategy, we set the alpha value to , i.e. half the type I error rate of the two degree of freedom test in order to take into account that we are performing twice the number of statistical tests in the former situations. In the case of the traditional strategy of running separate maternal and fetal GWAS and the one degree of freedom meta-analysis strategy, we evaluated power with respect to whether either test met the criterion for genome-wide significance for each replicate.
In contrast, at the other extreme, when there was complete sample overlap, then the optimal strategy depended upon whether the locus exhibited both type of effects concurrently and the residual correlation between variables. For example, Figure 1 (middle panel) shows an instance of where the usual strategy of analysing the maternal and fetal GWAS separately was most powerful- in this case where loci exhibited only indirect maternal (or only direct fetal) genetic effects and in which the residual correlation was very high. In contrast, when loci exhibited concordant maternal and fetal effects of similar magnitude then both the two degree of freedom test and the meta-analysis strategy often gave superior power compared to separately analysing the scans (Figure 1 top left panel).
Interestingly, in the case of loci that exhibited discordant maternal and fetal effects (of similar magnitude), the two degree of freedom test provided increased power to detect association relative to the other strategies, particularly in overlapping samples (Figure 1 top right panel). Notably, the simple meta-analysis strategy did not perform well in these situations (see Supplementary Table 2 for further details).
In the Supplementary Note we derive asymptotic formulae for the unconditional GWAS tests, the meta-analytic strategy, and the two degree of freedom tests of association. We confirm that our formulae for asymptotic power of the two degree of freedom test derived in the Supplementary Note gave similar results to those obtained using simulated data (results not shown) and implement our calculations in a web calculator (https://evansgroup.di.uq.edu.au/DINGO) that investigators can use to investigate power in their own studies.
Empirical analyses of birth weight using the two degree of freedom test
The GWAS of birth weight using the two degree of freedom test identified 332 independent SNP associations at 213 loci with a p-value < 6.6 × 10−9 (Figures 2 and 3, Supplementary Table 3). This included 34 independent SNPs at 32 loci that did not reach genome-wide significance in either the separate GWAS of birth weight or offspring birth weight (Supplementary Table 3). These results are consistent with the preceding power calculations which suggest that performing a two degree of freedom test can be substantially more powerful than performing separate maternal and fetal GWAS when there is only partial overlap between the GWAS. Conditional analyses suggested that among 34 genome-wide significant SNPs at the 32 loci, 10 acted primarily through the fetal genome, 12 acted primarily through the maternal genome, and 12 acted through both (p-value < 0.05) (Figure 4). Interestingly, of the 12 SNPs that showed evidence for both maternal and fetal effects in the conditional analyses, all 12 SNPs exhibited effects in the same direction (Figure 4). These results were at first surprising, given that the power calculations from the previous section suggested that whilst the two degree of freedom test should increase the power to detect loci that show concordant maternal and fetal effects when there is partial sample overlap, the largest gains in power should involve the detection of SNPs that have discordant maternal and fetal effects of similar magnitude. Indeed only 22 of 332 genome-wide significant SNPs identified using the two degree of freedom test exhibited conditional maternal (p-value < 0.05) and fetal effects (p-value < 0.05) in opposite directions- and all these SNPs were also identified in the separate GWAS analyses of own or offspring birth weight. Figure 5 shows the likely reason for this observation- most birth weight associated SNPs that exhibited discordant maternal and fetal genetic effects have estimated maternal and fetal effect sizes of quite different magnitude. Indeed, although there are some SNPs that show sizeable (discordant) maternal and fetal effects, these effects were so large that they were also picked up by one degree of freedom tests in the separate GWAS of birth weight and offspring birth weight. In contrast, there were many SNPs that showed concordant maternal and fetal effects of similar magnitude and in many of these cases the two degree of freedom test was more powerful than separate GWAS (Figure 5).
Figure 2. P-value scatter plots and Venn diagram comparing results from separate GWAS of own and offspring birth weight versus the combined two degree of freedom test across 31,033,794 SNPs from the deCODE study of birth weight.
The −log10p-value from the two degree of freedom test (y-axis) was compared against the −log10p-value of SNPs from (A) the separate GWAS of own birth weight, (B) the separate GWAS of offspring birth weight, and (C) the stronger p-value from among A and B (x-axis). Red dashed lines denote genome-wide significant thresholds of for the separate GWAS of own birth weight and offspring birth weight which involve one degree of freedom tests, and for the two degree of freedom test. The blue diagonal lines indicate x = y. Orange circles are SNPs that are only genome-wide significant in the two degree of freedom test. Panel (D) illustrates the degree of overlap across genome-wide significant loci identified from the two strategies.
Figure 3. Manhattan plots showing results of using the typical strategy of conducting separate GWAS of birth weight and offspring birth weight (top panel- best p-value at each locus shown), a simple meta-analytic strategy (middle panel- best p-value at each locus shown), and the two degree of freedom test (bottom panel).
The −log10p-value for the SNP association is plotted on the y-axis. Red dashed lines indicate genome-wide significant thresholds of p-value = 3.3 × 10−9 for one degree of freedom tests, and p-value = 6.6 × 10−9 for the two degree of freedom tests. Orange points represents SNPs that are genome-wide significant in the separate GWAS of birth weight/offspring birth weight, but not in the two degree of freedom test. Purple points represent SNPs that are genome-wide significant in the meta-analysis of fetal or maternal effects, but not in the separate GWAS of birth weight/offspring birth weight. Blue points represent SNPs that are genome-wide significant in the two degree of freedom test, but not in the separate GWAS of birth weight/offspring birth weight.
Figure 4. Estimated conditional fetal and maternal effects and 95% confidence intervals for the 34 independent genome-wide significant SNPs identified in the two degree of freedom analysis (p-value < 6.6 × 10−9) that did not meet genome-wide significance in the separate GWAS of birth weight and offspring birth weight (p-value < 3.3 × 10−9).
Effects are oriented to the allele that is associated with an increased fetal effect on birth weight. The SNPs are labelled according to the physically closest gene. The upper 95% confidence interval for rs55914222 has been truncated.
Figure 5. Estimates of conditional fetal and maternal effects for 332 genome-wide significant SNPs identified in the two degree of freedom GWAS.
76 SNPs* that reached genome-wide significance in the two degree of freedom test (p-value < 6.6 × 10−9), but not in either the GWAS of birth weight or offspring birth weight (p-value < 6.6 × 10−99) are displayed in colour, whilst the other 256 SNPs are shown in grey. Effect sizes and p-values represent conditional one degree of freedom fetal and maternal tests calculated from the GWAS of birth weight and offspring birth weight using DINGO. The colour of each point represents the p-value for the conditional maternal effect and the shape of each point the p-value for the conditional fetal effect. SNPs with large conditional effects are labelled with the name of the closest gene. Note in particular the paucity of SNPs that show discordant maternal and fetal effects of similar magnitude.
* It is possible that other SNPs at the same loci as these SNPs reached genome-wide significance in the GWAS of birth weight or offspring birth weight
There were five (one) SNPs that reached genome-wide significance in the GWAS of own birth weight (offspring birth weight) (p-value < 3.3 × 10−9) but did not reach significance in the two degree of freedom GWAS (p-value < 6.6 × 10−9), however, only two of these (both from the GWAS of own birth weight) involved loci not identified by the two degree of freedom test. All these SNPs exhibited either predominantly fetal or maternal effects only (Supplementary Table 4).
Finally, the two degree of freedom test outperformed the one degree of freedom tests implemented in the popular multivariate GWAS package MTAG (Supplementary Figure 7), which only identified 23 novel independent significant SNP associations with p-value < 3.3 × 10−9, all of which were identified in both the 2 degree of freedom test and the one degree of freedom meta-analyses described in the next section (Supplementary Table 5). In conditional analyses, all 23 of these SNPs had estimated maternal and fetal effects in the same direction.
Empirical analyses of birth weight using one degree of freedom meta-analyses
The one degree of freedom meta-analytic strategy identified 369 independent SNPs at 226 loci that were genome-wide significant (p-value < 3.3 × 10−9) in either the meta-analysis of fetal effects, or the meta-analysis of maternal effects – an even greater number of loci than were identified using the two degree of freedom strategy (Supplementary Table 6). These included 21 independent genome-wide significant SNPs at 21 loci that were not identified in the two degree of freedom tests (Figure 6, Supplementary Table 6), although all only narrowly missed out on significance in the two degree of freedom test, having a p-value smaller than 5 × 10−8. Interestingly, eight independent SNPs (at eight loci) were significant in the test but not in the meta-analysis of fetal effects, or the meta-analysis of maternal effects (Figure 6, Supplementary Table 3, Supplementary Table 6). Four of these SNPs exhibited significant (p-value <0.05) discordant maternal and fetal effects in the conditional analyses– most notably rs7034200 at GLIS3 , a known type 2 diabetes locus, that was highly significant in the two degree of freedom test (p-value2df = 1.15 × 10−12), but only exhibited relatively modest non-significant p-values in the maternal and fetal meta-analyses (minimum pmeta = 7.39 × 10−5), and likewise rs560887 at G6PC2, a known locus for fasting glucose which also exhibited radically different p-values across the two tests (p-value2df = 4.37 × 10−20, minimum pmeta = 6.25 × 10−7).
Figure 6. Quantile-Quantile plots and Venn diagram comparing one degree of freedom meta-analysis tests versus the combined two degree of freedom DINGO test for birth weight across 31,033,794 SNPs.
The −log10p-value of the two degree of freedom DINGO test calculated using the GWAS of fetal birth weight and the GWAS of maternal birth weight was compared against the −log10p-value of SNPs from (A) the GWAS meta-analysis of fetal birth weight, (B) the GWAS meta-analysis of maternal birth weight, and (C) the stronger p-value from among A and B. Red dashed lines denote to genome-wide significant thresholds of for the one degree of freedom maternal and fetal tests and for the two degree of freedom DINGO test. Blue diagonal lines indicate x = y. Orange circles are SNPs that are only genome-wide significant in the two degree of freedom test. There are a considerable number of SNPs that reach genome-wide significance in the GWAS meta-analysis but in the not the two degree of freedom test. These SNPs are all clustered in a very small region of Figures 6a, 6b and 6c in the bottom right quadrant adjacent to the thresholds for genome-wide significance. (D) illustrates the overlap between genome-wide significant loci identified from one and two degree of freedom tests.
We compared the results of our one degree of freedom meta-analyses against the list of variants claimed as genome-wide significant in the most recent GWAS analysis of birth weight5. Our analyses identified an additional 68 SNPs at 62 novel loci compared to the deCODE paper (Table 1). These included many SNPs previously associated with anthropometric traits (height, body mass index etc), blood pressure, bone mineral density and type 2 diabetes at genome-wide significant levels (Supplementary Table 7). These are all phenotypes that are known to be causally or pleiotropically associated with birth weight, increasing our confidence that our new findings represent genuine associations. Finally, we also found several new associations between SNPs and birth weight that could implicate new biological pathways in the aetiology of the trait. These include an interesting association between rs11085720, an intronic variant in the DNA methylase transferase 1 (DNMT1) gene, and birth weight.
Table 1. Novel genetic variants for birth weight identified in the one degree of freedom meta-analyses.
We identified 68 novel independent genome-wide significant associations (p-value < 3.3×10−9) at 62 genetic loci that were at least 500kb apart from those previously reported in the latest deCODE analysis of birth weight5. Genome-wide significance in the deCODE study was determined using class-based significance thresholds i.e. p-value < 2.49 × 10−7 for high impact variants; p-value < 4.97 × 10−8 for moderate impact variants; p-value < 4.52 × 10−9 for low-impact variants, p-value < 2.26 × 10−9 for other low-impact variants in DNase I hypersensitivity sites and p-value < 7.53 ×10−10 for all other variants.
| Variant | Genomic locus | CHR:BP | Nearest Gene | EA/NEA | GWAS own birth weight | GWAS offspring birth weight | 1 df meta-analysis | Conditional fetal effect | Conditional maternal effect | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Beta | SE | P-value | Beta | SE | P-value | PFetal | PMaternal | Beta | SE | P-value | Beta | SE | P-value | |||||
| rs2235566 | 1 | 1:6594993 | KLHL21 | A:G | 0.012 | 0.003 | 9.70E−06 | 0.015 | 0.003 | 6.50E−07 | 1.07E−08 | 1.27E−09 | 0.006 | 0.004 | 1.17E−01 | 0.012 | 0.004 | 3.54E−03 |
| rs7542242 | 2 | 1:22151000 | WNT4 | T:C | −0.014 | 0.003 | 3.20E−07 | −0.013 | 0.003 | 1.30E−05 | 1.19E−09 | 4.50E−09 | −0.010 | 0.004 | 9.33E−03 | −0.008 | 0.004 | 4.99E−02 |
| rs2807319 | 2 | 1:22227683 | MIR4418 | A:G | −0.023 | 0.004 | 8.00E−09 | −0.012 | 0.005 | 1.20E−02 | 2.86E−09 | 1.83E−06 | −0.023 | 0.006 | 7.18E−05 | −0.001 | 0.006 | 9.18E−01 |
| rs10889792 | 3 | 1:41025213 | SLFNL1 | T:G | 0.013 | 0.003 | 2.70E−07 | 0.011 | 0.003 | 7.50E−05 | 2.71E−09 | 2.72E−08 | 0.010 | 0.004 | 4.92E−03 | 0.006 | 0.004 | 1.14E−01 |
| rs787488 | 4 | 1:67622756 | RN7SL392P | G:A | 0.010 | 0.003 | 2.30E−04 | 0.018 | 0.003 | 1.40E−08 | 1.16E−07 | 2.94E−10 | 0.001 | 0.004 | 7.30E−01 | 0.017 | 0.004 | 5.69E−05 |
| rs6604061 | 5 | 1:91863056 | TGFBR3 | T:A | 0.014 | 0.003 | 6.60E−08 | 0.013 | 0.003 | 3.40E−06 | 1.16E−10 | 4.90E−10 | 0.010 | 0.004 | 5.93E−03 | 0.008 | 0.004 | 3.69E−02 |
| rs823118 | 6 | 1:205754444 | NUCKS1 | T:C | −0.013 | 0.002 | 1.80E−07 | −0.012 | 0.003 | 3.40E−05 | 1.32E−09 | 7.07E−09 | −0.009 | 0.004 | 8.44E−03 | −0.007 | 0.004 | 6.22E−02 |
| rs708119 | 7 | 1:228015567 | WNT3A | G:C | −0.016 | 0.003 | 4.10E−09 | −0.011 | 0.003 | 3.00E−04 | 1.16E−10 | 1.83E−08 | −0.014 | 0.004 | 2.67E−04 | −0.004 | 0.004 | 3.35E−01 |
| rs6735330 | 8 | 2:61504343 | XPO1 | A:G | −0.014 | 0.004 | 7.20E−05 | −0.021 | 0.004 | 9.90E−08 | 4.74E−08 | 7.57E−10 | −0.005 | 0.005 | 3.48E−01 | −0.019 | 0.005 | 5.16E−04 |
| rs10195405 | 9 | 2:111999014 | MERTK | T:A | 0.012 | 0.003 | 1.00E−05 | 0.015 | 0.003 | 1.60E−06 | 1.90E−08 | 2.91E−09 | 0.006 | 0.004 | 1.20E−01 | 0.012 | 0.004 | 4.75E−03 |
| rs1523918 | 10 | 2:224519711 | CUL3 | G:A | −0.013 | 0.002 | 1.00E−07 | −0.013 | 0.003 | 8.20E−06 | 3.75E−10 | 9.68E−10 | −0.009 | 0.003 | 1.31E−02 | −0.009 | 0.004 | 2.81E−02 |
| rs11720622 | 11 | 3:48205189 | MIR4443 | A:G | 0.012 | 0.003 | 3.00E−06 | 0.016 | 0.003 | 1.50E−07 | 2.13E−09 | 1.25E−10 | 0.005 | 0.004 | 1.46E−01 | 0.013 | 0.004 | 1.24E−03 |
| rs4681725 | 12 | 3:56658293 | FAM208A | G:T | −0.013 | 0.003 | 2.40E−07 | −0.011 | 0.003 | 9.20E−05 | 2.79E−09 | 2.98E−08 | −0.010 | 0.004 | 4.87E−03 | −0.006 | 0.004 | 1.18E−01 |
| rs9877935 | 13 | 3:66986060 | KBTBD8 | T:C | 0.017 | 0.003 | 7.80E−09 | 0.015 | 0.003 | 1.20E−05 | 3.33E−11 | 4.44E−10 | 0.013 | 0.004 | 2.50E−03 | 0.009 | 0.005 | 6.25E−02 |
| rs6780776 | 14 | 3:71505034 | FOXP1 | T:C | 0.009 | 0.002 | 1.30E−04 | 0.015 | 0.003 | 1.50E−07 | 1.68E−07 | 1.61E−09 | 0.002 | 0.003 | 5.54E−01 | 0.014 | 0.004 | 2.88E−04 |
| rs55914222 | 15 | 3:128484100 | GATA2 | C:G | −0.026 | 0.007 | 4.80E−04 | −0.048 | 0.008 | 1.20E−08 | 2.23E−07 | 4.45E−10 | −0.003 | 0.011 | 8.00E−01 | −0.047 | 0.011 | 4.74E−05 |
| rs2659689 | 15 | 3:128484163 | GATA2 | G:T | 0.011 | 0.002 | 4.10E−06 | 0.017 | 0.003 | 1.20E−08 | 1.48E−09 | 1.28E−11 | 0.003 | 0.003 | 3.34E−01 | 0.015 | 0.004 | 1.38E−04 |
| rs73236111 | 16 | 4:55152901 | KDR | C:A | −0.012 | 0.003 | 1.20E−05 | −0.016 | 0.003 | 2.10E−07 | 8.76E−09 | 4.70E−10 | −0.005 | 0.004 | 1.69E−01 | −0.013 | 0.004 | 1.51E−03 |
| rs3806788 | 17 | 4:88823674 | FAM13A | G:A | −0.013 | 0.003 | 5.70E−07 | −0.016 | 0.003 | 1.20E−07 | 2.89E−10 | 3.74E−11 | −0.007 | 0.004 | 7.14E−02 | −0.013 | 0.004 | 2.03E−03 |
| rs7664440 | 18 | 4:119657385 | PDE5A | G:A | 0.007 | 0.003 | 6.20E−03 | 0.018 | 0.003 | 6.60E−09 | 7.54E−06 | 2.80E−09 | −0.003 | 0.004 | 4.68E−01 | 0.019 | 0.004 | 4.07E−06 |
| rs10029651 | 19 | 4:155510873 | LOC105377505 | C:T | −0.011 | 0.003 | 5.00E−05 | −0.016 | 0.003 | 5.00E−08 | 1.99E−08 | 3.22E−10 | −0.004 | 0.004 | 2.93E−01 | −0.014 | 0.004 | 4.96E−04 |
| rs12643599 | 19 | 4:155718694 | GUCY1A3 | G:A | 0.011 | 0.002 | 6.90E−06 | 0.014 | 0.003 | 2.00E−06 | 1.67E−08 | 2.59E−09 | 0.005 | 0.004 | 1.28E−01 | 0.011 | 0.004 | 4.46E−03 |
| rs10941242 | 20 | 5:35306293 | PRLR | A:C | 0.010 | 0.003 | 2.50E−04 | 0.018 | 0.003 | 9.00E−08 | 3.29E−07 | 1.67E−09 | 0.001 | 0.004 | 7.35E−01 | 0.017 | 0.005 | 1.37E−04 |
| rs10942862 | 21 | 5:78550717 | LHFPL2 | T:C | 0.014 | 0.002 | 1.80E−08 | 0.012 | 0.003 | 5.50E−05 | 1.86E−10 | 3.22E−09 | 0.011 | 0.004 | 2.73E−03 | 0.007 | 0.004 | 9.79E−02 |
| rs860457 | 22 | 5:143308758 | NR3C1 | C:T | 0.014 | 0.003 | 7.80E−08 | 0.012 | 0.003 | 4.30E−05 | 6.10E−10 | 7.06E−09 | 0.011 | 0.004 | 3.78E−03 | 0.007 | 0.004 | 9.54E−02 |
| rs148833559 | 23 | 5:173328063 | STC2 | A:C | 0.210 | 0.038 | 3.40E−08 | 0.185 | 0.041 | 6.70E−06 | 8.63E−11 | 7.38E−10 | 0.157 | 0.053 | 3.32E−03 | 0.107 | 0.056 | 5.80E−02 |
| rs10039241 | 24 | 5:177151071 | NSD1 | A:G | 0.019 | 0.004 | 6.40E−08 | 0.015 | 0.004 | 2.40E−04 | 1.43E−09 | 3.87E−08 | 0.015 | 0.005 | 2.16E−03 | 0.007 | 0.006 | 1.86E−01 |
| rs836548 | 25 | 7:6399817 | RAC1 | T:A | 0.011 | 0.003 | 4.40E−05 | 0.016 | 0.003 | 4.70E−07 | 6.51E−08 | 2.21E−09 | 0.004 | 0.004 | 2.98E−01 | 0.014 | 0.004 | 1.15E−03 |
| rs6959903 | 26 | 7:85010596 | SEMA3D | C:T | −0.014 | 0.003 | 5.10E−08 | −0.012 | 0.003 | 1.40E−04 | 9.05E−10 | 1.38E−08 | −0.011 | 0.004 | 3.93E−03 | −0.007 | 0.004 | 1.17E−01 |
| rs73187803 | 27 | 7:102098423 | CUX1 | C:G | 0.014 | 0.003 | 5.60E−07 | 0.015 | 0.003 | 5.10E−06 | 1.66E−09 | 1.63E−09 | 0.009 | 0.004 | 2.98E−02 | 0.011 | 0.004 | 1.68E−02 |
| rs7015547 | 28 | 8:11821159 | FDFT1 | C:A | −0.011 | 0.003 | 1.60E−05 | −0.014 | 0.003 | 9.00E−07 | 2.18E−08 | 2.35E−09 | −0.005 | 0.004 | 1.38E−01 | −0.011 | 0.004 | 3.55E−03 |
| rs2285290 | 29 | 8:17346307 | MTMR7 | A:G | 0.020 | 0.003 | 6.00E−09 | 0.011 | 0.004 | 7.10E−03 | 1.49E−09 | 8.33E−07 | 0.019 | 0.005 | 8.35E−05 | 0.001 | 0.006 | 8.10E−01 |
| rs17243549 | 30 | 8:26164559 | LOC105379336 | A:G | 0.013 | 0.003 | 1.60E−06 | 0.017 | 0.003 | 2.00E−07 | 1.32E−09 | 1.06E−10 | 0.006 | 0.004 | 1.23E−01 | 0.014 | 0.004 | 1.55E−03 |
| rs17217757 | 31 | 8:105601093 | ZFPM2 | C:G | 0.017 | 0.003 | 8.80E−09 | 0.020 | 0.003 | 5.90E−09 | 8.61E−13 | 1.46E−13 | 0.009 | 0.004 | 2.64E−02 | 0.015 | 0.005 | 1.02E−03 |
| rs16873442 | 31 | 8:105612802 | ZFPM2 | C:T | −0.019 | 0.005 | 6.00E−05 | −0.027 | 0.005 | 5.40E−07 | 8.27E−08 | 3.24E−09 | −0.007 | 0.007 | 2.74E−01 | −0.023 | 0.007 | 1.46E−03 |
| rs4871404 | 32 | 8:123649734 | KLHL38 | T:C | 0.014 | 0.003 | 5.90E−06 | 0.019 | 0.004 | 2.00E−07 | 4.98E−09 | 2.58E−10 | 0.006 | 0.004 | 1.74E−01 | 0.016 | 0.005 | 1.24E−03 |
| rs768056 | 33 | 9:91504004 | ROR2 | T:C | 0.017 | 0.003 | 6.70E−08 | 0.019 | 0.004 | 4.70E−07 | 6.32E−11 | 3.75E−11 | 0.010 | 0.005 | 2.65E−02 | 0.014 | 0.005 | 6.11E−03 |
| rs6485660 | 34 | 11:46050176 | PHF21A | T:C | −0.017 | 0.004 | 7.20E−05 | −0.025 | 0.005 | 4.50E−07 | 1.02E−07 | 3.03E−09 | −0.006 | 0.006 | 3.24E−01 | −0.022 | 0.007 | 1.08E−03 |
| rs10437653 | 34 | 11:46276080 | CREB3L1 | C:A | −0.010 | 0.002 | 6.00E−05 | −0.016 | 0.003 | 1.20E−08 | 1.78E−08 | 9.39E−11 | −0.003 | 0.004 | 4.49E−01 | −0.015 | 0.004 | 1.26E−04 |
| rs12802960 | 35 | 11:58407302 | OR5B3 | C:T | 0.017 | 0.003 | 1.50E−08 | 0.012 | 0.003 | 3.90E−04 | 4.69E−10 | 4.24E−08 | 0.015 | 0.004 | 5.50E−04 | 0.005 | 0.005 | 3.11E−01 |
| rs7941126 | 36 | 11:65597158 | KCNK7 | G:A | −0.015 | 0.003 | 5.10E−09 | −0.012 | 0.003 | 4.50E−05 | 4.52E−11 | 1.79E−09 | −0.012 | 0.004 | 9.80E−04 | −0.006 | 0.004 | 1.34E−01 |
| rs147188702 | 37 | 11:68393706 | LRP5 | A:G | −0.017 | 0.003 | 2.60E−07 | −0.015 | 0.004 | 5.10E−05 | 2.19E−09 | 1.58E−08 | −0.013 | 0.005 | 6.59E−03 | −0.009 | 0.005 | 8.60E−02 |
| rs14133 | 38 | 11:111911973 | CRYAB | C:G | −0.016 | 0.003 | 1.00E−08 | −0.012 | 0.003 | 7.30E−05 | 1.07E−10 | 6.12E−09 | −0.013 | 0.004 | 6.66E−04 | −0.005 | 0.004 | 1.98E−01 |
| rs11171733 | 39 | 12:56062482 | ERBB3 | A:G | −0.011 | 0.002 | 6.10E−06 | −0.014 | 0.003 | 2.20E−06 | 1.59E−08 | 2.57E−09 | −0.005 | 0.003 | 1.27E−01 | −0.011 | 0.004 | 4.58E−03 |
| rs4764861 | 40 | 12:101979031 | DRAM1 | C:T | −0.013 | 0.003 | 5.80E−07 | −0.013 | 0.003 | 2.40E−06 | 8.32E−10 | 1.13E−09 | −0.009 | 0.004 | 1.71E−02 | −0.009 | 0.004 | 2.20E−02 |
| rs1541597 | 41 | 12:109607113 | MVK | T:G | 0.010 | 0.003 | 1.50E−03 | 0.020 | 0.003 | 9.90E−09 | 7.67E−07 | 9.20E−10 | 0.000 | 0.004 | 1.00E+00 | 0.020 | 0.005 | 2.69E−05 |
| rs11067734 | 41 | 12:109658764 | 7SK | T:C | −0.011 | 0.003 | 1.40E−05 | −0.016 | 0.003 | 1.70E−08 | 3.64E−09 | 4.75E−11 | −0.004 | 0.004 | 2.63E−01 | −0.014 | 0.004 | 2.96E−04 |
| rs117053076 | 42 | 12:121277142 | CAMKK2 | A:G | −0.066 | 0.012 | 5.90E−08 | −0.059 | 0.014 | 3.90E−05 | 4.82E−10 | 4.33E−09 | −0.049 | 0.017 | 5.07E−03 | −0.035 | 0.019 | 7.47E−02 |
| rs172179 | 43 | 13:29603200 | SLC7A1 | G:C | 0.009 | 0.002 | 2.40E−04 | 0.015 | 0.003 | 7.40E−08 | 1.91E−07 | 1.35E−09 | 0.002 | 0.003 | 5.64E−01 | 0.014 | 0.004 | 2.25E−04 |
| rs9584547 | 44 | 13:97319743 | MBNL2 | G:A | 0.015 | 0.003 | 1.70E−08 | 0.009 | 0.003 | 2.40E−03 | 1.80E−09 | 4.60E−07 | 0.014 | 0.004 | 1.90E−04 | 0.002 | 0.004 | 6.21E−01 |
| rs2260766 | 45 | 13:113532485 | TMCO3 | G:A | −0.014 | 0.003 | 2.00E−07 | −0.012 | 0.003 | 1.40E−04 | 3.20E−09 | 3.50E−08 | −0.011 | 0.004 | 5.42E−03 | −0.007 | 0.004 | 1.19E−01 |
| rs2319627 | 46 | 14:21097030 | ZNF219 | C:T | −0.016 | 0.003 | 1.70E−06 | −0.020 | 0.004 | 5.20E−08 | 4.94E−10 | 3.75E−11 | −0.008 | 0.005 | 8.89E−02 | −0.016 | 0.005 | 1.44E−03 |
| rs4903162 | 47 | 14:73978916 | ENTPD5 | G:A | −0.015 | 0.003 | 7.00E−09 | −0.008 | 0.003 | 1.10E−02 | 2.34E−09 | 1.40E−06 | −0.015 | 0.004 | 8.07E−05 | −0.001 | 0.004 | 8.75E−01 |
| rs62023488 | 48 | 15:52782755 | ONECUT1 | A:G | −0.024 | 0.004 | 3.90E−09 | −0.019 | 0.005 | 3.30E−05 | 2.81E−11 | 1.25E−09 | −0.019 | 0.006 | 7.86E−04 | −0.009 | 0.006 | 1.35E−01 |
| rs12903747 | 49 | 15:79157285 | LOC107984750 | C:A | −0.013 | 0.002 | 1.20E−07 | −0.011 | 0.003 | 1.80E−04 | 2.31E−09 | 3.24E−08 | −0.010 | 0.004 | 4.44E−03 | −0.006 | 0.004 | 1.31E−01 |
| rs17612964 | 50 | 15:93298966 | LOC105370982 | A:G | 0.014 | 0.002 | 1.70E−08 | 0.009 | 0.003 | 1.90E−03 | 1.51E−09 | 2.55E−07 | 0.013 | 0.004 | 3.39E−04 | 0.003 | 0.004 | 4.98E−01 |
| rs62621400 | 51 | 15:101178034 | CHSY1 | G:C | −0.026 | 0.005 | 2.00E−06 | −0.034 | 0.006 | 6.00E−08 | 7.86E−10 | 4.28E−11 | −0.012 | 0.008 | 1.22E−01 | −0.028 | 0.009 | 1.03E−03 |
| rs4985152 | 52 | 16:15042762 | NTAN1 | A:T | −0.014 | 0.003 | 1.30E−07 | −0.012 | 0.003 | 1.90E−04 | 2.63E−09 | 3.34E−08 | −0.011 | 0.004 | 5.09E−03 | −0.007 | 0.004 | 1.25E−01 |
| rs72801818 | 53 | 16:53442130 | RBL2 | T:C | 0.015 | 0.003 | 8.60E−08 | 0.018 | 0.003 | 2.60E−08 | 1.85E−11 | 2.64E−12 | 0.008 | 0.004 | 4.44E−02 | 0.014 | 0.004 | 1.44E−03 |
| rs4783585 | 54 | 16:67233376 | FHOD1 | A:G | 0.025 | 0.005 | 1.20E−07 | 0.029 | 0.005 | 7.00E−08 | 3.87E−11 | 9.20E−12 | 0.014 | 0.007 | 3.64E−02 | 0.022 | 0.007 | 2.66E−03 |
| rs3115960 | 55 | 16:75482636 | CHST6 | G:C | 0.008 | 0.002 | 1.20E−03 | 0.017 | 0.003 | 5.40E−09 | 6.98E−07 | 4.97E−10 | −0.001 | 0.004 | 8.50E−01 | 0.017 | 0.004 | 1.15E−05 |
| rs2338115 | 56 | 17:38773325 | PIP4K2B | C:T | −0.013 | 0.002 | 5.60E−08 | −0.010 | 0.003 | 5.20E−04 | 2.06E−09 | 7.43E−08 | −0.011 | 0.003 | 1.88E−03 | −0.005 | 0.004 | 2.31E−01 |
| rs17384555 | 57 | 18:5618174 | EPB41L3 | T:G | 0.017 | 0.003 | 5.60E−09 | 0.015 | 0.003 | 1.90E−06 | 7.09E−12 | 7.55E−11 | 0.013 | 0.004 | 1.96E−03 | 0.009 | 0.004 | 4.45E−02 |
| rs11085720 | 58 | 19:10207087 | DNMT1 | A:G | 0.014 | 0.003 | 4.00E−08 | 0.016 | 0.003 | 1.10E−08 | 4.20E−12 | 8.58E−13 | 0.008 | 0.004 | 2.57E−02 | 0.012 | 0.004 | 1.72E−03 |
| rs10423961 | 59 | 19:17721493 | MAP1S | G:A | 0.011 | 0.002 | 3.50E−06 | 0.015 | 0.003 | 1.80E−07 | 3.03E−09 | 1.57E−10 | 0.005 | 0.003 | 1.70E−01 | 0.013 | 0.004 | 1.12E−03 |
| rs4809731 | 60 | 20:48879230 | LOC105372648 | G:C | 0.019 | 0.004 | 1.80E−07 | 0.022 | 0.004 | 1.20E−07 | 7.71E−11 | 2.00E−11 | 0.011 | 0.005 | 3.87E−02 | 0.017 | 0.006 | 3.21E−03 |
| rs6122888 | 61 | 20:50210343 | CEBPB | C:T | −0.013 | 0.002 | 1.60E−07 | −0.011 | 0.003 | 1.70E−04 | 2.90E−09 | 3.75E−08 | −0.010 | 0.004 | 4.73E−03 | −0.006 | 0.004 | 1.30E−01 |
| rs4823084 | 62 | 22:30280862 | MTMR3 | A:G | −0.015 | 0.003 | 2.20E−08 | −0.009 | 0.003 | 2.80E−03 | 2.54E−09 | 5.95E−07 | −0.014 | 0.004 | 2.18E−04 | −0.002 | 0.004 | 6.26E−01 |
Discussion
In this manuscript we compared the power of three strategies to detect loci in maternal and fetal GWAS of the same trait: (i) the traditional strategy of analysing maternal and fetal GWAS separately; (ii) a two degree of freedom test which combines information from maternal and fetal GWAS; and (iii) a one degree of freedom test where signals from maternal and fetal GWAS are meta-analysed together conditional on the estimated sample overlap. We found that which strategy performed best depended on the degree of sample overlap between maternal and fetal GWAS, the correlation between own and offspring phenotype, whether individual trait loci were simultaneously influenced by maternal and fetal effects, and if so, whether the effects were directionally concordant.
Our simulations showed that in general, when maternal and fetal GWAS contained only partially overlapping samples (e.g. as is the case for most published GWAS meta-analyses of perinatal traits to date), considerable gains in power could be achieved using the meta-analytic strategy or using the two degree of freedom test compared to the traditional approach of analysing the scans separately. This makes sense intuitively since the traditional strategy of performing separate GWAS does not make use of the information contained across the scans. In these situations, for loci exhibiting concordant maternal and fetal effects (of similar magnitude) the two degree of freedom and the meta-analytic strategy typically performed similarly, whereas for loci involving only maternal or only fetal effects, a simple meta-analysis was often most powerful. Our results imply that many existing GWAS for perinatal traits could benefit from reanalysis using a two degree of freedom test or meta-analytic strategy (a point we demonstrate in this manuscript with the empirical analysis of birth weight).
In contrast, when maternal and fetal GWAS involved completely overlapping samples (as is the case for cohorts where all individuals report their own and their offspring’s phenotype), then which strategy was best depended on whether individual trait loci were simultaneously influenced by maternal and fetal effects, whether the effects were directionally concordant, and the background correlation between own and offspring phenotype. For loci that only exhibited effects through the fetal or maternal genome, then the traditional strategy of analysing maternal and fetal GWAS separately performed best. For loci that exhibited concordant maternal and fetal effects, the meta-analytic strategy was best (with the two degree of freedom test a close second).
Interestingly, our simulations also showed that when a single locus exhibited maternal and fetal effects that operated in different directions, a two degree of freedom test that modelled these effects often had vastly improved power to detect association compared to simple one degree of freedom tests (regardless of the degree of sample overlap). This result is similar to what we have observed previously in tests of genetic association involving individual level genotypes using a structural equation modelling framework7,11. Our results make sense intuitively as the two degree of freedom test expends an extra degree of freedom productively modelling both maternal and fetal effects, whereas the meta-analytic strategy combines effects in such a way that discordant effects may cancel out. Loci that exhibit maternal and fetal genetic effects in opposite directions are not uncommon in GWAS of perinatal traits related to growth 6,7. The implication of our power analyses is that a proportion of these loci may be missed if only one degree of freedom tests are performed across the genome (i.e. either by meta-analysis or separate maternal and fetal GWAS).
In empirical analyses of birth weight, and consistent with our simulations, both the two degree of freedom test and the meta-analytic strategy identified greater numbers of loci meeting genome-wide significance than analysing maternal and fetal GWAS separately. Both the meta-analytic strategy and the two degree of freedom test also identified more genome-wide significant loci for birth weight than MTAG 13, a popular software package that is commonly used in the multivariate analysis of GWAS summary results statistics. This makes sense since the two degree of freedom test (and the meta-analytic strategy) are derived by specifically modelling the correlation between maternal and fetal genotypes whereas other multivariate tests e.g. MTAG13 use models that are agnostic to the mother-offspring relationship and do not take advantage of this known source of information.
So which type of analysis is optimal in the case of birth weight and (potentially) other perinatal phenotypes, particularly in situations where there is only partial overlap between maternal and fetal GWAS? The meta-analytic strategy yielded considerably greater numbers of loci meeting genome-wide significance than the two degree of freedom strategy (i.e. 226 vs 213 loci), suggesting that even for a trait like birth weight, a simple meta-analytic strategy may be optimal when maternal and fetal GWAs are only partially overlapping. However, whilst all loci that met genome-wide significance using the simple meta-analytic strategy were also significant or almost significant using DINGO (i.e. all loci would be flagged by researchers as significant or suggestive), many genome-wide significant loci that displayed directionally discordant effects in the two degree of freedom DINGO test exhibited p-values that were far less extreme using the simple meta-analytic strategy, including some loci that may have been missed entirely (e.g. the GLIS3 locus discussed below). The implication is that a two degree of freedom test may be the preferable strategy in the case of phenotypes that are known to exhibit substantial numbers of loci with directionally discordant effects (e.g. perinatal growth-related traits like birth weight), whereas a simple meta-analytic strategy may be the superior strategy in the case of other traits (bearing in mind that such knowledge about likely underlying genetic architecture may not always be available from previous GWAS a priori). We have implemented both the meta-analytic tests and the two degree of freedom test as part of the DINGO package in the publicly available web-based software CTG-VL17.
We would like to highlight two findings from our empirical GWAS of birth weight that we think deserve further attention. The first is the GLIS3 locus. As mentioned above, it is unlikely that this locus would have been flagged by researchers adopting a simple meta-analytic strategy, although interestingly the locus was identified previously in the deCODE paper using separate maternal and fetal GWAS (although not discussed explicitly by the authors). GLIS3 encodes a member of the GLI-similar zinc finger protein family. The GLIS3 protein functions as both a repressor and activator of transcription that is involved in the development of pancreatic beta cells. Homozygous mutant Glis3 mice develop neonatal diabetes due to insufficient pancreatic beta cells 22, and deletions of the gene in humans are associated with neonatal diabetes 23–25. The “A” allele of the rs7034200 variant within the GLIS3 gene (i.e. which in our study is associated with increased offspring birth weight when present in the maternal genome, but decreased birth weight when present in the fetal genome), has also been associated with increased risk of type 2 diabetes 26–31, higher fasting glucose levels 28,32,33 and impaired beta cell function 28,33–35. The glucose raising allele at this SNP is also strongly associated with impaired beta-cell function in non-diabetic adults 33,35 and in healthy children and adolescents 32. We therefore hypothesise that diabetes predisposing variants at this locus may increase levels of circulating glucose in pregnant mothers (i.e. which would tend to increase intrauterine growth and offspring birth weight), but simultaneously act to decrease offspring birth weight when the same alleles are transmitted to the fetus (e.g. by impairing beta cell function and therefore fetal insulin secretion). Similar mechanisms are thought to underlie other diabetes predisposing variants that show directionally inconsistent maternal and fetal genetic associations with birth weight6,7,36.
The second result we would like to highlight is the genome-wide significant SNP rs11085720 in an intron in the DNMT1 gene. Although this SNP was not significant in the deCODE meta-analysis, it was strongly significant in both the two degree of freedom (p-value = 8.6 × 10−13) and one degree of freedom meta-analyses (pmin = 2.3 × 10−12). The gene DNMT1 codes for the enzyme DNA methyltransferase 1 which is responsible for the addition of methyl groups to specific CpG structures in DNA. Methylation of CpG islands is associated with transcriptional silencing and knockout experiments suggest that this enzyme is responsible for the bulk of methylation in mouse cells and is essential for embryonic development37. In humans, rare variants in the DNMT1 gene are associated with forms of cerebellar ataxia38 and neuropathy39,40. Although we cannot be sure that DNMT1 is indeed the causative gene at this locus, we highlight its potential involvement because of the considerable interest in the relationship between adverse environmental exposures during pregnancy, possible mediation via DNA methylation and long-term effects on disease risk due to putative intrauterine programming. In this respect, it is interesting to note that conditional analyses suggest that this locus involves both maternal and fetal effects. How methylation levels in mothers could in turn affect the birth weight of their offspring is unclear, however it is interesting that studies of knock-out mice have shown that a lack of both maternal and zygotic Dnmt1 results in complete demethylation of imprinted genes in (mouse) blastocysts41.
Several simple extensions to/applications of our methods present themselves. First, although we have applied our methods to birth weight, there exists many other traits where maternal and offspring GWAS have been conducted including gestational duration9,10,42 and placental weight43 which involve only partially overlapping samples and could benefit from reanalysis using the framework that we have outlined in this manuscript. Second, there is increasing interest in and growing realization in the scientific community that indirect genetic effects contribute not only to perinatal traits like birth weight1,6,7,18,44, but also to many later life socially patterned traits like educational attainment14–16. The implication is that substantial gains in power to detect loci underlying many of these traits might be achieved by combining parental and offspring GWAS either through a simple meta-analytic strategy, or by performing multi degree of freedom tests across the genome to GWAS summary results data derived from parents and children. Third, although the development of DINGO was motivated by the analysis of traits that are affected by indirect maternal/paternal effects, there is no reason why analogous methods cannot also be applied to traits where e.g. the offspring’s genome affects e.g. the mother’s phenotype. For example, offspring genotype is thought to affect risk of maternal preeclampsia and so it would be possible in theory to combine a fetal GWAS of preeclampsia with a GWAS of maternal preeclampsia using a similar procedure for binary traits45,46 (see below). Other phenotypes such as twinning and fertility/fecundity might also be interesting candidates for such analyses. Fourth, we note that our framework can be easily extended to a three degree of freedom test that incorporates summary results statistics from maternal, paternal and “fetal” GWAS (see Supplementary Note). Whilst a three degree of freedom test would most likely be inefficient for the analysis of perinatal traits (i.e. where indirect paternal genetic effects are likely to be negligible and so the estimation of a paternal genetic effect parameter would be of little benefit whilst requiring an extra degree of freedom), such a test could potentially be useful for the analysis of traits like educational attainment which are known to involve all three sorts of effect 16. Alternatively, employing a simple meta-analytic approach, but combining across maternal, paternal and offspring GWAS could also be advantageous (also see Supplementary Note).
There are several limitations to the methods proposed in this manuscript. First, the two degree of freedom test and the meta-analytic strategy utilize LD score regression to provide an accurate estimate of effective sample overlap. Therefore, all factors discussed in the literature that may affect the accuracy/precision of the LD score intercept apply mutatis mutandis to the present methods. We recommend that the sample sizes of both maternal and fetal GWAS be large and as ancestrally homogenous as possible to accurately estimate sample overlap. Second, the formulae for the two degree of freedom and meta-analytic tests that we have derived requires that the coefficients from the maternal and fetal GWAS are from a linear regression analysis of quantitative traits. It is critically important that these different GWAS are analysed on (or can be transformed to) the same scale. In the case of binary traits, a number of approaches exist for transforming logistic regression coefficients to a (normal) liability scale47–50. In theory it should be possible to transform these logistic coefficients (and their standard errors) to the liability scale and then apply the methods outlined in this manuscript to permit a similar analysis of binary traits. We expect that analogous to quantitative traits, such a method would improve the power to detect loci that simultaneously exert direct and indirect genetic effects and would accurately model their contribution to trait variation through maternal and fetal pathways. In contrast, existing methods for analysing binary data are geared towards increasing the power to detect direct genetic effects only through the utilization of self-report data from ungenotyped related individuals, or are appropriate for the analysis of individual level genotypes (i.e. rather than summary results data)51–53. Finally, our power simulations and empirical analyses used a Bonferroni correction to determine the genome-wide significance of the one degree of freedom tests. Simulations we have conducted suggest that this correction is appropriate although very slightly conservative in the case of substantial sample overlap and phenotypic correlation (see Supplementary Note). Thus, our analyses may have slightly underestimated the true power of the one degree of freedom tests under these conditions.
In conclusion, in this manuscript we have introduced two new methods designed to increase the power of locus discovery in meta-analyses of summary results statistics from maternal and fetal GWAS. We have shown through a combination of analytical results, simulation and empirical analysis that a simple one degree of freedom meta-analytic strategy where signals from maternal and fetal GWAS are meta-analysed together conditional on the estimated sample overlap is likely to be powerful strategy in the analysis of perinatal traits. Indeed, adopting this strategy increased the number of known birth weight loci by 62 and implicated a number of known and novel pathways in the aetiology of the trait. We have also shown that a two degree of freedom test may be particularly powerful strategy for analysing traits where a substantial proportion of loci involve discordant maternal and fetal genetic effects of similar magnitude. The two degree of freedom test and the one degree of freedom meta-analytic procedure described in this manuscript are implemented in the “Direct and INdirect effects analysis of Genetic lOci” (DINGO) package, available as part of the CTG-VL software.17
Supplementary Material
Acknowledgements
This work was supported by Australian National Health and Medical Research Council (NHMRC) grants (GNT1183074, GNT1157714). D.M.E. is supported by an NHMRC Leadership Fellowship (2017942). N.M.W is supported by an NHMRC Emerging Leadership Fellowship (2008723). G.H.M. is the recipient of an Australian Research Council Discovery Early Career Award (Project number: DE220101226) funded by the Australian Government and supported by the Research Council of Norway (Project grant: 325640). L.Y. is supported by a Australian Research Council Future Fellowship (FT220100069). R.M.F. and R.N.B. are supported by a Wellcome Senior Research Fellowship (WT220390). R.M.F. is also supported by a grant from the Eunice Kennedy Shriver National Institute Of Child Health & Human Development of the National Institutes of Health under Award Number R01HD101669. This research was funded in part by the Wellcome Trust (WT220390). Human genotype and phenotype data from the UKB on which the results of simulations were based were accessed with accession ID 53641.
References
- 1.Beaumont R.N. et al. Genome-wide association study of offspring birth weight in 86 577 women identifies five novel loci and highlights maternal genetic effects that are independent of fetal genetics. Hum Mol Genet 27, 742–756 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Freathy R.M. et al. Variants in ADCY5 and near CCNL1 are associated with fetal growth and birth weight. Nat Genet 42, 430–5 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Horikoshi M. et al. Genome-wide associations for birth weight and correlations with adult disease. Nature 538, 248–252 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Horikoshi M. et al. New loci associated with birth weight identify genetic links between intrauterine growth and adult height and metabolism. Nat Genet 45, 76–82 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Juliusdottir T. et al. Distinction between the effects of parental and fetal genomes on fetal growth. Nat Genet 53, 1135–1142 (2021). [DOI] [PubMed] [Google Scholar]
- 6.Warrington N.M. et al. Maternal and fetal genetic effects on birth weight and their relevance to cardio-metabolic risk factors. Nat Genet 51, 804–814 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Warrington N.M., Freathy R.M., Neale M.C. & Evans D.M. Using structural equation modelling to jointly estimate maternal and fetal effects on birthweight in the UK Biobank. Int J Epidemiol 47, 1229–1241 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Liu X. et al. Variants in the fetal genome near pro-inflammatory cytokine genes on 2q13 associate with gestational duration. Nat Commun 10, 3927 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Zhang G. et al. Genetic Associations with Gestational Duration and Spontaneous Preterm Birth. N Engl J Med 377, 1156–1167 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Sole-Navais P. et al. Genetic effects on the timing of parturition and links to fetal birth weight. Nat Genet (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Moen G.H., Hemani G., Warrington N.M. & Evans D.M. Calculating Power to Detect Maternal and Offspring Genetic Effects in Genetic Association Studies. Behav Genet 49, 327–339 (2019). [DOI] [PubMed] [Google Scholar]
- 12.Bulik-Sullivan B. et al. An atlas of genetic correlations across human diseases and traits. Nat Genet 47, 1236–41 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Turley P. et al. Multi-trait analysis of genome-wide association summary statistics using MTAG. Nat Genet 50, 229–237 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Demange P.A. et al. Estimating effects of parents’ cognitive and non-cognitive skills on offspring education using polygenic scores. Nat Commun 13, 4801 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Howe L.J. et al. Within-sibship genome-wide association analyses decrease bias in estimates of direct genetic effects. Nat Genet 54, 581–592 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Hwang L.D., Moen G.H. & Evans D.M. Using adopted individuals to partition indirect maternal genetic effects into prenatal and postnatal effects on offspring phenotypes. Elife 11(2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Cuellar-Partida G. et al. Complex-Traits Genetics Virtual Lab: A community-driven web platform for post-GWAS analyses. Bioarchiv, 10.1101/518027 (2019). [DOI] [Google Scholar]
- 18.Wu Y. et al. Estimating genetic nurture with summary statistics of multigenerational genome-wide association studies. Proc Natl Acad Sci U S A 118(2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Li A. et al. mBAT-combo: a more powerful test to detect gene-trait associations from GWAS data. BioRxiv, https://www.biorxiv.org/content/10.1101/2022.06.27.497850v1 (2022). [DOI] [PMC free article] [PubMed]
- 20.Morris J.A. et al. An atlas of genetic influences on osteoporosis in humans and mice. Nat Genet 51, 258–266 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Watanabe K., Taskesen E., van Bochoven A. & Posthuma D. Functional mapping and annotation of genetic associations with FUMA. Nat Commun 8, 1826 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Kang H.S., Beak J.Y., Kim Y.S., Herbert R. & Jetten A.M. Glis3 is associated with primary cilia and Wwtr1/TAZ and implicated in polycystic kidney disease. Mol Cell Biol 29, 2556–69 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Senee V. et al. Mutations in GLIS3 are responsible for a rare syndrome with neonatal diabetes mellitus and congenital hypothyroidism. Nat Genet 38, 682–7 (2006). [DOI] [PubMed] [Google Scholar]
- 24.Taha D., Barbar M., Kanaan H. & Williamson Balfe J. Neonatal diabetes mellitus, congenital hypothyroidism, hepatic fibrosis, polycystic kidneys, and congenital glaucoma: a new autosomal recessive syndrome? Am J Med Genet A 122A, 269–73 (2003). [DOI] [PubMed] [Google Scholar]
- 25.Dimitri P. et al. Novel GLIS3 mutations demonstrate an extended multisystem phenotype. Eur J Endocrinol 164, 437–43 (2011). [DOI] [PubMed] [Google Scholar]
- 26.Dou H.Y. et al. Association between genetic variants and characteristic symptoms of type 2 diabetes: A matched case-control study. Chin J Integr Med 23, 415–424 (2017). [DOI] [PubMed] [Google Scholar]
- 27.Hu C. et al. Variants from GIPR, TCF7L2, DGKB, MADD, CRY2, GLIS3, PROX1, SLC30A8 and IGF1 are associated with glucose metabolism in the Chinese. PLoS One 5, e15542 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Liu C. et al. Variants in GLIS3 and CRY2 are associated with type 2 diabetes and impaired fasting glucose in Chinese Hans. PLoS One 6, e21464 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Miranda-Lora A.L. et al. Genetic polymorphisms associated with pediatric-onset type 2 diabetes: A family-based transmission disequilibrium test and case-control study. Pediatr Diabetes 20, 239–245 (2019). [DOI] [PubMed] [Google Scholar]
- 30.Rees S.D. et al. Effects of 16 genetic variants on fasting glucose and type 2 diabetes in South Asians: ADCY5 and GLIS3 variants may predispose to type 2 diabetes. PLoS One 6, e24710 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Inshaw J.R.J. et al. Analysis of overlapping genetic association in type 1 and type 2 diabetes. Diabetologia 64, 1342–1347 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Barker A. et al. Association of genetic Loci with glucose levels in childhood and adolescence: a meta-analysis of over 6,000 children. Diabetes 60, 1805–12 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Dupuis J. et al. New genetic loci implicated in fasting glucose homeostasis and their impact on type 2 diabetes risk. Nat Genet 42, 105–16 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Boesgaard T.W. et al. Variants at DGKB/TMEM195, ADRA2A, GLIS3 and C2CD4B loci are associated with reduced glucose-stimulated beta cell function in middle-aged Danish people. Diabetologia 53, 1647–55 (2010). [DOI] [PubMed] [Google Scholar]
- 35.Hong K.W., Chung M. & Cho S.B. Meta-analysis of genome-wide association study of homeostasis model assessment beta cell function and insulin resistance in an East Asian population and the European results. Mol Genet Genomics 289, 1247–55 (2014). [DOI] [PubMed] [Google Scholar]
- 36.Hattersley A.T. & Tooke J.E. The fetal insulin hypothesis: an alternative explanation of the association of low birthweight with diabetes and vascular disease. Lancet 353, 1789–92 (1999). [DOI] [PubMed] [Google Scholar]
- 37.Li E., Bestor T.H. & Jaenisch R. Targeted mutation of the DNA methyltransferase gene results in embryonic lethality. Cell 69, 915–26 (1992). [DOI] [PubMed] [Google Scholar]
- 38.Winkelmann J. et al. Mutations in DNMT1 cause autosomal dominant cerebellar ataxia, deafness and narcolepsy. Hum Mol Genet 21, 2205–10 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Klein C.J. et al. DNMT1 mutation hot spot causes varied phenotypes of HSAN1 with dementia and hearing loss. Neurology 80, 824–8 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Klein C.J. et al. Mutations in DNMT1 cause hereditary sensory neuropathy with dementia and hearing loss. Nat Genet 43, 595–600 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Hirasawa R. et al. Maternal and zygotic Dnmt1 are necessary and sufficient for the maintenance of DNA methylation imprints during preimplantation development. Genes Dev 22, 1607–16 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Sole-Navais P. e.a. Genetic effects on the timing of parturition and links to fetal birth weight. medRxiv, 10.1101/2022.05.04.22274624 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Beaumont R. e.a. Genome-wide association study of placental weight in 179,025 children and parents reveals distinct and shared genetic influences between placental and fetal growth. medRxiv, 10.1101/2022.11.25.22282723 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Warrington N.M., Hwang L.D., Nivard M.G. & Evans D.M. Estimating direct and indirect genetic effects on offspring phenotypes using genome-wide summary results data. Nat Commun 12, 5420 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Gray K.J., Saxena R. & Karumanchi S.A. Genetic predisposition to preeclampsia is conferred by fetal DNA variants near FLT1, a gene involved in the regulation of angiogenesis. Am J Obstet Gynecol 218, 211–218 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Steinthorsdottir V. et al. Genetic predisposition to hypertension is associated with preeclampsia in European and Central Asian women. Nat Commun 11, 5976 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Gillett A.C., Vassos E. & Lewis C.M. Transforming Summary Statistics from Logistic Regression to the Liability Scale: Application to Genetic and Environmental Risk Scores. Hum Hered 83, 210–224 (2018). [DOI] [PubMed] [Google Scholar]
- 48.Pawitan Y., Seng K.C. & Magnusson P.K. How many genetic variants remain to be discovered? PLoS One 4, e7969 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.So H.C., Gui A.H., Cherny S.S. & Sham P.C. Evaluating the heritability explained by known susceptibility variants: a survey of ten complex diseases. Genet Epidemiol 35, 310–7 (2011). [DOI] [PubMed] [Google Scholar]
- 50.Wu T. & Sham P.C. On the Transformation of Genetic Effect Size from Logit to Liability Scale. Behav Genet 51, 215–222 (2021). [DOI] [PubMed] [Google Scholar]
- 51.de la Fuente J., Grotzinger A.D., Marioni R.E., Nivard M.G. & Tucker-Drob E.M. Integrated analysis of direct and proxy genome wide association studies highlights polygenicity of Alzheimer’s disease outside of the APOE region. PLoS Genet 18, e1010208 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Liu J.Z., Erlich Y. & Pickrell J.K. Case-control association mapping by proxy using family history of disease. Nat Genet 49, 325–331 (2017). [DOI] [PubMed] [Google Scholar]
- 53.Hujoel M.L.A., Gazal S., Loh P.R., Patterson N. & Price A.L. Liability threshold modeling of case-control status and family history of disease increases association power. Nat Genet 52, 541–547 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.






