Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

medRxiv logoLink to medRxiv
[Preprint]. 2023 Aug 15:2023.08.10.23293932. [Version 1] doi: 10.1101/2023.08.10.23293932

Pregnancy-Associated Bleeding and Genetics: Five Sequence Variants in the Myometrium and Progesterone Signaling Pathway are associated with postpartum hemorrhage

David Westergaard 1,2,3, Valgerdur Steinthorsdottir 4, Lilja Stefansdottir 4, Palle Duun Rohde 5, Xiaoping Wu 6,7, Frank Geller 6,7, Jaakko Tyrmi 8, Aki S Havulinna 9,10, Pol Sole Navais 11, Christopher Flatley 11, Sisse Rye Ostrowski 6,12, Ole Birger Pedersen 13,12, Christian Erikstrup 14,15, Erik Sørensen 6, Christina Mikkelsen 6,16, Mie Topholm Brun 17,18, Bitten Aagaard Jensen 19, Thorsten Brodersen 13, Henrik Ullum 7; FinnGen20; Danish Blood Donor Study Genomic Consortium21; Estonian Biobank Research Team22; Nordic Collaboration for Womens and Reproductive Health23, Per Magnus 24, Ole A Andreassen 25,26,27, Pål R Njolstad 28,29, Astrid Marie Kolte 30, Lone Krebs 1,12, Mette Nyegaard 5, Thomas Folkmann Hansen 2,31, Bjarke Fenstra 6,7, Mark Daly 9,32,33, Cecilia M Lindgren 34,35,36,33, Gudmar Thorleifsson 4, Olafur A Stefansson 4, Gardar Sveinbjornsson 4, Daniel F Gudbjartsson 4,37, Unnur Thorsteinsdottir 4,38, Karina Banasik 1,2, Bo Jacobsson 11,24, Triin Laisk 22, Hannele Laivuori 9,39,40,8, Kari Stefansson 4,38, Søren Brunak 2, Henriette Svarre Nielsen 30,12
PMCID: PMC10462219  PMID: 37645979

Abstract

Bleeding in early pregnancy and postpartum hemorrhage (PPH) bear substantial risks, with the former closely associated with pregnancy loss and the latter being the foremost cause of maternal death, underscoring the severity of these complications in maternal-fetal health. Here, we investigated the genetic variation underlying aspects of pregnancy-associated bleeding and identified five loci associated with PPH through a meta-analysis of 21,512 cases and 259,500 controls. Functional annotation analysis indicated candidate genes, HAND2, TBX3, and RAP2C/FRMD7, at three loci and showed that at each locus, associated variants were located within binding sites for progesterone receptors (PGR). Furthermore, there were strong genetic correlations with birth weight, gestational duration, and uterine fibroids. Early bleeding during pregnancy (28,898 cases and 302,894 controls) yielded no genome-wide association signals, but showed strong genetic correlation with a variety of human traits, indicative of polygenic and pleiotropic effects. Our results suggest that postpartum bleeding is related to myometrium dysregulation, whereas early bleeding is a complex trait related to underlying health and possibly socioeconomic status.

Introduction

Pregnancy-associated bleeding can occur at all stages of pregnancy. Bleeding in early pregnancy can range in significance from a benign event with no adverse effects, to an indication of ongoing pregnancy loss, and even serve as a potential marker for later pregnancy loss, obstetric complications, and long-term maternal comorbidities1,2. Postpartum hemorrhage (PPH) is the leading cause of maternal mortality, with approximately 100,000 young and otherwise healthy women dying every year worldwide3. Despite affecting more than one in ten births and being a heritable condition, PPH remains unexplored at the genetic and molecular level4. Prior candidate gene studies have focused on genes involved in the coagulation pathways5. Even though the etiology of PPH is multifactorial, it often occurs even when established risk factors are not present6,7.

The primary cause of PPH is uterine atony, which accounts for 70% of all cases3. Other causes include retained placental tissue, trauma, and congenital or acquired coagulation disorders. Early identification and correct management of PPH can prevent maternal mortality and morbidity8. Therefore, there is great interest in assessing PPH risk prior to labor, and a large body of literature has described detailed prognostic models. However, a recent review showed that almost half of the existing prognostic models include features that can only be obtained postpartum9. Consequently, there is an urgent clinical need to understand the molecular etiology and identify novel biomarkers that characterize high-risk women prior to labor to initiate timely preventive measures and monitoring.

Here, we report the results of genome-wide association studies (GWAS) of up to 302,894 women from six Northern European cohorts to identify the genetic etiology of bleeding during different stages of pregnancy. Our results reveal complexity in the genetics of early bleeding and highlight the importance of the myometrium and progesterone-responsive genes in the etiology of PPH.

Results

Overall findings

Combining data from six Northern European cohorts including up to 331,792 women we investigated the genetic architecture of three phenotypes related to bleeding during pregnancy; early bleeding (28,898 cases), antepartum hemorrhage (3,236 cases), and postpartum hemorrhage (PPH) (21,521 cases) (Figure 1A). We further divided early bleeding into “early bleeding with any outcome” (28,898 cases) and “early bleeding ending in live birth” (6,356 cases) (Supplementary Table 1). We included up to 18,009,056 sequence variants in a meta-analysis and identified five loci (chromosome 4, 6, 10, 12, and X) that were associated with PPH using a functionally informed multiple testing correction (Figure 2B, Table 1). The effect sizes were similar across all cohorts (Supplementary Figure 1A), and conditional analysis revealed no secondary signals. We observed no significant associations for early bleeding and antepartum hemorrhage (Supplementary Figure 24). In addition, we analyzed uterine atony (13,048 cases and 261,809 controls) and retained placental tissue (6,256 cases and 266,427 controls), where three (chromosome 4, 6, and 10) and one (chromosome X) of the five associated loci passed multiple testing correction, respectively (Figure 1C, Supplementary Table 2). We did not observe any significant differences in effect sizes between uterine atony and retained placental tissue, when comparing the lead variants from the five loci (Supplementary Table 2). We found no evidence of confounding or inflation in any of the analyses (Supplementary Table 3).

Figure 1.

Figure 1.

(A) Overview of the phenotypes under investigation. Early bleeding occurs up and until the 20th gestational week, antepartum between the 20th gestational week and birth, and postpartum hemorrhage after birth. (B) Manhattan plot of postpartum hemorrhage showing the 18M variants, with SNPs passing the functionally informed multiple testing criteria highlighted in green. (C) Miami plot comparing postpartum hemorrhage due to atony (top) and retained placenta (bottom). Green dots indicate SNPs passing the multiple testing threshold.

Figure 2.

Figure 2.

(A) MAGMA single cell enrichment from the Human Protein Atlas. Smooth muscle cells and endothelial cells were both enriched (FDR < 0.05) (C) MAGMA bulk tissue enrichment from the Human Protein Atlas showed an enrichment of endometrial, smooth muscle, seminal vesicle, and thyroid gland tissue (FDR < 0.05).

Table 1.

Effect sizes across loci for PPH, endometriosis and uterine fibroids. Endometriosis and uterine fibroid estimates come from the datasets listed in Supplementary Table 4.

CHR BP (hg38) RSID Effect allele Effect Allele Frequency Odds-ratio (95% CI, P-value)
Postpartum hemorrhage Endometriosis Uterine fibroids
4 173807552 rs13141656 T 0.30 1.10 (1.08–1.13; 1.42e-17) 0.98 (0.96–1.0; 0.014) 0.98 (0.96–0.99; 0.00076)
6 143642758 rs12195857 A 0.32 1.10 (1.08; 1.13, 9.86e-17) 0.97 (0.95–0.99; 0.0022) 0.97 (0.96–0.98; 2.8e-5)
10 31660483 rs11591307 A 0.22 1.08 (1.05; 1.11, 1.3e-9) 1.03 (1.03–1.05; 0.015) 0.94 (0.93– 0.95; 4.6e-16)
12 114656455 rs11067228 G 0.42 1.07 (1.05; 1.10, 4.33e-11) 0.96 (0.95–0.98; 2.8e-5) 1.00 (0.99–1.02; 0.74)
X 132131995 rs2747025 A 0.32 0.91 (0.89; 0.94, 9e-15) 0.93 (0.91–0.95; 1.2e-14) 1.17 (1.15–1.18; 4.6e-113)

Prior evidence of SNPs

According to the GWAS catalog10, the lead variant on chromosome 12 has previously been found in association with heel bone mineral density and prostate-specific antigen levels in males, both of which hormone-responsive tissues. Additionally, the lead variants on chromosomes 10 and X were in strong (r2 >0.8) linkage disequilibrium (LD) with variants associated with uterine fibroids and endometriosis, while the lead variant on chromosome 6 was in strong LD with a sequence variant associated with educational attainment (Supplementary Table 4). Furthermore, we investigated the genome-wide significant lead variants in the FinnGen cohort (R9) and found that the lead variants on chromosome 12 (TBX3) and chromosome X (FRDM7/RAP2C) were also associated with endometriosis, and the loci on chromosomes 6 (PHACTR2), 10 (ZEB1), and X (FRDM7/RAP2C) were associated with uterine fibroids (Table 2) (Supplementary Figure 1B).

Table 2.

PPH signals were enriched (p <0.05, Bonferroni corrected) within binding sites for progesterone receptor (PGR) defined in human embryonic stem cells (hESC). Shown are nominally significant results i.e., where uncorrected p-value <0.05. We defined binding sites by ChIP-seq data available through Remap2022 database (website: remap.univ-amu.fr).

DNA binding protein Tissue/Cell line Annotated PPH signals, n Expected proportion of annotated PPH signals p-value
PGR hESC 4/5 5% 5e-06
ZNF558 HEK293 4/5 27% 7e-05
PGR myometrium 5/5 17% 0.001
PGR leiomyoma 3/5 8,6% 0.002
IRF2BP2 HEK293 3/5 9,9% 0.005
MED12 leiomyoma 3/5 11% 0.007
MYOG RH4 3/5 11% 0.009
MED12 myometrium 3/5 14% 0.01
ONECUT1 Hep-G2 3/5 15% 0.019
FOXA1 prostate 4/5 17% 0.02
ZNF3 Hep-G2 3/5 7% 0.023

Functional analysis of loci

We annotated the five PPH lead variants and their correlated variants (r2>0.80), hereafter referred to as PPH signals, according to their location in the ENCODE encyclopedia of candidate cis-regulatory elements (cCRE)11. Collectively, cCREs span 291Mb of the genome and contain 10.2% of sequence variants. We found that all five PPH signals were located within either the distal or proximal enhancer-like sequences (dELS, pELS), suggesting non-coding regulatory functions (Supplementary Table 55–7).

The predicted gene targets for these regulatory elements in uterine tissue are TBX3 (12q24.21), FRMD7 and RAP2C (Xq26.2) according to Epimap12 (Supplementary Table 88). Furthermore, there is evidence that the lead SNP at the chromosome 4 locus, rs13141656, targets HAND2 in endometrial tissue13,14. None of these genes have been directly associated with PPH. HAND2 and TBX3 are involved in stromal-epithelial communication during implantation. HAND2 is implicated in preterm birth and gestational duration and has previously been found to be critical for implantation15,16. The function of the RAP2C/FRMD7 gene cluster is currently unknown, but variants in the RAP2C locus are associated with gestational duration17. None of the proteins are known to physically interact, according to the STRING database (v11.5)18.

We tested the PPH signals for enrichment within 1,210 transcription factor (TF) binding sites in DNA of various cell types and tissues19, amounting to a total of 4,143 tests and we used Bonferroni correction to set the threshold for significances at p<0.05/4,143 ~ 1·10−5. The number of PPH signals found in PGR binding sites in human embryonic stem cells was significantly higher than expected (p=5·10−6, Table 2). PGR is an important factor in the establishment and maintenance of pregnancy and is therefore relevant in the context of PPH.

We used MAGMA20 to test for tissue-specific enrichment using expression data from the Human Protein Atlas bulk tissue and single-cell datasets21. We found that the endometrium, smooth muscle, seminal vesicle, and thyroid gland tissue were enriched, as well as endothelial cells (FDR < 5%) (Figure 2A,B).

Maternal and fetal transmission

We performed a haplotype-specific analysis of the five PPH-associated variants in the MoBa and deCODE cohorts to distinguish between maternal and fetal effects. These results were consistent with all five variants affecting the risk of PPH primarily through the maternal genome (Supplementary Figure 5, Supplementary Table 10). However, we cannot exclude any effect from the fetal genome.

Heritability of pregnancy-associated bleeding traits

We estimated the SNP heritability of early bleeding in pregnancy and PPH to be 12.7% (95% CI 7.8–17.6%) and 16.5% (95% CI 10.2–22.8%), respectively, in the Danish cohort, assuming a population prevalence of 25% and 15%, respectively. We selected prevalence’s based on literature review2,8.

Genetic correlations between pregnancy-associated bleeding traits

We characterized the intra-phenotypic genetic correlations among the five bleeding in pregnancy phenotypes investigated in this study: “early bleeding in pregnancy, any outcome”, “early bleeding in pregnancy, live birth”, “PPH”, “PPH due to atony”, and “PPH due to retained placenta”. Antepartum hemorrhage did not have sufficient polygenic signal to be investigated (LDSC χ2 < 1.02). Early bleeding during pregnancy did not exhibit any significant genetic correlation with PPH or any of its subtypes (Figure 3A). Notably, there was strong genetic correlation between PPH due to uterine atony and PPH due to retained placenta (rg=0.77, 0.49–1.05 95% CI).

Figure 3.

Figure 3.

(A) Cross-trait genetic correlation of all bleeding in pregnancy phenotypes (95% confidence interval). Postpartum hemorrhage and early bleeding in pregnancy show no noteworthy genetic correlation. Postpartum hemorrhage due to atony or retained placenta are genetically indistinguishable. (B) Genetic correlations between postpartum bleeding and selected disorders. (C) Genetic correlations between early bleeding and selected traits. Correlations that are significant after accounting for the number of traits tested are highlighted in yellow. Error bars represent the 95% confidence interval. The data sets used for the analysis are described in Supplementary Table 5.

Phenotypes correlated with pregnancy-associated bleeding

Next, we aimed to characterize the genetic overlap of early bleeding (any outcome) and PPH with other co-occurring diseases and other phenotypes. The range of phenotypes that may co-occur with early bleeding during pregnancy and PPH has not been extensively characterized. Consequently, we looked for associations in three distinct cohorts: the Estonian Biobank (n=17,094), UK Biobank (n=12,490), and a Danish nationwide cohort (n=2,320,776). Following a meta-analysis of 417 and 628 ICD-10 codes at the third level for early bleeding and PPH, respectively, we found that 120 codes were significantly associated with PPH (FDR < 0.05) and 625 codes with early bleeding (Supplementary File 1).

Based on the literature, known risk factors, lifestyle, socioeconomic factors, and the pairwise phenotype-to-phenotype correlation analyses presented here, we identified a list of phenotypes for which we could find suitable summary statistics (Supplementary Table 11). We additionally included socioeconomic and cardiometabolic traits, such as BMI, smoking, and blood pressure. These traits are not recoded in the registries, but are highly correlated with the diseases we found in the phenotype-to-phenotype correlation analysis. PPH was, at the genetic level, strongly positively correlated with birth weight (maternal and fetal), gestational duration (maternal), and had an inverse correlation with uterine fibroids (Bonferroni-corrected p < 0.05) (Figure 3B, see Supplementary Table 11 for a description of the summary stats). No other traits displayed a significant genetic correlation with PPH after multiple testing corrections. Although no sequence variants were found in association with early bleeding, we nonetheless, found genetic correlations to reproductive, socioeconomic, cardiovascular, and psychiatric traits (Figure 3C).

Polygenic risk scores

Utilizing 25,118 pregnancies (n=19,026 women) since 2012 from the Danish cohort, we found that a logistic regression model including the polygenic risk score (PRS) for PPH and birth weight yielded an improved model (p < 2 · 10−16, likelihood ratio test, Supplementary Table 12), compared to a model that included only age, pre-pregnancy BMI, parity, prior number of cesarean sections, and prior number of PPHs. The variance explained (Nagelkerke R2) increased from 3.2% (2.7%; 3.8%) to 3.8% (3.4%; 4.5%), yielding a net improvement of 0.7% (0.5%; 0.9%). Similarly, the AUC increased from 0.60 (0.59; 0.61) to 0.61 (0.60; 0.62), improving marginally (0.008, 0.005; 0.011).

Discussion

Summary

In this study, we investigated the genetic architecture of bleeding associated with pregnancy, which is one of the most common complications of pregnancy associated with both maternal and fetal morbidity and mortality. We identified five loci associated with PPH, with strong functional evidence of association with genes involved in implantation and contraction. Furthermore, enrichment of progesterone receptor binding sites substantiates the importance of hormone regulation in the etiology of PPH and suggests organ-specific dysregulation. However, in the absence of relevant tissue (myometrium sampled during or right before pregnancy), we were not able to locate the point or points in pregnancy at which the sequence variants exert their effect. There was no evidence of a genetic correlation between PPH and diseases. Our study revealed that early bleeding is highly polygenic with genetic correlations spanning various different categories of human traits, and PPH is a disorder of hormone-responsive genes. Overall, this study provides new insights into the genetic basis of bleeding during pregnancy, and suggests different genetic pathways for early bleeding and PPH.

Strengths and limitations

In this study, data from six Northern European cohorts were analyzed, representing six different countries with similar, albeit varying, universal healthcare systems, protocols for pregnancy care, and levels of available clinical information. However, it is important to note that PPH disproportionately affects women in developing countries, and further research is needed to integrate more diverse populations into studies of this kind. Additionally, the registration of early bleeding during pregnancy depends heavily on the healthcare-seeking behavior of the individual, organization of early pregnancy care and is most likely affected by the heterogeneous causes of early bleeding. Not all cohorts had information on early bleeding during pregnancy, and only three cohorts could distinguish between events leading to live births and those that did not. Another factor that should be considered is that oxytocin, a drug used to prevent or treat PPH, is administered preemptively based on other factors, such as cesarean section and PPH in a previous pregnancy. This bias most likely results in a smaller effect, thereby requiring a larger sample size for detection of associated loci.

Comparison with other literature

In this study, the potential causal genes at the five loci that may contribute to the development of PPH were not related to previously suggested causes, such as the oxytocin receptor or coagulation cascade5,22. The latter being expected as women with known coagulation disorders were excluded. The identified loci were found to be significantly enriched with progesterone-binding sites in human embryonic stem cells and showed nominal significance in the myometrium, the smooth muscle layer of the uterus responsible for contractions during labor and delivery. Progesterone is known to relax the myometrium and reduce contractility23, which is vital for maintaining a healthy pregnancy. The presence of progesterone-binding sites suggests that the genes located in these regions may be involved in regulating myometrial contractility, and that abnormal contractions can lead to PPH. Furthermore, these loci were also associated with endometriosis and/or uterine fibroids. Endometriosis and uterine fibroids are both treated with Selective Progesterone Receptor Modulators (SPRM), which target the progesterone receptor24. Observational studies suggest that early bleeding, antepartum hemorrhage, and postpartum hemorrhage are correlated2,25. However, we did not observe any evidence of a shared genetic etiology.

We established early bleeding as a complex trait, substantiated by significant heritability, polygenic signals, and widespread pleiotropy across disease areas. Early bleeding is related to pregnancy loss and may be an indication of the maternal body not coping well with the pregnancy. Genetic correlation with post-traumatic stress disorder and a variety of seemingly unrelated diseases and traits may be an indication of an extreme response to stress and a general low tolerance of the added burden of pregnancy upon maternal systems with underlying weaknesses. Possibly due to the high heterogeneity in the phenotype, we did not identify any variants associating with early bleeding; therefore, we could not test for causality using e.g., Mendelian randomization. Nonetheless, a previous study indicated a causal relationship between early bleeding and cardiometabolic diseases1.

The use of polygenic risk scores resulted in marginal improvements in the predictive capability for PPH. Nonetheless, as genetic studies become better powered, we can expect an improvement in their predictive capability. Consequently, the addition of polygenic risk scores to prognostic models should be considered in future studies to enable early stratification of women at a high risk of PPH.

Conclusion

Our findings reveal complex genetics of early bleeding in pregnancy. They further provide valuable insights into the potential underlying mechanisms of PPH and may inform the development of more effective prevention strategies.

Methods

Study Cohorts

This was a multi-national study that included six cohorts of Western European ancestry: the Copenhagen Hospital Biobank study on Reproduction (Denmark), Estonian Biobank (Estonia), FinnGen (Finland), deCODE genetics (Iceland), UK Biobank (England), and Norwegian Mother, Father and Child Cohort Study (Norway). All studies were approved by the relevant institutional ethics review boards (Supplementary Text)

Copenhagen Hospital Biobank study on Reproduction and the Danish Blood Donor Study

The Copenhagen Hospital Biobank (CHB) is based on EDTA blood samples collected from patients for blood typing and red cell antibody screening at hospitals in the Greater Copenhagen Area26. The CHB study on Reproduction (CHB-Repro) cohort focuses on patients with fertility and obstetric complications, identified through the Danish National Patient Registry. We also included blood donors from the Danish Blood Donor Study Genomic Cohort (DBDS-GC). DBDS-GC is described by Hansen et al27. All samples were genotyped at deCODE genetics using the Illumina Infinium Global Screening array. Samples were imputed using an in-house pan-Scandinavian reference panel28. Association analysis was performed using software developed at deCODE genetics29.

Estonian Biobank

The EstBB is a population-based biobank with over 200,000 participants (corresponding to 20% of the total Estonian population). Details of EstBB genotyping procedure have been described previously30,31. Briefly, all EstBB participants were genotyped using Illumina arrays at the Core Genotyping Lab of the Institute of Genomics, University of Tartu. Samples were imputed using a population specific imputation reference of 2,297 whole genome sequencing samples32. Association analysis was performed using SAIGE 0.43.1.

FinnGen

FinnGen is a public–private partnership research project that combines imputed genotype data generated from newly collected and legacy samples from Finnish biobanks and digital health record data from Finnish health registries (https://www.finngen.fi/en) with the aim to provide new insights into disease genetics33. FinnGen includes 9 Finnish biobanks, research institutes, universities and university hospitals, 13 international pharmaceutical industry partners and the Finnish Biobank Cooperative (FINBB) in a pre-competitive partnership. As of November 2022 (release 10 described in this article), samples from 412,181 individuals have been analysed with the final aim to have a cohort of 500,000 participants. The project utilizes data from the nationwide longitudinal health register collected since 1969 from every resident in Finland.

deCODE genetics

The deCODE cohort is a nation-wide sample collection recruited in Iceland since 1997. All participants who donated blood signed an informed consent. Variants were identified through whole genome sequencing of 63,460 individuals. They were imputed into 173,025 chip-genotyped Icelanders using long-range phasing, and into their untyped close relatives based on genealogy29,34. We used logistic regression to test for association of sequence variants assuming an additive genetic model, using software developed at deCODE genetics29.

Norwegian Mother, Father and Child Cohort Study

The Norwegian Mother, Father and Child Cohort Study (MoBa) is a population-based pregnancy cohort study conducted by the Norwegian Institute of Public Health. Participants were recruited from all over Norway from 1999–200835. The women consented to participation in 41% of the pregnancies. The cohort includes approximately 114.500 children, 95.200 mothers and 75.200 fathers. The current study is based on version 12 of the quality-assured data files released for research. Details about PPH were obtained from the Medical Birth Registry, a national health registry containing information about all births in Norway. Sample QC and imputation has previously been described36. In brief, individuals were genotyped using different Illumina arrays (HumanCoreExome-12 v1.1, HumanCoreExome-24 v1.0, Global Screening Array v1.0, InfiniumOmniExpress-24 v.2, HumanOmniExpress-24 v1.0). Individual level QC was performed to remove ancestry outliers and individuals with sex discrepancy and call rate < 0.98. Furthermore, SNPs with a MAF < 1%, deviating from the Hardy-Weinberg equilibrium (p < 1e-4), or a call rate < 0.98 were removed. Imputation was done using SHAPEITv2 + PBWT on the Sanger imputation server, with HRC v1.1 as the imputation reference panel. Association analysis was done using regenie37.

UK Biobank

The UK Biobank is a prospective cohort of ~500.000 individuals from across the United Kingdom, recruited at ages 40–69. Genotyping was done in two batches, using the Affymetrix chip UK BiLEVE Axiom87 and Affymetrix UK Biobank Axiom array. Imputation was done using a sample of 150,000 whole genome sequenced individuals from the UK Biobank38. Only individuals with a registered live or stillbirth (identified through the HESIN delivery table) and of European descent were included in the analysis. Association analysis was performed using software developed at deCODE genetics29. The UKB resource was used under application no. 56270. All phenotype and genotype data were collected following an informed consent obtained from all participants.

Phenotype definitions

We divided bleeding in pregnancy into three categories and the following sub-phenotypes:

  1. Bleeding in early pregnancy (<20+0 gestational weeks)

    1. Bleeding in early pregnancy leading to live birth

    2. Bleeding in early pregnancy ending in any outcome (live birth, pregnancy loss, termination of pregnancy, ectopic pregnancy, mola pregnancy, pregnancy of unknown location)

  2. Antepartum hemorrhage (>20th gestational week, prior to birth)

  3. Postpartum hemorrhage (PPH, hemorrhage following birth)

    1. PPH due to atony

    2. PPH due to retained placenta

We categorized each phenotype using hospital admission codes, although not all codes were available in all countries. We provided a phenotype definition list in Supplementary Table 13. We adjusted analyses for age, parity, gestational duration, and weight of the child, if possible. Women with known coagulation disorders were excluded (ICD-10 codes D66-D69, O46.0, O67.0). Furthermore, we excluded multifold pregnancies for antepartum hemorrhage and PPH, if possible. Lastly, we excluded pregnancies delivered by cesarean section in the PPH analysis, if possible.

Meta-analysis

For the meta-analyses, we combined GWASs from the respective cohorts using a fixed-effects inverse variance method based on effect estimates and standard errors in which each dataset was assumed to have a common odds-ratio but allowed to have different population frequencies for alleles and genotypes. Sequence variants were mapped to NCBI Build38 and matched on position and alleles to harmonize the datasets. After excluding variants with discrepant allele frequency between cohorts, variants with MAF < 0.001% in all cohorts or variants only present in one dataset, 18,009,056 variants were included in the meta-analysis. The threshold for genome-wide significance was corrected for multiple testing with a weighted Bonferroni adjustment that controls for the family-wise error rate, using as weights the enrichment of variant classes with predicted functional impact among association signals39. The significance threshold then becomes 4.56 × 10−7 for high-impact variants (including stop-gained, frameshift, splice acceptor or donor), 9.12 × 10−8 for moderate-impact variants (including missense, splice-region variants and in-frame indels), 8.28 × 10−9 for low-impact variants (synonymous, 5’ and 3’ UTR, upstream and downstream variants), 4.19 × 10−9 for other DNase I hypersensitivity sites (DHS) variants and 1.38 × 10−9 for other non-DHS variants. In a random-effects method, a likelihood ratio test was performed in all genome-wide associations to test the heterogeneity of the effect estimate in the four datasets; the null hypothesis is that the effects are the same in all datasets and the alternative hypothesis is that the effects differ between datasets.

Conditional analysis

Conditional association analyses were performed on the GWASs from Iceland, the UK, and Denmark using true imputed genotypes of participants. Approximate conditional analyses (COJO), implemented in the GCTA-software, were applied on the lead variants in the Finnish, Estonian and MoBa summary statistics40,41. Linkage disequilibrium between variants was estimated using a set of 5,000 WGS Icelanders. The analyses were restricted to variants within 1 Mb from the index variants. The p-values were combined for all six datasets to identify any secondary signals. Based on the number of variants tested we required secondary signals to pass a threshold of p < 5 × 10−8 after correcting for the lead variant.

Comparison of effect sizes for retained placenta and uterine atony

We compared effect sizes for retained placenta and uterine atony by doing a case-case analysis of the summary statistics using ReAct42. Only genome-wide significant SNPs, according to the functionally informed multiple testing correction, found in the main analysis of PPH was included. We assumed no overlap between cases, and a full overlap between controls.

Lookup of variants

Variants and variants in strong LD were looked up in the GWAS catalog to identify prior associations to other phenotypes, using the LDlinkR package10,43. Furthermore, we investigated the association of the variants to endometriosis and uterine fibroids in the FinnGen cohort (r10). The analysis was part of the FinnGen core analysis, done using regenie, in which the analysis was adjusted for age, the first ten principal components, genotyping chip, and batch37. We adjusted p-values for the number of phenotypes (two) and variants (234) tested (p<0.05/(2 · 234) = 0.0001).

Mapping of GWA signals to non-coding annotations

We downloaded annotations of candidate cis-regulatory elements (cCRE; version 3) from the ENCODE project (website: screen.encodeproject.org)10. We then determined whether the lead PPH sequence variant or any of their correlated variants (r2 > 0.80), i.e. PPH signals were located within cell-type agnostic cCREs (candidate cis-regulatory elements), and cCREs defined in tissue samples relevant to PPH i.e. uterus tissue. In this same way, we annotated the PPH signals with respect to enhancer elements (Active/Genic) as defined for 833 samples (representing 33 groups of tissues/organs) in EpiMap (website: compbio.mit.edu/epimap)12. EpiMap further provides predicted links between enhancers and genes, and, based on these pre-computed predictions, we looked for candidate gene targets for each signal in uterus tissue (website: personal.broadinstitute.org/cboix/epimap/links/links_corr_only). We also annotated the PPH signals with respect to DNA binding sites for 1,210 transcription factors (TFs) mapped experimentally by various researchers, notably ENCODE project, using ChIP-seq in different tissue/cell types and conditions made available by Remap2022 (website: remap2022.univ-amu.fr), which amount to a total of 4,143 ChIP-seq experiments.

Enrichment of association signals in functional annotations

We used GWA signals from the GWAS catalog (see details in next paragraph: „GWAS catalog“) to obtain the null distribution in our enrichment analyses for functional annotations of the genome. The number of sequence variants found in high linkage disequilibrium (LD; r2>0.80) for each of the five PPH association signals were expected to influence the probability of finding an overlap to a given functional annotation map. We therefore randomly selected five GWA signals from the GWAS catalog for each of the five PPH signals, ensuring that the five randomly selected signals were matched to the PPH signals with respect to the number of sequence variants found in high LD. We then counted the number of randomly selected signals that intersected with a given annotation (this count is denoted as z). This procedure was then repeated N=200,000 times. In summary, we were simulating the five PPH signals in terms of a) the number of sequence variants in high LD to each PPH signal and b) the property of being a GWA signal associated with human multifactorial trait.

Let zi represent the number of annotated signals in each i-th sample. The probability (p) of finding an intersection to a given annotation among randomly sampled GWA signals is therefore: p=iNzi5N, where 5·N= the total number of randomly sampled GWA signals from the GWAS catalog (five randomly selected GWA signals in each of N samples); this is the expected proportion of annotated GWA signals. We then define XBin(n,p), where X is the number of annotated PPH signals and n is the number of PPH signals (n=5). The five PPH signals are found on different chromosomes, and we therefore assume that they are independent. We then determine the probability of observing x or more PPH signals in a given annotation, where x is the observed number of PPH signals that intersect with the given annotation. We are therefore interested in: (Xx)=j/N, where j is the number of times we found x or more annotated GWA signals in the aforementioned N random samples of GWA signals. We then used Bonferroni correction to set the threshold for significance.

GWAS catalog:

We compiled a robust set of association signals from the NHGRI-EBI catalog of GWAS association signals; downloaded on 4-AUG-2021 (GWAS catalog v1.00 website: www.ebi.ac.uk/gwas)10. GWAS catalog variants (lead) were matched to in-house variant calls on the basis of rs-identifiers, genome position and MAF (GWAS catalog entries with missing information in any of these fields were omitted). In the GWAS catalog, the same trait has been studied by many different research groups and therefore many associations are „repeated“ and therefore not independent. We used the following procedure to compile a set of independent associations for each trait in the GWAS catalog: First, we extracted all associations with the trait with P-value<1e-9. Second, we selected the most significant association and added it to the list of independent associations. Third, we added the most significant associations with P-value<1e-9 located more than 1Mb away from other independent associations. We then repeated this third step until no more associations were found with P-value<1e-9 while also located >1Mb away from those already added to the list of independent associations. We omitted traits classified as „blood protein measurement“ (mostly representing GWASs for serum protein assays) and sixteen other traits (e.g. heel bone mineral density) with an unusually large number of associations. Further, as our enrichment method takes LD into account (computed in whole genome sequenced individuals from the Icelandic population), we selected GWAŚs carried out in individuals of European descent. This resulted in 27,546 GWA association signals for 1,173 diseases or other human traits.

Functional enrichment and tissue specificity

We used MAGMA to investigate tissue expression specificity20. Consensus bulk and single-cell RNA-Seq data that had already been preprocessed was downloaded from the Human Protein Atlas21. In short, the HPA consensus tissue gene data summarizes expression at the gene level covering 62 tissues, and includes data from the Human Protein Atlas, GTEx, and FANTOM5. The RNA single cell consensus data set covers 51 cell types across 13 tissues, from 14 different studies. We used the 1,000 Genomes Phase 3 European data as reference (downloaded from https://ctg.cncr.nl/software/magma).

Comorbidity analysis

Comorbidities associated with early bleeding in pregnancy and PPH were identified across three cohorts (Denmark, Estonian Biobank, and the UK Biobank). The Danish cohort utilized nationwide data from the Danish National Patient Register (DNPR) and the Danish Medical Birth Register (DMBR)44,45. The DNPR contains hospital admissions since 1977, and the DMBR contains birth since 1973. We identified all women born after 1957, which ensured a full reproductive history from their 20th year birthday and onwards. We analyzed associations between early bleeding in pregnancy, PPH and all other diagnoses (excluding chapters regarding infections, obstetric diagnosis, injuries, and contacts with the healthcare system). Similarly, a PheWAS was performed in the Estonian Biobank and the UK Biobank. In the UK Biobank, we included only women present in the HESIN delivery tables. Odds ratios were determined using logistic regression, adjusting for year of birth. Data from the three cohorts were meta-analyzed using an inverse-variance weighting as implemented on the R package metafor. We controlled for multiple testing by calculating q-values and selecting associations with a q-value < 0.05.

Heritability and genetic correlations

SNP Heritability was estimated using RHE-mc, which is an efficient and scalable estimator using individual level data46. We selected genotyped SNPs in the CHB with MAF > 1%, missing in less than 1% of samples, no deviation from HWE (p < 10−7), and excluded the HMC region, as per author’s recommendations. We adjusted the analysis for year of birth, year of birth squared, and the first 10 principal components.

Genetic correlations were estimated using LD Score Regression47. We selected phenotypes based on prior knowledge about risk factors and associations from the comorbidity analysis and availability. In this analysis, we used results for about 1.2 million well imputed variants, and for LD information we used precomputed LD scores for European populations (downloaded from: https://data.broadinstitute.org/alkesgroup/LDSCORE/eur_w_ld_chr.tar.bz2). Genetic correlation of pregnancy bleeding subtypes was calculated between Danish primary trait and the meta-analysis of the relevant secondary trait, excluding Danes, and vice versa. The results of the two analyses were then meta-analyzed. Genetic correlation of Early bleeding - birth was only done using the Danish data for the primary trait as the sample size for the remaining populations was too small.

Polygenic Risk Scores

Polygenic Risk Scores (PRS) were created using LDPred248. Autosomal genotype data from 138,669 individuals in the Copenhagen Hospital Biobank study on Reproduction was filtered to only include variants present in LDpred2’s recommended set of 1,054,330 reference variants. Missing genotype information was imputed to be the affected locus’ reference allele. GWAS Summary statistics for birth weight from Warrington et al was pre-processed with MungeSumStats49,50. The birth weight summary statistics contain a very small fraction of Danish samples from other cohorts. We excluded any Danes from the summary statistics used for the PPH PRS to avoid inflation.

The effects of polygenic risk scores were estimated using a logistic regression model, adjusted for maternal age at conception, parity, pre-pregnancy BMI, previous number of cesarean sections, and previous numbers of PPH events. We compared models with and without polygenic risk scores using a likelihood ratio test. Furthermore, we also compared the C-index and Nagelkerke’s R2. We used a bootstrap resampling approach to find optimism corrected values, which is a conservative estimate of the error on unseen data and a method of performing an internal validation51. We repeated the bootstrap resampling 100 times, and we report the 95% percentile bootstrap confidence intervals. Standard errors were corrected for the inherent clustering present due to multiple pregnancies from the same women using the Huber-White method.

Haplotype analysis

We explored whether the effects of the identified variants on PPH depend on maternal, fetal or maternal and fetal origins by performing an association analysis using the parental transmitted and non-transmitted alleles. We used phased genotype data from the MoBa cohort (n = 22,330 parent-offspring trios) and deCODE study to infer the parent of origin of fetal alleles. The analysis of the deCODE data was done on 106,622 parent-offspring trios (2,558 cases and 104,064 controls) with at least one genotyped individual. This included 19,488 fully genotyped trios, 5,991 with only child and mother and 1,835 with only child and father genotyped, 39,390 with both parents genotyped but not the child, and 1,661, 26,582 and 11,675 with only child, mother or father genotyped, respectively.

For each lead variant, the following logistic regression model was fit:

PPH=MnT+MT+PnT+PT+covariates

where MnT and MT refer to the maternal non transmitted and transmitted alleles, respectively, and PnT and PT refer to the paternal non-transmitted and transmitted alleles, respectively. The PT effect is interpreted as a fetal-only genetic effect, whereas the effect of the maternal non transmitted allele is a maternal-only genetic effect. In the deCODE study, we used maximum likelihood estimation to estimate the effects, as previously described52. Estimates from the two cohorts were meta-analyzed using fixed-effect meta-analysis.

Supplementary Material

Supplement 1
media-1.docx (78KB, docx)
Supplement 2
media-2.xlsx (59.3KB, xlsx)
Supplement 3

Acknowledgements

The work is carried out as a part of the BRIDGE – Translational Excellence Programme (bridge.ku.dk) at the Faculty of Health and Medical Sciences, University of Copenhagen, funded by the Novo Nordisk Foundation. Grant agreements NNF18SA0034956, NNF14CC0001, and NNF17OC0027594. Furthermore, we would like to acknowledge funding from the Ole Kirk Foundation and Rigshospitalet’s Research Fund.

B.J. received funding from The Swedish Research Council, Stockholm, Sweden (2019–01004), The Research Council of Norway, Oslo, Norway (FRIMEDBIO #547711), March of Dimes (#21-FY16–121), Agreement concerning research and education of doctors (ALFGBG-965353). Research by B.J. was also supported by the Eunice Kennedy Shriver National Institute Of Child Health & Human Development of the National Institutes of Health under Award Number R01HD101669. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. We thank the Norwegian Institute of Public Health (NIPH) for generating high-quality genomic data. This research is part of the HARVEST collaboration, supported by the Research Council of Norway (#229624). We also thank deCODE genetics and the NORMENT Centre for providing genotype data, funded by the Research Council of Norway (#223273), South East Norway Health Authorities and Stiftelsen Kristian Gerhard Jebsen. We further thank the Center for Diabetes Research, the University of Bergen for providing genotype data and performing quality control and imputation of the data funded by the ERC AdG project SELECTionPREDISPOSED, Stiftelsen Kristian Gerhard Jebsen, Trond Mohn Foundation, the Research Council of Norway, the Novo Nordisk Foundation, the University of Bergen, and the Western Norway Health Authorities.

We want to acknowledge the participants and investigators of FinnGen study. The FinnGen project is funded by two grants from Business Finland (HUS 4685/31/2016 and UH 4386/31/2016) and the following industry partners: AbbVie Inc., AstraZeneca UK Ltd, Biogen MA Inc., Bristol Myers Squibb (and Celgene Corporation & Celgene International II Sàrl), Genentech Inc., Merck Sharp & Dohme LCC, Pfizer Inc., GlaxoSmithKline Intellectual Property Development Ltd., Sanofi US Services Inc., Maze Therapeutics Inc., Janssen Biotech Inc, Novartis AG, and Boehringer Ingelheim International GmbH. Following biobanks are acknowledged for delivering biobank samples to FinnGen: Auria Biobank (www.auria.fi/biopankki), THL Biobank (www.thl.fi/biobank), Helsinki Biobank (www.helsinginbiopankki.fi), Biobank Borealis of Northern Finland (https://www.ppshp.fi/Tutkimus-ja-opetus/Biopankki/Pages/Biobank-Borealis-briefly-in-English.aspx), Finnish Clinical Biobank Tampere (www.tays.fi/en-US/Research_and_development/Finnish_Clinical_Biobank_Tampere), Biobank of Eastern Finland (www.ita-suomenbiopankki.fi/en), Central Finland Biobank (www.ksshp.fi/fi-FI/Potilaalle/Biopankki), Finnish Red Cross Blood Service Biobank (www.veripalvelu.fi/verenluovutus/biopankkitoiminta), Terveystalo Biobank (www.terveystalo.com/fi/Yritystietoa/Terveystalo-Biopankki/Biopankki/) and Arctic Biobank (https://www.oulu.fi/en/university/faculties-and-units/faculty-medicine/northern-finland-birth-cohorts-and-arctic-biobank). All Finnish Biobanks are members of BBMRI.fi infrastructure (www.bbmri.fi). Finnish Biobank Cooperative -FINBB (https://finbb.fi/) is the coordinator of BBMRI-ERIC operations in Finland. The Finnish biobank data can be accessed through the Fingenious®services (https://site.fingenious.fi/en/) managed by FINBB.

This Estonian Biobank study was funded by European Union through the European Regional Development Fund Project No. 2014–2020.4.01.15–0012 GENTRANSMED. Data analysis was carried out in part in the High-Performance Computing Center of University of Tartu.

We acknowledge the Estonian Biobank research team: Andres Metspalu, Lili Milani, Reedik Mägi, Mari Nelis, and Georgi Hudjashov, giving them credit for data collection, genotyping, QC, and imputation.

Footnotes

Competing interests

H.S.N. obtained speaker fees from Ferring Pharmaceuticals, Merck A/S, AstraZeneca and Cook Medical. S.B. has ownership in Hoba Therapeutics Aps, Novo Nordisk A/S, Lundbeck A/S, ALK Abello and managing board memberships in Proscion A/S and Intomics A/S. All authors affiliated with deCODE genetics are employees of deCODE genetics, a subsidiary of Amgen.

Data availability

Meta-analysis summary statistics will be made available upon publication.

References

  • 1.Dudukina E., Horváth-Puhó E., Sørensen H. T. & Ehrenstein V. Risk of diabetes and cardiovascular diseases in women with vaginal bleeding before 20 gestational weeks: Danish population-based cohort study. 2022.03.18.22272466 Preprint at 10.1101/2022.03.18.22272466 (2022). [DOI] [PubMed] [Google Scholar]
  • 2.Lykke J. A., Dideriksen K. L., Lidegaard Ø. & Langhoff-Roos J. First-trimester vaginal bleeding and complications later in pregnancy. Obstet. Gynecol. 115, 935–944 (2010). [DOI] [PubMed] [Google Scholar]
  • 3.Bienstock J. L., Eke A. C. & Hueppchen N. A. Postpartum Hemorrhage. N. Engl. J. Med. 384, 1635–1645 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Oberg A. S. et al. Genetic contribution to postpartum haemorrhage in Swedish population: cohort study of 466 686 births. BMJ 349, g4984 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Biguzzi E. et al. Genetic background and risk of postpartum haemorrhage: results from an Italian cohort of 3219 women. Haemoph. Off. J. World Fed. Hemoph. 20, e377–383 (2014). [DOI] [PubMed] [Google Scholar]
  • 6.Biguzzi E. et al. Risk factors for postpartum hemorrhage in a cohort of 6011 Italian women. Thromb. Res. 129, e1–7 (2012). [DOI] [PubMed] [Google Scholar]
  • 7.Committee on Practice Bulletins-Obstetrics. Practice Bulletin No. 183: Postpartum Hemorrhage. Obstet. Gynecol. 130, e168–e186 (2017). [DOI] [PubMed] [Google Scholar]
  • 8.World Health Organization & World Health Organization. WHO recommendations for the prevention and treatment of postpartum haemorrhage. (2012). [PubMed]
  • 9.Neary C., Naheed S., McLernon D. & Black M. Predicting risk of postpartum haemorrhage: a systematic review. BJOG Int. J. Obstet. Gynaecol. 128, 46–53 (2021). [DOI] [PubMed] [Google Scholar]
  • 10.Sollis E. et al. The NHGRI-EBI GWAS Catalog: knowledgebase and deposition resource. Nucleic Acids Res. 51, D977–D985 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Moore J. E. et al. Expanded encyclopaedias of DNA elements in the human and mouse genomes. Nature 583, 699–710 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Boix C. A., James B. T., Park Y. P., Meuleman W. & Kellis M. Regulatory genomic circuitry of human disease loci by integrative epigenomics. Nature 590, 300–307 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Marinić M., Mika K., Chigurupati S. & Lynch V. J. Evolutionary transcriptomics implicates HAND2 in the origins of implantation and regulation of gestation length. eLife 10, e61257 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Sakabe N. J. et al. Transcriptome and regulatory maps of decidua-derived stromal cells inform gene discovery in preterm birth. Sci. Adv. 6, eabc8696 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Li Q. et al. The antiproliferative action of progesterone in uterine epithelium is mediated by Hand2. Science 331, 912–916 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Solé-Navais P. et al. Genetic effects on the timing of parturition and links to fetal birth weight. Nat. Genet. 55, 559–567 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Zhang G. et al. Genetic Associations with Gestational Duration and Spontaneous Preterm Birth. N. Engl. J. Med. 377, 1156–1167 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Szklarczyk D. et al. The STRING database in 2023: protein–protein association networks and functional enrichment analyses for any sequenced genome of interest. Nucleic Acids Res. 51, D638–D646 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Hammal F., de Langen P., Bergon A., Lopez F. & Ballester B. ReMap 2022: a database of Human, Mouse, Drosophila and Arabidopsis regulatory regions from an integrative analysis of DNA-binding sequencing experiments. Nucleic Acids Res. 50, D316–D325 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.de Leeuw C. A., Mooij J. M., Heskes T. & Posthuma D. MAGMA: generalized gene-set analysis of GWAS data. PLoS Comput. Biol. 11, e1004219 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Karlsson M. et al. A single–cell type transcriptomics map of human tissues. Sci. Adv. 7, eabh2169 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Erickson E. N., Krol K. M., Perkeybile A. M., Connelly J. J. & Myatt L. Oxytocin receptor single nucleotide polymorphism predicts atony-related postpartum hemorrhage. BMC Pregnancy Childbirth 22, 884 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Mesiano S. Myometrial Progesterone Responsiveness. Semin. Reprod. Med. 25, 5–13 (2007). [DOI] [PubMed] [Google Scholar]
  • 24.Islam M. S., Afrin S., Jones S. I. & Segars J. Selective Progesterone Receptor Modulators—Mechanisms and Therapeutic Utility. Endocr. Rev. 41, bnaa012 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Saraswat L., Bhattacharya S., Maheshwari A. & Bhattacharya S. Maternal and perinatal outcome in women with threatened miscarriage in the first trimester: a systematic review. BJOG Int. J. Obstet. Gynaecol. 117, 245–257 (2010). [DOI] [PubMed] [Google Scholar]
  • 26.Sørensen E. et al. Data Resource Profile: The Copenhagen Hospital Biobank (CHB). Int. J. Epidemiol. 50, 719–720e (2021). [DOI] [PubMed] [Google Scholar]
  • 27.Hansen T. F. et al. DBDS Genomic Cohort, a prospective and comprehensive resource for integrative and temporal analysis of genetic, environmental and lifestyle factors affecting health of blood donors. BMJ Open 9, e028401 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Banasik K. et al. DanMAC5: a browser of aggregated sequence variants from 8,671 whole genome sequenced Danish individuals. BMC Genomic Data 24, 30 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Gudbjartsson D. F. Large-scale whole-genome sequencing of the Icelandic population. Nat. Genet. 47, 435–444 (2015). [DOI] [PubMed] [Google Scholar]
  • 30.Koel M. et al. GWAS meta-analyses clarify the genetics of cervical phenotypes and inform risk stratification for cervical cancer. Hum. Mol. Genet. 32, 2103–2116 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Pujol-Gualdo N. et al. Advancing our understanding of genetic risk factors and potential personalized strategies for pelvic organ prolapse. Nat. Commun. 13, 3584 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Mitt M. et al. Improved imputation accuracy of rare and low-frequency variants using population-specific high-coverage WGS-based imputation reference panel. Eur. J. Hum. Genet. EJHG 25, 869–876 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Kurki M. I. et al. FinnGen provides genetic insights from a well-phenotyped isolated population. Nature 613, 508–518 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Kong A. et al. Detection of sharing by descent, long-range phasing and haplotype imputation. Nat. Genet. 40, 1068–1075 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Magnus P. et al. Cohort Profile Update: The Norwegian Mother and Child Cohort Study (MoBa). Int. J. Epidemiol. 45, 382–388 (2016). [DOI] [PubMed] [Google Scholar]
  • 36.Helgeland Ø. et al. Characterization of the genetic architecture of infant and early childhood body mass index. Nat. Metab. 4, 344–358 (2022). [DOI] [PubMed] [Google Scholar]
  • 37.Mbatchou J. et al. Computationally efficient whole-genome regression for quantitative and binary traits. Nat. Genet. 53, 1097–1103 (2021). [DOI] [PubMed] [Google Scholar]
  • 38.Halldorsson B. V. et al. The sequences of 150,119 genomes in the UK Biobank. Nature 607, 732–740 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Sveinbjornsson G. Weighting sequence variants based on their annotation increases power of whole-genome association studies. Nat. Genet. 48, 314–317 (2016). [DOI] [PubMed] [Google Scholar]
  • 40.Yang J. et al. Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat. Genet. 44, 369–375, S1–3 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Yang J., Lee S. H., Goddard M. E. & Visscher P. M. GCTA: A Tool for Genome-wide Complex Trait Analysis. Am. J. Hum. Genet. 88, 76–82 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Yang Z., Paschou P. & Drineas P. Reconstructing SNP Allele and Genotype Frequencies from GWAS Summary Statistics. 2021.04.02.438281 Preprint at 10.1101/2021.04.02.438281 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Myers T. A., Chanock S. J. & Machiela M. J. LDlinkR: An R Package for Rapidly Calculating Linkage Disequilibrium Statistics in Diverse Populations. Front. Genet. 11, (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Knudsen L. B. & Olsen J. The Danish Medical Birth Registry. Dan. Med. Bull. 45, 320–323 (1998). [PubMed] [Google Scholar]
  • 45.Schmidt M. et al. The Danish National Patient Registry: a review of content, data quality, and research potential. Clin. Epidemiol. 7, 449–490 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Pazokitoroudi A. et al. Efficient variance components analysis across millions of genomes. Nat. Commun. 11, 4020 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Bulik-Sullivan B. et al. An atlas of genetic correlations across human diseases and traits. Nat. Genet. 47, 1236–1241 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Privé F., Arbel J. & Vilhjálmsson B. J. LDpred2: better, faster, stronger. Bioinforma. Oxf. Engl. 36, 5424–5431 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Warrington N. M. Maternal and fetal genetic effects on birth weight and their relevance to cardio-metabolic risk factors. Nat. Genet. 51, 804–814 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Murphy A. E., Schilder B. M. & Skene N. G. MungeSumstats: a Bioconductor package for the standardization and quality control of many GWAS summary statistics. Bioinformatics 37, 4593–4596 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Harrell F. E. Regression Modeling Strategies: With Applications to Linear Models, Logistic and Ordinal Regression, and Survival Analysis. (Springer International Publishing, 2015). doi: 10.1007/978-3-319-19425-7. [DOI] [Google Scholar]
  • 52.Juliusdottir T. et al. Distinction between the effects of parental and fetal genomes on fetal growth. Nat. Genet. 53, 1135–1142 (2021). [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplement 1
media-1.docx (78KB, docx)
Supplement 2
media-2.xlsx (59.3KB, xlsx)
Supplement 3

Data Availability Statement

Meta-analysis summary statistics will be made available upon publication.


Articles from medRxiv are provided here courtesy of Cold Spring Harbor Laboratory Preprints

RESOURCES