Skip to main content
Bioinformatics logoLink to Bioinformatics
. 2024 Nov 8;40(11):btae666. doi: 10.1093/bioinformatics/btae666

MR Corge: sensitivity analysis of Mendelian randomization based on the core gene hypothesis for polygenic exposures

Wenmin Zhang 1, Chen-Yang Su 2,3, Satoshi Yoshiji 4,5,6,7,8, Tianyuan Lu 9,10,11,12,13,
Editor: Russell Schwartz
PMCID: PMC11578597  PMID: 39513749

Abstract

Summary

Mendelian randomization is being utilized to assess causal effects of polygenic exposures, where many genetic instruments are subject to horizontal pleiotropy. Existing methods for detecting and correcting for horizontal pleiotropy have important assumptions that may not be fulfilled. Built upon the core gene hypothesis, we developed MR Corge for performing sensitivity analysis of Mendelian randomization. MR Corge identifies a small number of putative core instruments that are more likely to affect genes with a direct biological role in an exposure and obtains causal effect estimates based on these instruments, thereby reducing the risk of horizontal pleiotropy. Using positive and negative controls, we demonstrated that MR Corge estimates aligned with established biomedical knowledge and the results of randomized controlled trials. MR Corge may be widely applied to investigate polygenic exposure-outcome relationships.

Availability and implementation

An open-sourced R package is available at https://github.com/zhwm/MRCorge.

1 Introduction

Mendelian randomization (MR) is a promising tool for assessing potential causal relationships (Davey Smith and Ebrahim 2003, Smith and Ebrahim 2004) since randomized controlled trials are often limited by ethical and practical constraints. Using genetic variants as instrumental variables, MR may effectively mitigate the risks of confounding and reverse causation that frequently bias observational association studies (Davey Smith and Ebrahim 2003, Smith and Ebrahim 2004). MR relies on essential instrumental variable assumptions of relevance, independence, and exclusion restriction (Skrivankova et al. 2021a,b). Specifically, genetic instruments must be strongly associated with the exposure, be independent of any confounders, and affect the outcome only through the exposure. The relevance assumption can be fulfilled by selecting genetic variants demonstrating genome-wide significance in genome-wide association studies (GWAS) of exposures, while the independence assumption may be examined by testing for instrument-confounder associations. However, exclusion restriction, also known as the assumption of no horizontal pleiotropy, remains challenging to verify.

With the goal to robustly derive MR estimates in the presence of horizontal pleiotropy, several methods have been developed beyond the original Wald ratio and inverse variance weighted (IVW) methods (Burgess et al. 2013). For instance, the median-based methods (e.g. simple median, weighted median, and penalized weighted median methods) require that at least 50% of the instruments or their weights are valid (Bowden et al. 2016, Burgess et al. 2017). The mode-based methods (e.g. simple mode and weighted mode methods) require that the mode of the causal effect estimates or their weights is based on valid instruments (Hartwig et al. 2017). The MR PRESSO method identifies and removes outlier instruments (Verbanck et al. 2018), while the contamination mixture method identifies one or more groups of instruments with consistent causal estimates (Burgess et al. 2020), both having the assumption that a plurality of the instruments should be valid (Verbanck et al. 2018, Burgess et al. 2020). The MR Egger method and MR RAPS method allow for pervasive pleiotropic effects but have an additional assumption that the strength of the genetic instruments is independent of their direct effects on the outcome (i.e. InSIDE assumption) (Burgess and Thompson 2017, Zhao et al. 2020).

In recent years, MR has been increasingly utilized to estimate the potential causal effects of polygenic exposures, for which tens or hundreds of independent genetic variants can be identified as candidate instruments from GWAS of rapidly growing sample sizes (Visscher et al. 2017, Abdellaoui et al. 2023). It has been recognized that many, sometimes the vast majority, of these candidate instruments do not have a biologically interpretable role in the exposures and can be involved in multiple distinct biological pathways (Boyle et al. 2017, Liu et al. 2019b, Sinnott-Armstrong et al. 2021a). This substantially increases the risk of violating the additional assumptions of the aforementioned MR methods.

The core gene hypothesis in the theory of the omnigenic model (Boyle et al. 2017) has been proposed to explain the high polygenicity of complex traits. The core gene hypothesis states that genetic variants within core genes or pathways have more direct and specific effects on a polygenic trait than those within peripheral genes or pathways, which more likely play regulatory roles. We hypothesize that for polygenic exposures, using instruments from core genes or pathways may reduce the risk of horizontal pleiotropy.

In this work, we present MR Corge for performing sensitivity analysis of MR based on the core gene hypothesis. MR Corge estimates the effect of a polygenic exposure using a group of putative core instruments. We demonstrate the utility of MR Corge in assessing polygenic exposure-outcome relationships using positive and negative controls.

2 Methods

MR Corge relies on two important assumptions derived from the core gene hypothesis (Boyle et al. 2017). First, for a polygenic exposure, a small number of core genes have a direct biological impact on the trait. Second, genetic instruments with a stronger variant-trait association (i.e. core instruments) are more likely to act through core genes, whereas those with a weaker variant-trait association (i.e. peripheral instruments) are more likely to act through peripheral genes and be subject to horizontal pleiotropy. With these assumptions, MR estimates obtained based on the putative core instruments are less likely to be biased by horizontal pleiotropy than those based on all instruments.

In MR Corge, the most crucial step is to identify putative core instruments from all candidate instruments, which are typically pre-selected through linkage disequilibrium (LD) clumping based on GWAS summary statistics. Four instrument ranking options have been made available, based on

  1. |β^j| (default setting): the per-allele effect of the jth genetic variant on the exposure, estimated in the GWAS of the exposure;

  2. fj1-fjβ^j2 : the per-variant heritability of the jth genetic variant, which is proportional to the squared per-allele effect normalized by the minor allele frequency (fj);

  3. |β^j|SEj : the significance of the jth genetic variant in the GWAS of the exposure;

  4. |β^j|SEjNj : the sample size (Nj)-normalized significance of the jth genetic variant, which can differ from (3) especially in meta-analyses of GWAS with multiple participating studies.

All pre-selected candidate instruments are ranked based on one of these four metrics in descending order. Among the tied instruments, ranks are randomly assigned. Then, the instruments are categorized into K groups, where each group contains approximately the same number of instruments. To showcase the utility of MR Corge, we choose K such that each group contains 5 to 10 instruments in this study. In practice, multiple sensitivity analyses should be conducted using different values of K.

MR Corge then generates group-specific and cumulative MR estimates that can be based on any existing methods (Fig. 1A). By default, MR Corge applies the IVW, weighted median, and weighted mode methods for estimation. The kth cumulative MR estimate (k{1,,K}) is obtained based on all instruments in the kth and all preceding groups, if any. An F-statistic is calculated for each group of instruments to assess whether the MR estimates may be subject to weak instrument bias (Burgess et al. 2011). MR estimates based on the first group of instruments, which are putative core instruments, are the primary MR Corge estimates.

Figure 1.

Figure 1.

Benchmarking the performance of MR Corge. (A) Group-specific and cumulative MR Corge estimates of the effect of serum high-density lipoprotein cholesterol (HDL) levels on the risk of coronary artery disease (CAD). Estimates obtained using the inverse variance weighted method are illustrated with error bars indicating 95% confidence intervals. SD, standard deviation. (B) Significance of association estimates in positive and negative controls. Association estimates obtained with nonspecific negative control outcomes are presented in Supplementary Fig. S1. Asterisks (*) indicate estimates with a P-value < 0.05. P-values < 1.0 × 10−10 were set to 1.0 × 10−10 for visualization purposes. Positive values of signed significance represent that increased exposure was predicted to increase the outcome while negative values of signed significance represent that increased exposure was predicted to decrease the outcome. Instruments were ranked based on the default ranking criterion based on the absolute value of per-allele effect size.

We utilized several positive and negative controls to examine whether sensible MR estimates could be obtained using MR Corge.

Positive controls:

  1. Serum insulin-like growth factor 1 (IGF1)-height association: IGF1 mediates the effects of growth hormone in various tissues, promotes the proliferation and differentiation of chondrocytes and myoblasts, and supports energy production, resulting in bone elongation and height increase (Martin and Baxter 1986, Clemmons 2018, Yakar et al. 2018);

  2. Serum urate-gout association: Gout is primarily caused by elevated levels of circulating urate, which can form urate crystals in joints and trigger inflammatory responses (Dalbeth et al. 2019);

  3. Serum low-density lipoprotein cholesterol (LDL)-coronary artery disease (CAD), body mass index (BMI)-CAD, and alcohol intake-CAD associations: The risk-increasing effects of hyperlipidemia, obesity, and excessive alcohol intake on CAD have been confirmed in previous randomized controlled trials as well as Mendelian randomization and observational studies (Wilson 1994, Dansinger et al. 2005, Baigent et al. 2010, Sabatine 2019, Biddinger et al. 2022).

Negative controls:

  1. Serum high-density lipoprotein cholesterol (HDL)-CAD and vitamin D-CAD associations: Although increased HDL and vitamin D levels have been associated with a decreased risk of CAD in observational studies, randomized controlled trials have shown no meaningful protective effects of HDL-increasing drugs or vitamin D supplements against CAD in the general population with normal HDL or vitamin D levels (Nissen et al. 2007, Lincoff et al. 2017, Scragg et al. 2017, Manson et al. 2019);

  2. Height-cognitive performance association: Height is a significant predictor of cognitive performance in observational studies (Tanner 1966), yet this association can be strongly confounded by various factors, such as socioeconomic status, nutrition, and assortative mating. We assessed the height-cognitive performance association using LD score regression (version 1.0.1) (Bulik-Sullivan et al. 2015a,b) to estimate the genetic correlation based on within-sibship studies, which can effectively control for unmeasured confounders (Howe et al. 2022).

  3. Following previous work (Sanderson et al. 2021, Hu et al. 2024), we also used natural hair color and self-reported ease of skin tanning(Watanabe et al. 2019) as nonspecific negative control outcomes, with IGF1, urate, LDL, BMI, HDL, vitamin D, and height as the exposures.

GWAS summary statistics of these exposures and outcomes were obtained from large-scale studies (Supplementary Table S1) (Köttgen et al. 2013, Willer et al. 2013, Wood et al. 2014, Locke et al. 2015, Lee et al. 2018, Liu et al. 2019a, Manousaki et al. 2020, Sinnott-Armstrong et al. 2021b, Aragam et al. 2022). For each exposure, we performed LD clumping using the ieugwasr R package (version 1.0.1) (Hemani et al. 2018) with an LD reference panel generated from 50 000 randomly sampled UK Biobank participants of European ancestry (Bycroft et al. 2018). We retained independent genetic variants (LD r2 < 0.001, window size = 1 Mb) that demonstrated genome-wide significance (P-value < 5.0×10−8) as candidate instruments. The summary statistics of these instruments were obtained from the GWAS of the corresponding outcomes. When an instrument was not present, we attempted to identify its proxy variant with an LD r2 > 0.8 using the same LD reference panel. Data harmonization was performed using the TwoSampleMR R package (version 0.5.7) (Hemani et al. 2018). Palindromic variants a minor allele frequency > 0.42 were excluded. MR Corge estimates were obtained using the IVW, weighted median, and weighted mode methods, with K = 5 for the alcohol intake, K = 20 for IGF1, urate, LDL, BMI, HDL, and vitamin D, and K = 100 for height, where each instrument group contains 5–10 instruments. All genetic instruments and their functional impacts on nearby genes annotated using the Ensembl Variant Effect Predictor (release 112) (McLaren et al. 2016) are detailed in Supplementary Table S2.

We also derived MR estimates for each of these exposure-outcome pairs based on all selected instruments throughout the genome, using the IVW, simple median, weighted median, simple mode, weighted mode, penalized weighted median, MR Egger, MR RAPS, MR PRESSO, and contamination mixture methods. All MR estimates are provided in Supplementary Tables S3 and S4.

3 Results

3.1 Positive controls

Using the default instrument ranking criterion based on the absolute value of per-allele effect size, the putative core instrument groups contained genetic variants mapping to genes with well-established biological functions related to the exposures. For instance, the putative core instruments for serum IGF1 levels included rs856540, an intergenic variant near IGFBP3 (main carrier of IGF1), while the putative core instruments for serum urate levels included rs1165196, a missense variant of SLC17A1, and rs1171614, an intronic variant of SLC16A9, where both SLC17A1 and SLC16A9 are important urate solute carriers. MR Corge estimates consistently showed that increased serum IGF1 levels could lead to increased height while increased serum urate levels could elevate the risk of gout (Fig. 1B). Meanwhile, serum LDL levels and BMI were predicted to increase the risk of CAD (Fig. 1B). Without selecting instruments, the simple mode, weighted mode, and MR Egger methods did not identify all of these associations (Fig. 1B).

The strongest instrument for alcohol intake was rs1229984, a missense variant of ADH1B (alcohol dehydrogenase). ADH1B regulates the accumulation of acetaldehyde which can cause unpleasant effects such as flushing, nausea, and rapid heart rate (Polimanti and Gelernter 2018, Lu et al. 2023a,b). However, most of the other instruments do not have a direct role in the metabolism of alcohol. With the core instruments, MR Corge estimates obtained using the weighted median and weighted mode methods identified the risk-increasing effect of alcohol intake on CAD, although the MR Corge estimate using the IVW method, along with four other methods based on all instruments, did not identify this effect (Fig. 1B).

3.2 Negative controls

Furthermore, MR Corge estimates consistently showed that serum HDL and vitamin D levels may not be causally associated with CAD, while based on all instruments, the IVW, simple median, weighted median, penalized weighted median, MR RAPS, and MR PRESSO suggested protective effects of at least one exposure (Fig. 1B). The putative core instruments for HDL levels included rs9989419, an intergenic variant near CETP [cholesteryl ester transfer protein, transferring cholesterol esters from HDL particles to other lipoproteins (Chapman et al. 2010)], while the putative core instruments for vitamin D levels included rs117576073, an intronic variant of CYP2R1 (vitamin D 25-hydroxylase), rs4694423, an intergenic variant near GC (vitamin D binding protein), and rs12803256, an intergenic variant near DHCR7 [7-dehydrocholesterol reductase which balances biosynthesis of cholesterol and vitamin D (Prabhu et al. 2016)]. Interestingly, consistent with MR Corge estimates, CETP-inhibitors have been a recent focus of drug development yet have not shown effectiveness in reducing cardiovascular disease risks in randomized controlled trials, despite their capacity to significantly elevate serum HDL levels (Nissen et al. 2007, Lincoff et al. 2017).

Meanwhile, MR Corge estimates indicated that height was not associated with cognitive performance, consistent with the absence of genetic correlation based on within-sibship studies (Supplementary Table S5). In contrast, when all instruments were utilized, eight of the ten MR methods suggested a causal effect of increased height on increased cognitive performance (Fig. 1B).

In addition, MR Corge demonstrated appropriate false positive control with natural hair color and self-reported ease of skin tanning as nonspecific negative control outcomes (Supplementary Fig. S1).

4 Discussion

Instrumental variable assumptions are essential for deriving valid MR estimates. It has been recognized that carefully selecting genetic instruments which represent relevant biological mechanisms may mitigate the risk of horizontal pleiotropy, such as using lead variants in cis-quantitative trait loci for instrumenting the abundances of mRNA transcripts, proteins, and certain metabolites (Porcu et al. 2019, Zhou et al. 2021, Chen et al. 2023, Lu et al. 2023a,b, Yoshiji et al. 2023, Lu et al. 2024). However, assessing potential causal effects of many polygenic exposures using MR remains challenging since the underlying biological mechanisms may not have been fully elucidated and a large number of candidate instruments can be subject to horizontal pleiotropy. MR Corge can be a useful tool for performing sensitivity analysis for such exposures.

With positive and negative controls, we demonstrated that MR estimates based on all instruments could be inconsistent with established biomedical knowledge or the results of randomized controlled trials, even when robust MR methods were used. This inconsistency is likely due to the violation of additional assumptions of these methods. In contrast, with a small number of putative core instruments, the assumptions of robust MR methods, such as the weighted median and weighted mode methods, are more likely to be fulfilled due to the enrichment of valid instruments. As a result, MR Corge could consistently provide sensible MR estimates. Moreover, because the putative core instruments had the strongest associations with the exposures, the F-statistics of the core instrument groups were higher than the overall F-statistics based on all instruments (Supplementary Table S4), indicating a low risk of introducing weak instrument bias (Burgess et al. 2011) and further supporting that appropriate instrument selection should prevail over maximizing the proportion of exposure variance explained by the instruments. This principle likely also applies to analyses using genetic risk scores as instrumental variables, where existing methods for addressing horizontal pleiotropy are not applicable.

Similar to other methods that operate on GWAS summary statistics, MR Corge may yield biased results if the GWAS populations are subject to selection bias (Pirastu et al. 2021) or if the genetic effect estimates capture the effects of demography or relatives (Howe et al. 2022). Identifying data sources less likely to introduce such bias remains crucial for analyses using MR Corge.

Notably, the choice of K directly reflects the fundamental assumption regarding the number of core instruments, where a larger K corresponds to sparser core instruments and core genes. In practice, given the significant knowledge gap between the genetic architecture and biological mechanisms of most polygenic traits, a universal rule for determining core instruments may not exist, and the core instrument-peripheral instrument classification may not be binary (Boyle et al. 2017). We recommend conducting multiple sensitivity analyses with different values of K (Supplementary Fig. S2), where consistent results are more likely to support causal interpretations. Furthermore, empirically, we found that ranking based on the absolute value of per-allele effect size outperformed other ranking criteria (Supplementary Fig. S3). However, interpreting MR Corge estimates always requires cautious investigation of the putative core instruments, as well as triangulation of multiple lines of evidence, as with other MR or epidemiological methods.

In summary, built upon the core gene hypothesis, MR Corge may identify putative core instruments to obtain MR estimates that are less likely to be biased by horizontal pleiotropy. MR Corge may have wide utility as a tool for performing sensitivity analysis when the exposures demonstrate high polygenicity, while the underlying biological mechanisms remain unclear.

Supplementary Material

btae666_Supplementary_Data

Acknowledgements

This study was enabled in part by support provided by Calcul Québec and Compute Canada.

Author contributions

W.Z.: Software, Formal analysis, Visualization, Writing—Original Draft, Writing—Review & Editing; C.-Y.S.: Resources, Data curation, Writing—Review & Editing; S.Y.: Resources, Data curation, Writing—Review & Editing; T.L.: Conceptualization, Methodology, Formal analysis, Visualization, Investigation, Writing—Original Draft, Writing—Review & Editing, Project administration, Funding acquisition.

Contributor Information

Wenmin Zhang, Montreal Heart Institute, Montreal, QC, H1T 1C8, Canada.

Chen-Yang Su, Quantitative Life Sciences Program, McGill University, Montreal, QC, H3A 0G4, Canada; Victor Phillip Dahdaleh Institute of Genomic Medicine, McGill University, Montreal, QC, H3A 0G1, Canada.

Satoshi Yoshiji, Victor Phillip Dahdaleh Institute of Genomic Medicine, McGill University, Montreal, QC, H3A 0G1, Canada; Department of Human Genetics, McGill University, Montreal, QC, H3A 0G1, Canada; Lady Davis Institute for Medical Research, Montreal, QC, H3T 1E2, Canada; Programs in Metabolism and Medical & Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, United States; Harvard Medical School, Boston, MA, 02115, United States.

Tianyuan Lu, Department of Population Health Sciences, University of Wisconsin-Madison, Madison, WI, 53726, United States; Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, 53726, United States; Center for Demography of Health and Aging, University of Wisconsin-Madison, Madison, WI, 53706, United States; Center for Genomic Science Innovation, University of Wisconsin-Madison, Madison, WI, 53706, United States; Center for Human Genomics and Precision Medicine, University of Wisconsin-Madison, Madison, WI, 53705, United States.

Supplementary data

Supplementary data are available at Bioinformatics online.

Conflict of interest: W.Z. and T.L. have been providing consulting services to Five Prime Sciences Inc. for research programs unrelated to this work. The other authors have no conflicts of interest to disclose.

Funding

W.Z. was supported by an Institut de valorisation des données Postdoctoral Fellowship. C.-Y.S. was supported by a Canadian Institutes of Health Research Canada Graduate Scholarship Doctoral Award, a Fonds de recherche du Québec doctoral training scholarship, and a Lady Davis Institute/TD-Bank Scholarship. S.Y. was supported by the Japan Society for the Promotion of Science. T.L. was supported by start-up funding from the Office of the Vice Chancellor for Research and Graduate Education, School of Medicine and Public Health, and Department of Population Health Sciences at the University of Wisconsin-Madison.

References

  1. Abdellaoui A, Yengo L, Verweij KJH. et al. 15 years of GWAS discovery: realizing the promise. Am J Hum Genet 2023;110:179–94. 10.1016/j.ajhg.2022.12.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Aragam KG, Jiang T, Goel A. et al. ; CARDIoGRAMplusC4D Consortium. Discovery and systematic characterization of risk variants and genes for coronary artery disease in over a million participants. Nat Genet 2022;54:1803–15. 10.1038/s41588-022-01233-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Baigent C, Blackwell L, Emberson J. et al. ; Cholesterol Treatment Trialists’ (CTT) Collaboration. Efficacy and safety of more intensive lowering of LDL cholesterol: a meta-analysis of data from 170,000 participants in 26 randomised trials. Lancet 2010;376:1670–81. 10.1016/S0140-6736(10)61350-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Biddinger KJ, Emdin CA, Haas ME. et al. Association of habitual alcohol intake with risk of cardiovascular disease. JAMA Netw Open 2022;5:e223849. 10.1001/jamanetworkopen.2022.3849 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Bowden J, Davey Smith G, Haycock PC. et al. Consistent estimation in Mendelian randomization with some invalid instruments using a weighted median estimator. Genet Epidemiol 2016;40:304–14. 10.1002/gepi.21965 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Boyle EA, Li YI, Pritchard JK.. An expanded view of complex traits: from polygenic to omnigenic. Cell 2017;169:1177–86. 10.1016/j.cell.2017.05.038 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Bulik-Sullivan B, Finucane HK, Anttila V. et al. ; Genetic Consortium for Anorexia Nervosa of the Wellcome Trust Case Control Consortium 3. An atlas of genetic correlations across human diseases and traits. Nat Genet 2015a;47:1236–41. 10.1038/ng.3406 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Bulik-Sullivan BK, Loh PR, Finucane HK. et al. ; Consortium Schizophrenia Working Group of the Psychiatric Genomics. LD score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat Genet 2015b;47:291–5. 10.1038/ng.3211 [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Burgess S, Bowden J, Fall T. et al. Sensitivity analyses for robust causal inference from Mendelian randomization analyses with multiple genetic variants. Epidemiology 2017;28:30–42. 10.1097/EDE.0000000000000559 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Burgess S, Butterworth A, Thompson SG.. Mendelian randomization analysis with multiple genetic variants using summarized data. Genet Epidemiol 2013;37:658–65. 10.1002/gepi.21758 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Burgess S, Foley CN, Allara E. et al. A robust and efficient method for Mendelian randomization with hundreds of genetic variants. Nat Commun 2020;11:376. 10.1038/s41467-019-14156-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Burgess S, Thompson SG.. Interpreting findings from Mendelian randomization using the MR-Egger method. Eur J Epidemiol 2017;32:377–89. 10.1007/s10654-017-0255-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Burgess S, Thompson SG; CRP CHD Genetics Collaboration Avoiding bias from weak instruments in Mendelian randomization studies. Int J Epidemiol 2011;40:755–64. 10.1093/ije/dyr036 [DOI] [PubMed] [Google Scholar]
  14. Bycroft C, Freeman C, Petkova D. et al. The UK biobank resource with deep phenotyping and genomic data. Nature 2018;562:203–9. 10.1038/s41586-018-0579-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Chapman MJ, Le Goff W, Guerin M. et al. Cholesteryl ester transfer protein: at the heart of the action of lipid-modulating therapy with statins, fibrates, niacin, and cholesteryl ester transfer protein inhibitors. Eur Heart J 2010;31:149–64. 10.1093/eurheartj/ehp399 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Chen Y, Lu T, Pettersson-Kymmer U. et al. Genomic atlas of the plasma metabolome prioritizes metabolites implicated in human diseases. Nat Genet 2023;55:44–53. 10.1038/s41588-022-01270-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Clemmons DR. Role of IGF-Binding proteins in regulating IGF responses to changes in metabolism. J Mol Endocrinol 2018;61:T139–69. 10.1530/JME-18-0016 [DOI] [PubMed] [Google Scholar]
  18. Dalbeth N, Choi HK, Joosten LAB. et al. Gout. Nat Rev Dis Primers 2019;5:69. 10.1038/s41572-019-0115-y [DOI] [PubMed] [Google Scholar]
  19. Dansinger ML, Gleason JA, Griffith JL. et al. Comparison of the atkins, ornish, weight watchers, and zone diets for weight loss and heart disease risk ReductionA randomized trial. JAMA 2005;293:43–53. 10.1001/jama.293.1.43 [DOI] [PubMed] [Google Scholar]
  20. Davey Smith G, Ebrahim S.. ‘Mendelian randomization’: can genetic epidemiology contribute to understanding environmental determinants of disease? Int J Epidemiol 2003;32:1–22. [DOI] [PubMed] [Google Scholar]
  21. Hartwig FP, Davey Smith G, Bowden J.. Robust inference in summary data Mendelian randomization via the zero modal pleiotropy assumption. Int J Epidemiol 2017;46:1985–98. 10.1093/ije/dyx102 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Hemani G, Zheng J, Elsworth B. et al. The MR-base platform supports systematic causal inference across the human phenome. Elife 2018;7:e34408. 10.7554/eLife.34408 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Howe LJ, Nivard MG, Morris TT, Within Family Consortium et al. Within-sibship genome-wide association analyses decrease bias in estimates of direct genetic effects. Nat Genet 2022;54:581–92. 10.1038/s41588-022-01062-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Hu X, Cai M, Xiao J. et al. Benchmarking Mendelian randomization methods for causal inference using Genome-Wide association study summary statistics. Am J Hum Genet 2024;111:1717–35. 10.1016/j.ajhg.2024.06.016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Köttgen A, Albrecht E, Teumer A. et al. ; MAGIC Consortium. Genome-wide association analyses identify 18 new loci associated with serum urate concentrations. Nat Genet 2013;45:145–54. 10.1038/ng.2500 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Lee JJ, Wedow R, Okbay A. et al. ; Social Science Genetic Association Consortium. Gene discovery and polygenic prediction from a genome-wide association study of educational attainment in 1.1 million individuals. Nat Genet 2018;50:1112–21. 10.1038/s41588-018-0147-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Lincoff AM, Nicholls SJ, Riesmeyer JS. et al. ; ACCELERATE Investigators. Evacetrapib and cardiovascular outcomes in high-risk vascular disease. N Engl J Med 2017;376:1933–42. 10.1056/NEJMoa1609581 [DOI] [PubMed] [Google Scholar]
  28. Liu M, Jiang Y, Wedow R. et al. ; HUNT All-In Psychiatry. Association studies of up to 1.2 million individuals yield new insights into the genetic etiology of tobacco and alcohol use. Nat Genet 2019a;51:237–44. 10.1038/s41588-018-0307-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Liu X, Li YI, Pritchard JK.. Trans effects on gene expression can drive omnigenic inheritance. Cell 2019b;177:1022–34.e6. 10.1016/j.cell.2019.04.014 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Locke AE, Kahali B, Berndt SI. et al. ; International Endogene Consortium. Genetic studies of body mass index yield new insights for obesity biology. Nature 2015;518:197–206. 10.1038/nature14177 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Lu T, Chen Y, Yoshiji S. et al. Circulating metabolite abundances associated with risks of bipolar disorder, schizophrenia, and depression: a Mendelian randomization study. Biol Psychiatry 2024;96:782–91. 10.1016/j.biopsych.2024.04.016 [DOI] [PubMed] [Google Scholar]
  32. Lu T, Forgetta V, Greenwood CMT. et al. Circulating proteins influencing psychiatric disease: a Mendelian randomization study. Biol Psychiatry 2023a;93:82–91. 10.1016/j.biopsych.2022.08.015 [DOI] [PubMed] [Google Scholar]
  33. Lu T, Nakanishi T, Yoshiji S. et al. Dose-dependent association of alcohol consumption with obesity and type 2 diabetes: Mendelian randomization analyses. J Clin Endocrinol Metab 2023b;108:3320–9. 10.1210/clinem/dgad324 [DOI] [PubMed] [Google Scholar]
  34. Manousaki D, Mitchell R, Dudding T. et al. Genome-wide association study for vitamin D levels reveals 69 independent loci. Am J Hum Genet 2020;106:327–37. 10.1016/j.ajhg.2020.01.017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Manson JE, Cook NR, Lee I-M. et al. ; VITAL Research Group. Vitamin D supplements and prevention of cancer and cardiovascular disease. N Engl J Med 2019;380:33–44. 10.1056/NEJMoa1809944 [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Martin JL, Baxter RC.. Insulin-like growth factor-binding protein from human plasma. Purification and characterization. J Biol Chem 1986;261:8754–60. [PubMed] [Google Scholar]
  37. McLaren W, Gil L, Hunt SE. et al. The ensembl variant effect predictor. Genome Biol 2016;17:122. 10.1186/s13059-016-0974-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Nissen SE, Tardif J-C, Nicholls SJ. et al. ; ILLUSTRATE Investigators. Effect of torcetrapib on the progression of coronary atherosclerosis. N Engl J Med 2007;356:1304–16. 10.1056/NEJMoa070635 [DOI] [PubMed] [Google Scholar]
  39. Pirastu N, Cordioli M, Nandakumar P. et al. ; iPSYCH Consortium. Genetic analyses identify widespread sex-differential participation bias. Nat Genet 2021;53:663–71. 10.1038/s41588-021-00846-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Polimanti R, Gelernter J.. ADH1B: from alcoholism, natural selection, and cancer to the human phenome. Am J Med Genet B Neuropsychiatr Genet 2018;177:113–25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Porcu E, Rüeger S, Lepik K, BIOS Consortium et al. Mendelian randomization integrating GWAS and eQTL data reveals genetic determinants of complex and clinical traits. Nat Commun 2019;10:3300. 10.1038/s41467-019-10936-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Prabhu AV, Luu W, Li D. et al. DHCR7: a vital enzyme switch between cholesterol and vitamin D production. Progress Lipid Res 2016;64:138–51. 10.1016/j.plipres.2016.09.003 [DOI] [PubMed] [Google Scholar]
  43. Sabatine MS. PCSK9 inhibitors: clinical evidence and implementation. Nat Rev Cardiol 2019;16:155–65. 10.1038/s41569-018-0107-8 [DOI] [PubMed] [Google Scholar]
  44. Sanderson E, Richardson TG, Hemani G. et al. The use of negative control outcomes in Mendelian randomization to detect potential population stratification. Int J Epidemiol 2021;50:1350–61. 10.1093/ije/dyaa288 [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Scragg R, Stewart AW, Waayer D. et al. Effect of monthly high-dose vitamin D supplementation on cardiovascular disease in the vitamin D assessment study : a randomized clinical trial. JAMA Cardiol 2017;2:608–16. 10.1001/jamacardio.2017.0175 [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Sinnott-Armstrong N, Naqvi S, Rivas M. et al. “GWAS of three molecular traits highlights core genes and pathways alongside a highly polygenic background.” edited by jonathan flint, patricia J wittkopp, vincent J lynch, naomi wray, and aravinda chakravarti. Elife 2021a;10:e58615. 10.7554/eLife.58615 [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Sinnott-Armstrong N, Tanigawa Y, Amar D. et al. ; FinnGen. Genetics of 35 blood and urine biomarkers in the UK biobank. Nature Genetics 2021b;53:185–94. 10.1038/s41588-020-00757-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Skrivankova VW, Richmond RC, Woolf BAR. et al. Strengthening the reporting of observational studies in epidemiology using Mendelian randomization: the STROBE-MR statement. JAMA 2021a;326:1614–21. 10.1001/jama.2021.18236 [DOI] [PubMed] [Google Scholar]
  49. Skrivankova VW, Richmond RC, Woolf BAR. et al. Strengthening the reporting of observational studies in epidemiology using Mendelian randomisation (STROBE-MR): explanation and elaboration. BMJ 2021b;375:n2233. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Smith GD, Ebrahim S.. Mendelian randomization: prospects, potentials, and limitations. Int J Epidemiol 2004;33:30–42. 10.1093/ije/dyh132 [DOI] [PubMed] [Google Scholar]
  51. Tanner JM. Galtonian eugenics and the study of growth: the relation of body size, intelligence test score, and social circumstances in children and adults. Eugenics Rev 1966;58:122–35. [PMC free article] [PubMed] [Google Scholar]
  52. Verbanck M, Chen C-Y, Neale B. et al. Detection of widespread horizontal pleiotropy in causal relationships inferred from Mendelian randomization between complex traits and diseases. Nat Genet 2018;50:693–8. 10.1038/s41588-018-0099-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Visscher PM, Wray NR, Zhang Q. et al. 10 Years of GWAS discovery: biology, function, and translation. Am J Hum Genet 2017;101:5–22. 10.1016/j.ajhg.2017.06.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Watanabe K, Stringer S, Frei O. et al. A global overview of pleiotropy and genetic architecture in complex traits. Nat Genet 2019;51:1339–48. 10.1038/s41588-019-0481-0 [DOI] [PubMed] [Google Scholar]
  55. Willer CJ, Schmidt EM, Sengupta S. et al. ; Global Lipids Genetics Consortium. Discovery and refinement of loci associated with lipid levels. Nature Genetics 2013;45:1274–83. 10.1038/ng.2797 [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Wilson PW, Established risk factors and coronary artery disease: the Framingham study. Am J Hypertens 1994;7:7S–12S. 10.1093/ajh/7.7.7S [DOI] [PubMed] [Google Scholar]
  57. Wood AR, Esko T, Yang J. et al. ; LifeLines Cohort Study. Defining the role of common variation in the genomic and biological architecture of adult human height. Nat Genet 2014;46:1173–86. 10.1038/ng.3097 [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Yakar S, Werner H, Rosen CJ.. Insulin-like growth factors: actions on the skeleton. J Mol Endocrinol 2018;61:T115–T137. 10.1530/JME-17-0298 [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Yoshiji S, Butler-Laporte G, Lu T. et al. Proteome-wide Mendelian randomization implicates nephronectin as an actionable mediator of the effect of obesity on COVID-19 severity. Nat Metab 2023;5:248–64. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Zhao Q, Wang J, Hemani G. et al. Statistical inference in two-sample summary-data Mendelian randomization using robust adjusted profile score. Ann Statist 2020;48:1742–69. 10.1214/19-AOS1866 [DOI] [Google Scholar]
  61. Zhou S, Butler-Laporte G, Nakanishi T. et al. A neanderthal OAS1 isoform protects individuals of European ancestry against COVID-19 susceptibility and severity. Nat Med 2021;27:659–67. 10.1038/s41591-021-01281-1 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

btae666_Supplementary_Data

Articles from Bioinformatics are provided here courtesy of Oxford University Press

RESOURCES