Abstract
Mendelian randomisation is an approach in genetic epidemiology that uses genetic variants as instrumental variables to investigate the causal relationship between genetically proxied exposures and health outcomes. During the last years, the number of published Mendelian randomisation studies increased tremendously. There are several opportunities of Mendelian randomisation including obtaining potential causal relationships between both exogenous and endogenous exposures and outcomes and for identifying and prioritising drug-targets to inform clinical trials. However, it is also important to be aware of its challenges. This includes the reliability of results under the assumptions on instrumental variables, being aware of potential biases, the correct and critical interpretation of findings and comparison to the results of randomised controlled trials, as well as the availability of genetic data on specific subgroups. This review provides a comprehensive overview of the opportunities and challenges of Mendelian randomisation and presents key future perspectives.
Keywords: Mendelian randomisation, opportunities, challenges, perspective, subgroups, sex-specific, omics
Introduction
The concept of causality
Understanding causal relationships helps to unravel disease etiology, which provides mechanistic understanding to inform drug development and preventative action, thereby guiding health policy and clinical practice. In 1986, Holland stated, ‘no causation without manipulation’ (1), emphasizing that manipulation of an exposure is essential to investigate the causal effect of an exposure on an outcome. However, many epidemiological studies are observational by design. Consequently, when assessing relationships between exposures and outcomes in observational studies, we cannot draw any conclusions about causality. One of the main reasons is that observational studies are prone to confounding, that is, there may exist third factors that are related to both the exposure and outcome of interest, which bias observational associations. Potential solutions are to adjust, restrict, or stratify statistical models. However, in reality, residual confounding may still exist after applying these methods, since either we do not know all confounding factors or we have not assessed them in our study. Accordingly, randomised controlled trials (RCTs) are the gold standard for assessing causality. The random allocation of participants to an intervention or control group should—with large enough sample sizes—ensure an equal distribution of known and unknown confounding factors. As such, the only difference between the groups is the exposure to the intervention. However, RCTs can be infeasible since they are often very expensive and time consuming and sometimes it may not be ethically justifiable to conduct them. Consequently, alternative methods are needed.
Mendelian randomisation
Mendelian randomisation (MR), a specific application of instrumental variable analysis, lies in its ability to assess causality based on genetic data. The use of instrumental variables was introduced in 1928 in the context of econometrics (2). Building on these principles, the concept of MR was first described in 1986 (3), and MR was formally introduced in 2003 (4). The number of reported MR studies in scientific literature has grown exponentially every year since then. By February 2025, more than 15,000 articles using or describing MR had been published in PubMed (see Figure 1).
Figure 1.
Number of publications on Mendelian randomisation studies until 2024. The data depicted in this figure have been extracted from PubMed on February 24th 2025 using the search terms “Mendelian randomization” OR “Mendelian randomisation”. Publications from 2025 have been excluded.
MR allows estimating the effect of an exposure on an outcome by selecting genetic variants (e.g., from GWAS data) strongly associated with the exposure of interest. These genetic variants serve as instrumental variables to proxy the exposure. Specifically, the effect size of the association of the genetic instrumental variable with the exposure and outcome, respectively, are used to estimate the causal effect of an exposure on an outcome (5,6). Since genetic variants are randomly allocated at gamete formation, MR is robust to confounding and reverse causation under three core assumptions: (1) the genetic variants are strongly associated with the exposure of interest (the relevance assumption); (2) there is no common cause between genetic variants and the outcome (the independence assumption); and (3) the genetic variants influence the outcome only through their effect on the exposure (i.e., absence of horizontal pleiotropy/exclusion restriction assumption).
Interpretation of Mendelian randomisation study results
The main objective of many MR studies is to understand whether the relationship between an exposure and an outcome is causal or not. However, MR studies usually provide an effect size to estimate the causal effect of an exposure on an outcome (7). Its interpretation depends on the types of variables studied. In case the outcome of interest is binary, the effect size is often presented as odds ratio, while for continuous outcomes, beta coefficients are usually reported. If the exposure of interest is continuous, the relationship is often described as per unit or standard deviation increase or decrease and the exposure is referred to as “genetically proxied” or “genetically predicted”. For instance, a MR study investigating the link between interleukin 6 and coronary artery disease (8) found an odds ratio of 0.64 (95% confidence interval 0.54 to 0.76). The authors described their findings as ‘[g]enetically predicted circulating IL-6 levels were significantly inversely associated with CAD’ (8). In case the exposure of interest is categorical, the relationship can usually be interpreted per unit increase in the log odds of the exposure and the wording “genetic liability to” is often used. For instance, a MR study investigated the relationship between insomnia and different CVD outcomes and described their findings as “genetic liability to insomnia was associated with higher odds of six CVDs” (9).
Conducting MR analyses has become more accessible over the past years due to the availability of statistical tools, advanced software, and publicly accessible genetic data. This accessibility has allowed researchers to study both hypothesised and novel causal effects. For example, hypothesis-generating methods such as phenome-wide MR analyses systematically assess causality between a wide range of traits (10,11). This could uncover unexpected causal effects as a valuable starting point for further research. However, the relative ease of applying MR also comes with potential concerns. There is a fine line between meaningful rigorous exploration and what might be considered a phishing expedition, where the risk of chance findings or biased findings increases. Indeed, MR findings may be spurious or biased, and results are not always clinically relevant. An example of this is the relationship between schizophrenia and snoring. Using sex-specific GWAS data from up to 76,755 individuals with schizophrenia and 243,649 control individuals (12), as well as data from 408,317 UK Biobank participants including around 152,000 participants who reported snoring during sleep (13), we found that schizophrenia appears to be causally related to a lower risk of snoring in males (odds ratio 0.97, 95% confidence interval 0.95 to 0.99). Relationships like these are unlikely to have a direct preventive or clinical interpretation, as intervening is complex for several reasons, and such findings may reflect a bias rather than true causality. This underscores the importance of interpreting MR results cautiously, considering their clinical context and validity.
In the following sections, we will explain the opportunities of MR, elaborate on some of its challenges, and discuss future perspectives on its application. An overview is provided in Table 1.
Table 1.
Opportunities, challenges, and future directions of Mendelian randomisation.
|
---|
MENDELIAN RANDOMISATION |
|
Opportunities |
|
|
|
|
|
Challenges |
|
|
|
|
|
|
|
|
|
Future perspectives |
|
|
|
|
|
|
|
Opportunities of Mendelian Randomisation
Exposure-outcome relationships
Exposures assessed in MR can be both exogenous (i.e., originating outside the body such as environmental or lifestyle factors) and endogenous (i.e., originating within the body such as metabolic by-products or circulating proteins levels). Furthermore, one can study continuous exposures (e.g., levels of C-reactive protein [CRP]) as well as binary exposures (e.g., smoking behaviour). This provides a wide range of potential applications of MR.
MR can be used to study the causal effect of health-related behaviours (such as smoking and alcohol consumption) or complex biological traits (such as diabetes and obesity) on the risk of disease. For example, MR studies have shown that genetic liability to smoking initiation is related to a range of circulatory system diseases including aortic aneurysm, atrial fibrillation, coronary artery disease, and peripheral artery disease, among others, as well as to several digestive and nervous system diseases and cancer types (14). Additionally, higher genetically proxied body mass index has been linked to an increased risk for several cancer types including digestive system cancers, lung cancer, and uterine cancer (15). MR can also assess whether various types of circulating lipids, metabolites, or proteins are causally related to health outcomes. For example, CRP is observationally associated with coronary heart disease (CHD) (16), but prior to robust MR studies it was unknown whether this association was causal (17). MR analyses have shown that CRP is not likely to be a causal risk factor for CHD but rather a marker of the inflammation contributing to CHD (18,19). Similarly, in 1989, the causal relationship between fibrinogen and myocardial infarction was studied by using a polymorphism in the fibrinogen beta-gene as a proxy for plasma fibrinogen levels (20), implicitly applying the principles of what was later formally introduced as MR. The study found that increased fibrinogen levels associated with the genetic factor did not increase the risk of myocardial infarction. In addition to circulating constituents of the blood, interest has also emerged in other biomarkers as exposure of interest, which can provide additional insights into biological changes associated with disease processes. For example, metabolites measured in urine could serve as non-invasive markers of cardiovascular disease (CVD), which may be used in prognosis or diagnosis. Messenger ribonucleic acid (mRNA) expression is another example of a potential exposure of interest in MR and provides deeper insights into the causal pathways linking genes to disease. GWASs have identified thousands of genetic variants associated with complex diseases, including CVD. By using mRNA as an exposure in MR studies, it becomes possible to identify genes with expression causally related to disease (21). Using tissue-level mRNA expression data as an exposure in MR allows for a further refined understanding of the biological mechanisms underlying GWAS-identified associations (22,23,24,25,26). For example, in the context of chronic kidney disease, recent analyses of renal transcriptomes have demonstrated the causal effects of kidney-specific gene expression on kidney function traits (25).
Drug-target prioritisation
MR can also be used for drug-target prioritisation. When a potential drug-target is the exposure of interest, MR is referred to as drug-target MR. While traditional genome-wide MR selects instrumental variables from genetic variants across the genome, drug-target MR selects instrumental variables from genetic variants in or around the gene encoding the drug-target (27). This approach can identify novel drug-targets and prioritise proteins for further validation in registry data studies, wet lab studies, and eventually, RCTs. Previous research has shown that genetically supported targets have higher success rates in phases II and III clinical trials (28,29), and given the size of the human proteome, prioritisation (and deprioritisation) of potential targets can streamline drug development, optimizing resource allocation and reducing the risk of costly late-stage trial failures (30,31). In addition to (de)prioritisation, drug repurposing—defined as identifying new therapeutic uses for existing and approved drugs—has gained attention (32).
Challenges in Mendelian Randomisation
Despite the opportunities of MR to estimate causal effects of various types of exposures and the robustness of MR to most types of confounding, which affect observational study designs, MR also has important challenges and limitations.
Validity of the MR assumptions
MR has three core assumptions, as described above. The relevance assumptions states that the instrumental variable are strongly related to the exposure of interest. In MR studies, an option to select genetic variants to be included in the instrumental variables is biological knowledge. For instance, it is possible to use variants within the CRP gene region to genetically proxy CRP levels (18). Alternatively, instrumental variables can be built based on results from large-scale GWASs. This has, for example, been done in a phenome-wide study on body mass index (33). When relying on GWAS findings, it is important that large-scale studies are available in order to construct valid instrumental variables. The second assumption of MR, the independence assumption, usually holds given that genotypes are defined at gamete formation and are rarely changed by environmental factors. However, there are factors that could invalidate this assumption including population structure or assortative mating, which are in general greater concerns for traits linked to social patterns (34). Another potential threat to the validity of the MR assumptions is horizontal pleiotropy, where selected genetic variants influence the outcome through pathways independently of the exposure of interest (35,36,37,38). This would violate the so-called exclusion restriction assumption. Moreover, genetic variants in linkage disequilibrium could be correlated with other variants related to a confounder that may affect the outcome (34). In general, it is not possible to fully exclude horizontal pleiotropy in an MR. However, there are methods that allow to further investigate potential horizontal pleiotropy such as MR-Egger (39) and MR-PRESSO (36).
Selection bias
While the independence assumption usually holds in MR, the independence of the instrumental variable and potential confounders can be at risk. For example, conditioning on a variable that is influenced by both the instrumental variable and one or more of the confounders of the exposure-outcome association (i.e., conditioning on a collider) induces an association between the two (40), known as collider bias (Figure 2). If the exposure or the outcome of interest in MR is associated with increased mortality, selection of participants for MR is inherently dependent on the levels of the exposure or outcome, inducing selection bias—a form of collider bias. As such, selection bias in MR can lead to an association between the instrumental variable and the outcome in the absence of a causal effect of the exposure on the outcome (41). Methods have been developed to address this issue, but it cannot be ruled out completely (42).
Figure 2.
Directed acyclic graphs on selection bias in Mendelian randomisation analyses. Directed acyclic graphs where G represents the instrumental variable, X the exposure, U the confounder, Y the outcome and where selection (S) leads to selection bias if it is dependent on levels of the exposure (panel A) or outcome (panel B).
Mendelian randomisation versus randomised controlled trials
MR studies are often compared to RCTs. While both methods share a conceptual basis in randomisation (either to study arms in RCTs, or to genotypes in MR), they differ in several ways (43). One difference is the timing of randomisation. In RCTs, participants are randomly allocated to a study arm soon after study recruitment, ensuring that well-conducted RCTs are free of confounding and selection bias. MR, in contrast, relies on the random allocation of genetic material at gamete formation, which occurs typically long before study recruitment. Consequently, MR can also be affected by selection bias, specifically survivor bias, as participants need to be survivors of their genotype. However, this is of course also the case for being included in an RCT. Furthermore, individuals with high-risk traits may have received interventions that may modify their phenotypic trait or their disease risk later in life. Consequently, this needs to be taken into account when performing GWAS as well as MR studies. For instance, a large-scale GWAS for blood pressure traits adjusted their analyses for antihypertensive medication use (44). Another consequence of the difference in timing of randomisation is that in MR, genetic variants used to proxy the exposure of interest may have a lifelong cumulative effect, while a RCT estimates the effect of an intervention administered over a specific and usually shorter time frame (43). This implies that MR estimates cannot be compared directly with effect sizes from RCTs. Furthermore, this MR may not be valid to assess disease risk due to acute alterations of risk factors (43).
As mentioned above, MR can also be applied to drug prioritisation. However, there are several challenges of drug-target MR (5,45). A crucial initial step in drug-target MR is the selection of the underlying instrumental variable, which can be based on protein quantitative trait loci (pQTL), expression quantitative trait loci (eQTL) data, or on biomarker data. For instance, if one wants to apply drug-target MR to inform on potential effects of proprotein convertase subtilisin/kexin type 9 (PCSK9) inhibitors, one could study variations in the PCSK9 gene by either using eQTL data on PCSK9 expression, pQTL data on the PCSK9 protein, or genetic variants related to the underlying biomarker (i.e., low-density lipoprotein cholesterol) within the PCSK9 gene region. Each of these scenarios has its advantages and drawbacks (5,45). In general, it is important to note that drug-target MR may help to inform clinical trials but cannot replace them (43). For instance, drug-target MR relies on the principle that the underlying drugs work via a specific target. However, many drugs have so-called pleiotropic or off-targets effects, which may be missed when only studying genetic variants in a specific gene region. For these reasons, MR and RCTs are complementary and not interchangeable and validation is required before MR results can be translated to clinical practice. Ultimately, RCTs may follow, which will provide additional insights in whether targeting the identified drug-targets can reverse the disease process or prevent disease onset.
Subgroup-specific GWAS data
Another limitation to the application of MR is the limited availability of subgroup-specific GWAS data, such as sex-specific or ancestry-specific data, which are essential for conducting subgroup-specific MR. Sex-specific MR is relevant because sex-combined MR can mask potential sex differences in either the direction or magnitude of causal effects, resulting in estimates that may not hold true for either of the sexes (46). For example, when using MR to estimate causal risk factor effects, a sex-specific approach may be valuable at a population level to know if targeting risk factors in a sex-specific way is needed to obtain similar levels of risk reduction across the full population. Similarly, when performing MR to estimate the effects of protein perturbation, sex-stratification is relevant because protein expression levels, protein function and the effect of protein perturbation may differ between females and males.
Sex-specific MRs are best performed using sex-specific GWAS, as heterogeneity in genetic effects between sexes may invalidate MR analysis relying on sex-combined GWAS estimates for the association of the instrumental variables with the exposure of interest. Notably, recent UK Biobank analyses found that 70% of non-binary and almost 10% of binary traits had at least one autosomal genetic variant showing a significantly different association in women than in men (at a p < 1 × 10–8) (47). Sex-differentiated genetic effects were confirmed, for example, for schizophrenia (48), serum uric acid concentrations (49), waist-hip ratio (50), and central obesity and adipose distribution (51). Although the first sex-stratified GWAS was reported in 2008 (reporting genetic associations with height), sex-stratification in GWAS is not common practice. Even in the GWAS that do stratify on sex, challenges may persist. For example, coronary artery disease GWAS in females often suffer from weaker statistical power as compared to the GWAS in males, due to differences in disease occurrence (52) as well as smaller effect sizes. Although the majority of coronary artery disease is caused by atherosclerosis, pathophysiological differences in coronary artery disease between sexes such as spontaneous coronary artery dissection (SCAD) in women may be the reason for smaller effect sizes as the genetic architecture of SCAD is distinct (53). In addition, GWAS may suffer from misclassification of cases, as some coronary artery disease subtypes such as ischemia with no obstructive arteries (INOCA) may not be captured under standard outcome definitions misclassifying patients with INOCA under the “control” group (54).
Future Perspectives
Availability and integration of omics data
A promising future opportunity of MR is the integration of ‘omics data. In the future, technological advancements are likely to increase the scale and scope of omics studies such as proteomics, transcriptomics, and metabolomics. Despite the important role of proteins in disease pathogenesis, proteomic studies at a large scale have only recently become feasible (55,56,57,58,59,60). These pQTL studies have enabled research into the causal effect of proteins on disease. Potential remains in scaling up these studies in terms of both sample size and coverage of the proteome. Moreover, blood is the primary source of current pQTL studies. While blood is most accessible, it may not reflect protein levels in tissues more directly involved in disease processes, highlighting the need for more tissue-specific studies. In addition to proteomics, other types of omics data provide further insights and expand the applications of MR. As described above, transcriptomics (RNA-sequencing) can be used within MR to study the effects of gene expression, and metabolomics can be used to further improve understanding of metabolic changes involved in disease progression. Additionally, methylation data allows to study regulatory mechanisms of gene expression (61). Such omics studies, with expected advancements in sample size, coverage, and quality, are expected to provide novel opportunities for MR to study disease mechanisms and identify therapeutic targets.
Future MR efforts to unravel disease mechanisms and identify novel drug targets or repurposing opportunities should focus on triangulating evidence from various types of omics data, using publicly available data supported by open science practices (62). The integration of multi-omics data provides a comprehensive understanding of complex biological systems, as each type of omics data offers different insights and poses different assumptions regarding for example pleiotropy within MR (5). In line with the notion of triangulation in epidemiology where multiple lines of evidence are needed to support robust conclusions, combining multiple type of omics within MR can reduce the limitations of individual approaches and achieve a more complete view of the disease mechanisms.
Availability of subgroup-specific GWAS data
The field of MR is rapidly evolving, and both sex-stratified and other subgroup-stratified MR studies are likely to become more common when advancements in genotyping and growing computational power drive the availability of subgroup-specific GWAS data. Currently, sex stratification in GWAS is not the common practice, probably because it is not considered to be a priority or because of concerns about a loss of statistical power from smaller sample sizes. However, combining results of sex-specific GWAS in a meta-analysis has shown to be effective in overcoming this potential loss of power (46). Large consortia, such as DIAGRAM (63) and MAGIC (64), have already published such meta-analyses of sex-specific GWAS data for various glycaemic traits and diabetes, enabling to study sex-specific effects and test for sex differences in effects that might otherwise be obscured in combined analyses. The same applies to ancestry-specific GWAS data. Although many GWAS are still conducted in European populations, there is growing recognition that performing GWAS across multiple ancestries is essential because of genetic heterogeneity across ancestries (65,66) and ancestry-specific GWAS studies have already been published (67,68,69,70).
A subgroup-specific approach in multi-omics data integration
Combining the two research fields of subgroup-specific and multi-omics approaches would be specifically valuable. For instance, previous studies have shown sex differences in gene expression across various human tissues (71,72), and found that sex influences biological gene networks through variations in gene co-expression (73). These findings suggest that sex-specific biological mechanisms may influence the efficacy of existing treatments. Novel therapeutic strategies, informed by sex-specific multi-omics analyses, may better target underlying pathways when they are unique to females or males or different between the sexes, offering more effective prevention and treatment options. However, such sex-specific analyses rely on sex-specific data, which are often not available. For example, numerous pQTL for plasma proteins have been identified, but studies reporting pQTL in a sex-stratified population remain scarce. To bridge this gap, we emphasize the need for sex-specific data, not only for risk factors, but also for proteomics, metabolomics, transcriptomics, and other omics data.
Conclusions
The framework of MR, a type of instrumental variable analysis, allows estimating relationships between exposures and outcomes based on genetic variants. It has a wide range of application, including the identification of potential causal associations between risk factors and diseases and the prioritisation of drug-targets. However, with the growing number of MR studies published, it is important to acknowledge its challenges. Large-scale subgroup-specific GWAS and multi-omics approaches are still needed to better understand specific disease mechanisms. This could increase our ability to address the complexities of diseases and guide the development of more personalised therapeutic strategies, ultimately improving individual health.
Funding Statement
SAEP is supported by a VIDI Fellowship from the Dutch Organisation for Health Research and Development (ZonMw) (09150172010050).
Competing Interests
The authors have no competing interests to declare.
Author Contributions
Sophie C. de Ruiter and Lena Tschiderer contributed equally.
References
- 1.Holland PW. Statistics and causal inference. Journal of the American Statistical Association. 1986;81(396):945–960. DOI: 10.1080/01621459.1986.10478354 [DOI] [PubMed] [Google Scholar]
- 2.Staiger DO, Stock JH. Instrumental variables regression with weak instruments. National Bureau of Economic Research Cambridge, Mass., USA; 1994. DOI: 10.3386/t0151 [DOI] [Google Scholar]
- 3.Katan MB. Apolipoprotein E isoforms, serum cholesterol, and cancer. The Lancet. 1986;327:507–508. DOI: 10.1016/S0140-6736(86)92972-7 [DOI] [PubMed] [Google Scholar]
- 4.Davey Smith G, Ebrahim S. ‘Mendelian randomization’: can genetic epidemiology contribute to understanding environmental determinants of disease? International Journal of Epidemiology. 2003;32(1):1–22. DOI: 10.1093/ije/dyg070 [DOI] [PubMed] [Google Scholar]
- 5.Schmidt AF, Finan C, Gordillo-Marañón M, Asselbergs FW, Freitag DF, Patel RS, et al. Genetic drug target validation using Mendelian randomisation. Nature Communications. 2020;11(1):3255. DOI: 10.1038/s41467-020-16969-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Schmidt AF, Hingorani AD, Finan C. Human genomics and drug development. Cold Spring Harb Perspect Med. 2022;12(2):a039230. DOI: 10.1101/cshperspect.a039230 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Burgess S, Thompson SG. Mendelian randomization: methods for using genetic variants in causal estimation. CRC Press; 2015. [Google Scholar]
- 8.Yuan S, Lin A, He QQ, Burgess S, Larsson SC. Circulating interleukins in relation to coronary artery disease, atrial fibrillation and ischemic stroke and its subtypes: A two-sample Mendelian randomization study. Int J Cardiol. 2020;313:99–104. DOI: 10.1016/j.ijcard.2020.03.053 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Yuan S, Mason AM, Burgess S, Larsson SC. Genetic liability to insomnia in relation to cardiovascular diseases: a Mendelian randomisation study. Eur J Epidemiol. 2021;36(4):393–400. DOI: 10.1007/s10654-021-00737-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Wang L, Zhang X, Meng X, Koskeridis F, Georgiou A, Yu L, et al. Methodology in phenome-wide association studies: a systematic review. Journal of Medical Genetics. 2021;58(11):720–728. DOI: 10.1136/jmedgenet-2021-107696 [DOI] [PubMed] [Google Scholar]
- 11.Zheng J, Haberland V, Baird D, Walker V, Haycock PC, Hurle MR, et al. Phenome-wide Mendelian randomization mapping the influence of the plasma proteome on complex diseases. Nature Genetics. 2020;52(10):1122–1131. DOI: 10.1038/s41588-020-0682-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Trubetskoy V, Pardiñas AF, Qi T, Panagiotaropoulou G, Awasthi S, Bigdeli TB, et al. Mapping genomic loci implicates genes and synaptic biology in schizophrenia. Nature. 2022;604(7906):502–508. DOI: 10.1038/s41586-022-04434-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Campos AI, García-Marín LM, Byrne EM, Martin NG, Cuéllar-Partida G, Rentería ME. Insights into the aetiology of snoring from observational and genetic investigations in the UK Biobank. Nature Communications. 2020;11(1):817. DOI: 10.1038/s41467-020-14625-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Larsson SC, Burgess S. Appraising the causal role of smoking in multiple diseases: A systematic review and meta-analysis of Mendelian randomization studies. EBioMedicine. 2022;82:104154. DOI: 10.1016/j.ebiom.2022.104154 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Vithayathil M, Carter P, Kar S, Mason AM, Burgess S, Larsson SC. Body size and composition and risk of site-specific cancers in the UK Biobank and large international consortia: A mendelian randomisation study. PLoS Med. 2021;18(7):e1003706. DOI: 10.1371/journal.pmed.1003706 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Collaboration ERF. C-reactive protein concentration and risk of coronary heart disease, stroke, and mortality: an individual participant meta-analysis. The Lancet. 2010;375(9709):132–140. DOI: 10.1016/S0140-6736(09)61717-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Danesh J, Pepys MB. C-reactive protein and coronary disease: is there a causal link? Am Heart Assoc. 2009;120(101):2036–2039. DOI: 10.1161/CIRCULATIONAHA.109.907212 [DOI] [PubMed] [Google Scholar]
- 18.Collaboration CRPCHDG. Association between C reactive protein and coronary heart disease: mendelian randomisation analysis based on individual participant data. BMJ. 2011;15(342):d548. DOI: 10.1136/bmj.d548 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Zacho J, Tybjærg-Hansen A, Jensen JS, Grande P, Sillesen H, Nordestgaard BG. Genetically elevated C-reactive protein and ischemic vascular disease. New England Journal of Medicine. 2008;359(18):1897–1908. DOI: 10.1056/NEJMoa0707402 [DOI] [PubMed] [Google Scholar]
- 20.van der Bom JG, de Maat MP, Bots ML, Haverkate F, De Jong P, Hofman A, et al. Elevated plasma fibrinogen: cause or consequence of cardiovascular disease? Arteriosclerosis, Thrombosis, and Vascular Biology. 1998;18(4):621–625. DOI: 10.1161/01.ATV.18.4.621 [DOI] [PubMed] [Google Scholar]
- 21.Porcu E, Rüeger S, Lepik K, Santoni FA, Reymond A, Kutalik Z. Mendelian randomization integrating GWAS and eQTL data reveals genetic determinants of complex and clinical traits. Nature Communications. 2019;10(1):3300. DOI: 10.1038/s41467-019-10936-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Khankari NK, Keaton JM, Walker VM, Lee KM, Shuey MM, Clarke SL, et al. Using Mendelian randomisation to identify opportunities for type 2 diabetes prevention by repurposing medications used for lipid management. EBioMedicine. 2022;80:104038. DOI: 10.1016/j.ebiom.2022.104038 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Richardson TG, Hemani G, Gaunt TR, Relton CL, Davey Smith G. A transcriptome-wide Mendelian randomization study to uncover tissue-dependent regulatory mechanisms across the human phenome. Nature Communications. 2020;11(1):185. DOI: 10.1038/s41467-019-13921-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Taylor K, Davey Smith G, Relton CL, Gaunt TR, Richardson TG. Prioritizing putative influential genes in cardiovascular disease susceptibility by applying tissue-specific Mendelian randomization. Genome Medicine. 2019;11:1–15. DOI: 10.1186/s13073-019-0613-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Xu X, Eales JM, Akbarov A, Guo H, Becker L, Talavera D, et al. Molecular insights into genome-wide association studies of chronic kidney disease-defining traits. Nature Communications. 2018;9(1):4800. DOI: 10.1038/s41467-018-07260-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Wu X, Jiang L, Qi H, Hu C, Jia X, Lin H, et al. Brain tissue-and cell type-specific eQTL Mendelian randomization reveals efficacy of FADS1 and FADS2 on cognitive function. Translational Psychiatry. 2024;14(1):77. DOI: 10.1038/s41398-024-02784-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Schmidt AF, Finan C, Gordillo-Marañón M, Asselbergs FW, Freitag DF, Patel RS, et al. Genetic drug target validation using Mendelian randomisation. Nature Communications. 2020;11(1):1–12. DOI: 10.1038/s41467-020-16969-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.King EA, Davis JW, Degner JF. Are drug targets with genetic support twice as likely to be approved? Revised estimates of the impact of genetic support for drug mechanisms on the probability of drug approval. PLoS Genetics. 2019;15(12):e1008489. DOI: 10.1371/journal.pgen.1008489 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Ochoa D, Karim M, Ghoussaini M, Hulcoop DG, McDonagh EM, Dunham I. Human genetics evidence supports two-thirds of the 2021 FDA-approved drugs. Nat Rev Drug Discov. 2022;21(8):551. DOI: 10.1038/d41573-022-00120-3 [DOI] [PubMed] [Google Scholar]
- 30.Plenge RM, Scolnick EM, Altshuler D. Validating therapeutic targets through human genetics. Nature Reviews Drug Discovery. 2013;12(8):581–594. DOI: 10.1038/nrd4051 [DOI] [PubMed] [Google Scholar]
- 31.Voight BF, Peloso GM, Orho-Melander M, Frikke-Schmidt R, Barbalic M, Jensen MK, et al. Plasma HDL cholesterol and risk of myocardial infarction: a mendelian randomisation study. The Lancet. 2012;380(9841):572–580. DOI: 10.1016/S0140-6736(12)60312-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Pushpakom S, Iorio F, Eyers PA, Escott KJ, Hopper S, Wells A, et al. Drug repurposing: progress, challenges and recommendations. Nature Reviews Drug Discovery. 2019;18(1):41–58. DOI: 10.1038/nrd.2018.168 [DOI] [PubMed] [Google Scholar]
- 33.Huang J, Huffman JE, Huang Y, Do Valle I, Assimes TL, Raghavan S, et al. Genomics and phenomics of body mass index reveals a complex disease network. Nature Communications. 2022;13(1):7973. DOI: 10.1038/s41467-022-35553-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Burgess S, Davey Smith G, Davies NM, Dudbridge F, Gill D, Glymour MM, et al. Guidelines for performing Mendelian randomization investigations: update for summer 2023. Wellcome Open Res. 2019;4:186. DOI: 10.12688/wellcomeopenres.15555.1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Lawlor DA, Harbord RM, Sterne JA, Timpson N, Davey Smith G. Mendelian randomization: using genes as instruments for making causal inferences in epidemiology. Statistics in Medicine. 2008;27(8):1133–1163. DOI: 10.1002/sim.3034 [DOI] [PubMed] [Google Scholar]
- 36.Verbanck M, Chen C-Y, Neale B, Do R. Detection of widespread horizontal pleiotropy in causal relationships inferred from Mendelian randomization between complex traits and diseases. Nature Genetics. 2018;50(5):693–698. DOI: 10.1038/s41588-018-0099-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Sheehan NA, Meng S, Didelez V. Mendelian randomisation: a tool for assessing causality in observational epidemiology. Genetic Epidemiology. 2011;153–166. DOI: 10.1007/978-1-60327-416-6_12 [DOI] [PubMed] [Google Scholar]
- 38.VanderWeele TJ, Tchetgen EJT, Cornelis M, Kraft P. Methodological challenges in mendelian randomization. Epidemiology. 2014;25(3):427–435. DOI: 10.1097/EDE.0000000000000081 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Burgess S, Thompson SG. Interpreting findings from Mendelian randomization using the MR-Egger method. Eur J Epidemiol. 2017;32(5):377–389. DOI: 10.1007/s10654-017-0255-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Swanson SA, Robins JM, Miller M, Hernán MA. Selecting on treatment: a pervasive form of bias in instrumental variable analyses. American Journal of Epidemiology. 2015;181(3):191–197. DOI: 10.1093/aje/kwu284 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Canan C, Lesko C, Lau B. Instrumental variable analyses and selection bias. Epidemiology. 2017;28(3):396–398. DOI: 10.1097/EDE.0000000000000639 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Vansteelandt S, Dukes O, Martinussen T. Survivor bias in Mendelian randomization analysis. Biostatistics. 2018;19(4):426–443. DOI: 10.1093/biostatistics/kxx050 [DOI] [PubMed] [Google Scholar]
- 43.Burgess S, Butterworth A, Malarstig A, Thompson SG. Use of Mendelian randomisation to assess potential benefit of clinical intervention. BMJ. 2012;345:e7325. DOI: 10.1136/bmj.e7325 [DOI] [PubMed] [Google Scholar]
- 44.Evangelou E, Warren HR, Mosen-Ansorena D, Mifsud B, Pazoki R, Gao H, et al. Genetic analysis of over 1 million people identifies 535 new loci associated with blood pressure traits. Nature Genetics. 2018;50(10):1412–1425. DOI: 10.1038/s41588-018-0205-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Gill D, Dib MJ, Cronje HT, Karhunen V, Woolf B, Gagnon E, et al. Common pitfalls in drug target Mendelian randomization and how to avoid them. BMC Med. 2024;22(1):473. DOI: 10.1186/s12916-024-03700-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Magi R, Lindgren CM, Morris AP. Meta-analysis of sex-specific genome-wide association studies. Genetic Epidemiology. 2010;34(8):846–853. DOI: 10.1002/gepi.20540 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Bernabeu E, Canela-Xandri O, Rawlik K, Talenti A, Prendergast J, Tenesa A. Sex differences in genetic architecture in the UK Biobank. Nature Genetics. 2021;53(9):1283–1289. DOI: 10.1038/s41588-021-00912-0 [DOI] [PubMed] [Google Scholar]
- 48.Shifman S, Johannesson M, Bronstein M, Chen SX, Collier DA, Craddock NJ, et al. Genome-wide association identifies a common variant in the reelin gene that increases the risk of schizophrenia only in women. PLoS Genetics. 2008;4(2):e28. DOI: 10.1371/journal.pgen.0040028 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Döring A, Gieger C, Mehta D, Gohlke H, Prokisch H, Coassin S, et al. SLC2 A9 influences uric acid concentrations with pronounced sex-specific effects. Nature Genetics. 2008;40(4):430–436. DOI: 10.1038/ng.107 [DOI] [PubMed] [Google Scholar]
- 50.Lindgren CM, Heid IM, Randall JC, Lamina C, Steinthorsdottir V, Qi L, et al. Genome-wide association scan meta-analysis identifies three Loci influencing adiposity and fat distribution. PLoS Genetics. 2009;5(6):e1000508. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Lumish HS, O’Reilly M, Reilly MP. Sex differences in genomic drivers of adipose distribution and related cardiometabolic disorders: opportunities for precision medicine. Arteriosclerosis, Thrombosis, and Vascular Biology. 2020;40(1):45–60. DOI: 10.1161/ATVBAHA.119.313154 [DOI] [PubMed] [Google Scholar]
- 52.Aragam KG, Jiang T, Goel A, Kanoni S, Wolford BN, Atri DS, et al. Discovery and systematic characterization of risk variants and genes for coronary artery disease in over a million participants. Nature Genetics. 2022;54:1803–1815. DOI: 10.1038/s41588-022-01233-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Adlam D, Berrandou TE, Georges A, Nelson CP, Giannoulatou E, Henry J, et al. Genome-wide association meta-analysis of spontaneous coronary artery dissection identifies risk variants and genes related to artery integrity and tissue-mediated coagulation. Nature Genetics. 2023;55(6):964–972. DOI: 10.1038/s41588-023-01410-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Sakkers TR, Mokry M, Civelek M, Erdmann J, Pasterkamp G, Benavente ED, et al. Sex differences in the genetic and molecular mechanisms of coronary artery disease. Atherosclerosis. 2023; 384:117279. DOI: 10.1016/j.atherosclerosis.2023.117279 [DOI] [PubMed] [Google Scholar]
- 55.Suhre K, Arnold M, Bhagwat AM, Cotton RJ, Engelke R, Raffler J, et al. Connecting genetic risk to disease end points through the human blood plasma proteome. Nature Communications. 2017;8(1):14357. DOI: 10.1038/ncomms14357 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Emilsson V, Ilkov M, Lamb JR, Finkel N, Gudmundsson EF, Pitts R, et al. Co-regulatory networks of human serum proteins link genetics to disease. Science. 2018;361(6404):769–773. DOI: 10.1126/science.aaq1327 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Sun BB, Maranville JC, Peters JE, Stacey D, Staley JR, Blackshaw J, et al. Genomic atlas of the human plasma proteome. Nature. 2018;558(7708):73–79. DOI: 10.1038/s41586-018-0175-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Benson MD, Yang Q, Ngo D, Zhu Y, Shen D, Farrell LA, et al. Genetic architecture of the cardiovascular risk proteome. Circulation. 2018;137(11):1158–1172. DOI: 10.1161/CIRCULATIONAHA.117.029536 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Yao C, Chen G, Song C, Keefe J, Mendelson M, Huan T, et al. Genome-wide mapping of plasma protein QTLs identifies putatively causal genes and pathways for cardiovascular disease. Nature Communications. 2018;9(1):1–11. DOI: 10.1038/s41467-018-06231-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Zhernakova DV, Le TH, Kurilshikov A, Atanasovska B, Bonder MJ, Sanna S, et al. Individual variations in cardiovascular-disease-related protein levels are driven by genetics and gut microbiome. Nature Genetics. 2018;50(11):1524–1532. DOI: 10.1038/s41588-018-0224-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Richardson TG, Zheng J, Smith GD, Timpson NJ, Gaunt TR, Relton CL, et al. Mendelian randomization analysis identifies CpG sites as putative mediators for genetic influences on cardiovascular disease risk. The American Journal of Human Genetics. 2017;101(4):590–602. DOI: 10.1016/j.ajhg.2017.09.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Reay WR, Cairns MJ. Advancing the use of genome-wide association studies for drug repurposing. Nature Reviews Genetics. 2021;22(10):658–671. DOI: 10.1038/s41576-021-00387-z [DOI] [PubMed] [Google Scholar]
- 63.Mahajan A, Taliun D, Thurner M, Robertson NR, Torres JM, Rayner NW, et al. Fine-mapping type 2 diabetes loci to single-variant resolution using high-density imputation and islet-specific epigenome maps. Nature Genetics. 2018;50(11):1505–1513. DOI: 10.1038/s41588-018-0241-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Chen J, Spracklen CN, Marenne G, Varshney A, Corbin LJ, Luan J, et al. The trans-ancestral genomic architecture of glycemic traits. Nature Genetics. 2021;53(6):840–860. DOI: 10.1038/s41588-021-00852-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Ntzani EE, Liberopoulos G, Manolio TA, Ioannidis JP. Consistency of genome-wide associations across major ancestral groups. Human Genetics. 2012;131:1057–1071. DOI: 10.1007/s00439-011-1124-4 [DOI] [PubMed] [Google Scholar]
- 66.Peterson RE, Kuchenbaecker K, Walters RK, Chen C-Y, Popejoy AB, Periyasamy S, et al. Genome-wide association studies in ancestrally diverse populations: opportunities, methods, pitfalls, and recommendations. Cell. 2019;179(3):589–603. DOI: 10.1016/j.cell.2019.08.051 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Gurdasani D, Carstensen T, Fatumo S, Chen G, Franklin CS, Prado-Martinez J, et al. Uganda genome resource enables insights into population history and genomic discovery in Africa. Cell. 2019;179(4):984–1002. e36. DOI: 10.1016/j.cell.2019.10.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Assimes T, Tcheandjieu C, Zhu X, Hilliard A, Clarke S, Napolioni V, et al. A large-scale multi-ethnic genome-wide association study of coronary artery disease. Nature Medicine. 2022;28:1679–1692. DOI: 10.1038/s41591-022-01891-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Ishigaki K, Sakaue S, Terao C, Luo Y, Sonehara K, Yamaguchi K, et al. Multi-ancestry genome-wide association analyses identify novel genetic mechanisms in rheumatoid arthritis. Nature Genetics. 2022;54(11):1640–1651. DOI: 10.1038/s41588-022-01213-w [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Pervjakova N, Moen G-H, Borges M-C, Ferreira T, Cook JP, Allard C, et al. Multi-ancestry genome-wide association study of gestational diabetes mellitus highlights genetic links with type 2 diabetes. Human Molecular Genetics. 2022;31(19):3377–3391. DOI: 10.1093/hmg/ddac050 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Oliva M, Muñoz-Aguirre M, Kim-Hellmuth S, Wucher V, Gewirtz AD, Cotter DJ, et al. The impact of sex on gene expression across human tissues. Science. 2020;369(6509):eaba3066. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Melé M, Ferreira PG, Reverter F, DeLuca DS, Monlong J, Sammeth M, et al. The human transcriptome across tissues and individuals. Science. 2015;348(6235):660–665. DOI: 10.1126/science.aaa0355 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Hartman RJ, Mokry M, Pasterkamp G, den Ruijter HM. Sex-dependent gene co-expression in the human body. Scientific Reports. 2021;11(1):18758. DOI: 10.1038/s41598-021-98059-9 [DOI] [PMC free article] [PubMed] [Google Scholar]