Skip to main content
American Journal of Human Genetics logoLink to American Journal of Human Genetics
. 2024 Jan 4;111(1):165–180. doi: 10.1016/j.ajhg.2023.12.002

A Bayesian approach to Mendelian randomization using summary statistics in the univariable and multivariable settings with correlated pleiotropy

Andrew J Grant 1,2,, Stephen Burgess 1,3
PMCID: PMC10806746  EMSID: EMS193234  PMID: 38181732

Summary

Mendelian randomization uses genetic variants as instrumental variables to make causal inferences on the effect of an exposure on an outcome. Due to the recent abundance of high-powered genome-wide association studies, many putative causal exposures of interest have large numbers of independent genetic variants with which they associate, each representing a potential instrument for use in a Mendelian randomization analysis. Such polygenic analyses increase the power of the study design to detect causal effects; however, they also increase the potential for bias due to instrument invalidity. Recent attention has been given to dealing with bias caused by correlated pleiotropy, which results from violation of the “instrument strength independent of direct effect” assumption. Although methods have been proposed that can account for this bias, a number of restrictive conditions remain in many commonly used techniques. In this paper, we propose a Bayesian framework for Mendelian randomization that provides valid causal inference under very general settings. We propose the methods MR-Horse and MVMR-Horse, which can be performed without access to individual-level data, using only summary statistics of the type commonly published by genome-wide association studies, and can account for both correlated and uncorrelated pleiotropy. In simulation studies, we show that the approach retains type I error rates below nominal levels even in high-pleiotropy scenarios. We demonstrate the proposed approaches in applied examples in both univariable and multivariable settings, some with very weak instruments.

Keywords: causal inference, instrumental variable, Mendelian randomization, multivariable, pleiotropy, shrinkage


We propose a Bayesian framework for performing Mendelian randomization with potentially pleiotropic genetic instruments. Our proposed method, MR-Horse, is shown to produce unbiased causal effect estimates while avoiding inflated false positive rates, even with high levels of pleiotropy and weak instruments. We further extend the approach to the multivariable setting.

Introduction

Mendelian randomization is a technique that exploits genetic variation to make causal inferences on the effect of one or more exposures on an outcome of interest.1 The underlying idea is that, because of the Mendelian laws of inheritance, a genetic variant that is associated with an exposure of interest can serve as a proxy for the exposure. Such a proxy should be less susceptible to the confounding and reverse causation that can inhibit causal inference from observational data. Methods for Mendelian randomization have been developed that can combine information from multiple genetic variants associated with the same exposure. Furthermore, many of these methods can be performed using only summary statistics taken from genome-wide association studies (GWASs), removing the need to access sensitive individual-level data. Under the two-sample framework, summary statistics for the exposure and outcome can be taken from separate, ideally non-overlapping, samples.2 Thus, any combination of exposure and outcome for which there exists publicly available GWAS summary statistics can potentially be considered in a Mendelian randomization study. However, in order to provide robust evidence for causal effects, strong assumptions must be made pertaining to the relationships between the genetic variants and the exposures, outcome, and any confounder variables.

Mendelian randomization can be considered as a form of instrumental variables analysis, where each genetic variant must satisfy the three instrumental variables assumptions. In the single exposure case, these assumptions are that each genetic variant:

  • (1)

    is associated with the exposure,

  • (2)

    is not associated with the outcome via any confounding pathway, and

  • (3)

    does not affect the outcome other than via the exposure.

The relationships between a valid genetic variant and other relevant traits is illustrated graphically in the directed acyclic graph (DAG) in Figure 1A.3

Figure 1.

Figure 1

Directed acyclic graphs

Shown is the relationship between the jth genetic variant (Gj), exposure (X), outcome (Y), and unmeasured confounders (U) when: (A) Gj is a valid instrument, (B) Gj is invalid due to a direct effect on Y, (C) Gj is invalid due to both a direct effect on Y and an indirect effect via U, (D) Gj is invalid due to both a direct effect on Y and being caused by U, and (E) Gj is invalid due to either (i) a direct effect on Y or (ii) both a direct effect on Y and an indirect effect via U but with no direct effect on X.

A common source of instrument invalidity is pleiotropy, which is where genetic variants associate with multiple traits. If a genetic instrument associates with a predictor of the outcome via a causal pathway that bypasses the exposure, then this will open up a pleiotropic pathway and invalidate the third instrumental variables assumption. Another source of invalidity is population structure, in which there are differences in the genetic distributions among subgroups of the population being studied. This results in confounding of the genetic variant-outcome relationship and invalidates the second instrumental variables assumption. Such confounding may occur if there is population stratification, dynastic effects, or assortative mating.4

Much attention has been given in the literature to developing methods for Mendelian randomization that are robust to instrument invalidity. Many of these methods have been developed motivated by providing robustness against pleiotropy. Often, the InSIDE (instrument strength independent of direct effect) assumption is required for these methods to give valid results, which means that any pleiotropic effects are independent of the genetic variant-exposure associations.5 Methods that require InSIDE will not provide robustness to population structure, since the confounding pathway may create a correlation between the genetic effects on the exposure and the direct effects on the outcome. Recently, more attention has been given to methods that can also handle violations of the InSIDE assumption, a scenario referred to as correlated pleiotropy.6 However, methods that provide such robustness typically suffer from a combination of inflated type I error rates, low power, or further restrictive assumptions.

In this paper we propose a Bayesian approach to Mendelian randomization that is robust to both uncorrelated and correlated pleiotropy. We show that this approach is able to provide unbiased and efficient causal effect estimation without inflated type I error rates in very general settings, with favorable performance in comparison with existing methods. A further advantage of the proposed method is that it is easily generalized to the multivariable Mendelian randomization paradigm, which has far fewer robust methods than the single exposure case. We demonstrate the approach in an applied example that looks at the relationship between combinations of four exposures (LDL cholesterol, triglycerides, fasting glucose, and birth weight) and three outcomes (coronary artery disease, type 2 diabetes, and asthma). We further consider an example looking at the effects of intelligence, years of education, and household income on the risk of Alzheimer disease in a multivariable model.

Material and methods

Models for instrument invalidity

We consider a potentially causal exposure X for an outcome Y with unmeasured confounders U. We denote by θ the causal effect of X on Y and assume that this effect is linear and homogeneous in the population (that is, that there is no effect modification). This latter assumption allows for causal effect estimation in the instrumental variables framework.

The relationships between a valid genetic instrument, Gj, and the exposure, outcome, and unmeasured confounders are shown in Figure 1A. If we have a single such instrument, we can estimate the causal effect using the Wald ratio, βˆYj/βˆXj, where βˆXj and βˆYj are estimates of the associations between Gj and X and Y, respectively. These association estimates are typically obtained by using linear regression for continuous traits and logistic regression for binary traits. If we have J valid instruments, the inverse-variance weighted (IVW) method fits the model

βˆYjN(θβˆXj,σYj2),j=1,,J, (Equation 1)

where σYj is the standard error of the jth genetic variant-outcome association estimate.7 Note that the standard error of the jth genetic variant-exposure association estimate, σXj, does not appear in the IVW model, which reflects the fact that this approach implicitly assumes that the genetic associations with the exposure are measured without error.

Most pleiotropy robust methods for Mendelian randomization make the assumption that some of the instruments are valid and allow for varying levels of invalidity. MR-PRESSO assumes that a small number of instruments are invalid, which will result in outliers in the Wald ratio estimates.8 Under the InSIDE assumption, MR-PRESSO tests for outliers and estimates the causal effect using IVW with the identified outliers removed. If at least half of the instruments are valid, a weighted median of the Wald ratios gives a consistent estimate of the causal effect, whether or not InSIDE is satisfied.9

Figure 1B illustrates the case where there is a pleiotropic pathway to the outcome that does not pass via X or U (although it may go via other unmeasured traits that do not associate with the exposure). In this case, the direct effect α˜j is independent of the instrument strength β˜Xj, and hence InSIDE is met. The causal effect can be estimated using the MR-Egger method, which fits the model

βˆYjN(θβˆXj+α0,σYj2),j=1,,J. (Equation 2)

The intercept term α0 represents the average pleiotropic effect across all variants. If this average effect is equal to zero, then it is said that pleiotropy is balanced. If the average effect is non-zero, the pleiotropy is said to be directional. Under the MR-Egger model, potentially all variants may be invalid, but only in such a way that the InSIDE assumption is satisfied.

Figure 1C shows the case where the genetic variant has a pathway to the outcome via one or both of a direct path and a path that passes through a confounder of the exposure-outcome relationship. This latter pathway creates correlated pleiotropy, or, in other words, violates InSIDE. One commonly used model for this scenario is

βˆYjN(θβˆXj+αj,σYj2),j=1,,J, (Equation 3)

where αj represents the effect α˜j+γjδY and βˆXj is an estimate of βXj=β˜Xj+γjδX. Under the assumption that some of the instruments are valid and some are invalid, Kang et al.10 and Rees et al.11 proposed estimating θ using penalized regression with a lasso-style penalty on the αj terms. In this way, the estimates of αj relating to the valid instruments are shrunk toward zero. A post-lasso approach fits the IVW model using only the instruments whose αj estimates were shrunk to zero in the initial lasso regression. Although these lasso-based approaches provide unbiased effect estimates, they can suffer from substantially inflated type I error rates due to the post-selection inference problem. Kang et al.10 showed that the causal effect is identifiable in the model given in Equation 3 when a plurality of instruments are valid. That is, if there are groups of variants for which the single-instrument causal effect estimate is the same, the largest of these groups is the set of valid instruments. Alternative methods that make the plurality valid assumption include the Contamination Mixture method (ConMix)12 and MR-Mix,13 which are based on fitting mixture distributions to the Wald ratio estimates, and weighted mode approaches.14,15

Xue et al.16 proposed a method that fits the model

βˆYjN(θβXj+αj,σYj2),βˆXjN(βXj,σXj2),j=1,,J, (Equation 4)

using constrained maximum likelihood, referred to as MR-cML. This method provides consistent causal effect estimation under the plurality valid assumption. Similar to the lasso regression approaches, MR-cML maximizes the likelihood in such a way that the αjs pertaining to valid instruments will tend toward zero. However, through the use of model averaging, it suffers less from inflated type I errors. Furthermore, the model accounts for the uncertainty in the βXj estimates. The MR-RAPS method is also based on the model given in Equation 4, where pleiotropy is accounted for using a robust loss function. However, MR-RAPS makes the InSIDE assumption, requires balanced pleiotropy, and was shown to have lower power than MR-cML by Xue et al.16

Bayesian methods have been far less utilized in the Mendelian randomization literature, despite a number of potential advantages over frequentist approaches in fitting a model such as that given in Equation 4. Berzuini et al.17 developed a Bayesian approach where parameters representing pleiotropic effects are given a shrinkage prior. This has the effect of shrinking these terms pertaining to the valid instruments toward zero, similar to the lasso-based approaches. The use of credible intervals obtained from the posterior distribution was shown to provide better coverage than the weighted median method, suggesting that the Bayesian approach is better able to estimate the uncertainty in the effect estimates. The method requires individual-level data and an assumption akin to InSIDE. Zhao et al.18 (BWMR) and Bucur et al.19 (BAYESMR) have proposed Bayesian approaches using summary-level data. Similar to Berzuini et al.,17 these approaches place a shrinkage prior on the αj terms and make the InSIDE assumption.

Recently developed approaches aimed at the correlated pleiotropy case include CAUSE,6 MR-Corr2,20 and MR-CUE,21 which each utilize Bayesian methods. MR-Corr2 uses a spike-and-slab prior for pleiotropic terms, similar to BAYESMR, and also includes a parameter to account for correlated pleiotropy. However, it does not allow for both uncorrelated and correlated pleiotropy among the variants. CAUSE and MR-CUE make the assumption, illustrated in Figure 1E, that genetic instruments are either directly associated with the exposure or only associated with the exposure via a confounder. Xue et al.16 showed that CAUSE is particularly sensitive to violations of this assumption. Another feature of these approaches is that they utilize genome-wide summary statistics, as opposed to just summary statistics relating to a curated set of instruments. The genome-wide summary statistics are used to estimate nuisance parameters that are inputs to the Bayesian models.

Figure 1D shows the case where correlated pleiotropy is caused by a confounder of the genetic variant-outcome relationship, for example, due to population structure. Although the cause of instrument invalidity differs from that shown in Figure 1C, we can model it in the same way. We thus consider Equation 4 to be a very general parametric model for Mendelian randomization with pleiotropy. It can represent invalidity due to both pleiotropy and population structure, and it does not make the assumption of CAUSE and MR-CUE that the instruments cannot both be directly associated with the exposure and subject to correlated pleiotropy.

Univariable Mendelian randomization

Suppose we have, for J genetic variants, summary statistics of their association with the exposure and outcome. We denote the estimate of the association between the jth variant and the exposure and outcome by βˆXj and βˆYj, respectively, and the standard errors of the estimates by σXj and σYj. We model the association estimates by

βˆYjN(θβXj+αj,σYj2)
βˆXjN(βXj,σXj2).

We place a jointly normal prior on (βXj,αj),

(βXjαj)N((μ0),(sX2ρjsXφjτρjsXφjτφj2τ2)),

and we use the following weakly informative priors:

μN(0,1)sX2N+(0,1)θN(0,1),

where N+(·,·) is the half-normal distribution (that is, the truncated normal distribution bounded below at 0).

In this model, the βXj term represents the effect of the jth genetic variant on the exposure, and the αj term represents the effect of the variant on the outcome that bypasses the exposure. If the jth variant is a valid instrument, then the true value of αj is zero. Otherwise, the variant is pleiotropic, and the pleiotropic effect is accounted for by a non-zero αj. If the jth variant is either a cause of, or caused by, U, then βXj and αj will be correlated. This correlation is modeled by the ρj parameter. Thus, the model can account for valid instruments (αj=0), uncorrelated pleiotropy (αj0,ρj=0), and correlated pleiotropy (αj0,ρ0). Any prior distribution for ρj that restricts its values to the interval (1,1) may be used and should be informed by domain knowledge of the likely biological mechanisms underlying the genetic variant associations. For our simulation study and applied examples, we set ρj=2rj1 where rjBeta(10,10). This places the majority of the mass in the prior distribution around zero, reflecting the fact that many of the genetic variants will either be valid instruments or have pleiotropic effects that are close to uncorrelated with their exposure association. However, this prior still allows for substantial correlation when this is present.

For the final two parameters in the model, we use the priors

φjCauchy+(0,1)τCauchy+(0,1),

where Cauchy+(·,·) is the half-Cauchy distribution. Under this parametrization, the marginal distribution of αj is the horseshoe prior.22 The horseshoe is a shrinkage prior that will shrink some of the αjs toward zero, while non-zero αjs will escape shrinkage better than when using a Laplace prior (which is equivalent to lasso regression). Thus, the horseshoe prior is less likely to underestimate the value of non-zero αjs. The τ parameter controls the global level of shrinkage such that a lower value will shrink more of the αjs close to zero. In our case, we assign τ a half-Cauchy prior and allow its posterior distribution to be estimated from data. However, if there was prior knowledge on the level of sparsity, this could be incorporated by adjusting this prior or setting a fixed value.23 The horseshoe prior has previously been proposed for pleiotropic effects in Mendelian randomization by Berzuini et al.17 for the individual-level data setting.

In practice, we have implemented the method using BUGS via the rjags package in R.24 We take 10,000 samples from the posterior distribution (with a burn-in of 10,000 iterations) from 3 chains, which, in all cases reported in the simulation study and applied examples, resulted in R-hat values for the parameters of interest close to 1, indicating good convergence. The causal effect estimate is the mean of the posterior samples, and the 95% credible interval is the lower and upper 2.5th percentiles. We call our method MR-Horse (after the horseshoe prior).

Multivariable Mendelian randomization

Multivariable Mendelian randomization considers the effect of multiple exposures on the outcome.25,26 Originally motivated as a method for accounting for measured pleiotropy, it can also be used to simultaneously estimate causal effects of multiple correlated exposures,27 to perform risk factor selection,28 and in mediation analysis.29 A genetic variant is a valid instrumental variable for multivariable Mendelian randomization if it is associated with at least one exposure, it is not associated with the outcome via any confounding pathway, and it does not affect the outcome other than via one or more of the measured exposures.

The multivariable IVW approach fits the model given in Equation 1, but where the βˆXjs are vectors of the association estimates between the jth genetic variant and each exposure. There are far fewer pleiotropy robust methods for multivariable Mendelian randomization compared with the univariable setting. Commonly used approaches include multivariable versions of MR-Egger30 and MR-PRESSO, a multivariable median-based approach,31 and GRAPPLE,32 akin to a multivariable version of MR-RAPS. Recently, a multivariable version of MR-cML has been developed.33

Our proposed model for univariable Mendelian randomization can be easily generalized to the multivariable case as follows. We denote by βXj· the vector of associations between the jth genetic variant and the K exposures, and by βˆXj· their estimates that have covariance matrix ΣXj. As with the standard errors in the univariable case, the ΣXjs are assumed known. In practice, they can be estimated either using individual-level data or from GWAS summary statistics (see, for example, Sanderson et al.34 and Grant et al.35). We fit the model

βˆYjN(θβXj·+αj,σYj2)
βˆXj·N(βXj·,ΣXj),

where the kth element of the vector θ represents the direct causal effect of the kth exposure on the outcome. We place a joint prior on the (βXj·,αj) as follows:

(βXj·αj)N((μ0),(VXdiag(VX)1/2ρjφjτ{diag(VX)1/2}ρjφjτφj2τ2)),

and we use the following priors:

μN(0,IK)
VXN+(0,IK)
θkN(0,1)
φjCauchy+(0,1)
τCauchy+(0,1).

As in the univariable case, this parametrization puts a horseshoe shrinkage prior on the αjs with global shrinkage controlled by the τ parameter. The correlation between the genetic variant-exposure effects and the pleiotropic effects are accounted for via the ρj parameters, where, again, we set ρj=2rj1 with rjBeta(10,10).

As before, we have implemented the method using BUGS and the rjags package in R, with 10,000 samples taken from the posterior distribution (with a burn-in of 10,000 iterations) from 3 chains. In all cases reported in the simulation study, R-hat values for the parameters of interest were close to 1, indicating good convergence. We call the multivariable method MVMR-Horse.

Simulation study

Primary simulation parametrization

In order to examine the performance of our method in comparison with alternatives, we generated data based on the simulation parametrization of Xue et al.,16 with some parameters adapted to ensure levels of instrument strength and variation of the exposure explained that is typical of realistic settings. The data generating mechanism is

U=j=1JγjGj+εU
X=j=1Jβ˜XjGj+U+εX
Y=θX+j=1Jα˜j+U+εY,

where GjBinomial(2,fj), fjUniform(0.1,0.3), εU,εYN(0,1), and εXN(0,ξ2), independently. For each j, β˜Xj was generated uniformly on the interval (0.15,0.05)(0.05,0.15), γjUniform(0.1,0.1), and δX=δY=1. We considered scenarios where either 20%, 40%, or 60% of variants were invalid instruments. Unlike Xue et al.,16 we considered both balanced and directional pleiotropy scenarios. In the balanced pleiotropy case, αjUniform(0.2,0.2), and in the directional pleiotropy case, αjUniform(0.1,0.3), when the jth variant was invalid, and in both cases αj=0 when the jth variant was valid. The causal effect was either θ=0.1 or 0. Note that under this parametrization, InSIDE was violated in all scenarios. In each of 1,000 replications, two independent samples of sizes nX and nY were generated. The genetic association estimates with the exposure were taken in the sample of size nX and with the outcome in that of size nY. The association estimates and their standard errors were obtained using simple linear regression of the exposure and outcome on each genetic variant in turn.

We considered three primary settings. In the first, we set J=100, ξ2=1, and nX=nY=50,000. This generated strong genetic instruments (with mean F-statistic across all scenarios between 77.4 and 85.0), which explained a high proportion of the variation in the exposure (with mean R2 value across all scenarios between 15.4% and 16.9%). In the second setting, we set J=100, ξ2=4, nX=5,000, and nY=50,000. This generated weak genetic instruments (mean F-statistic between 4.4 and 4.7) but that still explained a reasonably high proportion of the variance in the exposure (mean R2 between 8.6% and 9.3%). In the third setting, we set J=20, ξ2=1, and nX=nY=50,000. This generated strong genetic instruments (mean F-statistic between 87.7 and 98.2) that explained a reasonably low proportion of the variance in the exposure (mean R2 between 3.5% and 3.9%).

Comparison methods

We compared our approach with standard IVW estimation and also an oracle estimator that performed IVW using only the truly valid instruments. We used a weighted median method, MR-Median, which requires half of the variants to be valid but does not make the InSIDE assumption nor requires balanced pleiotropy. We used BWMR to compare with another Bayesian approach, noting that this method assumes InSIDE. Finally, we compared with MR-cML as well as an extension to this approach, MR-cML-DP. These latter two methods both make the plurality valid assumption and aim to shrink estimates of the pleiotropic effects pertaining to valid instruments toward zero. The extension uses data perturbation (DP) to produce multiple perturbed samples (the default, which we use, being 200 samples) and then takes averages across the samples for estimation of the causal effect and standard error. This significantly increases the computational time but has been shown to reduce bias and type I error rates.

Other common methods not considered include MR-Egger, MR-RAPS, and MR-PRESSO (InSIDE and balanced pleiotropy) and also ConMix, MR-Mix, MR-Mode, and MR-Lasso (plurality valid, can handle some directional pleiotropy). In the simulation study of Xue et al.,16 MR-cML generally outperformed each of these methods in terms of bias, power, and type I error rates. Because our simulation study is largely based on that of Xue et al.,16 it is expected that MR-cML would be the best performing out of these, and thus for brevity we don’t compare with all these methods in our study. Note that MR-cML also performed favorably compared with MR-Median in the study of Xue et al.;16 however, we retain MR-Median here so we have another comparison method that does not require balanced pleiotropy or InSIDE.

Genetic variants acting on the exposure via a single mechanism

Under the primary simulation parametrization, each genetic variant acts directly on the exposure, and each invalid genetic variant also acts on the exposure indirectly via the unmeasured exposure-outcome confounders. To explore a more parsimonious setting where genetic variants act on the exposure through only one of these mechanisms, we repeated the simulation study in the first setting (that is, with J=100, ξ2=1, and nX=nY=50,000), but where either βXj=0 or γj=0, with an equal probability of both, for all j relating to invalid instruments. In addition to the methods used in the primary simulation study, we also performed MR-CUE, using its default settings, in this scenario, which explicitly makes this assumption.

Genome-wide simulation study

We performed a simulation study that simulated a set of summary statistics representing genetic associations from independent variants across the genome, in order to compare our approach with CAUSE. We simulated J=100,000 genetic variant associations such that either p=100 or 20 variants were expected to have non-zero association with the outcome, setting the heritabilities of X and Y to hX=hY=0.3, respectively. We considered scenarios where either 20%, 40%, or 60% of these variants were expected to be pleiotropic. To do this, we first simulated β¯Xj, α¯j, and γ¯j from the joint mixture distribution

(β¯Xjα¯jγ¯j)i=13πiN((000),(cXi000cαi000cγi)),

where

(cX1,cX2,cX3)=(hXp,2hX3p,0),
(cα1,cα2,cα3)=(0,hYθ2hXp,0),
(cγ1,cγ2,cγ3)=(0,hX3p,0),and
(π1,π2,π3)=(p(1q)J,pqJ,1pJ),

and q is the proportion of variants associated with the exposure that are invalid. These were converted to non-standardized effect sizes by βXj=(β¯Xj+γ¯j)/2fj(1fj) and αj=(α¯j+γ¯j)/2fj(1fj). The genetic association estimates were then generated by βˆXjN(βXj,s2), βˆYjN(θβXj+αj,sj2), where sj2=1/(2nfj(1fj)), n=50,000. We considered scenarios where θ=0.1 or 0.

For each variant, we computed a p value using βˆXj and sj. We performed the same methods used in the primary simulation, taking as inputs variants with p value <5×108 (that is, which pass the conventional genome-wide significance threshold). We also performed CAUSE using all simulated genetic variants, and with its default threshold p value <0.001 as classifying that a variant is associated with the exposure. Due to the extra computational time required to run the simulations, we restricted this simulation study to 200 replications per scenario.

Multivariable simulation study

To compare methods in the multivariable setting, we simulated from the same model as in the primary simulation with J=100 and nX=nY, but where there were two exposures generated according to

Xk=j=1Jβ˜XjkGj+U+εXk
Y=k=12θkXk+j=1Jα˜j+U+εY,

for k=1,2, where θ1=0.1 or 0, θ2=0.1, and (εXk,εXk) were simulated from the bivariate normal distribution with marginal variances of 1 and covariance of 0.3.

As well as the multivariable IVW and multivariable IVW-oracle methods, we also compared with the multivariable median approach (MVMR-Median), GRAPPLE, and the constrained maximum likelihood approach with data perturbation (with 100 perturbed samples), which we denote MVMR-cML-DP. Note that for the methods that require the full covariance matrices of the genetic variant-exposures association estimates (that is, GRAPPLE, MVMR-cML-DP, and MVMR-Horse), we input these assuming that the traits are independent, even though they are not. This is to reflect the typical scenario where only GWAS summary statistics are available without trait correlation estimates. Due to the extra computational time required to run the simulation, we restricted this simulation study to 200 replications per scenario.

Applied examples

In a first applied example, we considered combinations of four exposures (LDL cholesterol [LDL], triglycerides [TGs], fasting glucose [FG], and birth weight [BW]) on three outcomes (coronary artery disease [CAD], type 2 diabetes [T2D], and asthma). Each of these exposure-outcome relationships were considered by both Morrison et al.6 and Xue et al.,16 some with conflicting results. Following Morrison et al.,6 we classify the exposure-outcome pairs as either having a causal effect supported by the literature (LDL-CAD, TG-CAD, FG-T2D), having unknown or conflicting evidence of a causal effect (BW-CAD, TG-T2D, FG-CAD, BW-T2D, LDL-T2D), or having implausible or unsupported evidence of a causal effect (LDL-Asthma, FG-Asthma, BW-Asthma, TG-Asthma).

We analyzed each of the exposure-outcome pairs using two-sample Mendelian randomization from publicly available GWAS results, as shown in Table 1. All genetic associations were estimated in samples of individuals of European, or predominantly European, ancestry. Note that some of these GWAS datasets differ from those used in Morrison et al.6 and Xue et al.,16 and so there are some differences in the results that we report here.

Table 1.

Sources of genetic association estimates for the exposure and outcome traits considered in the applied examples

Trait Study Sample size Instruments
LDL cholesterol Willer et al.36 188,577 79
Triglycerides Willer et al.36 188,577 55
Fasting glucose Chen et al.37 281,416 70
Birth weight Horikoshi et al.38 153,781 50
Intelligence Hill et al.39 248,482 213
Years of education Okbay et al.40 293,723 213
Household income Neale lab 361,194 213
Coronary artery disease Nikpay et al.41 184,305
Type 2 diabetes Mahajan et al.42 898,130
Asthma Demenais et al.43 142,486
Alzheimer disease Lambert et al.44 54,162

Instruments is the number of genetic instruments for each exposure trait used in the primary analyses. Genetic association estimates from the Neale lab are sourced from http://www.nealelab.is/uk-biobank/.

For each exposure, we chose a set of instruments by first selecting all genetic variants that associated with the trait with a p value <5×108, then pruned these to have r2<0.001. The number of selected variants for each exposure is shown in Table 1. The set of variants was then harmonized with each outcome dataset, and only variants that appeared in both exposure and outcome datasets were retained. The F-statistics across each of the 12 exposure-outcome combinations ranged between 51.6 and 172.4. We performed analysis using the IVW, CAUSE, MR-cML, MR-cML-DP, and MR-Horse methods. We used the TwoSampleMR R package45 for the pruning and harmonization procedures and the MendelianRandomization R package46,47 for implementing the IVW, MR-cML, and MR-cML-DP methods.

In a second applied example, we considered a multivariable scenario with conditionally weak instruments. We re-analyzed the example described by Grant and Burgess,31 and also considered by Woolf et al.,48 which looked at the effects of intelligence, years of education, and household income on the risk of Alzheimer disease. The phenotype referred to here as intelligence is an individual’s score on a verbal-numerical reasoning test used as a proxy for general cognitive function, as described in Hill et al.39 and Davies et al.49 The effects of intelligence and years of education on Alzheimer disease were first considered by Davies et al.49 using summary statistics from the GWAS of Hill et al.39 (intelligence), Okbay et al.40 (years of education), and Lambert et al.44 (Alzheimer disease). The genetic instruments were selected as those that associate with either intelligence or years of education with a p value below the genome-wide significance level of 5×108. In Grant and Burgess,31 household income was also included using genetic associations for these same genetic variants taken from the UK Biobank sourced from the Neale Lab (http://www.nealelab.is/uk-biobank/). The 213 genetic variants were strong predictors of the exposures separately, with univariable F-statistics all over the conventional threshold of 10. However, the conditional F-statistics were 2.4 (intelligence), 1.6 (years of education), and 1.6 (household income), and so the variants were conditionally weak instruments.

In a supplementary analysis, we repeated this second applied example restricting the set of genetic variants to those that associate with at least one of the three exposures with a p value below 5×1010. This approach selected 49 instruments that were separately very strong predictors of the exposures, with univariable F-statistics all over 18. However, the conditional F-statistics were each still quite low (between 1.8 and 2.2) due to the high correlation between the traits.

Results

Simulation study results

The mean estimates, standard deviation of estimates, coverage, and power/type I error rates for each scenario are shown in Table 2 (J=100, strong instruments), Table 3 (J=100, weak instruments), Table 4 (J=20, strong instruments), Table 5 (multivariable setting, showing results relating to estimates of θ1 only), Table S1 (single genetic mechanism), and Table S2 (genome-wide study). For MR-Horse and MVMR-Horse, we computed coverage as the proportion of replications for which the true causal effect was within the 95% credible interval and rejection rate as the proportion of replications for which the 95% credible interval did not contain zero. For the other methods, except for CAUSE, we used the confidence intervals (or credible intervals) produced by their respective software packages to evaluate coverage and rejection rates in the same way (where 95% confidence intervals for BWMR and MR-CUE were produced using the effect estimates and standard errors given by their respective package). For CAUSE, we evaluated coverage using the 95% credible interval and the rejection rate as the proportion of replications for which the one-sided p value (comparing the causal model to the non-causal sharing model) was less than 0.05.

Table 2.

Results of simulations for the scenarios with 100 strong genetic instruments

20% invalid 40% invalid 60% invalid
Method Mean SD Cov. Rej. Mean SD Cov. Rej. Mean SD Cov. Rej.

θ=0.1, balanced pleiotropy

IVW 0.156 0.061 0.789 0.807 0.207 0.081 0.673 0.780 0.254 0.093 0.558 0.805
IVW-Oracle 0.098 0.013 0.953 1.000 0.100 0.015 0.953 1.000 0.098 0.019 0.962 0.994
MR-Median 0.107 0.020 0.911 1.000 0.125 0.032 0.737 0.998 0.157 0.059 0.507 0.984
BWMR 0.100 0.017 0.943 0.996 0.196 0.085 0.691 0.768 0.256 0.093 0.560 0.816
MR-cML 0.100 0.014 0.928 1.000 0.101 0.020 0.870 0.999 0.098 0.030 0.774 0.979
MR-cML-DP 0.098 0.014 0.968 1.000 0.099 0.019 0.965 0.999 0.093 0.028 0.948 0.871
MR-Horse 0.100 0.014 0.982 1.000 0.105 0.020 0.987 0.999 0.111 0.030 0.976 0.905

θ=0.1, directional pleiotropy

IVW 0.158 0.078 0.817 0.627 0.207 0.100 0.778 0.588 0.256 0.117 0.704 0.602
IVW-Oracle 0.098 0.014 0.953 1.000 0.099 0.017 0.945 1.000 0.098 0.021 0.937 0.999
MR-Median 0.108 0.022 0.893 0.999 0.126 0.033 0.729 0.997 0.161 0.065 0.512 0.989
BWMR 0.100 0.016 0.941 1.000 0.166 0.097 0.815 0.745 0.256 0.117 0.707 0.606
MR-cML 0.100 0.015 0.921 1.000 0.102 0.021 0.848 1.000 0.103 0.031 0.773 0.980
MR-cML-DP 0.099 0.015 0.964 1.000 0.100 0.020 0.944 0.997 0.099 0.029 0.942 0.923
MR-Horse 0.101 0.015 0.975 1.000 0.106 0.021 0.981 0.995 0.116 0.031 0.974 0.911

θ=0, balanced pleiotropy

IVW 0.055 0.061 0.782 0.218 0.106 0.078 0.687 0.313 0.152 0.091 0.573 0.427
IVW-Oracle 0.000 0.013 0.948 0.052 0.000 0.014 0.961 0.039 0.000 0.018 0.954 0.046
MR-Median 0.009 0.020 0.891 0.109 0.024 0.029 0.721 0.279 0.056 0.057 0.502 0.498
BWMR 0.001 0.017 0.935 0.065 0.096 0.084 0.718 0.282 0.153 0.089 0.573 0.427
MR-cML 0.000 0.013 0.925 0.075 0.000 0.018 0.889 0.111 0.000 0.027 0.792 0.208
MR-cML-DP 0.000 0.013 0.975 0.025 0.000 0.017 0.972 0.028 0.000 0.024 0.965 0.035
MR-Horse 0.001 0.014 0.987 0.013 0.004 0.018 0.988 0.012 0.011 0.028 0.977 0.023

θ=0, directional pleiotropy

IVW 0.060 0.076 0.830 0.170 0.113 0.102 0.747 0.253 0.152 0.119 0.706 0.294
IVW-Oracle 0.001 0.013 0.953 0.047 0.001 0.015 0.955 0.045 0.000 0.018 0.962 0.038
MR-Median 0.011 0.020 0.892 0.108 0.027 0.031 0.702 0.298 0.057 0.058 0.480 0.520
BWMR 0.002 0.015 0.942 0.058 0.065 0.098 0.815 0.185 0.150 0.118 0.712 0.288
MR-cML 0.001 0.014 0.924 0.076 0.003 0.018 0.887 0.113 0.003 0.026 0.808 0.192
MR-cML-DP 0.001 0.013 0.981 0.019 0.003 0.016 0.973 0.027 0.003 0.023 0.974 0.026
MR-Horse 0.002 0.014 0.984 0.016 0.007 0.018 0.993 0.007 0.014 0.026 0.985 0.015

Reported is the mean estimate (mean), standard deviation of estimates (SD), the proportion of replications that the 95% confidence interval (or credible interval in the case of MR-Horse) contained the true causal effect (Cov.), and the proportion of replications that the 95% confidence interval (or credible interval) did not contain zero (Rej.).

Table 3.

Results of simulations for the scenarios with 100 weak genetic instruments

20% invalid 40% invalid 60% invalid
Method Mean SD Cov. Rej. Mean SD Cov. Rej. Mean SD Cov. Rej.

θ=0.1, balanced pleiotropy

IVW 0.123 0.054 0.894 0.701 0.160 0.073 0.830 0.674 0.200 0.083 0.726 0.699
IVW-Oracle 0.076 0.013 0.535 1.000 0.076 0.015 0.623 1.000 0.077 0.019 0.735 0.986
MR-Median 0.077 0.020 0.720 0.983 0.086 0.029 0.809 0.958 0.114 0.051 0.797 0.938
BWMR 0.099 0.029 0.916 0.989 0.190 0.098 0.775 0.641 0.254 0.105 0.647 0.698
MR-cML 0.104 0.020 0.883 1.000 0.267 0.433 0.725 0.955 1.313 0.673 0.106 0.959
MR-cML-DP 0.094 0.017 0.978 0.998 0.116 0.140 0.968 0.942 0.717 0.639 0.568 0.727
MR-Horse 0.093 0.017 0.964 0.999 0.093 0.022 0.983 0.957 0.106 0.034 0.988 0.805

θ=0.1, directional pleiotropy

IVW 0.123 0.068 0.909 0.518 0.168 0.091 0.845 0.509 0.204 0.101 0.824 0.527
IVW-Oracle 0.076 0.012 0.492 1.000 0.077 0.015 0.656 0.998 0.076 0.019 0.742 0.977
MR-Median 0.075 0.019 0.695 0.991 0.088 0.029 0.836 0.962 0.113 0.054 0.815 0.938
BWMR 0.097 0.019 0.931 0.999 0.168 0.111 0.835 0.636 0.255 0.129 0.780 0.506
MR-cML 0.103 0.019 0.891 1.000 0.249 0.452 0.756 0.961 1.392 0.918 0.153 0.939
MR-cML-DP 0.093 0.017 0.985 0.997 0.115 0.154 0.978 0.951 0.634 0.718 0.680 0.691
MR-Horse 0.092 0.016 0.977 1.000 0.095 0.021 0.992 0.963 0.106 0.036 0.988 0.745

θ=0, balanced pleiotropy

IVW 0.045 0.053 0.828 0.172 0.084 0.069 0.754 0.246 0.120 0.082 0.660 0.340
IVW-Oracle 0.000 0.011 0.946 0.054 0.000 0.013 0.957 0.043 0.000 0.016 0.955 0.045
MR-Median 0.006 0.017 0.936 0.064 0.016 0.025 0.832 0.168 0.038 0.050 0.664 0.336
BWMR 0.001 0.018 0.939 0.061 0.094 0.092 0.783 0.217 0.152 0.103 0.667 0.333
MR-cML 0.000 0.016 0.882 0.118 0.072 0.371 0.789 0.211 0.950 1.186 0.167 0.833
MR-cML-DP 0.000 0.015 0.996 0.004 0.001 0.043 0.994 0.006 0.378 0.693 0.753 0.247
MR-Horse 0.001 0.014 0.986 0.014 0.004 0.019 0.992 0.008 0.011 0.030 0.985 0.015

θ=0, directional pleiotropy

IVW 0.040 0.067 0.873 0.127 0.079 0.091 0.829 0.171 0.127 0.106 0.739 0.261
IVW-Oracle 0.000 0.011 0.947 0.053 0.001 0.013 0.967 0.033 0.000 0.016 0.964 0.036
MR-Median 0.005 0.017 0.948 0.052 0.015 0.025 0.844 0.156 0.042 0.054 0.643 0.357
BWMR 0.001 0.016 0.945 0.055 0.054 0.103 0.872 0.128 0.157 0.133 0.745 0.255
MR-cML 0.000 0.016 0.878 0.122 0.036 0.349 0.834 0.166 0.969 1.319 0.210 0.790
MR-cML-DP 0.000 0.015 0.990 0.010 0.004 0.061 0.996 0.004 0.326 0.725 0.802 0.198
MR-Horse 0.001 0.014 0.980 0.020 0.005 0.018 0.998 0.002 0.014 0.028 0.988 0.012

Reported is the mean estimate (mean), standard deviation of estimates (SD), the proportion of replications that the 95% confidence interval (or credible interval in the case of MR-Horse) contained the true causal effect (Cov.), and the proportion of replications that the 95% confidence interval (or credible interval) did not contain zero (Rej.).

Table 4.

Results of simulations for the scenarios with 20 strong genetic instruments

20% invalid 40% invalid 60% invalid
Method Mean SD Cov. Rej. Mean SD Cov. Rej. Mean SD Cov. Rej.

θ=0.1, balanced pleiotropy

IVW 0.150 0.135 0.897 0.307 0.190 0.178 0.869 0.262 0.244 0.209 0.853 0.277
IVW-Oracle 0.100 0.029 0.965 0.899 0.100 0.034 0.959 0.819 0.099 0.041 0.970 0.610
MR-Median 0.109 0.048 0.919 0.750 0.129 0.086 0.806 0.716 0.198 0.198 0.631 0.730
BWMR 0.135 0.120 0.895 0.456 0.188 0.177 0.870 0.263 0.248 0.207 0.847 0.277
MR-cML 0.101 0.033 0.919 0.905 0.100 0.042 0.874 0.807 0.096 0.061 0.802 0.632
MR-cML-DP 0.097 0.032 0.951 0.806 0.093 0.040 0.935 0.600 0.085 0.053 0.914 0.377
MR-Horse 0.102 0.033 0.989 0.701 0.105 0.044 0.990 0.395 0.122 0.084 0.983 0.211

θ=0.1, directional pleiotropy

IVW 0.157 0.166 0.902 0.265 0.199 0.221 0.911 0.188 0.261 0.268 0.865 0.230
IVW-Oracle 0.099 0.030 0.957 0.891 0.099 0.035 0.956 0.763 0.097 0.042 0.970 0.568
MR-Median 0.109 0.053 0.893 0.749 0.132 0.093 0.808 0.712 0.201 0.204 0.599 0.713
BWMR 0.134 0.128 0.896 0.528 0.195 0.216 0.885 0.213 0.262 0.265 0.875 0.220
MR-cML 0.101 0.033 0.924 0.909 0.101 0.042 0.876 0.780 0.098 0.059 0.817 0.628
MR-cML-DP 0.097 0.033 0.957 0.830 0.095 0.041 0.934 0.608 0.089 0.053 0.911 0.401
MR-Horse 0.102 0.034 0.983 0.731 0.106 0.044 0.991 0.386 0.122 0.076 0.991 0.183

θ=0, balanced pleiotropy

IVW 0.046 0.132 0.894 0.106 0.100 0.176 0.882 0.118 0.147 0.204 0.836 0.164
IVW-Oracle −0.001 0.027 0.969 0.031 0.000 0.032 0.959 0.041 0.000 0.040 0.965 0.035
MR-Median 0.009 0.044 0.915 0.085 0.033 0.093 0.791 0.209 0.093 0.181 0.587 0.413
BWMR 0.032 0.118 0.888 0.112 0.097 0.176 0.877 0.123 0.147 0.204 0.843 0.157
MR-cML −0.001 0.029 0.939 0.061 0.000 0.039 0.890 0.110 0.000 0.055 0.835 0.165
MR-cML-DP −0.001 0.027 0.975 0.025 0.000 0.035 0.955 0.045 0.000 0.047 0.942 0.058
MR-Horse 0.000 0.030 0.985 0.015 0.005 0.041 0.994 0.006 0.017 0.071 0.984 0.016

θ=0, directional pleiotropy

IVW 0.047 0.174 0.883 0.117 0.096 0.227 0.888 0.112 0.153 0.268 0.871 0.129
IVW-Oracle −0.001 0.028 0.957 0.043 0.001 0.031 0.961 0.039 0.000 0.040 0.963 0.037
MR-Median 0.008 0.048 0.907 0.093 0.031 0.087 0.813 0.187 0.097 0.216 0.591 0.409
BWMR 0.025 0.131 0.886 0.114 0.088 0.219 0.884 0.116 0.152 0.265 0.877 0.123
MR-cML 0.000 0.030 0.932 0.068 0.002 0.037 0.901 0.099 0.003 0.055 0.827 0.173
MR-cML-DP 0.000 0.028 0.969 0.031 0.001 0.034 0.958 0.042 0.002 0.047 0.938 0.062
MR-Horse 0.001 0.031 0.991 0.009 0.006 0.040 0.993 0.007 0.019 0.071 0.993 0.007

Reported is the mean estimate (mean), standard deviation of estimates (SD), the proportion of replications that the 95% confidence interval (or credible interval in the case of MR-Horse) contained the true causal effect (Cov.), and the proportion of replications that the 95% confidence interval (or credible interval) did not contain zero (Rej.).

Table 5.

Results of simulations for the multivariable scenarios (relating to the causal effect of X1 only)

20% invalid 40% invalid 60% invalid
Method Mean SD Cov. Rej. Mean SD Cov. Rej. Mean SD Cov. Rej.

θ1=0.1, balanced pleiotropy

IVW 0.155 0.066 0.785 0.805 0.204 0.078 0.720 0.765 0.229 0.088 0.670 0.700
IVW-Oracle 0.100 0.014 0.950 1.000 0.098 0.016 0.965 1.000 0.097 0.020 0.970 0.990
MVMR-Median 0.107 0.020 0.935 1.000 0.117 0.027 0.845 1.000 0.137 0.047 0.700 0.975
GRAPPLE 0.109 0.032 0.990 0.775 0.169 0.068 0.875 0.685 0.213 0.088 0.795 0.660
MVMR-cML-DP 0.101 0.015 0.975 1.000 0.099 0.020 0.945 1.000 0.096 0.031 0.935 0.850
MVMR-Horse 0.102 0.015 0.980 1.000 0.105 0.020 1.000 0.990 0.114 0.032 0.985 0.880

θ1=0.1, directional pleiotropy

IVW 0.154 0.077 0.855 0.620 0.208 0.109 0.790 0.585 0.229 0.105 0.805 0.535
IVW-Oracle 0.098 0.014 0.955 1.000 0.100 0.017 0.965 1.000 0.098 0.022 0.940 0.990
MVMR-Median 0.105 0.019 0.970 1.000 0.119 0.032 0.820 0.995 0.142 0.051 0.670 0.965
GRAPPLE 0.109 0.031 0.995 0.680 0.175 0.087 0.900 0.510 0.217 0.103 0.830 0.490
MVMR-cML-DP 0.100 0.016 0.975 1.000 0.103 0.020 0.955 0.990 0.106 0.030 0.950 0.935
MVMR-Horse 0.101 0.015 0.975 1.000 0.109 0.021 0.985 0.995 0.121 0.034 0.970 0.870

θ1=0, balanced pleiotropy

IVW 0.055 0.061 0.770 0.230 0.103 0.078 0.690 0.310 0.138 0.086 0.705 0.295
IVW-Oracle −0.001 0.013 0.975 0.025 0.003 0.017 0.925 0.075 −0.004 0.020 0.950 0.050
MVMR-Median 0.004 0.017 0.960 0.040 0.020 0.028 0.775 0.225 0.030 0.042 0.750 0.250
GRAPPLE 0.006 0.027 1.000 0.000 0.070 0.068 0.835 0.165 0.118 0.083 0.775 0.225
MVMR-cML-DP −0.001 0.014 0.970 0.030 0.002 0.020 0.955 0.045 −0.008 0.028 0.930 0.070
MVMR-Horse 0.000 0.014 1.000 0.000 0.008 0.022 0.995 0.005 0.007 0.030 0.980 0.020

θ1=0, directional pleiotropy

IVW 0.049 0.072 0.870 0.130 0.096 0.104 0.795 0.205 0.133 0.115 0.810 0.190
IVW-Oracle 0.000 0.014 0.940 0.060 −0.002 0.017 0.960 0.040 −0.001 0.021 0.960 0.040
MVMR-Median 0.006 0.020 0.945 0.055 0.014 0.030 0.830 0.170 0.031 0.041 0.770 0.230
GRAPPLE 0.011 0.028 1.000 0.000 0.059 0.083 0.915 0.085 0.112 0.111 0.840 0.160
MVMR-cML-DP 0.001 0.016 0.950 0.050 0.000 0.020 0.955 0.045 0.002 0.031 0.910 0.090
MVMR-Horse 0.002 0.016 0.975 0.025 0.005 0.022 0.990 0.010 0.014 0.031 0.965 0.035

Reported is the mean estimate (mean), standard deviation of estimates (SD), the proportion of replications that the 95% confidence interval (or credible interval in the case of MVMR-Horse) contained the true causal effect (Cov.), and the proportion of replications that the 95% confidence interval (or credible interval) did not contain zero (Rej.).

When genetic instruments were strong and explained a large proportion of the variance in the exposure, the mean estimates from MR-cML, MR-cML-DP, and MR-Horse were close to IVW-Oracle at all levels of pleiotropy. The IVW and BWMR methods showed considerable bias from 40% pleiotropy, as did MR-Median at 60% pleiotropy. Only MR-cML-DP and MR-Horse retained nominal coverage and type I error rates in all scenarios. In general, MR-cML-DP showed slightly higher power, whereas MR-Horse showed slightly higher coverage rates. These results were similar in the case where this setting was repeated but with genetic variants acting on the exposure only via a single mechanism. Furthermore, MR-CUE, which assumes the single mechanism and allows for correlated pleiotropy, showed substantial bias and inflated type I error rates in these settings.

When the genetic instruments were weak, the estimates from IVW and IVW-Oracle attenuated to the null, which is the expected behavior of these methods with weak instruments. MR-Median and BWMR showed bias and inflated type I error rates across all medium to high-level pleiotropy settings. MR-Median does not account for the uncertainty in the genetic variant-exposure association estimates and so is not robust to weak instruments. BWMR does account for this uncertainty, but does not allow for InSIDE violations. MR-cML also showed considerable bias and inflated type I error rates in the medium to high-pleiotropy settings, as did MR-cML-DP in the high-pleiotropy settings. On the other hand, MR-Horse had low levels of bias and had type I error rates below the nominal level in all scenarios.

One possible explanation for the performance of the constrained maximum likelihood methods with weak and pleiotropic instruments is the difficulty in selecting the invalid instruments in these settings. These approaches rely on a parameter (which we denote K˜) representing the number of invalid instruments, which determines the constraint placed on the likelihood maximization. Xue et al.16 showed that the methods will asymptotically estimate the correct value of K˜ but noted that this consistency in selection may not be obtained when there are weak and pleiotropic instruments. Here, we implemented their recommended model averaging approach, which fits the model for all possible values of K˜ and computes a weighted average of the effect estimates with weights determined by the Bayesian information criterion for each fit. In a further supplementary simulation study, we re-analyzed the simulated data from the weak instruments scenario but fixed the values of K˜ instead of using model averaging. We set the value of K˜ at 20, 40, and 60 for each setting regardless of the true level of pleiotropy. The results are shown in Tables S3 and S4. Setting the value of K˜ to its true value substantially reduced the bias from these methods; however, the bias was still greater than that of MR-Horse in the non-null settings. Furthermore, MR-cML still had inflated type I error rates, and MR-cML-DP had very low power. Setting the value of K˜ conservatively to 60 led to very low power when the true number of invalid instruments was below 60. On the other hand, fixing K˜ lower than the true value (for example, setting K˜=20 when the true value was 40 or 60) resulted in substantial bias.

When the genetic instruments were strong but explained a low proportion of the variation in the exposure, the performances of each method were broadly similar to in the first setting but with lower power across the board. At the high pleiotropy levels, MR-cML-DP had type I error rates above the nominal level, whereas MR-Horse did not. In the genome-wide simulation study, MR-Horse had coverage and bias comparable to CAUSE, with substantially higher power in the causal settings. It again had slightly higher coverage and lower type I error rates compared with MR-cML-DP and outperformed all other methods across each metric. In the multivariable simulation study, MVMR-Horse outperformed IVW, MVMR-Median, and GRAPPLE in terms of bias, precision, and type I error rates in all scenarios. Similar to the univariable settings, MVMR-Horse retained type I error rates below the nominal level in all scenarios, with the trade-off of lower power compared with MVMR-cML-DP. MVMR-cML-DP had inflated type I error rates in the higher pleiotropy scenarios.

Overall, the results from the simulation study show that MR-Horse and MVMR-Horse have levels of performance similar to MR-cML-DP and MVMR-cML-DP when genetic instruments are strong and explain a high amount of the variation in the exposure. However, particularly at high levels of pleiotropy, MR-Horse outperforms MR-cML-DP when instruments are weak and/or explain only a small amount of variation in the exposure. Although conservative, MR-Horse retained low bias and type I error rates below the nominal level in all scenarios considered and was the only method to do so out of those considered.

Applied examples results

For the univariable analyses, Figure 2 shows the estimated log odds ratio of each outcome per standard deviation increase in the genetically predicted levels of each exposure, as well as the 95% confidence intervals (IVW, MR-cML, MR-cML-DP) or 95% credible intervals (CAUSE, MR-Horse). For the exposure-outcome combinations with established causality, the results from each method were broadly in line across the different methods. For LDL-T2D, CAUSE did not show evidence of a causal effect. MR-Horse agreed with MR-cML(-DP) that increased LDL may reduce T2D risk, although with a 95% credible interval that came closer to the null. The other conflicting result was that CAUSE and MR-cML showed evidence of a causal effect of TGs on T2D risk, whereas MR-cML-DP and MR-Horse produced a null result. We note, however, that the one-sided p value from the CAUSE test comparing the causal model to the non-causal model was 0.066. Because MR-cML-DP and MR-Horse showed lower type I error rates in our simulation studies, one possible explanation for this difference is that MR-cML may be producing a false-positive result in this case. All methods showed a causal effect of BW on the risk of CAD and T2D, though with the estimates from CAUSE much closer to the null than the others. Finally, all methods gave a null result for the causal effect of LDL, TGs, and BW on the risk of asthma. However, MR-Horse showed a possible causal relationship between fasting glucose and risk of asthma, agreeing with MR-cML(-DP).

Figure 2.

Figure 2

Mendelian randomization estimates and 95% confidence intervals (IVW, MR-cML, MR-cML-DP) or 95% credible intervals (CAUSE, MR-Horse) for the different exposure-outcome combinations

Plots colored in orange indicate exposure-outcome relationships considered causal. Plots colored in yellow indicate exposure-outcome pairs with unknown causal relationships. Plots colored in blue indicate exposure-outcome relationships considered not causal. In each plot, the vertical dashed line indicates a log odds ratio of zero.

For the multivariable analysis with weak instruments, Table 6 shows the estimated log odds ratio of Alzheimer disease, as well as the 95% confidence intervals or credible intervals, from the multivariable IVW, MVMR-Median, MVMR-cML, MVMR-cML-DP, and MVMR-Horse methods. The reported estimates are per one standard deviation increase in genetically predicted intelligence and years of education and per unit increase in log odds ratio for a higher household income bracket. For the effect of intelligence on Alzheimer disease, the estimates were broadly in line across each method; however, MVMR-cML-DP had a very wide confidence interval. For the effects of years of education and household income, MVMR-cML and MVMR-cML-DP showed substantially different point estimates compared to the other methods and very wide confidence intervals. On the other hand, the effect estimates from MVMR-Horse were similar to those from MVMR-Median, though with an estimate in the opposite direction for years of education. These results from MVMR-Horse, including the directions of effects, are similar to the effect estimates reported in Grant and Burgess31 using the MVMR-Lasso method, but with wider confidence intervals (noting that MVMR-Lasso has been shown to produce unbiased effect estimates but with confidence intervals that are too narrow).

Table 6.

Mendelian randomization estimates for the effects of intelligence, years of education, and household income on Alzheimer disease using multivariable models

Method Intelligence Education Income
IVW −0.47 (−0.86, −0.07) −0.24 (−0.92, −0.43) 0.42 (−0.25, −1.08)
MVMR-Median −0.57 (−1.05, −0.10) −0.13 (−0.89, −0.62) 0.37 (−0.39, −1.12)
MVMR-cML −0.35 (−1.01, −0.32) −2.90 (−4.92, −0.87) 3.52 (−1.58, −5.46)
MVMR-cML-DP −0.73 (−2.12, −0.66) −0.39 (−5.27, −4.50) 1.08 (−3.52, −5.68)
MVMR-Horse −0.74 (−1.32, −0.15) −0.12 (−0.81, −1.06) 0.46 (−0.49, −1.39)
Univariable F-statistic 31.4 21.0 10.5
Conditional F-statistic 2.4 1.6 1.6

Reported are the estimated log odds ratios of Alzheimer disease and 95% confidence intervals (IVW, MVMR-Median, MVMR-cML, MVMR-cML-DP) or 95% credible intervals (MVMR-Horse). Also reported are the univariable and conditional F-statistics for each exposure.

The results of the multivariable analysis re-analyzed with unconditionally stronger instruments are shown in Table S5. Due to the reduced number of instruments, the confidence intervals and credible intervals are much wider than the original analysis. Although there are some differences in effect estimates, the confidence intervals and credible intervals are overlapping with the original ones, and the results from IVW, MVMR-Median, and MVMR-Horse are still broadly in line with each other. The results from MR-cML and MR-cML-DP are still producing substantially different effect estimates to the other results, with particularly wide confidence intervals.

Computational run time

In order to give an idea of the comparative computational load of the proposed methods, we measured the run time of the applied analyses on a machine running an Apple M1 Pro chip. To perform all 12 univariable analyses in the first applied example (with between 19 and 78 genetic instruments), MR-Horse took 141 s, compared with MR-cML-DP, which took 201 s. The run time for MVMR-Horse increased with multiple exposures and larger numbers of genetic variants. In the second applied example, the analysis with 3 exposures and 213 genetic variants took 996 s with MVMR-Horse compared to 220 s with MVMR-cML-DP.

Discussion

In this paper, we have presented a Bayesian framework for performing Mendelian randomization with summary-level data that is robust to general forms of instrument invalidity. The approach does not require the InSIDE assumption, and so it can handle correlated pleiotropy. The method is easily generalized to the multivariable Mendelian randomization setting, providing another robust method for multivariable studies. Our simulation studies showed that the approach has favorable performance in comparison with commonly used methods across a variety of settings.

Our approach has similarities to previously proposed Bayesian approaches but has been designed to avoid many of the restrictions that these methods pose. Berzuini et al.17 first proposed a Bayesian framework for Mendelian randomization with pleiotropy and suggested the use of the horseshoe prior to account for the presence of some valid and some invalid instruments. However, their method requires individual-level data and makes the InSIDE assumption. Zhao et al.18 and Bucur et al.19 developed versions of the Berzuini et al. approach using summary-level data, again requiring InSIDE to be satisfied. Cheng et al.20 introduced a correlation parameter into a Bayesian model to account for correlated pleiotropy; however, their model requires that pleiotropic effects are either all correlated or all uncorrelated with the variant-exposure effects. The methods of Morrison et al.6 and Cheng et al.21 are able to account for both correlated and uncorrelated pleiotropy, but they make a further assumption that the genetic instruments are associated with the exposure either entirely through the confounders or entirely through a direct effect. Our approach utilizes the horseshoe prior as suggested by Berzuini et al.,17 can be applied using summary-level data, and can account for both correlated and uncorrelated pleiotropy without any assumption on the pathways between the genetic instruments and exposure. Furthermore, we have shown how to extend the framework to be used for multivariable Mendelian randomization.

In comparison to frequentist approaches for Mendelian randomization, our method is based on the same causal model as that used by Xue et al.,16 which we believe is a very general model for instrument invalidity. With strong genetic instruments, the performance of our method in simulation studies was comparable to Xue et al.’s method when the latter was applied with the data perturbation extension in scenarios with low to mid-levels of pleiotropy. However, when the genetic instruments were weak and/or explained a small amount of variation in the exposure, MR-Horse outperformed all other methods considered. In practice, Mendelian randomization studies with weak instruments, both in the sense of low F-statistics and low proportion of variation in the exposure explained, are common, particularly with the sort of complex exposures that are being increasingly studied with this design. It is also likely that, particularly when many genetic variants are selected as instruments from high-powered GWASs, the proportion of invalid instruments is similar to the high pleiotropy scenarios considered in our simulation study. Thus, the simulation scenarios whose results are shown in Table 3 (weak instruments) and Table 4 (low proportion of variation explained) are likely to closely reflect many real-world settings. MR-Horse was the only method considered that retained low levels of bias and that showed coverage and type I error rates at their nominal levels in these cases.

The trade-off to the Bayesian approach retaining low type I error rates is that it sometimes had lower power than the frequentist approaches. The increased conservatism in the Bayesian approach supports the assertion that the horseshoe prior is able to avoid shrinkage of the pleiotropic effects pertaining to invalid instruments. Besides improved type I error control, another advantage of our framework is that it allows for the model specification to be adapted to specific settings. For example, the prior distribution of the global shrinkage parameter could be adjusted, or set at a fixed value, based on prior knowledge of the level and nature of pleiotropy. As long as these prior assumptions are reasonable, such incorporation of prior knowledge would be expected to reduce the uncertainty in the estimates and increase power.

Although the implementation of the method in this paper has used some weakly informative priors for the model parameters, one case where we have used a more informative prior is for the ρj parameters that model the correlation between the genetic variant-exposure associations and any pleiotropic effects. The justification for using a prior that places the majority of the mass close to zero is that this is consistent with our a priori assumption that some genetic instruments are valid and some are invalid. Those that are valid will have a true value of ρj of zero, as there are no pleiotropic effects. Relaxing this prior to, say, a uniform distribution on (1,1) would allow for all variant-exposure associations to have potentially substantial correlation; however, this would come at a cost of lower power. On the other hand, this prior could be strengthened by adding even more weight close to zero, possibly in combination with a stricter prior on the global shrinkage parameter, as discussed above. An interesting area for future development of the approach would be to explore the use of domain knowledge to further inform the model inputs.

Not taken into account in our model are sample overlap and linkage disequilibrium. Most of the methods that are commonly used for Mendelian randomization assume independent genetic variants, and they achieve this by pruning candidate instruments according to an r2 threshold. In general, the use of correlated variants is unlikely to add substantially more information to an analysis, and methods that are able to incorporate linkage disequilibrium tend to be very sensitive to mis-specification of genetic correlations.50 Nonetheless, there are situations where correlated variants may be desirable, such as in a cis-Mendelian randomization study exploring drug targets.51 In this case, correlation estimates could be incorporated into our model by specifying a joint distribution for the genetic variant-exposure association estimates and for the genetic variant-outcome association estimates that included genetic correlations, as is done by Cheng et al.21 to account for linkage disequilibrium.

Sample overlap is more likely to be an issue in practice, since many published GWAS results are from meta-analyses that include common data sources such as the UK Biobank.52 The models of Morrison et al.6 and Cheng et al.21 account for sample overlap by including correlation between βˆYj and βˆXj in a joint distribution. Our model could be adapted in a similar way to account for sample overlap if it was believed to be substantial.

Assumptions that we have made are of the linearity of the effect of the exposures on the outcome, and homogeneity or no effect modification. Furthermore, an implicit assumption that we have made in developing the method and simulation study is that all traits are continuous. In practice, summary-level Mendelian randomization methods can be performed with categorical traits, for example, by taking genetic association estimates from logistic regression. This may bias causal effect estimates because of the non-collapsibility of odds ratios; however, the bias will tend to be toward the null, at least in the univariable case.53 One further drawback to Bayesian methods generally is that they tend to be computationally intensive. However, in the univariable case, MR-Horse actually ran the applied analyses more quickly than MR-cML-DP. In the multivariable case, MVMR-Horse took longer to run the applied analyses than MVMR-cML-DP; however, the run time was not prohibitively slow, even with a fairly large set of genetic instruments.

Overall, our Bayesian framework is an effective and flexible method for both univariable and multivariable Mendelian randomization. It provides an important sensitivity analysis to these studies that is robust to violations of the InSIDE assumption and directional pleiotropy. Furthermore, it can be easily adapted to handle sample overlap or linkage disequilibrium.

Data and code availability

All the data used in this paper are publicly available and can be accessed via the references given. R code for performing the proposed methods, and for reproducing the simulation results and applied analyses, can be found at https://github.com/aj-grant/mrhorse.

Acknowledgments

S.B. is supported by the Wellcome Trust (225790/Z/22/Z) and the United Kingdom Research and Innovation Medical Research Council (MC_UU_00002/7). This research was supported by the National Institute for Health Research Cambridge Biomedical Research Centre (NIHR203312). The views expressed are those of the authors and not necessarily those of the National Institute for Health Research or the Department of Health and Social Care. For the purpose of open access, the author has applied a CC-BY public copyright license to any Author Accepted Manuscript version arising from this submission. The authors acknowledge the Sydney Informatics Hub and the University of Sydney’s high-performance computing cluster, Artemis, for providing the computing resources that have contributed to the results reported herein.

Declaration of interests

The authors declare no competing interests.

Published: January 4, 2024

Footnotes

Supplemental information can be found online at https://doi.org/10.1016/j.ajhg.2023.12.002.

Supplemental information

Document S1. Tables S1–S5
mmc1.pdf (74.8KB, pdf)
Document S2. Article plus supplemental information
mmc2.pdf (1.9MB, pdf)

References

  • 1.Lawlor D.A., Harbord R.M., Sterne J.A.C., Timpson N., Davey Smith G. Mendelian randomization: Using genes as instruments for making causal inferences in epidemiology. Stat. Med. 2008;27:1133–1163. doi: 10.1002/sim.3034. [DOI] [PubMed] [Google Scholar]
  • 2.Pierce B.L., Burgess S. Efficient Design for Mendelian Randomization Studies: Subsample and 2-Sample Instrumental Variable Estimators. Am. J. Epidemiol. 2013;178:1177–1184. doi: 10.1093/aje/kwt084. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Greenland S. An introduction to instrumental variables for epidemiologists. Int. J. Epidemiol. 2000;29:722–729. doi: 10.1093/ije/29.4.722. [DOI] [PubMed] [Google Scholar]
  • 4.Brumpton B., Sanderson E., Heilbron K., Hartwig F.P., Harrison S., Vie G.Å., Cho Y., Howe L.D., Hughes A., Boomsma D.I., et al. Avoiding dynastic, assortative mating, and population stratification biases in Mendelian randomization through within-family analyses. Nat. Commun. 2020;11:3519. doi: 10.1038/s41467-020-17117-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Bowden J., Davey Smith G., Burgess S. Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. Int. J. Epidemiol. 2015;44:512–525. doi: 10.1093/ije/dyv080. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Morrison J., Knoblauch N., Marcus J.H., Stephens M., He X. Mendelian randomization accounting for correlated and uncorrelated pleiotropic effects using genome-wide summary statistics. Nat. Genet. 2020;52:740–747. doi: 10.1038/s41588-020-0631-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Burgess S., Butterworth A., Thompson S.G. Mendelian randomization analysis with multiple genetic variants using summarized data. Genet. Epidemiol. 2013;37:658–665. doi: 10.1002/gepi.21758. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Verbanck M., Chen C.Y., Neale B., Do R. Detection of widespread horizontal pleiotropy in causal relationships inferred from Mendelian randomization between complex traits and diseases. Nat. Genet. 2018;50:693–698. doi: 10.1038/s41588-018-0099-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Bowden J., Davey Smith G., Haycock P.C., Burgess S. Consistent estimation in Mendelian randomization with some invalid instruments using a weighted median estimator. Genet. Epidemiol. 2016;40:304–314. doi: 10.1002/gepi.21965. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Kang H., Zhang A., Cai T.T., Small D.S. Instrumental variables estimation with some invalid instruments and its application to Mendelian randomization. J. Am. Stat. Assoc. 2016;111:132–144. [Google Scholar]
  • 11.Rees J.M.B., Wood A.M., Dudbridge F., Burgess S. Robust methods in Mendelian randomization via penalization of heterogeneous causal estimates. PLoS One. 2019;14:e0222362. doi: 10.1371/journal.pone.0222362. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Burgess S., Foley C.N., Allara E., Staley J.R., Howson J.M.M. A robust and efficient method for Mendelian randomization with hundreds of genetic variants. Nat. Commun. 2020;11:376. doi: 10.1038/s41467-019-14156-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Qi G., Chatterjee N. Mendelian randomization analysis using mixture models for robust and efficient estimation of causal effects. Nat. Commun. 2019;10:1941. doi: 10.1038/s41467-019-09432-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Hartwig F.P., Davey Smith G., Bowden J. Robust inference in summary data Mendelian randomization via the zero modal pleiotropy assumption. Int. J. Epidemiol. 2017;46:1985–1998. doi: 10.1093/ije/dyx102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Guo Z., Kang H., Tony Cai T., Small D.S. Confidence intervals for causal effects with invalid instruments by using two-stage hard thresholding with voting. J. R. Stat. Soc. Series B Stat. Methodol. 2018;80:793–815. [Google Scholar]
  • 16.Xue H., Shen X., Pan W. Constrained maximum likelihood-based Mendelian randomization robust to both correlated and uncorrelated pleiotropic effects. Am. J. Hum. Genet. 2021;108:1251–1269. doi: 10.1016/j.ajhg.2021.05.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Berzuini C., Guo H., Burgess S., Bernardinelli L. A Bayesian approach to Mendelian randomization with multiple pleiotropic variants. Biostatistics. 2020;21:86–101. doi: 10.1093/biostatistics/kxy027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Zhao J., Ming J., Hu X., Chen G., Liu J., Yang C. Bayesian weighted Mendelian randomization for causal inference based on summary statistics. Bioinformatics. 2020;36:1501–1508. doi: 10.1093/bioinformatics/btz749. [DOI] [PubMed] [Google Scholar]
  • 19.Bucur I.G., Claassen T., Heskes T. Inferring the direction of a causal link and estimating its effect via a Bayesian Mendelian randomization approach. Stat. Methods Med. Res. 2020;29:1081–1111. doi: 10.1177/0962280219851817. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Cheng Q., Qiu T., Chai X., Sun B., Xia Y., Shi X., Liu J. MR-Corr2: a two-sample Mendelian randomization method that accounts for correlated horizontal pleiotropy using correlated instrumental variants. Bioinformatics. 2022;38:303–310. doi: 10.1093/bioinformatics/btab646. [DOI] [PubMed] [Google Scholar]
  • 21.Cheng Q., Zhang X., Chen L.S., Liu J. Mendelian randomization accounting for complex correlated horizontal pleiotropy while elucidating shared genetic etiology. Nat. Commun. 2022;13:6490. doi: 10.1038/s41467-022-34164-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Carvalho C.M., Polson N.G., Scott J.G. The horseshoe estimator for sparse signals. Biometrika. 2010;97:465–480. [Google Scholar]
  • 23.Piironen J., Vehtari A. In: Proceedings of the 20th International Conference on Artificial Intelligence and Statistics. Singh A., Zhu J., editors. Vol. 54. Proc. Mach. Learn. Res.; 2017. On the Hyperprior Choice for the Global Shrinkage Parameter in the Horseshoe Prior; pp. 905–913. [Google Scholar]
  • 24.Plummer M. 2022. rjags: Bayesian Graphical Models using MCMC. R package version 4-13. [Google Scholar]
  • 25.Burgess S., Thompson S.G. Multivariable Mendelian randomization: the use of pleiotropic genetic variants to estimate causal effects. Am. J. Epidemiol. 2015;181:251–260. doi: 10.1093/aje/kwu283. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Burgess S., Dudbridge F., Thompson S.G. Re: “Multivariable Mendelian randomization: the use of pleiotropic genetic variants to estimate causal effects”. Am. J. Epidemiol. 2015;181:290–291. doi: 10.1093/aje/kwv017. [DOI] [PubMed] [Google Scholar]
  • 27.Sanderson E., Davey Smith G., Windmeijer F., Bowden J. An examination of multivariable Mendelian randomization in the single-sample and two-sample summary data settings. Int. J. Epidemiol. 2019;48:713–727. doi: 10.1093/ije/dyy262. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Zuber V., Colijn J.M., Klaver C., Burgess S. Selecting likely causal risk factors from high-throughput experiments using multivariable Mendelian randomization. Nat. Commun. 2020;11:29. doi: 10.1038/s41467-019-13870-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Carter A.R., Sanderson E., Hammerton G., Richmond R.C., Davey Smith G., Heron J., Taylor A.E., Davies N.M., Howe L.D. Mendelian randomisation for mediation analysis: current methods and challenges for implementation. Eur. J. Epidemiol. 2021;36:465–478. doi: 10.1007/s10654-021-00757-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Rees J.M.B., Wood A.M., Burgess S. Extending the MR-Egger method for multivariable Mendelian randomization to correct for both measured and unmeasured pleiotropy. Stat. Med. 2017;36:4705–4718. doi: 10.1002/sim.7492. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Grant A.J., Burgess S. Pleiotropy robust methods for multivariable Mendelian randomization. Stat. Med. 2021;40:5813–5830. doi: 10.1002/sim.9156. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Wang J., Zhao Q., Bowden J., Hemani G., Davey Smith G., Small D.S., Zhang N.R. Causal inference for heritable phenotypic risk factors using heterogeneous genetic instruments. PLoS Genet. 2021;17:e1009575. doi: 10.1371/journal.pgen.1009575. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Lin Z., Xue H., Pan W. Robust multivariable Mendelian randomization based on constrained maximum likelihood. Am. J. Hum. Genet. 2023;110:592–605. doi: 10.1016/j.ajhg.2023.02.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Sanderson E., Spiller W., Bowden J. Testing and correcting for weak and pleiotropic instruments in two-sample multivariable Mendelian randomization. Stat. Med. 2021;40:5434–5452. doi: 10.1002/sim.9133. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Grant A.J., Gill D., Kirk P.D.W., Burgess S. Noise-augmented directional clustering of genetic association data identifies distinct mechanisms underlying obesity. PLoS Genet. 2022;18 doi: 10.1371/journal.pgen.1009975. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Willer C.J., Schmidt E.M., Sengupta S., Peloso G.M., Gustafsson S., Kanoni S., Ganna A., Chen J., Buchkovich M.L., Mora S., et al. Discovery and refinement of loci associated with lipid levels. Nat. Genet. 2013;45:1274–1283. doi: 10.1038/ng.2797. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Chen J., Spracklen C.N., Marenne G., Varshney A., Corbin L.J., Luan J., Willems S.M., Wu Y., Zhang X., Horikoshi M., et al. The trans-ancestral genomic architecture of glycemic traits. Nat. Genet. 2021;53:840–860. doi: 10.1038/s41588-021-00852-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Horikoshi M., Beaumont R.N., Day F.R., Warrington N.M., Kooijman M.N., Fernandez-Tajes J., Feenstra B., van Zuydam N.R., Gaulton K.J., Grarup N., et al. Genome-wide associations for birth weight and correlations with adult disease. Nature. 2016;538:248–252. doi: 10.1038/nature19806. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Hill W.D., Marioni R.E., Maghzian O., Ritchie S.J., Hagenaars S.P., McIntosh A.M., Gale C.R., Davies G., Deary I.J. A combined analysis of genetically correlated traits identifies 187 loci and a role for neurogenesis and myelination in intelligence. Mol. Psychiatr. 2019;24:169–181. doi: 10.1038/s41380-017-0001-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Okbay A., Beauchamp J.P., Fontana M.A., Lee J.J., Pers T.H., Rietveld C.A., Turley P., Chen G.B., Emilsson V., Meddens S.F.W., et al. Genome-wide association study identifies 74 loci associated with educational attainment. Nature. 2016;533:539–542. doi: 10.1038/nature17671. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Nikpay M., Goel A., Won H.H., Hall L.M., Willenborg C., Kanoni S., Saleheen D., Kyriakou T., Nelson C.P., Hopewell J.C., et al. A comprehensive 1000 genomes–based genome-wide association meta-analysis of coronary artery disease. Nat. Genet. 2015;47:1121–1130. doi: 10.1038/ng.3396. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Mahajan A., Taliun D., Thurner M., Robertson N.R., Torres J.M., Rayner N.W., Payne A.J., Steinthorsdottir V., Scott R.A., Grarup N., et al. Fine-mapping type 2 diabetes loci to single-variant resolution using high-density imputation and islet-specific epigenome maps. Nat. Genet. 2018;50:1505–1513. doi: 10.1038/s41588-018-0241-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Demenais F., Margaritte-Jeannin P., Barnes K.C., Cookson W.O.C., Altmüller J., Ang W., Barr R.G., Beaty T.H., Becker A.B., Beilby J., et al. Multiancestry association study identifies new asthma risk loci that colocalize with immune-cell enhancer marks. Nat. Genet. 2018;50:42–53. doi: 10.1038/s41588-017-0014-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Lambert J.C., Ibrahim-Verbaas C.A., Harold D., Naj A.C., Sims R., Bellenguez C., DeStafano A.L., Bis J.C., Beecham G.W., Grenier-Boley B., et al. Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer’s disease. Nat. Genet. 2013;45:1452–1458. doi: 10.1038/ng.2802. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Hemani G., Zheng J., Elsworth B., Wade K.H., Haberland V., Baird D., Laurin C., Burgess S., Bowden J., Langdon R., et al. The MR-Base platform supports systematic causal inference across the human phenome. Elife. 2018;7 doi: 10.7554/eLife.34408. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Yavorska O.O., Burgess S. MendelianRandomization: an R package for performing Mendelian randomization analyses using summarized data. Int. J. Epidemiol. 2017;46:1734–1739. doi: 10.1093/ije/dyx034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Broadbent J.R., Foley C.N., Grant A.J., Mason A.M., Staley J.R., Burgess S. MendelianRandomization v0.5.0: updates to an R package for performing Mendelian randomization analyses using summarized data [version 2; peer review: 1 approved, 2 approved with reservations] Wellcome Open Res. 2020;5:252. doi: 10.12688/wellcomeopenres.16374.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Woolf B., Gill D., Grant A.J., Burgess S. MVMRmode: Introducing an R package for plurality valid estimators for multivariable Mendelian randomisation. medRxiv. 2023 doi: 10.1371/journal.pone.0291183. Preprint at. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Davies N.M., Hill W.D., Anderson E.L., Sanderson E., Deary I.J., Davey Smith G. Multivariable two-sample Mendelian randomization estimates of the effects of intelligence and education on health. Elife. 2019;8 doi: 10.7554/eLife.43990. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Burgess S., Zuber V., Valdes-Marquez E., Sun B.B., Hopewell J.C. Mendelian randomization with fine-mapped genetic data: Choosing from large numbers of correlated instrumental variables. Genet. Epidemiol. 2017;41:714–725. doi: 10.1002/gepi.22077. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Schmidt A.F., Finan C., Gordillo-Marañón M., Asselbergs F.W., Freitag D.F., Patel R.S., Tyl B., Chopade S., Faraway R., Zwierzyna M., et al. Hingorani A.D. Genetic drug target validation using Mendelian randomisation. Nat. Commun. 2020;11:3255. doi: 10.1038/s41467-020-16969-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Sudlow C., Gallacher J., Allen N., Beral V., Burton P., Danesh J., Downey P., Elliott P., Green J., Landray M., et al. UK Biobank: An open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 2015;12:e1001779. doi: 10.1371/journal.pmed.1001779. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Bowden J., Del Greco M F., Minelli C., Davey Smith G., Sheehan N., Thompson J. A framework for the investigation of pleiotropy in two-sample summary data Mendelian randomization. Stat. Med. 2017;36:1783–1802. doi: 10.1002/sim.7221. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Document S1. Tables S1–S5
mmc1.pdf (74.8KB, pdf)
Document S2. Article plus supplemental information
mmc2.pdf (1.9MB, pdf)

Data Availability Statement

All the data used in this paper are publicly available and can be accessed via the references given. R code for performing the proposed methods, and for reproducing the simulation results and applied analyses, can be found at https://github.com/aj-grant/mrhorse.


Articles from American Journal of Human Genetics are provided here courtesy of American Society of Human Genetics

RESOURCES