Skip to main content
American Journal of Human Genetics logoLink to American Journal of Human Genetics
. 2023 Mar 21;110(4):592–605. doi: 10.1016/j.ajhg.2023.02.014

Robust multivariable Mendelian randomization based on constrained maximum likelihood

Zhaotong Lin 1, Haoran Xue 1, Wei Pan 1,
PMCID: PMC10119150  PMID: 36948188

Summary

Mendelian randomization (MR) is a powerful tool for causal inference with observational genome-wide association study (GWAS) summary data. Compared to the more commonly used univariable MR (UVMR), multivariable MR (MVMR) not only is more robust to the notorious problem of genetic (horizontal) pleiotropy but also estimates the direct effect of each exposure on the outcome after accounting for possible mediating effects of other exposures. Despite promising applications, there is a lack of studies on MVMR’s theoretical properties and robustness in applications. In this work, we propose an efficient and robust MVMR method based on constrained maximum likelihood (cML), called MVMR-cML, with strong theoretical support. Extensive simulations demonstrate that MVMR-cML performs better than other existing MVMR methods while possessing the above two advantages over its univariable counterpart. An application to several large-scale GWAS summary datasets to infer causal relationships between eight cardiometabolic risk factors and coronary artery disease (CAD) highlights the usefulness and some advantages of the proposed method. For example, after accounting for possible pleiotropic and mediating effects, triglyceride (TG), low-density lipoprotein cholesterol (LDL), and systolic blood pressure (SBP) had direct effects on CAD; in contrast, the effects of high-density lipoprotein cholesterol (HDL), diastolic blood pressure (DBP), and body height diminished after accounting for other risk factors.

Keywords: direct causal effect, GWAS summary data, instrumental variable, IV, mediation analysis, pleiotropy


A robust and efficient multivariable Mendelian randomization method is proposed to estimate the direct effect of each of multiple exposures on an outcome after accounting for possible mediating effects through other exposures. An application to infer causal relationships between eight cardiometabolic risk factors and coronary artery disease highlights its usefulness.

Introduction

Mendelian randomization (MR), including its default version, univariable MR (UVMR), is an instrumental variable (IV) method that utilizes genetic variants as IVs to infer the causal relationship between an exposure and an outcome.1,2,3 With numerous publicly available large-scale genome-wide association study (GWAS) summary data, MR has recently become popular and powerful to infer causal relationships even in the presence of unmeasured confounding and reverse causation.4 In UVMR, a valid IV is required to satisfy the following three assumptions:

UV-A1: the IV is (marginally) associated with the exposure;

UV-A2: the IV is independent of the unmeasured confounder;

UV-A3: the IV is independent of the outcome conditional on the exposure and confounder.

Despite promising and wide applications of UVMR to observational data for causal inference, in reality, these assumptions may not always hold. In particular, the widespread pleiotropy is a major concern, violating UV-A2 or UV-A3; that is, a genetic variant is associated with the outcome other than through the exposure of interest.5 Screening procedures can be applied to avoid the use of pleiotropic variants, but there may be only few or even no variants solely associated with the exposure, leading to loss of power, in addition to likely biases. A number of UVMR methods robust to pleiotropy have been proposed, but under different untestable assumptions.5,6,7,8,9,10,11 Alternatively, one can alleviate the problem by including other associated risk factors as multiple exposures in the model, motivating the use of multivariable MR (MVMR).12 MVMR includes multiple exposures in the model and allows the genetic variants to be associated with one or more of them without violating the IV assumption. This is useful especially when a set of related risk factors for the outcome of interest share many commonly associated genetic variants, e.g., various lipids.13 Another scenario where MVMR would be useful is in mediation analysis.14 In MVMR, the direct effect of one exposure on the outcome not mediated through the rest of the exposures is estimated, while in UVMR, only the total effect of the exposure on the outcome, including that mediated through other exposures, is estimated. Therefore, MVMR would be useful when the aim is to study the causal mechanism of a set of risk factors on the outcome after accounting for potential causal pathways among the risk factors.

Despite these exciting advantages of MVMR over UVMR, some assumptions are still required to have valid IVs for MVMR. Paralleling that in UVMR, in MVMR a valid IV must satisfy the following:

MV-A1: the IV is associated with at least one exposure conditional on the other exposures included in the model;

MV-A2: the IV is independent of any confounder of each exposure-outcome pair;

MV-A3: the IV is independent of the outcome conditional on all exposures included in the model and the confounders.

Since a marginal association does not imply a conditional association, while the reverse is also true but less likely, in that sense assumption MV-A1 is stronger than UV-A1 while MV-A3 is weaker than UV-A3. We give an example on each of the two cases in the simulation results for mediation analysis. We note that the above assumptions are for one IV instead of a set of IVs as considered by others.15 The main reason is that, differing from the previous MVMR approaches, we would like to consider robust MVMR in the presence of some invalid IVs violating one or more of the above three assumptions. While there are many robust UVMR methods in the literature, however, only few MVMR methods exist, which may not be sufficiently robust and efficient (as to be shown).16,17,18 In this work, we propose a robust and efficient MVMR method by extending the univariable MR-cML7 (referred to as UVMR-cML throughout the paper) to the multivariable setting, called MVMR-cML. Based on constrained maximum likelihood (cML), UVMR-cML is robust to the violation of all three IV assumptions (in the univariable case): it allows the presence of invalid IVs violating any or all of the three IV assumptions as long as the plurality condition and several other mild conditions hold. Under some mild conditions, it can consistently identify invalid IV(s) with either or both of correlated and uncorrelated pleiotropy, yielding a consistent and asymptotic normal estimator of the causal effect. It has been shown to have robust performance under various simulation setups and real data analyses.7 Under a similar constrained maximum likelihood framework, MVMR-cML is expected to enjoy the same good properties as its univariable version. In particular, as its UV counterpart, it is robust to the presence of invalid IVs violating some or all of the three IV assumptions, including those with correlated or uncorrelated pleiotropy as to be confirmed in simulations. The flexible likelihood framework also allows us to account for possible correlations among any sets of GWAS summary statistics (e.g., due to overlapping samples), and thus the method can be widely applied in different scenarios, including one-sample, two-sample, or even mixed-sample designs. We develop an efficient R package with Rcpp to integrate different versions of MR-cML for various uses.

The rest of the paper is organized as follows. First, we extend the plurality condition for model identification from the univariable case19 to the current multivariable context. Then we introduce MVMR-cML and its two variants based on model selection and data perturbation, respectively. We show the superior performance of the proposed method over other existing MVMR methods through extensive simulations. Lastly, we consider an application to study the direct effects of eight cardio-metabolic risk factors on coronary artery disease (CAD [MIM: 608320]).

Material and methods

Causal model and its interpretation

Suppose we have m independent SNPs, L exposures Xl’s with l=1,,L, an unmeasured confounder (ensemble) U, and an outcome Y. We consider the following true causal model (Figure 1A), which can be viewed as a multivariable version of the true causal model presented in Xue et al.7:

bYi=θ1bX1i++θLbXLi+αi+βYUφi:=θ1bX1i++θLbXLi+ri, (Equation 1)

where bXli and bYi are the (true) marginal associations between SNP/IV Gi and the l-th exposure and the outcome, respectively; θl is the direct causal effect of Xl on Y; and ri=αi+βYUφi represents the total pleiotropic effect of Gi on Y (not through Xl’s). For example, in the absence of any causal relationship among the exposures in Figure 1A, we have bXli=γXli+βXlUφi, for l=1,,L; in Figure 1B with a causal relation from X1 to X2, we have bX1i=γX1i+βX1Uφi and bX2i=γX2i+βX2Uφi+ξbX1i.

Figure 1.

Figure 1

True causal models for multivariable MR

(A and B) A general (A) and a specific (B) causal graph showing the relationships among one IV (Gi), multiple exposures (X1,,XL), an unmeasured confounder (U), and the outcome (Y).

The three valid IV assumptions for a valid IV Gi require, respectively, γXli0 (for some 1lL), φi=0, and αi=0, the last two of which imply no pleiotropic effects with ri=0. On the other hand, αi0 and φi0 lead to so-called uncorrelated and correlated pleiotropy, respectively. It is a major goal here to consider robust MVMR analysis in the possible presence of some invalid IVs with ri0.

In the true causal model depicted in Figure 1A, for MVMR we do not need to specify causal relationships among the L exposures, the presence of which is likely and, importantly, implies that the direct causal effect θl is in general different from the total causal effect of Xl on Y. For illustration, we consider a simple example with L=2 exposures, one of which mediates the effect of the other on the outcome (Figure 1B); more examples can be found elsewhere.15,20 The corresponding linear regression model for MVMR is

Y=X1θ1+X2θ2+UβYU+Giαi+ϵ,

where ϵ is a random error. Accordingly, we can interpret the direct (causal) effect θ1 of X1 on Y as that of changing X1 while holding X2 fixed. In contrast, by replacing X2=ξX1+GiγX2i+UβX2U+e2 (with e2’s being a random error), we obtain the corresponding linear regression model for UVMR for X1 as

Y=X1(θ1+ξθ2)+U(βYU+βX2Uθ2)+Gi(αi+γX2iθ2)+(ϵ+e2θ2),

giving the total effect of X1 on Y as θ1+ξθ2, including both the direct effect θ1 and the indirect effect ξθ2 mediated through X2. In addition, this example also illustrates an advantage of MVMR over UVMR: if αi=0 and φi=0, Gi is likely a valid IV with no pleiotropic effect (i.e., ri=0) in MVMR; however, in UVMR for X1, Gi is an invalid IV with pleiotropic effect ri=γX2iθ20.

Model identification

Suppose the ground truth to Equation 1 is given by bYi=θ1bX1i++θLbXLi+ri, and the set of valid IVs is V={i:ri=0} with m0=|V|. Here, we use the asterisk to denote the true value of a parameter and only consider relevant IVs with bXi=(bX1i,,bXLi)T0. We say that the true parameters r=(r1,,rm)T and θ=(θ1,,θL)T are identifiable if, given the (true) marginal associations bYi and bXi0 for i=1,,m, there is a unique solution (r,θ) (as the ground truth) to

bYi=ri+θ1bX1i++θLbXLi,i=1,,m, (Equation 2)

under the constraint that any solution is obtained from the largest set(s) of IVs with their corresponding ri=0 (as a part of the solution). Note that the constraint is needed because otherwise we would have many solutions with m+L unknown parameters (ri’s and θj’s) in m linear equations.

There are two aspects to consider for model identification. First, unique to MVMR (i.e., different from UVMR), even in the ideal case of using only m0 valid IVs (with ri=0), in order to have a unique solution to the system of linear equations in Equation 2, we require that the marginal association matrix BV=(bX1,bX2,,bXm0)T to be of full column rank, which implies at least one valid IV for each exposure and thus m0L. This condition ensures the marginal associations between the IVs and exposures are not multicollinear, as pointed out by others; a conditional F-test has been proposed to test for possible violation of this condition.17 The other aspect, similar to that in UVMR, is a plurality condition in the presence of invalid IVs with ri0: the (true) valid IVs form the largest group to give the same causal parameter estimate. The two aspects are combined into the following assumption.

Assumption 1: Suppose the matrix of bXi’s for iV, BV=[bXi]iVRm0×L, has full column rank L. Moreover, the following multivariable plurality condition holds:

|V|>max{c0,cRL}|{i:ri=bXiTc}|. (Equation 3)

Theorem 1: Given bYi and bXi for i=1,,m, the true parameters θ and r in Equation 2 are identifiable if and only if Assumption 1 holds.

The proof of Theorem 1 is given in section S1.1 in the supplemental information. We first give some intuitive explanation on why the plurality condition is needed. If it is violated, it means that some invalid IVs form the largest group with a solution c+θθ to Equation 2. Next, we note that the plurality condition in Equation 3 is a generalization of that for the univariable case with L=1. Theorem 1 in Guo et al.19 states that, given bYi and bX1i, the model parameters θ1 and r1,,rm are identifiable if and only if the following plurality rule condition holds:

|V|>maxc0|{i:ri/bX1i=c}|, (Equation 4)

where V is the set of valid IVs with ri=0 and bX1i0. It is clear that Equation 3 reduces to Equation 4 with L=1.

New method: Multivariable MRcML (MVMR-cML)

We extend the UVMR-cML proposed in Xue et al.7 to the multivariable case. The goal of MVMR-cML is to estimate the direct causal effect of each exposure on the outcome (while relaxing the no-pleiotropy condition for valid IVs). Denote {βˆX1i,,βˆXLi,βˆYi, σˆX1i2,,σˆXLi2,σˆYi2}i=1m as the GWAS summary statistics of m (nearly) independent SNPs/IVs, each of which is selected on the basis of its significant marginal association with at least one of the exposures (e.g., at the usual genome-wide significance level of p value < 5e−8). We have

βˆXli=bXli+ϵXli,l=1,,L,βˆYi=bYi+ϵYi,

with var(ϵXli)=σXli2 and var(ϵYi)=σYi2. We assume σXli2 and σYi2 are known or well estimated as σˆXli2 and σˆYi2, respectively. Then we have the model for SNP i:

βˆi=(βˆX1i,,βˆXLi,βˆYi)TN(bi=(bX1i,,bXLi,l=1LθlbXli+ri)T,Σi), (Equation 5)

where Σi=(σX1i2ρ12σX1iσX2iρ1LσX1iσXLiρ1YσX1iσYiσX2i2ρ2LσX2iσXLiρ2YσX2iσYiσXLi2ρLYσXLiσYiσYi2), ρll (l,l=1,,L and ll) is the correlation between the two GWAS summary estimates for exposures Xl and Xl, and ρlY (l=1,,L) is the correlation between the two GWAS summary datasets for exposure Xl and outcome Y. When the L+1 GWAS summary datasets are calculated from L+1 sets of non-overlapping samples, respectively, ρll=ρlY=0 and Σi is diagonal. Otherwise, we can use bivariate linkage disequilibrium score regression (LDSC)21 or the sample correlation between two sets of GWAS Z scores across null SNPs22 to estimate all the ρ’s.

Since the m IVs are independent, the log likelihood of the observed GWAS data (up to some constants) is

l(θ,{bXi},{ri};{βˆi,Σi})=i=1mli(θ,bXi,ri;βˆi,Σi)=12i=1m(βˆibi)TΣi1(βˆibi), (Equation 6)

where θ=(θ1,,θL)T, and we use {bXi}={(bX1i,,bXLi)T,i=1,,m} to represent a set of the parameters and similarly for {ri} and {βˆi,Σi}.

Under the constraint that the number of invalid IVs is K, we estimate the unknown parameters by solving the following constrained maximum likelihood:

(θˆ,{bˆXi},{rˆi})=argmaxθ,{bXi},{ri}l(θ,{bXi},{ri};{βˆi,Σi})subjecttoi=1mI(ri0)=K.

For a given number of invalid IVs, K, a coordinate descent-like algorithm is implemented as follows: at the (t+1)-th iteration,

Step 1: calculate rˆi(t+1) by solving liri|θ(t+1),bXi(t)=0, order di(t+1)=li(θ(t),bXi(t),rˆi(t+1);βˆi,Σi)li(θ(t),bXi(t),0;βˆi,Σi) decreasingly, then for i=1,,K, update r(i)(t+1)=rˆ(i)(t+1); for i=K+1,,m, update r(i)(t+1)=0;

Step 2: update bXi(t+1) by solving libXi|θ(t),ri(t+1)=0 for i=1,,m;

Step 3: update θ(t+1) by solving lθ|{bXi(t+1),ri(t+1)}=0.

We repeat the above three steps until convergence, obtaining the final estimates θˆ(K) and {bˆXi(K),rˆi(K)}i=1m. As in UVMR-cML,7 it is notable that at the convergence the (estimated) invalid IVs (with rˆi0) do not contribute to estimating θ, and the resulting cMLE of θ is the same as the maximum (profile) likelihood estimator being applied to all (selected) valid IVs.

We select the number of invalid IVs, K, from a candidate set K based on the following Bayesian information criterion (BIC):

BIC(K)=2l(θˆ(K),{bˆXi(K),rˆi(K)};{βˆi,Σi})+log(N)·K, (Equation 7)

where N is the minimum sample size of all GWAS datasets used in the model. We select Kˆ=argminKKBIC(K) and Vˆ={i|rˆi(Kˆ)=0,i=1,,m}. Then the final cMLE of θ is θˆ=θˆ(Kˆ). The standard errors are calculated on the basis of the observed Fisher information matrix from the (profile) likelihood with all selected valid IVs (in Vˆ).7,23 With θˆ=(θˆ1,,θˆL)T and corresponding standard errors, we draw inference on the basis of the asymptotic normal distribution. We call this method MVMR-cML-BIC. We note that our proposed cMLE θˆ also enjoys the nice statistical properties of estimation and selection consistency as its univariable counterpart UVMR-cML.7 Here, we state the main conclusions with the proofs relegated to the supplemental information.

Assumption 2: For every SNP i=1,,m,

βˆi=(βˆX1i,,βˆXLi,βˆYi)TN(bi=(bX1i,,bXLi,l=1LθlbXli+ri)T,Σi),

with known covariance matrix Σi. Furthermore, the m vectors {βˆi}i=1m are mutually independent.

Assumption 3: Let N=min(NX1,,NXL,NY). There exist positive constants c1 and c2 such that we have c1/NσXli2c2/N, and c1/NσYi2c2/N, for l=1,,L, i=1,,m.

Theorem 2: With Assumptions 1 to 3 satisfied, our proposed BIC consistently selects valid IVs, i.e., P(Kˆ=mm0)1 and P(Vˆ=V)1 as N. Furthermore, the proposed constrained maximum likelihood estimator θˆ, combined with the use of the BIC, is consistent for the true causal parameter θ, and asymptotically normal with

V12(θˆθ)dN(0,I),

where V=E[2l˜(θ)/θθ] is the expected Fisher information matrix for the profile log likelihood that can be consistently estimated by its sample version.

For typical GWAS summary data with large sample sizes, Assumptions 2 and 3 are reasonable. Based on the plurality condition, the range of K can be varied from 0, i.e., no invalid IV, up to m(L+1). However in practice, we suggest first try a smaller range of K, for example from 0 to m/2, or based on other methods such as MVMR-Lasso,16 and one can keep expanding the range if the best K selected is on or close to the upper bound. One reason is that the proportion of invalid IVs is relatively low in many real data examples using UVMR,6,7 which is expected to be lower in MVMR when we explicitly include other exposures in the model. Moreover, this can speed up the computation of MVMR-cML dramatically, especially for the data perturbation version to be described next.

To better account for the uncertainty in model selection described above, we adopt the data perturbation approach7; in a UVMR context, the data perturbation on GWAS summary data is shown to be equivalent to bootstrapping GWAS individual-level data.6 For the b-th perturbation, b=1,,B, we generate the perturbed GWAS summary data:

βˆi(b)=(βˆX1i(b),,βˆXLi(b),βˆYi(b))TN(βˆi=(βˆX1i,,βˆXLi,βˆYi)T,Σi),

for i=1,,m independently. Then we apply MVMR-cML-BIC on the b-th perturbed sample and obtain θˆ(b). We use the (element-wise) sample mean and sample covariance of θˆ(1),,θˆ(B) as the final estimate of θ and its covariance matrix, respectively. The number of perturbations B is suggested to be at least 100. We call this method MVMR-cML-DP.

Simulations

Comparison of MVMR-cML and other MVMR methods in the presence of pleiotropy

We first compare the performance of our proposed method, MVMR-cML, with other existing MVMR methods in the presence of pleiotropy. Following Grant and Burgess,16 we simulated data as follows:

U=Gφ+eU,Xl=GγXl+0.25U+eXl,l=1,,4,Y=l=14θl·Xl+Gα+U+eY,

where each component of eU,eXl,eY was independently and identically distributed (iid) as the standard normal N(0,1), G was the genotype matrix with 20 IVs, each generated independently from a binomial distribution with minor allele frequency (MAF) 0.3 and γXliUniform(0,0.22) iid. Two sets of values for the causal effects were considered: (1) (θ1,θ2,θ3,θ4)=(0.2,0.1,0.3,0.4) and (2) (θ1,θ2,θ3,θ4)=(0,0.1,0.1,0.2). We considered three scenarios with different patterns of pleiotropy, and considered 30% and 50% invalid IVs for each scenario.

S1: Balanced and uncorrelated pleiotropy (with InSIDE satisfied): φ was set to 0 and the αi’s corresponding to invalid IVs were generated from N(0,0.22).

S2: Directional and uncorrelated pleiotropy (with InSIDE satisfied): φ was set to 0 and the αi’s corresponding to invalid IVs were generated from N(0.1,0.22).

S3: Directional and correlated pleiotropy (with InSIDE violated): for invalid IVs, φi’s were generated from Uniform(0,0.1) and the αi’s were generated from N(0,0.22).

We also considered a scenario (S4) where some IVs violated assumption MV-A1 and some had uncorrelated pleiotropy (φ=0). Specifically, we randomly selected 30% SNPs and set γX1i=γX2i=γX3i=γX4i=0. We also randomly selected 30% SNPs that had pleiotropic effect with αiN(0.1,0.12). Note that these two sets of (30%) invalid IVs could have overlap.

The GWAS sample sizes for all traits were set as N=50,000. The GWAS summary statistics for the four risk factors were calculated with the same 50,000 individuals, and the outcome GWAS summary statistics were calculated with the other non-overlapping 50,000 individuals. We calculated ρll as the sample correlation between Xl and Xl, and ρlY was set to zero in MVMR-cML.

For each simulation set-up, we ran 500 replications and compared MVMR-cML-BIC, MVMR-cML-DP, and some existing MVMR methods including MVMR-IVW,12 MVMR-Egger,18 MVMR-median, MVMR-robust, and MVMR-Lasso.16

Comparison of MVMR-cML and other MVMR methods in the presence of weak IVs

In this section, we simulated an MVMR model with two exposures and 45 IVs. We considered two different ways to generate the strengths of SNP-exposure associations. In the first scenario, some weak IVs were simulated with relatively small γX1i and γX2i, i.e., the exposures were marginally weakly associated with the IVs. In the second scenario, some marginally strong but conditionally weak IVs were generated by introducing a strong correlation between γX1i and γX2i; that is, each exposure was strongly associated with the IVs marginally but only weakly associated conditional on the other exposure. In both scenarios, we maintained the two-sample conditional F-statistic proposed in Sanderson et al.17 smaller than 10, the conventional cut-off for detecting weak IVs in MVMR. Details of simulation set-ups are given in the supplemental information. For each set-up, we ran 500 replications and compared the proposed MVMR-cML with other existing MVMR methods.

Mediation analysis: MVMR versus UVMR

In this simulation, we illustrate two advantages of using MVMR over UVMR. First, when there are causal relationships among the risk factors, MVMR can distinguish a direct effect from a total effect of one risk factor on the outcome, while UVMR estimates the total effect on the outcome. Second, ignoring causal pathways via other risk factors may lead to the violation of the IV assumptions for some IVs and thus of the plurality condition in UVMR.

We considered L=2 risk factors X1 and X2 in Figure 1B, with a causal effect X1X2, and simulated the GWAS summary statistics according to the three scenarios in Figure 2. More details are given in the supplemental information.

Figure 2.

Figure 2

True causal models in simulations

(A–C) Three scenarios of simulated genetic instruments for i=1,,K (A); i=K+1,,K+m1 (B); and i=K+m1+1,,20 (C). γX1i,γX2iiid.U(0,0.22), αiN(0.1,0.22), and (θ1,θ2)=(0.1,0.2).

We applied MVMR-cML and other MVMR methods with both X1 and X2 as exposures and applied UVMR-cML and UVMR-IVW with only X1 as the exposure. In particular, the SNPs in Figure 2A had direct/pleiotropic effects on Y, not going through either X1 or X2, and thus they were invalid IVs in both UVMR and MVMR analyses; the SNPs in Figure 2B did not have direct effects on Y conditional on X1, so they were valid IVs in both UVMR (for X1 only) and MVMR analyses; the SNPs in Figure 2C were directly associated with X2 in addition to a mediating route via X1, and thus they were invalid IVs in UVMR but not in MVMR. We considered different combinations of (K,m1) from {(0,1),(0,14),(0,18),(3,11),(3,15),(10,4)}. We note that when K=0 (i.e., no SNPs generated from scenario Figure 2A), all IVs were valid in MVMR analysis; when (K,m1)=(0,1), there was only one valid IV for UVMR and thus the plurality condition for UMVR7 was violated.

GWAS data

In the real data application, we focused on assessing the causal effects of eight cardio-metabolic risk factors on CAD.24 The eight risk factors were triglyceride (TG), low-density lipoprotein cholesterol (LDL), high-density lipoprotein cholesterol (HDL),25 body mass index (BMI),26 height,27 fasting glucose (FG),28 systolic blood pressure (SBP), and diastolic blood pressure (DBP).29 The sample size of the nine GWAS datasets ranges from around 130,000 to around 750,000. We extracted the SNPs as IVs by using the mv_extract_exposures function in the R package TwoSampleMR.30 After harmonizing data, we retained 201 IVs; for each of the eight exposures, the number of significantly associated IVs ranged from 14 to 79.

Results

Simulations: Better performance of MVMR-cML over other MVMR methods

Robustness to pleiotropy

Here, we show some representative results for estimation and inference of the direct causal effect of exposure X1 on the outcome, θ1, while the results for θ2,θ3,θ4 are provided in the supplemental information with the same conclusion. Tables 1 and 2 show the results of different MVMR methods under scenarios S1–S3 when θ1=0.2 and θ1=0, respectively. Table 3 shows the results under scenario S4.

Table 1.

Mean and standard deviation (SD) of estimates, mean standard error (SE), coverage rate (Cov), power, and mean squared error (MSE) when m=20 and θ1=0.2

Method 30% invalid
50% invalid
Mean SD SE Cov Power MSE Mean SD SE Cov Power MSE
Scenario 1: Balanced pleiotropy, InSIDE met

MVMR-cML-BIC 0.200 0.070 0.054 0.872 0.912 0.005 0.195 0.227 0.064 0.678 0.794 0.052
MVMR-cML-DP 0.199 0.073 0.084 0.974 0.706 0.005 0.193 0.182 0.192 0.968 0.322 0.033
MVMR-Egger 0.206 0.440 0.428 0.950 0.092 0.194 0.156 0.599 0.559 0.940 0.070 0.360
MVMR-IVW 0.207 0.372 0.361 0.944 0.114 0.138 0.163 0.502 0.475 0.938 0.080 0.253
MVMR-Lasso 0.194 0.095 0.058 0.894 0.870 0.009 0.205 0.255 0.084 0.686 0.714 0.065
MVMR-median 0.198 0.114 0.083 0.922 0.730 0.013 0.209 0.279 0.121 0.740 0.580 0.078
MVMR-robust 0.200 0.085 0.096 0.921 0.661 0.007 0.191 0.381 0.413 0.927 0.156 0.145

Scenario 2: Directional pleiotropy, InSIDE met

MVMR-cML-BIC 0.200 0.076 0.055 0.860 0.896 0.006 0.203 0.223 0.067 0.694 0.796 0.050
MVMR-cML-DP 0.199 0.075 0.085 0.958 0.712 0.006 0.204 0.190 0.173 0.930 0.370 0.036
MVMR-Egger 0.204 0.509 0.484 0.946 0.086 0.259 0.207 0.649 0.601 0.926 0.098 0.422
MVMR-IVW 0.267 0.432 0.409 0.926 0.130 0.191 0.297 0.555 0.510 0.914 0.128 0.317
MVMR-Lasso 0.203 0.113 0.063 0.840 0.844 0.013 0.244 0.389 0.093 0.580 0.714 0.153
MVMR-median 0.210 0.134 0.090 0.904 0.686 0.018 0.249 0.402 0.142 0.658 0.568 0.164
MVMR-robust 0.204 0.109 0.102 0.899 0.662 0.012 0.267 0.429 0.474 0.919 0.136 0.188

Scenario 3: Directional pleiotropy, InSIDE violated

MVMR-cML-BIC 0.198 0.069 0.055 0.872 0.908 0.005 0.203 0.201 0.065 0.698 0.780 0.041
MVMR-cML-DP 0.200 0.069 0.083 0.974 0.714 0.005 0.202 0.159 0.180 0.950 0.354 0.025
MVMR-Egger 0.221 0.507 0.452 0.922 0.138 0.258 0.265 0.609 0.569 0.912 0.098 0.376
MVMR-IVW 0.211 0.433 0.384 0.916 0.128 0.187 0.268 0.508 0.484 0.926 0.108 0.263
MVMR-Lasso 0.202 0.141 0.059 0.878 0.902 0.020 0.221 0.334 0.089 0.664 0.718 0.112
MVMR-median 0.206 0.163 0.087 0.924 0.688 0.027 0.227 0.337 0.133 0.762 0.576 0.114
MVMR-robust 0.202 0.089 0.100 0.907 0.659 0.008 0.243 0.404 0.405 0.929 0.190 0.165
Table 2.

Mean and standard deviation (SD) of estimates, mean standard error (SE), coverage rate (Cov), type-I error, and mean squared error (MSE) when m=20 and θ1=0

Method 30% invalid
50% invalid
Mean SD SE Cov Type-I MSE Mean SD SE Cov Type-I MSE
Scenario 1: Balanced pleiotropy, InSIDE met

MVMR-cML-BIC 0.001 0.056 0.044 0.882 0.118 0.003 −0.004 0.149 0.054 0.700 0.300 0.022
MVMR-cML-DP 0.001 0.056 0.069 0.974 0.026 0.003 −0.003 0.127 0.143 0.952 0.048 0.016
MVMR-Egger 0.008 0.441 0.427 0.946 0.054 0.194 −0.043 0.601 0.559 0.930 0.070 0.363
MVMR-IVW 0.008 0.371 0.361 0.944 0.056 0.138 −0.037 0.504 0.475 0.936 0.064 0.256
MVMR-Lasso −0.003 0.085 0.049 0.906 0.094 0.007 0.007 0.243 0.074 0.684 0.316 0.059
MVMR-median 0.000 0.096 0.070 0.920 0.080 0.009 0.005 0.276 0.108 0.736 0.264 0.076
MVMR-robust 0.003 0.077 0.079 0.907 0.093 0.006 −0.006 0.381 0.417 0.921 0.079 0.145

Scenario 2: Directional pleiotropy, InSIDE met

MVMR-cML-BIC −0.002 0.055 0.045 0.880 0.120 0.003 0.009 0.156 0.055 0.746 0.254 0.024
MVMR-cML-DP −0.001 0.057 0.069 0.970 0.030 0.003 0.006 0.144 0.144 0.954 0.046 0.021
MVMR-Egger 0.006 0.508 0.483 0.942 0.058 0.258 0.009 0.648 0.601 0.930 0.070 0.420
MVMR-IVW 0.067 0.431 0.408 0.918 0.082 0.190 0.096 0.553 0.510 0.910 0.090 0.315
MVMR-Lasso 0.002 0.104 0.052 0.868 0.132 0.011 0.042 0.380 0.084 0.590 0.410 0.146
MVMR-median 0.008 0.122 0.075 0.916 0.084 0.015 0.046 0.392 0.128 0.640 0.360 0.156
MVMR-robust 0.000 0.078 0.076 0.906 0.094 0.006 0.060 0.422 0.478 0.921 0.079 0.182

Scenario 3: Directional pleiotropy, InSIDE violated

MVMR-cML-BIC 0.002 0.062 0.045 0.876 0.124 0.004 0.007 0.197 0.054 0.706 0.294 0.039
MVMR-cML-DP 0.001 0.056 0.067 0.966 0.034 0.003 0.007 0.136 0.145 0.938 0.062 0.019
MVMR-Egger 0.024 0.506 0.451 0.918 0.082 0.257 0.066 0.610 0.569 0.908 0.092 0.377
MVMR-IVW 0.012 0.433 0.383 0.914 0.086 0.187 0.068 0.508 0.483 0.926 0.074 0.263
MVMR-Lasso 0.004 0.116 0.051 0.874 0.126 0.014 0.029 0.326 0.079 0.674 0.326 0.107
MVMR-median 0.007 0.154 0.074 0.922 0.078 0.024 0.024 0.329 0.119 0.740 0.260 0.109
MVMR-robust 0.003 0.070 0.078 0.887 0.113 0.005 0.049 0.396 0.413 0.925 0.075 0.159
Table 3.

Results for scenario S4

Method θ1=0.2
θ1=0
Mean SD SE Cov Power MSE Mean SD SE Cov Type-I MSE
MVMR-cML-BIC 0.205 0.142 0.067 0.802 0.814 0.020 0.013 0.216 0.058 0.806 0.194 0.047
MVMR-cML-DP 0.207 0.129 0.137 0.960 0.496 0.017 0.014 0.137 0.140 0.970 0.030 0.019
MVMR-Egger 0.259 0.313 0.325 0.948 0.152 0.101 0.054 0.324 0.330 0.936 0.064 0.108
MVMR-IVW 0.266 0.294 0.309 0.946 0.176 0.091 0.062 0.297 0.315 0.936 0.064 0.092
MVMR-Lasso 0.213 0.179 0.078 0.762 0.756 0.032 0.019 0.208 0.069 0.786 0.214 0.044
MVMR-median 0.226 0.181 0.112 0.836 0.618 0.033 0.022 0.226 0.099 0.812 0.188 0.052
MVMR-robust 0.204 0.169 0.118 0.851 0.632 0.029 0.023 0.166 0.095 0.814 0.186 0.028

Mean and standard deviation (SD) of estimates, mean standard error (SE), coverage rate (Cov), power/type-I error, and mean squared error (MSE).

First, across all considered scenarios, our proposed method MVMR-cML had the smallest bias and mean squared error (MSE) among all the methods, followed by MVMR-Lasso and MVMR-robust. The advantage of our methods over other methods was more pronounced when there was a higher proportion of invalid IVs at 50%. MVMR-Egger and MVMR-IVW had much less precise estimates than other methods, leading to much lower power and larger MSEs in general. Second, MVMR-cML-DP was the only method that could control the type-I error below the nominal level 5% (Table 2), followed by MVMR-IVW, MVMR-Egger, and MVMR-robust, while at the same time MVMR-cML-DP had much higher power than the other three methods (Table 1). On the other hand, MVMR-Lasso had the largest inflated type-I error rate. Lastly, although MVMR-cML-BIC had good performance in estimation in terms of a small bias and MSE, its inference might not be satisfactory. In the presence of invalid IVs, due to the uncertainty in model selection, its mean standard error was lower than the sample standard deviation of the estimates, leading to an anti-conservative coverage rate and an inflated type-I error rate (Table 2), And the issue was more severe when 50% IVs were invalid. Nevertheless, the data perturbation approach was able to alleviate this problem, giving a satisfactory coverage rate close to the nominal level 95% with a well-controlled type-I error rate. In addition, MVMR-cML-DP had a smaller MSE than MVMR-cML-BIC in the presence of 50% invalid IVs. Hence, overall, MVMR-cML-DP performed best and would be recommended.

Robustness to weak IVs

In this section, we considered a scenario with marginally (and conditionally) weak IVs (Table 4) and a scenario with marginally strong but conditionally weak IVs (Table 5), while all IVs were valid with no pleiotropic effects. The average conditional F-statistics across 500 simulation replicates were 6.73 and 6.70 for the two exposures, respectively, in the first scenario, and 9.35 and 9.38 in the second scenario. We can see that, only MVMR-cML yielded (almost) unbiased estimates (with the smallest MSE) in both scenarios, suggesting its robustness to weak instrument bias. We note that data perturbation might produce a slightly conservative confidence interval as observed in the previous section. In the null causal effect case (Tables S7 and S8), all methods gave unbiased results.

Table 4.

Simulation results for weak IVs

Method θ1=0.5
θ2=0.3
Mean SD SE Cov Power MSE Mean SD SE Cov Power MSE
MVMR-cML-BIC 0.501 0.081 0.073 0.924 1.000 0.007 −0.301 0.082 0.073 0.922 0.972 0.007
MVMR-cML-DP 0.500 0.081 0.102 0.984 1.000 0.007 −0.301 0.081 0.103 0.984 0.878 0.007
MVMR-Egger 0.449 0.097 0.094 0.904 0.992 0.012 −0.225 0.078 0.077 0.824 0.826 0.012
MVMR-IVW 0.433 0.068 0.066 0.824 1.000 0.009 −0.233 0.068 0.067 0.820 0.920 0.009
MVMR-Lasso 0.430 0.080 0.158 0.868 0.764 0.011 −0.233 0.080 0.165 0.872 0.502 0.011
MVMR-median 0.430 0.083 0.090 0.906 1.000 0.012 −0.233 0.083 0.091 0.894 0.748 0.011
MVMR-robust 0.432 0.069 0.069 0.828 1.000 0.009 −0.233 0.069 0.069 0.818 0.900 0.009

The average conditional F-statistics across 500 replicates for X1 and X2 are 6.73 and 6.70 (with SD 1.44 and 1.41), respectively. Mean and standard deviation (SD) of estimates, mean standard error (SE), coverage rate (Cov), power, and mean squared error (MSE).

Table 5.

Simulation results for conditionally weak IVs

Method θ1=0.5
θ2=0.3
Mean SD SE Cov Power MSE Mean SD SE Cov Power MSE
MVMR-cML-BIC 0.501 0.064 0.059 0.940 1.000 0.004 −0.301 0.045 0.042 0.932 1.000 0.002
MVMR-cML-DP 0.503 0.065 0.081 0.984 1.000 0.004 −0.302 0.046 0.057 0.982 1.000 0.002
MVMR-Egger 0.450 0.057 0.057 0.854 1.000 0.006 −0.265 0.040 0.040 0.846 1.000 0.003
MVMR-IVW 0.450 0.055 0.056 0.846 1.000 0.006 −0.265 0.039 0.039 0.850 1.000 0.003
MVMR-Lasso 0.449 0.066 0.134 0.869 0.860 0.007 −0.265 0.047 0.094 0.869 0.801 0.003
MVMR-median 0.449 0.071 0.073 0.882 1.000 0.008 −0.265 0.050 0.051 0.882 1.000 0.004
MVMR-robust 0.450 0.058 0.058 0.822 1.000 0.006 −0.265 0.041 0.041 0.826 1.000 0.003

The average conditional F-statistics across 500 replicates for X1 and X2 are 9.35 and 9.38 (with SD 1.95 and 1.96), respectively. Mean and standard deviation (SD) of estimates, mean standard error (SE), coverage rate (Cov), power, and mean squared error (MSE).

As shown in the supplemental information, we also considered other scenarios with both weak and pleiotropic IVs. Under these more challenging situations, all MVMR methods yielded biased estimates, but MVMR-cML-DP was least biased and controlled type-I error best.

Simulations: Advantages of MVMR-cML for mediation analysis

In this simulation, we considered a scenario for mediation analysis with two exposures, one of which (X2) mediated the effect of the other (X1) on the outcome. We compared MVMR-cML with other MVMR methods and two representative UVMR methods. We show some representative results here while more results are given in the supplemental information. First, Table 6 shows the results with the number of invalid IVs K=10 and the number of valid IVs (for both MVMR and UVMR) m1=4 (the remaining six IVs are valid for MVMR but not for UVMR). With 50% invalid IVs for MVMR, all other methods yielded severely biased estimates for the direct causal effect of each exposure on the outcome, while only MVMR-cML gave (almost) unbiased estimates. As shown in the supplemental information, when the number of invalid IVs K decreased, the performance of other robust MVMR methods improved and was more similar to that of MVMR-cML. In the UVMR analysis, since most of the IVs were invalid, UVMR-IVW yielded much more biased estimates for the total causal effect of X1 than those of UVMR-cML. As shown in the supplemental information (Tables S14–S17), with more valid IVs, UVMR-cML yielded almost unbiased estimates for the total causal effect.

Table 6.

Mean and standard deviation (SD) of estimates, mean standard error (SE), and power when K=10, m1=4

Method θ1=0.1(θ1T=0.2)
θ2=0.2
Mean SD SE Power Mean SD SE Power
MVMR-cML-DP 0.102 0.029 0.032 0.866 0.199 0.025 0.027 0.992
MVMR-Egger 0.389 0.580 0.638 0.062 −0.261 0.344 0.589 0.004
MVMR-IVW 0.699 0.438 0.495 0.270 −0.142 0.288 0.571 0.002
MVMR-Lasso 0.201 0.290 0.035 0.920 0.144 0.181 0.029 0.946
MVMR-median 0.227 0.290 0.055 0.900 0.130 0.180 0.042 0.892
MVMR-robust 0.269 0.339 0.491 0.123 0.105 0.204 0.287 0.319
UVMR-cML-DP 0.209 0.064 0.025 0.928 N/A N/A N/A N/A
UVMR-IVW 0.591 0.249 0.266 0.628 N/A N/A N/A N/A

The total causal effect of X1 is θ1T=0.2.

Second, Table 7 shows the results when K=0 and m1=1, in which case only one IV was valid in UVMR analysis, thus the plurality condition required by UVMR-cML was violated. MVMR-cML had similar performance to MVMR-IVW, which can be considered as the oracle estimator as all Ivs were valid in MVMR analysis. All MVMR methods yielded (almost) unbiased estimates for the direct causal effect. On the other hand, as the plurality condition was violated in UVMR, both UVMR-cML and UVMR-IVW performed poorly in estimating the total effect of X1.

Table 7.

Mean and standard deviation (SD) of estimates, mean standard error (SE), and power when K=0, m1=1

Method θ1=0.1(θ1T=0.2)
θ2=0.2
Mean SD SE Power Mean SD SE Power
MVMR-cML-DP 0.101 0.020 0.021 0.992 0.199 0.014 0.015 1.000
MVMR-Egger 0.100 0.021 0.020 0.988 0.199 0.017 0.017 1.000
MVMR-IVW 0.100 0.019 0.018 0.994 0.199 0.014 0.013 1.000
MVMR-Lasso 0.100 0.023 0.045 0.729 0.199 0.017 0.031 0.980
MVMR-median 0.101 0.023 0.025 0.980 0.199 0.017 0.018 1.000
MVMR-robust 0.100 0.020 0.019 0.994 0.199 0.014 0.014 1.000
UVMR-cML-DP 0.298 0.045 0.019 1.000 NA NA NA NA
UVMR-IVW 0.344 0.029 0.031 1.000 NA NA NA NA

The total causal effect of X1 is θ1T=0.2.

Note that we did not apply UVMR with exposure X2 because all of the Ivs were invalid for UVMR, although the total effect of X2 on the outcome was the same as its direct effect. This example illustrates an advantage of MVMR over UVMR: the valid IV assumption MV-A3 is weaker than the corresponding UV-A3. On the other hand, in the case of K=0, m1=20 and θ1=0 with a true causal graph of GiX1X2Y for all i, there is no valid IV for X2 in MVMR while all the IVs are valid in UVMR for either X1 or X2, illustrating that the valid IV assumption MV-A1 is stronger than UV-A1.

Real data application: The causal effects of cardio-metabolic risk factors on CAD

In this section, we studied the causal effects of the eight cardio-metabolic risk factors on CAD. The correlation matrix for the SNP-trait association estimates was estimated with bivariate LDSC as discussed in the section “new method: multivariable MRcML (MVMR-cML)” (with details provided in supplemental information, section S4). We first applied MVMR-cML (and MVMR-IVW as the standard method) by using the eight risk factors as exposures and CAD as the outcome. Then we applied UVMR-cML on each risk factor-CAD pair by using the set of IVs significantly associated (p value < 5e−8) with the corresponding risk factor (Figure 3). We calculated the conditional F-statistics17 for each of the eight exposures. As shown in Figure 3, all of the conditional F-statistics were larger than 10 except for BMI (8.15). This suggests that the weak instrument bias should not be severe in this analysis, and as shown in the previous simulations, our proposed method was expected to be robust. Furthermore, as discussed later, estimating the direct effect of BMI was potentially problematic and would not be a focus here.

Figure 3.

Figure 3

The estimated effects (and 95% confidence intervals) of each of the eight risk factors on CAD by various UVMR and MVMR methods

The conditional F-statistic is given in the parentheses following each exposure name.

Direct causal effects estimated by MVMR

The results are shown in Figure 3, and we summarize a few main findings here. First, MVMR methods suggested a null effect of HDL on CAD after adjustment for other seven risk factors, while UVMR-cML suggested a protective effect. Second, the positive effect of DBP on CAD diminished in MVMR-cML but stayed nearly significant in MVMR-IVW. There are several possible reasons for a risk factor to show an effect in UVMR analysis but not in MVMR. The first is that some genetic variants for that risk factor might have pleiotropic effects, leading to biased inference in UVMR. The second is that the effect of that risk factor on CAD is mediated through other risk factors in the model. The third is possible loss of power in MVMR. We will take a closer look at these possibilities in the following sections. Moreover, we note that the GWAS summary statistics for SBP and DBP were obtained after adjusting for BMI. While using covariate-adjusted SNP-trait associations may lead to bias in MR analyses,31 as discussed in Gilbody et al.,32 including the covariate, BMI in this case, as an additional exposure in an MVMR analysis can recover the direct causal effects of the exposures of interest, DBP and SBP in this case. However, the estimated direct effect of the covariate (BMI) on the outcome (CAD) was potentially biased and should be interpreted with caution.

MVMR-cML-BIC identified ten invalid IVs out of the 201 IVs used in the MVMR analysis. We calculated the conditional F-statistics on the basis of the selected set of valid IVs. The results were similar to those when using the whole set of 201 IVs: only BMI had a conditional F-statistic smaller than 10 (Table S19), suggesting no severe issue of multicollinearity. Lastly, to detect possible outlying or influential IVs, Cook’s distance for each IV in the MVMR-IVW model was calculated. None of them exceeded the recommended threshold of the median of the corresponding F-distribution,33,34 suggesting that there was no influential point in the analysis (Figure S2). We also applied a leave-one-out analysis with MVMR-cML-DP, reaching the same conclusion (Figures S3 and S4).

Causal effects of lipids on CAD

The causal effect of various lipid fractions, including LDL, HDL, and TG, on CAD is an important issue that has been studied widely. While increased LDL doubtlessly has a deleterious causal effect on CAD, the roles of TG and HDL are still under debate. Numerous MR analyses have been conducted to investigate this issue. In particular, in standard UVMR analyses using genome-wide significant variants for HDL, a protective role of HDL on CAD has been suggested by several UVMR methods.7 However, this could be due to the fact that many variants associated with HDL are also associated with other risk factors like LDL, TG, and BMI, suggesting possible pleiotropic effects. For example, Holmes et al.35 showed that, when an unrestricted allele score was used, a protective effect of HDL was identified, while when a restricted allele score (by removing any SNPs associated with either of the other two lipid traits) was used, no significant effect was found.

In our example here, we reached a similar conclusion. First, in the univariable analysis, UVMR-cML-BIC suggested that HDL had a protective effect on CAD while the result of UVMR-cML-DP was not significant. As noted in Xue et al.,7 model selection based on BIC might miss some invalid IVs due to their small effect sizes, leading to inflated type-I errors, especially when pleiotropic effects were weak, while data perturbation could help. We found that out of the 26 IVs associated with HDL, 17 were also associated with at least one of the three other likely causal risk factors, including LDL, TG, and BMI (at the significance level of 5e−8). However, UVMR-cML-BIC only identified eight of them, leading to possibly biased inference. On the other hand, for UVMR-cML-DP, 12 of them were identified as invalid at least 10 times out of the 100 perturbed datasets. Although UVMR-cML-DP performed more robustly than UVMR-cML-BIC in the presence of many invalid IVs, it also yielded a much wider confidence interval, even than that of MVMR-cML. We also applied UVMR-cML with only nine IVs for HDL by removing the 17 potentially invalid IVs. In this case, both UVMR-cML-BIC and UVMR-cML-DP suggested a null effect of HDL on CAD (though it could be due to the reduce power of using less SNPs). On the other hand, both MVMR-cML-BIC and MVMR-cML-DP suggested that both TG and LDL, but not HDL, had a causal effect on CAD after adjusting for other risk factors included in the model; this could be partly due to the relaxed non-pleiotropy assumption in MVMR.

Diminished causal effects of DBP on CAD after accounting for SBP

As shown in Figure 3, DBP had a significant positive effect on CAD in UVMR-cML, but the effect completely diminished in MVMR-cML. This was concordant with some previous studies finding that the effect of DBP on CAD disappeared after adjusting for SBP.36,37 First, there were 79 SNPs associated with DBP, and 63 of them were also associated with at least one of the other seven risk factors (at the significance level of 5e−8). We removed these 63 variants and performed UVMR-cML on DBP to CAD with the remaining set of 16 SNPs. Unlike what we observed for HDL, the UVMR-cML result still showed a significant positive effect, suggesting that the diminished effect of DBP in MVMR (but significant in UVMR) might be due to other reasons besides the presence of pleiotropic variants.

Next, we applied MVMR-cML-DP with a subset of exposures to investigate some potential mediating effects. We separately used each of the other seven risk factors along with DBP as exposures and CAD as the outcome. As shown in the left panel in Figure 4, the effect of DBP on CAD after adjusting for SBP changed from that in UVMR-cML-DP, while adjusting for any one of the other six risk factors (height, BMI, FG, HDL, LDL, and TG) did not lead to much change. Alternatively, we started from the "full" model as shown in Figure 3 with all eight risk factors and deleted each of the seven risk factors (other than DBP). The right panel in Figure 4 told a similar story: after SBP was removed from the full model, DBP still had an effect on CAD, although slightly smaller than that in UVMR, after adjusting for the other six remaining risk factors. This suggested that a mediating effect of DBP on CAD was likely via SBP. However, we should be cautious in interpreting any mediating effect, as it relies on the correct inference by both UVMR and MVMR. It is also possible that the total effect of DBP on CAD shown in UVMR was due to the fact that many of the IVs for DBP had pleiotropic effects on SBP (or a closely related trait), some of which may be too weak to be detectable, so the significant result with the restricted set of 16 IVs might still be biased, and the different results between UVMR-cML and MVMR-cML could be due to their differing adjustments for pleiotropy via SBP.

Figure 4.

Figure 4

The estimated effects (and 95% confidence intervals) of DBP on CAD with various sets of exposures by MVMR-cML-DP

Left corresponds to the sets of two exposures (DBP plus one of the other seven risk factors). Right corresponds to the sets of the six exposures after excluding one of seven risk factors marked out in the left. Results from UVMR-cML-DP and MVMR-cML-DP in Figure 3 are also added at bottom for comparison.

Lastly, we applied other robust MVMR methods. All methods gave results in line with that of MVMR-cML, suggesting a null direct effect of DBP on CAD, and their point estimates were also in the same direction (see Figure S1).

Discussion

We have proposed a robust and efficient multivariable Mendelian randomization method based on constrained maximum likelihood called MVMR-cML. It is an important and useful extension of the UVMR-cML.7 We have shown in both simulations and a real data application that, compared to its univariable counterpart (and other UVMR methods), MVMR-cML has two main advantages (while maintaining the major advantages of UVMR-cML, such as its estimation efficiency and robustness). First, MVMR estimates the direct effect of an exposure on the outcome after accounting for the other exposures included in the model, while UVMR only estimates the total effect. When there are causal pathways among the exposures, the direct effect of an exposure is in general different from its total effect on an outcome. Second, MVMR can account for some known pleiotropic effects through other exposures included in the MVMR model, making it more robust to pleiotropy than its univariable counterparts. This is important especially when (putative causal) risk factors, e.g., various lipids such as TG, LDL, and HDL, share some genetic associations. Although UVMR-cML and some other UVMR methods allow for invalid IVs under the plurality condition, it is still possible that some invalid IVs with weak pleiotropic effects cannot be selected out, leading to incorrect inference. On the other hand, an application of MVMR requires some assumptions beyond those of UVMR; in particular, MVMR requires at least one valid IV for each exposure; that is, the matrix of the marginal IV-exposure associations has a full column rank. We have given an example in mediation analysis in which UVMR, but not MVMR, would work. More generally, as in multiple regression, if the marginal association matrix is nearly singular, multicollinearity may lead to unstable estimates and thus inflated type-I errors and loss of power in MVMR; a conditional F-test has been proposed to detect such a case.17 In addition, all the current MVMR methods require the use of (nearly) independent IVs, leading to the use of fewer IVs (than in UVMR) and thus possibly exacerbating the issue of multicollinearity. An extreme example is when there is only one valid IV for one exposure in MVMR; removing this IV would lead to the non-identifiable MVMR model and thus unreliable estimates. These issues can be regarded as a price we pay for using MVMR.

There are a few advantages of our proposed method over other existing MVMR methods. First, MVMR-cML has nice statistical properties such as selection consistency, estimation consistency, and asymptotic normality with strong theoretical support. Second, as highlighted in our simulation studies, MVMR-cML was shown to have the smallest bias and MSE under various pleiotropic and/or weak IV settings among all MVMR methods being compared. In particular, MVMR-cML-DP consistently performed best, especially in controlling type-I errors. Compared to MVMR-cML-BIC, MVMR-cML-DP accounts for model selection uncertainties ignored by the former, often performs better for finite samples, and thus is recommended. Third, most of the MVMR methods are based on the two-sample MR setup, which means that all the exposure and outcome GWAS were performed in non-overlapping (and unrelated) samples. Our likelihood framework can account for overlapping samples by taking into account of the correlations among genetic associations with the exposures and outcome. This avoids sample splitting into non-overlapping subsets for analysis with reduced power (e.g., in Davies et al.38; Sanderson et al.20). Fourth, although in this paper we focused on testing for each direct effect separately, it is straightforward to test for a subset of multiple causal effects jointly in MVMR-cML. Furthermore, unlike some other MVMR methods, we do not need the assumption of no measurement errors of SNP-exposure associations (NOME) in our method by directly accounting for the variation of the estimated associations (βˆXli) in the likelihood framework. This contributes to its robustness to weak IV biases.14,17,39

There are some limitations with our method. First, as in other MVMR methods, we assume a linear and homogeneous effect of each exposure on the outcome.14,16 When a linear effect is absent, it does not necessarily imply no causal effects. Some exposures might have a non-linear effect on the outcome; for example, a U-shaped or J-shaped effect of DBP on CAD has been reported in previous studies.40,41 Second and importantly, as in multiple regression, if MVMR fails to detect a direct effect, it could be due to its low power with multicollinearity if many related exposures are included. Third, despite the two advantages of MVMR over UVMR mentioned earlier, when an estimated direct effect in MVMR differs from an estimated total effect in UVMR, it may be difficult to distinguish whether such a difference is due to mediating effects of other exposures or to pleiotropic effects of genetic variants on the outcome or both. Finally, though emerging as a powerful tool for causal inference with observational data, various (MV)MR methods depend on their own assumptions as well as on the quality of the genetic variants as IVs and, more generally, on the GWAS data being used. MR cannot completely replace traditional experimental studies, while triangulation through more applications of various MR methods to real data would be worthwhile and warranted.

Acknowledgments

The authors thank the reviewers for many insightful and constructive comments. This research was supported by NIH grants R01 AG065636, R01 AG069895, RF1 AG067924, U01 AG073079, R01 AG074858, R01 HL116720, and R01 GM126002 and by the Minnesota Supercomputing Institute at the University of Minnesota.

Declaration of interests

The authors declare no competing interests.

Published: March 21, 2023

Footnotes

Supplemental information can be found online at https://doi.org/10.1016/j.ajhg.2023.02.014.

Web resources

Conditional F-test, https://github.com/WSpiller/MVMR

IEU open GWAS project, https://gwas.mrcieu.ac.uk

MVMR-IVW, MVMR-Egger, https://cran.r-project.org/web/packages/MendelianRandomization

MVMR-Median, MVMR-Robust, MVMR-Lasso, https://github.com/aj-grant/robust-mvmr

TwoSampleMR, https://mrcieu.github.io/TwoSampleMR/

UVMR-cML, https://github.com/xue-hr/MRcML

Supplemental information

Document S1. Figures S1–S5, Tables S1–S19, and supplemental methods
mmc1.pdf (538.7KB, pdf)
Document S2. Article plus supplemental information
mmc2.pdf (1.2MB, pdf)

Data and code availability

The GWAS summary datasets used in the real data analysis are all publicly available. Computer code for simulation studies and real data analysis is available at https://github.com/ZhaotongL/MVMR-paper. The software for MVMR-cML is publicly available on GitHub at https://github.com/ZhaotongL/MVMR-cML.

References

  • 1.Smith G.D., Ebrahim S. ‘Mendelian randomization’: can genetic epidemiology contribute to understanding environmental determinants of disease? Int. J. Epidemiol. 2003;32:1–22. doi: 10.1093/ije/dyg070. [DOI] [PubMed] [Google Scholar]
  • 2.Zhu X. Mendelian randomization and pleiotropy analysis. Quant. Biol. 2021;9:122–132. doi: 10.1007/s40484-020-0216-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Boehm F.J., Zhou X. Statistical methods for mendelian randomization in genome-wide association studies: A review. Comput. Struct. Biotechnol. J. 2022;20:2338–2351. doi: 10.1016/j.csbj.2022.05.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Sleiman P.M.A., Grant S.F.A. Mendelian randomization in the era of genomewide association studies. Clin. Chem. 2010;56:723–728. doi: 10.1373/clinchem.2009.141564. [DOI] [PubMed] [Google Scholar]
  • 5.Hemani G., Bowden J., Davey Smith G. Evaluating the potential role of pleiotropy in mendelian randomization studies. Hum. Mol. Genet. 2018;27:R195–R208. doi: 10.1093/hmg/ddy163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Lin Z., Deng Y., Pan W. Combining the strengths of inverse-variance weighting and egger regression in mendelian randomization using a mixture of regressions model. PLoS Genet. 2021;17:e1009922. doi: 10.1371/journal.pgen.1009922. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Xue H., Shen X., Pan W. Constrained maximum likelihood-based mendelian randomization robust to both correlated and uncorrelated pleiotropic effects. Am. J. Hum. Genet. 2021;108:1251–1269. doi: 10.1016/j.ajhg.2021.05.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Morrison J., Knoblauch N., Marcus J.H., Stephens M., He X. Mendelian randomization accounting for correlated and uncorrelated pleiotropic effects using genome-wide summary statistics. Nat. Genet. 2020;52:740–747. doi: 10.1038/s41588-020-0631-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Qi G., Chatterjee N. Mendelian randomization analysis using mixture models for robust and efficient estimation of causal effects. Nat. Commun. 2019;10:1941. doi: 10.1038/s41467-019-09432-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Cheng Q., Zhang X., Chen L.S., Liu J. Mendelian randomization accounting for complex correlated horizontal pleiotropy while elucidating shared genetic etiology. Nat. Commun. 2022;13:6490. doi: 10.1038/s41467-022-34164-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Hu X., Zhao J., Lin Z., Wang Y., Peng H., Zhao H., Wan X., Yang C. Mendelian randomization for causal inference accounting for pleiotropy and sample structure using genome-wide summary statistics. Proc. Natl. Acad. Sci. USA. 2022;119 doi: 10.1073/pnas.2106858119. e2106858119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Burgess S., Thompson S.G. Multivariable mendelian randomization: the use of pleiotropic genetic variants to estimate causal effects. Am. J. Epidemiol. 2015;181:251–260. doi: 10.1093/aje/kwu283. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Waterworth D.M., Ricketts S.L., Song K., Chen L., Zhao J.H., Ripatti S., Aulchenko Y.S., Zhang W., Yuan X., Lim N., et al. Genetic variants influencing circulating lipid levels and risk of coronary artery disease. Arterioscler. Thromb. Vasc. Biol. 2010;30:2264–2276. doi: 10.1161/ATVBAHA.109.201020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Carter A.R., Sanderson E., Hammerton G., Richmond R.C., Davey Smith G., Heron J., Taylor A.E., Davies N.M., Howe L.D. Mendelian randomisation for mediation analysis: current methods and challenges for implementation. Eur. J. Epidemiol. 2021;36:465–478. doi: 10.1007/s10654-021-00757-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Sanderson E. Multivariable mendelian randomization and mediation. Cold Spring Harb. Perspect. Med. 2021;11:a038984. doi: 10.1101/cshperspect.a038984. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Grant A.J., Burgess S. Pleiotropy robust methods for multivariable mendelian randomization. Stat. Med. 2021;40:5813–5830. doi: 10.1002/sim.9156. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Sanderson E., Spiller W., Bowden J. Testing and correcting for weak and pleiotropic instruments in two-sample multivariable mendelian randomization. Stat. Med. 2021;40:5434–5452. doi: 10.1002/sim.9133. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Rees J.M.B., Wood A.M., Burgess S. Extending the mr-egger method for multivariable mendelian randomization to correct for both measured and unmeasured pleiotropy. Stat. Med. 2017;36:4705–4718. doi: 10.1002/sim.7492. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Guo Z., Kang H., Tony Cai T., Small D.S. Confidence intervals for causal effects with invalid instruments by using two-stage hard thresholding with voting. J. Roy. Stat. Soc. B Stat. Methodol. 2018;80:793–815. [Google Scholar]
  • 20.Sanderson E., Davey Smith G., Windmeijer F., Bowden J. An examination of multivariable mendelian randomization in the single-sample and two-sample summary data settings. Int. J. Epidemiol. 2019;48:713–727. doi: 10.1093/ije/dyy262. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Bulik-Sullivan B., Finucane H.K., Anttila V., Gusev A., Day F.R., Loh P.-R., ReproGen Consortium. Psychiatric Genomics Consortium. Genetic Consortium for Anorexia Nervosa of the Wellcome Trust Case Control Consortium 3. Duncan L., et al. An atlas of genetic correlations across human diseases and traits. Nat. Genet. 2015;47:1236–1241. doi: 10.1038/ng.3406. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Kim J., Bai Y., Pan W. An adaptive association test for multiple phenotypes with gwas summary statistics. Genet. Epidemiol. 2015;39:651–663. doi: 10.1002/gepi.21931. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Zhao Q., Wang J., Hemani G., Bowden J., Small D.S. Statistical inference in two-sample summary-data mendelian randomization using robust adjusted profile score. Ann. Stat. 2020;48:1742–1769. [Google Scholar]
  • 24.van der Harst P., Verweij N. Identification of 64 novel genetic loci provides an expanded view on the genetic architecture of coronary artery disease. Circ. Res. 2018;122:433–443. doi: 10.1161/CIRCRESAHA.117.312086. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Willer C.J., Schmidt E.M., Sengupta S., Peloso G.M., Gustafsson S., Kanoni S., Ganna A., Chen J., Buchkovich M.L., Mora S., et al. Discovery and refinement of loci associated with lipid levels. Nat. Genet. 2013;45:1274–1283. doi: 10.1038/ng.2797. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Locke A.E., Kahali B., I Berndt S., Anne E.J., Pers T.H., Day F.R., Powell C., Vedantam S., Buchkovich M.L., Yang J., et al. Genetic studies of body mass index yield new insights for obesity biology. Nature. 2015;518:197–206. doi: 10.1038/nature14177. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Wood A.R., Esko T., Yang J., Vedantam S., Pers T.H., Gustafsson S., Chu A.Y., Estrada K., Luan J., Kutalik Z., et al. Defining the role of common variation in the genomic and biological architecture of adult human height. Nat. Genet. 2014;46:1173–1186. doi: 10.1038/ng.3097. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Scott R.A., Lagou V., Welch R.P., Wheeler E., Montasser M.E., Luan Jian’an, Mägi R., Strawbridge R.J., Rehnberg E., Gustafsson S., et al. Large-scale association analyses identify new loci influencing glycemic traits and provide insight into the underlying biological pathways. Nat. Genet. 2012;44:991–1005. doi: 10.1038/ng.2385. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Evangelou E., Warren H.R., Mosen-Ansorena D., Mifsud B., Pazoki R., Gao H., Ntritsos G., Dimou N., Cabrera C.P., Karaman I., et al. Genetic analysis of over 1 million people identifies 535 new loci associated with blood pressure traits. Nat. Genet. 2018;50:1412–1425. doi: 10.1038/s41588-018-0205-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Hemani G., Zheng J., Elsworth B., Wade K.H., Haberland V., Baird D., Laurin C., Burgess S., Bowden J., Langdon R., et al. Base Collaboration The mr-base platform supports systematic causal inference across the human phenome. Elife. 2018;7:e34408. doi: 10.7554/eLife.34408. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Hartwig F.P., Tilling K., Davey Smith G., Lawlor D.A., Borges M.C. Bias in two-sample mendelian randomization when using heritable covariable-adjusted summary associations. Int. J. Epidemiol. 2021;50:1639–1650. doi: 10.1093/ije/dyaa266. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Joseph G., Carolina Borges M., Smith George D., Sanderson E. Multivariable mr can mitigate bias in two-sample mr using covariable-adjusted summary associations. medRxiv. 2022 doi: 10.1101/2022.07.19.22277803. Preprint at. [DOI] [Google Scholar]
  • 33.Cook R.D. Detection of influential observation in linear regression. Technometrics. 1977;19:15–18. [Google Scholar]
  • 34.Zuber V., Colijn J.M., Klaver C., Burgess S. Selecting likely causal risk factors from high-throughput experiments using multivariable mendelian randomization. Nat. Commun. 2020;11:1–11. doi: 10.1038/s41467-019-13870-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Holmes M.V., Asselbergs F.W., Palmer T.M., Drenos F., Lanktree M.B., Nelson C.P., Dale C.E., Padmanabhan S., Finan C., Swerdlow D.I., et al. Mendelian randomization of blood lipids for coronary heart disease. Eur. Heart J. 2015;36:539–550. doi: 10.1093/eurheartj/eht571. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Arvanitis M., Qi G., Bhatt D.L., Post W.S., Chatterjee N., Battle A., McEvoy J.W. Linear and nonlinear mendelian randomization analyses of the association between diastolic blood pressure and cardiovascular events: the j-curve revisited. Circulation. 2021;143:895–906. doi: 10.1161/CIRCULATIONAHA.120.049819. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Levin M.G., Klarin D., Walker V.M., Gill D., Lynch J., Lee K.M., Assimes T.L., Natarajan P., Hung A.M., Edwards T., et al. Genetic variation in blood pressure and lifetime risk of peripheral artery disease: A mendelian randomization study. medRxiv. 2020 doi: 10.1101/2020.08.23.20180240. Preprint at. [DOI] [Google Scholar]
  • 38.Davies N.M., Hill W.D., Anderson E.L., Sanderson E., Deary I.J., Davey Smith G. Multivariable two-sample mendelian randomization estimates of the effects of intelligence and education on health. Elife. 2019;8:e43990. doi: 10.7554/eLife.43990. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Burgess S., Thompson S.G. Bias in causal estimates from mendelian randomization studies with weak instruments. Stat. Med. 2011;30:1312–1323. doi: 10.1002/sim.4197. [DOI] [PubMed] [Google Scholar]
  • 40.Srinivasan B., Chertow G.M., Cheung A.K., Cushman W.C., Rahman M., Greene T., Guo W., Campbell R.C., Conroy M., Freedman B.I., et al. Influence of baseline diastolic blood pressure on effects of intensive compared with standard blood pressure control. Circulation. 2018;137:134–143. doi: 10.1161/CIRCULATIONAHA.117.030848. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Liang F., Wang Y. Coronary heart disease and atrial fibrillation: A vicious cycle. Am. J. Physiol. Heart Circ. Physiol. 2021;320:H1–H12. doi: 10.1152/ajpheart.00702.2020. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Document S1. Figures S1–S5, Tables S1–S19, and supplemental methods
mmc1.pdf (538.7KB, pdf)
Document S2. Article plus supplemental information
mmc2.pdf (1.2MB, pdf)

Data Availability Statement

The GWAS summary datasets used in the real data analysis are all publicly available. Computer code for simulation studies and real data analysis is available at https://github.com/ZhaotongL/MVMR-paper. The software for MVMR-cML is publicly available on GitHub at https://github.com/ZhaotongL/MVMR-cML.


Articles from American Journal of Human Genetics are provided here courtesy of American Society of Human Genetics

RESOURCES