Summary
Mendelian randomization (MR) is an instrumental variable approach used to infer causal relationships between exposures and outcomes, which is becoming increasingly popular because of its ability to handle summary statistics from genome-wide association studies. However, existing MR approaches often suffer the bias from weak instrumental variables, horizontal pleiotropy and sample overlap. We introduce MRBEE (MR using bias-corrected estimating equation), a multivariable MR method capable of simultaneously removing weak instrument and sample overlap bias and identifying horizontal pleiotropy. Our extensive simulations and real data analyses reveal that MRBEE provides nearly unbiased estimates of causal effects, well-controlled type I error rates and higher power than comparably robust methods and is computationally efficient. Our real data analyses result in consistent causal effect estimates and offer valuable guidance for conducting multivariable MR studies, elucidating the roles of pleiotropy, and identifying total 42 horizontal pleiotropic loci missed previously that are associated with myopia, schizophrenia, and coronary artery disease.
Keywords: Multivariable Mendelian Randomization, weak instrument bias, sample overlap, measurement error, causal effect, horizontal pleiotropy
We introduce a multivariable MR method MRBEE that is capable of simultaneously removing weak instrument and sample overlap bias and identifying horizontal pleiotropy. The real data analyses offer valuable guidance for conducting MR studies, and identifying 42 horizontal pleiotropic loci associated with myopia, schizophrenia, and coronary artery disease.
Introduction
Mendelian randomization (MR) is an instrumental variable (IV) approach used to infer causal relationships between exposures and outcomes and can apply to summary statistics from genome-wide association studies (GWASs), providing a cost-effective and generalizable alternative to randomized controlled trials.1 Inverse-variance weighting (IVW)2 is the fundamental approach to perform MR with GWAS summary statistics, and the validity of which relies heavily on three so-called valid IV assumptions: the genetic IVs are (IV1) strongly associated with the exposures; (IV2) not directly associated with the outcome conditional on the exposures; and (IV3) not associated with any confounders of the exposure-outcome relationships. Violations of the (IV1)–(IV3) assumptions will introduce weak instrument,3 unbalanced uncorrelated horizontal pleiotropy (UHP),4 and correlated horizontal pleiotropy (CHP)5 biases into the casual effect estimation, respectively. As for balanced UHP, which aligns with the instrument strength independent of direct effect (InSIDE) assumption,4 the causal effect estimation remains unbiased.
From a statistical perspective, both unbalanced UHP and CHP in an MR model exhibit characteristics similar to outliers in traditional regression analyses. Therefore, these issues can be addressed using robust statistical tools. In the literature, MR-PRESSO6 and IMRP7 identify and remove horizontal pleiotropic variants through hypothesis tests, while the MR-Lasso8 and MRcML9 methods detect horizontal pleiotropy through variable selection tools. On the other hand, approaches like MR-Median10 and MR-Robust11 employ robust loss functions to mitigate the horizontally pleiotropic effects. Furthermore, Gaussian mixture models are implemented in methods such as MRMix,12 MR-Conmix,13 CAUSE,5 MRAID,14 and MR-CUE.15 These models offer an advantage over traditional robust tools by utilizing fewer degrees of freedom to describe unbalanced UHP and CHP, thereby increasing efficiency when the mixture models are correctly specified.
While univariable MR (UVMR) methods allow some IVs to have horizontally pleiotropic effects, they generally assume that most IVs influence the outcome solely through the mediation of the exposure. However, this assumption can be problematic when traits share more than 50% causal variants. For instance, both systolic and diastolic blood pressure (SBP and DBP)16 are revealed to share substantial causal variants. When analyzing the causal effect of SBP on cardiovascular disease, it is often challenging to remove the effect through DBP. A more effective way to address this issue is multivariable MR (MVMR), which accounts for the majority of horizontally pleiotropic variants that are shared by multiple exposures.17 To date, the multivariable versions of IVW,18 MR-Egger,19 MR-Median,10 and MRcML20 have been developed. As demonstrated by Sanderson et al.,17 MVMR is reliable in estimating the direct causal effects of one or more exposures.
The issue of weak instrument bias, stemming from the violation of the (IV1) assumption, poses even more challenging to resolve in MVMR than in UVMR. Specifically, it is usually difficult to find a set of IVs that are strongly associated with all exposures under consideration. In contrast, IVs are generally selected if they are associated with at least one exposure.21 With the growing identification of causal variants for complex traits, the pool of IVs used in MVMR can easily reach the thousands due to this IV selection procedure, therefore worsen weak instrument bias. Traditional approaches to mitigate this bias involve discarding weak IVs whose F-statistic or conditional F-statistic is less than 10. This threshold is believed to keep the relative bias in causal effect estimates below 10%.3,22 However, the exclusion of IVs can lead to reduced statistical power and introduce a “winner’s curse,” thereby compromising the validity of the causal inference.23
We propose to resolve the weak instrument bias by using tools in measurement error analysis.24 Specifically, measurement error bias occurs when explanatory variables are measured with random error, leading to biased estimates of model parameters. Since current MR approaches are performed with GWAS summary statistics that always contain estimation errors, the causal effect estimates inevitably suffer from measurement error bias.25,26 Therefore, we view a weak instrument as a relatively large measurement error in effect size estimate based on finite sample size and is the primary reason for violating assumption (IV1) in IVW and other MR approaches. Furthermore, unlike traditional measurement error analyses that assume uncorrelated estimation errors in exposures and outcomes, overlapping individuals in exposure and outcome GWAS can result in correlated measurement errors, leading the direction of measurement error bias either toward or away from zero. As we observed in Figure 1, IVW estimates27 exhibit negative bias with small numbers of overlapping samples and positive bias with large numbers of overlapping samples, respectively.
We develop a computationally efficient MVMR method, MR using bias-corrected estimating equations (MRBEE), to eliminate weak instrument bias while simultaneously accounting for horizontal pleiotropy in the presence of weak IVs or sample overlap. In contrast, existing methods only address weak instrument bias in specific cases such as no sample overlap (debiased IVW)26 or no horizontal pleiotropy (MRlap).28 Although the multivariable MRcML methods20 generally provide unbiased causal estimates, they may be vulnerable to horizontal pleiotropy and computationally intensive. To underscore its practical significance, we apply MRBEE to three datasets, each targeting a unique disease, namely, myopia, schizophrenia (SCZ), and coronary artery disease (CAD), with the aim to unravel the distinct causal exposures associated with each. In addition, we extend the pleiotropy test to a genome-wide pleiotropy test (GWPT) for detecting novel loci. These empirical analyses offer valuable guides for conducting MVMR studies, elucidating the roles of pleiotropy and weak instrument bias, and illustrating how to identify novel loci through pleiotropy tests. The study was approved by the institutional review board (IRB number: STUDY20180592) at Case Western Reserve University.
Results
Overview of method
The detailed MRBEE is described in the material and methods section. Briefly, suppose that there are p exposures having causal effects on an outcome and m genetic variants as IVs. Let be a vector of length , representing the genetic effect sizes of IVs on the outcome, be an matrix with representing the genetic effect sizes of the th IV on the p exposures, be a vector of length representing the causal effects of the exposures on the outcome, and be a vector of length representing horizontal pleiotropy. We model the causal effects of the exposures on the outcome by
The goal in MR analysis is to estimate the causal effects unbiasedly. In the above equation, the true genetic effect sizes and are not observed but can be estimated through the GWAS of exposures and outcome and the pleiotropy effect is simply unknown. Let and be the effect size estimates of the th IV from the outcome and exposure GWASs. We have
where and represent the measurement errors because of finite sample sizes of the GWASs.
In general, an MVMR analysis is performed by the following linear regression:
where represents the residual. When we standardize and by their corresponding standard errors obtained from GWASs, the multivariable IVW (MV-IVW) estimates is
which is equivalent to solve the score equation = 0. However, the MV-IVW fails to consider the weak IVs and the correlations among and induced by sample overlap and assumes pleiotropy = 0 for all IVs. Thus, the MV-IVW is biased. To solve this problem, we propose MRBEE by solving the following estimating equation,
where and represent the covariance matrix among and between and (material and methods) in the set of the IVs. The score function in adds a corrected term, which corrects the bias because of weak IVs and sample overlap, meanwhile assumes there are no pleiotropic IVs ( The solution of the equation is
With the presence of pleiotropic IVs, we apply an iterative procedure7 with the pleiotropy test for multiple exposures and an outcome, which uses the following statistic for the th IV,
Thus, MRBEE estimates the causal effect , and identifies pleiotropic IVs with the current estimated causal effect iteratively. The entire pipeline from inputting summary statistics, estimating causal effects, and identifying pleiotropic IVs is illustrated in Figure 1. Note that after estimating the causal effect , we can further search pleiotropic variants across the entire genome.
Simulation
We compared MRBEE with the multivariable MR of IVW, MR-Egger, MR-Median, MR-Lasso, MRcML-DP, and MRcML-BIC. MRBEE is implemented with the R package MRBEE and the other methods are implemented through the R package MendelianRandomization.29 We call IVW, MR-Egger, MR-Median, and MR-Lasso the traditional MVMR methods, as they either do not account for estimation error of effect size or the sample overlap. Our simulation setting was adapted from the ones considered by Lin et al.,20 but with specific adjustments to better reflect real-world situations. Specifically, we set the heritability of both exposures and confounders at 0.1, introduced moderate genetic correlations among the exposures, and added correlations among random errors of exposure and outcome GWAS cohorts. In our analysis, we consider three scenarios: no pleiotropy, 30% unbalanced UHP, and 30% CHP. All exposures are assumed to come from the same GWAS sample, while the outcome may overlap completely (100% sample overlap), partially (50% and 77% overlap), or be entirely independent (0% sample overlap). In addition, the sample size was set at 50,000, the number of IVs was set at 50, 100, and 200, representing the increasing of weak IVs, the number of exposures was 4, and the causal effect was , which represents no causal effect, and positive, negative, and large causal effects, respectively. Simulation settings are fully presented in supplement 1 and the R code used to generate simulated data is available at the GitHub repository of this paper. The number of simulation replicates was 500, and additional simulations can be found in supplement 1.
Bias of causal effect estimates
Figures 2A, 2D, 2G, and 2J demonstrate that the bias in traditional MVMR methods (IVW, MR-Egger, MR-Median, MR-Lasso) is proportional to the number of IVs used, especially in the absence of horizontal pleiotropy. The direction of this bias is influenced by sample overlap: no overlap results in bias toward the null, while sample overlap leads to bias away from the null. On the contrary, MRBEE, MRcML-BIC, and MRcML-DP are unbiased under these conditions. The unbiasedness for MRcML methods is likely attributed to the fact that the objective function of MRcML methods20 accounts for the covariance of estimation errors. Our results suggest that incorporating the estimation error covariance matrix mitigates measurement error bias.
Figures 2B, 2E, 2H, and 2K demonstrate that when there was 30% unbalanced UHP, IVW, and MR-Egger generally incurred substantial bias. Moreover, there were inflated standard errors in the causal estimates due to the horizontal pleiotropy. MR-Median and MR-Lasso also incurred substantial bias, but the standard errors of their causal estimates were smaller than that from IVW and MR-Egger. These methods apply robust tools to estimate the causal effects in the presence of horizontal pleiotropy but are not able to remove the bias by weak instrument or sample overlap. MRcML-BIC and MRcML-DP generally provided unbiased causal estimates when there was no sample overlap. When the sample overlap percentage was 100%, both MRcML-BIC and MRcML-DP incurred biases in different directions. The magnitude of this bias was proportional to the number of IVs used. For example, for exposure 1 with true , MRcML-BIC and MRcML-DP had bias away from the null; for exposure 3 where , the two methods had bias toward, and even past, the null. In comparison, MRBEE was unbiased in all scenarios except when there were 200 IVs and 100% overlap. In this case, MRBEE still had a smaller upward bias for exposure 3 with than other methods.
Figures 2C, 2F, 2I, and 2L demonstrate that when there was 30% CHP, IVW, and MR-Egger had larger bias and standard errors in their causal estimates than the rest of methods. Both had bias away from the null for exposure 1, and the magnitude of which depended on the number of IVs used. MR-Median and MR-Lasso generally were less biased than IVW and MR-Egger, as they are more robust in handling of CHP IVs. The weak instrument bias of MR-Median and MR-Lasso followed the same bias patterns as no pleiotropy. MRcML-BIC and MRcML-DP were both biased when the sample overlapping percentage was 100% or 0%, potentially due to the instability of algorithm when horizontal pleiotropy is present. MRBEE was unbiased in all cases and generally had standard errors comparable to other methods excluding IVW and MR-Egger. Finally, when the number of IVs increased from 50 to 200, representing the increasing of weak IVs, MRBEE was always performing better than the comparing methods (Figure 2).
Type I error and power
Figures 3A–3C present the type I error rates for all the methods when the true causal effect , which corresponds to the first exposure in our simulations. When there was no sample overlap between exposures and the outcome, the type I error was well controlled for MRBEE in all three scenarios, i.e., no pleiotropy, 30% unbalanced UHP, and 30% unbalanced CHP. In comparison, MRcML-DP, MR-Median, MR-Egger, and IVW was generally conservative, while MRcML-BIC and MR-LASSO usually had inflated type I error rates. When 100% overlap between exposures and the outcome, MRBEE still controlled type I error rate well. The rest of the methods either had inflated or extremely conservative type I error rate.
Figures 3D–3L present power for different causal effects in the three scenarios. Overall, MRBEE has comparable power with the best of the other methods but maintains a type I error rate. We specifically compared MRBEE and MRcML-DP, where the latter controlled type I error rate well under all the simulation scenarios. We observed that MRBEE either had similar or better power than MRcML-DP. The power pattern across the seven methods does not align well with the type I error pattern, that is, high type I error rate corresponds to high power and vice visa. We observed that the reason is the bias direction in causal effect estimates as illuminated in Figure 2, i.e., the bias direction could be in opposite to the true causal effect.
Again, when the number of IVs increased from 50 to 200, the performance of type I error and power of MRBEE was either equal well or better than the comparing methods. We further evaluated these approaches in terms of the root-mean-square error (RMSE), standard error (SE) estimation, and coverage frequency of causal effects. MRBEE was again the best among the methods evaluated (Figures S1–S3, and Tables S1–S24 in supplement 1).
We have performed additional simulations in which the overlapping proportion takes values 0, 0.5, 0.77, and 1. In these scenarios MRBEE still performs well. The results are presented in supplement 1.
Computational efficiency
Figure 4 illuminates the computation efficient across seven methods. We observed that MRBEE is computationally as efficient as MR-Median, MR-Lasso, MR-Egger, and MR-IVW. We attribute the computational requirements of MRcML to two potential factors. First, MRcML methods utilize an algorithm similar to the best subset selection to identify the optimal subset of pleiotropic variants. This involves performing MVMR iteratively by considering numbers of pleiotropic variants ranging from 1 to (defaulting to ), and determining the optimal number based on the BIC criteria. In contrast, MRBEE automatically detects pleiotropic variants using a hypothesis test, and MR-Lasso utilizes lasso for pleiotropic variant selection, both of which are computationally efficient. Second, MRcML-DP relies on permutations to derive the SE, which further increases computational burden. Conversely, MRBEE uses the sandwich formula to estimate its SE, which appears to be accurate in our simulations (Equation 19 in material and methods).
Real data analysis
Data sources
To demonstrate the MRBEE performance in real data analysis, we analyzed three outcomes, including myopia, SCZ, and CAD. Myopia is known to be influenced by a combination of genetic and environmental factors, including educational attainment (EDU), near-work activity, and outdoor activities30 but their direct causality to myopia is not clear. In this MVMR analysis, we considered refractive error, the measure of myopia degree, as the outcome. The exposures include EDU, near-work activity measured by time spent watching TV and playing on the computer (TV and Computer), and outdoor activity measured by time spent driving (Driving).
Attention-deficit/hyperactivity disorder (ADHD), cannabis use disorder (CAN), EDU, intelligence (INT), left-handedness (LH), intelligence (INT), neuroticism (SESA), and sleep duration (SLP) have been reported as risk factors for SCZ. Of these risk factors, CAN arguably has the strongest evidence of causality with respect to SCZ, with studies reporting dose-response31 and strong temporal32 relationships for at-risk individuals. The direct causality of the risk factors on SCZ is also not clear.
Many studies have been published to understand the causal effects of risk factors on CAD. However, the findings in these studies have been inconsistent. For instance, Holmes et al. and Lin et al.20,33 found that HDL-C is not significant, while Zhu et al.,7 using the GWAS summary data with a much larger sample size (1.3M vs. 90K), found it to be significant. Besides, Wang et al.21 found that low-density lipoprotein cholesterol (LDL-C) is not significant in European populations, which seems unreasonable. In this data analysis, we investigated the causal relationships of these risk factors on CAD using the GWAS summary data with the largest sample sizes to date. We focus on the same eight factors studied in Lin et al.,20 i.e., body mass index (BMI), DBP, fasting plasma glucose (FPG), height, HDL-C, LDL-C, triglycerides (TG), and SBP.
Data preprocessing
For UVMR, we applied the C + T procedure with the linkage disequilibrium (LD) parameter in Kb window to SNPs with association p value at least as small as 5E−5. We performed this operation for each exposure separately and used the 1000 Genomes project (phase 3) data as the LD reference panel.34 For MVMR, we initially considered the union set of all exposure-specific IV sets from UVMR, then restricted this set to only include SNPs with a joint -test p value reaching genome-wide significance and again passing C + T using the same parameters as before. The joint -test for the exposures is presented in Equation 22 in material and methods and is used to assess the null hypothesis that an SNP is not associated with any exposure. We additionally standardized the GWAS effect size estimates so that their SEs were the inverse of the sample sizes. This procedure leads to comparable causal effect estimates across different exposures. We used false discovery rate (FDR) correction in MRBEE to identify and remove SNPs with evidence of horizontal pleiotropy (see Algorithm 1).
Algorithm 1. Pseudo-code of MRBEE + pleiotropy test.
1. Input: Initial estimates , Bias-correction terms and , , FDR q-value threshold , Tolerance , Full set of IVs
2. Output: Causal effect estimates , Set of non-UHP/CHP IVs
- 3. Pseudo-code:
- Initialize
- While
- 1. Calculate for all ,
- 2. Update ,
- 3. Update using Equation 14 and IVs in
- End While
Table 1 summarizes the information of GWAS data in this study. In Table 1, the last three columns present the SNP heritability estimated by the LD score regression (LDSC),35 the variances explained by the IVs in UVMR, and the variances explained by the IVs in MVMR. It is evident that for the trait with lower heritability and small sample sizes, the UVMR IVs account for about 1% of its SNP heritability, which may reduce power to detect causal effects using UVMR. However, for most traits, IVs in MVMR analyses explain a substantial portion of the variance, which will provide good power to detect causal effects. This is because the standard error of causal estimate(s) is inversely proportional to the variance(s) explained by the IVs. The last column of Table 1 shows the reliability ratios, a measure of IV strength, for exposures used in real data analysis. The estimation errors averagely account for ∼20% variance of the GWAS effect estimates.
Table 1.
Trait | Source | Sample size | Significant IVs | LDSC heritability | UVMR variancea | MVMR varianceb | Reliability ratioc | |
---|---|---|---|---|---|---|---|---|
Myopia | Driving | van De Vegte etal. 54 | 422K | 4 | 0.0365 | 0.00034 | 0.00400 | 0.705 |
Playing computer | Arns et al. 55 | 422K | 46 | 0.0719 | 0.00408 | 0.01154 | 0.873 | |
Watching TV | Rustad et al. 56 | 422K | 189 | 0.1321 | 0.01788 | 0.02775 | 0.943 | |
EDU | Okbay et al. 57 | 765K | 656 | 0.1352 | 0.03954 | 0.03683 | 0.976 | |
Joint 4 test | 707 | |||||||
Refractive error | Hysi et al. 58 | 246K | 420 | 0.2702 | 0.11079 | 0.01433 | 0.838 | |
Schizophrenia | ADHD | Demontis et al. 59 | 55K | 12 | 0.0956 | 0.00279 | 0.01871 | 0.832 |
CAN | Johnson et al. 60 | 384K | 5 | 0.0174 | 0.00033 | 0.00272 | 0.552 | |
EDU | Okbay et al.57 | 765K | 656 | 0.1222 | 0.03954 | 0.03728 | 0.973 | |
Intelligence | Neale’s Lab | 430K | 48 | 0.2326 | 0.01527 | 0.06023 | 0.900 | |
Left handedness | Cuellar-Partida et al. 61 | 205K | 4 | 0.0338 | 0.00086 | 0.00533 | 0.576 | |
Neuroticism (SESA) | Nagel et al. 62 | 450K | 42 | 0.0800 | 0.00476 | 0.01056 | 0.825 | |
Sleep duration | Dashti et al. 63 | 493K | 66 | 0.0649 | 0.00589 | 0.00998 | 0.850 | |
Joint 7 test | 1,227 | |||||||
SCZ | Trubetskoy et al.64 | 320K | 287 | 0.3380 | 0.06378 | 0.03570 | 0.855 | |
Coronary artery disease | BMI | Loh et al. 65 | 458K | 882 | 0.2076 | 0.09494 | 0.10612 | 0.918 |
DBP | Evangelou et al. 66+MVP | 1.00M | 942 | 0.1095 | 0.06022 | 0.04525 | 0.823 | |
FPG | Neale’s Lab | 361K | 115 | 0.0848 | 0.03729 | 0.05156 | 0.789 | |
Height | Loh et al. 65 | 458K | 2,728 | 0.6023 | 0.48986 | 0.49156 | 0.981 | |
HDL-C | Graham et al. 67 | 1.32M | 1,031 | 0.1779 | 0.09207 | 0.09745 | 0.965 | |
LDL-C | Graham et al. 67 | 1.32M | 754 | 0.1293 | 0.08435 | 0.08713 | 0.961 | |
TG | Graham et al. 67 | 1.32M | 900 | 0.1251 | 0.07298 | 0.08105 | 0.959 | |
SBP | Evangelou et al. 66+MVP | 1.00M | 895 | 0.1152 | 0.05626 | 0.04550 | 0.829 | |
Joint 8 test | 4,336 | |||||||
CAD | Aragam et al.68+MVP | 1.45M | 343 | 0.0500 | 0.01610 | 0.01712 | 0.850 |
Variance explained by the IVs in UVMR analysis.
Variance explained by the IVs in MVMR analysis.
Reliability ratios of exposures in MVMR analysis.
Myopia
All the MVMR methods consistently showed that EDU (MRBEE p = 9.3E−21) and Driving (p = 3.8E−11) are directly causal on myopia, but not TV (p = 0.136) or Computer (p = 0.972) (Figure 5A). The no direct causal effect of TV or Computer on myopia risk although all exposures were observed to have significant causal effects on myopia in the UVMR analysis (Figure 5B). The insignificance of both TV and Computer in MVMR analysis suggests their correlations with myopia could be attributed to the confounding with EDU and Driving time. MRBEE provided larger protective causal estimate of driving time than that by IVW (i.e., MRBEE odds ratio [OR] = 0.71 vs. IVW OR = 0.84), likely due to a correction for weak instrument bias given that the driving time variance explained by the IVs was less than 1%. The causal effect of driving time estimated by MRcML-BIC and MRcML-DP was 3–5 times larger in magnitude than those from other methods. (MRcML-BIC OR = 0.38 and MRcML-DP OR = 0.58, respectively). In the iterative pleiotropy test, we detected 31 IVs demonstrating pleiotropy of the exposures and myopia (Figure 5C). Figure 5D compares the computational efficiency of the MR methods. We observed that IVW was the fastest (<0.1 s), followed by MRBEE (0.1 s), MR-Lasso (0.9 s), MR-Median (1.4 s), MRcML-BIC (1 min), and MRcML-DP (107 min), which were consistent with the simulations.
Schizophrenia
All MVMR methods consistently estimated that CAN (MRBEE p = 3.7E−8), EDU (p = 3.6E−15), INT (p = 7.7E−12), and SESA (p = 1.8E−7) have direct causal contributions on schizophrenia (Figure 6A). MRBEE, MR-Lasso, and MR-Median suggested that SLP (p = 3.4E−4) has direct causal contribution on schizophrenia but not MRcML-BIC or MRcML-DP. It is not clear why both MRcML-DP and MRcML-BIC failed to detect this causal contribution given our simulations suggest MRcML-BIC could be more powerful although with inflated type I error. A potential reason could be the instability of MRcML, which may converge to local maximum. However, this requires additional investigation. MRBEE suggested no direct causal effect of ADHD (p = 0.510) or LH (p = 0.096), possibly due to the relatively low exposure variance explained by the IV set (i.e., 0.018 for ADHD and 0.005 for LH). We observed relatively larger odds ratios of EDU and CAN for MRBEE than MR-Median, MR-Lasso, and IVW, but less than MRcML-DP and MRcML-BIC. In comparison, UVMR analyses by all methods suggested evidence of total causal effects of CAN (MRBEE p = 1.6E−4), INT (p = 3.2E−7), SESA (p = 2.0E−10), ADHD (p = 0.017), and SLP (p = 1.4E−4), but not EDU (p = 0.542) or LH (p = 0.716) (Figure 6B). We did not observe any IVs with evidence of horizontal pleiotropy at the Bonferroni-corrected p = 0.05 level (Figure 6C), suggesting that the genetic association of the IVs with SCZ are strictly mediated by the five significant exposures. Again, we observed similar computational efficient for these methods as before (Figure 6D).
Coronary artery disease
Figure 7A presents MVMR causal estimates for the effects of BMI, DBP, SBP, FPG, height, HDL-C, LDL-C, TG, and SBP on CAD. Using MRBEE, we identified the following significant direct causal effects on CAD, including BMI (MRBEE p = 3.8E−39), FPG (p = 6.7E−10), HDL-C (p = 8.4E−21), LDL-C (p = 1.5E−87), TG (p = 1.9E−7), and SBP (p = 1.1E−25). We observed that MRBEE estimates were generally consistent with estimates from IVW, MR-Median, and MR-Lasso. Conversely, MRcML-BIC and MRcML-DP estimates diverged from all other methods for DBP and FPG. For example, MRcML-DP/BIC were the only methods that simultaneously produced significant causal effects for SBP (p = 1.6E−39) and DBP (p = 1.8E−45) on CAD, two traits that are highly genetically correlated. In comparison, UVMR analyses by all methods consistently suggested that all the exposures have causal contributions on CAD (Figure 7B). We also observed 173 IVs demonstrating horizontal pleiotropy by the horizontal pleiotropy test in MVMR (Figure 7C). Again, our proposed MRBEE is computationally efficient (Figure 7D).
Pleiotropic variants detected by GWPT
GWPT uses the statistic (Equation 21 in materials and methods) to test whether a genetic variant is associated with the outcome phenotype strictly through the mediation of a select group of exposures. In our GWPT analyses, these groups of exposures are those that were used in each MVMR. This test can be used to find these outcome-associated loci16 that do not reach the level of genome-wide significance in the original outcome phenotype GWAS but are genome-wide significant in GWPT. In these regions, it is possible that the local genetic correlations between the exposures and outcome are of different sign or magnitude than the genome-wide genetic correlations.36 To ensure that the loci identified in GWPT were not primarily influenced by other exposures, we excluded any loci that showed even a marginal association with any of the exposures at a genome-wide significance level (i.e., p < 5E−8). We also compared GWPT with cross-phenotype association analysis (CPASSOC),37 multi-trait analysis of GWAS (MTAG),38 which are joint tests of association between all exposures in the outcome.
Table 2 lists the variants detected by GWPT for myopia, SCZ, and CAD but missed in the original GWAS. For comparison, we listed the p values for association from the original outcome GWAS, cross-phenotype tests by CPASSOC and MTAG, and by GWPT, respectively. The GWPT identified 18 genome-wide significant loci for myopia, four for SCZ, and 20 for CAD, respectively. All these loci did not reach genome-wide significance level in the outcome GWASs, suggesting GWPT captures pleiotropic evidence and the standard GWAS does not. We also performed expression quantitative trait loci (eQTL) mapping for the identified loci using functional mapping of GWAS (FUMA GWAS).39 Each SNP that tagged a locus had marginal evidence of association with the expression of a gene in that locus in at least one tissue, where association p values ranged from 3.3E−310 to 6.7E−5. This suggests that these loci may have functionally relevant consequences in their conferred risk for myopia, SCZ, or CAD.
Table 2.
SNP information |
Association test |
eQTL mapping |
||||||||
---|---|---|---|---|---|---|---|---|---|---|
SNP | CHR:BP | GWAS | MTAG | CPASSOC | GWPT | Symbol | Tissue | Database | p | |
Myopia | rs55761633 | 1:20757820 | 9.9e−07 | 3.1e−05 | 5.6e−07 | 9.6e−09 | CAMK2N1 | Muscle Skeletal | GTEx/v8 | 3.9e−06 |
rs2419964 | 2:124252256 | 1.3e−07 | 1.2e−05 | 2.6e−10 | 6.0e−10 | NA | NA | NA | NA | |
rs7602460 | 2:182261869 | 4.9e−07 | 9.2e−06 | 6.6e−07 | 3.2e−08 | ITGA4 | Blood | eQTLgen | 2.0e−19 | |
rs61548163 | 2:184349492 | 9.2e−07 | 1.3e−05 | 3.3e−06 | 2.7e−08 | NA | NA | NA | NA | |
rs6764842 | 3:123106287 | 1.6e−06 | NA | 2.0e−07 | 3.1e−08 | ADCY5 | Artery Tibial | GTEx/v8 | 3.0e−17 | |
rs9761983 | 4:138482973 | 1.5e−07 | NA | 9.0e−06 | 4.4e−08 | RP11-714L20.1 | Cortex | GTEx/v8 | 2.2e−06 | |
rs2461726 | 6:166316838 | 5.1e−07 | 1.7e−05 | 1.2e−06 | 1.1e−08 | SDIM1 | Pituitary | GTEx/v7 | 2.8e−07 | |
rs12699288 | 7:11975557 | 3.8e−06 | 1.4e−04 | 3.4e−08 | 1.4e−08 | THSD7A | Nerve Tibial | GTEx/v8 | 6.7e−05 | |
rs2970498 | 7:30478056 | 1.1e−07 | 1.8e−06 | 7.0e−06 | 3.1e−08 | NOD1 | Blood | eQTLgen | 1.0e−07 | |
rs1532278 | 8:27466315 | 3.4e−07 | 1.1e−05 | 9.6e−07 | 6.7e−09 | CLU | Eye | EyeGEx | 1.1e−26 | |
rs7048915 | 9:4206388 | 1.0e−07 | 2.0e−06 | 6.2e−06 | 1.3e−08 | NA | NA | NA | NA | |
rs902997 | 10:105384262 | 1.9e−07 | 9.2e−07 | 1.2e−08 | 3.2e−09 | USMG5 | Blood | eQTLgen | 3.0e−37 | |
rs17065719 | 13:44925021 | 4.3e−07 | 1.1e−05 | 2.6e−05 | 3.7e−08 | SERP2 | Blood | eQTLgen | 7.7e−48 | |
rs1926715 | 13:111538590 | 8.6e−08 | 1.4e−05 | 2.4e−12 | 2.0e−09 | ANKRD10 | Eye | EyeGEx | 3.0e−48 | |
rs7141076 | 14:67922172 | 9.2e−08 | 1.8e−05 | 5.7e−06 | 1.7e−08 | TMEM229B | Pituitary | GTEx/v8 | 1.1e−08 | |
rs12889206 | 14:68769182 | 8.8e−08 | 1.5e−06 | 1.2e−06 | 3.9e−08 | NA | NA | NA | NA | |
rs7198357 | 16:67884619 | 2.5e−07 | 4.2e−06 | 2.1e−09 | 4.3e−09 | DUS2 | Blood | eQTLgen | 3.3e−310 | |
rs35594082 | 16:84796864 | 8.5e−07 | 1.8e−05 | 1.8e−06 | 3.1e−08 | USP10 | Eye | EyeGEx | 2.9e−09 | |
Schizophrenia | rs17672204 | 5:74946518 | 1.1e−06 | 8.2e−06 | 1.9e−06 | 2.4e−08 | COL4A3BP | Muscle Skeletal | GTEx/v8 | 5.8e−15 |
rs79650876 | 3:187997616 | 1.7e−07 | 4.3e−08 | 4.5e−07 | 3.2e−08 | AC022498.1 | Blood | eQTLGen | 5.7e−06 | |
rs2300921 | 3:185651001 | 8.0e−06 | 8.9e−06 | 4.9e−06 | 3.2e−08 | TRA2B | Breast | GTEx/v8 | 1.8e−06 | |
rs7225476 | 17:78561603 | 8.2e−07 | 2.1e−06 | 7.0e−05 | 3.3e−08 | RPTOR | Blood | eQTLGen | 4.0e−89 | |
Coronary artery disease | rs2045886 | 2:29010517 | 3.6e−07 | 4.6e−05 | 2.7e−21 | 7.7e−11 | PPP1CB | Blood | eQTLGen | 3.3e−310 |
rs6727524 | 2:238570309 | 8.9e−07 | 6.3e−05 | 4.4e−09 | 2.8e−08 | LRRFIP1 | Blood | eQTLGen | 3.4e−76 | |
rs1868217 | 3:98445534 | 2.1e−05 | 4.5e−04 | 1.3e−10 | 3.6e−08 | ST3GAL6 | Blood | eQTLGen | 2.0e−30 | |
rs73070809 | 3:186885760 | 1.1e−07 | NA | 3.7e−07 | 1.0e−08 | RPL39L | Adipose | GTEx/v8 | 4.0e−05 | |
rs12523133 | 5:86297919 | 8.5e−08 | 2.3e−05 | 3.2e−19 | 2.5e−10 | RP11-72L22.1 | Spinal Cord | GTEx/v8 | 6.6e−08 | |
rs6899197 | 5:111250597 | 8.8e−06 | NA | 1.2e−20 | 2.2e−09 | EPB41L4A | Esophagus | GTEx/v8 | 5.0e−05 | |
rs13202921 | 6:41687366 | 3.3e−07 | 2.9e−06 | 1.8e−08 | 3.8e−08 | CCDC77 | Artery Coronary | GTEx/v7 | 4.9e−11 | |
rs2073533 | 7:14029739 | 1.1e−07 | 2.6e−05 | 5.2e−14 | 3.2e−11 | NA | NA | NA | NA | |
rs7822979 | 8:106468592 | 6.0e−08 | NA | 1.5e−08 | 1.6e−10 | NA | NA | NA | NA | |
rs4734881 | 8:106587829 | 7.8e−08 | 3.1e−05 | 5.3e−06 | 1.3e−10 | ZFPM2 | Esophagus | GTEx/v8 | 3.8e−08 | |
rs12375254 | 8:125054365 | 3.2e−07 | 4.0e−06 | 2.5e−04 | 1.0e−08 | TRMT12 | Blood | eQTLGen | 4.3e−15 | |
rs12412313 | 10:134456762 | 2.4e−06 | 6.8e−03 | 2.7e−19 | 2.3e−10 | INPP5A | Blood | eQTLGen | 6.6e−13 | |
rs7113595 | 11:70236819 | 7.8e−07 | 2.2e−05 | 2.7e−08 | 2.4e−08 | PPFIA1 | Blood | eQTLGen | 5.3e−305 | |
rs7315852 | 12:417633 | 3.0e−07 | NA | 7.7e−19 | 2.1e−11 | CCDC77 | Blood | eQTLGen | 3.3e−310 | |
rs55893521 | 15:83955536 | 2.2e−07 | 1.7e−04 | 3.4e−20 | 2.9e−09 | BTBD1 | Brain | xQTLServer | 3.4e−75 | |
rs12918327 | 16:30626616 | 6.8e−06 | 9.7e−04 | 2.8e−32 | 5.2e−10 | STX4 | Blood | eQTLgen | 1.1e−85 | |
rs9958798 | 18:52769637 | 4.0e−06 | NA | 9.9e−10 | 4.2e−08 | RP11-99A1.2 | Testis | GTEx/v8 | 6.3e−08 | |
rs35496634 | 22:39147235 | 8.5e−06 | 1.2e−03 | 4.6e−17 | 3.4e−08 | SUN2 | Blood | eQTLGen | 7.0e−79 | |
rs5757949 | 22:40820151 | 2.9e−06 | 9.3e−06 | 5.2e−15 | 3.8e−08 | MKL1 | Eye | EyeGEx | 1.6e−07 |
Discussion
We proposed MRBEE to overcome the weak instrument, pleiotropy and sample overlap bias in MVMR analysis. We pointed out that weak instrument bias is essentially driven by measurement error of GWAS effect estimates, whose scale and bias direction are influenced by the degree of weakness of IVs and the GWAS sample overlap, respectively. An IV is not considered weak when the estimation error is negligible, which can be achieved with a sufficiently large GWAS sample size, no matter how large or small the effect size is. In genetics, Burgess et al.3 suggested using the F-statistics to define the strength of an IV, whereas we recommend the reliability ratio (material and methods, Equation 11), a commonly used statistic in measurement error analysis. Both metrics are equivalent and will be influenced by the GWAS sample size and the number of IVs, highlighting that the definition of a weak instrument is dynamic. MRBEE removes the measurement error bias by using an unbiased estimating function. Although this estimating function has a long history in the literature of measurement error analysis,40 it has not been utilized to modify the current MVMR approaches.
Our simulations suggested that MRBEE in general leads to equal or less bias of causal effect estimate than the comparing methods when weak IVs, pleiotropy, and sample overlap are present (Figure 2). Similarly, MRBEE also has equal or better type I error control and statistical power than robust comparing methods (Figure 3). MRcML-DP and MRcML-BIC were robust to weak IVs and consistently yield unbiased causal effect estimates under the “no pleiotropy” case. However, in the presence of horizontal pleiotropy, our simulations suggested that MRcML methods may produce local minimizers in some specific scenarios in which horizontal pleiotropy was not completely removed. MRcML employs the best subset selection for detecting pleiotropy and the algorithm’s stability and time consumption could be a challenge,41 as we observed in our simulations. MRBEE uses an iterative pleiotropy test, whose reliability has been validated in MR-PRESSO and IMRP.
In the myopia analysis, our detected causal effects for outdoor activities are consistent with the literature. For example, spending more time outdoors reducing incident myopia was confirmed by a randomized clinical trial.42 On the other hand, near-work activities such as time spent watching TV or using the computer have not been found to be associated with myopia risk.43 The potential biological mechanism is that outdoor activities increase the exposure time to natural light, which induces the release of dopamine and thereby inhibits axial elongation, thus suppressing the development of myopia.30 Moreover, MRBEE yields a relatively large causal estimate for time spent driving, likely correcting for weak instrument bias given the small variation of driving time explained by the IVs. Although MRcML-DP and MRcML-BIC can effectively reduce weak instrument bias in simulations, their estimates for the effects of driving time were 3–5 times larger than those from other methods.
We observed that cannabis use disorder and education have substantially larger causal effects on SCZ than other exposures we examined. For LH, the current GWAS has identified four genome-wide significant IVs together explaining its 0.086% variation. As a result, we did not have sufficient power to confirm their causal effect due to its relatively smaller variance of LH explained by IVs. MRBEE did not identify pleiotropic variants in these data, suggesting that our study may already include most of the direct causal risk factors for SCZ.
Our MVMR analysis seems to suggest that HDL-C is likely a protective factor against CAD but with a weaker effect size than that from UVMR analysis, aligning with recent pharmaceutical trial outcomes.44 The previously observed negative results20,33 are likely because they did not utilize the lipid GWAS summary data with the largest available sample sizes. When using the largest GWAS summary statistics of CAD as in this study, all methods including IVW, MRcML-DP, and MRBEE resulted in significant protective causal effect of HDL-C on CAD (Figure 7A). We noted that the estimated equal contributions of DBP and SBP on CAD risk by MRcML-BIC and MRcML-DP, which is in direct conflict with all other MR methods we tested and the literature.45 In addition, MRBEE identified 173 pleiotropic IVs, one of which (rs10757278) is strongly associated with CAD (p < 5E−300) but whose biological mechanism warrants for further investigation.
We introduced the GWPT using the statistic , which can be applied in UVMR or MVMR to identify specific IVs with evidence of horizontal pleiotropy. When was applied to the whole genome, we identified genetic loci associated with myopia, SCZ, and CAD that were missed in their original GWAS. These loci also reflect their direct association with the outcomes or through exposures not included in this study. Genes in these loci had genome-wide significant eQTLs across a range of tissues, suggesting that these genes might be functionally relevant in modifying disease risk. For example, we identified the RPTOR gene for SCZ, which has previously been found to be associated with BMI46 and blood pressure.47 This gene also has significant eQTLs (smallest p = 4E−89) in blood tissue. This and other examples highlight the potential utility of in identifying trait-associated loci and functionally relevant genes.
In our theoretical study (supplement 2), we consider the effect size of IVs to follow a normal distribution, representing a genomic random effect model.48 We observed that increasing the sample size of GWAS often yields more novel loci, hence more IVs with non-zero effects can be used in a corresponding MR analysis. Therefore, in our theoretical investigation, we allow the number of IVs to increase with the sample size and examine the outcomes of MRBEE and MV-IVW under different rates of and . Our conclusion can be summarized as follows: for scenarios like those in our myopia and CAD data, where GWAS sample sizes for exposures are approximately half a million or more, MRBEE and MV-IVW are equally efficient (supplement 2, Theorem 1.3 (i)). In this case, MRBEE’s inference is asymptotically valid, whereas MV-IVW may lead to incorrect inferences. For the SCZ data involving CUD, with GWAS sample sizes in the tens of thousands, MRBEE is less efficient than MV-IVW, but the inference made by MRBEE remains valid (supplement 2, Theorem 1.3 (ii)-(iii)). In these cases, the confidence intervals of MRBEE will be wider than MV-IVW but ensure the 95% coverage frequency. Although MRBEE can remove the weak instrument bias in general, we still recommend including the IVs with the association p values below a significance threshold. The reason is that weak IVs still require to be truly associated with an exposure although their effect sizes can be extremely small. Variants with the association p values above the threshold are likely to be false positive and including false positive IVs will lead to bias for MRBEE because of the violation of assumption (IV1). The purpose of developing MRBEE is to enhance existing methods, making causal effect estimation and inference more robust to weak IVs.
The comparison between MRBEE and MRcML in terms of statistical principle is as follows. MRBEE employs the unbiased estimating function method which constructs its unbiased score function from the score function of the MV-IVW method. In contrast, the MRcML method is a conditional score function method, characterized by first estimating the sufficient statistic containing parameters to be estimated and then estimating the parameter based on this sufficient statistic through an iterative method.40 Although Stefanski and Carroll24 demonstrated that the conditional score function possesses statistical efficiency, whether this conclusion can be directly applied to the MRcML method requires further investigation. In contrast, our investigation shows that MRBEE reaches statistical efficiency if where is the minimum GWAS sample size (supplement 2, Theorem 1.3 (i)). Furthermore, our simulations in Section S1.3 of supplement 1 suggest that MRcML-DP tends to overestimate its SD (i.e., SE > SD), MRcML-BIC underestimates its SD (i.e., SE < SD), and MRBEE estimates its SD well in most cases, suggesting MRBEE can achieve more efficiency than MRcML. The exact reason that MRcML does not estimate SD well warrants further investigation.
MRBEE also has some limitations. MRBEE is ineffective in handling exposures associated with significantly weaker IVs, such as CUD and LH in the SCZ data. This is also a challenge inherited from the field of measurement error analysis. In this case, MRBEE and analogous methods such as MRcML tend to produce causal effect estimates with relatively large SE. MRBEE is effective when the proportion of pleiotropic variants is relatively low (e.g., below 30%). Incorporating a Gaussian mixture model with MRBEE might improve the robustness for scenarios with a high proportion of pleiotropic variants. Finally, MRBEE is designed to handle a fixed number of exposures. Expanding its capability to a high-dimensional MR model is warranted in future research.49
Last, it is worth offering guidance on how to perform MVMR analysis from our perspective. First, rather than selecting the optimal number of IVs such that the F-statistics and conditional F-statistics are larger than 10,3,22 we suggest including all independent IVs that are genome-wide significantly associated with at least one exposure. The main purpose of doing this is to reduce the winner’s curse. Our simulations found that all methods, including MRBEE, were affected by the winner’s curse, and the only way to alleviate the winner’s curse was to include as many causal variants as possible (supplement 1, and Figure S10). Besides, our theory (supplement 2, Theorem 1.2 and 1.3) illustrates that the asymptotic variance of a causal effect estimate is related to the cumulative variance explained by all specified IVs instead of the average variance explained by each IV. Hence, including more IVs in the MR model can reduce the variance of the related causal effect estimate. Second, when performing MVMR analysis, it is not necessary to remove variants that are pleiotropic between the exposures. The reason why Wang et al.21 found that LDL-C was not significant in European populations is likely caused by this procedure. In contrast, simultaneously including all the relevant exposures and their IVs is recommended because the multivariable regression can automatically account for the pleiotropic variants shared by the specified exposures. Third, we suggest conducting a GWPT after performing the MR analysis, which represents an effective multi-trait approach for discovering loci with pleiotropy effect, beyond current methods such as CPASSOC and MTAG. In statistical principle, GWPT is likely to identify new loci associated with the outcome if the effect directions of pleiotropy and exposure mediation are opposite in these genome regions.
During the revision of this manuscript, we noted that a recent preprint53 claimed that MRBEE was biased with extremely large SD and SE for some of simulation scenarios. We observed that the reason was that the authors of the preprint did not perform the standardization for the instrument effects of both exposures and outcome, which was documented in in the MRBEE software. We present the reproduction of Table 1 in the preprint before and after the standardization in Table S25 in supplement 1. The result indicates MRBEE has reasonable SD and SE. We have updated MRBEE software on GitHub and it does not need the standardization now.
Material and methods
MR model
We describe MRBEE with details here. As in the main text, let be a vector of independent genetic variants where each variant is standardized with mean zero and variance one, be a vector of exposures, and be an outcome. Consider the following linear structural model:
(Equation 1) |
(Equation 2) |
where is an matrix of genetic effects on exposures with being a vector of length , is a vector of length representing the causal effects of the exposures on the outcome, is a vector of length representing horizontal pleiotropy, which may violate the (IV2) or (IV3) conditions, and and are noise terms. Substituting for in (2), we obtain the reduced-form model:
(Equation 3) |
where
(Equation 4) |
In practice, and are usually correlated, and hence traditional linear regression between and cannot obtain a consistent estimate of . In contrast, the genetic variant vector is assumed to be independent of the noise terms and because the genotypes of individuals are randomly inherited from their parents and do not change during their lifetime.50 Hence, can be used as IVs to remove the confounding effect of and .
We assume that the genetic effect () is a -dimensional random vector with zero-mean, covariance matrix , and cumulative covariance matrix :
The covariance matrix will vanish as , but the cumulative covariance matrix is still a constant matrix, representing the total genetic covariance contributed from the IVs. The genetic variant (, ) is standardized so that and , and all IVs are assumed to be in linkage equilibrium (LE), i.e., cov for . Next, the noise terms and have zero-means and joint covariance matrix:
Thus, the exposure and outcome have zero-means and joint covariance matrix:
, , and . Note that means the confounders affect both and .
Bias of multivariable IVW estimate
Since large individual-level data from GWAS are less publicly available, most of the current MR analyses are performed with summary statistics of IVs through the following linear regression:
(Equation 5) |
where and are respectively estimated from the outcome and exposure GWASs, is the horizontal pleiotropy, represents the residual of this regression model, and referring to the IVs. MV-IVW, which is the foundation of most existing MR methods, estimates by
(Equation 6) |
where is a diagonal matrix consisting of the variance of estimation errors of . In practice, it is routine to standardize and by and to remove the minor allele frequency effect.16 With this standardization, the MV-IVW estimates by
(Equation 7) |
However, the MV-IVW estimate is biased due to the estimation errors of and :
(Equation 8) |
(Equation 9) |
To see this, observe the estimating equation and Hessian matrix of :
That is, is the score function of Equation 7 and is estimated by solving , and is the 2nd derivative matrix of Equation 7. As shown in supplement 2, since , the bias of is approximately:
(Equation 10) |
where
Interpretation of weak instrument bias
Here, can be regarded as the average information carried by each IV, while can be regarded as the information carried by each estimation error. If is not substantially larger than , then the weak instrument will inflate the measurement error bias by the multiplier . This is the primary reason why violating assumption (IV1) introduces bias into causal effect estimates in IVW and other MR approaches.26
The covariance between the estimation errors of SNP-exposure and SNP-outcome associations can be affected by the fraction of overlapping samples of the exposures and outcome GWAS. If the exposures and outcome GWAS are independent of each other, then and hence the measurement error bias always shrinks toward the null. In contrast, if the exposures GWAS and outcome GWAS are estimated from the same cohorts, usually introduces bias toward the direction of , reflecting the degree of sample overlap between exposures and outcome. This is the reason why in some empirical studies,23,27 IVW cannot completely remove the confounding bias if the overlapping sample fraction is large.
If , there is additional pleiotropy bias due to the horizontal pleiotropy that violates the InSIDE assumption. In UVMR, it is challenging to guarantee or for all , resulting in a potentially biased IVW estimate. Traditional solutions to horizontal pleiotropy bias require that only a small proportion of IVs exhibit horizontally pleiotropic effects.5,7,12 However, for complex traits, it is plausible that a large portion of IVs (even possibly ) possess horizontally pleiotropic effects, leading to the failure of UVMR methods. MVMR can balance these pleiotropic effects shared by multiple exposures, significantly reducing the number of IVs with horizontal pleiotropy evidence when conditioned on specified exposures. In other words, it is more likely to guarantee that only few IVs violate the InSIDE assumption after accounting for multiple exposures, which can be then detected and removed using the robust tools such as a pleiotropy hypothesis test.
Reliability ratio
In practice, we suggest using the reliability ratio40:
(Equation 11) |
to measure the degree of bias in , which can be empirically estimated by
(Equation 12) |
reflects the proportion of variability in the estimated effects attributable to the underlying true genetic effects. For example, a reliability ratio of 0.6 indicates that 60% of the variance of the estimated effects is attributable to the true effects and the rest is attributable to their estimation errors. From the perspective of measurement error theory, the IVW estimate converges to in a univariable MR analysis when there is no sample overlap, where is equal to 40 Here is less than 1 and is viewed as a shrinkage coefficient for relative to the true effect . We adopt this reliability ratio to much broader contexts, such as multivariable MR and sample overlap. In our real data analysis, we found this reliability ratio works reasonably well although additional investigation is warranted. While the reliability ratio and the F-statistics3 are similar, the former has a simpler calculation and can more clearly reflect the proportion of weak IV bias than the latter.
MR using bias-corrected estimating equation
We propose MRBEE, which estimates causal effects by solving a new unbiased estimating equation of causal effects. The unbiased estimating equation of is
(Equation 13) |
where . Equation 13 states that the MRBEE estimating function is equal to the IVW estimating equation minus its bias. Unbiasedness of the MRBEE estimating equation implies unbiasedness of the MRBEE estimator. The solution such that is
(Equation 14) |
Note that unlike other optimizations such as generalized linear model in measurement error,40 the Hessian matrix does not involve and hence can be directly obtained from without any iterative approximation.
Bias-correction terms estimation
We estimate the bias-correction terms and from the insignificant and independent GWAS summary statistics.37 Let () be insignificant GWAS effect size estimates of outcome and exposures, where the insignificance means that the value of the genetic variants are larger than 0.05 for all exposures and outcome, and the independence means that they are not in LD. Because and follow the same distributions of and , can be estimated by
(Equation 15) |
Here, is the first sub-matrix of and consists of the first elements of the last column of . The intercept provided by LDSC35 is also a consistent estimate of . Each of these two estimators may be used by MRBEE and experience with real data suggests that they generally produce similar results. LDSC requires specification of an LD reference panel that is from an ancestrally similar population to that under study in MR. Differences in genetic architecture between the LD reference panel and the MR study population could introduce bias. Use of Equation 15 does not require an LD reference panel and so will not be biased for this reason. Additionally, use of Equation 15 is computationally simpler.
SE estimation
The covariance matrix of is yielded through the sandwich formula:
(Equation 16) |
where the outer matrix is the Fisher information matrix, i.e., the expectation of the Hessian matrix of , and the inner matrix is the covariance matrix of . A consistent estimate of is
(Equation 17) |
where and
When the number of IVs is small, the standard sandwich formula has been observed to underestimate the SE.51 We apply the MD correction52 to solve this problem. Consider the so-called hat matrix:
and is its th diagonal entries. The MD correction adjusts the inner matrix as
(Equation 18) |
Their theory shows that
and hence it can obtain a more reliable covariance matrix by adjusting when estimating with the moment method. When there is horizontal pleiotropy, we adjust Equation 18 as
(Equation 19) |
where is number of valid IVs and is the number of detected pleiotropies. Section S1.3 of supplement 1 compares the estimated and true standard errors of causal effect estimates for MRBEE and other MVMR estimators. These results demonstrate that the MD correction described above controls the Type I error rate well. It is also worth noting that the standard errors of the MRBEE causal estimates will generally become smaller as the degree of weak instrument bias becomes smaller.
Horizontal pleiotropy detection
We illustrate how to remove specific IVs with evidence of UHP or CHP effects with the pleiotropy test which tests the same null hypothesis for each SNP as MR-PRESSO and IMRP. The null hypothesis for the th IV not having any horizontally pleiotropic effects on the outcome is
(Equation 20) |
The statistic for the th IV is defined as
(Equation 21) |
which follows a distribution under . The only assumption here is that is asymptotically normal distributed. In fact, this test examines whether the outcome effect can be explained by the mediation effects through all exposures. In practice, we estimate using the delta method:
Other methods such as empirical variance and robust variance estimates of the residual can also be used here. We calculate for all candidate IVs and remove IVs with large values in an iterative manner. Algorithm 1 uses an FDR Q-value threshold to exclude IVs showing potential pleiotropy evidence. We suggest a threshold Q-value <0.05 in general. Additional simulation results presented in Section S2.5 of supplement 1 show that FDR correction generally performs well.
GWPT
Since tests a very general null hypothesis, we can also calculate for all SNPs across the genome after obtaining the causal effect estimates of exposures on the outcome. Results from these tests can be used to (1) find novel loci associated with the MR outcome and (2) draw inferences about pathways of genetic association with the MR outcome. Specifically, when an SNP has a negative effect on the exposure and a positive pleiotropic effect on the outcome , and simultaneously the causal effect is positive, then the total effect of this variant on the outcome is canceled and hence cannot be detected in the outcome GWAS. In contrast, the pleiotropy test directly tests the effect and therefore can detect novel loci. For example, Zhu et al.16 successfully detected many blood pressure loci missed previously by using this GWPT with IMRP as the estimator of the causal effect. The results indicated that most detected pleiotropic variants influenced SBP and DBP in opposite directions, providing support for the principle of the GWPT.
Joint -test for IVs selection
We applied the joint -test to select a set of IVs that are strongly associated with multiple exposures. Let be the p-length vector of standardized associations between the jth SNP and the p exposures. We performed the following hypothesis test:
(Equation 22) |
The test statistic is
(Equation 23) |
which follows a distribution when the null hypothesis holds, where is the estimated matrix of covariances between estimation errors. We only considered variants as IVs if they are genome-wide significant in the joint -test.
Estimation of variance explained by instrument variables
Assume that we intend to estimate the SNP heritability of a trait using a set of m IVs in the m-length vector with corresponding associations with in the vector . If the variance of is 1 and , we can estimate the variance in explained by the m IVs using the following equation:
(Equation 24) |
where is the minor allele frequency of . We used Equation 24 to produce the heritability estimates in Table 1.
Asymptotic results
We assume that both total number of IVs and the minimum sample size among the exposure and outcome GWAS can approach infinity, while the number of exposures and the -dimensional causal effect vector are fixed and bounded. Our goal is to identify the scenarios when MV-IVW outperforms MRBEE, when they perform equally well, and when MRBEE outperforms MV-IVW in terms of unbiased estimation of causal effects and the asymptotic validity of causal inference. We demonstrate the related theorems and the related regularity conditions and lemmas in supplement 2.
Data and code availability
The data referenced in this study can be accessed through the GWAS Catalog (https://www.ebi.ac.uk/gwas/home), with the corresponding GWAS summary data available for download in the “data availability” section of the respective papers. Some of the GWAS summary data are exclusive of the Million Veteran Program (MVP) summary results, which are available through dbGAP under the accession number phs001672.v3.p1.
The MRBEE R package generated during this study is available at https://github.com/noahlorinczcomi/MRBEE. Simulation codes generated during this study are available at https://github.com/harryyiheyang/MRBEE.Simulation.
Acknowledgments
This work was supported by grant HG011052 and HG011052-03S1 (to X.Z.) from the National Human Genome Research Institute (NHGRI), USA. N.L.-C. was partially supported by grant T32 HL007567 from the National Heart, Lung, and Blood Institute (NHLBI).
Declaration of interests
The authors declare no competing interests.
Footnotes
Supplemental information can be found online at https://doi.org/10.1016/j.xhgg.2024.100290.
Supplemental information
References
- 1.Sanderson E., Glymour M.M., Holmes M.V., Kang H., Morrison J., Munafò M.R., Palmer T., Schooling C.M., Wallace C., Zhao Q., Davey Smith G. Mendelian Randomization. Nat. Rev. Methods Primers. 2022;2:6–21. doi: 10.1038/s43586-021-00092-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Burgess S., Butterworth A., Thompson S.G. Mendelian Randomization Analysis with Multiple Genetic Variants Using Summarized Data. Genet. Epidemiol. 2013;37:658–665. doi: 10.1002/gepi.21758. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Burgess S., Thompson S.G., CRP CHD Genetics Collaboration Avoiding Bias from Weak Instruments in Mendelian Randomization Studies. Int. J. Epidemiol. 2011;40:755–764. doi: 10.1093/ije/dyr036. [DOI] [PubMed] [Google Scholar]
- 4.Bowden J., Davey Smith G., Burgess S. Mendelian Randomization with Invalid Instruments: Effect Estimation and Bias Detection Through Egger Regression. Int. J. Epidemiol. 2015;44:512–525. doi: 10.1093/ije/dyv080. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Morrison J., Knoblauch N., Marcus J.H., Stephens M., He X. Mendelian Randomization Accounting for Correlated and Uncorrelated Pleiotropic Effects Using Genome-Wide Summary Statistics. Nat. Genet. 2020;52:740–747. doi: 10.1038/s41588-020-0631-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Verbanck M., Chen C.-Y., Neale B., Do R. Detection of Widespread Horizontal Pleiotropy in Causal Relationships Inferred from Mendelian Randomization Between Complex Traits and Diseases. Nat. Genet. 2018;50:693–698. doi: 10.1038/s41588-018-0099-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Zhu X., Li X., Xu R., Wang T. An Iterative Approach to Detect Pleiotropy and Perform Mendelian Randomization Analysis Using GWAS Summary Statistics. Bioinformatics. 2021;37:1390–1400. doi: 10.1093/bioinformatics/btaa985. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Kang H., Zhang A., Cai T.T., Small D.S. Instrumental Variables Estimation with Some Invalid Instruments and Its Application to Mendelian Randomization. J. Am. Stat. Assoc. 2016;111:132–144. [Google Scholar]
- 9.Xue H., Shen X., Pan W. Constrained Maximum Likelihood-Based Mendelian Randomization Robust to Both Correlated and Uncorrelated Pleiotropic Effects. Am. J. Hum. Genet. 2021;108:1251–1269. doi: 10.1016/j.ajhg.2021.05.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Bowden J., Davey Smith G., Haycock P.C., Burgess S. Consistent Estimation in Mendelian Randomization with Some Invalid Instruments Using a Weighted Median Estimator. Genet. Epidemiol. 2016;40:304–314. doi: 10.1002/gepi.21965. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Rees J.M.B., Wood A.M., Dudbridge F., Burgess S. Robust Methods in Mendelian Randomization via Penalization of Heterogeneous Causal Estimates. PLoS One. 2019;14 [Google Scholar]
- 12.Qi G., Chatterjee N. Mendelian Randomization Analysis Using Mixture Models for Robust and Efficient Estimation of Causal Effects. Nat. Commun. 2019;10:1941. doi: 10.1038/s41467-019-09432-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Burgess S., Foley C.N., Allara E., Staley J.R., Howson J.M.M. A Robust and Efficient Method for Mendelian Randomization with Hundreds of Genetic Variants. Nat. Commun. 2020;11:376. doi: 10.1038/s41467-019-14156-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Yuan Z., Liu L., Guo P., Yan R., Xue F., Zhou X. Likelihood-Based Mendelian Randomization Analysis with Automated Instrument Selection and Horizontal Pleiotropic Modeling. Sci. Adv. 2022;8 doi: 10.1126/sciadv.abl5744. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Cheng Q., Zhang X., Chen L.S., Liu J. Mendelian Randomization Accounting for Complex Correlated Horizontal Pleiotropy While Elucidating Shared Genetic Etiology. Nat. Commun. 2022;13:6490. doi: 10.1038/s41467-022-34164-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Zhu X., Zhu L., Wang H., Cooper R.S., Chakravarti A. Genome-Wide Pleiotropy Analysis Identifies Novel Blood Pressure Variants and Improves Its Polygenic Risk Scores. Genet. Epidemiol. 2022;46:105–121. doi: 10.1002/gepi.22440. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Sanderson E., Davey Smith G., Windmeijer F., Bowden J. An Examination of Multivariable Mendelian Randomization in the Single-Sample and Two-Sample Summary Data Settings. Int. J. Epidemiol. 2019;48:713–727. doi: 10.1093/ije/dyy262. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Burgess S., Thompson S.G. Multivariable Mendelian Randomization: The Use of Pleiotropic Genetic Variants to Estimate Causal Effects. Am. J. Epidemiol. 2015;181:251–260. doi: 10.1093/aje/kwu283. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Rees J.M.B., Wood A.M., Burgess S. Extending the MR-Egger Method for Multivariable Mendelian Randomization to Correct for Both Measured and Unmeasured Pleiotropy. Stat. Med. 2017;36:4705–4718. doi: 10.1002/sim.7492. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Lin Z., Xue H., Pan W. Robust Multivariable Mendelian Randomization Based on Constrained Maximum Likelihood. Am. J. Hum. Genet. 2023;110:592–605. doi: 10.1016/j.ajhg.2023.02.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Wang K., Shi X., Zhu Z., Hao X., Chen L., Cheng S., Foo R.S.Y., Wang C. Mendelian Randomization Analysis of 37 Clinical Factors and Coronary Artery Disease in East Asian and European Populations. Genome Med. 2022;14:63. doi: 10.1186/s13073-022-01067-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Sanderson E., Spiller W., Bowden J. Testing and Correcting for Weak and Pleiotropic Instruments in Two-Sample Multivariable Mendelian Randomization. Stat. Med. 2021;40:5434–5452. doi: 10.1002/sim.9133. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Sadreev I.I., Elsworth B.L., Mitchell R.E., Paternoster L., Sanderson E., Davies N.M., Millard L.A.C., Davey Smith G., Haycock P.C., Bowden J., et al. Navigating sample overlap, winner’s curse and weak instrument bias in mendelian randomization studies using the UK Biobank. medRxiv. 2021 doi: 10.1101/2021.06.28.21259622. Preprint at. [DOI] [Google Scholar]
- 24.Carroll R.J., Ruppert D., Stefanski L.A., Crainiceanu C.M. Hall/CRC; 2006. Measurement Error in Nonlinear Models: A Modern Perspective. Chapman. [Google Scholar]
- 25.VanderWeele T.J., Tchetgen Tchetgen E.J., Cornelis M., Kraft P. Methodological Challenges in Mendelian Randomization. Epidemiology. 2014;25:427–435. doi: 10.1097/EDE.0000000000000081. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Ye T., Shao J., Kang H. Debiased Inverse-Variance Weighted Estimator in Two-Sample Summary-Data Mendelian Randomization. Ann. Stat. 2021;49:2079–2100. [Google Scholar]
- 27.Burgess S., Davies N.M., Thompson S.G. Bias Due to Participant Overlap in Two-Sample Mendelian Randomization. Genet. Epidemiol. 2016;40:597–608. doi: 10.1002/gepi.21998. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Mounier N., Kutalik Z. Bias Correction for Inverse Variance Weighting Mendelian Randomization. Genet. Epidemiol. 2023;47:314–333. doi: 10.1002/gepi.22522. [DOI] [PubMed] [Google Scholar]
- 29.Yavorska O.O., Burgess S. MendelianRandomization: An r Package for Performing Mendelian Randomization Analyses Using Summarized Data. Int. J. Epidemiol. 2017;46:1734–1739. doi: 10.1093/ije/dyx034. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Morgan I.G., French A.N., Ashby R.S., Guo X., Ding X., He M., Rose K.A. The Epidemics of Myopia: Aetiology and Prevention. Prog. Retin. Eye Res. 2018;62:134–149. doi: 10.1016/j.preteyeres.2017.09.004. [DOI] [PubMed] [Google Scholar]
- 31.Marconi A., Di Forti M., Lewis C.M., Murray R.M., Vassos E. Meta-Analysis of the Association Between the Level of Cannabis Use and Risk of Psychosis. Schizophr. Bull. 2016;42:1262–1269. doi: 10.1093/schbul/sbw003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Corcoran C.M., Kimhy D., Stanford A., Khan S., Walsh J., Thompson J., Schobel S., Harkavy-Friedman J., Goetz R., Colibazzi T., et al. Temporal Association of Cannabis Use with Symptoms in Individuals at Clinical High Risk for Psychosis. Schizophr. Res. 2008;106:286–293. doi: 10.1016/j.schres.2008.08.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Holmes M.V., Asselbergs F.W., Palmer T.M., Drenos F., Lanktree M.B., Nelson C.P., Dale C.E., Padmanabhan S., Finan C., Swerdlow D.I., et al. Mendelian Randomization of Blood Lipids for Coronary Heart Disease. Eur. Heart J. 2015;36:539–550. doi: 10.1093/eurheartj/eht571. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.1000 Genomes Project Consortium. Auton A., Brooks L.D., Durbin R.M., Garrison E.P., Kang H.M., Korbel J.O., Marchini J.L., McCarthy S., McVean G.A., Abecasis G.R. A Global Reference for Human Genetic Variation. Nature. 2015;526:68–74. doi: 10.1038/nature15393. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Bulik-Sullivan B.K., Neale B.M., Loh P.-R., Finucane H.K., Ripke S., Yang J., Schizophrenia Working Group of the Psychiatric Genomics Consortium. Patterson N., Daly M.J., Price A.L. LD Score Regression Distinguishes Confounding from Polygenicity in Genome-Wide Association Studies. Nat. Genet. 2015;47:291–295. doi: 10.1038/ng.3211. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Smith C.J., Sinnott-Armstrong N., Cichońska A., Julkunen H., Fauman E.B., Würtz P., Pritchard J.K. Integrative Analysis of Metabolite GWAS Illuminates the Molecular Basis of Pleiotropy and Genetic Correlation. Elife. 2022;11 doi: 10.7554/eLife.79348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Zhu X., Feng T., Tayo B.O., Liang J., Young J.H., Franceschini N., Smith J.A., Yanek L.R., Sun Y.V., Edwards T.L., et al. Meta-Analysis of Correlated Traits via Summary Statistics from GWASs with an Application in Hypertension. Am. J. Hum. Genet. 2015;96:21–36. doi: 10.1016/j.ajhg.2014.11.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Turley P., Walters R.K., Maghzian O., Okbay A., Lee J.J., Fontana M.A., Nguyen-Viet T.A., Wedow R., Zacher M., Furlotte N.A., et al. Multi-Trait Analysis of Genome-Wide Association Summary Statistics Using MTAG. Nat. Genet. 2018;50:229–237. doi: 10.1038/s41588-017-0009-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Watanabe K., Taskesen E., Van Bochoven A., Posthuma D. Functional Mapping and Annotation of Genetic Associations with FUMA. Nat. Commun. 2017;8:1826. doi: 10.1038/s41467-017-01261-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Yi G.Y. Springer; 2017. Statistical Analysis with Measurement Error or Misclassification: Strategy, Method and Application. [Google Scholar]
- 41.Breiman L. Heuristics of instability and stabilization in model selection. Ann. Statist. 1996;24:2350–2383. [Google Scholar]
- 42.He M., Xiang F., Zeng Y., Mai J., Chen Q., Zhang J., Smith W., Rose K., Morgan I.G. Effect of Time Spent Outdoors at School on the Development of Myopia Among Children in China: A Randomized Clinical Trial. JAMA. 2015;314:1142–1148. doi: 10.1001/jama.2015.10803. [DOI] [PubMed] [Google Scholar]
- 43.Lin Z., Vasudevan B., Jhanji V., Mao G.Y., Gao T.Y., Wang F.H., Rong S.S., Ciuffreda K.J., Liang Y.B. Near Work, Outdoor Activity, and Their Association with Refractive Error. Optom. Vis. Sci. 2014;91(4):376–382. doi: 10.1097/OPX.0000000000000219. [DOI] [PubMed] [Google Scholar]
- 44.The HPS3/TIMI55–REVEAL Collaborative Group Effects of Anacetrapib in Patients with Atherosclerotic Vascular Disease. N. Engl. J. Med. 2017;377:1217–1227. doi: 10.1056/NEJMoa1706444. [DOI] [PubMed] [Google Scholar]
- 45.Chobanian A.V., Bakris G.L., Black H.R., Cushman W.C., Green L.A., Izzo J.L., Jr, W Jones D., Materson B.J., Oparil S., Wright J.T., Jr, et al. The seventh report of the joint national committee on prevention, detection, evaluation, and treatment of high blood pressure. Hypertension. 2004;42:1206–1252. doi: 10.1161/01.HYP.0000107251.49515.c2. [DOI] [PubMed] [Google Scholar]
- 46.Locke A.E., Kahali B., Berndt S.I., Justice A.E., Pers T.H., Day F.R., Powell C., Vedantam S., Buchkovich M.L., Yang J., et al. Genetic studies of body mass index yield new insights for obesity biology. Nature. 2015;518:197–206. doi: 10.1038/nature14177. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Takeuchi F., Akiyama M., Matoba N., Katsuya T., Nakatochi M., Tabara Y., Narita A., Saw W.Y., Moon S., Spracklen C.N., et al. Interethnic analyses of blood pressure loci in populations of East Asian and European descent. Nat. Commun. 2018;9:5052. doi: 10.1038/s41467-018-07345-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Yang J., Benyamin B., McEvoy B.P., Gordon S., Henders A.K., Nyholt D.R., Madden P.A., Heath A.C., Martin N.G., Montgomery G.W., et al. Common SNPs Explain a Large Proportion of the Heritability for Human Height. Nat. Genet. 2010;42:565–569. doi: 10.1038/ng.608. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Zuber V., Colijn J.M., Klaver C., Burgess S. Selecting likely causal risk factors from high-throughput experiments using multivariable Mendelian randomization. Nat. Commun. 2020;11:29. doi: 10.1038/s41467-019-13870-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Lawlor D.A., Harbord R.M., Sterne J.A.C., Timpson N., Davey Smith G. Mendelian Randomization: Using Genes as Instruments for Making Causal Inferences in Epidemiology. Stat. Med. 2008;27:1133–1163. doi: 10.1002/sim.3034. [DOI] [PubMed] [Google Scholar]
- 51.Wang M., Kong L., Li Z., Zhang L. Covariance Estimators for Generalized Estimating Equations (GEE) in Longitudinal Analysis with Small Samples. Stat. Med. 2016;35:5318–5319. doi: 10.1002/sim.7131. [DOI] [PubMed] [Google Scholar]
- 52.Mancl L.A., DeRouen T.A. A Covariance Estimator for GEE with Improved Small-Sample Properties. Biometrics. 2001;57:126–134. doi: 10.1111/j.0006-341x.2001.00126.x. [DOI] [PubMed] [Google Scholar]
- 53.Wu Y., Kang H., Ye T. Debiased Multivariable Mendelian Randomization. arXiv. 2024 doi: 10.48550/arXiv.2402.00307. Preprint at. [DOI] [Google Scholar]
- 54.van De Vegte Y.J., Said M.A., Rienstra M., van Der Harst P., Verweij N. Genome-wide association studies and Mendelian randomization analyses for leisure sedentary behaviours. Nature Commun. 2020;11:1770. doi: 10.1038/s41467-020-15553-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Arns A., Wahl T., Wolff C., Vafeidis A.T., Haigh I.D., Woodworth P., Niehüser S., Jensen J. Non-linear interaction modulates global extreme sea levels, coastal flood exposure, and impacts. Nature Commun. 2020;11:1918. doi: 10.1038/s41467-020-15752-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Rustad E.H., Yellapantula V., Leongamornlert D., Bolli N., Ledergor G., Nadeu F., Angelopoulos N., Dawson K.J., Mitchell T.J., Osborne R.J., et al. Timing the initiation of multiple myeloma. Nature Commun. 2020;11:1917. doi: 10.1038/s41467-020-15740-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Okbay A., Wu Y., Wang N., Jayashankar H., Bennett M., Nehzati S.M., Sidorenko J., Kweon H., Goldman G., Gjorgjieva T., et al. Polygenic prediction of educational attainment within and between families from genome-wide association analyses in 3 million individuals. Nature Genet. 2022;54:437–449. doi: 10.1038/s41588-022-01016-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Hysi P.G., Choquet H., Khawaja A.P., Wojciechowski R., Tedja M.S., Yin J., Simcoe M.J., Patasova K., Mahroo O.A., Thai K.K., et al. Meta-analysis of 542,934 subjects of European ancestry identifies new genes and mechanisms predisposing to refractive error and myopia. Nature genetics. 2020;52:401–407. doi: 10.1038/s41588-020-0599-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Demontis D., Walters R.K., Martin J., Mattheisen M., Als T.D., Agerbo E., Baldursson G., Belliveau R., Bybjerg-Grauholm J., Bækvad-Hansen M., et al. Discovery of the first genome-wide significant risk loci for attention deficit/hyperactivity disorder. Nature Genet. 2019;51:63–75. doi: 10.1038/s41588-018-0269-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Johnson E.C., Demontis D., Thorgeirsson T.E., Walters R.K., Polimanti R., Hatoum A.S., Sanchez-Roige S., Paul S.E., Wendt F.R., Clarke T.K., et al. A large-scale genome-wide association study meta-analysis of cannabis use disorder. The Lancet Psychiatry. 2020;7:1032–1045. doi: 10.1016/S2215-0366(20)30339-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Cuellar-Partida G., Tung J.Y., Eriksson N., Albrecht E., Aliev F., Andreassen O.A., Barroso I., Beckmann J.S., Boks M.P., Boomsma D.I., et al. Genome-wide association study identifies 48 common genetic variants associated with handedness. Nature Human Behav. 2021;5:59–70. doi: 10.1038/s41562-020-00956-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Nagel M., Jansen P.R., Stringer S., Watanabe K., De Leeuw C.A., Bryois J., Savage J.E., Hammerschlag A.R., Skene N.G., Muñoz-Manchado A.B. Meta-analysis of genome-wide association studies for neuroticism in 449,484 individuals identifies novel genetic loci and pathways. Nature Genet. 2018;50:920–927. doi: 10.1038/s41588-018-0151-7. [DOI] [PubMed] [Google Scholar]
- 63.Dashti H.S., Jones S.E., Wood A.R., Lane J.M., Van Hees V.T., Wang H., Rhodes J.A., Song Y., Patel K., Anderson S.G., et al. Genome-wide association study identifies genetic loci for self-reported habitual sleep duration supported by accelerometer-derived estimates. Nature Commun. 2019;10:1100. doi: 10.1038/s41467-019-08917-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Trubetskoy V., Pardiñas A.F., Qi T., Panagiotaropoulou G., Awasthi S., Bigdeli T.B., Bryois J., Chen C.Y., Dennison C.A., Hall L.S., et al. Mapping genomic loci implicates genes and synaptic biology in schizophrenia. Nature. 2022;604:502–508. doi: 10.1038/s41586-022-04434-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Loh P.R., Kichaev G., Gazal S., Schoech A.P., Price A.L. Mixed-model association for biobank-scale datasets. Nature Genet. 2018;50:906–908. doi: 10.1038/s41588-018-0144-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Evangelou E., Warren H.R., Mosen-Ansorena D., Mifsud B., Pazoki R., Gao H., Ntritsos G., Dimou N., Cabrera C.P., Karaman I., et al. Genetic analysis of over 1 million people identifies 535 new loci associated with blood pressure traits. Nature Genet. 2018;50:1412–1425. doi: 10.1038/s41588-018-0205-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Graham S.E., Clarke S.L., Wu K.H.H., Kanoni S., Zajac G.J., Ramdas S., Surakka I., Ntalla I., Vedantam S., Winkler T.W., et al. The power of genetic diversity in genome-wide association studies of lipids. Nature. 2021;600:675–679. doi: 10.1038/s41586-021-04064-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Aragam K.G., Jiang T., Goel A., Kanoni S., Wolford B.N., Atri D.S., Weeks E.M., Wang M., Hindy G., Zhou W., et al. Discovery and systematic characterization of risk variants and genes for coronary artery disease in over a million participants. Nature Genet. 2022;54:1803–1815. doi: 10.1038/s41588-022-01233-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data referenced in this study can be accessed through the GWAS Catalog (https://www.ebi.ac.uk/gwas/home), with the corresponding GWAS summary data available for download in the “data availability” section of the respective papers. Some of the GWAS summary data are exclusive of the Million Veteran Program (MVP) summary results, which are available through dbGAP under the accession number phs001672.v3.p1.
The MRBEE R package generated during this study is available at https://github.com/noahlorinczcomi/MRBEE. Simulation codes generated during this study are available at https://github.com/harryyiheyang/MRBEE.Simulation.