Skip to main content
PLOS Genetics logoLink to PLOS Genetics
. 2021 Apr 19;17(4):e1009455. doi: 10.1371/journal.pgen.1009455

MRLocus: Identifying causal genes mediating a trait through Bayesian estimation of allelic heterogeneity

Anqi Zhu 1,#, Nana Matoba 2,3,#, Emma P Wilson 2, Amanda L Tapia 1, Yun Li 1,2,4, Joseph G Ibrahim 1, Jason L Stein 2,3, Michael I Love 1,2,*
Editor: Caroline Relton5
PMCID: PMC8084342  PMID: 33872308

Abstract

Expression quantitative trait loci (eQTL) studies are used to understand the regulatory function of non-coding genome-wide association study (GWAS) risk loci, but colocalization alone does not demonstrate a causal relationship of gene expression affecting a trait. Evidence for mediation, that perturbation of gene expression in a given tissue or developmental context will induce a change in the downstream GWAS trait, can be provided by two-sample Mendelian Randomization (MR). Here, we introduce a new statistical method, MRLocus, for Bayesian estimation of the gene-to-trait effect from eQTL and GWAS summary data for loci with evidence of allelic heterogeneity, that is, containing multiple causal variants. MRLocus makes use of a colocalization step applied to each nearly-LD-independent eQTL, followed by an MR analysis step across eQTLs. Additionally, our method involves estimation of the extent of allelic heterogeneity through a dispersion parameter, indicating variable mediation effects from each individual eQTL on the downstream trait. Our method is evaluated against other state-of-the-art methods for estimation of the gene-to-trait mediation effect, using an existing simulation framework. In simulation, MRLocus often has the highest accuracy among competing methods, and in each case provides more accurate estimation of uncertainty as assessed through interval coverage. MRLocus is then applied to five candidate causal genes for mediation of particular GWAS traits, where gene-to-trait effects are concordant with those previously reported. We find that MRLocus’s estimation of the causal effect across eQTLs within a locus provides useful information for determining how perturbation of gene expression or individual regulatory elements will affect downstream traits. The MRLocus method is implemented as an R package available at https://mikelove.github.io/mrlocus.

Author summary

Genome-wide association studies (GWAS) have identified many loci associated with complex traits and diseases. Expression quantitative trait loci (eQTL) may help to explain mechanisms of GWAS associations, if the gene has a role as a mediator of the trait or disease. Loci that exhibit allelic heterogeneity, that is, loci containing multiple causal variants, offer the opportunity to investigate whether effects are concordant and proportional across eQTL and GWAS; if the gene is a partial mediator of the trait, the sign and size of the effects across distinct eQTL variants should be reflected in GWAS associations. Such a Mendelian Randomization (MR) analysis of individual loci is complicated by moderate sample sizes in eQTL studies and linkage disequilibrium (LD), resulting in complex patterns of estimated effect sizes for eQTL and GWAS. We develop a statistical model, MRLocus, with two steps: selection of eQTL SNPs to act as instruments in the MR analysis of a genetic locus, and estimation of the gene-to-trait mediation effect taking instrument uncertainty into account. In simulation, the method has higher accuracy and better uncertainty measures compared to other competing methods, and we compare its estimates on candidate causal gene-trait pairs from literature.

Introduction

Genome-wide association studies (GWAS) have identified many loci associated with complex traits and diseases. A major goal now is to understand the mechanism by which non-coding genetic variation influences trait levels through changes in gene expression. This involves identifying the causal variants at a locus, determining if the same variants are associated with both gene expression and trait, and disambiguating mediation from pleiotropy [1]. Proposing mediating genes from existing expression quantitative trait loci (eQTL) and GWAS resources will lead to experiments that test whether modulating gene expression influences traits, and therefore inform further research and development of treatments.

Current efforts to identify the genes underlying GWAS risk often make use of either colocalization of GWAS signal with eQTLs, or expression imputation. In colocalization, statistical models are used to probabilistically assess if the same genetic variants within a locus are likely to be causally contributing to both eQTL and GWAS signals, taking into account the structure of linkage disequilibrium (LD) for a given population [28]. Expression imputation methods, as in transcriptome-wide association studies (TWAS), add additional information by including subthreshold signal for both GWAS and eQTL to identify which genes’ expression may have a non-zero local genetic correlation with a given GWAS trait [911]. Further refinements of TWAS statistical models have allowed for probabilistic fine-mapping within loci harboring multiple candidate genes by accounting for LD structure, as in FOCUS [11].

Though colocalization and expression imputation suggest genes involved in a trait, neither method is designed to disambiguate between pleiotropy and mediation. The latest generation of methods for determining those genes involved in mediating GWAS signal have combined the approaches of colocalization and expression imputation with statistical techniques from the field of Mendelian randomization (MR) [1214], as evaluated and reviewed recently [15,16]. Intuitively, these methods work by determining if those genetic variants which influence gene expression also influence a downstream trait in proportionate degrees. Evidence for mediation is provided by randomized genetic variation used to perturb gene expression and observing the expression effects propagated to traits. With access to genotype, expression, and trait data, classical mediation techniques can be employed, as in the methods CIT [17] and SMUT [18], while MR-link [19] makes use of individual-level data from a GWAS study and summary statistics for eQTL to perform MR analysis.

Other methods testing gene-to-trait mediation require only summary statistics from eQTL and GWAS studies, as it is rarely possible to have access to the per-participant genotypes, expression values, and GWAS trait values [20]. Methods such as CaMMEL [21], TWMR [22], LDA-MR-Egger [23], PMR-Summary-Egger [24], PTWAS [25], MESC [1], MR-Robin [26] allow for estimation of gene-to-trait effects from eQTL and GWAS summary statistics, integrating across multiple SNPs within a locus. The ability to accurately estimate the gene-to-trait mediation effect within a locus relies on having multiple independent “instruments”, SNPs which are found to be associated with the potential mediator (gene expression), and which plausibly only affect the downstream trait through the mediator. Across studies, between 29–50% of genes provide evidence for allelic heterogeneity, that is, having more than one independent cis-eQTL per gene, with estimates varying by tissue and study sample size [2729]. Some genes provide evidence for up to 13 independent cis-eQTL signals, detected by conditional analysis in peripheral blood [28]. A recent study integrating neonatal gene expression with GWAS of autoimmune and allergic disease performed MR analysis across 52 genes that had three or more cis-eQTLs [30]. Therefore, while not all genes display allelic heterogeneity at current eQTL study sample sizes, it is common enough to allow for mediation modeling of many candidate genes.

Existing methods for assessing whether expression of a particular gene in some context may mediate GWAS signal have primarily focused on their ability to perform genome-wide mediation scans across multiple tissues or cell types. This is a critical task in determining the genetic architecture and the most relevant molecular contexts for a trait (e.g. tissues, cell types, or developmental stage) which are often not known a priori. However, when considering functional follow-up experiments at an individual locus, investigators may weigh accurate quantification of the uncertainty regarding a potential gene-to-trait effect, as well as the heterogeneity of gene-to-trait effects across signal clusters within a locus.

Here, we propose MRLocus, a Bayesian model for estimating the gene-to-trait effect from multiple independent signal clusters for one gene, as well as for estimating the heterogeneity of the effects across clusters. We have designed our method for prioritization of genes in functional experiments, where the genes under study have already been identified as candidates for mediation, having emerged from one of the global mediation scanning methods described above, or from colocalization or TWAS. MRLocus performs estimation of the gene-to-trait effect itself, as our focus is on experimental follow-up, as opposed to estimation of the percent of mediated heritability. In comparisons with other recently developed methods for identifying mediating genes from eQTL and GWAS summary data and LD matrices, TWMR and PTWAS, MRLocus was often more accurate in estimation of the gene-to-trait effect across simulated eQTL and GWAS experiments, and had higher and closer to nominal coverage of the true effect when considering its credible intervals. Using existing eQTL and GWAS data, we also estimate mediation effects at previously reported and experimentally validated loci. The MRLocus method is implemented as an open source R package with full function documentation and a software vignette demonstrating its use, publicly available at https://mikelove.github.io/mrlocus.

Materials and methods

MRLocus method

MRLocus consists of two steps, (1) colocalization and (2) MR slope fitting, each of which use Bayesian hierarchical models specified in the Stan probabilistic programming language, and with posterior inference using the Stan and RStan software packages [31]. As in PTWAS [25], MRLocus estimates the gene-to-trait effect or slope by identifying "nearly-LD-independent" signal clusters and then assessing the strength of evidence of mediation and the heterogeneity of the allelic effects. We refer to "nearly-LD-independent" clusters meaning non-overlapping sets of SNPs with low LD: MRLocus first uses PLINK’s clumping algorithm [32] with r2 < 0.1 to identify signal clusters, sets of SNPs correlated with gene expression based on eQTL p-value (as discussed below), whereas PTWAS uses DAP [33] for identifying signal clusters. MRLocus additionally performs trimming of signal clusters to ensure r2 < 0.05 for cluster representative SNPs, first using across-cluster r2 based on index SNP, then later using the candidate causal SNP from the colocalization step. Here, "trimming clusters" refers to removing clusters, prioritizing those clusters with lower index SNP eQTL p-value. Alternatively, conditional analysis could be used as discussed in a recent coloc methods paper [34]. As with other gene mediation methods mentioned above, we focus here on common SNPs (using a minor allele frequency (MAF) filter on real and simulated data of 0.01). To the extent that a SNP or genetic variant gives rise to both the eQTL and GWAS signal and is in the set analyzed by MRLocus, then the model has a chance to find the "causal” SNP, although in general MRLocus may identify a SNP which is in high LD with the causal SNP.

In contrast to other methods for estimating gene-to-trait effects, MRLocus additionally performs a colocalization step prior to slope fitting, using eQTL and GWAS summary statistics (estimated coefficients and standard errors (SE)), based on LD matrices (either distinct matrices when eQTL/GWAS are performed in different populations, or a single shared matrix can be used when eQTL/GWAS are performed in the same population). The colocalization step attempts to identify a single candidate causal SNP per nearly-LD-independent signal cluster. Here “candidate causal SNP” refers to the hypothesis that the SNP gives rise to both the observed eQTL and GWAS signal, given the LD matrices. The colocalization step produces posterior estimates that assess the degree to which the summary statistics and LD matrices support the causal hypothesis per signal cluster (see S1 Methods for details on the statistical model). If the SNP is a strong candidate for causing both the eQTL and GWAS signal in a signal cluster, then the posterior estimates for the eQTL and GWAS effect sizes will be large (in absolute value) for the SNP, and near 0 for the other SNPs. If the SNP is a strong candidate for only the eQTL signal, but not for the GWAS signal, then the posterior estimate for the chosen SNP for the GWAS signal will be near 0. Finally, we note that prior to the colocalization step, MRLocus performs collapsing of highly correlated SNPs (threshold of 0.95 correlation), such that the final "per-SNP" results actually correspond to results for representatives from sets of highly correlated SNPs. MRLocus also performs allele flipping such that all SNPs are coded to be in positive LD correlation to the index SNP that has a positive estimated coefficient for eQTL (S1 Methods). This ensures that the statistical modeling and visualizations are always referring to the effect of an expression-increasing allele.

MRLocus’s colocalization step is motivated by the eCAVIAR model [5] as it formulates a generative model for the summary statistics based on true underlying signals, but is distinct from the eCAVIAR model in two respects. First, eCAVIAR models the z-scores from eQTL and GWAS, while MRLocus directly models the estimated coefficients, as our focus is on estimation of the gene-to-trait effect, which can be conceived as in other MR applications as a regression of coefficients from GWAS on eQTL. eCAVIAR does not output posterior effect sizes for the two studies, which is the originally intended input to MRLocus slope estimation. Second, eCAVIAR uses a multivariate normal distribution to model the vector of observed z-scores in a locus, while MRLocus uses a univariate distribution to model the estimated coefficients of the SNPs in each nearly-LD-independent signal cluster. The univariate distribution was chosen for its increased performance in accuracy and in efficiency in model fitting, as well as for flexibility in specification of prior distributions. As the methods have distinct but related functionality, we also assessed the use of eCAVIAR in lieu of MRLocus’s colocalization step, for choice of candidate causal SNPs to provide to the subsequent slope fitting step (referred to here as "eCAVIAR-MRLocus"). As with MRLocus colocalization, clusters are trimmed such that across-cluster r2 < 0.05 based on eCAVIAR chosen SNPs, but collapsing of highly correlated SNPs is not performed, as it is not required for eCAVIAR input. The SNP with largest colocalization posterior probability (CLPP) is chosen among those SNPs with p-value below the clumping threshold.

In its implementation of colocalization, MRLocus uses a horseshoe prior [35] on the true coefficients for eQTL and GWAS signal per signal cluster, which helps to induce sparsity in the posterior estimates of the coefficients prior to mediation analysis (S1 Methods). The proposed use of the horseshoe prior during colocalization to identify putative causal SNPs from eQTL and GWAS coefficients within a signal cluster is distinct from other Bayesian MR methods’ use of the horseshoe prior on pleiotropic effects or on the mediation slope [3638].

MRLocus’s slope fitting step involves estimation of the gene-to-trait effect across signal clusters (Fig 1). For slope estimation, the best candidate SNP per nearly-LD-independent signal cluster is chosen, based on which SNP has the largest posterior mean of the eQTL effect size from the colocalization step. A hierarchical model is used to perform inference on parameters of interest, in this case the slope (ɑ) of true GWAS coefficients on true eQTL coefficients. In contrast to a typical inverse-variance weighted MR slope estimation, MRLocus additionally includes uncertainty on the effect size of the instruments, which we find leads to better coverage properties of credible intervals in simulated datasets. For study of putative mediator A (e.g. eQTL) and downstream trait B (e.g. GWAS), the following hierarchical model is fit using estimated coefficients β^j and standard errors sej across signal clusters j ∈ 1,…,J:

β^jAN(βjA,sejA)
β^jBN(βjB,sejB)
βjAN(0,SDβ)
βjBN(αβjA,σ)

Fig 1.

Fig 1

MRLocus estimates the gene-to-trait effect (solid blue line) as the slope from paired eQTL and GWAS effect sizes from independent signal clusters (black points with standard error bars), here on simulated coefficients. The dispersion of allelic effects around the main gene-to-trait effect (light blue band) is also estimated. An 80% credible interval on the slope is indicated with dashed blue lines sloping above and below the solid blue line. Panels represent loci demonstrating (A) mediation with low dispersion, (B) mediation with high dispersion, and (C) colocalization of eQTL and GWAS signals but no evidence of mediation (slope credible interval overlaps 0). Investigators may wish to prioritize loci for experimental follow-up in which a typical “dosage” pattern is observed, such that alleles contributing small amounts to expression of a gene contribute small amounts to GWAS trait, and similarly for large effect alleles.

Details regarding setting of priors for the hyperparameters SDβ (the prior for the instrument effect sizes), α (the gene-to-trait slope), and σ (the dispersion of signal cluster effect sizes around the fitted line) are provided in the S1 Methods. If the signal cluster does not provide evidence of colocalization, the estimate of the GWAS coefficient from the previous step will be near 0, and this will bring the estimated slope toward 0 as well. If the eQTL and GWAS estimated coefficients are based on standardized expression and traits, then α2 can be interpreted as the mediated trait variance explained in the samples. Quantile-based credible intervals on the slope coefficient provide information regarding the uncertainty of the gene mediating the trait measured in the GWAS. Finally, whereas PTWAS makes use of an I2 statistic [39] for quantifying the heterogeneity of the allelic effects at the locus, MRLocus estimates the dispersion (σ) of the different allelic effects around the predicted values given by the slope. Therefore, MRLocus may have high certainty on the slope (narrow credible interval for ɑ not overlapping 0), while nevertheless estimating that the dispersion of allelic effects around the slope is large (σ). The tradeoff between uncertainty on the estimate of ɑ and the dispersion σ of allelic effects around fitted line naturally depends upon the number of independent signal clusters at the locus. For loci with no allelic heterogeneity, we do not recommend running MRLocus (the software will produce a warning), and do not evaluate MRLocus on such loci here.

Choice of methods for comparison

We chose to focus on TWMR [22] and PTWAS [25] in our comparisons, as these two methods had a focus on estimation of the gene-to-trait effect, were able to run on eQTL data for a single gene and a single tissue, and required only summary statistics and LD matrices. PTWAS adds to the analysis of gene-to-trait effects an upstream fine-mapping of cis-eQTL using DAP [33], and estimation of gene-to-trait effect heterogeneity in the case of multiple independent eQTL signal clusters, employing an I2 statistic that ranges between 0 and 1. The I2 statistic represents, in the gene-to-trait meditation case, the percent of variance in estimated effect sizes across signal clusters that arises from true effect heterogeneity.

We additionally compared MRLocus to LDA-MR-Egger [23] and PMR-Summary-Egger [24] on the first simulation setting. Other methods that likewise determine if one or more genes may mediate traits include SMR [40,41], CaMMEL [21], MESC [1], MR-Robin [26]. SMR uses the top cis-eQTL per gene to compute gene-to-trait effects, and was extended to SMR-multi [41] to perform null hypothesis testing across multiple SNPs per gene, but it does not offer an integrated estimate of the causal effect. We were not able to run CaMMEL using only LD matrices from the eQTL and/or GWAS cohort, as the fit.med.zqtl function takes genotype design matrices as input. We were not able to run MESC with less than 5 genes, while our focus with MRLocus is on single gene mediating effect estimation. MR-Robin also provides robust estimates of gene-to-trait effects but with a focus on multiple-tissue eQTL summary statistics as input.

Simulation

For simulation, we used the pre-existing TWAS simulation framework, twas_sim [42], which simulates eQTL and GWAS datasets using real genotype data (1KG EUR Phase3) and outputs summary statistics. This software has a number of options for simulation parameters including cis heritability of gene expression (referred to here as "h2g"), expression mediated trait heritability (referred to here as "h2med"), and the number of SNPs in a locus which are cis eQTL. A single gene’s expression was simulated per experiment, and both gene and trait were scaled to unit variance for slope estimation. The key twas_sim simulation equations follow. For a study with N individuals, concerning a locus with n SNPs, let the distribution of eQTL effect sizes for causal SNPs j ∈ C, |C| = ncausal < n follow a Normal distribution with mean and variance given by:

βjeQTLN(0,h2g/ncausal).

The generation of a vector of gene expression ygene for N individuals is given by:

sgene2=V^(ZβeQTL)
serr2=sgene2((1/h2g)1)
εindivN(0,serr2)
ygene=ZβeQTL+ε,

where V^ is a function for the unbiased sample variance, Z is an N x n matrix of zero-centered, unit variance genotypes, βeQTL is a column vector of sampled true eQTL effect sizes which is equal to 0 everywhere except for the causal SNPs βjeQTL, and ε is a column vector of errors for individuals 1 to N. A column vector of errors, ε, are drawn from a zero-centered normal with variance s2err. Finally, ygene is standardized to unit variance before eQTL coefficients are estimated. The simulated GWAS trait ytrait for a separate set of individuals is generated as above but using h2med in place of h2g, and where the same set of causal (non-zero effect size) SNPs are used for βeQTL and βGWAS:

strait2=V^(ZβGWAS)
serr,trait2=strait2((1/h2med)1)
εindiv,traitN(0,serr,trait2)
ytrait=ZβGWAS+εtrait.

This provides gene expression and trait with population-level heritability given by h2g and h2med (the simulated individuals are drawn from an infinite population and so the heritability in the finite sample may differ). Finally, the true gene-to-trait slope α is calculated as the ratio of βjGWAS / βjeQTL for causal SNPs j, after scaling coefficients by the sample standard deviation for ygene and ytrait. Therefore, in the simulations we have the approximate relationship (see also S1 Method):

h2medh2gα2

Simulations of paired eQTL and GWAS datasets were performed where the gene was a partial mediator of the trait, as well as null simulations where the gene expression was unrelated to the downstream trait and the locus contains multiple trait-only association signals (the same causal SNP percent as for eQTL). An additional simulation was performed in which the gene is a partial mediator of the trait, and three large effect trait-only association signals are added to the causal SNP set for simulating the GWAS study. This simulation was performed to assess the methods’ accuracy in the presence of horizontal pleiotropy, and code for this simulation can be found in the ‘hp’ branch of the forked repository: https://github.com/mikelove/twas_sim/tree/hp. Null simulations were obtained by running twas_sim with two different seeds on the same locus to produce a set of GWAS trait signals not mediated by gene expression: a distinct set of causal SNPs with unrelated effect sizes.

TWMR, PTWAS, MRLocus, eCAVIAR-MRLocus, LDA-MR-Egger, and PMR-Summary-Egger were all run on the same summary data from eQTL and GWAS simulations. The Snakemake software [43] was used for automation of simulation scripts including specification of random seed for each of the simulations, in order to assist with computational reproducibility of simulations. Simulation and analysis code is provided at https://github.com/mikelove/mrlocusPaper. The sample sizes for eQTL and GWAS were kept at their default values of NeQTL = 500 and NGWAS = 100,000, respectively, and an additional simulation was performed with NeQTL = 1,000. The percent of SNPs in a locus which are eQTLs was set to 1%. The parameter h2g was varied from its default value (0.1) to a higher value (0.2) and a lower value (0.05), and the expression mediated trait heritability (h2med) was varied from its default value (0.01) to two lower values (0.005, 0.001, as well as to 0 indicating a null simulation where gene expression does not explain variation in a GWAS trait). Each combination of 3 (for h2g) x 4 (for h2med) resulted in 12 simulations (with labels A-I, Null-0.1, Null-0.2, and Null-0.05) plus two additional simulations for partial mediation with horizontal pleiotropy (HP) and higher NeQTL (High-N), totaling 14 simulation types (Fig A in S1 Text). 20 iterations of each simulation were drawn, and if this did not produce at least 18 iterations with allelic heterogeneity (two or more index eSNPs with p-value < 0.001 and pairwise r2 < 0.05), then 20 more iterations were drawn. 40 iterations of the main (A) simulation, the horizontal pleiotropy (HP) simulation, and the higher NeQTL (High-N) simulation were drawn. If a method did not produce output for an iteration due to insufficient input data, e.g. no signal clusters with PIP > 0.5 for PTWAS, or < 2 nearly-LD-independent signal clusters for MRLocus, then a slope estimate of 0 is plotted in the simulation accuracy figures. The error rate and coverage for a method was calculated only over those iterations that the method produced an estimate. All methods except for PMR-Summary-Egger ran without error on all iterations.

LD-based clumping implemented in PLINK (v1.90b) [32] was performed on eQTL simulation un-adjusted p-values with the following settings:—clump-p1 0.001—clump-p2 1—clump-r2 0.1—clump-kb 500. All PLINK clumps were provided to TWMR (commit 62994ec) [22] and MRLocus (v0.0.22) for gene-to-trait effect estimation. PTWAS (v1.0) [25] was provided with output from DAP (DAP-G, commit ac38301) [33] with settings: -d_n NeQTL -d_syy NeQTL (as twas_sim scales expression to unit variance). PTWAS code was modified (at line 97 for commit b5714f3) to allow for input of estimated coefficients and their SE, such that it provided slope estimates on the original scale of coefficients, not z-scores. LDA-MR-Egger and PMR-Summary-Egger (v1.0) were supplied with the eQTL and GWAS summary statistics of the locus and the LD matrix, without clumping. For 1–2 simulation iterations, LDA-MR-Egger or PMR-Summary-Egger would output a very large estimate of ɑ (> 2 in absolute value), and these large estimates were removed to give more representative error rates for these two methods. PMR-Summary-Egger does not provide an SE of the causal effect in its output so was excluded from coverage evaluation. eCAVIAR (v2.2) was run per nearly-LD-independent signal cluster with -c 1 (maximum of one causal SNP), without collapsing of highly correlated SNPs, as an alternative to MRLocus colocalization step. For eCAVIAR-MRLocus, the SNP with highest CLPP among those with p-value less than the PLINK clumping threshold was provided from each signal cluster to the slope fitting step. For all simulations, if there were no SNPs in the simulated locus with eQTL un-adjusted p-value < 0.001 then a new seed was drawn. Furthermore, all methods were only evaluated on loci with evidence of allelic heterogeneity, as demonstrated by more than one nearly-LD-independent (r2 < 0.05) signal cluster with index eQTL p-value < 0.001. For simulation comparisons, MR with inverse variance weighted (IVW) regression with fixed effects [44] was computed using the true causal eQTLs ("causal") or across all SNPs ("all") using the mr_ivw_fe function in the TwoSampleMR R package (v0.5.5) [45] with eQTL as the exposure study and GWAS as the outcome study. The "causal" IVW MR analysis served as an "oracle" estimator in the simulations, as it was provided with information not available in a typical analysis and not provided to other methods. The number of true eQTL SNPs, PLINK clumps, DAP signal clusters, and clusters passing 1st and 2nd round of r2-based trimming are shown in Fig B in S1 Text. The distribution of the number of SNPs per clump before and after collapsing highly correlated SNPs (part of MRLocus pre-processing, described in S1 Methods) is shown in Fig C in S1 Text.

In order to assess whether trimming signal clusters to obtain pairwise r2 < 0.05 would be sufficient to protect against false positives for MRLocus slope fitting resulting from correlated instruments, a simulation was performed across a range of r2. Estimated eQTL and GWAS coefficients for between J = 4, 6, or 8 signal clusters were generated from a multivariate normal distribution, such that the population correlation between adjacent signal clusters took on a specific r2 value, and all other pairs had correlation of 0. The mean vector for the eQTL (A) study was centered on [10,…,10], while the mean vector for the GWAS (B) study was centered on the origin, such that the true gene-to-trait slope is α = 0. These coefficients and their standard errors were provided to the slope estimation step of MRLocus, and 80% credible intervals were calculated for 400 iterations per combination of J and r2. The code for this simulation is provided in the corr_instr_sim directory of the mrlocusPaper GitHub repository.

By producing signal clusters with PLINK clump using—clump-p1 0.001, this ensures that the index eSNP for each signal cluster has an absolute Z-score of > ~3.3, or equivalently an F-statistic > ~10.8. In order to assess whether even stronger criteria on the strength of the signal clusters provided to MRLocus would improve accuracy of the gene-to-trait effect, we re-ran PLINK clump on the main (A) simulation and the high NeQTL simulation using—clump-p1 0.0001 (F-statistic > ~15.1), and re-ran TWMR and MRLocus. Such estimates for TWMR and MRLocus are referred to using "_p1e-4" in the Results. As PTWAS did not always produce estimates on simulated loci due to insufficient signal clusters with posterior inclusion probability (PIP) > 0.5, we additionally tested PTWAS on the A and high NeQTL simulations using signal clusters with PIP > 0.1, referred to as "_t0.1" in the Results.

Two additional assessments of the simulation datasets were performed, to better understand underlying factors that may explain differences in method performance. For MRLocus and eCAVIAR colocalization steps, the accuracy in recovering the causal eSNPs was assessed across the different simulation settings. Specifically, for signal clusters containing a true causal eSNP, the SNP chosen as the candidate instrument from each signal cluster for slope fitting was evaluated as to whether it was equal to or in high correlation (r > 0.95) with the causal eSNP. The accuracy was then computed by averaging over all signal clusters from all iterations of the simulation. Additionally, the coverage of true values by confidence or credible intervals was assessed using information about the true simulated value. As insufficient coverage could occur due to estimation bias or badly calibrated standard errors, for three simulations with h2g of 10% (A, D, G), we calculated the bias for TWMR, PTWAS, MRLocus, and eCAVIAR-MRLocus, using the sample mean of the estimates and the population-level true alpha value of (h2med / h2g)-1/2. We then re-computed the interval coverage after subtracting the bias. While these "oracle" bias-adjusted coverage values could not be obtained in real settings, they help to answer whether correction of estimation bias alone would resolve problems with insufficient coverage.

Real data analysis

To evaluate the performance of methods on real data, we applied TWMR, PTWAS, and MRLocus to five candidate causal genes for mediation of particular GWAS traits, chosen based on literature review of previous connections between gene expression and downstream phenotype: SORT1 (liver)—low-density lipoproteins (LDL), MRAS (tibial artery)—coronary artery disease (CAD), PHACTR1 (tibial artery)—CAD, CETP (liver)—high-density lipoproteins (HDL), and LIPC (liver, as well as in blood)—HDL. eQTL and GWAS summary data were obtained as summarized in Table A in S1 Text. Briefly, eQTL data on tibial artery, blood, and liver was obtained from the Genotype-Tissue Expression (GTEx) project v8 [46], eQTLGen Consortium 2019-12-23 release [47], and directly from the authors of a liver meta-analysis study [48], respectively. GTEx v8 estimated eQTL coefficients were on the scale of unit variance expression values (following per-gene inverse normal transformation), eQTLGen estimated coefficients and standard errors were derived from z-scores and MAF as indicated by the README for SMR input, while the liver meta-analysis estimated coefficients were on the scale of log2 normalized expression, according to references. GWAS summary statistics on CAD were obtained from CARDioGRAMplusC4D (Coronary ARtery DIsease Genome wide Replication and Meta-analysis (CARDIoGRAM) plus The Coronary Artery Disease (C4D) Genetics) consortium [49], where coefficients represent log odds ratios (OR), and the UK Biobank (https://www.ukbiobank.ac.uk) obtained from http://www.nealelab.is/uk-biobank/, where coefficients are estimated with respect to unit variance, continuous scale HDL or LDL.

Prior to defining independent signal clusters, we filtered out SNPs with MAF < 0.01 from GWAS data, and p-values were corrected with lambda GC [50]. We used raw p-values for eQTL data and genomic control corrected p-values for GWAS data. As our task for downstream inference is estimation of the slope of GWAS coefficients over eQTL coefficients across signal clusters, it is not necessary that the eQTL signal clusters attain genome-wide significance before being provided to MRLocus. We first identified nearly-LD-independent signal clusters for eQTL by LD-based clumping implemented in PLINK (1.90b) [32] with the following settings:—clump-p1 0.001—clump-p2 1—clump-kb 500—clump-r2 0.1 (as in the simulation setting). LD was estimated in the European population from the 1000 Genome Project phase 3 (1KG EUR). Next we directly obtained test statistics of matched SNPs in each eQTL cluster from GWAS summary data.

Generating MRLocus input files

For each independent pair, we generated MRLocus input files (effect size tables) with estimated coefficients, SE, and reference and effect alleles from both eQTL and GWAS. Pairwise LD (Pearson’s r) between SNPs were generated by the PLINK—r function for all SNPs included in the corresponding effect size table using 1KG EUR. We also included major/minor allele information from the reference panel (PLINK bim files) in the effect size table for proper allele flipping for statistical modeling and visualization (S1 Methods).

Real data analysis with other methods

TWMR and PTWAS were used to compare estimates on real data loci. Both software packages require definition of eQTL signal clusters and we used DAP (DAP-G, commit ac38301) to estimate independent eQTL clusters for PTWAS (v1.0) as described in the original paper, while for TWMR (commit 62994ec) we ran PLINK clumping with the same parameters we used for MRLocus (v0.0.22) (c.f. the original TWMR paper performed approximate conditional analyses). DAP was applied to estimated coefficients and SE from eQTL summary statistics with default options except we set maximum models (-msize) to 20. For the CETP locus, there were not any signal clusters for PTWAS with sufficient posterior inclusion of probability (all signal cluster PIP < 0.5), so the threshold was lowered to 0.1 in order to visualize an estimate and interval.

Results

Simulation

In order to evaluate the accuracy in estimating the mediated gene effect, we used a GWAS and eQTL simulation. The twas_sim simulation framework used for evaluating methods has default values for cis eQTL gene expression heritability of 10% (h2g) and expression mediated trait heritability of 1% (h2med). As the eQTL-based gene expression heritability could feasibly be higher or lower than 10%, we investigated values of 20% and 5% as well, which are within the range of detection for eQTL studies with hundreds of samples [5155]. The default mediated trait variance explained value of 1% for a single gene is likely high, given that a recent publication using GTEx data has estimated the proportion of heritability mediated by expression across all genes to be around 11% (±2%) averaging over 42 traits, with the top mediated traits having around 30% of heritability mediated by expression [1]. We therefore considered even lower expression mediated heritability of the GWAS trait, h2med of 0.5% and 0.1%, as well as a null simulation where gene expression did not mediate the downstream trait in any way (0%). We note that the selection of simulated loci with two or more nearly-LD-independent eQTL with index eSNP p-value < 0.001 may result in Winner’s Curse for low-powered studies, where true values tend to be smaller than the estimated coefficients for the eSNPs passing a significance threshold. Over-estimation of instrument effect sizes would then result in under-estimation of the mediation effect. There was some over-estimation for the lowest gene heritability (h2g = 0.05), but a balance of both under- and overestimation of effect sizes for h2g of 0.1 or 0.2 (Fig D in S1 Text).

For the default values of 10% gene h2 and 1% h2med (simulation A), eCAVIAR-MRLocus had the highest accuracy in terms of relative mean absolute error (RMAE), dividing the error by the absolute value of the true slope, followed by MRLocus and PTWAS (Fig 2A). TWMR and PTWAS demonstrated the most negative bias, calculated using the sample mean of the estimator and the population-level α. The bias for TWMR, PTWAS, MRLocus, and eCAVIAR-MRLocus was -0.112, -0.079, -0.003, and -0.053, respectively. In addition, only eCAVIAR-MRLocus and MRLocus were able to achieve nominal coverage for 80% credible intervals over the true values, while other methods had lower observed coverage, having too narrow confidence intervals (Fig 2B). We additionally tested two other methods, LDA-MR-Egger and PMR-Summary-Egger at the default twas_sim settings (Fig E in S1 Text). These additional two methods had higher error compared to other methods at the default twas_sim settings (with LDA-MR-Egger having lower error of the two), and so we focused on the latter three methods for further evaluation. Here, and in all simulation settings considered, the evaluated methods had higher RMAE of gene-to-trait slope compared to an oracle method that uses only the true causal SNPs (which are generally not known) and their estimated coefficients (“causal”) in IVW regression with fixed effects [44].

Fig 2. Performance of methods on simulated eQTL and GWAS datasets.

Fig 2

(A) Estimates of each method over true (simulated) gene-to-trait values. The method denoted with “causal” indicates an inverse variance weighted slope estimation using the true causal SNPs but the estimated coefficients and SEs (an oracle estimate), and “all” indicates an inverse variance weighted slope estimation using all SNPs. (B) Observed coverage (abbreviated “cov.”) of 80% confidence or credible intervals from each method. If the interval contains the true effect size, it is colored black, otherwise red.

In the evaluation of pairwise r2 across signal clusters, for loci with 4, 6, or 8 signal clusters, increase in the r2 of adjacent signals was associated with an increase in the false positive rate for MRLocus (Fig F in S1 Text). False positive rate was measured by the rate of 80% credible intervals not covering the true value of α = 0, for 400 simulation iterations per combination of simulation parameters. At r2 < 0.05, the average rate of intervals not covering 0 was approximately 20%, so MRLocus was then achieving the nominal level. We therefore recommend trimming signal clusters to achieve r2 < 0.05 across pairs of clusters before running MRLocus slope estimation, because allowing higher pairwise r2 across instruments could lead to loss of nominal interval coverage. A convenience function in the MRLocus package, trimClusters, can be used to prioritize signal clusters with higher strength of eQTL signal. Violin plots of the pairwise r2 across clusters, for all instruments passed to MRLocus slope fitting is provided in Fig G in S1 Text.

We modified simulation A in two ways, to assess how horizontal pleiotropy or higher sample size would worsen or improve estimation of partial mediation, respectively. The effect of horizontal pleiotropy, i.e. signals associated separately with gene expression and trait, on estimation of gene-to-trait mediation was assessed by adding non-expression associated GWAS signals to a loci exhibiting partial mediation. An example region plot for one of the HP simulation iterations is provided in Fig H in S1 Text. TWMR, PTWAS, MRLocus, and eCAVIAR-MRLocus had higher error and lower interval coverage for the horizontal pleiotropy simulation compared to their performance in simulation A, as expected (Fig I in S1 Text, with additional methods provided in Fig J in S1 Text). MRLocus and eCAVIAR-MRLocus had nearly the same RMAE, which was the lowest among the methods tested. LDA-MR-Egger was less affected by inclusion of trait-only signals, although still having higher error than all other methods except for PMR-Summary-Egger. MRLocus and eCAVIAR-MRLocus obtained close to 80% coverage (86% and 76%, respectively) despite the addition of coincident trait-only GWAS signals. Finally, for the same default h2g and h2med settings as simulation A, we increased the sample size to NeQTL = 1,000 to see how higher power in detection of instruments may translate to better estimation of mediation. In this higher power setting, all methods had lower error in estimating the gene-to-trait effect (Fig K in S1 Text). Here, MRLocus had the lowest RMAE, slightly below that of eCAVIAR-MRLocus, and both obtained nominal interval coverage. PTWAS had improved confidence interval coverage (from 28% to 50%) while other methods had similar coverage as with NeQTL = 500.

For simulations using other parameter settings of gene expression heritability h2g and expression mediated trait heritability h2med, MRLocus had the second lowest RMAE in 6 of the remaining 8 non-null simulation settings with non-default values of h2g and h2med (Figs L-S in S1 Text). In simulation G, with 10% h2g and 0.1% h2med, PTWAS had lower RMAE than MRLocus, and in simulation H, with 20% h2g and 0.1% h2med, eCAVIAR-MRLocus had lower RMAE than MRLocus and PTWAS. Overall, considering the non-null simulations performed, MRLocus or eCAVIAR-MRLocus had the lowest RMAE in estimating the gene-to-trait effect compared to other methods for 10 out of 11 of the settings (Fig 2, and Figs I, K-P, R-S in S1 Text). When expression heritability and mediated heritability were moderate to high (h2g ≥ 10%, h2med ≥ 0.5%), these two methods would come close to the RMAE of the oracle estimate ("causal").

In terms of credible or confidence interval coverage, MRLocus and eCAVIAR-MRLocus always had better or equal coverage of the true values compared to all other methods within the non-null simulation settings, though it did not always reach the nominal level (Figs T-AA in S1 Text). When expression heritability was low (h2g = 5%), or the mediated trait heritability low (h2med = 0.1%) MRLocus and eCAVIAR-MRLocus had coverage roughly in the range 50–70%; exceptions to this trend were that MRLocus achieved >80% coverage for simulation F, and both methods had >80% coverage for simulation H. MRLocus always had higher coverage than eCAVIAR-MRLocus when they differed by more than 5%. PTWAS tended to have higher coverage than TWMR, but they each had a maximal coverage of 39% and 26%, respectively; TWMR and PTWAS had the best coverage on the h2g = 20%, h2med = 0.1% simulation. The oracle method tended to have narrow confidence intervals, and MRLocus had higher coverage than the oracle method for all simulations. The maximal coverage of the oracle was simulation F with 40%. We believe the lower-than-nominal coverage seen here for the oracle method is likely from insufficient propagation of error during slope estimation. As the twas_sim framework does not include heterogeneity of effects from different signal clusters, an additional simulation was performed to assess MRLocus’s estimation of the dispersion of effects around the gene-to-trait fitted line (Fig AB in S1 Text). Here, MRLocus was accurate both in estimation of the dispersion and quantification of uncertainty (attaining nominal credible interval coverage), with higher accuracy and smaller intervals as the number of nearly-LD-independent clusters increased, as expected.

In the 3 null simulation settings with h2g of 5%, 10%, 20% but no mediated heritability, and trait-only GWAS signals, MRLocus always had the highest interval coverage of the true slope value of 0 (average of 79%, close to the nominal 80%), followed by eCAVIAR-MRLocus (average of 63%) (Fig AC in S1 Text). Here, TWMR had average coverage of 35% and PTWAS had average coverage of 38%. The oracle method with the true eQTL SNPs had average coverage of 14%. The oracle method again had too narrow intervals as in the non-null simulations. TWMR and PTWAS had lower mean absolute error (MAE) for estimating ɑ = 0 than MRLocus and eCAVIAR-MRLocus, which was expected due to the bias toward 0 seen in the non-null simulations.

We performed sensitivity analysis for the two thresholds involved in the formation of signal clusters: the eQTL p-value used with PLINK clump (default of p-value < 0.001), and the PIP threshold for inclusion of a signal cluster to PTWAS causal effect estimation (default of PIP > 0.5). While p-value < 0.001 corresponds to an F-statistic > ~10.8, use of stronger criteria for the eQTL instruments (p < 0.0001, F-statistic > 15.1) did not result in higher accuracy or better interval coverage for MRLocus in simulation A or with higher NeQTL = 1,000 (Fig AD in S1 Text). However, in both simulation settings, accuracy and coverage were improved slightly for TWMR using the stricter p-value for clumping. Running PTWAS with PIP threshold lowered from the default value of 0.5 to 0.1 did not improve accuracy or interval coverage in these simulation settings.

We assessed whether MRLocus colocalization step or eCAVIAR was more accurate at identifying the causal eSNP in non-null and null simulations. eCAVIAR outperformed MRLocus on the non-null simulations in its ability to detect the true causal SNP, for all 11 non-null simulations (Fig AE in S1 Text). We note that eCAVIAR does not provide posterior estimates of eQTL and GWAS effect sizes, the originally designed input to the MRLocus slope fitting step. When providing eCAVIAR-chosen SNPs and the original estimated coefficients from eQTL and GWAS, we find that eCAVIAR-MRLocus is more accurate for only 2 out of 11 non-null simulations (simulation A and H). Furthermore, MRLocus generally had higher coverage of the gene-to-trait effect compared to eCAVIAR-MRLocus in the more difficult simulation settings (simulations C, F, and I, while eCAVIAR-MRLocus had higher coverage for simulation G). For the null simulations, in which eQTL eSNPs and GWAS trait-associated SNPs are distinct and expression does not mediate trait variance, MRLocus colocalization was always more accurate at identifying the true causal eSNP, and the MRLocus-chosen eSNPs were better for providing intervals that cover the true gene-to-trait effect (ɑ = 0).

Insufficient confidence or credible interval coverage could result from estimator bias, and TWMR and PTWAS exhibited downward bias in the simulations. We further examined if calculating and removing the estimator bias (using the true, population-level slope ɑ) improved interval coverage for TWMR, PTWAS, MRLocus and eCAVIAR-MRLocus, for the simulations with h2g of 10% (simulation A, D, and G). In some cases, bias removal helped substantially, for example TWMR’s coverage rising from 14% to 67% in simulation A and from 8% to 59% in simulation D, or PTWAS’s coverage rising from 35% to 76% in simulation G (Fig AF in S1 Text). Still, even with this "oracle" bias removal, TWMR and PTWAS still had cases of insufficient coverage, for example PTWAS with 32% coverage in simulation D, and TWMR with 44% coverage in simulation G. This indicated that the insufficient coverage observed for these methods was not only due to bias, but also from too small standard errors on the causal effect estimate.

We assessed method timing across the 80 main (A) and high NeQTL simulations. MRLocus was approximately 17 times slower than TWMR and 32 times slower than PTWAS, with a mean running time per locus of 75 seconds compared to 4.3 seconds for TWMR and 2.3 seconds for PTWAS (Fig AG in S1 Text). The additional elapsed time is because MRLocus involves a colocalization step with model fitting per signal cluster using MCMC. MRLocus colocalization step was provided with four cores for running four MCMC chains, while other methods and MRLocus slope fitting used a single core. Performing colocalization with eCAVIAR reduced the running time by about two-thirds for eCAVIAR-MRLocus compared to MRLocus (3.4 seconds for eCAVIAR colocalization and 22 seconds for MRLocus slope fitting, so in total 6 times slower than TWMR and 11 times slower than PTWAS). One reason why MRLocus colocalization is slower than eCAVIAR is that MRLocus’s model for colocalization adapts its priors for eQTL and GWAS effect sizes to the data in each signal cluster in terms of the scale and sparsity of causal variants. Runtime for PLINK clumping and DAP were not included in the times presented above.

Real data analysis

We compared TWMR, PTWAS, MRLocus, and eCAVIAR-MRLocus using eQTL and GWAS summary statistics for five gene-trait pairs in which there is strong evidence that the gene mediates the trait, and the direction of the effect has also been reported, such that we could compare our estimates against the literature. For one gene-trait pair (LIPC and HDL), we examined the effect in liver, the tissue reported in literature as the relevant gene expression context, as well as using eQTL in blood, where the expression association signal may or may not provide strong evidence of mediation of the GWAS trait. LocusZoom-style plots [56,57] for the eQTL and GWAS tracks are provided in Figs AH-AL in S1 Text. Additionally, the LD patterns for all trimmed signal clusters are provided in Figs AM-AR in S1 Text.

On the six eQTL-GWAS dataset pairs, all four methods had consistent sign of the mediating effect (Fig 3). MRLocus and eCAVIAR-MRLocus generally had larger credible intervals compared to confidence intervals from the other two methods, as was seen in the simulation datasets where MRLocus and eCAVIAR-MRLocus had improved coverage of the true effect size. In all cases except LIPC in blood, the sign of the mediating effect was in concordance with literature: higher SORT1 (liver) expression decreasing LDL levels [19,58,59], higher MRAS (artery) being hazardous for CAD [6062], higher PHACTR1 (artery) being protective for CAD [63,64], higher CETP (liver) decreasing HDL levels [65], and higher LIPC (liver) decreasing HDL levels [66]. For LIPC assayed in blood, TWMR and PTWAS have positive effect sizes with narrow confidence intervals (opposite of the mediating effect for LIPC in liver), while MRLocus and eCAVIAR-MRLocus have large credible intervals, reflecting the conflicting evidence across blood eQTL signal clusters (Figs AS and AT in S1 Text). We note that MRLocus also has large credible intervals for LIPC in liver, due to a limited number of nearly-LD-independent signal clusters (n = 3) and posterior GWAS effect sizes close to 0 for two signal clusters (Fig AS in S1 Text). For CETP (liver) paired with HDL, PTWAS did not have any DAP signal clusters with sufficient PIP (threshold of 0.5) to estimate an effect, so the threshold was lowered to 0.1, which produced an estimate close to 0, while MRLocus and eCAVIAR-MRLocus likewise displayed high uncertainty of mediation given the limited number of signal clusters (n = 2 for both, Figs AS and AT in S1 Text).

Fig 3. Estimated gene-to-trait effects for TWMR, PTWAS, and MRLocus on five eQTL and GWAS datasets for strong candidate genes for mediation of the GWAS trait and one auxiliary tissue example (LIPC in blood).

Fig 3

80% confidence or credible intervals are shown (MRLocus provides quantile-based credible intervals). The artery eQTL were estimated from inverse normal transformed expression data from GTEx, transformed z-scores for eQTLGen, while the liver meta-analysis eQTL were estimated from log2 expression data, thus the estimated slopes represent changes in SD of artery or blood gene expression on log odds for CAD risk and on SDs of lipid levels, and doubling of liver gene expression on SDs of lipid levels.

MRLocus had the strongest evidence for mediation with SORT1 (liver) and its effect on LDL, with LocusZoom-style plot of the region in Fig 4A and MRLocus gene-to-trait estimate plot in Fig 4B. The number of signal clusters used for MRLocus for SORT1 (liver), MRAS (artery), PHACTR1 (artery), CETP (liver), LIPC (liver), and LIPC (blood) were: {6, 9, 5, 2, 3, 31}, and for eCAVIAR-MRLocus were: {5, 5, 6, 2, 3, 28}. The number of nearly-LD-independent signal clusters for blood eQTL for LIPC (n = 31 and 28 for MRLocus and eCAVIAR-MRLocus, respectively) is likely an overestimate of the number of true causal SNPs at this locus, and new methods for clumping or summary-based conditional analysis large-sample-size meta-analysis eQTL studies may refine these signals. MRLocus and eCAVIAR-MRLocus also had strong evidence for mediation with PHACTR1 (artery) and its effect on CAD. Two loci where MRLocus and eCAVIAR-MRLocus have slightly differing results were MRAS (artery) on CAD and LIPC (liver) on HDL, where MRLocus-selected instruments tended to have more scatter (Fig AS in S1 Text), and thus more uncertainty reflected in the credible interval of the gene-to-trait effect.

Fig 4. Colocalization and MRLocus estimation for SORT1.

Fig 4

(A) Colocalized signals in the SORT1 region. From top panel to bottom, gene model (NCBI Refseq), eQTL for SORT1 in liver (N = 588) (Strunz et al., 2018) and LDL association within UKBB (N = 343,621). Dashed line indicates a significance threshold at p = 0.001 or p = 5x10-8 for eQTL and GWAS respectively. Colored labels indicate eSNPs used for slope fitting with both methods, eCAVIAR-MRLocus, or MRLocus. (B) MRLocus plot of the gene-to-trait effect for SORT1 expression in liver on LDL levels. The signal clusters all provide consistent evidence for a gene-to-trait effect of -0.042, meaning that doubling of gene expression level in liver should reduce LDL by 4.2% of its population standard deviation. An 80% credible interval for the slope is indicated by dashed blue lines around the solid blue slope, while a range of heterogeneity of allelic effects is indicated by the light blue band.

Estimates of heterogeneity and dispersion for PTWAS and MRLocus are shown in Table 1. PTWAS indicates heterogeneity of instruments using an I2 statistic that ranges from 0 to 1 (low to high heterogeneity). MRLocus estimates the dispersion of instruments around the fitted line on the scale of the mediation effect; the estimate can be compared across loci by calculating the mean mediated effect: (estimate ɑ) x (mean of estimated coefficients for βjA). Visually, the mean mediated effect is the y-axis value from the fitted slope in Fig 4B corresponding to the middle values of estimated eQTL coefficients. The ratio of the estimate of dispersion σ over the absolute value of mean mediated effect indicates relatively how much the effects from nearly-LD-independent signal clusters vary around the conditional mean. The lowest I2 for PTWAS was for MRAS and LIPC (liver), which each had an estimate of 0 for heterogeneity. The lowest dispersion relative to the mean mediated effect for MRLocus was for SORT1, followed by PHACTR1 and MRAS.

Table 1. I2 statistic for PTWAS and dispersion estimate σ from MRLocus for eQTL-GWAS pairs.

Gene (tissue)—trait PTWAS I2 MRLocus σ Mean mediated σ / |MM|
SORT1 (liver)—LDL 0.85 1.47 -0.062 0.17
MRAS (art.)—CAD 0.00 0.16 0.014 0.91
PHACTR1 (art.)—CAD 0.76 0.12 -0.044 0.71
CETP (liver)—HDL 0.92 0.17 -0.12 1.43
LIPC (liver)—HDL 0.00 0.14 -0.028 1.30
LIPC (blood)—HDL 0.29 0.12 0.0051 7.16

Additional columns provide the mean mediated effect for MRLocus, and the ratio of the dispersion to the absolute value of the mean mediated effect.

On the six gene-tissue-trait combinations, eCAVIAR colocalization took an average of 6 seconds per locus, MRLocus slope fitting following eCAVIAR took an average of 18 seconds per locus, and MRLocus (both steps) took an average of ~4 minutes per locus, using 4 cores for MRLocus colocalization step.

Discussion

Here, we introduce MRLocus, a two-step Bayesian statistical procedure for estimation of gene-to-trait effects from eQTL and GWAS summary statistics. We find that MRLocus tends to have high accuracy in estimating the gene-to-trait effect across a variety of simulation settings, often higher than existing methods, and always had better credible interval coverage of true values, whether in partial mediation or null simulations. On real data analyses, MRLocus and eCAVIAR-MRLocus (using eCAVIAR for colocalization followed by MRLocus slope fitting) had consistent sign of estimates and comparable effect size compared to TWMR and PTWAS, but larger credible intervals compared to the other methods’ confidence intervals. While the effect sizes of alleles detected by GWAS on downstream traits examined here may be moderate, MRLocus’s estimation of the causal effect from perturbation of gene expression can be helpful in assessing the impact of therapeutic effects modulating expression on downstream traits [67]. For various systems, different downstream trait effect sizes qualify as practically meaningful increases or decreases, and MRLocus provides a framework for assessing what level of gene expression perturbation may be needed to obtain such changes in a downstream trait.

MRLocus was used to estimate the gene-to-trait effect following colocalization with MRLocus’s own Bayesian model of eQTL and GWAS coefficients per signal cluster, and alternatively using the existing eCAVIAR method. Both approaches were found to perform well in simulations, and better than other methods in terms of accuracy and interval coverage of the true slope. Use of eCAVIAR sped up the pipeline substantially (by at least an order of magnitude after accounting for use of multiple cores), as colocalization in MRLocus is the slowest step. eCAVIAR was better on average at identifying the true causal eSNP for non-null simulations, although MRLocus was better at identifying true causal eSNP in the null simulations. In simulation settings when expression heritability was high (h2g > 5%), passing original estimated coefficients for eCAVIAR-identified SNPs to slope fitting performed just as well as passing MRLocus’s posterior estimates for MRLocus-identified SNPs. However, when expression heritability was lower (h2g = 5%), MRLocus had a more substantial gain in accuracy in estimating the gene-to-trait effect over eCAVIAR-MRLocus. This advantage may be due to the shrinkage of the eQTL/GWAS effect sizes that occurs in the MRLocus colocalization step (eCAVIAR does not provide posterior estimates of effect size). Also, in the h2g = 5% simulations, eCAVIAR-MRLocus had moderate coverage of the true effect (48–58%), while MRLocus had closer to nominal coverage (68–87%). In the real data loci, the two alternatives for colocalization input to MRLocus slope fitting had similar output, except for the case of MRAS (artery) and LIPC (liver) where MRLocus had a larger interval, intersecting with 0. Overall, we recommend MRLocus over eCAVIAR-MRLocus for general purpose estimation of gene-to-trait effects, but both are provided as options in the MRLocus software guide.

While the mediator evaluated by MRLocus in this work was gene expression effects via eQTL, the methods are generic, and protein abundance effects via pQTL could be used instead of eQTL. Two-sample MR linking pQTL and GWAS has already uncovered 30 metabolite features with evidence of causal effects on at least one disease [68], and a recent pQTL study of hepatic proteins reported a median of 4.5 local pQTL variants per protein [69], suggesting that there are loci with sufficient number of nearly-LD-independent clusters for MR analysis. Additionally, large scale pQTL studies of similar sample size to eQTLGen (>30,000 individuals) have uncovered secondary pQTL signals upon conditional analysis for hundreds of loci [70]. Alternatively, pQTL could be used in place of the downstream GWAS trait in order to study mediation from gene expression to protein abundance [71], as previous work has found pQTL effect size to be positively correlated with eQTL effect size for variants ascertained through eQTL in human [72,73], and that colocalized eQTL and pQTL signal leads to higher observed RNA-protein correlations in mice [74].

Given MRLocus’s improved performance with respect to interval coverage in the simulation, we feel that accurate estimation of uncertainty is an advantage to MRLocus, and the focus in developing a new method was on specificity for prioritization of gene targets for functional follow-up experiments. Additionally, the MRLocus model is extensible. The slope-fitting model could easily be generalized to use an alternative monotonic function, as long as there are sufficient nearly-LD-independent signal clusters to support fitting. On the other hand, MRLocus’s slower speed means it is likely not the best choice for a global scan of the transcriptome for mediating genes, while the other methods examined and discussed here have been successfully used to scan across all genes and across multiple tissues. Furthermore, MRLocus is designed for investigating loci with strong causal gene candidates, whereas other methods that estimate gene-to-trait effects for many genes in a locus simultaneously may have less biased estimation of the effect, when a strong gene candidate is not present. Future work on the MRLocus model may involve estimation of the mediating effect of candidate causal genes in the context of other relevant genes in a pathway and a polygenic background [75].

There are a number of important limitations of the current study. First, the 1000 Genome Project phase 3 (1KG EUR) was used for LD calculations and clumping on the real data loci, although this reference panel is now a small sample size compared to other available resources such as gnomAD, TOPMed, and UK Biobank [7678]. In addition, new tools such as TopLD (http://topld.genetics.unc.edu/topld/) allow for easy access to LD based on TOPMed whole genome sequencing data. MRLocus and other methods for estimating mediation effects would benefit from these larger reference panels for LD calculations. Another important limitation is that, as eQTL study sample sizes increase, the threshold of eQTL p-value < 0.001 for clumping and inclusion of eQTL instruments may prove to be too lenient, as could be seen in the large number of nearly-LD-independent loci with small eQTL effect size for LIPC in blood from eQTLGen (Figs AS and AT in S1 Text). More sophisticated statistical methods for performing conditional analyses from summary statistics would benefit downstream analyses such as MRLocus that make use of nearly-LD-independent signal clusters, and stricter instrument inclusion criteria may be needed for large studies and meta-analyses with ancestry heterogeneity across studies.

We note that our method focuses on common variation (MAF > 0.01), and that we collapse highly correlated SNPs to a single representative SNP during the pre-processing, such that we cannot determine if the final selected SNP is the true "causal" SNP. Future development of MRLocus could involve upstream use of methods defining credible sets [7984] or modeling based on a posterior inclusion probability as in LLARRMA or DAP [33,85]. Therefore, the current implementation of MRLocus can perform fine-mapping to the level of a highly correlated set of SNPs, which may be sufficient for identifying one or more regulatory elements (RE) to prioritize for functional follow-up experiments. The current implementation of MRLocus assumes that the mediation slope passes through the origin, and therefore that eQTL signal clusters do not affect the downstream trait through genes other than the eGene. Estimation was nevertheless shown to be robust when loci harbored large trait-only association signals. Further iterations of MRLocus could relax this assumption through the addition of an intercept term accounting for invalid instruments as in MR-Egger [86,87]. Complementary information linking RE to genes, e.g. as measured by Hi-C, was not examined here, but have been proven successful elsewhere (HUGIN [88], H-MAGMA [89]), and we envision that prioritization of signal clusters in MRLocus that are supported by Hi-C would increase its power to detect causal genes.

As part of the MR analysis, MRLocus provides an estimate of the dispersion of effects around the estimated slope from nearly-LD-independent signal clusters, analogous to PTWAS’ use of the I2 statistic for effect size heterogeneity. The combined information from PTWAS and MRLocus regarding both uncertainty in estimation of the gene-to-trait slope, and estimated dispersion or heterogeneity of effects is critical when modeling context-specific (e.g. relevant tissue, cell type, or developmental stage) gene expression as a mediator for downstream traits. Different combinations of eQTL and GWAS SE (primarily influenced by sample size), extent of heterogeneity of effects, and the number of nearly-LD-independent signal clusters within a locus all may give rise to the same gene-to-trait effect and SE, but disentangling these sources of variance is important for experimental planning. For example, consider experimental follow-up for endophenotype downstream traits that could be feasibly measured in vitro. An investigator could choose between modulating gene expression directly or modulating the activity of an RE harboring candidate causal SNPs. A nonzero gene-to-trait effect with narrow credible interval estimated by MRLocus (as in Figs 1A and 4B) would suggest that modulating the gene directly or via the activity of an RE should affect the downstream trait, and the predicted effect could be assessed experimentally. However, high dispersion around the gene-to-trait slope (as in Fig 1B) suggests that perturbation of an RE implicated by candidate causal SNPs may induce an effect on the trait that is far from the effect size indicated by the fitted line, such that modulation of the gene directly may prove more successful. MRLocus provides a band around the predominant gene-to-trait slope, such that functional experiments per RE can therefore be prioritized. In all, here we demonstrate the MRLocus method and software utilizing summary statistics from eQTL and GWAS to identify genes mediating traits that can be prioritized for experimental follow-up.

Supporting information

S1 Methods. Description of statistical methods used in MRLocus.

(PDF)

S1 Text. Supplementary Table and Figures.

(PDF)

Acknowledgments

The authors would like to acknowledge the following individuals for helpful comments and suggestions on the work: Karen Mohlke, Laura Raffield, Alaine Broadaway, Robert Gentleman, Steven Munger, Eric Fauman, and Rob Moccia. The authors would like to thank Nicholas Mancuso for assistance in maintaining the ’twas_sim’ simulation framework. The authors would like to thank Tobias Strunz and Bernhard Weber for their help in providing summary statistics for the three genes from their liver meta-analysis eQTL study.

The Genotype-Tissue Expression (GTEx) Project was supported by the Common Fund of the Office of the Director of the National Institutes of Health, and by NCI, NHGRI, NHLBI, NIDA, NIMH, and NINDS.

Data Availability

All real and simulated preprocessed data used to run the software in the article can be found at the GitHub repository:https://github.com/mikelove/mrlocusPaper eQTL data on tibial artery and liver was obtained from the Genotype-Tissue Expression (GTEx) project v8 (GTEx Consortium et al., 2017). Bernhard Weber, corresponding author on Strunz et al. 2018 can be contacted at bweb@klinik.uni-regensburg.de.

Funding Statement

AZ and JGI were supported by NIH National Institute of General Medical Sciences R01 GM070335. Additional support to AZ was from NIH National Cancer Institute P01 CA142538. NM, ALT, JLS, and MIL were supported by NIH National Institute of Mental Health R01 MH118349. YL was supported by NIH National Heart, Lung, and Blood Institute R01 HL129132, NIH National Institute of General Medical Sciences R01 GM105785, and NIH National Institute of Child Health and Human Development U54 HD079124. Additional support to JLS was from NIH National Institute of Mental Health R01 MH121433 and R01 MH118349 (URL for the NIH funders:https://www.nih.gov/institutes-nih/list-nih-institutes-centers-offices). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Yao DW, O’Connor LJ, Price AL, Gusev A. Quantifying genetic effects on disease mediated by assayed gene expression levels. Nat Genet. 2020;52: 626–633. 10.1038/s41588-020-0625-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Plagnol V, Smyth DJ, Todd JA, Clayton DG. Statistical independence of the colocalized association signals for type 1 diabetes and RPS26 gene expression on chromosome 12q13. Biostatistics. 2009;10: 327–334. 10.1093/biostatistics/kxn039 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Giambartolomei C, Vukcevic D, Schadt EE, Franke L, Hingorani AD, Wallace C, et al. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet. 2014;10: e1004383. 10.1371/journal.pgen.1004383 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Wallace C, Rotival M, Cooper JD, Rice CM, Yang JHM, McNeill M, et al. Statistical colocalization of monocyte gene expression and genetic risk variants for type 1 diabetes. Hum Mol Genet. 2012;21: 2815–2824. 10.1093/hmg/dds098 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Hormozdiari F, van de Bunt M, Segrè AV, Li X, Joo JWJ, Bilow M, et al. Colocalization of GWAS and eQTL Signals Detects Target Genes. Am J Hum Genet. 2016;99: 1245–1260. 10.1016/j.ajhg.2016.10.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Wen X, Pique-Regi R, Luca F. Integrating molecular QTL data into genome-wide genetic association analysis: Probabilistic assessment of enrichment and colocalization. PLoS Genet. 2017;13: e1006646. 10.1371/journal.pgen.1006646 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Ongen H, Brown AA, Delaneau O, Panousis NI, Nica AC, Dermitzakis ET. Estimating the causal tissues for complex traits and diseases. Nat Genet. 2017;49: 1676–1683. 10.1038/ng.3981 [DOI] [PubMed] [Google Scholar]
  • 8.Gleason KJ, Yang F, Pierce BL, He X, Chen LS. Primo: integration of multiple GWAS and omics QTL summary statistics for elucidation of molecular mechanisms of trait-associated SNPs and detection of pleiotropy in complex traits. Genome Biol. 2020;21: 236. 10.1186/s13059-020-02125-w [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Gamazon ER, Wheeler HE, Shah KP, Mozaffari SV, Aquino-Michaels K, Carroll RJ, et al. A gene-based association method for mapping traits using reference transcriptome data. Nat Genet. 2015;47: 1091–1098. 10.1038/ng.3367 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Gusev A, Ko A, Shi H, Bhatia G, Chung W, Penninx BWJH, et al. Integrative approaches for large-scale transcriptome-wide association studies. Nat Genet. 2016;48: 245–252. 10.1038/ng.3506 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Mancuso N, Freund MK, Johnson R, Shi H, Kichaev G, Gusev A, et al. Probabilistic fine-mapping of transcriptome-wide association studies. Nat Genet. 2019;51: 675–682. 10.1038/s41588-019-0367-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Smith GD, Ebrahim S. “Mendelian randomization”: can genetic epidemiology contribute to understanding environmental determinants of disease? Int J Epidemiol. 2003;32: 1–22. 10.1093/ije/dyg070 [DOI] [PubMed] [Google Scholar]
  • 13.Davey Smith G, Hemani G. Mendelian randomization: genetic anchors for causal inference in epidemiological studies. Hum Mol Genet. 2014;23: R89–98. 10.1093/hmg/ddu328 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Richardson TG, Hemani G, Gaunt TR, Relton CL, Davey Smith G. A transcriptome-wide Mendelian randomization study to uncover tissue-dependent regulatory mechanisms across the human phenome. Nat Commun. 2020;11: 185. 10.1038/s41467-019-13921-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Barbeira AN, Bonazzola R, Gamazon ER, Liang Y, Park Y, Kim-Hellmuth S, et al. Exploiting the GTEx resources to decipher the mechanisms at GWAS loci. Genome Biol. 2021;22: 49. 10.1186/s13059-020-02252-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Broekema RV, Bakker OB, Jonkers IH. A practical view of fine-mapping and gene prioritization in the post-genome-wide association era. Open Biol. 2020;10: 190221. 10.1098/rsob.190221 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Millstein J, Zhang B, Zhu J, Schadt EE. Disentangling molecular relationships with a causal inference test. BMC Genet. 2009;10: 23. 10.1186/1471-2156-10-23 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Zhong W, Spracklen CN, Mohlke KL, Zheng X, Fine J, Li Y. Multi-SNP mediation intersection-union test. Bioinformatics. 2019;35: 4724–4729. 10.1093/bioinformatics/btz285 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.van der Graaf A, Claringbould A, Rimbert A, BIOS consortium, Westra H-J, Li Y, et al. A novel Mendelian randomization method identifies causal relationships between gene expression and low-density lipoprotein cholesterol levels. Nat Commun. 2020;11: 4930. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Pasaniuc B, Price AL. Dissecting the genetics of complex traits using summary association statistics. Nat Rev Genet. 2017;18: 117–127. 10.1038/nrg.2016.142 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Park Y, Sarkar AK, He L, Davila-Velderrain J, De Jager PL, Kellis M. A Bayesian approach to mediation analysis predicts 206 causal target genes in Alzheimer’s disease. bioRxiv [Preprint]. 2017. bioRxiv 219428 [posted 2017 Nov 14; revised 2017 Nov 18; revised 2017 Dec 1; cited 2021 Mar 9]: [23 p.]. Available from: https://www.biorxiv.org/content/10.1101/219428v3 [Google Scholar]
  • 22.Porcu E, Rüeger S, Lepik K, eQTLGen Consortium, BIOS Consortium, Santoni FA, et al. Mendelian randomization integrating GWAS and eQTL data reveals genetic determinants of complex and clinical traits. Nat Commun. 2019;10: 3300. 10.1038/s41467-019-10936-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Barfield R, Feng H, Gusev A, Wu L, Zheng W, Pasaniuc B, et al. Transcriptome-wide association studies accounting for colocalization using Egger regression. Genet Epidemiol. 2018;42: 418–433. 10.1002/gepi.22131 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Yuan Z, Zhu H, Zeng P, Yang S, Sun S, Yang C, et al. Testing and controlling for horizontal pleiotropy with probabilistic Mendelian randomization in transcriptome-wide association studies. Nat Commun. 2020;11: 3861. 10.1038/s41467-020-17668-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Zhang Y, Quick C, Yu K, Barbeira A, The GTEx Consortium, Luca F, et al. Investigating tissue-relevant causal molecular mechanisms of complex traits using probabilistic TWAS analysis. Genome Biol. 2020;21: 232 10.1186/s13059-020-02026-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Gleason KJ, Yang F, Chen LS. A robust two-sample Mendelian Randomization method integrating GWAS with multi-tissue eQTL summary statistics. bioRxiv [Preprint]. 2020. bioRxiv 135541 [posted 2020 Jun 5; cited 2021 Mar 9]: [31 p.]. Available from: https://www.biorxiv.org/content/10.1101/2020.06.04.135541v1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Brown CD, Mangravite LM, Engelhardt BE. Integrative modeling of eQTLs and cis-regulatory elements suggests mechanisms underlying cell type specificity of eQTLs. PLoS Genet. 2013;9: e1003649. 10.1371/journal.pgen.1003649 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Jansen R, Hottenga J-J, Nivard MG, Abdellaoui A, Laport B, de Geus EJ, et al. Conditional eQTL analysis reveals allelic heterogeneity of gene expression. Hum Mol Genet. 2017;26: 1444–1451. 10.1093/hmg/ddx043 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Consortium GTEx. The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science. 2020;369: 1318–1330. 10.1126/science.aaz1776 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Huang QQ, Tang HHF, Teo SM, Mok D, Ritchie SC, Nath AP, et al. Neonatal genetics of gene expression reveal potential origins of autoimmune and allergic disease risk. Nat Commun. 2020;11: 3761. 10.1038/s41467-020-17477-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Carpenter B, Gelman A, Hoffman MD, Lee D, Goodrich B, Betancourt M, et al. Stan: A Probabilistic Programming Language. J Stat Softw. 2017;76. 10.18637/jss.v076.i01 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81: 559–575. 10.1086/519795 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Wen X, Lee Y, Luca F, Pique-Regi R. Efficient Integrative Multi-SNP Association Analysis via Deterministic Approximation of Posteriors. Am J Hum Genet. 2016;98: 1114–1129. 10.1016/j.ajhg.2016.03.029 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Wallace C. Eliciting priors and relaxing the single causal variant assumption in colocalisation analyses. PLoS Genet. 2020;16: e1008720. 10.1371/journal.pgen.1008720 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Carvalho CM, Polson NG, Scott JG. The horseshoe estimator for sparse signals. Biometrika. 2010. pp. 465–480. 10.1093/biomet/asq017 [DOI] [Google Scholar]
  • 36.Berzuini C, Guo H, Burgess S, Bernardinelli L. A Bayesian approach to Mendelian randomization with multiple pleiotropic variants. Biostatistics. 2020;21: 86–101. 10.1093/biostatistics/kxy027 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Fazia T, Egidi L, Ayoglu B, Beecham A, Bitti PP, Ticca A, et al. Bayesian Mendelian Randomization identifies disease causing proteins via pedigree data, partially observed exposures and correlated instruments. 2019. Available: http://arxiv.org/abs/1903.00682 [Google Scholar]
  • 38.Uche-Ikonne OO, Dondelinger F, Palmer T. Bayesian estimation of IVW and MR-Egger models for two-sample Mendelian randomization studies. Epidemiology. medRxiv; 2019. [Google Scholar]
  • 39.Higgins JPT, Thompson SG. Quantifying heterogeneity in a meta-analysis. Statistics in Medicine. 2002. pp. 1539–1558. 10.1002/sim.1186 [DOI] [PubMed] [Google Scholar]
  • 40.Zhu Z, Zhang F, Hu H, Bakshi A, Robinson MR, Powell JE, et al. Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets. Nat Genet. 2016;48: 481–487. 10.1038/ng.3538 [DOI] [PubMed] [Google Scholar]
  • 41.Wu Y, Zeng J, Zhang F, Zhu Z, Qi T, Zheng Z, et al. Integrative analysis of omics summary data reveals putative mechanisms underlying complex traits. Nat Commun. 2018;9: 918. 10.1038/s41467-018-03371-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Mancuso, N. twas_sim repository; 2021 [cited 2021 Mar 9]. Database: GitHub [Internet] Available from: https://github.com/mancusolab/twas_sim
  • 43.Koster J, Rahmann S. Snakemake—a scalable bioinformatics workflow engine. Bioinformatics. 2012. pp. 2520–2522. 10.1093/bioinformatics/bts480 [DOI] [PubMed] [Google Scholar]
  • 44.Burgess S, Butterworth A, Thompson SG. Mendelian randomization analysis with multiple genetic variants using summarized data. Genet Epidemiol. 2013;37: 658–665. 10.1002/gepi.21758 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Hemani G, Zheng J, Elsworth B, Wade KH, Haberland V, Baird D, et al. The MR-Base platform supports systematic causal inference across the human phenome. Elife. 2018;7. 10.7554/eLife.34408 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Consortium GTEx, Laboratory Data Analysis &Coordinating Center (LDACC)—Analysis Working Group, Statistical Methods groups—Analysis Working Group, Enhancing GTEx (eGTEx) groups, NIH Common Fund, NIH/NCI, et al. Genetic effects on gene expression across human tissues. Nature. 2017;550: 204–213. 10.1038/nature24277 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Võsa U, Claringbould A, Westra H-J, Bonder MJ, Deelen P, Zeng B, et al. Unraveling the polygenic architecture of complex traits using blood eQTL metaanalysis. bioRxiv [Preprint]. 2018. bioRxiv 447367 [posted 2018 Oct 19; cited 2021 Mar 9]: [57 p.]. Available from: https://www.biorxiv.org/content/10.1101/447367v1 [Google Scholar]
  • 48.Strunz T, Grassmann F, Gayán J, Nahkuri S, Souza-Costa D, Maugeais C, et al. A mega-analysis of expression quantitative trait loci (eQTL) provides insight into the regulatory architecture of gene expression variation in liver. Sci Rep. 2018;8: 5865. 10.1038/s41598-018-24219-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Nikpay M, Goel A, Won H-H, Hall LM, Willenborg C, Kanoni S, et al. A comprehensive 1000 Genomes—based genome-wide association meta-analysis of coronary artery disease. Nat Genet. 2015;47: 1121. 10.1038/ng.3396 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Bacanu SA, Devlin B, Roeder K. The power of genomic control. Am J Hum Genet. 2000;66: 1933–1944. 10.1086/302929 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Stranger BE, Nica AC, Forrest MS, Dimas A, Bird CP, Beazley C, et al. Population genomics of human gene expression. Nat Genet. 2007;39: 1217–1224. 10.1038/ng2142 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Wright FA, Sullivan PF, Brooks AI, Zou F, Sun W, Xia K, et al. Heritability and genomics of gene expression in peripheral blood. Nat Genet. 2014;46: 430–437. 10.1038/ng.2951 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Lloyd-Jones LR, Holloway A, McRae A, Yang J, Small K, Zhao J, et al. The Genetic Architecture of Gene Expression in Peripheral Blood. Am J Hum Genet. 2017;100: 228–237. 10.1016/j.ajhg.2016.12.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Ouwens KG, Jansen R, Nivard MG, van Dongen J, Frieser MJ, Hottenga J-J, et al. A characterization of cis- and trans-heritability of RNA-Seq-based gene expression. Eur J Hum Genet. 2020;28: 253–263. 10.1038/s41431-019-0511-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Wheeler HE, Shah KP, Brenner J, Garcia T, Aquino-Michaels K, GTEx Consortium, et al. Survey of the Heritability and Sparse Architecture of Gene Expression Traits across Human Tissues. PLoS Genet. 2016;12: e1006423. 10.1371/journal.pgen.1006423 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Pruim RJ, Welch RP, Sanna S, Teslovich TM, Chines PS, Gliedt TP, et al. LocusZoom: regional visualization of genome-wide association scan results. Bioinformatics. 2010;26: 2336–2337. 10.1093/bioinformatics/btq419 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Hahne F, Ivanek R. Visualizing Genomic Data Using Gviz and Bioconductor. In: Mathé E, Davis S, editors. Statistical Genomics. New York, NY: Springer New York; 2016. pp. 335–351. [DOI] [PubMed] [Google Scholar]
  • 58.Strong A, Patel K, Rader DJ. Sortilin and lipoprotein metabolism: making sense out of complexity. Curr Opin Lipidol. 2014;25: 350–357. 10.1097/MOL.0000000000000110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Musunuru K, Strong A, Frank-Kamenetsky M, Lee NE, Ahfeldt T, Sachs KV, et al. From noncoding variant to phenotype via SORT1 at the 1p13 cholesterol locus. Nature. 2010;466: 714–719. 10.1038/nature09266 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Alshahid M, Wakil SM, Al-Najai M, Muiya NP, Elhawari S, Gueco D, et al. New susceptibility locus for obesity and dyslipidaemia on chromosome 3q22.3. Hum Genomics. 2013;7: 15. 10.1186/1479-7364-7-15 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Song Y, Ma R, Zhang H. The influence of MRAS gene variants on ischemic stroke and serum lipid levels in Chinese Han population. Medicine. 2019;98: e18065. 10.1097/MD.0000000000018065 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Wu J, Yin R-X, Guo T, Lin Q-Z, Shi G-Y, Sun J-Q, et al. Association between the MARS rs6782181 polymorphism and serum lipid levels. Int J Clin Exp Pathol. 2015;8: 1855–1866. [PMC free article] [PubMed] [Google Scholar]
  • 63.Codina-Fauteux V-A, Beaudoin M, Lalonde S, Lo KS, Lettre G. PHACTR1 splicing isoforms and eQTLs in atherosclerosis-relevant human cells. BMC Med Genet. 2018;19: 97. 10.1186/s12881-018-0616-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Chen L, Qian H, Luo Z, Li D, Xu H, Chen J, et al. PHACTR1 gene polymorphism with the risk of coronary artery disease in Chinese Han population. Postgrad Med J. 2019;95: 67–71. 10.1136/postgradmedj-2018-136298 [DOI] [PubMed] [Google Scholar]
  • 65.Tall AR. Functions of cholesterol ester transfer protein and relationship to coronary artery disease risk. J Clin Lipidol. 2010;4: 389–393. 10.1016/j.jacl.2010.08.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Guerra R, Wang J, Grundy SM, Cohen JC. A hepatic lipase (LIPC) allele associated with high plasma concentrations of high density lipoprotein cholesterol. Proc Natl Acad Sci U S A. 1997;94: 4532–4537. 10.1073/pnas.94.9.4532 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Visscher PM, Wray NR, Zhang Q, Sklar P, McCarthy MI, Brown MA, et al. 10 Years of GWAS Discovery: Biology, Function, and Translation. Am J Hum Genet. 2017;101: 5–22. 10.1016/j.ajhg.2017.06.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Qin Y, Meric G, Long T, Watrous J, Burgess S, Havulinna A, et al. Genome-wide association and Mendelian randomization analysis prioritizes bioactive metabolites with putative causal effects on common diseases. Genetic and Genomic Medicine. medRxiv; 2020. [Google Scholar]
  • 69.He B, Shi J, Wang X, Jiang H, Zhu H-J. Genome-wide pQTL analysis of protein expression regulatory networks in the human liver. BMC Biol. 2020;18: 97. 10.1186/s12915-020-00830-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Folkersen L, Gustafsson S, Wang Q, Hansen DH, Hedman ÅK, Schork A, et al. Genomic and drug target evaluation of 90 cardiovascular proteins in 30,931 individuals. Nat Metab. 2020;2: 1135–1148. 10.1038/s42255-020-00287-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Buccitelli C, Selbach M. mRNAs, proteins and the emerging principles of gene expression control. Nat Rev Genet. 2020. 10.1038/s41576-020-0258-4 [DOI] [PubMed] [Google Scholar]
  • 72.Battle A, Khan Z, Wang SH, Mitrano A, Ford MJ, Pritchard JK, et al. Genomic variation. Impact of regulatory variation from RNA to protein. Science. 2015;347: 664–667. 10.1126/science.1260793 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Li YI, van de Geijn B, Raj A, Knowles DA, Petti AA, Golan D, et al. RNA splicing is a primary link between genetic variation and disease. Science. 2016;352: 600–604. 10.1126/science.aad9417 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Chick JM, Munger SC, Simecek P, Huttlin EL, Choi K, Gatti DM, et al. Defining the consequences of genetic variation on a proteome-wide scale. Nature. 2016;534: 500–505. 10.1038/nature18270 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Sinnott-Armstrong N, Naqvi S, Rivas M, Pritchard JK. GWAS of three molecular traits highlights core genes and pathways alongside a highly polygenic background. eLife. 2021;10: e58615. 10.7554/eLife.58615 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Karczewski KJ, Francioli LC, Tiao G, Cummings BB, Alföldi J, Wang Q, et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature. 2020;581: 434–443. 10.1038/s41586-020-2308-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Kowalski MH, Qian H, Hou Z, Rosen JD, Tapia AL, Shan Y, et al. Use of >100,000 NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium whole genome sequences improves imputation quality and detection of rare variant associations in admixed African and Hispanic/Latino populations. PLoS Genet. 2019;15: e1008500. 10.1371/journal.pgen.1008500 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Sudlow C, Gallacher J, Allen N, Beral V, Burton P, Danesh J, et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 2015;12: e1001779. 10.1371/journal.pmed.1001779 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Servin B, Stephens M. Imputation-based analysis of association studies: candidate regions and quantitative traits. PLoS Genetics. 2005. p. e114. 10.1371/journal.pgen.0030114.eor [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Wellcome Trust Case Control Consortium, Maller JB, McVean G, Byrnes J, Vukcevic D, Palin K, et al. Bayesian refinement of association signals for 14 loci in 3 common diseases. Nat Genet. 2012;44: 1294–1301. 10.1038/ng.2435 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Kichaev G, Yang W-Y, Lindstrom S, Hormozdiari F, Eskin E, Price AL, et al. Integrating functional data to prioritize causal variants in statistical fine-mapping studies. PLoS Genet. 2014;10: e1004722. 10.1371/journal.pgen.1004722 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Hormozdiari F, Kostem E, Kang EY, Pasaniuc B, Eskin E. Identifying causal variants at loci with multiple signals of association. Genetics. 2014;198: 497–508. 10.1534/genetics.114.167908 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Hutchinson A, Watson H, Wallace C. Improving the coverage of credible sets in Bayesian genetic fine-mapping. PLoS Comput Biol. 2020;16: e1007829. 10.1371/journal.pcbi.1007829 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Wang G, Sarkar A, Carbonetto P, Stephens M. A simple new approach to variable selection in regression, with application to genetic fine mapping. J R Stat Soc Series B Stat Methodol. 2020;25: 1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Valdar W, Sabourin J, Nobel A, Holmes CC. Reprioritizing genetic associations in hit regions using LASSO-based resample model averaging. Genet Epidemiol. 2012;36: 451–462. 10.1002/gepi.21639 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Bowden J, Davey Smith G, Burgess S. Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. Int J Epidemiol. 2015;44: 512–525. 10.1093/ije/dyv080 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Burgess S, Bowden J, Fall T, Ingelsson E, Thompson SG. Sensitivity Analyses for Robust Causal Inference from Mendelian Randomization Analyses with Multiple Genetic Variants. Epidemiology. 2017;28: 30–42. 10.1097/EDE.0000000000000559 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Martin JS, Xu Z, Reiner AP, Mohlke KL, Sullivan P, Ren B, et al. HUGIn: Hi-C Unifying Genomic Interrogator. Bioinformatics. 2017;33: 3793–3795. 10.1093/bioinformatics/btx359 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Sey NYA, Hu B, Mah W, Fauni H, McAfee JC, Rajarajan P, et al. A computational tool (H-MAGMA) for improved prediction of brain-disorder risk genes by incorporating brain chromatin interaction profiles. Nat Neurosci. 2020;23: 583–593. 10.1038/s41593-020-0603-0 [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

David Balding, Caroline Relton

5 Nov 2020

Dear Dr Love,

Thank you very much for submitting your Research Article entitled 'MRLocus: identifying causal genes mediating a trait through Bayesian estimation of allelic heterogeneity' to PLOS Genetics. Your manuscript was fully evaluated at the editorial level and by independent peer reviewers. The reviewers appreciated the attention to an important problem, but raised some substantial concerns about the current manuscript. Based on the reviews, we will not be able to accept this version of the manuscript, but we would be willing to review again a much-revised version. We cannot, of course, promise publication at that time.

Should you decide to revise the manuscript for further consideration here, your revisions should address the specific points made by each reviewer. We will also require a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript.

If you decide to revise the manuscript for further consideration at PLOS Genetics, please aim to resubmit within the next 60 days, unless it will take extra time to address the concerns of the reviewers, in which case we would appreciate an expected resubmission date by email to plosgenetics@plos.org.

If present, accompanying reviewer attachments are included with this email; please notify the journal office if any appear to be missing. They will also be available for download from the link below. You can use this link to log into the system when you are ready to submit a revised version, having first consulted our Submission Checklist.

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see our guidelines.

Please be aware that our data availability policy requires that all numerical data underlying graphs or summary statistics are included with the submission, and you will need to provide this upon resubmission if not already present. In addition, we do not permit the inclusion of phrases such as "data not shown" or "unpublished results" in manuscripts. All points should be backed up by data provided with the submission.

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool.  PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

PLOS has incorporated Similarity Check, powered by iThenticate, into its journal-wide submission system in order to screen submitted content for originality before publication. Each PLOS journal undertakes screening on a proportion of submitted articles. You will be contacted if needed following the screening process.

To resubmit, use the link below and 'Revise Submission' in the 'Submissions Needing Revision' folder.

[LINK]

We are sorry that we cannot be more positive about your manuscript at this stage. Please do not hesitate to contact us if you have any concerns or questions.

Yours sincerely,

Caroline Relton, PhD

Associate Editor

PLOS Genetics

David Balding

Section Editor: Methods

PLOS Genetics

Reviewer's Comments to the Authors:

Reviewer #1: Thank you for the opportunity to review this manuscript. The authors have developed a new method to integrate eQTL data with findings from GWAS to help develop insight into the underlying biology responsible for trait-associated genetic variants. This area of research has been extremely busy in the last few years with new methods seemingly appearing every month or so. That said, I believe the MRLocus method devised by these authors is one of the better and innovative approaches in this space and am interested to apply it myself in future work. However, I do have some comments, mainly concerning choices of parameters and requests for further simulations, which I would kindly ask the authors to address before I believe it is suitable for publication.

Comments:

1) The authors state in the introduction that MRLocus estimates gene-to-trait effects at loci containing ‘multiple LD-independent’ eQTL. However, in both their simulations and applied analyses they apply a threshold of r2 < 0.2 (based on the 1KG ref panel). I think this is slightly misleading to the reader, as in the applied analysis of SORT1 it suggests there are 12 ‘independent’ eQTL in liver tissue. However, based on this threshold I think that these 12 variants may well be weakly correlated with each other. In a conventional MR analysis I would be very skeptical of an analysis using this threshold as weakly correlated instruments are likely to result in ‘double counting’ which can bias findings.

I therefore think it is important that the authors run simulations to investigate this, using a more stringent threshold i.e. r2<0.001 to compare how their method compares to other approaches using truly independent instruments. Furthermore, I would like to see how false positive rates are affected when the r2 threshold is varied. My concern is that MRLocus may be prone to false positive findings using an r2 < 0.2, although it will be interesting to see how this compares to other methods in the field. Additionally, the instances of ‘independent’ eQTL should be changes if the authors can demonstrate that their method performs well under the default setting of r2<0.2.

2) Related to my previous point, I think it should be made clear in the applied analysis how many instruments are used at each locus. Although it is available to the reader in the supplementary material, making this clear in the main text means that the reader doesn’t have to seek this information out. The authors highlight SORT1 which had n=12 instruments in their analysis, although for example MRAS only had a single eQTL instrument. Can the authors comment on whether their method provides added value when this is the case (i.e. single instrument MR) over say the eCAVIAR method which their method is built on?

3) I’m also concerned by the P<0.001 threshold applied to identify eQTL as instruments. Can the authors please evaluate how a more stringent threshold influences the false positive rates of their method compared to those it is benchmarked against? In a conventional MR setting an F-stat < 10 is typically used to indicate weak instruments, which will likely be the case when eQTL have P=10-03. This issue is not highlighted in this work as the authors have selected 5 genes which have strong prior knowledge that they are involved in disease risk/trait variation. However, if readers are interested in applying MRLocus to detect novel genes responsible for GWAS signals (which should be its primary purpose I would’ve thought?) then the choice of p-value threshold will likely be important in mitigating false positive rates.

In my opinion applying conventional MR instrument criteria of P<5x10-08 and r2<0.001 is not necessary for cis-QTL analyses, however I think the authors should simulate this to investigate how these parameters influence the performance of their method. Some discussion regarding this would also be welcome for readers interested in applying their approach but perhaps have not studied MR extensively.

4) Although the authors have used the 1KG reference panel is their study for LD calculations and clumping, this dataset consisting of 503 European individuals which based on modern standards is severely outdated. Although I won’t ask the authors to re-run their analyses, I would ask that they mention in their discussion that their method (as with others in the field) would really benefit from larger reference panel for more accurate LD calculations. Even though they have restricted variants to MAF > 0.01, n=503 is going to be unreliable for low frequency SNPs in my experience.

5) The authors do an excellent job of providing a summary of genetic colocalization & TWAS methodologies in their introduction, as well as why introducing MR to address the challenges of deciphering GWAS signals is important. This is a minor suggestion, but I felt that paragraphs 4-6 of the introduction slightly got away from the core message of this paper and may be slightly overwhelming for readers not primarily involved in this topic of research. Perhaps these paragraphs could therefore be streamline, focusing on the methods which are used to benchmark MRLocus, rather than the plethora of other approaches in this very busy area of research.

Reviewer #2: The authors present a new approach to summary statistics-based causal inference tailored to gene expression-to-trait settings, which includes a preprocessing step of colocalization analysis. They have shown across a wide range of simulation settings that their newly proposed method has lower error, less bias and better coverage of the confidence interval than state-of-the-art competing methods. They also apply their method to real data and replicate well-established causal links between certain expression levels and complex traits. While the method seems solid and the simulation results are promising, I suspect that some of the modelling (and simulation) assumptions are unrealistic and may give an unfair advantage to MRclust. The real data application could be more thorough, but it is acceptable for a mainly method paper. Below I list my comments and questions for the authors that may improve the m/s.

Major comments:

It would be clearer if the authors stated the exact mathematical model (in equations) for the simulation process instead of referring to another publication and software (twas_sim). It is not clear for example how much horizontal pleiotropy was added to the signals (i.e. violation of the MR assumption)?

I feel there is a slight confusion between two concepts: expression mediated heritability [A] (which is the fraction of trait heritability mediated by gene expression) vs mediated trait variance explained [B] (as far as I understand from the authors’ definition: the squared standardized (SD/SD) causal effect). For me the relationship between them is A = h2-expression x B, do I understand well? Would be great if the terminology could be clarified in the paper.

While I see the difference between the colocalization step of MRlocus and eCAVIAR, I’m not sure how different results they would give. (Z-scores can be easily transformed back to effect sizes.) Have the authors compared head-to-head their approach to eCAVIAR when applied to the simulated data set to show superior performance?

“For all simulations, if there were no SNPs in the simulated locus with eQTL un-adjusted p-value < 0.001 then a new seed was drawn.” – This leads to Winner’s curse bias, since you only use summary statistics that (by chance) pass the significance threshold, i.e. their effects will be overestimated and hence lead to an underestimation of the causal effect.

Low coverage can be a consequence of two issues: (unaccounted) bias and/or badly calibrated SE. It would be helpful if the authors could explore what is the main reason for low coverage of all methods. To me it stems mostly from bias – can the authors show how methods compare in terms of bias?

Fig 2A shows that MRlocus outperforms the oracle (known causal SNPs) both in terms of RMAE/MAE. The authors argue that it is due to the re-estimation of the effects using colocalization. This however would only work if there is no pleiotropy present at all. We know from real data, that there is almost always pleiotropy present, so simulation setting must be extended to more realistic situations. (E.g. TWMR has an outlier removal step, analogous to MR-PRESSO to protect against such situations) In addition: it seems to me that in the simulations there were no trait-only effects simulated, which is again a very important omission. Gene->trait causal effects do not mean that all trait-associated SNP at the locus must be expression associated, traits can have additional genetic components. Adding this more realistic setting would show the weakness of MRlocus, because it would artificially pull up the effect of those SNPs on expression. Both these situations should be incorporated into the simulations, otherwise the settings are too artificial and unrealistic.

While it is reassuring to see that previous findings are confirmed by MRlocus, but would be nice to see examples of gene-trait associations missed by other methods and only picked up by MRlocus or potentially spurious causal effects (by TWMR or PTWAS) disproved by MRclust.

Since the authors point out the importance of allelic heterogeneity, larger eQTL data sets, such as the eQTL-Gen summary statistics (whole blood, n=32K) would be more ideal to demonstrate the advantages of their method. Have the authors tried MRclust on such data?

I’d move quite some part of the supplementary methods (especially the method description) to the main text Methods – especially given that PLoS Gen has no strict page limit.

Minor comments:

“the original TWMR paper performed conditional analyses” – TWMR performs approximate conditional analysis, which is designed for summary statistics.

“We therefore considered even lower heritability of trait on gene expression of 0.5% and 0.1%” – Please rephrase this sentence, it is currently not understandable, how a trait can have a heritability on expression?

Fig 4B vertical whiskers are missing to indicate SE.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Decision Letter 1

David Balding, Caroline Relton

26 Feb 2021

Dear Dr Love,

Thank you for attending to the reviewer's comments so thoroughly. There are a few very minor issues outstanding that have been highlighted by one reviewer (see below). Please address these when preparing your final submission but we are confident that you will address these easily so that we can already decide that your manuscript entitled "MRLocus: identifying causal genes mediating a trait through Bayesian estimation of allelic heterogeneity" has been editorially accepted for publication in PLOS Genetics. Congratulations!

Before your submission can be formally accepted and sent to production you will need to complete our formatting changes, which you will receive in a follow up email. Please be aware that it may take several days for you to receive this email; during this time no action is required by you. Please note: the accept date on your published article will reflect the date of this provisional acceptance, but your manuscript will not be scheduled for publication until the required changes have been made.

Once your paper is formally accepted, an uncorrected proof of your manuscript will be published online ahead of the final version, unless you’ve already opted out via the online submission form. If, for any reason, you do not want an earlier version of your manuscript published online or are unsure if you have already indicated as such, please let the journal staff know immediately at plosgenetics@plos.org.

In the meantime, please log into Editorial Manager at https://www.editorialmanager.com/pgenetics/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production and billing process. Note that PLOS requires an ORCID iD for all corresponding authors. Therefore, please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field.  This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager.

If you have a press-related query, or would like to know about making your underlying data available (as you will be aware, this is required for publication), please see the end of this email. If your institution or institutions have a press office, please notify them about your upcoming article at this point, to enable them to help maximise its impact. Inform journal staff as soon as possible if you are preparing a press release for your article and need a publication date.

Thank you again for supporting open-access publishing; we are looking forward to publishing your work in PLOS Genetics!

Yours sincerely,

Caroline Relton, PhD

Associate Editor

PLOS Genetics

David Balding

Section Editor: Methods

PLOS Genetics

www.plosgenetics.org

Twitter: @PLOSGenetics

----------------------------------------------------

Comments from the reviewers (if applicable):

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: Thank you for addressing my comments and congratulations on an excellent study.

Reviewer #2: The authors have done an excellent job addressing my comments. Only a few minor ones I’d like to flag up:

1. “βjeQTL ~ N(0, (h2g / ncausal )^{-1/2} ).” – I’m not sure why the per SNP variance needs to be raised to the power -1/2?

2. “ygene = Z βeQTL + ε,” Define epsilon in the equations, not only buried in the text. Otherwise the reader doesn’t understand why you introduced s_{err}^2.

3. Please provide the exact equation for y_trait instead (or at least on top) of the ambiguous description in the text.

4. The figures are of very poor resolution, pls change.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Tom Richardson

Reviewer #2: No

----------------------------------------------------

Data Deposition

If you have submitted a Research Article or Front Matter that has associated data that are not suitable for deposition in a subject-specific public repository (such as GenBank or ArrayExpress), one way to make that data available is to deposit it in the Dryad Digital Repository. As you may recall, we ask all authors to agree to make data available; this is one way to achieve that. A full list of recommended repositories can be found on our website.

The following link will take you to the Dryad record for your article, so you won't have to re‐enter its bibliographic information, and can upload your files directly: 

http://datadryad.org/submit?journalID=pgenetics&manu=PGENETICS-D-20-01297R1

More information about depositing data in Dryad is available at http://www.datadryad.org/depositing. If you experience any difficulties in submitting your data, please contact help@datadryad.org for support.

Additionally, please be aware that our data availability policy requires that all numerical data underlying display items are included with the submission, and you will need to provide this before we can formally accept your manuscript, if not already present.

----------------------------------------------------

Press Queries

If you or your institution will be preparing press materials for this manuscript, or if you need to know your paper's publication date for media purposes, please inform the journal staff as soon as possible so that your submission can be scheduled accordingly. Your manuscript will remain under a strict press embargo until the publication date and time. This means an early version of your manuscript will not be published ahead of your final version. PLOS Genetics may also choose to issue a press release for your article. If there's anything the journal should know or you'd like more information, please get in touch via plosgenetics@plos.org.

Acceptance letter

David Balding, Caroline Relton

7 Apr 2021

PGENETICS-D-20-01297R1

MRLocus: identifying causal genes mediating a trait through Bayesian estimation of allelic heterogeneity

Dear Dr Love,

We are pleased to inform you that your manuscript entitled "MRLocus: identifying causal genes mediating a trait through Bayesian estimation of allelic heterogeneity" has been formally accepted for publication in PLOS Genetics! Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out or your manuscript is a front-matter piece, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Genetics and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Alice Ellingham

PLOS Genetics

On behalf of:

The PLOS Genetics Team

Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom

plosgenetics@plos.org | +44 (0) 1223-442823

plosgenetics.org | Twitter: @PLOSGenetics

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Methods. Description of statistical methods used in MRLocus.

    (PDF)

    S1 Text. Supplementary Table and Figures.

    (PDF)

    Attachment

    Submitted filename: Response for MRLocus.pdf

    Data Availability Statement

    All real and simulated preprocessed data used to run the software in the article can be found at the GitHub repository:https://github.com/mikelove/mrlocusPaper eQTL data on tibial artery and liver was obtained from the Genotype-Tissue Expression (GTEx) project v8 (GTEx Consortium et al., 2017). Bernhard Weber, corresponding author on Strunz et al. 2018 can be contacted at bweb@klinik.uni-regensburg.de.


    Articles from PLoS Genetics are provided here courtesy of PLOS

    RESOURCES