Skip to main content
Biostatistics (Oxford, England) logoLink to Biostatistics (Oxford, England)
. 2018 Aug 1;21(1):86–101. doi: 10.1093/biostatistics/kxy027

A Bayesian approach to Mendelian randomization with multiple pleiotropic variants

Carlo Berzuini 1,, Hui Guo 1, Stephen Burgess 2, Luisa Bernardinelli 3
PMCID: PMC6920542  EMSID: EMS82527  PMID: 30084873

Summary

We propose a Bayesian approach to Mendelian randomization (MR), where instruments are allowed to exert pleiotropic (i.e. not mediated by the exposure) effects on the outcome. By having these effects represented in the model by unknown parameters, and by imposing a shrinkage prior distribution that assumes an unspecified subset of the effects to be zero, we obtain a proper posterior distribution for the causal effect of interest. This posterior can be sampled via Markov chain Monte Carlo methods of inference to obtain point and interval estimates. The model priors require a minimal input from the user. We explore the performance of our method by means of a simulation experiment. Our results show that the method is reasonably robust to the presence of directional pleiotropy and moderate correlation between the instruments. One section of the article elaborates the model to deal with two exposures, and illustrates the possibility of using MR to estimate direct and indirect effects in this situation. A main objective of the article is to create a basis for developments in MR that exploit the potential offered by a Bayesian approach to the problem, in relation with the possibility of incorporating external information in the prior, handling multiple sources of uncertainty, and flexibly elaborating the basic model.

Keywords: Correlated instruments, Egger regression, Instrumental variable, Mediation, Median estimator, Metabolomics, Shrinkage, Sparsity prior

1. Introduction

Many statistical studies aim to assess the causal effect of a phenotype or exposure (Inline graphic) on an outcome (Inline graphic). In many such studies, an experimental design is unfeasible, and the only remaining option is to work on the basis of observational data. Unfortunately in this case, no matter how impeccable is the study design, how accurate are the observations and smart the inference algorithm, there is no guarantee that the result will not be biased due to unobserved confounding or reverse causation. A useful approach to this situation is Mendelian randomization (MR) (Katan, 1986; Davey Smith and Ebrahim, 2003; Lawlor and others, 2008). The bare bones of the idea are that, under certain assumptions, for a phenotype Inline graphic to be a causal influence on an outcome Inline graphic, we expect a genetic variant Inline graphic that modulates Inline graphic to likewise affect Inline graphic. Information about Inline graphic can then be used as an instrument to assess the causal effect of Inline graphic on Inline graphic, despite confounding.

The potential impact of MR on science cannot be underestimated (Robinson and others, 2017). In various occasions, an MR study based on observational data has predicted the outcome of a clinical trial, thereby supporting or casting doubt on the motivating causal hypothesis (Voight and others, 2012). Furthermore, MR can help biologists reconstruct a disease process from its molecular causes to its phenotypic manifestations, and unravel causal relationships of pharmacological relevance through the analysis of biobank data.

Early implementations of MR used a single or a handful of instrumental variants, under the untestable assumption that these variants are not pleiotropic, i.e. that they affect the outcome only through the changes they induce in the exposure. Recent developments, mostly in the frequentist realm, have focused on methods that use multiple instruments, while allowing for “cryptic” pleiotropy, i.e. allowing an unspecified subset of the instruments to affect the outcome directly. Examples of multi-instrument Mendelian randomization (MIMR) methods that allow for cryptic pleiotropy are the Egger regression (ER) and the weighted median estimator (WME) method of Bowden and others (2016).

The existing frequentist approaches to MR do not coherently account for important sources of uncertainty, such as the uncertainty arising from the estimation of the instrument-exposure (i-e) associations. Hence our concern that these methods may yield over-optimistic results. We attempt to remedy this by proposing a Bayesian approach to MR (see Didelez and Sheehan, 2007; Didelez and Sheehan, 2008; Burgess and Thompson, 2010; Burgess and Thompson, 2012; Jones and others, 2012), which deals with cryptic pleiotropy and can in principle acknowledge uncertainty at all levels of the model. Our method allows the user to shape the prior on the basis of external information, for stronger and more accurate inferences, although it will work with vague prior specifications, too. It combines the power of Bayesian analysis with that of Markov chain Monte Carlo (MCMC) inference, for an exceptional freedom in elaborating the basic model. Extensions of our method to deal with non-linear dependencies and model uncertainty are under investigation, but remain outside the scope of this article. We restrict our attention to continuous Inline graphic and Inline graphic variables.

In Section 2, we review past related work, and place our method in that context. In Section 3, we introduce our MIMR framework in a simple setting. The idea here is that the pleiotropic effects are represented in the model by unknown parameters, with an independence sparsity prior that assumes an unspecified subset of these parameters to be zero. Incorporating this prior yields a proper posterior for the causal effect, which we MCMC-sample to obtain point and interval estimates. Also discussed in this section is the use of external information to shape an informative prior. In Section 4, we assess the performance of our method in relation to the number of instruments, the amount and direction of pleiotropy, and the degree of linkage disequilibrium (LD) between the instrumental variants, taking the performance of the WME method as a reference. Thanks to the explicit modeling of the direct instrument-outcome (i-o) effects, our approach bears relationships with mediation analysis. This connection is explored in depth in Section 5, where we consider a problem involving two exposures (instead of one), and use our method to estimate direct and indirect effects. This is further illustrated in Section 6 with the aid of a study in metabolomics. This article is based on the decision-theoretic causality framework proposed by Dawid (2000) and described in Chapter 4 of Berzuini and others (2012).

2. Background

Let Inline graphic denote a set of imperfectly observed exposure-outcome (e-o) confounders, responsible for the correlation between Inline graphic and Inline graphic being not totally attributable to a causal relationship. In order for a scalar variable Inline graphic to qualify as an instrument for estimating the causal effect of Inline graphic on Inline graphic, we generally require it to satisfy the following three conditions, where we use the notation Inline graphic for “Inline graphic is independent of Inline graphic given Inline graphic” (Dawid, 1979), and Inline graphic for the negation of the same sentence:

Condition 1

(marginal relevance) Inline graphic is associated with the exposure, formally Inline graphic.

Condition 2

(confounder independence) Inline graphic is independent of the e-o confounders, formally Inline graphic.

Condition 3

(exclusion restriction) Inline graphic is independent of Inline graphic, given Inline graphic and Inline graphic, formally Inline graphic.

The last two conditions are not testable on the basis of the usually available Inline graphic data. Three examples of MR problem are graphically represented in Figure 1, where the Inline graphic arrow represents the causal effect of inferential interest, the Inline graphic arrow a pleiotropic effect, and a Inline graphic arrow an i-e association, which none of the methods discussed assumes to be causal. We regard the graphs of Figure 1 as expressing sets of conditional independence relationships, which can be read off them with the aid of the Inline graphic-separation criterion of Geiger and others (1990). Conditions 1–3 are satisfied in Figure 1a. Condition 3 is violated in Figure 1b by the presence of the Inline graphic arrow.

Fig. 1.

Fig. 1.

Conditional independence graph representations of a Mendelian randomization problem. In (a) the graph represents a set of conditional independence assumptions that do not violate Conditions 1–3 of Section 2. In (b), the arrow from Inline graphic to Inline graphic violates Condition 3. In (c) the graph represents a class of problems with multiple instruments, where Conditions 1 and 2 are not violated.

With reference to Figure 1a, if we assume linear additive dependencies between the variables in the graph, and let Inline graphic and Inline graphic denote the estimated slopes in the regressions of Inline graphic on Inline graphic and Inline graphic on Inline graphic, respectively, then the instrumental variable (IV) estimator of the causal effect of Inline graphic on Inline graphic is Inline graphic. A small sample size and/or weak i-e associations may cause the data to deviate from Condition 2 (Nelson and Startz, 1990), and consequently the IV estimate to be affected by the so-called weak instruments bias.

Existing frequentist methods admit a collection of independent instruments, Inline graphic, and they require Conditions 1 and 2 to hold for all instruments, formally Inline graphic and Inline graphic, for Inline graphic, as in Figure 1c. In these methods, each Inline graphicth instrument contributes a separate IV estimate Inline graphic of the causal effect of Inline graphic on Inline graphic. When the IV estimates of several instruments show reasonable concordance, it would appear that a causal conclusion is defensible, pleiotropy notwithstanding. This idea is developed by Egger and others (1997), who suggest that concordance can be tested by regressing Inline graphic on Inline graphic. Under the assumption that the i-e associations (or instrument strengths) are independent of the direct effects (pleiotropic associations), usually referred to as the INSIDE assumption (Kolesar and others, 2014), evidence of a linear relationship between Inline graphic and Inline graphic will support (and provide an estimate of) the causal effect of interest, whether or not the instruments satisfy Condition 3. For finite numbers of instruments, the frequentist interpretation of the INSIDE is that the correlation between pleiotropic and i-e associations is zero. This is an untestable property, although some indirect empirical evidence (Pickrell and others, 2015) can be summoned in its support. The Egger method requires the instrumental SNPs to be recoded to ensure that the i-e associations have the same sign, although, unfortunately, INSIDE is sensitive to changes such. Moreover, by treating the Inline graphic as fixed quantities, the Egger method ignores the imprecision introduced by their estimation.

Another popular approach to MIMR is the median estimator. If Conditions 1 and 2 are valid, the instruments are independent and at least half of them satisfy Condition 3, then the median of their corresponding IV estimates will be a consistent estimate of the causal effect (Han, 2008). Bowden and others (2016) proposed a widely used weighted version of this estimator—the WME of the causal effect.

In this article, we propose a Bayesian approach to MR that allows an unspecified subset of the instruments to be pleiotropic, provided that Condition 2 and a Bayesian version of the INSIDE assumption (see Condition 4 in the next section) are satisfied. The proposed approach has the following distinguishing features. It allows for (moderate) instrument–instrument correlation, and does not require the signs of the instrument effects to be manipulated. It treats the i-e associations as random quantities, which we can learn about via prior-to-posterior updating. Once the posterior distributions (e.g. for the i-e associations and for the causal effect, etc.) have been calculated, they can be used as priors in future studies, in what can be regarded as a sequential learning process. Finally, while the aforementioned frequentist methods emphasize the construction of estimators for specific situations, our combined use of Bayesian inference and MCMC computation allows the researcher to focus on model choice, to better explore the possibility of tackling elaborated versions of the basic model.

We conclude this section with a note on the decision-theoretic formulation of causality proposed by Dawid (2000), and on the corresponding definition of causal effect, which we adopt in the present work. In accord with this formulation, we define the causal effect of Inline graphic on Inline graphic as the difference between the expected values of Inline graphic under a (hypothetical) intervention that imposes on Inline graphic a reference value Inline graphic and another intervention that imposes a generic value Inline graphic. To express this, let the symbol Inline graphic label the regime under which the value of Inline graphic is generated, with Inline graphic indicating that Inline graphic is fixed to value Inline graphic by an intervention of the relevant type, and Inline graphic denoting the observational regime under which the data have actually been obtained. Then the average causal effect (ACE) of Inline graphic on the continuous outcome Inline graphic is defined by Inline graphic. Based on our observational data (obtained exclusively under regime Inline graphic) we can estimate ACE under the (bold) assumption Inline graphic, that the conditional distribution of Inline graphic given Inline graphic in the generic individual characterized by a specific value of Inline graphic, depends on Inline graphic, but not further on whether the value Inline graphic has arisen by passive observation or through the intervention of interest. The implications of this condition in a MR context, and, more in general, in the context of IV analysis, are examined in Chapter 4 of Berzuini and others (2012)

3. Methods

We shall now introduce our approach to MR with reference to a one-sample setting, where each individual is characterized by a complete set of observed values for Inline graphic, Inline graphic and Inline graphic. We assume linear additive dependencies and write

graphic file with name M88.gif (3.1)
graphic file with name M89.gif (3.2)
graphic file with name M90.gif (3.3)

where Inline graphic stands for a normal distribution with mean Inline graphic and variance Inline graphic, the symbol Inline graphic denotes the i-e associations and Inline graphic are the pleiotropic effects. The causal effect of interest, denoted as Inline graphic, represents the change in Inline graphic caused by an interventional unit change in Inline graphic. We may equivalently write

graphic file with name M99.gif (3.4)
graphic file with name M100.gif (3.5)

with Inline graphic, Inline graphic and Inline graphic. Equations (3.43.5) involve a vector of parameters Inline graphic, with Inline graphic. The model is not completely identifiable, in the sense that the information contained in the observed covariances does not lead to a unique solution for Inline graphic or any subset of Inline graphic containing the parameter of inferential interest, Inline graphic. In fact, parameters Inline graphicare identified by the Inline graphic conditions provided by equalities Inline graphic and Inline graphic. Unfortunately, the remaining Inline graphic parameters, including the causal effect of interest, Inline graphic, remain unidentified. This is because the equality Inline graphic (with Inline graphic and Inline graphic) provides additional Inline graphic conditions, and a further condition is obtained from the equation Inline graphic, for a total of additional Inline graphic conditions, which are not sufficient to identify Inline graphic parameters.

From a Bayesian point of view, non-identifiability can be negotiated by using a scientifically plausible prior that induces a proper posterior on Inline graphic. Formally, if Inline graphic denotes data, the posterior can always be written in the product form:

graphic file with name M124.gif

Because the last term above is the conditional posterior of an unidentifiable parameter, it reduces to the conditional prior: Inline graphic, which leads to

graphic file with name M126.gif

from which it follows that we may make the full posterior distribution proper by allowing the last term of the above product to take the form of a proper distribution. To proceed, we introduce the following Bayesian interpretation and generalization of INSIDE:

Condition 4

(instrument effects orthogonality (IEO)) Each component of Inline graphic is a priori independent of the parameters of the exposure model, Inline graphic, and we specify a proper and scientifically plausible prior Inline graphic. One option is to impose Inline graphic, as in a standard IV analysis, which however will often be unrealistic. A second option is to impose that the effect exerted by each instrument on the outcome through the mediation of Inline graphic is greater in magnitude than the corresponding pleiotropic (unmediated) effect. We use none of these. In the following section, we construct Inline graphic from our belief that some of the components of Inline graphic are zero.

3.1. The prior

We shall now discuss the prior specifications for model (3.13.3). In many applications, it will be reasonable to assume that some components of vector Inline graphic are zero, i.e. that an unspecified subset of the set of instruments have no pleiotropic effect. This justifies imposing on Inline graphic a shrinkage prior, e.g. by taking each Inline graphic to be a priori independently drawn from a Laplace (double exponential) distribution with mean Inline graphic and unknown variance Inline graphic, with Inline graphic distributed a priori as Inline graphic, where Inline graphic denotes the half-Cauchy density on the positive reals, with scale parameter Inline graphic. An alternative choice is to impose on each Inline graphicth component of Inline graphic the horseshoe shrinkage prior proposed by Carvalho and others (2010), which has the hierarchical structure Inline graphic, where the degree of shrinking of each Inline graphicth component of Inline graphic is controlled by an unknown parameter Inline graphic. A high value of Inline graphic corresponds to a near-zero value of the shrinkage weight, Inline graphic, in which case this prior leaves the magnitude of Inline graphic almost unaffected. In contrast, a near-zero value of Inline graphic corresponds to a near-unit shrinkage weight, which will result in the estimate of Inline graphic being heavily shrunk towards zero. Under the horseshoe prior, each Inline graphic is mixed over its own Inline graphic, with Inline graphic drawn from a Inline graphic distribution governed by an unknown parameter Inline graphic. Both the Inline graphic parameters, which are in charge of controlling the local degrees of shrinking, and parameter Inline graphic, which controls the global degree of shrinking, are inferred from the data, with minimal input from the user. With Inline graphic, the horseshoe specifications induce on Inline graphic a horseshoe-shaped Inline graphic distribution with one peak at Inline graphic and another at Inline graphic. The two peaks may be interpreted in terms of the horseshoe prior inducing sparsity in a selective fashion. The lower peak of the distribution of Inline graphic accounts for the small components of Inline graphic, which our model recognizes as noise and heavily shrinks towards zero. The upper peak of the distribution accounts for the large components of Inline graphic, which our model recognizes as pleiotropic, and leaves almost unaffected, thereby reducing the influence of the pleiotropic instruments on the estimate of Inline graphic.

In our experience, assigning the remaining parameters uniform priors does not cause numerical problems, thanks to the ability of the Stan toolbox (STAN Development Team, 2014) to determine a sensible bounding of the search space via variational algorithms. However, we shall often wish to make our priors informative, for stronger inferences. In future studies, we speculate that it will be possible to shape informative priors on the basis of data collected in previous studies (provided these satisfy the necessary conditions). For example, Inline graphic data from past studies can be used to construct a prior for Inline graphic, in such a way to reduce the weak instruments bias.

Consider also that mathematical relationships between parameters may be used to derive sensible local priors. For example, parameters Inline graphic and Inline graphic are not identifiable, but the model links them to Inline graphic through the identity Inline graphic, which justifies the inequalities Inline graphic and Inline graphic. Because we are able to learn about Inline graphic from external data, we can use this information, in conjunction with the above inequalities, to derive joint prior bounds for Inline graphic and Inline graphic (not illustrated in this article). Alternatively, we may establish an upper bound for Inline graphic, denoted by Inline graphic, and impose the prior bound Inline graphic. In some situations, a posterior distribution for the causal effect might become available from previous studies, and be used, under assumptions, as our prior for Inline graphic. Prior information about Inline graphic might become available with the development of web repositories containing lists of instruments for specific exposures. Finally, in certain situations it might be reasonable to assume a priori that each direct effect Inline graphic is smaller in magnitude than the corresponding indirect effect, Inline graphic.

In our analyses of real and simulated data, we assigned Inline graphic and Inline graphic uniform prior distributions with positive support. We assigned Inline graphic, Inline graphic, Inline graphic and Inline graphic independent uniform priors, and we took each Inline graphic, for Inline graphic, to be independently drawn from a normal Inline graphic prior, with hyperparameters Inline graphic and Inline graphic subject to uniform priors.

4. Simulation experiment

We performed a simulation experiment to evaluate our model’s performance in relation to the number of instruments and individuals, the direction and amount of pleiotropy, and the degree of correlation between the instruments. Although performance comparisons are not a primary objective of this article, we shall compare our method’s performance with the WME in terms of bias, coverage and power.

Our simulations were based on sequences of SNPs of real individuals, with each SNP expressed on an interval scale as an allele dose (0,1,2). We considered the 21 simulation scenarios described in Table 1. In each of these, we simulated 800 datasets with the causal effect Inline graphic set to zero, and further 800 datasets with Inline graphic set to 0.35, which allowed us to assess each method’s performance under the null and under the alternative hypothesis. The SNP sequences changed from one individual to the next, but they were kept fixed across scenarios and simulations, except for scenarios 14 to 21, where they changed from one scenario to the next to represent different degrees of LD between the SNPs.

Table 1.

Comparative assessment of the proposed method (with a horseshoe prior for Inline graphic) and of the WME, in relation to the mean pleiotropy, the number of instruments, the degree of linkage disequilibrium (Inline graphic) between instruments and the dispersion of the Inline graphic instrument-exposure associations (column 4)

Scenario Number of individuals Mean pleiotropy Standard deviation Inline graphic No. of instruments Linkage disequilibrium (Inline graphic) Coverage under the null Coverage under the alternative Power Bias under the null Bias under the alternative Coverage under the null Coverage under the alternative Power Bias under the null Bias under the alternative
          Our method Weighted median estimator
1 500 0.012 0.02 60 0 79 86 93 Inline graphic0.04 Inline graphic0.06 70 78 88 Inline graphic0.04 Inline graphic0.04
2 500 0.006 0.02 60 0 89 86 95 Inline graphic0.02 Inline graphic0.03 78 79 92 0.01 Inline graphic0.03
3 500 0 0.02 60 0 91 94 99 Inline graphic0.003 0.02 80 81 94 0.01 0.01
4 500 Inline graphic0.006 0.02 60 0 90 88 99 0.03 0.03 75 80 98 0.04 0.06
5 500 Inline graphic0.012 0.02 60 0 85 81 99 0.06 0.06 73 73 98 0.07 0.08
6 500 0.012 0.02 20 0 90 88 62 Inline graphic0.03 0.02 82 80 60 Inline graphic0.02 0.08
7 500 0 0.02 20 0 90 93 73 0.01 0.01 80 87 71 0.03 0.01
8 500 Inline graphic0.012 0.02 20 0 86 89 80 0.06 0.09 81 82 78 0.07 0.12
9 500 0.012 1.0 60 0 95 94 99 0.00 0.00 83 97 99 0.00 0.00
10 500 0.006 1.0 60 0 96 94 99 0.00 0.00 89 98 99 0.00 0.00
11 500 0 1.0 60 0 96 95 99 0.00 0.00 90 99 99 0.00 0.00
12 500 Inline graphic0.006 1.0 60 0 95 94 99 0.00 0.00 88 99 99 0.00 0.00
13 500 Inline graphic0.012 1.0 60 0 95 93 99 0.00 0.00 85 99 99 0.00 0.00
14 500 (Inline graphic0.012 to 0.012) 0.02 60 0.33 96 96 97 0.05 0.04 12 11 98 0.09 0.08
15 500 (Inline graphic0.012 to 0.012) 0.02 60 0.54 96 95 94 Inline graphic0.05 Inline graphic0.06 12 12 98 0.07 0.1
16 500 (Inline graphic0.012 to 0.012) 0.02 60 0.63 81 70 95 0.09 0.09 8 10 97 0.13 0.11
17 500 (Inline graphic0.012 to 0.012) 0.02 60 0.70 82 72 94 0.07 0.08 2 3 96 0.08 0.09
18 300 (Inline graphic0.012 to 0.012) 0.02 60 0.33 98 96 72 Inline graphic0.05 Inline graphic0.05 19 24 93 Inline graphic0.14 Inline graphic0.13
19 300 (Inline graphic0.012 to 0.012) 0.02 60 0.53 98 96 66 Inline graphic0.04 Inline graphic0.04 11 12 85 Inline graphic0.17 Inline graphic0.16
20 300 (Inline graphic0.012 to 0.012) 0.02 60 0.62 96 95 48 0.06 0.07 8 10 84 Inline graphic0.18 Inline graphic0.17
21 300 (Inline graphic0.012 to 0.012) 0.02 60 0.70 85 90 12 Inline graphic0.15 Inline graphic0.18 4 9 83 Inline graphic0.19 Inline graphic0.17

Coverage and power are expressed as percentages.

Each of Scenarios 1 to 13 uses independent SNPs, and is characterized by (i) the sample size reported in column 2 of Table 1, (ii) the value of Inline graphic, the mean pleiotropic effect, reported in column 3, and (iii) the value of Inline graphic reported in column 4, which controls the variability of the strength, Inline graphic, from one instrument to the next. Note that by varying Inline graphic, we explore different types of pleiotropy: balanced (Inline graphic), negative (Inline graphic) and positive (Inline graphic). In particular, by allowing Inline graphic to take values Inline graphic, we have included situations where the pleiotropic component of the effect of the instrument on the outcome is on average stronger than the component mediated by the exposure (indirect component). At each new simulation, new values for the model parameters were generated. In particular, in Scenarios 1 to 13, each component of Inline graphic was independently drawn from Inline graphic. A randomly selected subset (40%) of the components of Inline graphic were independently drawn from Inline graphic, the remaining components being set to 0. The proportion of instruments with a significant (Inline graphic) marginal association with the exposure varied between 70% and 100% across the simulations. Also, parameters Inline graphic and Inline graphic were drawn from Inline graphic and Inline graphic, respectively, and Inline graphic and Inline graphic were independently drawn from Inline graphic, so as to have a positive average correlation between Inline graphic-errors and Inline graphic-errors. Parameters Inline graphic and Inline graphic were sampled from sharp inverse-gamma distributions with means 0.1 and 0.3, respectively. Conditional on the generated parameter values, at each new simulation we generated values for variables Inline graphic, for each individual, on the basis of Equations (3.13.3) and in conformity with the IEO condition.

Scenarios 14 to 21 involve instrumental SNPs with increasing degrees of mutual correlation (average Inline graphic reported in column 6). These scenarios were generated in the same way as the previous ones, except for vector Inline graphic. After the elements of this vector were simulated, the majority of them were set to zero, so as to mimic the situation where only a small number of instruments have a non-null causal or conditional effect on the exposure. Also, the components of Inline graphic were independently drawn from Inline graphic, with Inline graphic uniformly distributed between Inline graphic, so as to embrace situations where the pleiotropic effect is on average stronger than the i-o indirect effect.

Each simulated dataset was analyzed via WME to obtain a point and a bootstrapped 95% confidence interval for Inline graphic, and then via our model (with a horseshoe prior for Inline graphic) to obtain a posterior mean and a 95% credible interval for Inline graphic. On the basis of these results, we assessed performance in terms of bias, coverage and power. The analysis with our model was performed by using the Hamiltonian MCMC methods (Metropolis and others, 1953; Neal, 2011) provided by the program Stan (STAN Development Team, 2014). Stan employs a combination of variational (Wainwright and Jordan, 2008) and MCMC methods. The former are used to generate an approximation of the posterior distribution of the model parameters. The approximation is then used to guide the MCMC exploration of the posterior. No major Markov chain mixing problems were encountered.

We shall now briefly discuss the results of the simulations. Scenarios 1 to 8 were based on independent instruments. Table 1 tells us that, in these scenarios, (i) in both methods an increase in the number of instruments corresponds to an increase in power, (ii) in both methods an increase in the number of instruments corresponds to a drop in coverage under the null, the drop being modulated by the amount of directional pleiotropy, and (iii) in both methods, positive pleiotropy reduces power. In our case, a positive pleiotropy corresponds to the direct and indirect effects of the instruments’ effects on the outcome having on average the opposite sign.

A comparison between the results of Scenarios 1–5 and Scenarios 9–13, all of which involve independent instruments, suggests that in both methods a higher value of Inline graphic, which means a higher number of strong instruments, improves power and coverage under the null. In our method, this was sufficient to bring coverage under the null into the nominal range. This did not happen with WME, although in Scenarios 9–13 WME slightly outperforms our method in terms of coverage under the alternative.

In Scenarios 14 to 17, and in both methods, the progressively increasing degree of LD between SNPs causes a marked drop in coverage and a slight drop in power. In the presence of LD, the gap in performance between the two methods is dramatic. This is unsurprising, because WME was developed with independent instruments in mind. This pattern is confirmed in Scenarios 18 to 21, where, in addition, we observe the effect of reducing the number of individuals from 500 to 300. The reduction makes power more vulnerable to presence of LD between the instruments.

Our method appears to outperform WME in terms of coverage under the null (in all scenarios), and in terms of power (in all scenarios with independent instruments).

5. Incorporating mediation

This section extends our approach to deal with two (instead of one) exposures or intermediate phenotypes, Inline graphic and Inline graphic. Within this more general setting, we shall use our method to estimate direct and indirect effects, and to combine, albeit under strong parametric assumptions, the capabilities of MR and mediation analysis.

Figure 2a portrays a problem where Inline graphic is a putative cause of Inline graphic. Suppose, we accept the assumptions represented in the figure. Suppose, we are interested in the direct causal effect of Inline graphic on Inline graphic (controlling for Inline graphic), and in the indirect effect of Inline graphic on Inline graphic (via Inline graphic). Suppose, we are also interested in the causal effects of Inline graphic on Inline graphic and of Inline graphic on Inline graphic. Let the set of instruments, Inline graphic, consist of two non-overlapping subsets, Inline graphic and Inline graphic, with Inline graphic and Inline graphic, with Inline graphic. Assume for simplicity that Inline graphic. Let Inline graphic. We elaborate (3.13.3) into:

Fig. 2.

Fig. 2.

(a) Graphical model for the class of problems discussed in Section 5, (b) Application of the graphical model to our study in Section 6.

graphic file with name M303.gif (5.1)

with Inline graphic, Inline graphic, Inline graphic, Inline graphic, Inline graphic. The causal effect of Inline graphic on Inline graphic is represented by parameter Inline graphic, whereas the direct causal effect of Inline graphic on Inline graphic (controlling for Inline graphic) is represented by parameter Inline graphic, and the indirect causal effect of Inline graphic on Inline graphic is represented by Inline graphic. The model equations satisfy Condition 2. When all components of Inline graphic and Inline graphic differ from zero, they satisfy also:

Condition 5

(sequential relevance) Each component of Inline graphic is associated with Inline graphic, conditional on the remaining instruments, and each component of Inline graphic is associated with Inline graphic, conditional on Inline graphic and the remaining instruments. This condition is formally expressed by Inline graphic for Inline graphic, and Inline graphic, for Inline graphic.

In the following, we show that, under the above conditions, and in the special case where Inline graphic, all the parameters of model (5.1) and, in particular, the causal effects Inline graphic, are identified.

First, we need to introduce the concept of “unblocked” path of a causal diagram. A path (Inline graphic sequence of adjacent edges) in a causal diagram is said to be unblocked if it involves one or more colliders (Geiger and others, 1990), i.e., if at least one pair of arrows point to a common node, Inline graphic (not the most general definition, but sufficient for our purposes).

We are now ready to show that Inline graphic) are identifiable, provided (i)Inline graphic, (ii)Inline graphic and (iii)Inline graphic. To see this, consider Figure 2a in the simple case where Inline graphic and Inline graphic are scalar. Assume all variables represented in the graph have zero mean. Then the correlation Inline graphic between two nodes of the graph, Inline graphic and Inline graphic say, is given by a sum of terms over all the unblocked paths that connect Inline graphic and Inline graphic, with each term of the sum consisting of the product of the effects along the path (Wright, 1934). By using Figure 2a and Condition (iii), we obtain Inline graphic and Inline graphic. The two equalities uniquely identify parameters Inline graphic and Inline graphic, conditional on which we may then consider the system formed by equations Inline graphic and Inline graphic, which can be solved for Inline graphic by virtue of Condition (i), as its determinant does not vanish. This means that causal parameter Inline graphic is identified. Next, note that, under Condition (iii), nodes Inline graphic and Inline graphic are connected via two unblocked paths, and Inline graphic and Inline graphic via further three unblocked paths, which leads to the two equations Inline graphic and Inline graphic. These can be solved for Inline graphic and Inline graphic, conditional on the identifiable parameters Inline graphic, because Conditions (i) and (ii) prevent the determinant of the system, Inline graphic, from vanishing, which completes the proof.

We deal with the more general situation where the Inline graphic-vectors depart from zero in the same way as in Section 3.1, i.e. by imposing on each of these vectors a sparsity prior that makes the posterior distribution of the causal effects proper. Simple averages of the MCMC samples generated from this posterior will give simulation-consistent point and interval estimates for any function of interest of parameters Inline graphic, such as, for example, the indirect effect Inline graphic exerted by Inline graphic on Inline graphic. This is illustrated in the next section.

6. Illustrative application

Past decades have witnessed an unprecedented worldwide rise in obesity. Excess body fat, as measured by body mass index (BMI Inline graphic weight in kilograms divided by the square of the height in meters) is a major risk factor for cardiovascular disease (CVD), among other disorders. The increased incidence of CVD associated with adiposity is believed to be mediated both by abnormalities in carbohydrate metabolism and by an increase in blood pressure. As far as the latter is concerned, various authors have found evidence of BMI being a causal factor for hypertension, and in this section, we shall corroborate this hypothesis by applying our MR approach to data from the general population, by using a recently proposed measure of blood pressure burden defined as the sum of the diastolic and systolic arterial pressures, hereafter, denoted as PRES (Nair, 2016). Part of our analysis is motivated by recent metabolite profiling studies, that have highlighted deviations in molecular signatures of BMI. Many of these studies compared small groups of individuals with large differences in adiposity, and it remains unclear whether those deviations are also observed in the general population. One putative molecular signature of obesity is the Inline graphic-aminoacid phenylalanine (PHE) (Jones, 1996; Droyvold and others, 2005; Shah and others, 2012; Moore and others, 2014; Wuertz and others, 2015; Hao and others, 2016). Recent research also highlights PHE as a putative mediator of the causal effect of body fat on blood pressure.

In the following analysis, we shall put these hypotheses under scrutiny by using our MR approach. We shall first use MR to assess the putative causal effect of BMI on PHE. In a subsequent stage of the analysis, we shall assess the causal effect of BMI on PRES, in terms of a direct causal effect, and of an indirect causal effect mediated by PHE.

We analyzed a dataset of 520 unrelated individuals (aged 25–74) from a population-based Finnish cohort—the DILGOM (Dietary, Lifestyle and Genetic determinants of Obesity and Metabolic Syndrome) study (Inouye and others, 2010). Each individual in this study had serum metabonome information, a genome-wide genetic profile and measures of BMI, blood pressure and sex. The eighty instruments used in the analysis, Inline graphic, were SNPs with a significant (Inline graphic) BMI marginal association, and in negligible LD (Inline graphic). These SNPs we treated as counts Inline graphic of minor alleles at the corresponding locus. Let Inline graphic (the exposure) represent the logarithm of BMI. Let Inline graphic represent the log-concentration of PHE, and Inline graphic take value Inline graphic for female and Inline graphic for male.

WME gave an estimated causal effect of log BMI on log PHE of 0.25, with a 95% confidence interval of 0.18–0.31. Our analysis based on model (3.13.3) gave a posterior mean of 0.3, with a 95% credible interval of 0.19–0.42, representing a higher degree of uncertainty about the causal effect with respect to the WME estimate.

A number of studies (Kaplan and others, 2014, see) stress the differential prognostic significance of BMI across genders. This motivated our interest in incorporating an interaction between the effects of sex and BMI on the outcome. Recall that sex is denoted as Inline graphic, with Inline graphic indicating female, and Inline graphic male. For purposes of illustration, we made the following simplifying assumptions. First, we assumed that sex is independent of the confounders Inline graphic. Second, we assumed the effect of sex on either BMI or PHE not to interact with the effect of the instrumental variants on the same variable. The latter assumption is delicate, which invites caution in the interpretation of the results. To include the interaction, we extended model (3.13.3) as follows:

graphic file with name M383.gif (6.1)

with Inline graphic. The causal effects of log BMI on log PHE are represented in the model equations by Inline graphic (in the females) and Inline graphic (in the males), with Inline graphic representing the interaction between sex and BMI. We used a horseshoe prior for Inline graphic, and uniform priors for the remaining parameters. We ran 10 000 iterations of a Markov chain, and used the values generated during the second half of the chain to compute the estimates. Parameter Inline graphic had a posterior mean of Inline graphic and a 95% credible interval of Inline graphic to Inline graphic, representing fair evidence of an interaction between BMI and sex in their causal effects on PHE. The causal effect of log BMI on log PHE had a posterior mean of 0.34 with a 95% credible interval of 0.21–0.47 in the females, and a posterior mean of 0.2 with a 95% credible interval of 0.098–0.3 in the males.

In the scatter diagrams of Figure 3, each instrumental SNP is represented by a black dot with Inline graphic-coordinate (respectively, Inline graphic-coordinate) given by the coefficient of the least-squares regression of log BMI (respectively, log PHE) on that SNP, as obtained from an analysis of the male (left plot) and female (right plot) subsamples. The linearity of the relationship in both plots provides visual evidence of a causal effect of BMI on PHE, whereas the difference between the two slopes provides evidence of that causal effect interacting with sex.

Fig. 3.

Fig. 3.

With reference to our analysis of Section 6, each Inline graphicth instrument is represented in each of these plots by a black dot with co-ordinates (Inline graphic (see Section 2 for a definition of these symbols), as obtained from an analysis of the male (left plot) and female (right plot) individuals in the sample. The slope of the regression line is the Egger regression estimate of the causal effect.

The second stage of our analysis embraced variables BMI, PHE, and PRES. Our assumptions in this analysis are depicted in Figure 2b, where the effect of BMI on PRES has two putative components: a direct one and an indirect component mediated by PHE. We analyzed the data by using model (5.1), with Inline graphic, Inline graphic and Inline graphic representing log BMI, log PHE, and PRES, respectively. We used a set of 98 instruments, Inline graphic, partitioned into a subset Inline graphic consisting of 80 BMI-associated instrumental SNPs (the same as in the preceding part of the analysis), and a subset Inline graphic, consisting of 18 instruments associated with PHE but not BMI. We assumed almost all the parameters to be a priori uniformly distributed. We sampled the model posterior distribution by running six Markov chains, of 100 000 iterations each, with initial values spanning the approximate 95% confidence intervals for Inline graphic and for the quantity Inline graphic, as obtained by a traditional MR analysis. We checked convergence of the six chains to the same posterior. The second half of each chain was used to approximate the posterior means and credible intervals for the parameters of interest. Figure 4 shows the marginal posterior distributions for the main quantities of interest. One of the plots shows the posterior distribution for the total causal effect of log BMI on PRES, Inline graphic. Figure 4 suggests that BMI might exert a causal effect on both PHE and PRES, although there appears to be little evidence of an effect of PHE on PRES. These results discredit the hypothesis of PHE acting as a mediator of the deleterious effect of body mass on blood pressure. The total effect of log BMI on PRES, represented by parameter Inline graphic, was re-estimated in the traditional way, by using the instruments contained in Inline graphic. This yielded an estimated total effect of 32.3, and a 95% confidence interval of 19.1–46.6, which corresponds to a lower uncertainty relative to the estimate obtained by our method.

Fig. 4.

Fig. 4.

This figure summarizes results from our analysis of the illustrative problem of Section 6. Shown in this figure are the posterior distributions for key parameters of model (5.1), as obtained by applying the model to the DILGOM data, under the assumptions of Figure 2b. Parameter Inline graphic represents the controlled direct effect of BMI on PRES, controlling for PHE. Parameter Inline graphic represents the direct effect of BMI on PHE, and Inline graphic represents the causal effect of PHE on PRES. Also included are the posterior distributions for two nonlinear functions of the above parameters, namely Inline graphic, which represents the indirect component of the effect of BMI on PRES, mediated by PHE, and Inline graphic, which represents the total effect of BMI on PRES. From a substantive point of view, these results can be interpreted to provide evidence of a causal effect of the body mass index on blood pressure and phenylalanine concentration, but no evidence that this latter influences blood pressure.

In consideration of the relatively small size of the sample, and of the cross-sectional nature of the study, the results of our analysis deserve future independent validation.

7. Discussion

Thanks to its holistic approach to uncertainty, a Bayesian approach to MR may represent a safeguard from over-optimistic conclusions. The results of our simulation study are consistent with this expectation, while also suggesting that our method behaves well in the presence of moderate LD between the variants—a welcome feature when the choice of the instruments is confined to a narrow region of DNA.

Much work remains to be done. It might be interesting to assess the extent to which our approach can mimic existing frequentist methods, such as the one proposed by Kang and others (2016) and further elaborated by Windmeijer and others (2016), where LASSO-type procedures are used to identify the valid instruments from within a set of candidate variables.

A variety of future developments of the approach are envisaged. One of these is to incorporate advances in Bayesian sparsity modeling, for example, in relation to the design of shrinkage priors that deal with high-dimensional vectors of possibly correlated instruments. Of equal importance is to extend the method to deal with nonlinearities and selection effects, and perhaps to incorporate principles of Bayesian model averaging. Such efforts will encounter theoretical difficulties, such as problems of collapsibility of the causal effect parameters. Finally, we may use our framework in a simulation mode, for generating extended datasets from limited data, for purposes of power calculation.

8. Software

R software to implement analyses by means of the proposed method is available from Github (https://github.com/carloberzuini/BMR).

Acknowledgments

We thank Stijn Vansteelandt for his insight into the mediation analysis aspects of the approach. Our analysis of the DILGOM data has benefited from discussions with Drs. Xiaoguang Xu and Susana Conde. Conflict of Interest: None declared.

Funding

Carlo Berzuini, Hui Guo and Luisa Bernardinelli were supported by the European Union within the Seventh Framework Programme FP7-Health-2012-INNOVATION [305280 to C.B.]. The DILGOM data resource exploited in Section 6 has been funded by the Sigrid Juselius and Yrjõ Jahnsson Foundations and by the Finnish Academy [255935 and 269517]. Stephen Burgess is supported by a Sir Henry Dale Fellowship jointly funded by the Wellcome Trust and the Royal Society [204623/Z/16/Z] and by core funding to the Medical Research Council Biostatistics Unit [MC_UU_00002/7].

References

  1. Berzuini C., Dawid A. P. and Bernardinelli L. (editors) (2012). Causality: Statistical Perspectives and Applications. Chichester: John Wiley and Sons. [Google Scholar]
  2. Bowden J., Davey Smith G., Haycock P. C. and Burgess S. (2016). Consistent estimation in Mendelian randomization with some invalid instruments using a weighted median estimator. Genetic Epidemiology 40, 304–314. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Burgess S. and Thompson S. G. (2010). Bayesian methods for meta-analysis of causal relationships estimated using genetic instrumental variables. Statistics in Medicine 29, 1298–1311. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Burgess S. and Thompson S. (2012). Improving bias and coverage in instrumental variable analysis with weak instruments for continuous and binary outcomes. Statistics in Medicine 31, 1582–1600. [DOI] [PubMed] [Google Scholar]
  5. Carvalho C. M., Polson N. G. and Scott J. G. (2010). The horseshoe estimator for sparse signals. Biometrika 97, 465–480. [Google Scholar]
  6. Davey Smith G. and Ebrahim S. (2003). Mendelian randomization: can genetic epidemiology contribute to understanding environmental determinants of disease? International Journal of Epidemiology 32, 1–22. [DOI] [PubMed] [Google Scholar]
  7. Dawid A. P. (1979). Conditional independence in statistical theory (with Discussion). Journal of the Royal Statistical Society B 41, 1–31. [Google Scholar]
  8. Dawid A. P. (2000). Causal inference without counterfactuals (with Discussion). Journal of the American Statistical Association 95, 407–448. [Google Scholar]
  9. Didelez V. and Sheehan N. A. (2007). Mendelian randomisation as an instrumental variable approach to causal inference. Statistical Methods in Medical Research 16, 309–330. [DOI] [PubMed] [Google Scholar]
  10. Didelez V. and Sheehan N. A. (2008). Mendelian Randomisation: Why Epidemiology Needs a Formal Language for Causality, Volume 5 London: College Publications. [Google Scholar]
  11. Droyvold W. B., Midthjell K., Nilsen T. I. L. and Holmen J. (2005). Change in body mass index and its impact on blood pressure: a prospective population study. International Journal of Obesity 29, 650–655. [DOI] [PubMed] [Google Scholar]
  12. Egger M., Smith G. D., Schneider M. and Minder C. (1997). Bias in meta-analysis detected by a simple, graphical test. British Medical Journal 315, 629–634. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Geiger D., Verma T. and Pearl J. (1990). Identifying independence in Bayesian networks. Networks 20, 507–534. [Google Scholar]
  14. Han C. (2008). Detecting invalid instruments using L1-GMM. Economics Letters 101, 285–287. [Google Scholar]
  15. Hao Y., Wang Y., Xi L., Li G., Zhao F., Qi Y., Liu J. and Zhao D. (2016). A nested case-control study of association between metabolome and hypertension risk. BioMed Research International 10.1155/2016/7646979. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Inouye M., Kettunen J., Soininen P., Silander K., Ripatti S., Kumpula L. S., Hämäläinen E., Jousilahti P., Kangas A. J., Männistö S.. and others (2010). Metabonomic, transcriptomic, and genomic variation of a population cohort. Molecular Systems Biology 6, 502–513. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Jones D. W. (1996). Body weight and blood pressure. Effects of weight reduction on hypertension. American Journal of Hypertension 9, 50–54. [DOI] [PubMed] [Google Scholar]
  18. Jones E. M., Thompson J. R., Didelez V. and Sheehan N. A. (2012). On the choice of parameterisation and priors for the Bayesian analyses of Mendelian randomisation studies. Statistics in Medicine 31, 1483–1501. [DOI] [PubMed] [Google Scholar]
  19. Kang H., Zhang A., Cai T. and Small D. S. (2016). Instrumental variables estimation with some invalid instruments and its application to Mendelian randomization. Journal of the American Statistical Association 111, 132–144. [Google Scholar]
  20. Kaplan R. C., Avilés-Santa M. L., Parrinello C. M., Hanna D. B., Jung M., Castañeda S. F., Hankinson A. L., Isasi C. R., Birnbaum-Weitzman O., Kim R. S.. and others (2014). Body Mass Index, Sex, and Cardiovascular Disease Risk Factors Among Hispanic/Latino Adults. Journal of the American Heart Association 3, 1–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Katan M. B. (1986). Apoupoprotein E isoforms, serum cholesterol, and cancer. The Lancet 327(8479), 507–508. [DOI] [PubMed] [Google Scholar]
  22. Kolesar M., Chetty R., Friedman J., Glaeser E. and Imbens G. (2014). Identification and inference with many invalid instruments. Journal of Business and Economic Statistics 3, 1–11. [Google Scholar]
  23. Lawlor D. A, Harbord R. M., Sterne J. A., Timpson N. and Davey Smith G. (2008). Mendelian randomization: using genes as instruments for making causal inferences in epidemiology. Statistics in Medicine 27, 1133–1163. [DOI] [PubMed] [Google Scholar]
  24. Metropolis N., Rosenbluth A., Rosenbluth M., Teller M. and Teller E. (1953). Equations of state calculations by fast computing machines. Journal of Chemical Physics 21, 1087–1092. [Google Scholar]
  25. Moore S., E. Matthews C., Sampson J., Stolzenberg-Solomon R., Zheng W., Cai Q., Ting Tan Y., Chow W.-H., Ji B.-T., Ke Liu D.. and others (2014). Human metabolic correlates of body mass index. Metabolomics 10, 259–269. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Nair T. (2016). Systolic and diastolic blood pressure: do we add or subtract to estimate the blood pressure burden? Hypertension Journal 2, 22–24. [Google Scholar]
  27. Neal R. (2011). MCMC using Hamiltonian dynamics. In: Brooks S., Gelman A., Jones G. L. and Meng X. L. (editors), Handbook of Markov Chain Monte Carlo. New York: Chapman and Hall/CRC, pp. 116–162. [Google Scholar]
  28. Nelson C. and Startz R. (1990). The distribution of the instrumental variables estimator and its t-ratio when the instrument is a poor one. The Journal of Business 63, 125–140. [Google Scholar]
  29. Pickrell J., Berisa T., Segurel L., Tung J. Y. and Hinds D. (2015). Detection and interpretation of shared genetic influences on 40 human traits. Technical Report, bioRxiv2015, 10.1101/019885. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Robinson P. C., Choi H. K., Do R. and Merriman T. R. (2017). Insight into rheumatological cause and effect through the use of Mendelian randomization. Nature Reviews Rheumatology 13, 486–496. [DOI] [PubMed] [Google Scholar]
  31. Shah S. H., Kraus W. E. and Newgard C. B. (2012). Metabolomic profiling for the identification of novel biomarkers and mechanisms related to common cardiovascular diseases: form and function. Circulation 126, 1110–1120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. STAN Development Team. (2014). STAN: A C++ library for probability and sampling, version 2.2. http://mc-stan.org/.
  33. Voight B. F., Ardissino D.. and colleagues (2012). Plasma HDL cholesterol and risk of myocardial infarction: a Mendelian randomisation study. Lancet 380, 572–580. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Wainwright M. J. and Jordan M. I. (2008). Graphical models, exponential families, and variational inference. Foundations and Trends in Machine Learning 1, 1–305. [Google Scholar]
  35. Windmeijer F., Farbmacher H., Davies N. and Davey Smith G. (2016). On the use of the Lasso for Instrumental Variables Estimation with Some Invalid Instruments. Bristol Economics Discussion Papers. Department of Economics, University of Bristol, UK. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Wright S. (1934). The method of path coefficients. Annals of Mathematical Statistics 5, 161–215. [Google Scholar]
  37. Würtz P., Havulinna A. S., Soininen P., Tynkkynen T., Prieto-Merino D., Tillin T., Ghorbani A., Artati A., Wang Q., Tiainen M.. and others (2015). Metabolite profiling and cardiovascular event risk a prospective study of three population-based cohorts. Circulation 131, 774–785. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Biostatistics (Oxford, England) are provided here courtesy of Oxford University Press

RESOURCES