Skip to main content
International Journal of Epidemiology logoLink to International Journal of Epidemiology
. 2022 Jul 15;51(6):1899–1909. doi: 10.1093/ije/dyac136

Interpretation of Mendelian randomization using a single measure of an exposure that varies over time

Tim T Morris 1,2,, Jon Heron 3,4, Eleanor C M Sanderson 5,6, George Davey Smith 7,8, Vanessa Didelez 9,10, Kate Tilling 11,12
PMCID: PMC9749705  PMID: 35848950

Abstract

Background

Mendelian randomization (MR) is a powerful tool through which the causal effects of modifiable exposures on outcomes can be estimated from observational data. Most exposures vary throughout the life course, but MR is commonly applied to one measurement of an exposure (e.g. weight measured once between ages 40 and 60 years). It has been argued that MR provides biased causal effect estimates when applied to one measure of an exposure that varies over time.

Methods

We propose an approach that emphasizes the liability that causes the entire exposure trajectory. We demonstrate this approach using simulations and an applied example.

Results

We show that rather than estimating the direct or total causal effect of changing the exposure value at a given time, MR estimates the causal effect of changing the underlying liability for the exposure, scaled to the effect of the liability on the exposure at that time. As such, results from MR conducted at different time points are expected to differ (unless the effect of the liability on exposure is constant over time), as we illustrate by estimating the effect of body mass index measured at different ages on systolic blood pressure.

Conclusion

Univariable MR results should not be interpreted as time-point-specific direct or total causal effects, but as the effect of changing the liability for the exposure. Estimates of how the effects of a genetic variant on an exposure vary over time, together with biological knowledge that provides evidence regarding likely effective exposure periods, are required to interpret time-point-specific causal effects.

Keywords: Mendelian randomization, causal inference, longitudinal, simulation


Key Messages.

  • Many exposures of interest vary over time, yet Mendelian randomization (MR) is commonly applied to one measurement of an exposure.

  • MR does not estimate direct or total causal effect of changing an exposure value at a given time.

  • MR estimates the causal effect of changing the exposure liability that gives rise to an exposure value at a given time.

  • MR results from time-varying exposures should be interpreted with respect to the underlying liability for an exposure.

Introduction

Mendelian randomization

Mendelian randomization (MR) is a powerful tool through which the causal effects of modifiable exposures (risk factors) can be estimated from observational data under assumptions that in some circumstances may be more plausible than the unmeasured confounding and no measurement error assumptions required by conventional methods.1 MR is generally implemented within an instrumental variables (IV) framework that exploits the randomization inherent in the allocation of genotypes at conception and gamete cell formation, using this random variation in alleles to instrument differences in observed exposures between individuals.2 Reverse and residual confounding is reduced because the formation of the genotype occurs prior to the phenotypic development and is generally not related to environmental factors.3,4

Three assumptions are required for MR analyses to test the null hypothesis that an exposure X does not cause an outcome Y for any individuals. These are (i) relevance: that genotype is associated with the exposure of interest; (ii) independence: that there is no common cause of genotype and outcome; (iii) exclusion: that genotype does not affect the outcome through any path other than the exposure.5,6 In order to estimate an average treatment effect, extra assumptions are needed: first, that a difference in average exposure between populations with and without a risk allele would result in the same difference in outcome as if an environmental factor increased average exposure in a population by the same amount (gene–environment equivalence) and, second, that the structural model relating the exposure and outcome is linear and additive with a homogeneous effect of the exposure on the outcome.7–9

MR studies have largely leveraged information from a single measurement of the exposure and outcome, often due to limited availability of repeatedly measured data. Many exposures of interest vary over time,10,11 being subject to both between-individual and within-individual variation. Within-individual variation may be largely a function of measurement error (e.g. height in adulthood12), longitudinal within-individual phenotypic variability (BMI13), monotonic change (myopia14) or, likely, a mixture of these. Time-varying genetic associations have been reported for a range of phenotypes,15–21 thus consistent effect sizes may not be estimated when applying MR to exposures measured at different time points across the lifecourse, regardless of sampling variation and measurement error.18,22

MR applied to one measure of an exposure that varies over time

It has long been recognized that MR estimates relate to exposures that generally act over a considerable period of time, often since birth.1,23 Many single-nucleotide polymorphism (SNP)-phenotype associations have a consistent direction of effect and similar effect sizes throughout the lifecourse, though this pattern is not uniform.24 More recently, it has been questioned how appropriate MR is when applied to exposures that vary over time.25–27 Labrecque and Swanson propose one possible definition of a lifetime effect that might be of interest: the effect of increasing the exposure by one unit at each time point throughout the lifecourse.25 In order to estimate this effect using MR, the association between genotype and exposure needs to be constant over time, giving rise to parallel exposure trajectories (a one-unit change in the genotype will give rise to the same difference in exposure at time t as at time t + 1). They demonstrated that estimates of this causal effect from MR differ over time in the presence of time-varying genotype–exposure associations, concluding that MR provides a biased estimate of the causal effect of increasing the exposure by one unit at each time point. Concerns have also been raised that MR with time-varying exposures may be biased if a feedback mechanism exists in which genetic factors influence predisposition to an outcome, which in turn influences the exposure at a subsequent time point.26 For example, where instruments for coronary heart disease (CHD) relate to C-reactive protein (CRP) because the instruments for CHD relate to developing atheroma, which in turn increases CRP.

We propose an approach that uses MR to assess the effect on the outcome of the change in the entire exposure history that would be induced by a change in genotype. That is, we are not estimating the causal effect of an exposure as it manifests at a given time point, but the effect of the underlying exposure liability. That is, we assume that there is some unobserved (latent) variable L, which is caused by the genotype G, and in turn causes the exposure at every instance across the lifecourse. Although the effect of liability on outcome is the estimand of interest, the liability is unobserved, so we must estimate its effect via the measured exposure(s). Here, we consider the simplified case with one genetic instrument (G), a time-varying continuous exposure that only occurs on two occasions (X0 and X1), one of which is measured, an outcome that occurs at one time point (Y) and an unmeasured confounder U (Figure 1). Thus, a change in genotype changes L, which changes both X0 and X1. The case in which X occurs in continuous time is described in the Supplementary text (available as Supplementary data at IJE online).

Figure 1.

Figure 1

Causal diagram showing two exposures and one outcome

G , genetic instrument; L, liability; X0, exposure measured at time 0; X1, exposure measured at time 1; Y, outcome; U, confounder. Other sources of variability in the liability, exposures and outcome are not shown in this diagram. There is a problem of under-identification here in that the direct effects of X0 or X1 on Y cannot be estimated with a single liability (L).

Our approach overcomes two problems with the interpretation of MR with time-varying exposures. First, if G changes, both X0 and X1 must be changed together; a one-unit change in G (e.g. an increase of one risk allele) cannot change one of the exposures in isolation. Where time-varying genetic effects exist, the change in genotype G required to raise a given exposure trajectory by one unit at time t (e.g. raising the weight by 1 kg at birth) may be quite different to the change in genotype required to change the exposure by one unit at time t + k (e.g. raising the weight by 1 kg at age 50 years). Second, a one-unit change in G cannot have an arbitrary effect on the exposure trajectory (e.g. increasing exposure by exactly one unit at all times). Thus, univariable MR with one genetic instrument that acts on exposure over a period of time cannot be used to recover the effect of a change in exposure at a single time, nor of any arbitrary change to the trajectory of exposure. Instead, we argue that MR with a time-varying continuous exposure can be used to examine the effect of a specific change in the trajectory of that exposure, depending on how the genotype impacts the trajectory. Here, the effect refers to the liability L, i.e. we are estimating the effect on the outcome of changing L. It should be noted that there will be no information about the shape of the typical trajectory from a study with only one measurement of a time-varying exposure taken at the same time point for all participants. In this case external information would be needed (e.g. from a separate longitudinal study).

This interpretation of the MR estimate can be clearly seen from the terms in the Wald estimator. Here, the numerator is the effect of a change in SNP on the outcome. The denominator is the effect of a change in SNP on the exposure at a given time point. This denominator is equal to the effect of a one-unit change in SNP on the liability, multiplied by the total effect of the liability on the exposure at the given time point. Thus, the Wald estimate is the effect of a change in the liability on the outcome (effect of SNP on outcome divided by effect of SNP on exposure liability), scaled such that the liability causes a one-unit change in exposure at that time point.

For simplicity, our example has just one SNP causing L. However, L may be instrumented by multiple SNPs. The emphasis here is that the effects of X0 and X1 cannot be separated in the case where our instrument(s) act through a 1D liability (L). The effects of X0 and X1 could potentially be separately estimated within a multivariable MR framework if two or more different liabilities have been identified that have different effects on X0 and X1, and if there were really only two exposures.28 In more realistic scenarios, however, it is unlikely that the exposure will only occur at a small number of time points—it is likely to occur over continuous time (see Supplementary material, available as Supplementary data at IJE online).

In this paper, we clarify the causal quantities that are estimated by MR when applied to time-varying exposures with time-varying genetic effects assuming a liability structure and how they should be interpreted.

Methods

Effects of interest

We define two estimands of interest: the total effect of a one-unit change in an exposure Xk (i.e. exposure X measured at a specific time point tk) on an outcome Y (βTk); and a ‘liability effect’—the causal effect on Y of a change in the liability L, such that Xk increases in expectation by one unit (βLk). We derive algebraic expressions for these estimands under a linear model in the case of a time-varying exposure that occurs at two time points and one outcome, with more general derivations given in the Supplementary material (available as Supplementary data at IJE online).

Total effect

We define βTk to be the total effect of Xk on the outcome Y, i.e. the change in Y from increasing Xk by one unit. This includes the direct effect of Xk on Y, and the indirect effect via the effect of Xk on subsequent occurrences of the exposure Xm where m > k.

Lifetime liability effect for a specific liability

We define the liability effect (βL) as the causal effect on Y of increasing the liability L by one unit. We define the liability effect at time k (βLk) as the causal effect of changing the liability L such that the expected value of the exposure measured at time k is increased by one unit. This can be thought of as the effect of moving all individuals from the liability L giving rise to E(X) = x at time k, to a liability L1 that would give rise to E(X) = x + 1 at time k.

We now derive expressions for the total and liability causal effect in the situation with an outcome Y that is caused by a genetically influenced liability L for an exposure X occurring at two time points (X0 and X1) (Figure 2). The genetic instrument G can have a non-linear relationship with the underlying liability L, but for simplicity we assume here that L is a linear function of G and U. We assume linearity and additivity from L to the exposure measurements Xk and from exposures to outcome Y. The effect of L on exposure measures is allowed to change with time/age, thus the shape of the trajectory of X with age can be non-linear.

Figure 2.

Figure 2

Directed acyclic graph showing the exposure liability in the context of two exposures and one outcome

G , genetic instrument; L, liability; X0, exposure measured at time 0; X1, exposure measured at time 1; Y, outcome. Note that confounders or other causes of variability in liability, exposures or outcome are not shown.

We assume (Figure 2) that:

L=γ1G+αLU+εL
X0=γ2L+α0U+ε0
X1=γ3L+γ5X0+α1U+ε1
Y=γ6X1+γ4X0+αYU+εY

where all errors (εL,ε0, ε1 and εY) are distributed with mean zero and are mutually independent.

The total effect of a one-unit change in X0 on Y (βT0) is given by:

βT0=γ4+γ5γ6

The total effect of a one-unit change in X1 on Y is given by:

βT1=γ6

The liability effect is the causal effect of a one-unit increase in liability, which is given by:

βL=γ2γ4+γ2γ5γ6+γ3γ6

Turning to the liability effect at time 0, βL0 (the effect of increasing the liability such that X0 increases in expectation by one unit), a one-unit increase in E(X0) occurs because there is an increase in L from l10 to L= l10+1γ2.

If L = l10, then:

EYdol10=y00=l10γ2γ4+γ2γ5γ6+γ3γ6

If L=l10+1γ2, then:

E(Ydol10+1γ2)=y10=l10+1γ2γ2γ4+γ2γ5γ6+γ3γ6

The effect on Y of changing the liability L such that it raises X0 by one unit is therefore given by:

βL0=y10-y00=γ2γ4+γ2γ5γ6+γ3γ6γ2 (1)

A one-unit increase in expectation in X1 would occur because there is an increase in L from l11 to l11+1γ2γ5+γ3

If L=l11 then

EYdol11=y01=l11γ2γ4+γ2γ5γ6+γ3γ6

If L=l11+1γ2γ5+γ3 then

E(Ydol11+1γ2γ5+γ3)=y11=l11+1γ2γ5+γ3γ2γ4+γ2γ5γ6+γ3γ6

The effect on Y of changing L such that X1 is increased by one unit in expectation is given by:

βL1=y11-y01=γ2γ4+γ2γ5γ6+γ3γ6γ2γ5+γ3 (2)

MR

To investigate what the Wald Ratio (β^MRk) using Xk as exposure is estimating in our setting, we need to calculate the effect of G on Y, and the effect of G on Xk. In our example, we only have two occurrences of the exposure, so k =0 or 1. For examples with the exposure in continuous time, see the Supplementary material (available as Supplementary data at IJE online). We now derive the expressions for what is estimated with exposures X0 and X1 using the usual Wald Ratio.

The effect of G on Y is:

βGY=γ1γ2γ4+γ2γ5γ6+γ3γ6 (3)

The effect of G on X0 is:

βGX0=γ1γ2 (4)

The effect of G on X1 is:

βGX1=γ1γ2γ4+γ2γ5+γ1γ3 (5)

The Wald Ratio MR estimand with X0 as a single exposure is given by Equation (3)/Equation (4):

βMR0=γ1γ2γ4+γ2γ5γ6+γ3γ6γ1γ2=(γ2γ4+γ2γ5γ6+γ3γ6)(γ2) (6)

The Wald Ratio MR estimand βMR0 in Equation (6) is equal to the effect on Y of changing the liability L such that it raises X0 by one unit in expectation in Equation (1), and hence estimates βL0, the liability effect of X0 on Y.

The Wald Ratio MR estimand with X1 as a single exposure is given by Equation (3)/Equation (5):

βMR1=γ1γ2γ4+γ2γ5γ6+γ3γ6γ1γ2γ5+γ3=γ2γ4+γ2γ5γ6+γ3γ6γ2γ5+γ3 (7)

The Wald Ratio MR estimand in Equation (7) is equal to the effect on Y of changing the liability L resulting in a one-unit change in X1 in Equation (2), and hence estimates βL1 the liability effect of X1 on Y. Importantly, both liability effects include effects through the other exposure X.

MR with a single liability L can therefore only examine whether there is evidence for a causal effect of some measure of the exposure (at some time point in the period in which the liability L operates) on the outcome, not which part of the exposure trajectory is causal. This is because the liability L affects all the exposures jointly in the period in which it operates, so estimating the effect on Y of changing the liability does not give information about which of the exposures caused by the liability cause(s) the outcome. It does not matter whether genotype–exposure associations are time-varying or time-invariant; the null hypothesis tested using MR (which does not require the parametric assumptions, only the structural assumptions) is that the liability L does not cause the outcome, i.e. there is no part of the trajectory that causes the outcome. If the liability does not cause the outcome, a null effect will be correctly detected using MR.6 The Wald Ratio MR estimand based on the single exposure measurement Xk, the liability effect, is the effect of increasing L such that Xk increases by one unit without holding the other X fixed. We extend this to an outcome measured at multiple time points in the Supplementary material (available as Supplementary data at IJE online).

Simulation approach

We describe our simulation approach within the ADEMP framework.29

(A)ims

The aim of the simulation was to illustrate the causal effect estimates described above.

(D)ata-generating mechanisms

We simulated data for 10 000 hypothetical individuals (nobs = 10000), representing a genotyped cohort sample with exposure occurring at two time points (t0,t1). Let G represent the genotype of individuals simulated as a single variant (effect alleles=0,1,2) with minor allele frequency set to 0.2 and genotype drawn from this with a binomial distribution. We simulate a liability L underlying a time-varying exposure (Xk) that occurs on two occasions k (k =0,1), an outcome that occurs once (Y) and a time-invariant confounder (U) of exposure and outcome variables. Random measurement error was simulated for all variables except the genetic instrument. Associations of the exposure and outcome with the unobserved confounder (α0, α1, αY) were set at 0.3. For simplicity, the association between the liability and the confounder (αL) was set to 0. Base parameters were set as follows: γ1 =0.5 γ2 =0.5; γ3 =0.5; γ4 =0.4; γ5 =0.3; and γ6 =0.4 (Figures 2 and 3). One by one all base parameters except γ1 were set to zero to investigate the change in coefficient estimated. This allowed us to interrogate differential (i) strength of the genetic instrument; (ii) time-varying genetic associations; (iii) exposure effects on the outcome(s); and (iv) confounding effects. Note that the value of the liability effects for the exposures will not remain constant but will change depending on the base parameters. Results are presented for 1000 replications of each simulation. All data were generated within Stata. The programme code used to run the simulations is available at https://github.com/timtmorris/time-varying-MR and can be used to vary all parameters.

Figure 3.

Figure 3

Simulated parameters

G, genotype; L , liability; X0, exposure measured at time 0; X1, exposure measured at time 1; Y, outcome; U, confounder.

(E)stimands

We estimate the liability effect βLk, the effect of increasing L such that Xk increases by one unit, on Y by MR using the Wald Ratio and the standard error (SE) of this estimate in our simulations.

(M)odel

We assume the linear structural equation model in Figure 3.

(P)erformance measures

We used three performance measures to assess the estimates in our simulations: the mean of the estimates of the liability effects for X0 and X1, the mean of the SEs of the liability effects for X0 and X1 across 1000 replications, and the deviation (and SE) of the estimates of the liability effects for X0 and X1 from their expectation given the model parameters.

Results

Simulations

Simulations illustrate that the Wald Ratio MR estimator correctly recovers the liability causal effect in all scenarios with a time-varying exposure, even where time-varying genetic associations existed (Table 1). The estimate of the liability effect of X0 on Y is different to that of X1 on Y. This is because MR is estimating the effect of L on Y rather than the effect of Xk itself, i.e. the change in L required to raise X0 by one unit is 2 (=1/0.5), whereas the change in L required to raise X1 by one unit is 1.54 (=1/0.65). Non-zero estimates are recovered for the liability effect of X0 on Y even when there is no direct path from X0 to Y (i.e. γ4 = 0). This non-zero coefficient arises because MR estimates the causal effect of changing the liability such that the exposure measured at time t0 is one unit higher, not the effect of a one-unit change in X0 whilst holding X1 constant. Similarly, there is a non-zero estimate for the liability effect of X1 on Y even if there is no direct effect from X1 to Y (i.e. γ6 = 0), because the change in liability causing the change in X1 will have also changed X0. It does not matter when the exposures are measured with respect to the outcome (provided that earlier exposures influence the outcome); non-zero effects of X1 on Y will be correctly estimated even if the exposure is measured after the outcome. Note that the MR estimates in Table 1 do not correspond to the estimates of the direct effects of X0 or X1 on Y except for the cases in which the liability has no effect on X0 or either X0 or X1 has no direct effect on Y. Where there are more than two exposure measurements, all direct effects on the outcome except for the exposure under consideration would have to be zero for MR to provide an unbiased estimate of the direct effect of that exposure.

Table 1.

Estimates, standard errors and bias of the liability effect of a time-varying exposure on an outcome using Mendelian randomization given the parameters in Figure 3

Parameter values Liability effect of:
X0 on Y  X1 on Y
[estimate (SE)] [estimate (SE)]
[Bias (SE)] [Bias (SE)]
Base parameters in directed acyclic graph 0.92 (0.042) 0.71 (0.031)
γ2=0 –6.93 (57 811.605) 0.4 (0.041)
7.448 (24.217) 0.001 (0.001)
γ3=0 0.52 (0.041) 1.76 (0.207)
–0.001 (0.001) –0.026 (0.006)
γ4=0 0.52 (0.042) 0.4 (0.028)
0.002 (0.001) –0.001 (0.001)
γ5=0 0.8 (0.042) 0.8 (0.042)
0 (0.001) 0.001 (0.001)
γ6=0 0.4 (0.037) 0.31 (0.031)
0.002 (0.001) 0.001 (0.001)
U=0 0.92 (0.041) 0.71 (0.03)
0 (0.001) 0 (0.001)

Bias presented as ‘0.000’ where –0.001 < mean bias < 0.001. SE, standard error.

MR recovers liability effects even where simulations are extended to include outcome–exposure feedback effects or reverse confounding (Supplementary material, available as Supplementary data at IJE online). Where the liability does not cause exposure during a specific time period (e.g. a genotype may only cause weight gain after puberty), weak instrument bias may affect estimates of the effect of the change in liability required to increase exposure during that period by one unit (Table 1, where γ2 = 0).30 This bias is smaller for later measures of exposure (Table 1, where γ3 = 0) because genetic effects here can operate via earlier occurrences.

Cross-sectional total effects estimated using linear regression are biased even where unobserved confounding from U is absent due to confounding by the liability L that underlies the repeat measures of exposure (Supplementary material, available as Supplementary data at IJE online).

MR of body mass index measured at different ages on systolic blood pressure

We used two-sample MR to estimate the liability effect of body mass index (BMI) at different ages on systolic blood pressure (SBP) using the SNP rs9939609 located in the fat mass and obesity-associated gene (FTO). Note that this single SNP approach prohibited standard two-sample MR sensitivity analyses but provided a suitable proof of concept. We estimated FTO–BMI associations from a study using data from the 1958 National Survey of Health and Development British cohort in which BMI was measured on 11 occasions between ages 2 and 53years (n = 2479) by Hardy et al.21 We estimated FTO–hypertension associations from a study of Danish individuals in the Copenhagen General Population Study with mean age 57.6years [standard deviation (SD): 13.49] by Timpson et al. (n = 37 027), thus ensuring no sample overlap.31 All associations were consistent with the study of individuals in the Rotterdam Study (n = 5123) by Labrecque and Swanson.25

MR results from these SNP–exposure and SNP–outcome associations varied greatly depending on when the exposure was measured (Table 2). This variation in results does not invalidate MR,25 but is expected because the effect of genotype on exposure varied over age (Figure 4). The interpretation of the MR estimate is with respect to the underlying liability. So, from Table 2, the effect of changing the liability such that BMI increases by one SD unit in expectation at age 11 years would be to increase mid-life blood pressure by 6.08 mmHg (SE: 2.32 mmHg). The effect of changing the liability such that BMI increases in expectation by one SD unit at age 53 years would be to increase mid-life blood pressure by 12.78 mmHg (SE: 7.77 mmHg). Although these are different, their consistency can be verified by examining Figure 4, and the effect of genotype on BMI at different ages shown in Table 2—the effect of a one-unit change in genotype (which would equate to a change in liability) on measured BMI z-score is twice as large at age 11 years as at age 53 years.

Table 2.

Results from Mendelian randomization with time-varying exposures for the causal effect of body mass index z-score on systolic blood pressure using rs9939609

Age (years) SNP–exposure association (SE) MR estimate (SE) Association of G with a one-unit SD difference in BMI (SE) Liability exposure difference equivalent to a one-unit SD difference in exposure at age 20 years (SE)
2 0.013 (0.033) 49.56 (127.47) 78.7 (201.4) 0.09 (0.23)
4 0.019 (0.030) 32.69 (51.68) 51.9 (81.1) 0.14 (0.22)
6 0.0035 (0.031) 178.63 (1546.5) 283.5 (2453.9) 0.025 (0.22)
7 0.043 (0.030) 14.79 (11.18) 23.5 (16.8) 0.30 (0.23)
11 0.104 (0.031) 6.08 (2.32) 9.7 (2.85) 0.73 (0.27)
15 0.107 (0.031) 5.88 (2.23) 9.3 (2.71) 0.76 (0.28)
20 0.141 (0.031) 4.46 (1.46) 7.1 (1.56) Reference
26 0.097 (0.030) 6.46 (2.52) 10.3 (3.13) 0.69 (0.26)
36 0.070 (0.029) 9.0 (4.30) 14.3 (5.87) 0.50 (0.23)
43 0.043 (0.028) 14.73 (10.30) 23.4 (15.33) 0.30 (0.21)
53 0.049 (0.027) 12.78 (7.77) 20.3 (11.31) 0.35 (0.21)

SNP–exposure associations taken from Hardy et al.21; SNP–outcome association taken from Timpson et al.31 Standard errors for ratio estimates were computed using the formula in Burgess et al.32 ignoring covariance between SNP–exposure effects at different ages.

SNP, single-nucleotide polymorphism; SE, standard error; MR, Mendelian randomization; BMI, body mass index. The SNP–outcome association from Timpson et al.31 was 0.63 (SE: 0.153).

Figure 4.

Figure 4

Linear prediction of mean body mass index at different ages from additive genetic models by FTO rs9939609 genotype

FTO, fat mass and obesity-associated gene. AA: two risk variant; AT: one risk variant; TT: zero risk variants.

MR estimates obtained on different occasions can be converted to the same liability scale provided that SNP–exposure associations on these occasions can be estimated. This conversion can be made by multiplying the Wald Ratio estimate by the SNP–exposure association at its age divided by the SNP association at another (target) age. Taking FTO–BMI and FTO–hypertension associations from the Hardy et al. and the Timpson et al. studies (Table 2), multiplying the age 53 years estimate (12.78 mmHg per SD of BMI) by the SNP–exposure association at age 20 years (0.1412) divided by the SNP–exposure association at age 53 years (0.0493) gives us the liability effect of BMI on SBP at age 20 years of 4.46 4.46 = 12.78 × 0.04930.1412. The third column of Table 2 shows the association of G with a one-unit SD change in BMI at each age. This varies due to variation in SNP–exposure associations at different ages. The last column shows the exposure difference at specific ages that corresponds to a liability-induced increase in BMI of one SD unit at age 20 years. This information helps to interpret the differing MR estimates and could be used to conduct a Genomewide Association Study (GWAS) meta-analysis in which the exposure was measured at different ages in different studies.

Discussion

Here we have clarified that MR (using the Wald Ratio estimator) using only one measure of an exposure that varies over time estimates the causal effect of the liability that underlies the exposure. That is, MR applied to exposures that vary over time estimates the causal effect of the underlying liability rather than the causal effect of the exposure as it manifests at a given measurement occasion. Although the effect of liability on outcome is the estimand that can be estimated using MR, as the genotype acts on the liability rather than a specific exposure, the liability is unobserved, so we must estimate its effect via the measured exposures. The MR estimate of the liability effect does not require time-invariant genotype–exposure associations under the assumptions that the structural model is linear and additive, providing that the instruments are valid instruments for the underlying liability.11,25 There is also no assumption that the liability L should have the same direction of effect on exposure at all time points.33 We do not suggest that the construct of a 1D liability is suitable for every situation with time-varying exposure, but when it is, then the present paper gives an interpretation.

We have demonstrated that MR estimates the causal effect of a change in liability L that results in an expected one-unit change in exposure Xt. MR with a single genetic proxy of liability cannot in general be used to infer the direct or total effect of an exposure at a specific point in time, or to draw inferences about changes in exposure trajectories different to the one that L causes (e.g. the effect of increasing exposure by one unit at all time points).25 Results from MR conducted at different exposure time points will necessarily differ where time-varying gene–exposure associations exist. However, this does not invalidate MR as previously argued,25 but highlights that it is testing the effect of the liability L on outcome.34 Estimation of time-specific causal effects (i.e. what is the direct or total effect of the exposure at a given timepoint on the outcome) will usually not be possible with instruments for one liability that affects all periods of exposure, in the absence of other information.

We proposed a new definition of the causal effect estimated by MR using one measure of a time-varying exposure and one liability, the ‘liability effect’, as the causal effect of changing the liability such that one particular measurement of exposure would be one unit higher at a given time, and all other occurrences of the exposure would change accordingly. The estimated liability causal effect will differ in size if the exposure is measured at a different time point, but the estimates will be consistent with the underlying trajectory of exposure induced by the SNP as shown in Table 2. Although the FTO trajectories from the study by Hardy et al.21 study will likely differ from those in larger studies,35 these have been used for illustrative purposes as they cover a broad range of ages. Our interpretation differs from that previously suggested by Labrecque and Swanson25 in that it rests upon a liability caused by a specific genotype. Labrecque and Swanson demonstrated that MR is sensitive to age-related variation in SNP–exposure associations and we have demonstrated that these differences are a necessary component of time-varying exposures. Our assumption that genotype may act, through liability L, upon the whole lifecourse exposure trajectory36 rather than a single exposure measurement is supported by studies demonstrating time-varying genetic associations.15–20

It may seem counter-intuitive to use the instrument to describe the causal effect to be estimated, e.g. we are estimating the effect of the liability that is associated with a given SNP. So if a one-unit increase in SNP causes an increase in exposure at time t0 of one unit and an increase in exposure at time t1 of two units, then the gene–environment equivalence means that the MR estimate of the effect of this liability on outcome is the same as the effect of any intervention that raised the exposure by one unit at t0 and two units at time t1. The underlying point is that we can only examine the effect of a liability that has an instrument associated with it. For example, should an analyst wish to estimate the effect of increasing X by one unit at all time points, an instrument that has a constant effect on exposure over the lifecourse would be required. If the interest is in a liability that causes X to double every 10 years, then an instrument that has this (or a proportional) effect is required. Genetic variants may better relate to relative than absolute differences in some phenotypes and where this is the case, transformations of phenotypes may be considered to demonstrate more consistent effect sizes throughout the lifecourse. Here, MR effects would be consistent over time.

There are thus two consequences of our results. First, if the aim is to estimate the effect of a specific liability, then the researcher needs to find an instrument for that liability. This is no different to any other situation in which some desired exposures cannot be instrumented genetically (e.g. it is hard to imagine a valid genetic instrument for cycling to work). Second, interpretation of an MR of an exposure that varies over time is with respect to the liability for that exposure that is induced by the given genotype. Thus, interpretation of an MR estimate of a time-varying exposure requires knowledge of the liability induced by the genotype. This in turn means that if several genotypes have different effects on the liability for exposure (e.g. one increases exposure during early ages and has a constant effect after age 20 years and another decreases exposure during early ages and has a constant effect after age 20 years) but the same effect on outcome, this would imply that it was the exposure after age 20 years that was important for this outcome. If there are SNPs associated with different liabilities in this way, then multivariable MR (MVMR)28 could be formally used to estimate the effects of the liability for exposure before and after age 20 years.

Our simulations also illustrate that MR is not biased with respect to the liability effect by longitudinal exposure mediation, where earlier exposure measures cause later exposure measures. Again, it is not possible to draw inferences on the timing of causal effects because it is the effect of the liability that is being estimated, which goes through exposure at all time points whether measured or not. An effect of the exposure on the outcome could still be observed even if the exposure is measured later than the outcome, as it is the estimated effect of the liability to the exposure on the outcome that will be estimated. Similarly, if an outcome affects a later measurement of exposure, an investigator will not incorrectly conclude that the outcome causes the exposure, but they may incorrectly conclude that exposure at a given age causes an outcome. With one liability for an exposure, a cumulative effect of exposure will be indistinguishable from an effect of exposure only during specific time periods; as stated before, one can only say that some part of the exposure trajectory is causal, but not which part.6 The lack of the ability to determine causal effects at specific time points complicates comparisons between MR and randomized–controlled trials (RCTs). In an RCT, the timing of exposure (intervention) can be modified (e.g. an intervention can be given at any age, or any stage in a disease time course, as specified by the design of the study) and an intervention will have a specific effect on the subsequent pattern of exposure, e.g. randomization to a weight-loss intervention at age 40 years might lower BMI by one unit at age 43 years, but by age 53 years the two arms of the trial could have the same average BMI. This RCT would thus estimate a different effect on outcome than the MR estimate of the liability effect at age 43 years using the FTO genotype shown in Table 2.

Although we investigated a single SNP, this interpretation of time-varying MR can in principle be extended to multiple SNPs if they all act on the same underlying liability to exposure (L, in Figure 1). It is however highly unlikely that any two SNPs will induce the same liability and thus the same exposure trajectory (non-causal SNPs that tag the same causal variant would not be seen to produce the trajectory). If different SNPs have differing time-varying associations with the exposure, then their estimated liability effects of exposure on outcome will differ. Future studies should assess heterogeneity between groups of SNPs with repeat measures of exposure to assess the consistency of trajectories of exposure.11 Assuming multiple liabilities through MVMR may allow investigators to more reliably test hypotheses about exposures during different time periods.37,38 For example, a recent MR study using multiple instruments with different effects on early (age 10 years) and later life (age 57 years) BMI could draw inferences about the different contributions of liability for BMI at ages 10 and 57 years.15 An MR study with a single instrument (or multiple instruments acting on the same liability) could only draw inferences on the whole time period acted on by that liability.

Our liability effect and the Wald Ratio estimator rely on a linear, additive structural model, but do not make any assumptions about the timing of how exposure affects the outcome. For example, the exposure may act cumulatively on the outcome, may have sensitive or critical periods39 or may have different effects depending on its proximity to the outcome window. If the mechanism of exposure is known, then the appropriate summary of exposure could be derived and used in MR, e.g. using cumulative exposure or functional principal component analysis to summarize trajectories of exposure.40 Further longitudinal genetic studies that investigate time-varying genetic associations with exposures are therefore required to better triangulate causal evidence.41

This framework of an effect of an underlying liability can be extended from multiple measures of the same phenotype to measures of different phenotypes. However, in univariable MR, the assumption would be that instruments only act on a common liability and do not have any direct effects on different phenotypes. To estimate separate direct effects on two phenotypes, one would need to use separate liabilities within an MVMR framework.28

The key aspect when interpreting MR results from time-varying exposures is to consider the underlying liability for a specific exposure trajectory. We have demonstrated this using two exposure occasions for simplicity, but the result holds when the liability is extended across measures of X in continuous time (Supplementary material, available as Supplementary data at IJE online). MR with a genetic instrument using an exposure measured at a single time point provides an unbiased estimate of the causal effect of moving the liability L (as induced by the instrument) such that the exposure at the single time point would be expected to increase by one unit. Care must be taken in interpretation of the results of MR analyses using a single measure of a time-varying exposure, as temporal effects cannot be inferred in the presence of a genetic instrument obtained from a single time point. It is important for future research to consider the exposure trajectories for every genetic instrument used.25

Ethics approval

Ethics approval was not required for this study; only publicly available summary data were used.

Supplementary Material

dyac136_Supplementary_Data

Acknowledgements

We thank Tom Palmer for commenting on earlier drafts of this work. Vanessa Didelez would like to acknowledge financial support from the Simons Institute for the Theory of Computing, Berkeley, CA, USA.

Conflict of interest

None declared.

Contributor Information

Tim T Morris, MRC Integrative Epidemiology Unit at the University of Bristol, Bristol, UK; Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK.

Jon Heron, MRC Integrative Epidemiology Unit at the University of Bristol, Bristol, UK; Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK.

Eleanor C M Sanderson, MRC Integrative Epidemiology Unit at the University of Bristol, Bristol, UK; Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK.

George Davey Smith, MRC Integrative Epidemiology Unit at the University of Bristol, Bristol, UK; Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK.

Vanessa Didelez, Leibniz Institute for Prevention Research and Epidemiology—BIPS, Bremen, Germany; Department of Mathematics and Computer Science, University of Bremen, Bremen, Germany.

Kate Tilling, MRC Integrative Epidemiology Unit at the University of Bristol, Bristol, UK; Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK.

Data availability

The summary data and all code used in this paper are available at https://github.com/timtmorris/time-varying-MR.

Supplementary data

Supplementary data are available at IJE online.

Author contributions

T.T.M., J.H. and K.T. conceived the study. T.T.M. and K.T. carried out all analyses. All authors contributed to interpretation of the results. T.T.M. drafted the original manuscript. All authors contributed to editing of the manuscript. T.T.M. will act as guarantor for the paper.

Funding

All authors except V.D. are part of the MRC Integrative Epidemiology Unit (IEU) at the University of Bristol, which is supported by the UK Medical Research Council (MC_UU_00011/1 and MC_UU_00011/3).

References

  • 1. Davey Smith G, Ebrahim S.. ‘Mendelian randomization’: can genetic epidemiology contribute to understanding environmental determinants of disease? Int J Epidemiol 2003;32:1–22. [DOI] [PubMed] [Google Scholar]
  • 2. Davey Smith G, Holmes MV, Davies NM, Ebrahim S.. Mendel’s laws, Mendelian randomization and causal inference in observational data: substantive and nomenclatural issues. Eur J Epidemiol 2020;35:99–111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Davey Smith G, Hemani G.. Mendelian randomization: genetic anchors for causal inference in epidemiological studies. Hum Mol Genet 2014;23:89–98. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Richmond RC, Davey Smith G.. Mendelian randomization: concepts and scope. Cold Spring Harb Perspect Med 2022;12:a040501. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Davies NM, Holmes MV, Davey Smith G.. Reading Mendelian randomisation studies: a guide, glossary, and checklist for clinicians. BMJ 2018;362:k601. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Didelez V, Sheehan N.. Mendelian randomization as an instrumental variable approach to causal inference. Stat Methods Med Res 2007;16:309–30. [DOI] [PubMed] [Google Scholar]
  • 7. Sheehan NA, Didelez V.. Epidemiology, genetic epidemiology and Mendelian randomisation: more need than ever to attend to detail. Hum Genet 2020;139:121–36. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Labrecque J, Swanson SA.. Understanding the assumptions underlying instrumental variable analyses: a brief review of falsification strategies and related tools. Curr Epidemiol Rep 2018;5:214–20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Vansteelandt S, Didelez V.. Improving the robustness and efficiency of covariate-adjusted linear instrumental variable estimators. Scand J Statist 2018;45:941–61. [Google Scholar]
  • 10. Labrecque JA, Swanson SA.. Mendelian randomization with multiple exposures: the importance of thinking about time. Int J Epidemiol 2020;49:1158–62. [DOI] [PubMed] [Google Scholar]
  • 11. Labrecque JA, Swanson SA.. Age-varying genetic associations and implications for bias in Mendelian randomization. medRxiv; doi:10.1101/2021.04.28.21256235, 30 April 2021, preprint: not peer reviewed. [Google Scholar]
  • 12. Leon DA, Davey Smith G, Shipley M, Strachan D.. Adult height and mortality in London: early life, socioeconomic confounding, or shrinkage? J Epidemiol Community Health 1995;49:5–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Tyrrell J, Jones SE, Beaumont R. et al. Height, body mass index, and socioeconomic status: Mendelian randomisation study in UK Biobank. BMJ 2016;352:i582. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Mountjoy E, Davies NM, Plotnikov D. et al. Education and myopia: assessing the direction of causality by Mendelian randomisation. BMJ 2018;361:k2022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Richardson TG, Sanderson E, Elsworth B, Tilling K, Davey Smith G.. Use of genetic variation to separate the effects of early and later life adiposity on disease risk: Mendelian randomisation study. BMJ 2020;369:m1203. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Mostafavi H, Harpak A, Agarwal I, Conley D, Pritchard JK, Przeworski M.. Variable prediction accuracy of polygenic scores within an ancestry group. Elife 2020;9:e48376. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Simino J, Shi G, Bis JC. et al. ; LifeLines Cohort Study. Gene-age interactions in blood pressure regulation: a large-scale investigation with the CHARGE, global BPgen, and ICBP consortia. Am J Hum Genet 2014;95:24–38. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Kwong ASF, Morris TT, Pearson RM. et al. Polygenic risk for depression, anxiety and neuroticism are associated with the severity and rate of change in depressive symptoms across adolescence. J Child Psychol Psychiatry 2021;62:1462–74. [DOI] [PubMed] [Google Scholar]
  • 19. Warrington NM, Howe LD, Paternoster L. et al. A genome-wide association study of body mass index across early life and childhood. Int J Epidemiol 2015;44:700–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Jiang X, Holmes C, McVean G.. The impact of age on genetic risk for common diseases. PLoS Genet 2021;17:e1009723. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Hardy R, Wills AK, Wong A. et al. Life course variations in the associations between FTO and MC4R gene variants and body size. Hum Mol Genet 2010;19:545–52. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Kwong ASF, López-López JA, Hammerton G. et al. Genetic and environmental risk factors associated with trajectories of depression symptoms from adolescence to young adulthood. JAMA Netw Open 2019;2:e196587. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Davey Smith G, Ebrahim S.. Mendelian randomization: prospects, potentials, and limitations. Int J Epidemiol 2004;33:30–42. [DOI] [PubMed] [Google Scholar]
  • 24. Kivimäki M, Lawlor DA, Davey Smith G. et al. Variants in the CRP gene as a measure of lifelong differences in average C-reactive protein levels: the cardiovascular risk in young Finns study, 1980–2001. Am J Epidemiol 2007;166:760–64. [DOI] [PubMed] [Google Scholar]
  • 25. Labrecque JA, Swanson SA.. Interpretation and potential biases of Mendelian randomization estimates with time-varying exposures. Am J Epidemiol 2019;188:231–38. [DOI] [PubMed] [Google Scholar]
  • 26. Burgess S, Swanson SA, Labrecque JA.. Are Mendelian randomization investigations immune from bias due to reverse causation? Eur J Epidemiol 2021;36:253–57. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Swanson SA, Labrecque J, Hernán MA.. Causal null hypotheses of sustained treatment strategies: what can be tested with an instrumental variable? Eur J Epidemiol 2018;33:723–28. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Sanderson E, Richardson TG, Morris TT, Tilling K, Davey Smith G.. Estimation of causal effects of a time-varying exposure at multiple time points through Multivariable Mendelian randomization. PLoS Genetics 2022;in press. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Morris TP, White IR, Crowther MJ.. Using simulation studies to evaluate statistical methods. Stat Med 2019;38:2074–102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Sanderson E, Spiller W, Bowden J.. Testing and correcting for weak and pleiotropic instruments in two-sample multivariable Mendelian randomization. Stat Med 2021;40:5434–52. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Timpson NJ, Harbord R, Davey Smith G, Zacho J, Tybjaerg-Hansen A, Nordestgaard BG.. Does greater adiposity increase blood pressure and hypertension risk? Mendelian randomization using the FTO/MC4R genotype. Hypertension 2009;54:84–90. [DOI] [PubMed] [Google Scholar]
  • 32. Burgess S, Small DS, Thompson SG.. A review of instrumental variable estimators for Mendelian randomization. Stat Methods Med Res 2017;26:2333–55. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Xue A, Jiang L, Zhu Z. et al. Genome-wide analyses of behavioural traits are subject to bias by misreports and longitudinal changes. Nat Commun 2021;12:1–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Burgess S; CRP CHD Genetics Collaboration. Identifying the odds ratio estimated by a two-stage instrumental variable analysis with a logistic regression model. Stat Med 2013;32:4726–47. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Helgeland Ø, Vaudel M, Juliusson PB. et al. Genome-wide association study reveals dynamic role of genetic variation in infant and early childhood growth. Nat Commun 2019;10:4448. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Evans DM, Davey Smith G.. Mendelian randomization: new applications in the coming age of hypothesis-free causality. Annu Rev Genomics Hum Genet 2015;16:327–50. [DOI] [PubMed] [Google Scholar]
  • 37. Sanderson E, Davey Smith G, Windmeijer F, Bowden J.. An examination of multivariable Mendelian randomization in the single-sample and two-sample summary data settings. Int J Epidemiol 2019;48:713–27. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Sanderson E. Multivariable Mendelian randomization and mediation. Cold Spring Harb Perspect Med 2021;11:a038984. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Smith ADAC, Hardy R, Heron J. et al. A structured approach to hypotheses involving continuous exposures over the life course. Int J Epidemiol 2016;45:1271–79. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Cao Y, Rajan SS, Wei P.. Mendelian randomization analysis of a time-varying exposure for binary disease outcomes using functional data analysis methods. Genet Epidemiol 2016;40:744–55. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Lawlor DA, Tilling K, Davey Smith G .. Triangulation in aetiological epidemiology. Int J Epidemiol 2016;45:1866–86. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

dyac136_Supplementary_Data

Data Availability Statement

The summary data and all code used in this paper are available at https://github.com/timtmorris/time-varying-MR.


Articles from International Journal of Epidemiology are provided here courtesy of Oxford University Press

RESOURCES