Abstract
Mediation analysis helps researchers assess whether part or all of an exposure’s effect on an outcome is due to an intermediate variable. The indirect effect can help in designing interventions on the mediator as opposed to the exposure and better understanding the outcome’s mechanisms. Mediation analysis has seen increased use in genome-wide epidemiological studies to test for an exposure of interest being mediated through a genomic measure such as gene expression or DNA methylation. Testing for the indirect effect is challenged by the fact that the null hypothesis is composite. We examined the performance of commonly used mediation testing methods for the indirect effect in genome-wide mediation studies. When there is no association between the exposure and the mediator and no association between the mediator and the outcome, we show that these common tests are overly conservative. This is a case that will arise frequently in genome-wide mediation studies. Caution is hence needed when applying the commonly used mediation tests in genome-wide mediation studies. We evaluated the performance of these methods using simulation studies, and performed an epigenome-wide mediation association study in the Normative Aging Study, analyzing DNA methylation as a mediator of the effect of pack-years on FEV1.
Keywords: Composite null, DNA methylation, Epigenetics, Hypothesis testing, Type I error
Introduction
Mediation analysis is an attractive tool to study disease causal pathways. It allows researchers to partition an exposure’s effect on an outcome into a direct effect and an indirect effect through an intermediate variable (the mediator). The natural indirect effect (NIE) provides evidence of whether the mediator is on a pathway from the exposure to the outcome and is often of primary interest [Pearl 2001; Robins and Greenland 1992]. This can have important policy/intervention consequences as it may not be possible to change an exposure, but we could intervene to disable the mechanism from the exposure to the outcome.
There has been a rise in genomic studies that have incorporated some form of mediation analysis in recent years [Fasanelli, et al. 2015; Huan, et al. 2016; Liu, et al. 2013; VanderWeele, et al. 2012]. This could be in form of a single nucleotide polymorphism (SNP) acting on gene expression which then acts on a phenotype further downstream, or that a genomic measure such as DNA methylation being a mediator for an environmental exposure’s effect on disease status. Mediation in genomic studies is an ongoing area of research that will continue to grow as more data are collected and pathways are better understood. Given this rise, the statistical problems inherent to genomic mediation studies need to be better understood.
In this paper, we examine the performance of testing the natural indirect effect in epigenome wide association studies where we test the null of no NIE for a single mediator. We examine the performance of four common methods for testing the NIE in simulation studies under various null scenarios. We examine the joint significant method (causal steps method) [MacKinnon, et al. 2002], the product method (Sobel’s method) [Baron and Kenny 1986; Sobel 1982], the bootstrap method [Fritz and MacKinnon 2007] and the difference method [Mackinnon, et al. 1995]. There are other methods not covered here but the message extends to those approaches [Baron and Kenny 1986; Bayarri and Berger 2000; Biesanz, et al. 2010; Judd and Kenny 1981; MacKinnon and Fairchild 2009; MacKinnon, et al. 2007; Yuan and MacKinnon 2009].
The empirical performance of these tests under continuous outcome and mediator has been analyzed in detail in the Nursing, Psychology, and Sociology fields [Biesanz, et al. 2010; Krause, et al. 2010; MacKinnon, et al. 2007; MacKinnon, et al. 2002]. To the best of our knowledge, it has not been brought to the attention of the public health and genetic epidemiology fields, nor has any literature illustrated the effects in the genome-wide multiple testing context. There have been some methodology developments of new approaches for high-dimensional testing in genomic studies [Huang, et al. 2015; Huang, et al. 2014; Huang and Pan 2016; Zhang, et al. 2016], but these developments did not rigorously discuss the limitations of standard testing approaches.
We first demonstrate that the null hypothesis of no NIE is a composite null that occurs under three cases. In genome-wide epigenetic studies, the genomic marker is likely not associated with both the exposure and the outcome. We show that the existing tests for the NIE are conservative in this case. It follows that then applying these commonly used tests in genome-wide epigenetic studies will result in loss of power. We also extend the discussion of the performance of these tests under the null hypothesis to different data types.
The structure of this paper is as follows, we first put forth the methods and notation that will be used in testing. We next perform a simulation analysis to assess how well each testing procedure performs under various null scenarios. We demonstrate the performance of existing methods in assessing the NIE and issues with test sizes (α) under scenarios of the null hypothesis. We then finally perform a mediation analysis in the Normative Aging Study, with DNA methylation as the mediator, pack years as the exposure, and forced expiratory volume in 1 second (FEV1).
Methods
Base Models
We first focus on when we have a continuous outcome (Y) and mediator (M). We have a primary exposure of interest (A) and baseline covariates (X). We consider the following simple model (though this discussion is extendable so as to include interactions [VanderWeele and Vansteelandt 2009]) with zero mean residuals ( , , ) and finite variance:
(1) |
(2) |
(3) |
We assume the baseline covariates (X) suffice to control for exposure-outcome, mediator-outcome, and exposure-mediator confounding, with no mediator-outcome confounders affected by the exposure [Pearl 2001; VanderWeele 2015], assumptions which are needed for a causal interpretation. Note that the covariates in the outcome model and the mediator model do not need to coincide, but for ease of notation, we let X denote all baseline covariates, which include confounders for both M and Y models. The NIE of A on Y through M conditional on X is given by θ2β1 or ϕ1-θ1, as there is no interaction between A and M on Y. A directed acyclic graph of the various causal effects highlighting the NIE is given in Figure 1. The composite null hypothesis of no NIE (H0: θ2β1 = 0) occurs when:
Case 1: No association between A and M but association between M and Y given A and X: β1=0; θ2≠0.
Case 2: Association between A and M but no association between M and Y given A and X: β1≠ 0; θ2=0.
Case 3: No association between A and M and no association between M and Y given A and X: β1=θ2=0.
The majority of mediation applications in traditional epidemiological studies where there is no NIE likely consist of Cases 1 or 2, where researchers have a-priori information and a meaningful reason to investigate a mediator. Case 3 would then not be frequently encountered. However, with increasing genome-wide epidemiological studies, Case 3 arises much more often as researchers test hundreds of thousands of potential genomic mediators across the genome, such as gene expression or methylation.
Statistical Methods for NIE test
We examine four common approaches as mentioned in the introduction. The joint significant test (causal steps method) for NIE is based on the path specific p-values and does not provide an estimate. The p-value is the maximum of the two path specific p-values. If it is less than level α, the null is rejected. Under Case 3, it will have type I error rate equal to α2. For the other two cases, it will converge to α as evidence for the non-null path increases. The product of coefficients method, focuses on the outcome model (1) and mediator model (3) and estimates the NIE by . In terms of estimating the variance, the most commonly used is the Sobel estimator [Sobel 1982]:
where and . The Sobel variance estimate is based on the multivariate delta method, which requires that the gradient of the expression evaluated at the truth be non-zero. The Sobel variance is estimated by substituting the above terms with the corresponding estimators. Testing is conducted by dividing the product by the square root of this estimate and compared with standard normal distribution. Under Case 3, evaluating the Sobel estimator with the true parameters yields 0 (θ2 =β1= 0), and the multivariate delta method does not hold, as the gradient is 0. This is because the product estimator follows a normal product distribution. The difference in coefficients method, considers the outcome model (1) and a reduced outcome model (2). The mediation effect of A on Y through M is ϕ1-θ1. A test for the NIE can be developed by comparing with a standard normal distribution. Under Case 3, the difference method does not follow a normal distribution, but a normal product distribution as the point estimate is equal to [Mackinnon, et al. 1995]. Hence the normality based difference method also fails.
Finally, the bootstrap approximates the empirical distribution of the product method and then derives an empirical p-value by the number of bootstrap sample estimates greater than the observed [Fritz and MacKinnon 2007; Mackinnon, et al. 2004]. the bootstrap method attempts to account for departures from normality, as it uses the empirical distribution.
Here we note that the mediation test statistic is a product of two normals, which in itself is not normal and under Case 3 is symmetric around zero with excess kurtosis of 6 [Craig 1936; Mackinnon, et al. 2004]. Under Case 1 and Case 2 the excess kurtosis will go to 0 as more evidence of the non-null relationship. Given the non-normality of the parameter we are interested in under Case 3 ( ) we expect that the Bootstrap will also fail [Bickel, et al. 2012; Efron 1992].
Binary Mediator or Outcome
We next briefly discuss the product method for the NIE under three other scenarios which are commonly seen in epidemiologic studies. Consider a binary outcome (such as disease status) and the following models for mediation analysis:
Under sufficiently rare outcome and the necessary causal assumptions [VanderWeele and Vansteelandt 2010], we can approximate the log of the NIE as:
We then obtain a NIE estimate by substituting in and . The three cases of no NIE, will thus apply again, and similarly, under Case 3 the product method will fail due to having zero gradient. Note, for this scenario the NIE is defined on the scale of odds ratio as opposed to the mean of a continuous outcome. The null is then NIE=1 or log(NIE)=0.
Next, if the mediator is also binary, we have the following models and expression for the NIE (NIE is measured on the odds ratio) [Valeri and VanderWeele 2013]:
Under Case 3, the gradient of this term evaluated at the truth will be 0. Finally, for a continuous outcome and a binary mediator, under the necessary causal assumptions [Valeri and VanderWeele 2013]:
The gradient of this term evaluated at the truth will be 0 for both Case 1 and Case 3. For these mediation studies above, Case 3 will provide invalid results for the product method.
In mediation analysis with the above mediator and outcome combinations, the primary interest may lie in estimation as opposed to testing of NIE. Regardless, we felt it would be prudent to present these data types under testing. Note that the three cases under the null of no NIE affect statistical inference for these data types as well. Confidence intervals calculated using the normality assumption, may not be at the desired level. This is primarily an issue with a binary outcome and continuous mediator, as the delta method will likely be used in a similar manner as the continuous setting. Bootstrap method used to construct the CI’s for the other scenarios may result in similar issues, due to the non-normality of the estimates under certain scenarios [Bickel, et al. 2012; Efron 1992].
Simulation Study
We performed a simulation study to assess the Type I Error (TIE). Without loss of generality, we considered scalar confounder X for both outcome and mediator models. Let θ2, β1 ∈ (0, 0.14, 0.39), and other parameters set as fixed θ0=θ1=β0=θ3=β2=0.14 (Equations 1, 3). These values were taken as a subset of the values from MacKinnon et al [MacKinnon, et al. 2002]. We considered all nine combinations of the values of and to examine the sizes and powers of the four commonly used methods for testing the NIE: joint, product, difference and bootstrap methods. If only one of and equals to 0, it corresponds to Case 1 or 2 under the composite null of no NIE Case 3 corresponds to If both and are not equal to 0, we study the power.
We generated X and A from independent standard normal and then sequentially generated M and Y with standard normal errors. The sample size was examined at 50, 100, 500, 1,000 and 5,000. For each set, 50,000 replications were performed. The type I error and power were evaluated at α=0.05. For each setting, we applied the four methods for testing for the NIE.
When , A explains approximately 2% of the variation in M conditional on X and when , it is 13%. This is when all errors are standard normal. Break downs of the total variation in Y due to A conditional on X are given in Table 1. This also gives how much of the total variation in Y that is due to the indirect effect of A conditional on X and the direct effect (Supplementary Table 1). When , A explains 2.4% of the variation in Y conditional on X with .3% of that variation coming from the NIE. If , 6.9% of the variation in Y is due to A, with 3.6% of the variation is due to the NIE.
Table 1. Simulations results for Empirical Type I Error and Power.
β1 | θ2 | N | Joint | Product | Difference | Bootstrap |
---|---|---|---|---|---|---|
50 | 0.0024 | 0.0002 | 0.0003 | 0.0024 | ||
100 | 0.0021 | 0.0001 | 0.0001 | 0.0016 | ||
0 | 0 | 500 | 0.0024 | 0.0001 | 0.0002 | 0.0016 |
1000 | 0.0026 | 0.0001 | 0.0002 | 0.0015 | ||
5000 | 0.0027 | 0.0001 | 0.0002 | 0.0014 | ||
| ||||||
50 | 0.0074 | 0.0006 | 0.0010 | 0.0073 | ||
100 | 0.0131 | 0.0011 | 0.0018 | 0.0113 | ||
0 | 0.14 | 500 | 0.0452 | 0.0138 | 0.0175 | 0.0420 |
1000 | 0.0510 | 0.0280 | 0.0321 | 0.0513 | ||
5000 | 0.0514 | 0.0466 | 0.0480 | 0.0524 | ||
| ||||||
50 | 0.0075 | 0.0009 | 0.0013 | 0.0072 | ||
100 | 0.0140 | 0.0017 | 0.0026 | 0.0125 | ||
0.14 | 0 | 500 | 0.0445 | 0.0134 | 0.0173 | 0.0418 |
1000 | 0.0499 | 0.0277 | 0.0317 | 0.0511 | ||
5000 | 0.0503 | 0.0459 | 0.0471 | 0.0516 | ||
| ||||||
50 | 0.0338 | 0.0094 | 0.0120 | 0.0373 | ||
100 | 0.0489 | 0.0212 | 0.0262 | 0.0541 | ||
0 | 0.39 | 500 | 0.0511 | 0.0457 | 0.0473 | 0.0539 |
1000 | 0.0515 | 0.0492 | 0.0499 | 0.0531 | ||
5000 | 0.0514 | 0.0511 | 0.0512 | 0.0524 | ||
| ||||||
50 | 0.0366 | 0.0114 | 0.0145 | 0.0397 | ||
100 | 0.0479 | 0.0233 | 0.0277 | 0.0541 | ||
0.39 | 0 | 500 | 0.0508 | 0.0454 | 0.0471 | 0.0535 |
1000 | 0.0503 | 0.0477 | 0.0484 | 0.0525 | ||
5000 | 0.0503 | 0.0497 | 0.0498 | 0.0516 | ||
| ||||||
50 | 0.0232 | 0.0041 | 0.0056 | 0.0215 | ||
100 | 0.0744 | 0.0158 | 0.0210 | 0.0638 | ||
0.14 | 0.14 | 500 | 0.7663 | 0.5744 | 0.6154 | 0.7483 |
1000 | 0.9855 | 0.9674 | 0.9724 | 0.9842 | ||
5000 | 1 | 1 | 1 | 1 | ||
| ||||||
50 | 0.1103 | 0.0405 | 0.0499 | 0.1101 | ||
100 | 0.2690 | 0.1659 | 0.1859 | 0.2717 | ||
0.14 | 0.39 | 500 | 0.8757 | 0.8648 | 0.8676 | 0.8736 |
1000 | 0.9926 | 0.9920 | 0.9923 | 0.9920 | ||
5000 | 1 | 1 | 1 | 1 | ||
| ||||||
50 | 0.1133 | 0.0442 | 0.0531 | 0.1115 | ||
100 | 0.2636 | 0.1642 | 0.1841 | 0.2663 | ||
0.39 | 0.14 | 500 | 0.8750 | 0.8645 | 0.8676 | 0.8744 |
1000 | 0.9928 | 0.9921 | 0.9923 | 0.9924 | ||
5000 | 1 | 1 | 1 | 1 | ||
| ||||||
50 | 0.5356 | 0.3403 | 0.3780 | 0.5153 | ||
100 | 0.9251 | 0.8594 | 0.8777 | 0.9198 | ||
0.39 | 0.39 | 500 | 1 | 1 | 1 | 1 |
1000 | 1 | 1 | 1 | 1 | ||
5000 | 1 | 1 | 1 | 1 |
We also considered the case when the outcome Y is binary, which is also commonly seen in epigenomic studies. We generated data according to mediator model (3) and a logit outcome model (4), with similar parameter settings as for continuous Y. In detail, we let θ2, β1 ∈ (0, 0.14, 0.39), and set θ1=β0=θ3=β2=0.14. We set θ0=−5 to mimic the situation when the outcome is the incidence of a rare disease (about 1%). We examined a cohort with sample size N=5,000. For each set of parameters (θ2, β1), 50,000 replications were performed, joint method, product method and bootstrap method were employed to evaluate the type I error and/or power at α=0.05.
(4) |
Application to the Normative Aging Study
We next performed an epigenome-wide mediation analysis using DNA methylation (DNAm), the most well studied epigenetic mechanism. This biological process occurs when a methyl group has been added to a cytosine followed by a guanine (CpG site) based pair on the genome and is measured across the genome. It has been found to be associated with a variety of diseases and exposures [Ciernia and LaSalle 2016; Panni, et al. 2016; Teschendorff, et al. 2015]. We performed an epigenome-wide Mediation Study to assess mediation of smoking behavior (pack years) on FEV1 through logit transformed DNAm probes [Du, et al. 2010]. Pack years is a measure of how much an individual has smoked over multiple years. It is defined as the number of packs of cigarettes smoked per day times the years the individual has smoked. FEV1 measures the forced expiratory volume in 1 second of an individual. FEV1 is affected by airway narrowing and is often used to diagnose and measure severity of cases of Chronic Obstructive Pulmonary Disease (COPD) and cystic fibrosis [Taylor-Robinson, et al. 2012; Weiss 2010]. The analysis was performed on FEV1 in the Normative Aging Study, a cohort study in the Greater-Boston metropolitan area [Bell, et al. 1972]. We focused on the data consisting of 575 Caucasian men, with DNAm measured on the Illumina 450k array.
Briefly, DNAm was collected in whole blood in study participants between 1999 and 2007. We removed any probes on the sex chromosomes, non-CG probes and probes within 10 bp of a known SNP within the 1000 genomes population [Butcher 2013]. More information about the preprocessing and collection of the data has been published elsewhere [Panni, et al. 2016]. We adjusted for: age, log of height, visit information, cell type proportion [Houseman, et al. 2012], health history, smoking status, top ten surrogate variables (SV) [Leek and Storey 2007] and technical variables. The SV’s were calculated on a subset of the DNAm probes with an interdecile range greater than 0.1 (111,189 or approximately 25% of probes). This was to decrease computational time and to use more informative probes. SV analysis was performed to control for additional batch effects, given our design matrix of covariates X and pack-years. Testing of the NIE was done using the joint significance and the product using the Sobel variance method. We did not use the bootstrap approach due to the computational cost of performing the number of bootstrap samples for the empirical p-value to be significant at the genome-wide scale. The analysis was done on a total of 449,547 probes.
Results
Simulation Study
When either θ2 is non-zero (Case 1) or β1 is non-zero (Case 2), but not both, the type I error of the four methods for testing NIE is controlled under 0.05 if the sample size is large and the effects size of the non-zero β1 or θ2 is not small (Table 1). If the nonzero coefficient β1 or θ2 is small and the sample size is not sufficiently large, the four tests are conservative, especially the product estimator and the difference estimator. For β1=θ2= 0 (Case 3), the type I error is extremely low even for large sample sizes, indicating these commonly used tests are conservative. This is due to the fact that the tests do not follow the standard normal distribution but a normal product distribution in Case 3. Using non-Sobel variance estimators [Aroian 1947; Goodman 1960] leads to similar results (data not shown) [MacKinnon, et al. 2002].
In genome-wide epigenetic/epidemiological studies, the effect sizes associated with single CpG sites, such as β1 and θ2 tend to be small and a vast majority of probes are Case 3, thus the genome-wide type I error control is likely to be a problem when applying mediation tests over the CpG markers genome-wide. Further, we do not know which case (Cases 1, 2, 3) a probe belongs to, we would not be able to know what null distribution to use as a reference distribution, e.g., using a normal distribution or a product normal distribution for the product method. Among all settings the joint and bootstrap procedures have sizes closer to the null and the highest power (last nine rows of Table 1).
When the outcome is binary, deflated type I error were also observed for all methods under consideration for β1=θ2= 0 (Case 3), as well as for the Case 1 and Case 2 when the true effect size of the non-zero parameter (β1 or θ2) is small (Table 2). Thus existing methods also tend to be conservative for problems with binary outcomes. Notice here the bootstrap method may tend to have slight inflation on type I error and power in some of the cases, which is likely to be related to unbalanced data (smaller number of cases than controls), thus in practice, it may not be recommended to use bootstrap method unless the sample size is large, or special bootstrap techniques that are designed for unbalanced samples may be employed to overcome such issues.
Table 2. Simulations results for Empirical Type I Error and Power when the outcome is binary.
β1 | θ2 | Joint | Product | Bootstrap |
---|---|---|---|---|
0 | 0 | 0.0026 | 0.0001 | 0.0017 |
0 | 0.14 | 0.0065 | 0.0005 | 0.0043 |
0.14 | 0 | 0.0509 | 0.0463 | 0.0592 |
0 | 0.39 | 0.0351 | 0.0064 | 0.0286 |
0.39 | 0 | 0.0509 | 0.0505 | 0.0592 |
0.14 | 0.14 | 0.1310 | 0.1226 | 0.1398 |
0.14 | 0.39 | 0.6795 | 0.6652 | 0.6730 |
0.39 | 0.14 | 0.1322 | 0.1309 | 0.1397 |
0.39 | 0.39 | 0.6884 | 0.6869 | 0.6839 |
Application to the Epigenome-Wide Normative Aging Study
We examine the QQ-plots for the associations between smoking and DNA methylations and the associations between DNA methylations and FEV1. We see little to no inflation in the distribution of p-values (Figures 2A and 2B). Under the null hypothesis of no NIE, we also expect a test to yield uniformly distributed p-values, which does not occur here (Figure 2C). The observed deflation is likely due to most probes existing in Case 3. This deflation is not likely just due to any unmeasured confounders, as the p-values from exposure to mediator and mediator to outcome given exposure appear to follow a uniform distribution (Figures 2A and 2B). Under Case 3, the joint p-value, which is the maximum of the path-specific p-values, does not follow a uniform distribution. This deflation is also seen if we used the product method with Sobel’s estimator (Figure 2D).
One probe was found to be genome wide significant: cg05575921, at the AHRR (Aryl-Hydrocarbon Receptor Repressor) gene (p-value 1.92 e-10, NIE estimate −0.0036, product method p-value 5.62e-09). This probe has been found to be associated with smoking in several previous studies [Fasanelli, et al. 2015; Zeilinger, et al. 2013]. It is located at chr 5:373378 on the north shore of AHRR [Fasanelli, et al. 2015]. The aryl-hydrocarbon receptor helps to regulate the effects of stressors from the outside environment, and AHRR prevents that mechanism from occurring [Fasanelli, et al. 2015]. There was evidence that this probe (with a collection of other probes) potentially mediates the effect of maternal smoking on offspring birthweight [Kupers, et al. 2015]. It has also been suggested that this probe is a risk factor for lung cancer [Baglietto, et al. 2017]. The association with smoking in DNAm at this probe is one of the more frequently reported associations in the epigenome wide field so far. This significant NIE suggests a further association of interest, with pack years’ effect on FEV1 being mediated by DNAm.
Conclusion
In addition to the traditional considerations of whether a variable meets the criteria of a mediator, it is important to consider the statistical test for the NIE. We have demonstrated that the classical mediation analysis methods, such as Sobel’s product, are not statistically appropriate under Case 3 (Table 1). Other approaches also fail to have the proper type I error under Case 3 [Biesanz, et al. 2010; MacKinnon, et al. 2002]. We have shown that this could lead to severe problem in epigenome-wide studies for mediation, where the task is to test a large number of potential mediators and pathways for NIEs and very commonly there is a mixture of Case 1, 2 and 3, with Case 3 likely occurring most often. These studies could result in severe deflation in the p-value distribution. Case 3 needs to be further studied to derive proper testing procedures for these new challenges.
We demonstrated how Case 3 can affect real data analysis where a mediation analysis equivalent of an epigenome-wide association study was performed. The deflation we observe in the tests of no NIE is likely due to the statistical properties of the tests as we did not see deflation in the association specific p-value distribution. We also provide more evidence for the AHRR gene being an important mediator of smoking history and lung function.
We note that in practice the genomic mediators are likely correlated. This could lead to biased estimates of the natural indirect effect due to failure to account for unmeasured confounders in the form of nearby DNAm markers. Methods for testing the joint mediated effect should be further developed. There have been approaches for this multiple mediator case in the epidemiology field [Daniel, et al. 2015; VanderWeele and Vansteelandt 2014], as well as the higher dimensionality of the genomic case [Huang and Pan 2016; Zhang, et al. 2016]. However, we feel that it is important to derive proper testing size under the single mediator case which can then guide testing under the multiple mediator situation for Case 3.
In most epidemiological studies, where there is prior knowledge of being in Case 1 or 2 and sample sizes are large, the product and difference methods are likely appropriate. However, in genome-wide genomic studies, Case 3 is common and it is unknown in advance which of the three cases a probe falls in when scanning the genome. For now, if Case 3 is thought to be a reasonable possibility, we suggest that the product and difference methods be avoided as the conditions for the multivariate delta method do not hold. We suggest the joint significance or bootstrap might be considered with the understanding that these are conservative under Case 3. Future research is needed to develop tests for the NIE in genome-wide genetic/epigenetic studies by accounting for the fact that null hypothesis is a composite null.
Supplementary Material
Acknowledgments
This work was supported by NIH grants R01 ES015172; R01 ES017876, R01 ES021733; R01 ES025225, R01 HL113338, R35 CA197449, P01 CA134294, T32 GM074897, T32 ES007142, U01 HG009088 and by the Clinical Science R&D Service, US Department of Veterans Affairs (AS). The United States Department of Veterans Affairs (VA) Normative Aging Study (NAS) is supported by the Cooperative Studies Program/Epidemiology Research and Information Center and is a research component of the Massachusetts Veterans Epidemiology Research and Information Center (MAVERIC). The views expressed in this paper are those of the authors and do not necessarily represent the views of the US Department of VA. The computations in this paper were run on the Odyssey cluster supported by the FAS Division of Science, Research Computing Group at Harvard University.
References
- Aroian LA. The probability function of the product of two normally distributed variables. The Annals of Mathematical Statistics. 1947:265–271. [Google Scholar]
- Baglietto L, Ponzi E, Haycock P, Hodge A, Bianca Assumma M, Jung CH, Chung J, Fasanelli F, Guida F, Campanella G, et al. DNA methylation changes measured in pre-diagnostic peripheral blood samples are associated with smoking and lung cancer risk. Int J Cancer. 2017;140(1):50–61. doi: 10.1002/ijc.30431. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Baron RM, Kenny DA. The moderator–mediator variable distinction in social psychological research: Conceptual, strategic, and statistical considerations. Journal of personality and social psychology. 1986;51(6):1173. doi: 10.1037//0022-3514.51.6.1173. [DOI] [PubMed] [Google Scholar]
- Bayarri MJ, Berger JO. P values for composite null models. Journal of the American Statistical Association. 2000;95(452):1127–1142. [Google Scholar]
- Bell B, Rose CL, Damon A. The Normative Aging Study: an interdisciplinary and longitudinal study of health and aging. The International Journal of Aging and Human Development. 1972;3(1):5–17. [Google Scholar]
- Bickel PJ, Götze F, van Zwet WR. Resampling Fewer Than n Observations: Gains, Losses, and Remedies for Losses. In: van de Geer S, Wegkamp M, editors. Selected Works of Willem van Zwet. New York, NY: Springer New York; 2012. pp. 267–297. [Google Scholar]
- Biesanz JC, Falk CF, Savalei V. Assessing mediational models: Testing and interval estimation for indirect effects. Multivariate Behavioral Research. 2010;45(4):661–701. doi: 10.1080/00273171.2010.498292. [DOI] [PubMed] [Google Scholar]
- Butcher L. Illumina450ProbeVariants.db: Annotation Package combining variant data from 1000 Genomes Project for Illumina HumanMethylation450 Bead Chip probes 2013 [Google Scholar]
- Ciernia AV, LaSalle J. The landscape of DNA methylation amid a perfect storm of autism aetiologies. Nat Rev Neurosci. 2016;17(7):411–23. doi: 10.1038/nrn.2016.41. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Craig CC. On the Frequency Function of xy. The Annals of Mathematical Statistics. 1936;7(1):1–15. [Google Scholar]
- Daniel RM, De Stavola BL, Cousens SN, Vansteelandt S. Causal mediation analysis with multiple mediators. Biometrics. 2015;71(1):1–14. doi: 10.1111/biom.12248. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Du P, Zhang X, Huang C-C, Jafari N, Kibbe WA, Hou L, Lin SM. Comparison of Beta-value and M-value methods for quantifying methylation levels by microarray analysis. BMC bioinformatics. 2010;11(1):587. doi: 10.1186/1471-2105-11-587. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Efron B. Bootstrap Methods: Another Look at the Jackknife. In: Kotz S, Johnson NL, editors. Breakthroughs in Statistics: Methodology and Distribution. New York, NY: Springer New York; 1992. pp. 569–593. [Google Scholar]
- Fasanelli F, Baglietto L, Ponzi E, Guida F, Campanella G, Johansson M, Grankvist K, Johansson M, Assumma MB, Naccarati A, et al. Hypomethylation of smoking-related genes is associated with future lung cancer in four prospective cohorts. Nat Commun. 2015;6:10192. doi: 10.1038/ncomms10192. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fritz MS, MacKinnon DP. Required sample size to detect the mediated effect. Psychological science. 2007;18(3):233–239. doi: 10.1111/j.1467-9280.2007.01882.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Goodman LA. On the exact variance of products. Journal of the American Statistical Association. 1960;55(292):708–713. [Google Scholar]
- Houseman EA, Accomando WP, Koestler DC, Christensen BC, Marsit CJ, Nelson HH, Wiencke JK, Kelsey KT. DNA methylation arrays as surrogate measures of cell mixture distribution. BMC bioinformatics. 2012;13(1):1. doi: 10.1186/1471-2105-13-86. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huan T, Joehanes R, Schurmann C, Schramm K, Pilling LC, Peters MJ, Magi R, DeMeo D, O’Connor GT, Ferrucci L, et al. A Whole-Blood Transcriptome Meta-Analysis Identifies Gene Expression Signatures of Cigarette Smoking. Hum Mol Genet. 2016 doi: 10.1093/hmg/ddw288. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huang Y-T, Liang L, Moffatt MF, Cookson WOCM, Lin X. iGWAS: Integrative Genome-Wide Association Studies of Genetic and Genomic Data for Disease Susceptibility Using Mediation Analysis. Genetic epidemiology. 2015;39(5):347–356. doi: 10.1002/gepi.21905. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huang Y-T, VanderWeele TJ, Lin X. Joint analysis of SNP and gene expression data in genetic association studies of complex diseases. The annals of applied statistics. 2014;8(1):352. doi: 10.1214/13-AOAS690. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huang YT, Pan WC. Hypothesis test of mediation effect in causal mediation model with high-dimensional continuous mediators. Biometrics. 2016;72(2):402–13. doi: 10.1111/biom.12421. [DOI] [PubMed] [Google Scholar]
- Judd CM, Kenny DA. Process analysis estimating mediation in treatment evaluations. Evaluation review. 1981;5(5):602–619. [Google Scholar]
- Krause MR, Serlin RC, Ward SE, Rony RYZ, Ezenwa MO, Naab F. Testing mediation in nursing research: Beyond Baron and Kenny. Nursing research. 2010;59(4):288. doi: 10.1097/NNR.0b013e3181dd26b3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kupers LK, Xu X, Jankipersadsing SA, Vaez A, la Bastide-van Gemert S, Scholtens S, Nolte IM, Richmond RC, Relton CL, Felix JF, et al. DNA methylation mediates the effect of maternal smoking during pregnancy on birthweight of the offspring. Int J Epidemiol. 2015;44(4):1224–37. doi: 10.1093/ije/dyv048. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Leek JT, Storey JD. Capturing heterogeneity in gene expression studies by surrogate variable analysis. PLoS Genet. 2007;3(9):1724–35. doi: 10.1371/journal.pgen.0030161. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu Y, Aryee MJ, Padyukov L, Fallin MD, Hesselberg E, Runarsson A, Reinius L, Acevedo N, Taub M, Ronninger M, et al. Epigenome-wide association data implicate DNA methylation as an intermediary of genetic risk in rheumatoid arthritis. Nat Biotechnol. 2013;31(2):142–7. doi: 10.1038/nbt.2487. [DOI] [PMC free article] [PubMed] [Google Scholar]
- MacKinnon DP, Fairchild AJ. Current directions in mediation analysis. Current Directions in Psychological Science. 2009;18(1):16–20. doi: 10.1111/j.1467-8721.2009.01598.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- MacKinnon DP, Fritz MS, Williams J, Lockwood CM. Distribution of the product confidence limits for the indirect effect: Program PRODCLIN. Behavior research methods. 2007;39(3):384–389. doi: 10.3758/bf03193007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- MacKinnon DP, Lockwood CM, Hoffman JM, West SG, Sheets V. A comparison of methods to test mediation and other intervening variable effects. Psychological methods. 2002;7(1):83. doi: 10.1037/1082-989x.7.1.83. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mackinnon DP, Lockwood CM, Williams J. Confidence Limits for the Indirect Effect: Distribution of the Product and Resampling Methods. Multivariate Behav Res. 2004;39(1):99. doi: 10.1207/s15327906mbr3901_4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mackinnon DP, Warsi G, Dwyer JH. A Simulation Study of Mediated Effect Measures. Multivariate Behav Res. 1995;30(1):41. doi: 10.1207/s15327906mbr3001_3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Panni T, Mehta AJ, Schwartz JD, Baccarelli AA, Just AC, Wolf K, Wahl S, Cyrys J, Kunze S, Strauch K, et al. A Genome-Wide Analysis of DNA Methylation and Fine Particulate Matter Air Pollution in Three Study Populations: KORA F3, KORA F4, and the Normative Aging Study. Environmental health perspectives. 2016 doi: 10.1289/ehp.1509966. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pearl J. Direct and indirect effects. 2001:411–420. [Google Scholar]
- Robins JM, Greenland S. Identifiability and exchangeability for direct and indirect effects. Epidemiology. 1992:143–155. doi: 10.1097/00001648-199203000-00013. [DOI] [PubMed] [Google Scholar]
- Sobel ME. Asymptotic confidence intervals for indirect effects in structural equation models. Sociological methodology. 1982;13(1982):290–312. [Google Scholar]
- Taylor-Robinson D, Whitehead M, Diderichsen F, Olesen HV, Pressler T, Smyth RL, Diggle P. Understanding the natural progression in %FEV1 decline in patients with cystic fibrosis: a longitudinal study. Thorax. 2012;67(10):860–6. doi: 10.1136/thoraxjnl-2011-200953. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Teschendorff AE, Lee SH, Jones A, Fiegl H, Kalwa M, Wagner W, Chindera K, Evans I, Dubeau L, Orjalo A, et al. HOTAIR and its surrogate DNA methylation signature indicate carboplatin resistance in ovarian cancer. Genome Med. 2015;7:108. doi: 10.1186/s13073-015-0233-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Valeri L, VanderWeele TJ. Mediation analysis allowing for exposure-mediator interactions and causal interpretation: theoretical assumptions and implementation with SAS and SPSS macros. Psychol Methods. 2013;18(2):137–50. doi: 10.1037/a0031034. [DOI] [PMC free article] [PubMed] [Google Scholar]
- VanderWeele T. Explanation in causal inference: methods for mediation and interaction. Oxford University Press; 2015. [Google Scholar]
- VanderWeele T, Vansteelandt S. Conceptual issues concerning mediation, interventions and composition. Statistics and its Interface. 2009;2:457–468. [Google Scholar]
- VanderWeele TJ, Asomaning K, Tchetgen Tchetgen EJ, Han Y, Spitz MR, Shete S, Wu X, Gaborieau V, Wang Y, McLaughlin J, et al. Genetic variants on 15q25.1, smoking, and lung cancer: an assessment of mediation and interaction. Am J Epidemiol. 2012;175(10):1013–20. doi: 10.1093/aje/kwr467. [DOI] [PMC free article] [PubMed] [Google Scholar]
- VanderWeele TJ, Vansteelandt S. Odds ratios for mediation analysis for a dichotomous outcome. Am J Epidemiol. 2010;172(12):1339–48. doi: 10.1093/aje/kwq332. [DOI] [PMC free article] [PubMed] [Google Scholar]
- VanderWeele TJ, Vansteelandt S. Mediation Analysis with Multiple Mediators. Epidemiol Methods. 2014;2(1):95–115. doi: 10.1515/em-2012-0010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weiss ST. Lung function and airway diseases. Nat Genet. 2010;42(1):14–6. doi: 10.1038/ng0110-14. [DOI] [PubMed] [Google Scholar]
- Yuan Y, MacKinnon DP. Bayesian mediation analysis. Psychological methods. 2009;14(4):301. doi: 10.1037/a0016972. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zeilinger S, Kuhnel B, Klopp N, Baurecht H, Kleinschmidt A, Gieger C, Weidinger S, Lattka E, Adamski J, Peters A, et al. Tobacco smoking leads to extensive genome-wide changes in DNA methylation. PLoS One. 2013;8(5):e63812. doi: 10.1371/journal.pone.0063812. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang H, Zheng Y, Zhang Z, Gao T, Joyce B, Yoon G, Zhang W, Schwartz J, Just A, Colicino E, et al. Estimating and testing high-dimensional mediation effects in epigenetic studies. Bioinformatics. 2016;32(20):3150–3154. doi: 10.1093/bioinformatics/btw351. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.