Thompson1 noted in 1991 that, although more than a decade had passed since the first discussion of ‘interaction’ in the epidemiological literature, debate had by then subsided and few clear conclusions had emerged. He concluded:
Unfortunately, choice among theories of pathogenesis is enhanced hardly at all by the epidemiological assessment of interaction … What few causal systems can be rejected on the basis of observed results would provide decidedly limited etiological insight.
This conclusion probably represented the consensus of opinion at that time. Yet 20 years later, discussion and reports of interactions pervade the literature. In this journal alone, two papers giving guidelines and recommendations for reporting and assessing interactions have appeared within weeks of one another. Knol and VanderWeele2 considered only the reporting of interactions. Boffetta et al.3 primarily discuss the assessment of evidence, but, since they are much concerned with the ‘cumulative’ evidence following meta-analysis or systematic review, they inevitably touch on the reporting of interactions.
One could point to several developments that have contributed to this reawakening of interest. The first of these has arisen from recent work by the statisticians in ‘causal modelling’, which has led to new, deterministic, definitions of mechanistic interaction. One such approach derives from the earlier work of Rothman4 on component causes, whereas the other is based on the currently popular focus on counterfactual outcomes. In the former approach, interaction between two causal factors is defined as the presence of a sufficient cause, which involves both factors as component causes, whereas from the latter viewpoint, interaction implies the existence of people in the population who would not have developed disease unless they had been exposed to both factors. These viewpoints have been shown to be mutually consistent5–7 and, subject to certain assumptions, are also consistent with statistical definition of interaction as deviation from additivity of effects on risk. Broadly, the same conclusions follow from stochastic models for independent causes, originally discussed by Rothman8 and Miettinen9 and recently revisited from the standpoint of directed acyclical graphical models by VanderWeele and Robins.10
In common with many discussions of interaction and mechanism, Boffetta et al.3 and Knol and VanderWeele2 ignored the role of time (particularly age). For chronic degenerative diseases, this must be considered in any analysis, but it is not straightforward. Greenland and Poole5 briefly discussed this problem in the context of stochastic sufficient cause models, pointing out that most commonly used additive models impose rigid constraints on the form of cause-specific hazard functions. Such constraints are avoided in the non-parametric additive hazards model of Aalen11 (extended to case–control data by Borgan and Langholz12). However, the time-dependent aspect of the problem seems to have been ignored in deterministic causal theories.
Although recent work has established a more rigorous basis for the idea of mechanistic interaction, its identification in many circumstances with non-additivity for effects on risk had already been recognized for many years before Thompson came to his pessimistic conclusion. We can only assume that he regarded establishment of the existence of such interaction as providing ‘decidedly limited etiological insight’. So, what has changed to re-ignite epidemiologists’ enthusiasm for interaction? The answer must surely be the greater availability of genetic data. Thus, discussion of gene–gene interaction often identified with the earlier (and better defined) concept of epistasis pervades the recent genetic epidemiology literature, and few epidemiological grant applications now fail to identify the establishment of ‘gene–environment interaction’ as a primary aim. Yet much of this discussion is as careless in its use of terms as the early epidemiological literature that first prompted debate about the topic 40 years ago; an interaction trumpeted in the title of a paper more often than not turns out to represent a significant interaction term in a logistic regression model, despite the fact that this, in general, has few implications for mechanism. Statistical interaction between two genes in the logistic regression model is often described as epistasis, although the multiplicative model for joint effects of two genes, which is approximately equivalent to a ‘main effects’ logistic model, has previously been used as model for epistasis.13,14 Against this background, perhaps there is a place for the publication of some guidelines.
Perhaps, the most important service that could be provided by the issuing of guidelines would be to foster greater clarity in presentation of precisely ‘why’ the ‘interaction’ reported is of interest. As is argued by Knol and VanderWeele2 this, in turn, has implications for ‘how’ it should be reported. In this respect, Boffetta et al.3 contribute little, after the now familiar agonizing about how difficult it is to define ‘interaction’, finishing with a conclusion remarkably similar to Thompson's of 20 years before:
It is not always clear whether or not sensible biological conclusions can come from a particular statistical formula of interaction.
The discussion continues as if this warning had never been issued. Thus, there is detailed discussion of bias in relation to the interaction parameter in the multiplicative model, and of designs, such as the case-only design in which ‘only’ this parameter can be estimated, despite the admission that interpretation of this parameter is problematic (inference concerning sufficient cause interaction in the latter case has been discussed by VanderWeele et al.15). One must question the rationale for the proposal of prescriptive guidelines for reporting and assessing a phenomenon whose scientific meaning is unclear. If a paper is to report an interaction as a major finding, then it must make clear precisely what sort of interaction is reported and what can be legitimately concluded. We now have a rigorous theory which allows us, at least in some circumstances, to test a well-defined concept of no mechanistic interaction in the sufficient cause framework. But the fact remains that this will often be of ‘decidedly limited interest’. Although it would be rash to claim that the hypothesis of truly independent sufficient causes will never be of interest, one suspects that it will only rarely be plausible. Indeed, it would usually be more interesting if one could show that two causes did indeed operate through independent mechanisms. But, of course, absence of statistically significant interaction does not allow such a conclusion, any more than its presence, on whatever scale, implies that the factors operate on the same biological pathway.
It is often argued that, regardless of any direct causal interpretation, interaction on the additive scale is important from a public health point of view. Even this is over-stated; as pointed out by Clayton and McKeigue,16 the argument for targeted intervention does not only depend on non-additivity of effects—the ‘prevention paradox’17 remains under multiplicative models for accrual of risk.
Despite these reservations, presence or absence of statistical interaction is an important part of any data analysis as it is related to the goodness of fit of the model adopted for risk (either explicitly or implicitly) and, therefore, to the accurate assessment of risks for different risk factor profiles. This brings us to the question of ‘when’ statistical interactions should be reported. The usual criterion for this is Occam's razor, applied either by a classical significance test for interaction or by some optimal prediction criterion such as the Akaike Information Criterion.18 As has been widely recognized, interaction parameters are estimated precisely much less than the main effects so that they are only rarely significant in a single study. However, Boffetta et al.3 are concerned with systematic review, which requires that studies are reported in such a way that evidence can be combined across studies. This raises the possibility that interaction could be insufficiently convincing to report in an individual study, but statistically significant when evidence is combined over studies. Boffetta et al.3 recommend publication of web supplementary tables of data on gene–environment joint effects and, since this recommendation comes at the end of a discussion of the dangers of selective reporting, we can only assume that they are proposing that Occam's razor should not apply in this context. But, clearly reporting of all non-significant interactions in the expectation of later meta-analysis is not feasible. Although not explicitly suggested, it is implied that data on interactions should be presented regardless of their significance when there is sufficient prior expectation that interaction is present. However, in the absence of clear relationship between plausible mechanisms and statistical interaction in a given risk model, how should such prior expectations be informed? Ultimately, there would seem to be no solution to this problem; meta-analysis with an emphasis on joint effects of two or more factors will, of necessity, require assembly of the relevant raw data.
The problem of how to report significant interactions remains, and was discussed in some detail by Knol and VanderWeele.2 This essentially concerns the choice of parametrization of an interaction term in a (generalized) linear model. In the case of two binary risk factors, Knol and VanderWeele distinguish between an ‘interaction’ parametrization, which considers the effect of each risk factor combination with one combination taken as baseline, and an ‘effect modification’ parametrization, which reports the effect of one factor for each level of the other. Both of these parametrizations include one or both of the main effects, and address slightly different questions. It is arguable whether it is necessary to publish guidelines that suggest how authors should lay out their tables to best make their point but, nevertheless, this article contains some useful insights. However, it is worth noting that the ‘interaction’ parametrization results in parameter estimates, which are strongly intercorrelated as a result of the use of a shared baseline. This can be misleading and seriously limits subsequent use of results in meta-analyses. In this context, perhaps the idea of ‘floating absolute risk’19deserves mention. Although this has been criticized,20 its limitations are now better understood and presentation of the ‘quasi-variances’21 will often be helpful.
Although reporting and assessment of evidence for interaction are not the same thing, they are clearly linked. Opinions will, no doubt, differ as to whether or not the publication of guidelines to aid either process is necessary or even useful. In addressing reporting, Knol and VanderWeele2 have at least clearly defined what they mean by ‘interaction’ and have not made unrealistic claims for its interpretation. In contrast, the guidelines for assessment of evidence suggested by Boffetta et al.3 are in danger of adding to the continuing confusion.
Funding
Wellcome Trust Principal Research Fellowship (to D.C.); Juvenile Diabetes Research Foundation (to D.C.). The Cambridge Institute for Medical Research is in receipt of a Wellcome Trust Strategic Award (079895).
Conflict of interest: None declared.
References
- 1.Thompson W. Effect modification and the limits of biological inference from epidemiologic data. J Clin Epidemiol. 1991;44:221–32. doi: 10.1016/0895-4356(91)90033-6. [DOI] [PubMed] [Google Scholar]
- 2.Knol MJ, VanderWeele TJ. Recommendations for presenting analyses of effect modification and interaction. Int J Epidemiol. 2012;41:514–20. doi: 10.1093/ije/dyr218. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Boffetta P, Winn DM, Ioannidis JP, et al. Recommendations and proposed guidelines for assessing the cumulative evidence on joint effects of genes and environments on cancer occurrence in humans. Int J Epidemiol. 2012;41:686–704. doi: 10.1093/ije/dys010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Rothman KJ. Causes. Am J Epidemiol. 1976;104:587–92. doi: 10.1093/oxfordjournals.aje.a112335. [DOI] [PubMed] [Google Scholar]
- 5.Greenland S, Poole C. Invariants and noninvariants in the concept of independent effects. Scand J Work Environ Health. 1988;14:125–29. doi: 10.5271/sjweh.1945. [DOI] [PubMed] [Google Scholar]
- 6.Flanders WD. On the relationship of sufficient cause models with potential outcome (counterfactual) models. Eur J Epidemiol. 2006;21:847–53. doi: 10.1007/s10654-006-9048-3. [DOI] [PubMed] [Google Scholar]
- 7.VanderWeele TJ, Robins JM. Empirical and counterfactual conditions for sufficient cause interactions. Biometrika. 2008;95:49–61. [Google Scholar]
- 8.Rothman KJ. Synergy and antagonism in cause-effect relationships. Am J Epidemiol. 1974;99:385–88. doi: 10.1093/oxfordjournals.aje.a121626. [DOI] [PubMed] [Google Scholar]
- 9.Miettinen O. Causal and preventive independence: elementary principles. Scand J Work Environ Health. 1982;8:159–68. doi: 10.5271/sjweh.2479. [DOI] [PubMed] [Google Scholar]
- 10.VanderWeele TJ, Robins JM. Directed acyclic graphs, sufficient causes, and the properties of conditioning on a common effect. Am J Epidemiol. 2007;166:1096–104. doi: 10.1093/aje/kwm179. [DOI] [PubMed] [Google Scholar]
- 11.Aalen O. A linear regression model for the analysis of life times. Stat Med. 1989;8:907–25. doi: 10.1002/sim.4780080803. [DOI] [PubMed] [Google Scholar]
- 12.Borgan O, Langholz B. Estimation of excess risk from case-control data using Aalen's linear regression model. Biometrics. 1997;53:690–97. [PubMed] [Google Scholar]
- 13.Risch N. Linkage strategies for genetically complex traits. I. Multilocus models. Am J Hum Genet. 1990;46:222–28. [PMC free article] [PubMed] [Google Scholar]
- 14.Clayton D. Prediction and interaction in complex disease genetics: The experience in type 1 diabetes. PLoS Genet. 2009;5:e1000540. doi: 10.1371/journal.pgen.1000540. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.VanderWeele TJ, Hernádez-Díaz S, Hernán MA. Case-only gene-environment interaction studies: When does association imply mechanistic interaction? Genet Epidemiol. 2010;34:327–34. doi: 10.1002/gepi.20484. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Clayton D, McKeigue P. Epidemiological methods for studying genes and environmental factors in complex diseases. Lancet. 2001;358:1357–60. doi: 10.1016/S0140-6736(01)06418-2. [DOI] [PubMed] [Google Scholar]
- 17.Rose G. Sick individuals and sick populations. Int J Epidemiol. 1985;14:32–38. doi: 10.1093/ije/14.1.32. [DOI] [PubMed] [Google Scholar]
- 18.Akaike H. A new look at the statistical model identification. IEEE Trans Auto Control. 1974;19:716–23. [Google Scholar]
- 19.Easton D, Peto J, Babiker A. Floating absolute risk: an alternative to relative risk in survival and case-control analysis avoiding an arbitrary reference group. Stat Med. 1991;10:1025–35. doi: 10.1002/sim.4780100703. [DOI] [PubMed] [Google Scholar]
- 20.Greenland S, Michels KB, Robins JM, Poole C, Willett WC. Presenting statistical uncertainty in trends and dose-response relations. Am J Epidemiol. 1999;149:1077–86. doi: 10.1093/oxfordjournals.aje.a009761. [DOI] [PubMed] [Google Scholar]
- 21.Firth D, De Menezes R. Quasi-variances. Biometrika. 2004;91:65–80. [Google Scholar]