Misclassification and measurement error are very common in studies of effects of exposures and are often ignored. The problem of exposure misclassification for interaction measures in the context of both multiplicative interaction (1–4) and additive interaction (5) has been discussed previously. Other investigators have discussed problems of outcome misclassification for a single exposure (6, 7). However, little is known about how interaction measures perform when the outcome is misclassified. In the present letter, we consider additive interactions with a mismeasured outcome. All the proofs are given in Web Appendix 1, available at http://aje.oxfordjournals.org/.
Let X1 and X2 denote 2 binary exposures and let D denote a binary outcome. Let . The measure of interaction for risks on the linear additive scale is as follows:
If p11 − p10 − p01 – p00 > 0, the additive interaction is said to be positive. If p11 − p10 − p01 – p00 < 0, the additive interaction is said to be negative. Now suppose that D is subject to misclassification, and let D* denote the observed outcome. We assume that the misclassification is nondifferential, that is, . The probability of misclassification can be characterized by sensitivity and specificity . We assume that SN + SP > 1, which is plausible because the observed outcome is more likely to be 1 (or 0) if the true outcome is 1 (or 0). If we let for x1, x2 = 0,1, we get the following result:
| (1) |
Because SN + SP – 1 ≤ 1, the naive estimator that is determined using the observed outcome will always be biased towards the null. We can furthermore get corrected estimates and confidence intervals by dividing the estimate and both limits of the confidence interval of the naive estimator by (SN + SP –1). For corrected estimates, sensitivity and specificity will often be unknown but can be varied in a sensitivity analysis. The precise values of sensitivity and specificity, however, do not need to be known to obtain conservative estimates.
Sometimes, instead of using risk differences to measure interaction, we might use risk ratios or odds ratios. In general, measures of interaction on the odds ratio and risk ratio scale will be very close to one another whenever the outcome is rare (6). Thus, we will focus only on the risk ratio. Let be the risk ratio effect measures. A measure of interaction on the additive scale for risk ratios (6) is as follows:
If we suppose the outcome is measured with error, we could use the naive estimator using the observed outcome:
Unfortunately, we cannot obtain a simple relation between and RERIRR. Instead, we have the following qualitative result:
| (2) |
Equation 2 shows that the naive estimator for the additive interaction on the risk ratio scale will always estimate the true interaction towards the null. The interaction measure using the observed misclassified data will thus always be conservative, and tests using will thus be valid in the sense of having a type I error less than 5%. This conclusion enables us to draw qualitative conclusions in practice even when we cannot observe the true outcome. For example, if we estimate the interaction to be positive using the observed misclassified data, we can conclude that true additive interaction is positive. Further details on the mathematical relations between RERIRR and are given in Web Appendix 1.
Another 2 measures of additive interaction that use risk ratios are called the attributable proportion and the synergy index. Rothman et al. (6) first defined the attributable proportion as
which measures the proportion of the risk in the doubly exposed group that is due to the interaction. When the outcome is measured with error, the naive estimator using the misclassified data is
and we can obtain the following result:
| (3) |
Equation 3 has the same form as equation 2 in that the measure using the observed data is conservative for the true measure. An alternative definition of attributable proportion discussed by VanderWeele and Tchetgen Tchetgen (8) is:
This measure considers the proportion of the joint effects of both exposures together that is due to interaction. When the outcome is measured with error, the naive estimator using the observed outcome is
and we can obtain the following result:
| (4) |
Equation 4 shows that the attributable proportion discussed by VanderWeele and Tchetgen Tchetgen (8) remains the same when using a nondifferentially misclassified outcome. Therefore, using this measure, we can still obtain the magnitude of interaction without knowing the sensitivity and specificity parameters.
Rothman et al. (6) gave the definition of the synergy index as
which measures the extent to which the risk ratio for both exposures together exceeds 1 and whether this is greater than the sum of the extent to which each of the risk ratios considered separately exceeds 1. When the outcome is measured with error, we use the observed outcome to calculate the synergy index,
and we can obtain the following result:
| (5) |
This shows that the synergy index does not change when estimated using the misclassified data. VanderWeele (9, 10) discussed how constraints of the form p11 − p10 − p01 > 0 and p11 − p10 − p01 – p00 > 0 can be used to make inferences about causal sufficient cause interaction. We can use observed data with misclassified outcome to also make such inferences as stated in the following results:
| (6) |
| (7) |
Details are provided in Web Appendix 1. By similar arguments, analogous results also hold for n-way sufficient cause interactions (11) and for sufficient cause interactions with categorical and ordinal exposures.
Unfortunately, similar simple relations do not seem to hold for interaction on the multiplicative scale, and biases due to outcome misclassification can be in either direction. An exception occurs if there is perfect specificity (SP = 1) for the outcome, in which case measures of multiplicative interaction are unbiased even with imperfect sensitivity (Web Appendix 1). Further research could examine correction strategies and evaluate these through simulations.
Supplementary Material
Acknowledgments
The research was supported National Institutes of Health grant ES017876 and the China Scholarship Council.
Conflict of interest: none declared.
References
- 1.García-Closas M, Thompson WD, Robins JM. Differential misclassification and the assessment of gene-environment interactions in case-control studies. Am J Epidemiol. 1998;147(5):426–433. doi: 10.1093/oxfordjournals.aje.a009467. [DOI] [PubMed] [Google Scholar]
- 2.Zhang L, Mukherjee B, Ghosh M, et al. Accounting for error due to misclassification of exposures in case-control studies of gene-environment interaction. Stat Med. 2008;27(15):2756–2783. doi: 10.1002/sim.3044. [DOI] [PubMed] [Google Scholar]
- 3.Cheng KF, Lin WJ. The effects of misclassification in studies of gene-environment interactions. Hum Hered. 2009;67(2):77–87. doi: 10.1159/000179556. [DOI] [PubMed] [Google Scholar]
- 4.Lindström S, Yen YC, Spiegelman D, et al. The impact of gene-environment dependence and misclassification in genetic association studies incorporating gene-environment interactions. Hum Hered. 2009;68(3):171–181. doi: 10.1159/000224637. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Vanderweele TJ. Inference for additive interaction under exposure misclassification. Biometrika. 2012;99(2):502–508. doi: 10.1093/biomet/ass012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Rothman KJ, Greenland S, Lash TL. Modern Epidemiology. Philadelphia, PA: Lippincott Williams & Wilkins; 2008. [Google Scholar]
- 7.Lash TL, Fox MP, Fink AK. Applying Quantitative Bias Analysis to Epidemiologic Data. New York, NY: Springer; 2011. [Google Scholar]
- 8.VanderWeele TJ, Tchetgen Tchetgen EJ. Attributing effects to interactions. Epidemiology. 2014;25(5):711–722. doi: 10.1097/EDE.0000000000000096. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.VanderWeele TJ. Sufficient cause interactions and statistical interactions. Epidemiology. 2009;20(1):6–13. doi: 10.1097/EDE.0b013e31818f69e7. [DOI] [PubMed] [Google Scholar]
- 10.VanderWeele TJ. Epistatic interactions. Stat Appl Genet Mol Biol. 2010;9(1) doi: 10.2202/1544-6115.1517. Article 1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.VanderWeele TJ, Richardson TS. General theory for interactions in sufficient cause models with dichotomous exposures. Ann Stat. 2012;40(4):2128–2161. doi: 10.1214/12-aos1019. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
