Investigators have several design, measurement, and analytic tools to detect and reduce bias in epidemiological studies. One such approach, “negative controls,” has been used on an ad hoc basis for decades. A formal approach has recently been suggested for its use to detect confounding, selection, and measurement bias in epidemiological studies.1,2 Negative controls in epidemiological studies are analogous to negative controls in laboratory experiments, in which investigators test for problems with the experimental method by leaving out an essential ingredient, inactivating the hypothesize dactive ingredient, or checking for an effect that would be impossible by the hypothesized mechanism.1 A placebo treatment group in a randomized trial is an example of a negative control exposure (leaving out an essential ingredient) that helps remove bias that can result from participant or practitioner knowledge of an individual’s treatment assignment—the placebo treatment is susceptible to the same bias structure as the actual treatment but is causally unrelated to the outcome of interest.
Negative control outcomes are conceptually similar but are subtly different because, unlike exposures in a randomized trial, they are not under investigator control. The formal definition of a negative control outcome is one that shares the same potential sources of bias with the primary outcome but cannot plausibly be related to the treatment of interest. For example, early screening echocardiography for patent ductus arteriosus in extremely preterm infants was associated with a 4.3% absolute reduction of in-hospital mortality in a propensity-score matched analysis of a population-based cohort.3 To help check for residual bias from unmeasured confounding, investigators repeated the analysis using late-onset infections as a negative control outcome under the assumption that any sources of uncontrolled confounding in the mortality analysis would similarly lead to lower incidence of late-onset infection (an effect that would be impossible by the hypothesized mechanism). The finding of no association between echocardiography screening and the negative control outcome provided additional support for the conclusion from the primary analysis.
To date, nearly all discussions and applications of negative control outcomes have focused on use in observational studies to detect unmeasured confounding.4,5 With sufficient sample size and proper allocation, randomized trials are protected from confounding bias when estimating an intention-to-treat effect; however, confounding, selection, and measurement bias can still threaten the validity of trials in many circumstances that regularly occur. For example, even masked trials with aplacebo control can be vulnerable to bias if the treatment has adverse effects (leading to selection bias from differential attrition or measurement bias from unblinding participants or practitioners). In this Viewpoint, we suggest that negative control outcomes can be a valuable addition to detect residual bias in randomized trials.
Confounding and selection bias are of greatest concern in clinical trials that report analyses beyond intention-to-treat. For example, trials with imperfect adherence often include as-treated and per-protocol analyses. As-treated analyses can be vulnerable to confounding bias because participants are analyzed according to the treatment regimen they actually followed irrespective of their randomized assignment, which may be confounded by prognostic factors. Per-protocol analyses restrict the analysis to participants who were adherent with their randomized assignment and can be vulnerable to selection bias because participants who are adherent are usually different from those who are not. If both treatment assignment and prognostic characteristics affect adherence, excluding those who are not adherent from the analysis induces selection bias. Comparison of observable characteristics between study participants who are adherent to the assigned intervention and those who are not adherent can help provide clues about the potential for bias. Controlling for these in as-treated and per-protocol analyses may remove the bias.
However, such analyses (ie, as treated and per protocol) no longer rely solely on randomization for inference and are effectively observational analyses. Accordingly, there is always concern that statistical adjustments are imperfect because they can only control for bias from measurable factors. A negative control outcome analysis goes one step further to help identify the presence of residual bias: if an effect is observed between the treatment and negative control outcome that is impossible by the hypothesized mechanism, this suggests that unmeasured or unmeasurable sources of bias are influencing the results.1,2 A trial to measure the effect of flexible sigmoidoscopy screening on colorectal cancer mortality provides an illustrative example,6 whereby per-protocol analyses could have overestimated the benefits of screening on mortality due to “healthy screenee” selection bias if individuals assigned to regular screening were more health conscious than those who were not adherent. A negative control outcome that was affected by health consciousness but not influenced by flexible sigmoidoscopy screening, such as mortality due to noncolorectal cancers, could have been used to detect this bias. Selection bias can also threaten trial validity in other ways such as differential inclusion or exclusion protocols or differential loss to follow-up (attrition). Negative control outcomes could provide similarly useful diagnostics for the presence of selection bias from these other mechanisms.2
Measurement bias from differential outcome misclassification by treatment status is another concern in randomized trials. A large systematic review of clinical trials found evidence of systematically larger effects among unblinded trials with subjective outcomes (either patient reported or investigator assessed), likely owing to differential measurement bias from knowledge of treatment status.7 Negative control outcomes can be useful here as well. For example, trials of in-home water treatment ordinarily measure child diarrhea outcomes based on caregiver-reported symptoms because of the cost and logistical difficulties of collecting stool specimens and testing them for enteric pathogens. Because such trials are rarely blinded, there is concern that caregivers who receive in-home water treatment could underreport diarrhea, leading to a biased effect away from the null. To test for this potential source of bias, a trial of in-home chlorination and safe storage also asked care-givers about skin rash and ear infection symptoms along with diarrhea, with the hypothesis that these symptoms could be subject to the same source of potential reporting bias but could not be improved by drinking water treatment.8 A large reduction in diarrhea but not skin rash or ear infections added credibility to the trial’s primary results using reported diarrhea.
Selecting a good control outcome at the design stage of a trial that captures the bias structure of concern but is unequivocally unrelated to the treatment requires subject-matter expertise. Nevertheless, a deep understanding of the science underlies most substantive elements of epidemiological study design and analysis, so negative control outcomes are a natural addition to the approaches trialists could use. A second caveat is that negative controls typically identify the presence of bias but not necessarily its magnitude without further assumptions.1 This is an active area of research, and it is likely that methodologic advances will enable investigators not only to detect but also to minimize bias using negative control outcomes in a similar way that a placebo group removes the placebo effect. In addition, prespecification of negative control outcomes could prevent the selective presentation of favorable results.4
Negative controls are a simple and powerful tool with potential for broad application. Trials have used negative control exposures (placebos) for decades to reduce bias when estimating the effects of treatment. Trialists should similarly add negative control outcomes to their approaches for study design. In particular, the use of prespecified negative control outcomes could potentially improve the quality of evidence from trials that report additional analyses beyond intention-to-treat effects and those that have weaknesses (inescapable in many settings) such as lack of blinding, subjective outcomes, or differential attrition.
Acknowledgments
Funding/Support: National Institute of Allergy and Infectious Diseases grant K01-AI119180 (Dr Arnold) and National Institute of Child Health and Human Development grant R01-HD078912 (Dr Ercumen).
Footnotes
Conflict of Interest Disclosures: Both authors have completed and submitted the ICMJE Form for Disclosure of Potential Conflicts of Interest and none were reported.
Role of the Funder/Sponsor: The funders had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication.
Contributor Information
Benjamin F. Arnold, Division of Epidemiology, School of Public Health, University of California-Berkeley.
Ayse Ercumen, Division of Epidemiology, School of Public Health, University of California-Berkeley.
References
- 1.Lipsitch M, Tchetgen Tchetgen E, Cohen T. Negative controls: a tool for detecting confounding and bias in observational studies. Epidemiology. 2010;21(3):383–388. doi: 10.1097/EDE.0b013e3181d61eeb. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Arnold BF, Ercumen A, Benjamin-Chung J, Colford JM., Jr Negative controls to detect selection bias and measurement bias in epidemiologic studies. Epidemiology. 2016;27(5):637–641. doi: 10.1097/EDE.0000000000000504. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Rozé J-C, Cambonie G, Marchand-Martin L, et al. Hemodynamic EPIPAGE 2 Study Group. Association between early screening for patent ductus arteriosus and in-hospital mortality among extremely preterm infants. JAMA. 2015;313(24):2441–2448. doi: 10.1001/jama.2015.6734. [DOI] [PubMed] [Google Scholar]
- 4.Prasad V, Jena AB. Prespecified falsification end points: can they validate true observational associations? JAMA. 2013;309(3):241–242. doi: 10.1001/jama.2012.96867. [DOI] [PubMed] [Google Scholar]
- 5.Dusetzina SB, Brookhart MA, Maciejewski ML. Control outcomes and exposures for improving internal validity of nonrandomized studies. Health Serv Res. 2015;50(5):1432–1451. doi: 10.1111/1475-6773.12279. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Hoff G, Grotmol T, Skovlund E, Bretthauer M Norwegian Colorectal Cancer Prevention Study Group. Risk of colorectal cancer seven years after flexible sigmoidoscopy screening: randomised controlled trial. BMJ. 2009;338:b1846. doi: 10.1136/bmj.b1846. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Wood L, Egger M, Gluud LL, et al. Empirical evidence of bias in treatment effect estimates in controlled trials with different interventions and outcomes: meta-epidemiological study. BMJ. 2008;336(7644):601–605. doi: 10.1136/bmj.39465.451748.AD. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Ercumen A, Naser AM, Unicomb L, Arnold BF, Colford JM, Jr, Luby SP. Effects of source- versus household contamination of tubewell water on child diarrhea in rural Bangladesh: a randomized controlled trial. PLoS One. 2015;10(3):e0121907. doi: 10.1371/journal.pone.0121907. [DOI] [PMC free article] [PubMed] [Google Scholar]