Currently, there is great interest in identifying safe and effective existing drug therapies for the prevention of coronavirus-2019 (COVID-19). Recently, there is a surge of observational studies, using big data, examining safety and effectiveness of proton pump inhibitors (PPIs) or histamine-2 receptor antagonists (H2RAs) on the risk of new onset COVID-19.1, 2, 3, 4, 5, 6 The nature of the disease, limitations of large databases, and use of different methodologic approaches have created unique challenges that might have contributed to contradictory studies addressing this question.
Although some recent studies, especially those that tightly controlled for confounding bias, have not found an association,1 , 2 , 5 others have shown either a protective3 , 4 or a harmful6 association with PPIs or H2RAs in COVID-19. These contradictory results can be misleading to clinicians, policymakers, and patients, leading to unnecessary prescribing of PPIs or H2RAs for COVID-19 prevention. Here we discuss potential biases that might have led to these contradictory findings including confounding bias, immortal time bias, selection bias, reverse causality bias, and sparse data bias using examples from published studies and causal directed acyclic graphs (cDAGs). cDAGs are graphical tools that can enable clinician researchers to visualize and better understand the structure of epidemiological biases and allow them to critique the methodologic rigor of observational studies addressing COVID-19 in the area of gastroenterology.7, 8, 9
Specific Biases in Epidemiologic Studies of Acid Suppressing Drug and COVID-19 Studies
The structure of a number of biases that might have affected the studies of gastric-suppressing drugs in COVID-19 can be demonstrated using cDAGs. A more comprehensive discussion on cDAGs is beyond the scope of this review, but can be found elsewhere.10 , 11 In brief, cDAGs provide a blueprint of all pertinent variables that play a role in causal questions. Two variables, A and B, are connected with an arrow tail that starts from A and ends on variable B, meaning that A causes B. Variable A cannot cause itself; that is, cDAGs are acyclic. Adjacent arrows, regardless of direction, form paths in a cDAG. Finally, conditioning on a variable means restricting that to a subset of its values. Conditioning can be done at the data analysis stage (eg, regression adjustment) or study design stage (eg, restriction).
Confounding bias
Confounding bias is one of the most prevalent types of bias that can affect the validity of observational studies. In a recent retrospective cohort study, Freedberg et al3 demonstrated a protective effect of famotidine use on COVID-19 death or intubation (hazard ratio, 0.42; 95% confidence interval [CI], 0.21–0.85). The study attempted to control for confounding bias through propensity score (PS) matching, a statistical technique that attempts to balance measured confounders between famotidine users and nonusers. However, PS matching can only adjust for measured confounders and its usefulness depends on how well the PS model can balance measured confounders between the 2 groups.
The authors matched users and nonusers using a 5:1 nearest-neighbor matching strategy and a caliper of 0.2, which means that the PS scores for the 2 members of a matched set could have varied by 20%. They then assessed balance of confounding variables between famotidine users and nonusers in the original and matched sample using hypothesis testing where failure to reject the null hypothesis (P > .05) was used as a metric for good balance of confounders. This approach has been shown to introduce bias9 because hypothesis testing is a function of sample size, and with only 84 famotine users, a large P value can erroneously indicate optimal balance of confounders. In fact, in the matched sample, the percentage of the following variables—old age, diabetes, hypertension, and coronary artery disease—among famotidine users remained higher compared with nonusers (at 48%, 29%, 35%, and 11% vs. 43%, 25%, 30%, and 9%, respectively). Standardized mean differences (SMDs)12 are a useful and accepted metric for examining covariate balance after PS matching. As a general rule, a SMD of ≤10% is indicative of a good balance between variables.13 The percentage difference for variables age and hypertension in the matched sample in the Freedberg study was 5%, which translates into a SMD of 10.03% and 10.66%, respectively, which are both slightly >10%. The cohort study by Lee et al,5 which did not find an association between PPI use and COVID-19 also used a PS matching analysis and demonstrated a SMD of <10% for all the 14 covariates included in the PS analysis.
Moreover, a caliper of ±0.2 for PSs between matched users and nonusers can lead to residual confounding owing to the substantial variation between the PSs allowed by this approach. A caliper of 0.01–0.05 on the original scale14 or 0.2 of the standard deviation on the logit scale has been recommended. Residual confounding may explain the protective effect seen with famotidine use in this study.
Residual confounding in this study might have been further accentuated owing to a small ratio of the number of matched exposed to the number of PS model parameters, also referred to as the number of events per variable (EPV). It is recommended EPV to be ≥10 in logistic regression modelling,15 which for this specific case looks at the number of the exposed as the response variable in the PS model is the exposure. In this study of 84 famotidine users, the investigators included 21 variables as indicators of potential confounders in the estimation of PS model: 84/22 (21 covariates+1, the model intercept) which yields an EPV of 3.8. This number is much less than 10, suggesting the possibility of sparse data bias16 , 17 in the PS model coefficients, which can affect the validity of the PS estimation and yield sub-optimal balance of confounders (between famotidine users and nonusers), potentially leading to residual confounding.
Perhaps a more consequential effect of sparse data bias is on the magnitude of the effect size of epidemiologic studies. For example, in the study by Almario et al,6 those taking PPIs twice daily experienced an approximately 3.5 higher risk of developing COVID-19 compared with nonusers (odds ratio [OR], 3.67; 95% CI, 2.93–4.60). In this study, the EPV is computed as 198/32 = 6.2 (31 covariates+1) indicative of a low EPV resulting in possibly inflated ORs. Statistical techniques can be used to correct for sparse data bias and shrink the effect size.18
In another study, Elmunzer et al19 examined the preadmission exposure to acid-suppressing drugs and the odds of death or need for mechanical ventilation in patients who contract COVID-19. No benefit was observed in the adjusted analysis for H2RA or PPI use with respect to death. Similar to Freedberg et al,3 Elmunzer et al also used a PS-matched analysis using a 0.2 caliper which resulted in a risk difference of 9.72% (95% CI, 1.26%–18.2%). Again, a caliper of 0.2 might mean that the PS matched analysis might not have fully corrected for confounding bias.
Confounding bias might have also explained harmful association shown between PPI use and COVID-19. Almario et al6 undertook a survey in patients with gastrointestinal symptoms who had also contracted COVID-19. The study found that subjects using a PPI once a day had double the odds of contracting COVID-19 compared with nonusers (OR, 2.15; 95% CI, 1.90–2.44). The increase in risk observed might be due to unmeasured confounding such as chronic obstructive pulmonary disease (COPD) severity. This is reflected in Figure 1 A, where subjects with more severe COPD might experience more severe GI symptoms.20 Consequently, those with more severe GI symptoms are more likely to take PPIs. COPD severity is also a recognized risk factor for COVID-19 making it a classic confounder.
Immortal Time Bias
In a recent cohort, Mather et al22 followed 878 famotidine users and assessed the risk of COVID-19 mortality between this group and nonusers. By design, famotidine users could have received the drug within 7 days of COVID-19 screening or hospital admission. This means users had to be alive for ≤7 days (immortal period) before being eligible to receive the drug while nonusers could have incurred the outcome (death) at any time during the study period. This type of conditioning can introduce immortal time bias,21 which can lead to a spurious protective effect of a drug as shown by the study which reported an OR of 0.37 (95% CI, 0.16–0.86) consistent with a profound protective effect. The structure of immortal time bias in this example resembles selection bias. Selection bias generally occurs in cohort studies where 1 group of subjects (eg, famotidine users) are treated differently than the other group (nonusers). In this example, users were allowed to be alive ≤7 days before COVID-19 diagnosis, whereas nonusers could have died at any time. Thus, famotidine users with less severe comorbid conditions were more likely to be included in the study, leading to an observed protective effect secondary to selection bias as depicted in Figure 1 B.
In Figure 1 B, variable S represents having been selected into the study based on the 7-day conditioning step defined by the investigators. The arrows from famotidine use and variable U (which represents the unmeasured risk factor COPD severity) to variable S indicate famotidine users with less comorbid conditions (eg, less severe COPD) were more likely to be included in the study. The square around S reflects that the analysis is limited to famotidine users and nonusers included in the sample. Thus, S is a common effect of famotidine and U, also referred to a collider. Unlike in our confounding example where conditioning (through adjustment) on a confounder blocks its effect, conditioning on a collider actually introduces bias by opening a path that was previously closed by variable S. For example, the conditioning step in the study design (variable S) will open the path famotidine → S ← U → mortality. Through this path, famotidine use will be associated with COVID-19 mortality as a result of selection bias, whereas without this conditioning, according to the cDAG, no association exists between famotidine use and COVID-19 mortality. A time-dependent Cox regression analysis should have been used to avoid immortal time bias.
Selection bias might have also affected the study by Almario et al.6 In this study, 264,058 potential participants were identified, but only 48.8% patients agreed to participate. It is possible that patients who agreed to participate in the survey did so owing to monetary incentives, a proxy for low socioeconomic status, which can also affect their risk of contracting COVID-19.
Reverse causality bias
Reverse causality bias might explain the protective effect observed with PPI use in the study by Blanc et al.4 This was a case-control study among elderly hospitalized patients with an average age of 84 years. PPI use was assessed 15 days before COVID-19 diagnosis. The study found a protective effect with PPI use (OR, 0.43; 95% CI, 0.23-0.81) compared with non-PPI users. This profound protective effect might be explained by reverse causality bias. This bias refers to a situation where symptoms of the disease (which in the case of COVID-19 can start days or weeks before diagnosis) can affect drug use. For example, elderly patients in this study might have decided to discontinue using a PPI, possibly to decrease polypharmacy and maximize adherence, upon experiencing early COVID-19 symptoms and resorted to taking more essential medications such as antidiabetics or antihypertensives. This is reflected in Figure 1 C, where the arrow from COVID-19 to PPI use suggests that early COVID-19 symptoms can affect PPI use, representing reverse causality. Figure 1 C illustrates that a consequence of reverse causality is a confounded association between PPI use and COVID-19 diagnosis.
Summary and Conclusions
We have identified a number of biases including confounding bias, immortal time bias, selection bias and reverse causality bias that might have affected the results of PPI/H2Ras and COVID-19 diagnosis. These biases were identified based on the information available in the studies we have reviewed without having access to the study protocols or data. Thus, we cannot state with certainty if these biases actually affected the results.
In studies of acid-suppressing drugs in COVID-19 where confounding bias was appropriately controlled for, no benefit with these drugs was found. Clinicians should be cognizant of these biases when reviewing future studies on this topic.
Footnotes
Conflicts of interest The authors have made the following disclosures: Mr Sodhi and Drs Mansournia and Nazemipour have no conflicts to report. Dr Etminan has consulted in the ranitidine and cancer litigation.
References
- 1.Fan X., Liu Z., Miyata T. Effect of acid suppressants on the risk of COVID-19: a propensity score matched study using UK Biobank Acid Suppressants and the risk of COVID-19. Gastroenterology. 2021;160:155–158.e5. doi: 10.1053/j.gastro.2020.09.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Yeramaneni S., Doshi P., Sands K. Famotidine use is not associated with 30-day mortality: a coarsened exact match study in 7158 hospitalized COVID-19 patients from a large healthcare system. Gastroenterology. 2021;160:919–921.e3. doi: 10.1053/j.gastro.2020.10.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Freedberg D.E., Conigliaro J., Wang T.C. Famotidine use is associated with improved clinical outcomes in hospitalized COVID-19 patients: a propensity score matched retrospective cohort study. Gastroenterology. 2020;159:373–375.e2. doi: 10.1053/j.gastro.2020.05.053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Blanc F., Waechter C., Vogel T. Interest of proton pump inhibitors in reducing the occurrence of COVID-19: a case-control study. Preprints. 2020 2020050016. [Google Scholar]
- 5.Lee S.W., Ha E.K., Yeniova A. Severe clinical outcomes of COVID-19 associated with proton pump inhibitors: a nationwide cohort study with propensity score matching. Gut. 2021;70:76–84. doi: 10.1136/gutjnl-2020-322248. [DOI] [PubMed] [Google Scholar]
- 6.Almario C.V., Chey W.D., Spiegel B.M.R. Increased risk of COVID-19 among users of proton pump inhibitors. Am J Gastroenterol. 2020;115:1707–1715. doi: 10.14309/ajg.0000000000000798. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Mansournia M.A., Collins G.S., Nielsen R.O. CHecklist for statistical Assessment of Medical Papers: the CHAMP statement. Br J Sports Med. 2021 doi: 10.1136/bjsports-2020-103651. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Mansournia M.A., Collins G.S., Nielsen R.O. A CHecklist for statistical Assessment of Medical Papers (the CHAMP statement): explanation and elaboration. Br J Sports Med. 2021 doi: 10.1136/bjsports-2020-103652. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Etminan M., Brophy J.M., Collins G.S. To adjust or not to adjust: the role of different covariates in cardiovascular observational studies. Am Heart J. 2021 Mar 12 doi: 10.1016/j.ahj.2021.03.008. [DOI] [PubMed] [Google Scholar]
- 10.Etminan M., Collins G.S., Mansournia M.A. Using causal diagrams to improve the design and interpretation of medical research. Chest. 2020;158:S21–S28. doi: 10.1016/j.chest.2020.03.011. [DOI] [PubMed] [Google Scholar]
- 11.Williamson E.J., Aitken Z., Lawrie J. Introduction to causal diagrams for confounder selection. Respirology. 2014;19:303–311. doi: 10.1111/resp.12238. [DOI] [PubMed] [Google Scholar]
- 12.Austin P.C. Optimal caliper widths for propensity-score matching when estimating differences in means and differences in proportions in observational studies. Pharm Stat. 2011;10:150–161. doi: 10.1002/pst.433. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Austin P.C. An introduction to propensity score methods for reducing the effects of confounding in observational studies. Multivariate Behav Res. 2011;46:399–424. doi: 10.1080/00273171.2011.568786. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Hernan M.A., Robins J.M. Chapman & Hall/CRC; Boca Raton, FL: 2020. Causal inference: what if. [Google Scholar]
- 15.Peduzzi P., Concato J., Kemper E. A simulation study of the number of events per variable in logistic regression analysis. J Clin Epidemiol. 1996;49:1373–1379. doi: 10.1016/s0895-4356(96)00236-3. [DOI] [PubMed] [Google Scholar]
- 16.Greenland S., Mansournia M.A. Penalization, bias reduction, and default priors in logistic and related categorical and survival regressions. Stat Med. 2015;34:3133–3143. doi: 10.1002/sim.6537. [DOI] [PubMed] [Google Scholar]
- 17.Mansournia M.A., Geroldinger A., Greenland S. Separation in logistic regression: causes, consequences, and control. Am J Epidemiol. 2018;187:864–870. doi: 10.1093/aje/kwx299. [DOI] [PubMed] [Google Scholar]
- 18.Greenland S., Mansournia M.A., Altman D.G. Sparse data bias: a problem hiding in plain sight. BMJ. 2016;352:i1981. doi: 10.1136/bmj.i1981. [DOI] [PubMed] [Google Scholar]
- 19.Elmunzer B.J., Wolf B.J., Scheiman J.M. Association between pre-admission acid suppressive medication exposure and severity of illness in patients hospitalized with COVID-19. Gastroenterology. 2020 Nov 20 doi: 10.1053/j.gastro.2020.11.007. [Epub ahead of print] [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Lee A.L., Goldstein R.S. Gastroesophageal reflux disease in COPD: links and risks. Int J Chron Obstruct Pulmon Dis. 2015;10:1935. doi: 10.2147/COPD.S77562. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Mather J.F., Seip R.L., McKay R.G. Impact of famotidine use on clinical outcomes of hospitalized patients with COVID-19. Am J Gastroenterol. 2020;115:1617–1623. doi: 10.14309/ajg.0000000000000832. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Suissa S. Immortal time bias in pharmacoepidemiology. Am J Epidemiol. 2008;167:492–499. doi: 10.1093/aje/kwm324. [DOI] [PubMed] [Google Scholar]