Abstract
Rosenbaum and Rubin’s (1983) propensity score revolutionized the field of causal inference and has emerged as a standard tool when researchers reason about cause-and-effect relationship across many disciplines. This discussion centers around the key “no interference” assumption in Rosenbaum and Rubin’s original development of the propensity score and reviews some recent advances in extending the propensity score to studies involving dependent happenings.
Keywords: Causal inference, Cluster-level propensity score, Interference
Four decades ago, Paul Rosenbaum and Donald Rubin wrote one of the most consequential articles about drawing causal inference from non-randomized observational data. Rosenbaum and Rubin examined the key role of “assignment mechanism” in non-randomized studies and developed a suite of propensity-score-based tools in Rosenbaum and Rubin (1983) and subsequent works, including propensity score matching (Rosenbaum and Rubin, 1983, 1985), propensity score subclassification (Rosenbaum and Rubin, 1984), and weighting/direct adjustment based on the propensity score (Rosenbaum, 1987). Propensity score is also a key element in doubly-robust estimators, such as augmented inverse probability weighted estimators (Robins and Rotnitzky, 1995) and targeted maximum likelihood estimation (Van der Laan and Rose, 2011). Propensity-score-based methods have now become standard tools in many branches of medical research, epidemiology, and social science, and have contributed to many important scientific endeavors. For instance, a propensity score based analysis was recently used to assess the effectiveness of the mRNA vaccines against SARS-CoV-2 infection requiring hospitalization, ICU admission, or an emergency department or urgent care clinic visit using electronic health records data (Thompson et al., 2021).
A key assumption made in Rosenbaum and Rubin (1983) is the no interference assumption. According to Cox (1958), there is no interference if “the observation on one unit [is] unaffected by the particular assignment of treatments to the other units.” This no interference assumption is a key element in Rubin’s “stable unit-treatment value assumption” or SUTVA (Rubin, 1980). The no interference assumption is often not plausible in the infectious disease context; for instance, Rubin (1990, Page 475) wrote that “interference between units can be a major issue when studying medical treatments for infectious disease.” In a series of articles, Halloran and Struchiner (1995), Sobel (2006) and Hong and Raudenbush (2006) discussed how to relax the no interference assumption under the potential outcomes framework. Rosenbaum (2007) distinguished between two sharp null hypotheses under interference, the null hypothesis of no primary effect and that of no effect. Rosenbaum (2007) also developed randomization-based inferential procedures for them. Hudgens and Halloran (2008) formally defined different kinds of causal effects of interest, including direct, indirect (spillover), total, and overall effects, under a partial interference context (Sobel, 2006) where individuals are partitioned into clusters, each with size , such that there is no interference between individuals in different clusters. Hudgens and Halloran (2008) also proposed unbiased estimators for these estimands under an experimental design using two-stage randomization.
A natural question then emerged: Can researchers still identify these scientifically meaningful causal effects (e.g., direct and indirect effects) in a non-experimental setting using observational data, and if so, under what assumptions? In the absence of two-stage randomization, researchers, not surprisingly, turned their attention to the propensity score. Tchetgen Tchetgen and VanderWeele (2012) first discussed how to estimate the direct, indirect, total, and overall effects using data from observational studies in a partial interference setting. Their key identification assumption is a cluster-level generalization of Rosenbaum and Rubin’s (1983) strong ignorability assumption:
| (1) |
where is the treatment assignment of all individuals in cluster is one realization in the set of all possible length- treatment assignments is the observed pre-treatment covariates including both cluster- and individual-level covariates, and is the collection of all possible potential outcomes of individuals under all possible treatment allocations in . The cluster-level propensity score describes the relationship between observed treatment assignments and covariates, and is the probability of the joint treatment statuses of all individuals within the cluster. The cluster-level propensity score collapses to Rosenbaum and Rubin’s (1983) original propensity score when ; it also collapses to the original propensity score when , i.e., when there are only two possible treatment allocations for each cluster: assigning each individual to 1 or each to 0 (Zubizarreta and Keele, 2017).
The cluster-level propensity score may be modeled using a mixed effects model that takes into account non-null associations among individuals in the same cluster as follows (Tchetgen Tchetgen and VanderWeele, 2012; Perez-Heydrich et al., 2014):
| (2) |
where is a cluster-level random effect with density function , and is an individual-level propensity score, i.e., probability that individual in cluster receives the treatment conditional on her covariates (possibly including cluster-level covariates) and the cluster-level random effect . The cluster-level propensity score is parameterized by , and is estimated by using maximum likelihood.
Under Assumption (1), a cluster-level analogue of the positivity assumption for any , and assuming model (2) is correctly specified, Tchetgen Tchetgen and VanderWeele (2012) showed that
| (3) |
are unbiased estimators of the cluster-level average potential outcome for and the cluster-level marginal average potential outcome, respectively. In expression (3), denotes the conditional probability and denotes the probability , both under a “stochastic policy” parameterized by . The conditional probability equals the unconditional probability when the stochastic policy treats participants independently but not otherwise (VanderWeele and Tchetgen Tchetgen, 2011). For instance, if a policy under evaluation treats individuals in a cluster independently from each other and with the same probability , then . Averaging and over clusters, one obtains and , from which one may readily construct estimators for the population average direct effect of the treatment policy parameterized by , indirect effect that compares two policies parameterized by and holding an individual’s treatment status fixed at , total effect , and overall effect .
Perez-Heydrich et al. (2014) applied these estimators to a study of the effectiveness of cholera vaccine in Matlab, Bangladesh. While vaccine was individually randomly assigned in the cholera vaccine trial, individuals self-selected into participation in the vaccine trial, making the actual vaccination status observational and subject to confounding bias. Moreover, the formulation of clusters was post hoc, so the vaccine coverage within clusters was non-randomized. Given the observational nature of the data, Perez-Heydrich et al. (2014) assumed individuals’ vaccination statuses in each neighborhood were “ignorable” conditional on age and distance to the nearest river. Perez-Heydrich et al. (2014) found that the estimated direct effect in general decreased as vaccine coverage increased, and that the indirect effect comparing and seemed to increase as increased.
There are many possible directions to extend the framework of Tchetgen Tchetgen and VanderWeele (2012) and Perez-Heydrich et al. (2014) using inverse probability weighted estimators under interference. Liu et al. (2016) extended IPW estimators from a partial interference regime to a general interference regime where each individual is allowed to have their own interference set. Liu et al. (2016) proposed IPW and stabilized IPW estimators, and established their asymptotic normality in a partial interference setting. In a subsequent article, Liu et al. (2019) further proposed doubly-robust estimators for various causal estimands under interference. Liu et al. (2019)’s doubly-robust estimators combine IPW and outcome-regression-based estimators and are consistent and asymptotically normal if either model, but not necessarily both, is correctly specified. Park and Kang (2020) showed one of Liu et al.’s doubly robust estimators is locally efficient. Park and Kang (2020) also considered non-parametric data-adaptive estimators for the partial interference setting which utilize machine learning methods to estimate the propensity score and outcome regression models. Papadogeorgou et al. (2019) and Barkley et al. (2020) considered causal estimands that allow for within-cluster dependence in the individual treatment selection and developed corresponding IPW estimators. Chakladar et al. (2022) developed IPW estimators that accommodate partial interference and right censoring. Zigler and Papadogeorgou (2021) considered IPW estimators that allow for interference in the bipartite causal inference setting where treatments are assigned to units that are distinct from those on which outcomes are measured.
Causal inference with interference is a rapidly growing research area. Above we have summarized some extensions of the propensity score to the interference setting, primarily focusing on inverse probability weighted estimators. Extensions of propensity score-based subclassification estimators which allow for interference have also been developed (e.g., Hong and Raudenbush, 2006, Forastiere et al., 2021). Likewise, estimators that utilize propensity score matching have also been proposed for the interference setting (e.g., Forastiere et al., 2018, Zirkle et al., 2021). While much progress has been made in relaxing the no interference assumption, there is still much room for future research that extends the use of the propensity score proposed by Rosenbaum and Rubin (1983) to the setting of interference.
Acknowledgments
This work was supported in part by NIH grant R01 AI085073. The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH.
Contributor Information
Bo Zhang, Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Center, Seattle, Washington.
Michael G. Hudgens, Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina
M. Elizabeth Halloran, Department of Biostatistics, University of Washington, and Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Center, Seattle, Washington.
References
- Barkley Brian G, Hudgens Michael G, Clemens John D, Ali Mohammad, and Emch Michael E. Causal inference from observational studies with clustered interference, with application to a cholera vaccine study. The Annals of Applied Statistics, 14(3):1432–1448, 2020. [Google Scholar]
- Chakladar Sujatro, Rosin Samuel, Hudgens Michael G., Halloran M. Elizabeth, Clemens John D., Ali Mohammad, and Emch Michael E.. Inverse probability weighted estimators of vaccine effects accommodating partial interference and censoring. Biometrics, 78(2):777–788, 2022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cox David Roxbee. Planning of Experiments. Wiley, 1958. [Google Scholar]
- Forastiere Laura, Mealli Fabrizia, Wu Albert, and Airoldi Edoardo. Estimating causal effects under interference using bayesian generalized propensity scores. arXiv preprint arXiv:1807.11038, 2018. [Google Scholar]
- Forastiere Laura, Airoldi Edoardo M, and Mealli Fabrizia. Identification and estimation of treatment and interference effects in observational studies on networks. Journal of the American Statistical Association, 116(534):901–918, 2021. [Google Scholar]
- Halloran M Elizabeth and Struchiner Claudio J. Causal inference in infectious diseases. Epidemiology, pages 142–151, 1995. [DOI] [PubMed] [Google Scholar]
- Hong Guanglei and Raudenbush Stephen W. Evaluating kindergarten retention policy: A case study of causal inference for multilevel observational data. Journal of the American Statistical Association, 101(475):901–910, 2006. [Google Scholar]
- Hudgens Michael G and Halloran M Elizabeth. Toward causal inference with interference. Journal of the American Statistical Association, 103(482):832–842, 2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu Lan, Hudgens Michael G, and Becker-Dreps Sylvia. On inverse probability-weighted estimators in the presence of interference. Biometrika, 103(4):829–842, 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu Lan, Hudgens Michael G, Saul Bradley, Clemens John D, Ali Mohammad, and Emch Michael E. Doubly robust estimation in observational studies with partial interference. Stat, 8(1):e214, 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Papadogeorgou Georgia, Mealli Fabrizia, and Zigler Corwin M. Causal inference with interfering units for cluster and population level treatment allocation programs. Biometrics, 75(3):778–787, 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Park Chan and Kang Hyunseung. Efficient semiparametric estimation of network treatment effects under partial interference. arXiv preprint arXiv:2004.08950, 2020. [Google Scholar]
- Perez-Heydrich Carolina, Hudgens Michael G, Halloran M Elizabeth, Clemens John D, Ali Mohammad, and Emch Michael E. Assessing effects of cholera vaccination in the presence of interference. Biometrics, 70(3):731–741, 2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Robins James M and Rotnitzky Andrea. Semiparametric efficiency in multivariate regression models with missing data. Journal of the American Statistical Association, 90(429):122–129, 1995. [Google Scholar]
- Rosenbaum Paul R. Model-based direct adjustment. Journal of the American statistical Association, 82(398):387–394, 1987. [Google Scholar]
- Rosenbaum Paul R. Interference between units in randomized experiments. Journal of the American Statistical Association, 102(477):191–200, 2007. [Google Scholar]
- Rosenbaum Paul R and Rubin Donald B. The central role of the propensity score in observational studies for causal effects. Biometrika, 70(1):41–55, 1983. [Google Scholar]
- Rosenbaum Paul R and Rubin Donald B. Reducing bias in observational studies using subclassification on the propensity score. Journal of the American Statistical Association, 79(387):516–524, 1984. [Google Scholar]
- Rosenbaum Paul R and Rubin Donald B. Constructing a control group using multivariate matched sampling methods that incorporate the propensity score. The American Statistician, 39(1):33–38, 1985. [Google Scholar]
- Rubin Donald. Discussion of “Randomization analysis of experimental data in the Fisher randomization test” by D. Basu. Journal of the American Statistical Association, 75:591–593, 1980. [Google Scholar]
- Rubin Donald B. Comment: Neyman (1923) and causal inference in experiments and observational studies. Statistical Science, 5(4):472–480, 1990. [Google Scholar]
- Sobel Michael E. What do randomized studies of housing mobility demonstrate? Causal inference in the face of interference. Journal of the American Statistical Association, 101(476):1398–1407, 2006. [Google Scholar]
- Tchetgen Eric J Tchetgen and VanderWeele Tyler J. On causal inference in the presence of interference. Statistical Methods in Medical Research, 21(1):55–75, 2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thompson Mark G, Stenehjem Edward, Grannis Shaun, Ball Sarah W, Naleway Allison L, Ong Toan C, DeSilva Malini B, Natarajan Karthik, Bozio Catherine H, Lewis Ned, et al. Effectiveness of Covid-19 vaccines in ambulatory and inpatient care settings. New England Journal of Medicine, 385(15):1355–1371, 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Van der Laan Mark J and Rose Sherri. Targeted Learning: Causal Inference for Observational and Experimental Data, volume 10. Springer, 2011. [Google Scholar]
- VanderWeele Tyler J and Tchetgen Eric J Tchetgen. Effect partitioning under interference in two-stage randomized vaccine trials. Statistics & Probability Letters, 81(7):861–869, 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zigler Corwin M and Papadogeorgou Georgia. Bipartite causal inference with interference. Statistical Science, 36(1):109, 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zirkle Keith W, Bind Marie-Abele, Swall Jenise L, and Wheeler David C. Addressing spatially structured interference in causal analysis using propensity scores. arXiv preprint arXiv:2101.09297, 2021. [Google Scholar]
- Zubizarreta José R. and Keele Luke. Optimal multilevel matching in clustered observational studies: A case study of the effectiveness of private schools under a large-scale voucher system. Journal of the American Statistical Association, 112(518):547–560, 2017. [Google Scholar]
