Abstract
Scholars are often interested in evaluating the causal effects of a recurring exposure (e.g., family violence) on behavioral and psychological outcomes. However, causal inference of recurring exposures is challenging. Conventional analytic approaches target causal quantities lacking practical relevance, such as mandating everyone to uniformly always be exposed or unexposed to family violence. Estimation further relies on everyone having a non-zero probability of being either exposed or unexposed at each occurrence, which is frequently unrealistic when past exposures perfectly predict future exposures. In this paper, we introduce a novel approach from the causal inference literature for drawing causal conclusions about recurring exposures: the incremental propensity score intervention (IPSI). IPSI frames causal questions more realistically by assessing how changing the propensity of a recurring exposure may influence an outcome. To facilitate the adoption of IPSI for recurring exposures, we develop an estimation procedure using lavaan, a widely used structural equation modeling software in R. We demonstrate the application of IPSI with a real-world dataset investigating the impact of recurring family violence on adolescent depression. IPSI requires fewer assumptions than existing approaches while offering more meaningful insights into the causal effects of recurring exposures.
Keywords: Causal inference, Incremental propensity score intervention (IPSI), Longitudinal data, Potential outcomes, Time-varying or treatment-dependent confounding
In the psychological and behavioral sciences, researchers often seek to evaluate the causal effects of exposures that occur repeatedly. Examples include, but are not limited to, individual behaviors such as social media use (Valkenburg, 2022) and socializing (Ren et al., 2022); lifestyle factors such as physical activity (Smith & Merwin, 2021) and sleep quality (Scott et al., 2021); and social stressors such as school bullying (Hymel & Swearer, 2015) and workplace aggression (Zhong et al., 2023). Common to these exposures is that they rarely occur only once. In reality, such exposures occur intermittently or periodically on multiple occasions and are innately recurring.
Various methods have been developed to assess the causal effects of recurring exposures. These include marginal structural models using inverse probability weighting (Cole & Hernán, 2008; Robins et al., 2000), parametric g-formula (Robins, 1986; Westreich et al., 2012), and structural nested mean models using g-estimation (Robins, 1994; Vansteelandt & Sjölander, 2016). These methods are widely applied in epidemiology and medical sciences (Clare et al., 2019; Mansournia et al., 2017) and are gaining traction in psychological and behavioral sciences (Kennedy et al., 2022; Thoemmes & Ong, 2015). These methods have recently been introduced to the psychological and behavioral research literature, such as g-estimation (Loh & Ren, 2023a) and parametric g-formula (Loh et al., 2024).
Common to these methods is their focus on causal effects comparing counterfactual scenarios where at each time point, everyone uniformly experiences the same exposure (Bonvini et al., 2023). While such effects may be precisely what researchers intend in specific contexts – for example, taking vs. not taking a prophylactic medication daily – their practical relevance can be limited in behavioral research. This is especially true when it is unrealistic for everyone’s exposure to be identical, such as the examples we provided above: social media use, socializing, and being the victim of bullying or aggression, among others. Therefore, while these methods are valuable tools for causal inference, their applicability in behavioral research warrants careful consideration of the context and implications of the causal effects.
In this article, we put forth a novel approach – unencumbered by this drawback – to evaluate the causal effects of recurring exposures: the incremental propensity score intervention (IPSI; Kennedy (2019)). IPSI frames causal queries using a realistic and practical question: How would changing the chances of a recurring exposure by a given amount lead to different average outcomes? IPSI has been applied in criminology to investigate the effect of homelessness on recidivism (Jacobs et al., 2024), and in epidemiology to investigate the effects of consuming vegetables on preeclampsia (Naimi et al., 2021) and of aspirin on the incidence of pregnancy (Rudolph et al., 2022). Loh and Ren (2024) offer an accessible introduction to IPSIs with a time-fixed exposure (i.e., measured at a single time point) for a behavioral sciences audience.
Our paper proceeds in three parts. First, we describe what the IPSI for recurring exposures is, and why this approach offers realistic and meaningful evaluations of the causal effects of recurring exposures. Second, we present the causal conditions that suffice to estimate the IPSI. Crucially, to make IPSI accessible to a broad audience of behavioral and psychological researchers, we implement the procedure using lavaan (Rosseel, 2012), a widely used and freely accessible structural equation modeling software in R (R Core Team, 2021). Finally, we illustrate IPSI for recurring exposures using a publicly available real-world dataset. To enhance readers’ understanding of IPSI, throughout the paper, we use a running example from psychological research: the causal effect of recurring family violence on depression. We explain how IPSI offers unique insights into the accumulative impact of exposure to family conflicts over time on the development of depression among adolescents. All the R scripts implementing the procedure are freely available online (https://osf.io/8ksfx/).
Using the IPSI to assess causal effects of recurring exposures
We will use a running example from psychological research to ease the exposition of IPSI. Suppose researchers are investigating the causal effect of family violence. Family violence is a recurring exposure because it occurs repeatedly on multiple occasions. Suppose the research interest is in whether reducing family violence (binary exposure ) at consecutive time points leads to lower depression (outcome Y) among adolescents. (To simplify our introduction of the IPSI, we first consider a single end-of-study outcome, and then generalize to repeatedly measured outcomes later.) The researchers seek to answer the causal query: How would depression change, on average, among adolescents if the chances of family violence over time could be hypothetically lowered?
To answer this question, we formally define what it means to lower the probability of family violence. Let denote an individual’s propensity score (Rosenbaum & Rubin, 1983b; Thoemmes & Ong, 2015; West et al., 2014) at time t, which is the conditional probability of being exposed to family violence given pre-exposure covariates (we elaborate on later). Now, suppose the researchers can shift the propensity of family violence from the naturally occurring quantity to a lower probability instead, e.g., through a hypothetical education program or legislative policy to prevent or mitigate domestic violence. Denote this shifted probability by , which can be expressed in terms of as:
| 1 |
This specific form was used because solving for yields the relationship:
| 2 |
Hence, , termed a shift parameter, encodes the odds ratio between the shifted propensity score and the naturally occurring propensity score . In other words, at time t, researchers can posit a value of , then multiply it with the odds for the (naturally occurring) propensity score to obtain the shifted odds in (1) for each individual. For example, when , the shifted risk of family violence will be lower than the naturally occurring risk. We emphasize that while the exposure (family violence) is binary, the propensity for it to occur is distributed along a continuum. In general, , hereafter referred to simply as IPSI, is termed an incremental propensity score intervention (Kennedy, 2019) because it characterizes the shifted propensity score following a hypothetical intervention that changes the naturally occurring odds by .
The accumulation of a recurring exposure over time can affect later outcomes that cannot be captured by a snapshot at a single time. Therefore, the causal query focuses on whether shifting the propensity for a recurring exposure throughout the population can change the average outcome. Continuing our example, researchers can conceive of hypothetically lowering the recurring propensity of family violence by a given amount and then evaluate the average potential depression following this reduction. Denote the IPSI for the recurring exposure under a posited by . Denote the potential outcome1 (Rubin, 1990; Splawa-Neyman et al., 1990) an individual would have had, possibly counter to fact, after experiencing the recurring exposures by .
For example, the potential depression following recurring family violence at time points is . We can assess the impact of an IPSI on the average depression using the following causal quantity:
| 3 |
Here, and denote the conditional expectation and probability function of a random variable A among those with a specific value(s) of the random variable(s) , respectively. Uppercase letters represent random variables or potential outcomes, while lowercase letters indicate specific or realized values. The support for the time-invariant (C) and the time-varying () covariates are denoted by and , respectively, and are assumed to be discrete. When a covariate is continuous, the summation and probability mass function may be replaced by an integral and probability density function, respectively; see Kennedy (2019) for more general settings.
In words, the IPSI causal estimand in (3), hereafter referred to as IPSICE for simplicity, is a cumulative weighted average of the potential outcomes over the recurring exposure, with weights at each time corresponding to the cumulative product of IPSI (up to that time) under given . Continuing our example, the causal quantity in (3) can be interpreted as the average potential depression when each individual’s odds of family violence at both times are multiplied by . Put another way, the causal quantity in (3) answers the questions: Would raising the propensity of being exposed to recurring family violence engender a subsequent increase in depression? Conversely, would reducing the propensity of being exposed to recurring family violence engender a subsequent decrease in depression?
More generally, the average potential outcome under an IPSI is:
| 4 |
Continuing our example, the causal quantity in (4) can be interpreted as the average potential depression when each individual’s odds of family violence at all times are multiplied by . Hence, setting in (4) renders the causal quantity in (3). Note that when , the IPSI reverts to the naturally occurring PS (i.e., ) and (4) is just the average observed outcome.
We briefly compare IPSICE with prevailing average causal effects (ACEs; Rubin (1974)) that are (implicitly) targeted in conventional analyses. First, an ACE necessarily compares different counterfactual scenarios mandating everyone at the same exposure level(s). Continuing our example, an ACE may compare the average potential outcomes where, throughout the study, everyone is subject to family violence vs. no one is subject to family violence. Such extreme, uniform scenarios are untethered from reality because neither corresponds to what was actually observed. Recurring exposures, such as family violence, are intrinsically heterogeneous, with some individuals being exposed periodically while others might be exposed only sporadically.
Moreover, because interventions that can realistically achieve such counterfactual scenarios are rarely feasible at best or impossible at worst, the ACE’s interpretation has limited or no practical implications. In contrast, IPSICE integrates individual heterogeneity in the propensity of being exposed to family violence in its definition, and its interpretation is characterized by shifting each individual’s propensity score.
Second, everyone must have a non-zero probability of being exposed or not exposed to family violence at each time t; i.e., for all . This so-called positivity assumption (Westreich & Cole, 2010), is violated when certain combinations of the pre-exposure variables perfectly predict the exposure (e.g., producing the problem of complete or perfect separation when fitting a logistic regression model for the propensity score). This is particularly salient for recurring exposures in behavioral research because, frequently in naturalistic real-world settings, earlier exposures overwhelmingly influence future exposure. Continuing our example, prior exposure to family violence can systematically induce vulnerability to subsequent occurrences (e.g., all individuals with have a predicted probability of being precisely one). Violations of the positivity assumption are challenging to diagnose, but they can result in extrapolation for the ACE and potentially biased estimates at best and ill-defined effects in non-existent covariate strata at worst (Rudolph et al., 2022). In contrast, IPSI is not bound by this assumption because, for those whose probability of (non-)exposure is zero, their potential outcomes do not contribute to the average outcome.
The average potential outcomes defined in (3) and (4) are identifiable under certain assumptions common to almost all causal inferences from observational data. The first is causal or counterfactual consistency, stated as Assumption 1 of Kennedy (2019), which assumes that the observed treatments reveal the potential outcome as the observed outcome; i.e., . The second is no unmeasured confounding (Imbens & Rubin, 2015), where within strata defined by the unique values of the pre-exposure variables recorded in , the exposure and potential outcomes are conditionally independent. This can be formally stated, following Assumption 2 of Kennedy (2019), as:
| 5 |
for all possible values , where denotes the conditional independence of A and B given C. When both Assumptions 1 and 2 of Kennedy (2019) are met, the causal quantity can be consistently estimated from observed non-causal data.
Because is a recurring exposure, the variables in causally preceding include time-invariant covariates (C), past exposures () and the time-varying covariate history (). Therefore, . In practice, outcomes are also repeatedly measured over time in longitudinal designs with recurring exposures. With repeatedly measured outcomes, the definitions and results above readily extend by considering the outcome at each time point as the final outcome one at a time. However, now includes the outcome and exposure history (Loh & Ren, 2023b), as well as the pre-exposure covariates. For example, if within each time point t, causally precedes , then . Alternatively, if within each time point t, causally precedes , then .
Consistent estimation of the IPSI causal quantities is contingent on no unmeasured confounding (5) being met. Although no sensitivity analyses specific to IPSI are currently available, existing methods already developed for other causal effect estimands may be extended for IPSI. One promising possibility is to adapt the sensitivity analysis of Bonvini and Kennedy (2022) by permitting a proportion of the population to be subject to unmeasured confounding and then reanalyzing the data under that scenario. Here, we propose another approach previously put forth in other contexts (Harring et al., 2017; Loh, 2024) that is amenable to use in lavaan. Because unmeasured confounding manifests in correlated exposure and outcome residuals, we can encode their correlation as a sensitivity parameter (Lin et al., 1998; Rosenbaum & Rubin, 1983a). Stronger correlations indicate more severe violations of unmeasured confounding, whereas a zero correlation corresponds to no violations. Hence, we leverage lavaan to mimic unmeasured confounding using correlated residuals as follows: First, fix the correlation between each exposure and a causally downstream outcome at a given value as part of the lavaan model. (Residual covariances are readily specified in lavaan using two adjoining tilde symbols, i.e., “”.) Next, fit the model with this fixed correlation to the observed data and estimate the IPSICE under this constraint. Repeating both steps using different fixed values of the sensitivity parameter permits systematically judging the extent to which unmeasured confounding alters the results. We illustrate its implementation using real-world data in the next section.
Illustration using real-world data
We illustrate IPSI using data from a real-world study: the Flint Adolescent Study. This longitudinal study, beginning in 1994, followed ninth graders from four public high schools in Flint, Michigan (Zimmerman, 2014). For illustration, we will use data from three study waves (hereafter time points ) to investigate whether reducing family violence leads to decreased depression over time. Both variables were repeatedly recorded at each time point.
We utilized three waves of the survey: , and . The exposure (family violence) was a composite of five dichotomous variables measuring students’ responses to whether the following family situations occurred at least once in a while or hardly ever: (i) “We fight in our family,” (ii) “Family members get so angry they throw things,” (iii) “Family members lose their tempers,” (iv) “Family members criticize each other,” and (v) “Family members hit each other in anger.” The exposure took on a value of if any situation occurred at least once in a while around time t or a value of if all these situations hardly ever occurred around time t. The outcome (depression) was measured using the mean score for five items on how much the student felt uncomfortable because of the following problems during the past week (including the day of the interview): (i) “had thoughts of ending your life,” (ii) “felt blue,” (iii) “felt no interest in things,” (iv) “felt hopeless about the future,” and (v) “felt worthless.” Responses were on a five-point Likert scale (1=Not at all uncomfortable; 5=Extremely uncomfortable).
To simplify our illustration, we considered a single time-varying covariate : stress levels, measured using the mean score of four items on how much stress they felt in different areas of their life (at school, at work, with friends, with personal issues or concerns) during the past 6 months. Responses were on a five-point Likert scale (1=Very low stress; 5=Very high stress). The baseline covariates C were the following self-reported variables: age cohort (1 if their birth year was 1980, 0 if earlier than 1980), gender (1 for female, 0 for male), race (1 for Black or African American, 0 otherwise), and family socioeconomic status (1 if their family received income support, 0 otherwise). A snapshot of the data for six students is presented in Table 1. To simplify the illustration, we considered the 753 students with complete data on these variables.
Table 1.
Observed data for six students in the illustration
| ID | Gender | Race | Age | Income | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 8 | 1 | 1 | 1 | 1 | 1.12 | 1 | 1.40 | 0.26 | 1 | 1.40 | -1.37 | 1 | 1.20 |
| 16 | 0 | 0 | 0 | 1 | 0.46 | 1 | 1.00 | 1.51 | 1 | 1.20 | 0.38 | 1 | 1.00 |
| 24 | 1 | 1 | 0 | 1 | -1.21 | 0 | 1.00 | 0.76 | 0 | 1.60 | -1.12 | 1 | 3.40 |
| 32 | 0 | 0 | 0 | 1 | -0.54 | 1 | 1.00 | -0.74 | 1 | 1.40 | 0.30 | 1 | 1.80 |
| 40 | 0 | 1 | 1 | 1 | 0.04 | 1 | 1.40 | 0.76 | 1 | 3.20 | -0.37 | 1 | 1.60 |
| 48 | 1 | 1 | 0 | 0 | 0.54 | 1 | 3.20 | -1.24 | 1 | 1.20 | 0.63 | 1 | 3.20 |
Note. “Income” = family socioeconomic status; whether family violence occurred at least once in a while (1) or hardly ever occurred (0) at time t; stress at time t (mean-centered); recent depression at time t. The definitions of each variable are in the main text. Values for continuous and were rounded to two decimal places
Ninety percent of the students were subject to repeated family violence (in at least two time points), with 49% being subject throughout the study. This emphasizes that exposure to family violence was innately recurring, with each additional occurrence adding to the cascade of future occurrences. Crucially, the recurring exposures to family violence are so tightly interlinked that isolating the effect at a single time point would be artificial and pointless.
To check whether positivity was empirically violated, we plotted the predicted propensity scores for both exposure groups ( and ) at each time and , in Figs. 1 and 2, respectively. We stratified the propensity scores based on previous exposure(s) to better illustrate the recurring nature of family violence. Note how some students exposed to family violence () had propensity scores very close to one, more so than those unexposed to family violence () who tended to have propensity scores closer to zero. The lack of overlap in interquartile ranges between the two exposure groups was most pronounced at among those previously exposed to family violence but not earlier (); see the bottom left panel of Fig. 2.
Fig. 1.
Estimates of the naturally occurring propensity scores at for each exposure group, stratified by the previous exposure , in the family violence example. The proportions of those observed with either or across strata defined by are stated in brackets on the horizontal axis
Fig. 2.
Estimates of the naturally occurring propensity score at for each exposure group, stratified by the previous exposures and , in the family violence example. Note. . The proportions of those observed with either or across strata defined by are stated in brackets on the horizontal axis
We carried out the estimation procedure described in Appendix A. Briefly, for the propensity scores , we fitted logistic models with main effects and two-way interaction terms for all the predictors in . An example of the specified logistic regression model for is formally stated in (A.1) of the Appendix. For the outcomes , we fitted regression models with main effects for and two-way interaction terms between each exposure instance , and each variable in excluding that exposure, i.e., . Examples of the specified regression models for and are formally stated in (A.4) and (A.5), respectively, of the Appendix. The regression models for repeated outcomes were fitted jointly using lavaan. For the IPSI, we posited a discrete grid of . Estimates of the average outcomes at each of the three time points in terms of are displayed in Fig. 3.
Fig. 3.
Estimates of the incremental propensity score intervention causal estimand (IPSICE) for the outcome at each time point in the family violence example. Note. Each circle corresponds to an IPSICE estimate, and each vertical line corresponds to the 95% confidence interval (CI), for a given value of on the horizontal axis. A logarithmic horizontal axis was used for the evenly spaced posited values of on a binary logarithm scale. The horizontal dashed and dotted lines corresponded to the average observed outcome and 95% CI, respectively, under . Values of whose CIs did not overlap the latter were indicated in red
There was no effect of family violence on average depression in the first two times. However, there was an effect of recurring family violence on average depression at the third time point. Multiplying the odds of family violence by a factor of at all three times reduced subsequent average observed depression by 0.44 points on a five-point scale. We further illustrated a sensitivity analysis of unmeasured confounding for IPSI, leveraging lavaan as described in the preceding section. To simplify the analysis, we set the correlations between the residuals for family violence and depression to be greater than 0.3, allowing for mild to strong residual confounding. The results are plotted in Fig. 5 of the Appendix, and suggest that the odds of family violence must be lowered over time to reduce average depression, albeit not statistically significantly, even after accounting for unmeasured confounding. There is no single time point at which family violence can be shifted to reduce average depression; only when the full array of the recurring exposure is shifted can average depression be reduced. These results suggested that to reduce average depression among adolescents, it is inadequate to merely lower family violence at a single time point because enduring effort is required for its impact to accumulate over time and emerge.
Fig. 5.
Estimates of the incremental propensity score intervention causal estimand (IPSICE) for the outcome at each time point in the family violence example, after using correlated exposure and outcome residuals to mimic unmeasured confounding. Note. Descriptions of each panel are provided as part of Fig. 3 in the main text
Discussion
In this paper, we introduced the IPSICE for recurring exposures. We implemented an estimator that leverages lavaan, a software widely used by psychological and behavioral researchers.2 A major advantage of using lavaan is access to its various features when fitting a set of outcome models jointly (see step 4a of the estimation procedure in the Appendix). We describe two examples relevant to IPSICE. First, parameter constraints can be easily introduced in the lavaan model syntax. For example, the effect of on can be readily constrained to be constant across times t; similarly, the autoregressive effects of on can also be constrained to be stable over time. Second, intermittently missing data can be straightforwardly handled within lavaan. For example, “full information” maximum likelihood estimation (Lee & Shi, 2021) can be readily utilized (by using the argument missing = “ML”) after establishing the missing mechanism is missing at random (MAR; Yuan and Bentler (2010)).
In practice, the posited values of should be based on prior empirical results, established theory, or policy guidelines or recommendations. We offer two practical suggestions. First, researchers should consider a broad range of values to produce IPSIs with substantively different magnitudes. Second, researchers should be aware that very small or large values can lead to individual chances of exposure being close to zero or one, respectively. This may produce counterfactual scenarios where almost everyone is unexposed or exposed. To illustrate this point, we plotted the IPSIs for the family violence example in Fig. 4. Note now at , the average propensity score was close to zero; under this IPSI, only about 4–6% of individuals had chances of being exposed to family violence above 0.05. Therefore, researchers should use their domain or subject matter knowledge to evaluate whether or not such scenarios are realistic in the specific context of the exposure.
Fig. 4.
Estimates of the IPSIs at in the family violence example. Note. For a given value of on the horizontal axis, the estimated average is denoted by a cross, with each vertical line corresponding to a symmetric 80-percentile interval. A logarithmic horizontal axis was used for the evenly spaced posited values of on a binary logarithm scale
Conclusion
In conclusion, the IPSI approach is ideally suited for evaluating the causal effects of recurring exposures. By effectively adopting the IPSI in empirical analyses, we seek to contribute to more realistic and meaningful assessments of recurring exposures in psychological and behavioral research.
Acknowledgements
The authors thank the Editor, Dr. Sacha Epskamp, for constructive suggestions and valuable advice guiding the manuscript through the review process. The authors thank two anonymous referees for their insightful comments that helped improve the manuscript.
Appendix A: Estimating the IPSICE for recurring exposures
In this section, we describe how to empirically estimate the IPSICEs using observed data. The estimation procedure extends the algorithm in Web Appendix Section 1.3 of Kennedy (2019) for a single outcome to handle repeatedly measured outcomes simultaneously.
- For each exposure time point , in turn, fit a model for the naturally occurring propensity score to the observed data. A simple example of a propensity score model is a logistic regression model for given main effects and two-way interaction terms for the observed pre-exposure history in ; e.g., for and ,
The model in (A.1) may be fitted using, e.g., maximum likelihood estimation. Then, using the fitted model, obtain the predicted propensity score at time t for each individual, denoted by for simplicity. The logistic models specified in the illustration are included as R model formulae in the online supplemental materials.A.1 - Calculate the weights for all exposure time points for a given :
A.2 - Calculate the cumulative weight for :
A.3 - We now describe how to parsimoniously fit a joint model simultaneously for the repeated outcomes at all time points by leveraging lavaan (Rosseel, 2012). For notational simplicity, let denote the outcome at the first time point immediately following the exposure at time t. If within each time point t, causally precedes , then set ; otherwise if within each time point t, causally precedes , then set . For :
- Regress the (pseudo-)outcomes on and for . When , use the observed outcomes . For example, regression models with main effects for and two-way interaction terms between each exposure instance , and each variable in , for , are:
A.4
This set or system of regression models for the repeated outcomes can be readily fitted jointly or simultaneously with, e.g., maximum likelihood estimation, in lavaan (Rosseel, 2012). The regression models specified in the illustration are also included as R model formulae in the online supplemental materials.A.5 - Predict the individual potential outcomes for , under both exposure levels and :
The predictions can be computed using the jointly fitted models in the previous step, after setting the exposure levels to the respective hypothetical values of and retaining the observed values of pre-exposure covariates , with the lavPredictY() function in lavaan.A.6 - Calculate the individual pseudo-outcomes for :
A.7
- For the outcome at time , calculate for each individual, indexed by the superscript i, the following quantity:
The estimator of the IPSICE for the outcome at time is then the sample average over all N individuals:A.8
Let denote the percentile of a standard normal distribution; e.g., . An asymptotic Wald confidence interval (CI) can be constructed as:A.9
where .A.10
To evaluate whether the average potential outcome under a posited value of is statistically significantly different from the average observed outcome, compare the CIs for the IPSICEs under and , respectively. If the endpoints of the CIs are disjointed, it indicates that shifting the propensity score odds for the recurring exposure by produces a change in the average outcome.
Researchers should generally investigate the IPSICEs under different values for . For example, because parame-trizes an odds ratio, an evenly spaced grid on the binary logarithm (Weisstein, 2024) scale may be . Then, for each posited value of in turn, repeat steps 2 through 5 to estimate the IPSICE under the given . The estimates can then be plotted as a function of to visualize the pointwise trajectory as the IPSI parameterized by changes. Simultaneous confidence bands can be constructed by replacing in (A.10) with an estimate of the quantile of using the multiplier bootstrap (Bonvini et al., 2023; Kennedy, 2019). There is a significant effect (at level ) if the endpoints of the confidence bands for any two different are disjointed (Bonvini et al., 2023; Kennedy, 2019).
Simulation study
We conducted a simulation study to empirically evaluate the operating characteristics of the IPSI estimator using lavaan. Each observed dataset was simulated using the following data-generating process:
The true ACEs of the recurring exposure on outcomes and were and , respectively. The unobserved confounder was used to generate an association between and within each time point . The observed pre-exposure history for and for sufficed for no unmeasured confounding in (5) to hold at each time point. To align the overall chances of the exposure across both time points, the value of the intercept was determined by minimizing the absolute difference between the sample marginal or average propensity scores, i.e., . We used to encode the strength of dependence of on for a recurring exposure. We considered three possible values of and three sample sizes.
For the IPSI estimator, we specified the same propensity score as in the illustration. We assumed a logistic regression model for given all main and two-way interaction effects for the observed pre-exposure history in ; i.e., for , (A.1) simplified to:
and for ,
For the outcome models, we assumed linear regression models with only main effects for for simplicity; i.e.,
We extracted estimators of the ACEs from those of the IPSIs by taking the difference in the latter for and because these values corresponded to counterfactual scenarios where almost everyone is unexposed or exposed (Bonvini et al., 2023). For comparison, we fitted both outcome models jointly in lavaan, then used the maximum likelihood estimators of the linear regression coefficients and as estimators of the ACEs of X on and , respectively. The results for 1000 replications under each setting are shown in Table 2. First, both linear regression and IPSI estimators of the ACE on were unbiased in all settings. Second, the linear regression estimators of the ACE on were biased in all settings. While the IPSI estimator suffered from finite sample biases, the biases decreased as the sample size increased. Third, the IPSI estimators of the ACE on had smaller root mean squared errors than the linear regression estimators as the sample size increased. All the R scripts for replicating the results are available online (https://osf.io/8ksfx/).
Table 2.
Results of the simulation study comparing estimators of the average causal effects of a recurring exposure on and
| Bias | ESE | RMSE | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| LR | IPSI | LR | IPSI | LR | IPSI | ||||||||
| N | |||||||||||||
| 400 | 0 | -0.00 | 0.57 | -0.01 | -0.06 | 0.19 | 0.25 | 0.33 | 2.03 | 0.19 | 0.62 | 0.33 | 2.03 |
| 6400 | 0 | -0.00 | 0.56 | 0.00 | -0.03 | 0.05 | 0.06 | 0.09 | 1.02 | 0.05 | 0.56 | 0.09 | 1.02 |
| 14,400 | 0 | -0.00 | 0.56 | -0.00 | -0.01 | 0.03 | 0.04 | 0.06 | 0.46 | 0.03 | 0.56 | 0.06 | 0.46 |
| 400 | 2 | 0.01 | 0.61 | 0.01 | -0.00 | 0.19 | 0.29 | 0.32 | 1.63 | 0.19 | 0.68 | 0.32 | 1.63 |
| 6400 | 2 | 0.00 | 0.62 | 0.00 | -0.02 | 0.05 | 0.07 | 0.09 | 0.58 | 0.05 | 0.63 | 0.09 | 0.58 |
| 14,400 | 2 | 0.00 | 0.62 | 0.00 | -0.01 | 0.03 | 0.05 | 0.06 | 0.53 | 0.03 | 0.62 | 0.06 | 0.53 |
| 400 | 4 | 0.00 | 0.66 | -0.01 | 0.08 | 0.19 | 0.30 | 0.31 | 0.97 | 0.19 | 0.73 | 0.31 | 0.98 |
| 6400 | 4 | 0.00 | 0.66 | 0.00 | 0.02 | 0.05 | 0.08 | 0.09 | 0.77 | 0.05 | 0.66 | 0.09 | 0.77 |
| 14,400 | 4 | -0.00 | 0.66 | -0.00 | 0.01 | 0.03 | 0.05 | 0.06 | 0.40 | 0.03 | 0.66 | 0.06 | 0.40 |
Note. ESE = empirical standard error; RMSE = root mean squared error; LR = linear regression; IPSI = incremental propensity score intervention
Appendix B: Sensitivity analysis of unmeasured confounding for IPSI in the illustrating example
Funding
Wen Wei Loh and Dongning Ren are partially supported by The Netherlands Organisation for Health Research and Development (ZonMw): 10430382410014. However, funding from the grant was not used for this project.
Data Availability
The data used in the illustration is publicly available at https://doi.org/10.3886/ICPSR34598.v1.
Code availability
The R scripts implementing the estimation procedure are publicly available online at https://osf.io/8ksfx/.
Declarations
Conflicts of Interest/Competing Interests
Not applicable.
Ethics Approval
Not applicable.
Consent to Participate
Not applicable.
Consent for Publication
Not applicable.
Open Practices Statements
The data used in the illustration is publicly available at https://doi.org/10.3886/ICPSR34598.v1. The R scripts implementing the estimation procedure are publicly available online at https://osf.io/8ksfx/.
Footnotes
A fundamental assumption for well-defined individual potential outcomes is the stable unit treatment value assumption (SUTVA; Rubin (1986)). SUTVA comprises two components (Imbens & Rubin, 2015): (i) no hidden treatment variations and (ii) no treatment interference. The former requires the (absence of) treatment received must be the same for everyone, and the latter requires that an individual’s potential outcome must be unaffected by treatments received by other individuals.
In principle, IPSI can be implemented with other structural equation modeling software, such as Mplus (Muthén & Muthén, 1998-2017) and OpenMx (Boker et al., 2011), but the R functions we developed to estimate the IPSICE and generate the resulting plots were designed for compatibility with lavaan.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- Boker, S., Neale, M., Maes, H., Wilde, M., Spiegel, M., Brick, T., Spies, J., Estabrook, R., Kenny, S., Bates, T., Mehta, P., & Fox, J. (2011). OpenMx: An open source extended structural equation modeling framework. Psychometrika,76(2), 306–317. 10.1007/s11336-010-9200-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bonvini, M., & Kennedy, E. H. (2022). Sensitivity analysis via the proportion of unmeasured confounding. Journal of the American Statistical Association,117(539), 1540–1550. 10.1080/01621459.2020.1864382 [Google Scholar]
- Bonvini, M., McClean, A., Branson, Z., & Kennedy, E. H. (2023). Incremental causal effects: An introduction and review. In J. R. Zubizarreta, E. A. Stuart, D. S. Small, & P. R. Rosenbaum (Eds.), Handbook of matching and weighting adjustments for causal inference (1st, pp. 349–372). Chapman; Hall/CRC. 10.1201/9781003102670
- Clare, P. J., Dobbins, T. A., & Mattick, R. P. (2019). Causal models adjusting for time-varying confounding-a systematic review of the literature. International Journal of Epidemiology,48(1), 254–265. 10.1093/ije/dyy218 [DOI] [PubMed] [Google Scholar]
- Cole, S. R., & Hernán, M. A. (2008). Constructing inverse probability weights for marginal structural models. American Journal of Epidemiology,168(6), 656–664. 10.1093/aje/kwn164 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Harring, J. R., McNeish, D. M., & Hancock, G. R. (2017). Using phantom variables in structural equation modeling to assess model sensitivity to external misspecification. Psychological Methods,22(4), 616–631. 10.1037/met0000103 [DOI] [PubMed] [Google Scholar]
- Hymel, S., & Swearer, S. M. (2015). Four decades of research on school bullying: An introduction. American Psychologist,70(4), 293–299. 10.1037/a0038928 [DOI] [PubMed] [Google Scholar]
- Imbens, G. W., & Rubin, D. B. (2015). Causal inference in statistics, social, and biomedical sciences. Cambridge: Cambridge University Press. [Google Scholar]
- Jacobs, L. A., McClean, A., Branson, Z., Kennedy, E. H., & Fixler, A. (2024). Incremental propensity score effects for criminology: An application assessing the relationship between homelessness, behavioral health problems, and recidivism. Journal of Quantitative Criminology. 10.1007/s10940-024-09582-7 [Google Scholar]
- Kennedy, E. H. (2019). Nonparametric causal effects based on incremental propensity score interventions. Journal of the American Statistical Association,114(526), 645–656. 10.1080/01621459.2017.1422737 [Google Scholar]
- Kennedy, T. M., Kennedy, E. H., & Ceballo, R. (2022). Marginal structural models for estimating the longitudinal effects of community violence exposure on youths’internalizing and externalizing symptoms. Psychological Trauma: Theory, Research, Practice, and Policy,15(6), 906–916. 10.1037/tra0001398 [DOI] [PubMed] [Google Scholar]
- Lee, T., & Shi, D. (2021). A comparison of full information maximum likelihood and multiple imputation in structural equation modeling with missing data. Psychological Methods,26(4), 466–485. 10.1037/met0000381 [DOI] [PubMed] [Google Scholar]
- Lin, D. Y., Psaty, B. M., & Kronmal, R. A. (1998). Assessing the sensitivity of regression results to unmeasured confounders in observational studies. Biometrics,54(3), 948–963. 10.2307/2533848 [PubMed] [Google Scholar]
- Loh, W. W. (2024). Unmeasured mediator-outcome confounding distorts both indirect and direct effects. Proceedings of the National Academy of Sciences,121(47), Article e2418979121. 10.1073/pnas.2418979121 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Loh, W. W., & Ren, D. (2023). A tutorial on causal inference in longitudinal data with time-varying confounding using g-estimation. Advances in Methods and Practices in Psychological Science,6(3), 25152459231174028. 10.1177/25152459231174029 [Google Scholar]
- Loh, W. W., & Ren, D. (2023b). The unfulfilled promise of longitudinal designs for causal inference. Collabra: Psychology, 9(1), 89142. 10.1525/collabra.89142
- Loh, W. W., & Ren, D. (2024). The incremental propensity score approach for diversity science. Advances in Methods and Practices in Psychological Science,7(2), 25152459241240680. 10.1177/25152459241240681 [Google Scholar]
- Loh, W. W., Ren, D., & West, S. G. (2024). Parametric g-formula for testing time-varying causal effects: What it is, why it matters, and how to implement it in lavaan. Multivariate Behavioral Research,59(5), 995–1018. 10.1080/00273171.2024.2354228 [DOI] [PubMed] [Google Scholar]
- Mansournia, M. A., Etminan, M., Danaei, G., Kaufman, J. S., & Collins, G. (2017). Handling time varying confounding in observational research. BMJ,359,. 10.1136/bmj.j4587 [DOI] [PubMed]
- Muthén, L. K., & Muthén, B. (1998–2017). Mplus user’s guide : Statistical analysis with latent variables (Eighth edition). Muthén & Muthén.
- Naimi, A. I., Rudolph, J. E., Kennedy, E. H., Cartus, A., Kirkpatrick, S. I., Haas, D. M., Simhan, H., & Bodnar, L. M. (2021). Incremental propensity score effects for time-fixed exposures. Epidemiology,32(2), 202–208. 10.1097/EDE.0000000000001315 [DOI] [PMC free article] [PubMed] [Google Scholar]
- R Core Team. (2021). R: A language and environment for statistical computing. R Foundation for Statistical Computing. Vienna, Austria. https://www.R-project.org/
- Ren, D., Stavrova, O., & Loh, W. W. (2022). Nonlinear effect of social interaction quantity on psychological well-being: Diminishing returns or inverted U? Journal of Personality and Social Psychology,122(6), 1056–1074. 10.1037/pspi0000373 [DOI] [PubMed] [Google Scholar]
- Robins, J. M. (1986). A new approach to causal inference in mortality studies with a sustained exposure period-application to control of the healthy worker survivor effect. Mathematical Modelling,7(9), 1393–1512. 10.1016/0270-0255(86)90088-6 [Google Scholar]
- Robins, J. M. (1994). Correcting for non-compliance in randomized trials using structural nested mean models. Communications in Statistics-Theory and methods,23(8), 2379–2412. 10.1080/03610929408831393 [Google Scholar]
- Robins, J. M., Hernán, M. A., & Brumback, B. (2000). Marginal structural models and causal inference in epidemiology. Epidemiology, 11(5). https://journals.lww.com/epidem/Fulltext/2000/09000/Marginal_Structural_ Models_and_Causal_Inference_in.11.aspx [DOI] [PubMed]
- Rosenbaum, P. R., & Rubin, D. B. (1983). Assessing sensitivity to an unobserved binary covariate in an observational study with binary outcome. Journal of the Royal Statistical Society: Series B (Methodological),45(2), 212–218. 10.1111/j.2517-6161.1983.tb01242.x [Google Scholar]
- Rosenbaum, P. R., & Rubin, D. B. (1983). The central role of the propensity score in observational studies for causal effects. Biometrika,70(1), 41–55. 10.1093/biomet/70.1.41 [Google Scholar]
- Rosseel, Y. (2012). Lavaan: An R package for structural equation modeling. Journal of Statistical Software, 48(2), 1–36. 10.18637/jss.v048.i02
- Rubin, D. B. (1974). Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of Educational Psychology,66(5), 688–701. 10.1037/h0037350 [Google Scholar]
- Rubin, D. B. (1986). Comment: Which ifs have causal answers. Journal of the American Statistical Association,81(396), 961–962. 10.1080/01621459.1986.10478355 [Google Scholar]
- Rubin, D. B. (1990). [On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9.] Comment: Neyman (1923) and Causal Inference in Experiments and Observational Studies. Statistical Science, 5(4), 472–480. 10.1214/ss/1177012032
- Rudolph, J. E., Benkeser, D., Kennedy, E. H., Schisterman, E. F., & Naimi, A. I. (2022). Estimation of the average causal effect in longitudinal data with time-varying exposures: The challenge of nonpositivity and the impact of model flexibility. American Journal of Epidemiology. 10.1093/aje/kwac136 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Scott, A. J., Webb, T. L., Martyn-St James, M., Rowse, G., & Weich, S. (2021). Improving sleep quality leads to better mental health: A meta-analysis of randomised controlled trials. Sleep Medicine Reviews,60, Article 101556. 10.1016/j.smrv.2021.101556 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smith, P. J., & Merwin, R. M. (2021). The role of exercise in management of mental health disorders: An integrative review. Annual Review of Medicine, 72(Volume 72, 2021), 45–62. https://doi.org/10.1146/annurev-med-060619-022943 [DOI] [PMC free article] [PubMed]
- Splawa-Neyman, J., Dabrowska, D. M., & Speed, T. P. (1990). On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9. Statistical Science, 5(4), 465–472. 10.1214/ss/1177012031
- Thoemmes, F., & Ong, A. D. (2015). A primer on inverse probability of treatment weighting and marginal structural models. Emerging Adulthood,4(1), 40–59. 10.1177/2167696815621645 [Google Scholar]
- Valkenburg, P. M. (2022). Social media use and well-being: What we know and what we need to know. Current Opinion in Psychology,45, Article 101294. 10.1016/j.copsyc.2021.12.006 [DOI] [PubMed] [Google Scholar]
- Vansteelandt, S., & Sjölander, A. (2016). Revisiting G-estimation of the effect of a time-varying exposure subject to time-varying confounding. Epidemiologic Methods,5(1), 37–56. 10.1515/em-2015-0005 [Google Scholar]
- Weisstein, E. W. (2024). Binary logarithm. https://mathworld.wolfram.com/BinaryLogarithm.html
- West, S. G., Cham, H., Thoemmes, F., Renneberg, B., Schulze, J., & Weiler, M. (2014). Propensity scores as a basis for equating groups: Basic principles and application in clinical treatment outcome research. Journal of Consulting and Clinical Psychology,82(5), 906–919. 10.1037/a0036387 [DOI] [PubMed] [Google Scholar]
- Westreich, D., & Cole, S. R. (2010). Invited commentary: Positivity in practice. American Journal of Epidemiology,171(6), 674–677. 10.1093/aje/kwp436 [DOI] [PMC free article] [PubMed]
- Westreich, D., Cole, S. R., Young, J. G., Palella, F., Tien, P. C., Kingsley, L., Gange, S. J., & Hernán, M. A. (2012). The parametric g-formula to estimate the effect of highly active antiretroviral therapy on incident AIDS or death. Statistics in medicine,31(18), 2000–2009. 10.1002/sim.5316 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yuan, K.-H., & Bentler, P. M. (2010). Consistency of normal-distribution-based pseudo maximum likelihood estimates when data are missing at random. The American Statistician,64(3), 263–267. 10.1198/tast.2010.09203 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhong, R., Lian, H., Hershcovis, M. S., & Robinson, S. L. (2023). Mitigating or magnifying the harmful influence of workplace aggression: An integrative review. Academy of Management Annals,17(2), 516–545. 10.5465/annals.2021.0144
- Zimmerman, M. A. (2014). Flint Michigan adolescent study (FAS): A longitudinal study of school dropout and substance use, 1994-1997. Inter-university Consortium for Political; Social Research [distributor]. https://doi.org/10.3886/ICPSR34598.v1
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The data used in the illustration is publicly available at https://doi.org/10.3886/ICPSR34598.v1.
The R scripts implementing the estimation procedure are publicly available online at https://osf.io/8ksfx/.





