Skip to main content
BMC Medical Research Methodology logoLink to BMC Medical Research Methodology
. 2025 Aug 27;25:202. doi: 10.1186/s12874-025-02633-y

Causal estimation of time-varying treatments in observational studies: a scoping review of methods, applications, and missing data practices

Mercy Rop 1,, Innocent Maposa 1,2, Taryn Young 1, Rhoderick Machekano 1
PMCID: PMC12382241  PMID: 40859121

Abstract

Background

Estimating causal effects of time-varying treatments or exposures in observational studies is challenging due to time-dependent confounding and missing data, necessitating advanced statistical approaches for accurate inference. Previous reviews indicate that singly robust methods are prevalent in epidemiological studies despite the availability of more robust alternatives that better handle time-varying confounding. Although common in longitudinal studies, missing data are often inadequately reported and addressed, potentially compromising the validity of estimates. Whether this dependence on less robust methods and inadequate handling of missing data persists in time-varying treatment settings remains unclear. This review aimed to identify current practices, methodological trends, and gaps in the causal estimation of time-varying treatments.

Methods

We conducted a scoping review to map causal methodologies for time-varying treatments in epidemiological studies and identify trends and gaps. To capture the most recent developments, we searched PubMed, Scopus, and Web of Science for articles published between 2023 and 2024. A structured questionnaire was used to extract key methodological aspects, and findings were summarized using descriptive statistics.

Results

Of the 424 articles, 63 met the eligibility criteria, with five added from citations and references, totalling 68 for analysis. Among these, 78% addressed epidemiological questions, 13% included methodological illustrations, and 9% focused solely on methods. Singly robust methods dominated, with inverse probability of treatment weighting (IPTW) being the most common (64.3%), followed by targeted maximum likelihood estimation (TMLE) (14.3%). The emergence of new estimation approaches was also noted. Missing data handling remained inadequate; 33% did not report the extent of missingness, 95.2% lacked assumptions, and sensitivity analysis was performed in only 14.5% of the articles. Multiple imputation (MI) was more prevalent (29%), while complete case analysis (11.3%) was likely underreported, given 33.9% omitted strategy details.

Conclusion

Persistent reliance on singly robust methods, underutilization of doubly robust approaches, and inadequate missing data handling highlight ongoing gaps in evaluating time-varying treatments. While newer estimation approaches are emerging, their adoption remains limited. These trends, alongside the growing complexity of real-world data and the demand for evidence-driven care, call for greater methodological rigor, wider adoption of robust approaches, and enhanced reporting transparency.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12874-025-02633-y.

Keywords: Time-varying treatments, Time-dependent confounding, Causal inference, Observational studies, Real-world data, Doubly robust, Singly robust, Missing data, Scoping review

Introduction

Time-varying treatment plans are increasingly utilized in clinical settings to deliver personalized care to patients with chronic and progressive medical conditions [1]. These plans involve continually monitoring and evaluating a patient’s evolving disease state and treatment responses, enabling dynamic adjustments to treatment decisions for optimized care. Similarly, many epidemiological exposures of interest are often time-varying and may influence clinically relevant outcomes differently over time [2].

When treatments or exposures change over time, treatment-affected time-varying confounding can occur because of evolving covariates and treatment history. This phenomenon occurs when earlier treatments influence the patient’s disease state, which in turn influences subsequent treatments and outcomes of interest [3, 4]. For instance, before the implementation of the universal test-and-treat policy in HIV management in 2016 [5], antiretroviral therapy (ART) was initiated only once CD4 counts dropped below a predefined threshold. However, throughout the follow-up period, earlier ART influenced CD4 counts, as successful treatment led to immunological recovery. This interplay creates a feedback loop in which CD4 counts dictate treatment decisions, treatments alter CD4 counts, and both factors ultimately affect clinical outcomes. This dynamic interplay is illustrated by a directed acyclic graph (Fig. 1).

Fig. 1.

Fig. 1

Treatment-affected time-varying confounding over three time points

Standard regression-based adjustment (RBA) analyses often fail to address these feedback loops adequately because they condition confounders affected by prior treatment, which can introduce bias and result in misleading conclusions [3]. Additionally, missing data, ubiquitous in longitudinal observational studies due to attrition, dropouts, non-adherence, and data collection challenges, further complicate accurate causal estimation in time-varying treatment settings [5, 6].

Unlike randomized controlled trials (RCTs), treatment assignments in observational settings are non-random and usually influenced by patient characteristics and physician decisions. Consequently, the probability of receiving a specific treatment or treatment sequence is not known a priori and must be estimated. Causal inference in these settings is commonly approached within the potential outcomes framework [7], which posits the existence of multiple potential outcomes for each treatment option (level). Within this framework, specialized confounding adjustment methods are used, and key identifiability assumptions—such as non-interference, consistency, positivity, and exchangeability—must be made and satisfied for the estimated effects to be interpreted causally [2, 8].

Over the past three decades, several statistical methods have been developed to adjust for treatment-affected time-varying confounding and estimate the average treatment effects (ATE). These methods include the g-formula, g-estimation with Structural Nested Mean Models (SNMM), and Inverse Probability of Treatment Weighting (IPTW) of Marginal Structural Models (MSMs), collectively developed by Robins et al. [4] and referred to as the g-methods, as well as doubly robust methods, such as Augmented Inverse Probability of Treatment Weighting (AIPTW) [9] and Targeted Maximum Likelihood Estimators (TMLE) [10]. A comprehensive review of these methods is provided elsewhere [8, 1013].

Briefly, the g-methods rely on correctly estimating either the outcome or treatment model for unbiased estimation of ATE. However, incorrectly specified models can lead to biased estimates and misleading conclusions [9]. Doubly robust (DR) methods aim to address this by requiring the specification of both the treatment and outcome models, and unbiased estimates can still be obtained when either model is misspecified but not both. These DR methods are also efficient when both models are consistently estimated [9].

While IPTW methods used in Marginal Structural Models (MSMs) effectively address time-varying confounding, standard implementations may not explicitly model interactions between treatments and time-varying covariates without additional specification in the MSM model [4]. Nonetheless, their resemblance to standard statistical procedures makes them intuitive and widely used in applied epidemiological research [10, 14]. Moreover, accurate specification of the treatment model is critical to avoid bias, yet traditional g-methods often rely solely on domain knowledge for this specification, which may not guarantee accurate estimation in high-dimensional observational settings with several covariates varying over time [8, 14]. To address these limitations, efforts to enhance g-methods by incorporating machine learning techniques are ongoing [1518]. These advancements aim to make the methods more data-adaptive, flexible, and accurate, enabling better capture of complex relationships among variables.

Although AIPTW enhances the IPTW methodology by incorporating an augmentation term that corrects the estimator when the treatment model is misspecified, the estimator can perform poorly in the presence of extreme propensity scores [19]. Additionally, challenges related to non-unique solutions in its estimating equations and not restricting the ATE estimate to the parameter space make it difficult to implement and interpret, limiting its widespread use [20].

In contrast, some doubly robust methods, such as TMLE, protect against model misspecification and include a targeting step that reduces the bias-variance trade-off, enabling more accurate estimates. Additionally, TMLE can seamlessly integrate Super Learner, an ensemble machine learning algorithm that combines multiple learners to identify and leverage the best-fitting model to improve efficiency [21].

Although TMLE can be implemented without Super Learner, it is generally considered best practice to use it to enhance performance [22]. In high-dimensional data with numerous time-varying covariates, this combination can uncover complex patterns and relationships among variables in the data [19], including non-linear, interaction, and higher-order relationships, thus improving model estimation. Previous comparative assessments have also demonstrated the superiority of TMLE methods over single robust alternatives in terms of efficiency and ability to simultaneously reduce bias and variance [23].

Since its introduction in 2006 [10], the TMLE methodology has undergone tremendous growth and advancement [24], broadening its scope and application to diverse settings, including longitudinal observational studies [19, 23, 24], time-to-event outcomes [25], clustered data [22] and, more recently, to dynamic treatment regimens or adaptive strategies [26].

Similarly, causal machine learning (CML) methods are gaining prominence for estimating treatment effect heterogeneity (HTE) [24, 2729]. By using machine learning algorithms to uncover variations in treatment response, CML approaches such as causal forests provide insights into treatment efficacy across patient subgroups, enabling more refined personalized strategies and policies [30, 31].

Additionally, the increasing availability of observational data from large electronic databases, advances in causal inference and computing methodologies, and the growing need for evidence-based healthcare have driven efforts to develop methods that combine findings from RCTs and observational studies [29, 32, 33]. This integration seeks to leverage the strengths of both designs to generate high-quality evidence for better health care.

Methodological trends and challenges in causal inference in epidemiological studies

In a systematic review conducted in 2019 [27], it was found that many applied researchers continue to rely on older, simpler, and less robust methods for evaluating time-varying treatments, despite the availability of more sophisticated alternatives, such as TMLE and AIPTW. A subsequent review in 2021 [34] pointed out a similar reluctance to embrace newer methods and the continued dominance of familiar traditional approaches. However, by 2022 [24], a systematic review noted a gradual increase in the use of TMLE in epidemiological studies, with further uptake anticipated as accessible tutorials and software packages become more widely available beyond the methodological community.

Furthermore, the 2021 review [34] identified gaps in the methodological quality of published studies that evaluated optimal time-varying treatment plans in observational studies. One prevalent issue was the inadequate handling of missing data, with approximately 51% of the studies failing to transparently report the strategies used to address missing data, contrary to reporting guidelines for observational studies [34, 35]. Similarly, Carroll et al. [36] explored how missing data are handled in time-to-event analyses for observational oncology studies. They found that simple strategies such as complete case analysis (CCA) were still the most used (53%), while more sophisticated methods such as multiple imputation (MI) were employed in only 22% of the studies. Missing data assumptions were rarely reported (14%), and sensitivity analyses were conducted in just 7% of cases, limiting the reliability of the study findings.

Similarly, a more recent systematic review [37] assessed gaps in the reporting and use of MI when addressing causal questions in observational studies with missing data across multiple variables. They noted that most studies did not report missing data, associated assumptions, or analysis decisions with sufficient clarity; in cases where missing data assumptions were provided, proper justifications were lacking. For instance, the common justification for MI implementation was the Missing at Random (MAR) assumption, often without thorough assessments or sufficient rationale to support the claim [37].

This underappreciation and underreporting of missing data have implications for the validity of conclusions, as selection bias and information loss can be introduced, obscuring the true effects of treatments on clinically relevant outcomes. Therefore, appropriate handling of missing data is essential for unbiased and accurate causal estimates.

Strategies for handling missing data

Various strategies are available for analyzing incomplete data, each suited to different missing data mechanisms or assumptions regarding the reasons for the missing data [3841]. Missing data are commonly classified according to Little and Rubin’s framework [42] as Missing Completely at Random (MCAR), Missing at Random (MAR), or Missing Not at Random (MNAR).

Under the assumption of MCAR, CCA yields unbiased estimates because the subsample remains representative of the study population [43]. However, when MCAR does not hold, CCA may introduce selection bias, as complete cases may differ systematically from those with missing data, leading to non-representativeness.

Multiple imputations are generally recommended when the MAR assumption is plausible [36, 41]. It generates multiple complete datasets by imputing missing values using a predictive probability distribution derived from observed data. These datasets are then analyzed separately, and the results are pooled using Rubin’s rule, which ensures that both between- and within-imputation variations are accounted for in estimating the standard errors.

Multiple imputation is increasingly used in epidemiological studies [37] and is often considered the'gold standard'because it mitigates information loss by using all available data and accounts for the uncertainty around missing data, yielding more precise estimates [39, 42, 43]. However, consistent MI estimation depends on the correct specification of imputation models [44]. Essentially, the models need to have all the variables that are intended to be part of the analysis incorporated in the model in a manner that depicts the actual relationship between the variables; failure to do so may lead to incompatibility between the imputation model and the analysis model, resulting in biased estimates [45, 46]. Fortunately, flexible data-adaptive MI approaches that combine machine learning algorithms are currently available to improve the chance that imputation models are appropriately specified [4749].

Single imputation approaches, such as using the mean or median of observed data or carrying forward the last observed value (LOCF), are prevalent in epidemiological studies. These methods impute a single value for each missing observation, assuming that the imputed value accurately represents the true missing values. Hence, failing to account for the uncertainty in missing data and underestimating standard errors can lead to biased estimates and incorrect inferences [43]. Consequently, these methods are widely discouraged as primary strategies for handling missing data [5053]. However, LOCF may be considered reasonable under certain strong justifications that the last observed value reflects the most recent information guiding treatment decisions, particularly when the condition is expected to remain stable or the treatment effect is unlikely to change substantially [54].

Missing data challenges can also be addressed using inverse probability weighting (IPW) or its extension for censoring in time-to-event analyses, known as the inverse probability of censoring weighting (IPCW). These techniques create a pseudo-population in which the observed data accurately reflect the entire target population, particularly when all relevant population characteristics or conditioning variables are included in the observed data. They achieve this by assigning greater weight to individuals who are underrepresented because of missing data or censoring. Each individual receives a weight that is the inverse of the estimated probability that their data are observed given their covariates and history. This method reduces bias under the MAR assumption by ensuring that the analysis properly considers the differential probabilities of observation [6, 49].

Additionally, within the TMLE framework, there is growing literature on handling missing covariate data by incorporating a missing data indicator (1 if observed, 0 if missing) into the nuisance parameter models [5557]. In this context, the target parameter is redefined under a hypothetical intervention where everyone is observed, allowing estimation under complete data scenarios. Combined with Super Learner, this approach enables a flexible, data-adaptive approach for handling missing data. This extension is explicitly implemented in Longitudinal TMLE via the ltmle package in R [58].

Finally, in situations where missing data are assumed to be MNAR, more advanced approaches, such as selection models [59], pattern-mixture models [60] and Bayesian approaches [43, 51], are recommended.

There is no universally best method for handling missing data in all situations; instead, the choice of approach should be guided by the specific nature and context of the missing data [51]. After handling missing data, it is important to conduct sensitivity analyses to assess the robustness of the findings to potential deviations from the assumed missing data mechanism in the primary analysis, mainly because the assumptions made cannot be verified from the observed data [6, 43, 53, 54]. This ensures that research findings provide a more transparent picture of the uncertainty surrounding the results.

The effective use of robust causal methods, in conjunction with appropriate handling of missing data, can significantly improve the accuracy of insights into the personalized management of chronic and progressive diseases. However, the extent to which applied researchers have embraced these advanced methods and whether appropriate reporting and optimal handling of missing data have been adequately incorporated into evaluating treatment effects of time-varying treatments in epidemiological studies remains unclear.

Objectives of the review

This scoping review builds on prior reviews [14, 22, 61, 62], evaluating the utilization of causal methods and progress in adopting advanced causal techniques in epidemiological studies. It focuses specifically on their application in time-varying settings and identifies the remaining gaps in methodological practice, particularly concerning the reporting and handling of missing data in observational studies. The specific objectives were to:

  • Identify and describe methodologies used to estimate the average treatment effects of time-varying treatments and exposures.

  • Assess the extent to which the amount of missing data was reported.

  • Evaluate the extent to which missing data assumptions were explored and reported.

  • Identify strategies employed to handle missing data.

  • Determine whether sensitivity analyses were conducted to assess the robustness of missing data assumptions.

Methods

This scoping review followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews (PRISMA-ScR) checklist [63] and adhered to relevant guidelines [54, 64]. The literature reviewed in this study included articles published between January 1, 2023, and December 31, 2024. The inclusion of more recent literature allows this review to update and extend prior reviews [14, 22, 61], capturing the most current trends, gaps, and practices in the epidemiological field.

Search strategy

We systematically searched PubMed, Scopus, and Web of Science databases on January 5, 2025, to identify primary research articles published within the defined period. The search strategy was developed in collaboration with a skilled librarian, who assisted in constructing the search terms. The search terms consisted of three main components: time-varying treatments, adjustment methods, and observational studies. The primary search terms were built as an ordered combination of the three key components, with synonyms and related phrases to capture all relevant variations (Table 1).

Table 1.

Search terms

Key terms Synonyms
Time-varying treatments Adaptive interventions, dynamic treatment strategies, time-dependent confounding, treatment policy/treatment policies
Causal methods Adjustment methods, TMLE, IPTW, AIPTW, SNMM, G-estimation, g-formula, g-computation formula, Marginal structural models (MSMs), causal machine learning, causal AI, causal deep learning
Observational studies Non-randomized, non-experimental, real-world data, electronic medical records, electronic health records

The search terms were tailored for each of the three databases. The full PubMed search strategy is shown in Table 2, with strategies for the other databases provided in Additional file 1.

Table 2.

PubMed search terms

Search Search combination used
#1 “Dynamic treatment regimens” OR “Dynamic treatment regime*” OR “Dynamic treatment strategies” OR “Adaptive intervention*” OR “adaptive strateg*” OR “adaptive treatment*” OR “time-varying” OR “time-dependent” OR “treatment policy” OR “treatment policies” OR “Sequential treatment decisions”
#2 “Causal” OR “Causal inference” OR “Targeted Maximum Likelihood Estimation” OR “TMLE” OR “targeted maximum likelihood estimation”) OR “targeted minimum loss based estimation” OR “targeted minimum loss-based estimation” OR “targeted machine learning” OR “targeted learning” OR “targeted machine-learning” OR “Inverse Probability of treatment weighting” OR “IPTW” OR “Augmented Inverse Probability of Treatment Weighting” OR “AIPTW” OR “Structural Nested Mean Models” OR “SNMM” OR “G-Estimation” OR “g-formula” OR “g-computation formula” OR “Marginal structural model” OR “Causal AI” OR"causal ML"OR"Causal deep learning"OR"Bayesian causal"
#3 “Observational” OR “Non-randomized” OR “non-randomised” OR “real-world*” OR “Electronic medical records” OR “EMR” OR “Electronic health records” OR “EHR”
#4 #1 AND #2 AND #3

Eligibility criteria

We included articles proposing or applying statistical methods for time-varying treatments in observational settings, including peer-reviewed articles and conference proceedings published between 2023 and 2024, as well as grey literature such as technical reports and preprints not formally published in peer-reviewed journals. The inclusion criteria required articles to address causal questions related to time-varying treatments in observational settings, or they propose, illustrate, or implement methods for adjusting for time-varying confounding. Additionally, only publications in the English language, with both abstract and full-text available, were included. Articles were excluded if they were review papers, opinion articles or commentaries, study protocols, simulations, or comparative studies.

Selection of relevant articles

The search output was imported into the Rayyan software [65] to organize and manage the literature and facilitate team collaboration while selecting relevant articles. The initial screening was conducted based on titles and abstracts, followed by a detailed assessment of full articles for eligibility. Additional articles were selected based on the citations and references of the included studies to ensure thorough coverage. Two authors independently reviewed the articles.

Data on the fields listed in Table 2 were extracted using a standardized form (in Microsoft Excel) for each article. Data extraction was performed by MR and confirmed by RM (Table 3).

Table 3.

Data extraction

Item Item definition
Complete reference Title, Journal name, year of publication
Paper focus The paper's focus: methodology or clinical questions
The clinical focus The medical condition of the paper, e.g., HIV, cancer
Causal question Whether the research question is causal or associational
Statistical adjustment methods used The methods used to adjust for confounding, e.g., IPTW
The extent of missingness described Amount of missing data reported either in text or in a table/figure
Missing data mechanism Missing data assumptions explored and reported
Handling of missing data Strategies employed to address missing data and their justifications
Sensitivity analysis Whether sensitivity analyses were done to assess the robustness of missing data assumptions. If yes, which methods were used

Data analysis and reporting

The extracted data were analyzed using descriptive statistics (frequencies and percentages) in Stata 18 [66], narratively synthesized and reported following established guidelines [63].

Results

The initial search yielded 424 articles. After removing duplicates, 344 studies were screened by titles and abstracts, and 99 advanced to full-text reviews. Of these, 63 studies met the inclusion criteria [67129], with five additional studies [130134] identified through citation tracking and reference lists. In total, 68 articles were included (Fig. 2), with 36 (53%) published in 2023 and 32 (42%) in 2024.

Fig. 2.

Fig. 2

Flowchart outlining article identification and inclusion in the scoping review

Most studies (53, 78%) focused on answering clinical and epidemiological questions, while nine (13%) focused primarily on methodology alongside illustrative analyses using real-world data, and six (9%) focused on methodological concepts only (Table 4).

Table 4.

Characteristics of reviewed papers

Characteristics Total (n = 68)
Year of publication
 2023 36 (52.9%)
 2024 32 (47.1%)
Focus of the paper
 Answer clinical questions 53 (77.9%)
 Methodology + illustrative analysis 9 (13.2%)
 Methodology only 6 (8.8%)
Clinical focus area
 COVID-19 6 (9.7%)
 Dentition 5 (8.1%)
 Mental health disorders 6 (9.7%)
 Mortality 5 (8.1%)
 Cancer 4 (6.5%)
 HIV 4 (6.5%)
 Diabetes 3 (4.8%)
 Kidney issues 4 (6.5%)
 Other 25 (40.3%)
Disease category
 Chronic & progressive conditions 31 (54.4%)
 Other 26 (45.6%)
Methods used
 Propensity score methods 39 (59.1%)
 TMLE 6 (9.1%)
 G-formula 6 (9.1%)
 G-estimation 4 (6.1%)
 Bayesian approaches 3 (4.5%)
 Instrumental variables 2 (3.0%)
 ML 3 (4.5%)
 Other 3 (4.5%)
Causal method category (Application papers)
 Singly robust 56 (90.3%)
 Doubly robust 6 (9.7%)
Extent of missingness reported
 No 21 (33.9%)
 Yes, in a table/figure 19 (30.6%)
 Yes, in text 22 (35.5%)
Missing data assumptions reported
 No 59 (95.2%)
 No, but implied by text as MAR 1 (1.6%)
 Yes, MAR 1 (1.6%)
 Yes, MNAR 1 (1.6%)
Missing data strategy
 MI 18 (29.0%)
 CCA 7 (11.3%)
 SI 10 (16.1%)
 Multiple approaches 6 (9.7%)
 Not indicated 21 (33.9%)
Sensitivity analysis done
 No 53 (85.5%)
 Yes 9 (14.5%)

The estimation of the causal effects of dynamic treatments and time-varying exposures spanned various health conditions. Key focuses included COVID-19 (6, 9%), mental health disorders (6, 9%), dentition (5, 7.4%), and cancer and HIV (4 studies each, 6%).

Mortality (n = 5, 7.4%) was a frequent outcome of interest. Although COVID-19 accounted for the largest proportion, studies addressing chronic and progressive conditions collectively comprised a significant share (31 studies, 54%), underscoring the growing importance of dynamic treatment regimens and time-varying exposures in epidemiological research.

Causal methods

Among the methodological papers, the emphasis was on demonstrating the theory and application of emerging concepts, such as machine learning [56, 58, 88], proximal approaches to address unmeasured confounding [71], and the use of high-dimensional propensity scores [81, 123], an extension of the traditional propensity score that enables an automated, data-driven approach to covariate selection. This trend highlights the increasing role of data-driven methodologies in causal estimations.

In the application and methodology illustrative papers, propensity score methods (n = 39, 59%), particularly IPTW, were the most frequently used, appearing in 27 studies (44%). TMLE was used in six studies (9.7%), followed by the g-formula in five studies (8.1%), and G-estimation in three studies (5%). Bayesian approaches, along with instrumental variables, joint models, and machine learning techniques, were the least common. Regarding robustness, doubly robust methods were employed in only 6 studies (14.3%), whereas the majority (56 studies, 90.3%) relied on singly robust methods.

Missing data reporting and handling

In addition to the range of causal estimation methods, we observed substantial variability in reporting and handling missing data. Among the 62 application papers, 21 (33.9%) did not report the extent of missing data, while 19 (30.6%) provided details in tables and figures, and 22 (35.5%) merely mentioned missingness briefly in the text without offering precise details, sometimes using vague statements such as “ < 5% missingness in all variables,” and “there was small missingness in the covariates.” While these statements provide some indication of the extent of missing data, they lack specificity and leave questions regarding the underlying assumptions and potential bias unaddressed.

Regarding missing data assumptions, most studies (59, 95.2%) did not explicitly state any assumptions regarding missingness in the data. Only two studies (3.2%) assumed that missing data were MAR, and a single study (1.6%) clearly stated that the missing data were MNAR. About one-third of the studies did not indicate a missing data strategy utilized, which may suggest the use of CCA, as this is the default in most statistical packages. Among the reported strategies, MI was the most used, applied in 29% [17] of the studies.

Single Imputation (SI) approaches, including LOCF and mean or median imputation, were employed in ten studies (16.1%). LOCF was also frequently used in studies that employed multiple strategies (six studies, 9.7%). However, only 9 studies (14.5%) conducted sensitivity analyses to assess the robustness of the results to missing data assumptions, while the majority (53 studies, 85.5%) did not perform such analyses.

Discussion

In this scoping review, we provide an up-to-date overview of the methodological practices in estimating the average treatment effects of dynamic treatment regimens and time-varying exposures in observational epidemiological studies, focusing on causal methodologies and handling missing data. We observed significant variations in approaches used for causal estimation and the reporting and management of missing data across the 68 included studies, which covered a wide range of health conditions. We also noted an increase in the use of new analytical techniques.

Previous reviews [14, 61] had indicated a primary focus of time-varying regimens on HIV and chronic conditions such as cancer. While the current review highlights a continued emphasis on chronic diseases (31 studies, 54%), there was a noticeable shift towards acute conditions, particularly COVID-19, reflecting the field’s responsiveness to emerging public health crises.

Our findings also indicate that singly robust methods, particularly propensity score-based approaches such as IPTW, continue to dominate the evaluation of time-varying treatments, while doubly robust methods remain underutilized. This reflects a continued reliance on traditional singly robust approaches in current practice, consistent with previous reviews [14, 61], which also reported the predominance of IPTW methods, likely owing to their ease of implementation and interpretability [135], suggesting a reluctance to adopt newer methods. However, despite expectations of increased TMLE adoption [136], the current review noted a slow uptake. The limited use of doubly robust methods in real-life applications presents an opportunity to improve causal inference in complex data settings, such as those considered in this review.

Shifting to robust methods is likely perceived as challenging because of the complexity of their modeling approaches and computational demands. In particular, both the Super Learner and multiple imputation (MI) can be resource-intensive, especially when applied to large datasets. While specialized software options are steadily improving within open-source platforms such as R, comparable support remains limited in other widely used statistical packages such as Stata and SAS.

For new users, these tools can be intimidating because of their steep learning curves and limited troubleshooting support. Although efforts have been made to enhance accessibility through lay-language tutorials for applied researchers [19, 137] and the availability of freely available implementation packages in R, ongoing efforts are needed to develop and maintain user-friendly tools and documentation to support broader adoption in epidemiological research. For instance, the tlverse software suite for TMLE (GitHub: https://github.com/tlverse/tlverse), which has not been updated in the last four years, provides a platform that can be further enhanced for improved accessibility by non-specialist users. Additionally, the rapid pace of development in the causal inference landscape reinforces the need for continued collaboration between methodologists and applied scientists to bridge the gap between theory and practice [24], as methodological advancements alone, without practical implementation, cannot drive meaningful progress.

Historically, non-Bayesian approaches have dominated the theory and practice of causal estimation in epidemiological studies [72]. This trend will likely persist in the current practice due to theoretical, computational, and programming requirements that remain inaccessible to domain experts. Along with the growing interest in integrating machine learning techniques with frequentist causal methods [123, 126], similar efforts are gaining traction within Bayesian causal inference frameworks [72, 138].

The potential for unmeasured time-varying confounding is also a significant concern in time-varying observational settings, as it can bias estimates [139]. As a result, methodologies that use proximal approaches are being developed [71]. A related trend is the increasing use of high-dimensional propensity scores for covariate selection [93]. These newer innovative approaches are expected to increasingly shape the causal inference field, particularly with the increasing complexity of real-world clinical data, the rising demand for evidence-driven care, and the advancements in statistical computing.

Missing data remains a persistent challenge in longitudinal observational studies; however, it was not adequately reported in many studies. Notably, slightly more than one-third failed to disclose the extent of missing data for each variable, while the other third provided only limited details. This is consistent with the findings of a previous review [34], which noted that up to half of the studies provided no information regarding the amount of missingness in the data. The main issue arising from missing data is selection bias, which becomes more pronounced as the proportion of missing data increases. As a result, this lack of transparency regarding the extent of missing data limits the ability to assess the validity of the findings and complicates the comparison of results across studies. Despite the guidelines for reporting observational studies [62, 140] emphasizing clear and explicit reporting of missingness, adherence was mostly inconsistent.

Our findings also reveal that assumptions underlying missing data are rarely explored and stated. This was similar to previous reviews [62, 140], which noted infrequent reporting of missing data assumptions, with approximately 70–86% failing to provide a statement on assumptions made about the underlying missingness in the data. Assumptions regarding the mechanism of missing data can significantly influence the estimates obtained from statistical analyses because different approaches are appropriate for different assumptions [41]. Failure to consider them in the analysis may lead to incorrect inferences or underestimation of the uncertainty surrounding missing data.

This scoping review also highlights a varied approach to addressing missing data. The most common strategy was MI, which is an improvement from previous reviews [61, 62] where CCA was predominant. Although this review shows that MI is becoming more widely used, CCA may still be the preferred method in practice, particularly in studies that do not explicitly report their missing data strategies. Despite its simplicity and widespread use, CCA can potentially introduce selection bias when the missing data are not MCAR.

Interestingly, some studies employed multiple missing data strategies, often combining LOCF, a common bedside practice in clinical settings, with other methods. Although combining multiple approaches may reflect a more deliberate effort to address missing data, it can also complicate the interpretation of results because it introduces variability and inconsistency in how missing data are handled across the dataset.

Another notable concern is the limited use of sensitivity analyses to assess the robustness of missing data strategies. This raises questions regarding the validity and reliability of the conclusions drawn from such studies, as the potential impact of missing data may not have been assessed thoroughly.

The underappreciation and under-reporting of the extent of missing data, assumptions made, inadequate handling, and lack of sensitivity analyses point to a broader issue of insufficient reporting in observational research, a finding similar to that of a recent review by Mainzer et al. [37], where they called for increased concerted efforts to improve transparency in reporting missing data practices. Given the challenges of time-varying confounding, the prevalence of missing data in multiple covariates (potential confounders) in observational studies, and their potential impacts on estimates, more detailed and transparent reporting of both the methods used and the assumptions made is essential for enhancing the reproducibility and robustness of causal estimates.

The strength of this study lies in its thorough evaluation of causal methodologies across a large body of literature spanning three databases, offering a clear picture of current trends and practices in applied epidemiology over the past two years. Although potentially relevant articles may have been missed, we took extensive measures to identify all relevant studies by examining cited and citing articles, which were few due to the recent review period (2023–2024).

Overall, although time-dependent confounding and missing data are distinct methodological challenges, they often intersect in observational studies. Our findings highlight persistent gaps in the reporting and handling of missing data in these studies. These gaps are particularly concerning, given the frequent reliance on singly robust methods, which are more prone to bias due to their dependence on a single model. Combined with inadequate reporting and handling of missing data, the impact of this bias can be significantly amplified, potentially leading to misleading conclusions about treatment effects. With the rise in personalized medicine and the increasing volume and complexity of real-world healthcare data, the need for rigorous methodologies has never been greater.

Conclusion

Time-varying treatments and exposures of epidemiological interest are increasingly common, making proper adjustments for time-varying confounding and handling of missing data essential for accurate estimation. This review highlights the widespread reliance on singly robust methods in causal inference for time-varying treatment settings, despite the availability of more robust alternatives, alongside significant gaps in the reporting and management of missing data. Addressing these dual challenges by implementing appropriate, more modern, and rigorous methodologies and improving transparency, as recommended in guidelines for observational studies, will move causal inference for time-varying treatments in observational settings to greater validity and reliability.

Moreover, the growing integration of machine learning (ML) and artificial intelligence (AI) techniques offers innovative solutions for causal inference challenges, particularly in complex, high-dimensional settings and in estimating treatment effect heterogeneity. These techniques provide greater modeling flexibility and reveal intricate patterns and relationships that traditional methods may fail to detect. However, their application in time-varying settings is still evolving.

Finally, with the growing interest in developing methods that integrate findings from RCTs and observational studies, as well as those that account for residual confounding, it would be valuable to explore their implementation and relative performance in time-varying settings in the future.

Supplementary Information

Supplementary Material 1. (17.2KB, docx)

Acknowledgements

Research reported in this publication was supported by the Fogarty International Center of the National Institutes of Health under Award Number D43TW010547. The content is solely the authors'responsibility and does not necessarily represent the official views of the National Institutes of Health.

We also want to thank the faculty librarian, Ms. Alvina Matthee, for her invaluable assistance in formulating and piloting the search strategy for this study and for providing guidance.

Abbreviations

ATE

Average treatment effects; ART: Anti-retroviral therapy; CCA: Complete case analysis

CML

Causal Machine Learning; DR: Doubly robust

HTE

Heterogeneous Treatment Effects; SNMM: Structural nested mean model

IPTW

Inverse Probability of Treatment Weighting

IPW

Inverse Probability of Weighting; IPCW: Inverse Probability of Censoring Weighting

AIPTW

Augmented Inverse Probability of Treatment Weighting

TMLE

Targeted Maximum Likelihood Estimation/minimum-loss based estimation

MCAR

Missing data can be classified as Missing Completely at Random

MAR

Missing at Random; MNAR: Missing Not at Random

MI

Multiple Imputation; LOCF: Last observation carried forward

HIV

Human Immunodeficiency Virus; RBA: Regression-based adjustment

RCTs

Randomized Controlled Trials; SAS: Statistical Analysis System

Authors’ contributions

MR, TY, and RM conceived and designed the scoping review. MR conducted the review, with RM confirming the reviewed articles and subsets of data extraction. MR drafted the manuscript, incorporating inputs from all co-authors. All authors were responsible for the interpretation of results, input to the draft manuscript, revision, and approval of the final version.

Funding

Not applicable.

Data availability

The datasets used and analyzed during the current study are available from the corresponding author upon reasonable request.

Declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Chakraborty B, Murphy SA. Dynamic treatment regimes. Annu Rev Stat Appl. 2014;1:447–64. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Robins J, Hernan M. Estimation of the causal effects of time-varying exposures. In: Verbeke G, Davidian M, Fitzmaurice G, Molenberghs G, editors. Longitudinal Data Analysis [Internet]. Chapman and Hall/CRC; 2008. p. 553–99. (Chapman & Hall/CRC Handbooks of Modern Statistical Methods; vol. 20085746). Available from: http://www.crcnetbase.com/doi/abs/10.1201/9781420011579.ch23. Cited 2024 Mar 26.
  • 3.Hernán MA, Robins JM. Estimating causal effects from epidemiological data. J Epidemiol Community Health. 2006;60(7):578–86. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Robins JM, Hernán MÁ, Brumback B. Marginal structural models and causal inference in epidemiology. Epidemiology. 2000;11(5):550–60. [DOI] [PubMed] [Google Scholar]
  • 5.World Health Organization and others. Consolidated guidelines on the use of antiretroviral drugs for treating and preventing HIV infection: recommendations for a public health approach. World Health Organization. 2016. [PubMed]
  • 6.Tan FES, Jolani S, Verbeek H. Guidelines for multiple imputations in repeated measurements with time-dependent covariates: a case study. J Clin Epidemiol. 2018;102:107–14. [DOI] [PubMed] [Google Scholar]
  • 7.Hern´an MA, Robins JM. Causal Inference: What If. Chapman & Hall/CRC. 2020.
  • 8.Leyrat C, Carpenter JR, Bailly S, Williamson EJ. Common methods for handling missing data in marginal structural models: what works and why. Am J Epidemiol. 2021;190(4):663–72. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Funk MJ, Westreich D, Wiesen C, Stürmer T, Brookhart MA, Davidian M. Doubly robust estimation of causal effects. Am J Epidemiol. 2011;173(7):761–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Van Der Laan MJ, Rubin D. Targeted Maximum Likelihood Learning. Int J Biostat. 2006;2(1). Available from: https://www.degruyter.com/document/doi/10.2202/1557-4679.1043/html. Cited 2024 Apr 26.
  • 11.Robins JM, Hernán MA, Brumback B. Marginal structural models and causal inference in epidemiology. Epidemiology. 2000;11(5):550–60. [DOI] [PubMed] [Google Scholar]
  • 12.Daniel RM, Cousens SN, De Stavola BL, Kenward MG, Sterne JAC. Methods for dealing with time-dependent confounding. Stat Med. 2013;32(9):1584–618. [DOI] [PubMed] [Google Scholar]
  • 13.Robins J, Hernan M. Estimation of the causal effects of time-varying exposures. In: Verbeke G, Davidian M, Fitzmaurice G, Molenberghs G, editors. Longitudinal Data Analysis. Chapman and Hall/CRC; 2008. p. 553–99. (Chapman & Hall/CRC Handbooks of Modern Statistical Methods; vol. 20085746). Available from: http://www.crcnetbase.com/doi/abs/10.1201/9781420011579.ch23. Cited 2024 Dec 19.
  • 14.van der Laan MJ, Gruber S. Targeted minimum loss based estimation of causal effects of multiple time point interventions. Int J Biostat. 2012;8(1):Article 9. 10.1515/1557-4679.1370. [DOI] [PubMed]
  • 15.Lee BK, Lessler J, Stuart EA. Improving propensity score weighting using machine learning. Stat Med. 2010;29(3):337–46. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Pirracchio R, Petersen ML, van der Laan M. Improving propensity score estimators’ robustness to model misspecification using super learner. Am J Epidemiol. 2015;181(2):108–19. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Cui P, Shen Z, Li S, Yao L, Li Y, Chu Z, et al. Causal Inference Meets Machine Learning. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. Virtual Event CA USA: ACM; 2020. p. 3527–8. Available from: https://dl.acm.org/doi/10.1145/3394486.3406460. Cited 2024 Jun 21.
  • 18.Le Borgne F, Chatton A, Léger M, Lenain R, Foucher Y. G-computation and machine learning for estimating the causal effects of binary exposure statuses on binary outcomes. Sci Rep. 2021;11(1):1435. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Luque-Fernandez MA, Schomaker M, Rachet B, Schnitzer ME. Targeted maximum likelihood estimation for a binary treatment: a tutorial. Stat Med. 2018;37(16):2530–46. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Neugebauer R, Schmittdiel JA, Van Der Laan MJ. Targeted learning in real-world comparative effectiveness research with time-varying interventions. Stat Med. 2014;33(14):2480–520. [DOI] [PubMed] [Google Scholar]
  • 21.van der Laan MJ, Polley EC, Hubbard AE. Super learner. Stat Appl Genet Mol Biol. 2007;6: Article25. [DOI] [PubMed] [Google Scholar]
  • 22.Laan MJ van der, Rose S. Targeted Learning: Causal Inference for Observational and Experimental Data. New York, NY: Springer New York; 2011. 626 p. (Springer Series in Statistics).
  • 23.Tran L, Yiannoutsos C, Wools-Kaloustian K, Siika A, van der Laan M, Petersen M. Double Robust Efficient Estimators of Longitudinal Treatment Effects: Comparative Performance in Simulations and a Case Study. Int J Biostat. 2019;15(2):/j/ijb.2019.15.issue-2/ijb-2017–0054/ijb-2017–0054.xml. [DOI] [PMC free article] [PubMed]
  • 24.Smith MJ, Phillips RV, Luque-Fernandez MA, Maringe C. Application of targeted maximum likelihood estimation in public health and epidemiological studies: a systematic review. Ann Epidemiol. 2023;86:34-48.e28. [DOI] [PubMed] [Google Scholar]
  • 25.Stitelman OM, van der Laan MJ. Collaborative targeted maximum likelihood for time to event data. Int J Biostat. 2010;6(1):21. [DOI] [PubMed] [Google Scholar]
  • 26.van der Laan M, Qiu S, van der Laan L. Adaptive-TMLE for the Average Treatment Effect based on Randomized Controlled Trial Augmented with Real-World Data. arXiv; 2024. Available from: http://arxiv.org/abs/2405.07186. Cited 2024 May 20.
  • 27.Schomaker M, Luque-Fernandez MA, Leroy V, Davies MA. Using longitudinal targeted maximum likelihood estimation in complex settings with dynamic interventions. Stat Med. 2019;38(24):4888–911. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Petersen M, Schwab J, Gruber S, Blaser N, Schomaker M, van der Laan M. Targeted maximum likelihood estimation for dynamic and static longitudinal marginal structural working models. J Causal Inference. 2014;2(2):147–85. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Clare PJ, Dobbins TA, Mattick RP. Causal models adjusting for time-varying confounding-a systematic review of the literature. Int J Epidemiol. 2019;48(1):254–65. [DOI] [PubMed] [Google Scholar]
  • 30.Athey S, Wager S. Estimating treatment effects with causal forests: an application. Observ Stud. 2019;5(2):37–51. [Google Scholar]
  • 31.Shiba K, Inoue K. Harnessing causal forests for epidemiologic research: key considerations. Am J Epidemiol. 2024;193(6):813–8. [DOI] [PubMed] [Google Scholar]
  • 32.Jawadekar N, Kezios K, Odden MC, Stingone JA, Calonico S, Rudolph K, et al. Practical Guide to Honest Causal Forests for Identifying Heterogeneous Treatment Effects. Am J Epidemiol. 2023;192(7):1155–65. [DOI] [PubMed] [Google Scholar]
  • 33.Caron A, Baio G, Manolopoulou I. Estimating individual treatment effects using non-parametric regression models: a review. J R Stat Soc Ser A Stat Soc. 2022;185(3):1115–49. [Google Scholar]
  • 34.Mahar RK, McGuinness MB, Chakraborty B, Carlin JB, IJzerman MJ, Simpson JA. A scoping review of studies using observational data to optimise dynamic treatment regimens. BMC Med Res Methodol. 2021;21(1): 39. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Dahabreh IJ, Matthews A, Steingrimsson JA, Scharfstein DO, Stuart EA. Using trial and observational data to assess effectiveness: trial emulation, transportability, benchmarking, and joint analysis. Epidemiol Rev. 2024;46(1):1–16. [DOI] [PubMed] [Google Scholar]
  • 36.Carroll OU, Morris TP, Keogh RH. How are missing data in covariates handled in observational time-to-event studies in oncology? A systematic review. BMC Med Res Methodol. 2020;20(1):134. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Mainzer RM, Moreno-Betancur M, Nguyen CD, Simpson JA, Carlin JB, Lee KJ. Gaps in the usage and reporting of multiple imputation for incomplete data: findings from a scoping review of observational studies addressing causal questions. BMC Med Res Methodol. 2024;24(1):193. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.White IR, Royston P, Wood AM. Multiple imputation using chained equations: issues and guidance for practice. Stat Med. 2011;30(4):377–99. [DOI] [PubMed] [Google Scholar]
  • 39.Bounthavong M, Watanabe JH, Sullivan KM. Approach to addressing missing data for electronic medical records and pharmacy claims data research. Pharmacotherapy. 2015;35(4):380–7. [DOI] [PubMed] [Google Scholar]
  • 40.Desai M, Montez-Rath ME, Kapphahn K, Joyce VR, Mathur MB, Garcia A, et al. Missing data strategies for time-varying confounders in comparative effectiveness studies of non-missing time-varying exposures and right-censored outcomes. Stat Med. 2019;38(17):3204–20. [DOI] [PubMed] [Google Scholar]
  • 41.Carpenter JR, Smuk M. Missing data: a statistical framework for practice. Biom J. 2021;63(5):915–47. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Little RJA, Rubin DB. Statistical analysis with missing data. New York: Wiley; 1987. p. 278 (Wiley series in probability and mathematical statistics). [Google Scholar]
  • 43.Austin PC, White IR, Lee DS, van Buuren S. Missing data in clinical research: a tutorial on multiple imputation. Can J Cardiol. 2021;37(9):1322–31. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Bartlett JW, Seaman SR, White IR, Carpenter JR, Alzheimer’s Disease Neuroimaging Initiative. Multiple imputation of covariates by fully conditional specification: accommodating the substantive model. Stat Methods Med Res. 2015;24(4):462–87. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Hayati Rezvan P, Lee KJ, Simpson JA. The rise of multiple imputation: a review of the reporting and implementation of the method in medical research. BMC Med Res Methodol. 2015;15(1): 30. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Erler NS, Rizopoulos D, van Rosmalen J, Jaddoe VWV, Franco OH, Lesaffre EMEH. Dealing with missing covariates in epidemiologic studies: a comparison between multiple imputation and a full Bayesian approach. Stat Med. 2016;35(17):2955–74. [DOI] [PubMed] [Google Scholar]
  • 47.Laqueur HS, Shev AB, Kagawa RMC. SuperMICE: an ensemble machine learning approach to multiple imputation by chained equations. Am J Epidemiol. 2022;191(3):516–25. [DOI] [PubMed] [Google Scholar]
  • 48.Getz K, Hubbard RA, Linn KA. Performance of multiple imputation using modern machine learning methods in electronic health records data. Epidemiology. 2023;34(2):206–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Carpenito T, Manjourides J. MISL: multiple imputation by super learning. Stat Methods Med Res. 2022;31(10):1904–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Sterne JAC, White IR, Carlin JB, Spratt M, Royston P, Kenward MG, et al. Multiple imputation for missing data in epidemiological and clinical research: potential and pitfalls. BMJ. 2009;29(338): b2393. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Mallinckrodt CH, Sanger TM, Dubé S, DeBrota DJ, Molenberghs G, Carroll RJ, et al. Assessing and interpreting treatment effects in longitudinal clinical trials with missing data. Biol Psychiatry. 2003;53(8):754–60. [DOI] [PubMed] [Google Scholar]
  • 52.Lachin JM. Fallacies of last observation carried forward analyses. Clin Trials. 2016;13(2):161–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.National Research Council. The Prevention and Treatment of Missing Data in Clinical Trials. Washington, D.C.: National Academies Press; 2010. Available from: http://www.nap.edu/catalog/12955. Cited 2025 May 28. [PubMed]
  • 54.European Medicines Agency. Guideline on missing data in confirmatory clinical trials. European Medicines Agency; 2010. Available from: https://www.ema.europa.eu/en/documents/scientific-guideline/guideline-missing-data-confirmatory-clinical-trials_en.pdf.
  • 55.Balzer LB, Ayieko J, Kwarisiima D, Chamie G, Charlebois ED, Schwab J, et al. Far from MCAR: obtaining population-level estimates of HIV viral suppression. Epidemiology. 2020;31(5):620–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Ehrlich SF, Ferrara A, Hedderson MM, Feng J, Neugebauer R. Exercise during the first trimester of pregnancy and the risks of abnormal screening and gestational diabetes mellitus. Diabetes Care. 2021;44(2):425–32. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Dashti SG, Lee KJ, Simpson JA, White IR, Carlin JB, Moreno-Betancur M. Handling missing data when estimating causal effects with targeted maximum likelihood estimation. Am J Epidemiol. 2024. 10.1093/aje/kwae012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Lendle SD, Schwab J, Petersen ML, Van Der Laan MJ. ltmle : An R Package Implementing Targeted Minimum Loss-Based Estimation for Longitudinal Data. J Stat Soft. 2017;81(1). Available from: http://www.jstatsoft.org/v81/i01/. Cited 2025 May 31.
  • 59.Little RJ. Selection Model (Missing Data). In: Kenett RS, Longford NT, Piegorsch WW, Ruggeri F, editors. Wiley StatsRef: Statistics Reference Online. 1st ed. Wiley; 2016. p. 1–5. Available from: https://onlinelibrary.wiley.com/doi/10.1002/9781118445112.stat07871. Cited 2025 Jan 30.
  • 60.Little RJ, Wang Y. Pattern-mixture models for multivariate incomplete data with covariates. Biometrics. 1996;52(1):98–111. [PubMed] [Google Scholar]
  • 61.Colnet B, Mayer I, Chen G, Dieng A, Li R, Varoquaux G, et al. Causal Inference Methods for Combining Randomized Trials and Observational Studies: A Review. Statist Sci. 2024;39(1). Available from: https://projecteuclid.org/journals/statistical-science/volume-39/issue-1/Causal-Inference-Methods-for-Combining-Randomized-Trials-and-Observational-Studies/10.1214/23-STS889.full. Cited 2025 Jan 31.
  • 62.Elm EV, Altman DG, Egger M, Pocock SJ, Gøtzsche PC, Vandenbroucke JP. Strengthening the reporting of observational studies in epidemiology (STROBE) statement: guidelines for reporting observational studies. BMJ. 2007;335(7624):806–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Tricco AC, Lillie E, Zarin W, O’Brien KK, Colquhoun H, Levac D, et al. PRISMA Extension for Scoping Reviews (PRISMA-ScR): Checklist and Explanation. Ann Intern Med. 2018;169(7):467–73. [DOI] [PubMed] [Google Scholar]
  • 64.Seaman SR, White IR. Review of inverse probability weighting for dealing with missing data. Stat Methods Med Res. 2013;22(3):278–95. [DOI] [PubMed] [Google Scholar]
  • 65.Ouzzani M, Hammady H, Fedorowicz Z, Elmagarmid A. Rayyan—a web and mobile app for systematic reviews. Syst Rev. 2016;5(1): 210. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.StataCorp. Stata Statistical Software: Release 18. StataCorp; 2023.
  • 67.Johansson SE, Jansåker F, Sundquist K, Bygren LO. A longitudinal study of the association between attending cultural events and coronary heart disease. Commun Med. 2023;3(1). Available from: https://www.scopus.com/inward/record.uri?eid=2-s2.0-85203698118&doi=10.1038%2fs43856-023-00301-0&partnerID=40&md5=78631090598fe63f466ac89111801389. [DOI] [PMC free article] [PubMed]
  • 68.Koch B, Sainburg T, Bastias P, Jiang S, Sun Y, Foster J. A Primer on Deep Learning for Causal Inference. Sociological Methods & Research. 2024;54(2):397–447. 10.1177/00491241241234866.
  • 69.Ishii T, Seya N, Taguri M, Wakui H, Yoshimura A, Tamura K. Allopurinol, febuxostat, and nonuse of xanthine oxidoreductase inhibitor treatment in patients receiving hemodialysis: a longitudinal analysis. Kidney Med. 2024;6(11): 100896. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Saghafian S. Ambiguous dynamic treatment regimes: a reinforcement learning approach. Manage Sci. 2024;70(9):5667–90. [Google Scholar]
  • 71.Tchetgen E, Ying A, Cui Y, Shi X, Miao W. An introduction to proximal causal inference. Statist Sci. 2024;39(3):375–90. [Google Scholar]
  • 72.Li F, Ding P, Mealli F. Bayesian causal inference: a critical review. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences. 2023;381(2247). Available from: https://www.scopus.com/inward/record.uri?eid=2-s2.0-85150990496&doi=10.1098%2frsta.2022.0153&partnerID=40&md5=2dc515e3ac0ab03a1e761ec7b9fbe2da. [DOI] [PubMed]
  • 73.Ai B, Zhang J, Zhang S, Chen G, Tian F, Chen L, et al. Causal association between long-term exposure to air pollution and incident Parkinson’s disease. J Hazardous Mater. 2024;469. Available from: https://www.scopus.com/inward/record.uri?eid=2-s2.0-85187219968&doi=10.1016%2fj.jhazmat.2024.133944&partnerID=40&md5=4eb1c70a003c87ca47113e7ef7e45e18. [DOI] [PubMed]
  • 74.Sewak A, Lodi S, Li X, Shu D, Wen L, Mayer KH, et al. Causal effects of stochastic PrEP interventions on HIV incidence among men who have sex with men. Am J Epidemiol. 2024;193(1):6–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Keogh RH, Gran JM, Seaman SR, Davies G, Vansteelandt S. Causal inference in survival analysis using longitudinal observational data: sequential trials and marginal structural models. Stat Med. 2023;42(13):2191–225. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Xu Y, Kim JS, Hummers LK, Shah AA, Zeger SL. Causal inference using multivariate generalized linear mixed-effects models. Biometrics. 2024;80(3). Available from: https://pubmed.ncbi.nlm.nih.gov/39319549/. Cited 7AD Jan 1. [DOI] [PMC free article] [PubMed]
  • 77.Petrich DM. Chronic exposure to community violence and criminal behavior: a marginal structural modeling approach. J Quant Criminol. 2024;40(4):671–705. [Google Scholar]
  • 78.Rodriguez CA, Lodi S, Horsburgh CR, Mitnick CD, Bastard M, Huerga H, et al. Comparative effectiveness of adding delamanid to a multidrug-resistant tuberculosis regimen comprised of three drugs likely to be effective. PLoS Glob Public Health. 2023. 10.1371/journal.pgph.0000818. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Mussini C, Cozzi-Lepri A, Meschiari M, Franceschini E, Burastero G, Faltoni M, et al. Do All Critically Ill Patients with COVID-19 Disease Benefit from Adding Tocilizumab to Glucocorticoids? A Retrospective Cohort Study. Viruses. 2023;15(2). Available from: https://pubmed.ncbi.nlm.nih.gov/36851508/. Cited 1AD Jan 1. [DOI] [PMC free article] [PubMed]
  • 80.Huang S, Yang B, Peng Y, Wang L, Wang L, Wang J, et al. Early sodium bicarbonate infusion was associated with improved outcome in critically ill patients with acute moderate metabolic acidosis: a retrospective observational study. Emerg Crit Care Med. 2024;4(4):143–8. [Google Scholar]
  • 81.Lagerberg T, Matthews AA, Zhu N, Fazel S, Carrero JJ, Chang Z. Effect of selective serotonin reuptake inhibitor treatment following diagnosis of depression on suicidal behaviour risk: a target trial emulation. Neuropsychopharmacology. 2023;48(12):1760–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Kiuchi S, Cooray U, Aida J, Osaka K, Chan A, Malhotra R, et al. Effect of tooth loss on cognitive function among older adults in Singapore. J Dent Res. 2023;102(8):871–8. [DOI] [PubMed] [Google Scholar]
  • 83.Pipa J, Daniel JR, Peixoto F. Effects of grade retention in lower secondary education on students’ self-concept, self-esteem, goal orientations, and school career. Psychol Sch. 2024;61(5):1897–921. [Google Scholar]
  • 84.Seib C, Ganesan C, Furst A, Pao A, Chertow G, Leppert J, et al. Estimated effect of parathyroidectomy on long-term kidney function in adults with primary hyperparathyroidism. Ann Intern Med. 2023;176(5):624. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Chen S, Zhang B. Estimating and improving dynamic treatment regimes with a time-varying instrumental variable. J R Stat Soc Series B Stat Methodol. 2023;85(2):427–53. [Google Scholar]
  • 86.Hu L, Ji J, Joshi H, Scott ER, Li F. Estimating the causal effects of multiple intermittent treatments with application to COVID-19. ArXiv. 2023. Available from: https://pubmed.ncbi.nlm.nih.gov/34981032/. Cited 8AD Jan 1.
  • 87.Qiu S, Hubbard AE, Gutiérrez JP, Pimpale G, Juárez-Flores A, Ghosh R, et al. Estimating the effect of realistic improvements of metformin adherence on COVID-19 mortality using targeted machine learning. Global Epidemiology. 2024;7: 100142. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Loh WW, Ren D. Estimating Time-Varying Treatment Effects in Longitudinal Studies. Psychological Methods. 2023; Available from: https://www.scopus.com/inward/record.uri?eid=2-s2.0-85168847273&doi=10.1037%2fmet0000574&partnerID=40&md5=6b520ec7624387ce8142f8b0b5c7744d. [DOI] [PubMed]
  • 89.Schäfer Z, Mathisen A, Thomsen TR, Rossing P, Kirketerp-Møller K. Glucagon-like peptide-1 treatment reduces the risk of diabetes-type 2 related amputations: A cohort study in Denmark. Diabetes Research and Clinical Practice. 2023;202. Available from: https://www.scopus.com/inward/record.uri?eid=2-s2.0-85165006540&doi=10.1016%2fj.diabres.2023.110799&partnerID=40&md5=d827654dbb400186674f3352dc41fc61. [DOI] [PubMed]
  • 90.Barbulescu A, Sjölander A, Delcoigne B, Askling J, Frisell T. Glucocorticoid exposure and the risk of serious infections in rheumatoid arthritis: a marginal structural model application. Rheumatology (Oxford). 2023;62(10):3391–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Marschner N, Haug N, Hegewisch-Becker S, Reiser M, Dörfel S, Lerchenmüller C, et al. Head-to-head comparison of treatment sequences in advanced pancreatic cancer—real-world data from the prospective German TPK clinical cohort study. Int J Cancer. 2024;155(9):1629–40. [DOI] [PubMed] [Google Scholar]
  • 92.Edmonds A, Haley D, Edwards J, Ramirez C, French A, Tien P, et al. Health insurance and initiation of direct-acting antivirals for hepatitis C in US women with human immunodeficiency virus. Clin Infect Dis. 2023;77(2):258–64. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Rassen JA, Blin P, Kloss S, Neugebauer RS, Platt RW, Pottegård A, et al. High-dimensional propensity scores for empirical covariate selection in secondary database studies: planning, implementation, and reporting. Pharmacoepidemiol Drug Saf. 2023;32(2):93–106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Peng C, Peng L, Yang F, Yu H, Wang P, Cheng C, et al. Impact of early tracheostomy on clinical outcomes in trauma patients admitted to the intensive care unit: a retrospective causal analysis. J Cardiothorac Vasc Anesth. 2023;37(12):2584–91. [DOI] [PubMed] [Google Scholar]
  • 95.Bonnesen K, Pedersen L, Ehrenstein V, Sørensen HT, Lash TL, Schmidt M. Impact of hemoglobin A1c level on the association between non-steroidal anti-inflammatory drug use and cardiovascular events in patients with type 2 diabetes: a population-based cohort study. Pharmacoepidemiol Drug Saf. 2023;32(11):1233–43. [DOI] [PubMed] [Google Scholar]
  • 96.Cooray U, Tsakos G, Heilmann A, Watt RG, Takeuchi K, Kondo K, et al. Impact of teeth on social participation: modified treatment policy approach. J Dent Res. 2023;102(8):887–94. [DOI] [PubMed] [Google Scholar]
  • 97.Shimizu S, Niihata K, Nishiwaki H, Shibagaki Y, Yamamoto R, Nitta K, et al. Initiation of renin–angiotensin system inhibitors and first complete remission in patients with primary nephrotic syndrome: a nationwide cohort study. Clin Exp Nephrol. 2023;27(5):480–9. [DOI] [PubMed] [Google Scholar]
  • 98.Cui Y, Michael H, Tanser F, Tchetgen Tchetgen E. Instrumental variable estimation of the marginal structural Cox model for time-varying treatments. Biometrika. 2023;110(1):101–18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99.Rein SM, Lodi S, Logan RW, Touloumi G, Antoniadou A, Wittkop L, et al. Integrase strand-transfer inhibitor use and cardiovascular events in adults with HIV: an emulation of target trials in the HIV-CAUSAL Collaboration and the Antiretroviral Therapy Cohort Collaboration. Lancet HIV. 2023;10(11):e723–32. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Han Y, Lam J, Li V, Crowcroft J. Interpretable AI-driven causal inference to uncover the time-varying effects of PM2.5 and public health interventions on COVID-19 infection rates. Humanit Soc Sci Commun. 2024. 10.1057/s41599-024-04202-y. [Google Scholar]
  • 101.Peres MA, Peres KG, Chan A, Wu B, Mittinty M. Investigating the causal effect of cognition on the self-reported loss of functional dentition using marginal structural models: the panel on health and ageing of Singaporean elderly study. J Clin Periodontol. 2023;50(4):408–17. [DOI] [PubMed] [Google Scholar]
  • 102.Mulder JD, Usami S, Hamaker EL. Joint Effects in Cross-Lagged Panel Research Using Structural Nested Mean Models. Struct Equation Model. 2024; Available from: https://www.scopus.com/inward/record.uri?eid=2-s2.0-85196269444&doi=10.1080%2f10705511.2024.2355579&partnerID=40&md5=590611331ea837bc75edff0e76f0bb01.
  • 103.Urner M, Jüni P, Rojas-Saunero LP, Hansen B, Brochard LJ, Ferguson ND, et al. Limiting dynamic driving pressure in patients requiring mechanical ventilation. Crit Care Med. 2023;51(7):861–71. [DOI] [PubMed] [Google Scholar]
  • 104.Takahashi Y, Minagawa K, Nagashima T, Hayakawa T, Akimoto H, Asai S. Long-term benefit of SGLT2 inhibitors to prevent heart failure hospitalization in patients with diabetes, with potential time-varying benefit. Clin Transl Sci. 2024;17(12). Available from: https://www.scopus.com/inward/record.uri?eid=2-s2.0-85212400253&doi=10.1111%2fcts.70088&partnerID=40&md5=b5b5912ab8f5f2cbdf8efc59a99588cf. [DOI] [PMC free article] [PubMed]
  • 105.Kennedy TM, Kennedy EH, Ceballo R. Marginal structural models for estimating the longitudinal effects of community violence exposure on youths’ internalizing and externalizing symptoms. Psychol Trauma. 2023;15(6):906–16. [DOI] [PubMed] [Google Scholar]
  • 106.Stulberg EL, Delic A, Zheutlin AR, Steinberg BA, Yaghi S, Sharma R, et al. Modelling anticoagulation and health-related quality of life in those with atrial fibrillation: a secondary analysis of AFFIRM. Clin Res Cardiol. 2024;113(8):1200–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 107.Visontay R, Mewton L, Slade T, Aris IM, Sunderland M. Moderate alcohol consumption and depression: a marginal structural model approach promoting causal inference. Am J Psychiatry. 2023;180(3):209–17. [DOI] [PubMed] [Google Scholar]
  • 108.Weir DL, Bai YQ, Thavorn K, Guilcher S, Kanji S, Mulpuru S, et al. Non-adherence to COPD medications and its association with adverse events: a longitudinal population based cohort study of older adults. Ann Epidemiol. 2024;96:88–96. [DOI] [PubMed] [Google Scholar]
  • 109.Jun I, Cohen SA, Ser SE, Marini S, Lucero RJ, Bian J, et al. Optimizing Dynamic Antibiotic Treatment Strategies against Invasive Methicillin-Resistant Staphylococcus Aureus Infections using Causal Survival Forests and G-Formula on Statewide Electronic Health Record Data. Proc Mach Learn Res. 2023;218:98–115. [PMC free article] [PubMed] [Google Scholar]
  • 110.Loh WW, Ren D, West SG. Parametric g-formula for testing time-varying causal effects: what it is, why it matters, and how to implement it in Lavaan. Multivar Behav Res. 2024;59(5):995–1018. [DOI] [PubMed] [Google Scholar]
  • 111.Matsuyama Y, Isumi A, Doi S, Fujiwara T. Persistent poverty and child dental caries: time-varying exposure analysis. J Epidemiol Community Health. 2023;77(10):670–5. [DOI] [PubMed] [Google Scholar]
  • 112.Adesuyan M, Jani YH, Alsugeir D, Howard R, Ju C, Wei L, et al. Phosphodiesterase type 5 inhibitors in men with erectile dysfunction and the risk of Alzheimer disease: a cohort study. Neurology. 2024;102(4): e209131. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 113.Zhang Y, Wang Y, Du Z, Chen S, Qu Y, Hao C, et al. Potential causal links between long-term ambient particulate matter exposure and cardiovascular mortality: new evidence from a large community-based cohort in South China. Ecotoxicol Environ Saf. 2023;254: 114730. [DOI] [PubMed] [Google Scholar]
  • 114.Chen S, Lin X, Du Z, Zhang Y, Zheng L, Ju X, et al. Potential causal links between long-term ambient particulate matter exposure and cerebrovascular mortality: Insights from a large cohort in southern China. Environmental Pollution. 2023;328. Available from: https://www.scopus.com/inward/record.uri?eid=2-s2.0-85152745373&doi=10.1016%2fj.envpol.2023.121336&partnerID=40&md5=dce9d2780c04d2be7026dc7827fb78f9. [DOI] [PubMed]
  • 115.Pierce M, Bai Y, Nevriana A, Dalman C, Hope HF, Kosidou K, et al. Prevention of childhood adversities and children’s common mental disorders and school grades. JAMA Netw Open. 2023;6(10):e2336408. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 116.Raittio E, Lopez R, Baelum V. Restricting Periodontal Treatment Frequency: Impact on Tooth Loss in Danish Adults. Community Dentistry and Oral Epidemiology. 2024; Available from: https://www.scopus.com/inward/record.uri?eid=2-s2.0-85212779648&doi=10.1111%2fcdoe.13022&partnerID=40&md5=eb7a300afb9519704f3f1feb90772838. [DOI] [PMC free article] [PubMed]
  • 117.Yun H, Chen L, Roy JA, Greenberg J, Harrold LR, George MD, et al. Rheumatoid arthritis disease activity and hospitalized infection in a large US registry. Arthritis Care Res. 2023;75(8):1639–47. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 118.Hoisnard L, Meyer A, Dray-Spira R, Weill A, Zureik M, Sbidian E. Risk of gastrointestinal perforation in patients with rheumatic diseases exposed to Janus kinase inhibitors versus adalimumab: a nationwide cohort study. Arthritis Rheumatol. 2024;76(9):1364–76. [DOI] [PubMed] [Google Scholar]
  • 119.Chen S, Zhang Y, Wei J, Hao C, Wu W, Li Z, et al. Risk of stroke admission after long-term exposure to PM(1): evidence from a large cohort in South China. Ecotoxicol Environ Saf. 2024;283: 116720. [DOI] [PubMed] [Google Scholar]
  • 120.Rodriguez Duque D, Stephens DA, Moodie EEM, Klein MB. Semiparametric Bayesian inference for optimal dynamic treatment regimes via dynamic marginal structural models. Biostatistics. 2023;24(3):708–27. [DOI] [PubMed] [Google Scholar]
  • 121.Hoffman KL, Salazar-Barreto D, Williams NT, Rudolph KE, Díaz I. Studying continuous, time-varying, and/or complex exposures using longitudinal modified treatment policies. Epidemiology. 2024;35(5):667–75. [DOI] [PubMed] [Google Scholar]
  • 122.Liu Y, Schnitzer ME, Herrera R, Díaz I, O’Loughlin J, Sylvestre MP. The application of target trials with longitudinal targeted maximum likelihood estimation to assess the effect of alcohol consumption in adolescence on depressive symptoms in adulthood. Am J Epidemiol. 2024;193(6):835–45. [DOI] [PubMed] [Google Scholar]
  • 123.Song J, Yang X, Wu J, Wu Z, Niu S, Zhuo L, et al. The association analysis between fatigue and body composition loss in patients with nasopharyngeal carcinoma during radiotherapy: an observational longitudinal study. Radiother Oncol. 2024;197: 110340. [DOI] [PubMed] [Google Scholar]
  • 124.Keen R, Chen JT, Slopen N, Sandel M, Copeland WE, Tiemeier H. The biological embedding of social adversity: how adolescent housing insecurity impacts inflammation over time. Brain Behav Immun. 2024;119:1008–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 125.Wang Y, Wang Z, Jiang J, Guo T, Chen S, Li Z, et al. The effect of long-term particulate matter exposure on respiratory mortality: cohort study in China. JMIR Public Health Surveill. 2024. 10.2196/56059. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 126.Ahn N, Wawro N, Baumeister SE, Nolde M, Gerlach R, Tauscher M, et al. Time-varying use of proton pump inhibitors and cognitive impairment and dementia: a real-world analysis from Germany. Drugs Aging. 2023;40(7):653–63. [DOI] [PubMed] [Google Scholar]
  • 127.Narita Z, Devylder J, Yamasaki S, Ando S, Endo K, Miyashita M, et al. Uncovering associations between gender nonconformity, psychosocial factors, and mental health in adolescents: a birth cohort study. Psychol Med. 2024;54(5):921–30. [DOI] [PubMed] [Google Scholar]
  • 128.Rizopoulos D, Taylor JM, Papageorgiou G, Morgan TM. Using joint models for longitudinal and time-to-event data to investigate the causal effect of salvage therapy after prostatectomy. Stat Methods Med Res. 2024;33(5):894–908. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 129.Usami S. Within-person variability score-based causal inference: a two-step estimation for joint effects of time-varying treatments. Psychometrika. 2023;88(4):1466–94. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 130.Börnhorst C, Pigeot I, De Henauw S, Formisano A, Lissner L, Molnár D, et al. The effects of hypothetical behavioral interventions on the 13-year incidence of overweight/obesity in children and adolescents. Int J Behav Nutr Phys Act. 2023;20(1). Available from: https://www.scopus.com/inward/record.uri?eid=2-s2.0-85168707470&doi=10.1186%2fs12966-023-01501-6&partnerID=40&md5=378fb8d6ad7e10611ebdc18f30ac46cf. [DOI] [PMC free article] [PubMed]
  • 131.Keen R, Chen JT, Slopen N, Sandel M, Copeland WE, Tiemeier H. The biological embedding of social adversity: how adolescent housing insecurity impacts inflammation over time. Brain Behav Immun. 2024;119:1008–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 132.Birnie K, Tomson C, Caskey F, Ben-Shlomo Y, Nitsch D, Casula A, et al. Comparative effectiveness of dynamic treatment strategies for medication use and dosage: emulating a target trial using observational data. Epidemiology. 2023;34(6):879–87. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 133.Tanios M, Wu TT, Nguyen MH, Smith L, Mahidhara R, Devlin JW. Comparing the impact of targeting limited driving pressure to low tidal volume ventilation on mortality in mechanically ventilated adults with COVID-19 ARDS: an exploratory target trial emulation. BMJ Open Respir Res. 2024;11(1): e002439. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 134.Ko Y, Howard SC, Golden AP, French B. Adjustment for duration of employment in occupational epidemiology. Ann Epidemiol. 2024;94:33–41. [DOI] [PubMed] [Google Scholar]
  • 135.Newsome SJ, Keogh RH, Daniel RM. Estimating long-term treatment effects in observational data: a comparison of the performance of different methods under real-world uncertainty. Stat Med. 2018;37(15):2367–90. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 136.Smith MJ, Phillips RV, Luque-Fernandez MA, Maringe C. Application of targeted maximum likelihood estimation in public health and epidemiological studies: a systematic review. Ann Epidemiol. 2023;86:34-48.e28. [DOI] [PubMed] [Google Scholar]
  • 137.Talbot D, Diop A, Mésidor M, Chiu Y, Sirois C, Spieker AJ, et al. Guidelines and best practices for the use of targeted maximum likelihood and machine learning when estimating causal effects of exposures on time-to-event outcomes. Stat Med. 2025;44(6): e70034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 138.Chen X, Harhay MO, Tong G, Li F. A bayesian machine learning approach for estimating heterogeneous survivor causal effects: applications to a critical care trial. Ann Appl Stat. 2024;18(1):350–74. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 139.Streeter AJ, Lin NX, Crathorne L, Haasova M, Hyde C, Melzer D, et al. Adjusting for unmeasured confounding in nonrandomized longitudinal studies: a methodological review. J Clin Epidemiol. 2017;87:23–34. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 140.Lee KJ, Tilling KM, Cornish RP, Little RJA, Bell ML, Goetghebeur E, et al. Framework for the treatment and reporting of missing data in observational studies: the treatment and reporting of missing data in observational studies framework. J Clin Epidemiol. 2021;134:79–88. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Material 1. (17.2KB, docx)

Data Availability Statement

The datasets used and analyzed during the current study are available from the corresponding author upon reasonable request.


Articles from BMC Medical Research Methodology are provided here courtesy of BMC

RESOURCES