Skip to main content
PMC Canada Author Manuscripts logoLink to PMC Canada Author Manuscripts
. Author manuscript; available in PMC: 2018 Mar 21.
Published before final editing as: Stat Methods Med Res. 2016 Sep 21;27(6):1709–1722. doi: 10.1177/0962280216668554

Comparison of Statistical Approaches Dealing with Time-dependent Confounding in Drug Effectiveness Studies

Mohammad Ehsanul Karim 1,2,*, John Petkau 3, Paul Gustafson 3, Robert W Platt 1,2,4, Helen Tremlett 5; The BeAMS study group6
PMCID: PMC5770217  CAMSID: CAMS6817  PMID: 27659168

Abstract

In longitudinal studies, if the time-dependent covariates are affected by the past treatment, time-dependent confounding may be present. For a time-to-event response, marginal structural Cox models (MSCMs) are frequently used to deal with such confounding. To avoid some of the problems of fitting MSCM, the sequential Cox approach has been suggested as an alternative. Although the estimation mechanisms are different, both approaches claim to estimate the causal effect of treatment by appropriately adjusting for time-dependent confounding. We carry out simulation studies to assess the suitability of the sequential Cox approach for analyzing time-to-event data in the presence of a time-dependent covariate that may or may not be a time-dependent confounder. Results from these simulations revealed that the sequential Cox approach is not as effective as MSCM in addressing the time-dependent confounding. The sequential Cox approach was also found to be inadequate in the presence of a time-dependent covariate. We propose a modified version of the sequential Cox approach that correctly estimates the treatment effect in both of the above scenarios. All approaches are applied to investigate the impact of beta-interferon treatment in delaying disability progression in the British Columbia Multiple Sclerosis cohort (1995 – 2008).

Keywords: bias (epidemiology), causality, confounding factors (epidemiology), epidemiologic methods, inverse probability weighting, longitudinal studies, models, survival analysis

1 Introduction

Longitudinal studies can include regular measurements of clinical symptoms and disease activity as covariates, and it is natural that the values may change over time. Since the predictive ability of baseline covariates may decrease over the follow-up time, consideration of the full history of these time-dependent covariates, rather than just the baseline covariates would be preferable1. However, if these covariates are affected by previous treatment and predicts the future treatment decision and future outcome conditional on the past treatment exposure, then such covariates are popularly known as “time-dependent confounders”2,3. If the causal effect of treatment is of interest, the estimated hazard ratio may be biased whether or not the time-dependent confounders are included as covariates in a time-dependent Cox model analysis2,4. In the presence of time-dependent confounding, marginal structural Cox models (MSCM) are frequently used to estimate the causal effect of a time-dependent treatment exposure5,6. Sometimes MSCM estimates are unstable due to use of variable inverse probability of treatment weights (IPTW)79. The sequential Cox approach has been proposed as an alternative to the MSCM approach9.

In a previous simulation study, we showed that a simplified implementation of the sequential Cox approach (considering only baseline covariate adjustment) performed well compared to a Cox proportional hazards model fit with time-dependent exposure while adjusting for baseline confounders10. To the best of our knowledge, no attempt has been made to explore the appropriateness of the sequential Cox approach in adjusting for the effect of time-dependent confounding in a simulation setting.

To overcome the limitations of this approach revealed by our simulations, we also propose a modified version of the sequential Cox approach in this paper. The primary focus of this paper is to assess the performance of these sequential Cox approaches for dealing with a time-dependent confounder. A secondary aim of this paper is to examine how these methods perform in the presence of a time-dependent covariate which does not interact with the past treatment condition (i.e., is not a ‘time-dependent confounder’). To do this, we simulate survival data with time-dependent treatment exposure. Two different conditions are considered for simulation: (1) a time-dependent confounder is present, and (2) a time-dependent covariate is present along with a baseline covariate. To assess their suitability in an application, we apply these methods to investigate the impact of time-varying beta-interferon treatment in delaying disability progression in subjects from the British Columbia Multiple Sclerosis (MS) database (1995–2008)11,12.

The remainder of the paper is organized as follows. In the next section, we describe the notation and design of the simulation study, the methods used to address time-dependent confounding, and the metrics used to evaluate their performances. Then we summarize the simulation and the MS data analysis results. The paper concludes with a discussion of the results, and the implications and limitations of the current study.

2 Methods

2.1 Notation

Consider a hypothetical longitudinal study consisting of n subjects (i = 1, 2, …, n). Let t0 = 0 be the start of follow-up or the time of the baseline clinic visit. Baseline covariates L0 (binary or continuous) are recorded at baseline. Follow-up continues until the time of failure T or the time of censoring TC. Let that, [tm, tm+1) constitutes the m-th interval (say, m-th month in the follow-up). At intervals m = 0, 1, 2, …, K, regular measurements of the binary treatment status Am (= 1 for treated and 0 otherwise) are recorded. Let Cm be the binary indicator of censoring (= 1 if censored due to dropout or artificial censoring and 0 otherwise). Let ām = (a0, a1, …, am) be the observed realizations of the treatment history Ām up to interval m, and similarly, let m and m be the observed realizations of the covariate history m and the censoring history m up to interval m respectively. The binary indicator of failure by time tm+1 is defined as Ym+1 = I(Ttm+1). As the sequential Cox approach does not allow for treatment discontinuation, we assume that the subjects may initiate treatment at most once and that they continue on the treatment thereafter until the end of their follow-up. Let treatment initiation occur at time TA.

2.2 Analysis Approaches

Brief characteristics of the analysis approaches are shown in Table 1. We describe these methods in detail in the following sections using the notation defined above.

Table 1.

Description of the Cox models used in the approaches under consideration.

Approach Stratified Time-dependent covariate history Weight adjusted
TD-Cox No Full No
MSCM No Full Yes, IPTC#
Sequential Cox Yes Up to the new baseline Yes, IPC§
Modified Sequential Cox Yes New baseline and afterwards†† Yes, IPC§

TD-Cox, Cox model with time-dependent exposure; MSCM, Marginal structural Cox model; IPT, Inverse probability of treatment; IPC, Inverse probability of censoring; IPTC, Inverse probability of treatment and censoring.

For the sequential Cox approach, covariate values are collected at three time points for each mini-trial: at baseline, at the interval of treatment start and at the previous interval (the lagged value): m = (L0, Lm−1, Lm). Here, time-fixed covariates collected at the original baseline (i.e., L0) are included in the analysis.

††

For the modified sequential Cox approach, the time-dependent covariate values are collected at the new baseline and then at subsequent intervals (i.e., L⃗m = (Lm, Lm+1, …, LK)). Time-fixed covariates collected at the original baseline (i.e., L0) are also included in the analysis.

Robust (sandwich) estimate is used to obtain SEs.

#

Pooled logistic regression is used to estimate the IPTC weights.

§

Aalen’s additive regression model is used to estimate the IPCW.

2.2.1 Cox Model with Time-dependent Treatment and Covariates

In the presence of baseline confounders L0 and time-dependent covariates Lm, one way to express the hazard function through the time-dependent Cox model is as follows:

λ(mL0,Lm)=λ0(m)exp(ψ1Am+ψ2L0+ψ3Lm,) (1)

where m is the visit index, λ0(m) is the unspecified baseline hazard function, ψ̃1 is the log-hazard ratio (log-HR) of the time-dependent treatment status (Am), ψ2 and ψ3 are the vectors of log-HRs for the baseline covariates L0 and the time-dependent covariates Lm, respectively.

2.2.2 Marginal Structural Cox Models

If the time-dependent covariate Lm is influenced by past exposure, i.e., if Lm is a time-dependent confounder, playing a dual role as a confounder and an intermediate variable in the causal pathway between treatment and outcome, then ψ1 as estimated from equation (1) may be biased2 (i.e., may deviate from the target parameter ψ1; discussed in detail in Web-Appendix §A). Instead of using Lm as a covariate, the MSCM approach uses it to calculate IPTW that are person-time specific measures of the degree to which Lm confounds the treatment selection process.

Stabilized inverse probability of treatment and censoring (IPTC) weight, swm, can be obtained by multiplying stabilized IPTW, swmT, by stabilized inverse probability of censoring weights (IPCW), swmC5, where

swmT=j=0mpr(Aj=ajA¯j-1=a¯j-1,L0=l0)pr(Aj=ajA¯j-1=a¯j-1,L0=l0,L¯j=l¯j), (2)

and

swmC=j=0mpr(Cj=0C¯j-1=0,A¯j-1=a¯j-1,L0=l0)pr(Cj=0C¯j-1=0,A¯j-1=a¯j-1,L0=l0,L¯j-1=l¯j-1). (3)

The weights swm are used in the time-dependent Cox model with hazard function modelled as follows to weight the contribution of each person-time observation so that the confounding due to Lm is removed:

λ(mL0)=λ0(m)exp(ψ1Am+ψ2L0), (4)

where ψ1 is log-HR of the time-dependent treatment status (Am). Note that IPCW is used only if non-random censoring is present. When the numerators in equations (2) and (3) are replaced by 1, these become the unstabilized IPTC weights, wm. We used pooled logistic regression2,5 to estimate the IPTC weights. Estimation procedure details are included in Web-Appendix §B.

2.2.3 Sequential Cox Approach

Suppose that at least one subject initiates treatment in the m-th interval [tm, tm+1). We want to mimic a randomized clinical trial for each such interval. The mini-trial corresponding to the m-th interval (hereafter referred to as the m-th mini-trial) involves only subjects who have not previously received any treatment. Among the subjects at-risk at tm the subjects initiating treatment during the interval (tm < TAtm+1) are considered as the treated group, while the remaining subjects are considered as the control group. These control subjects are artificially censored at their times of later treatment initiation (TA > tm+1) to avoid confounding due to treatment. As these subjects are artificially censored, the analysis must be adjusted using IPCW.

In the analysis, we adjust for the baseline confounders L0 measured at inclusion or baseline, the time-dependent covariates Lm measured at the start of the interval when patient started the treatment and the lagged covariates Lm−1 consisting of the lagged value measured at the previous interval of the treatment start. Adjustment of these covariate values should help to reduce bias in the estimation of the treatment effect from the m-th mini-trial data9. Let us denote m = (L0, Lm−1, Lm). Here, L0 includes the covariate values measured at baseline (i.e., time-static baseline covariates as well as the time-varying covariate or confounder value at time-point m = 0).

We assume that the different mini-trials may have different baseline hazard functions but all subjects in the same mini-trial have the same baseline hazard function. Under this assumption, use of a stratified Cox model is appropriate. Therefore, one way to model the hazard function for the m-th mini-trial is9:

λm(jL0,L¯j)=λ0m(j)exp(ψˋ1Aj+ψˋ2Lm);jm (5)

where λ0m(j) is the unspecified baseline hazard function for stratum m, ψ`1 is log-HR of the time-dependent treatment status and ψ`2 is the vector of log-HRs for the time-dependent covariates m. This hazard function should be weighted by IPCW (equation (3)). It was suggested that the resulting estimate should bear a causal interpretation under the assumptions of no unmeasured confounders and correct model specification for the hazard ratio and the censoring weights9. We used Aalen’s additive regression model9,13 to estimate the IPCW.

We can fit a stratified Cox model to the combined data of all mini-trials (pseudo-data), stratified by the treatment initiation time. Inclusion of the same subject more than once invalidates the SE obtained from the stratified weighted Cox analysis. Computationally demanding resampling methods are suggested9,13 to obtain a correct SE. We used a robust (sandwich) estimate instead to save computational time similar to other simulation studies14,15. An illustrative data construction example is provided in Web-Appendix §C and the corresponding software implementation details are provided in Web-Appendix §D.

2.2.4 Modified Sequential Cox Approach

As each of the mini-trials mimics a clinical trial, we propose to analyze the mini-trial data accordingly. Each mini-trial is created based on a particular month of treatment initiation. These treatment initiation months are considered as the new baselines (new time-0) for the corresponding mini-trials. Note that the time-fixed covariates measured at the original baseline (i.e., L0) are included in the analysis. For covariates that vary over time, we consider all the information from the new baseline to the study end-point to analyze the data, as we would do in a clinical trial setting. For example, time-varying covariate or confounder values collected at the interval of treatment start and onward (i.e., L⃗m = (Lm, Lm+1, …, LK) for the m-th mini-trial) are included in the model for adjustment16. Unlike the original proposal9, we do not use any time-dependent confounder (or time-dependent covariate) values prior to the new baseline for adjustment. Therefore, the hazard function for the m-th mini-trial can be expressed as:

λm(jL0,L¯j)=λ0m(j)exp(ψ1Aj+ψ2Lm);jm (6)

where ψ1 is log-HR of the time-dependent treatment status and ψ2 is the vector of log-HRs for the time-dependent covariates L⃗m. However, the process of creating the pseudo-population remains the same. The proposed changes occur only in the analysis stage. Theoretically proving the equivalence of the target parameters ψ1 from equation (6) and ψ1 from equation (4) is not easy, but we have provided a heuristic justification of such equivalence later based on a Monte Carlo experiment. In some context, additional techniques, such as, matching17,18 or use of propensity scores19 or adherence-adjustments16 may be useful to make the subjects within a mini-trial more comparable, but those approaches are not considered here.

2.3 Design of Simulation

We adopt the data generation process of Young et al.20 to simulate survival times where time-dependent confounding is present. To simulate survival times with time-dependent covariates (none of which are time-dependent confounders), we adapt the permutation algorithm21. Descriptions of these algorithms are presented in Web-Appendices §E and §F respectively.

2.4 Simulation Specifications

In our Monte Carlo study, we generated N = 1, 000 datasets with n = 2, 500 subjects, each followed for up to m = 10 subsequent monthly visits for each setting under consideration. We set λ0 = 0.01 (on a monthly scale) to represent a rare disease condition and λ0 = 0.10 (on a monthly scale) for a more frequent disease condition. We discuss a brief description of the two simulations under consideration in Table 2. Below we provide the detailed specifications of the simulation scenarios.

Table 2.

Two simulation settings under consideration.

Simulation - I Simulation - II
Algorithm 20 21
Time-varying treatment Yes Yes
Baseline covariate No Yes
Time-varying covariate No Yes
Time-varying confounder Yes No

2.4.1 Simulation - I

In our implementation of the algorithm20, counterfactual failure time Ti ’s are sampled from an exponential distribution, with constant λ0 rate of monthly events throughout the follow-up. The binary time-dependent confounder, Lm, is modelled by the following covariates: a binary covariate I(Tc), previous treatment status Am−1, and the lagged variable Lm−1:

logit(pL)=logitPr(Lm=1Am-1,Lm-1,Ym=0;β)=β0+β1I(T0¯<c)+β2Am-1+β3Lm-1, (7)

with associated parameters β = (β0, β1, β2, β3) = (log(3/7), 2, log(1/2), log(3/2)), c = 30 and Ym = I(Ttm). Here, the time-dependent covariate Lm is moderately affected by prior treatment Am−1 (β2 = log(1/2) = −0.3).

We model binary treatment status at each stage Am with the factors current symptom Lm, past symptom Lm−1, and previous treatment status Am−1 as

logit(pA)=logitPr(Am=1Lm,Am-1,Lm-1,Ym=0;α)=α0+α1Lm+α2Lm-1+α3Am-1, (8)

with associated parameters α = (α0, α1, α2, α3) = (log(2/7), 1/2, 1/2, 10). Current treatment status Am is made heavily dependent on the previous treatment status Am−1 by setting a high parameter value (α3 = 10). That way, we emulate the situation where subjects switch to treatment at most once and keep on using the treatment without much interruption or discontinuation. The true causal effect parameter (i.e., treatment effect) is set to be ψ1 = 0.5 (in equation (4)).

The equations (7) and (8) define Lm as a time-dependent confounder affected by prior treatment2. In particular, past treatment exposure status Am−1 affects the time-dependent confounder Lm, which then predicts future treatment exposure Am. Lm is also associated with the future failure status Ym+1 via I(T0c). Here, T0 is the untreated counterfactual survival time and c is an arbitrary cut-point used to generate the binary variable I(T0c). The value of c affects the degree of variability in the indicator variable I(T0c)22. Without I(T0c), there would not be any confounding in the exposure-outcome relationship. The confounding here arises via the path: Ym+1I(T0c) → LmAm. This indicator variable I(T0c) therefore dictates the degree to which T0 affects Lm for a chosen value of c.

2.4.2 Simulation - II

We assume an exponential distribution for generating failure times T with constant λ0 = 0.01 rate of monthly events throughout the follow-up. A uniform distribution U(1, 60) months is assumed to generate censoring times TC; i.e., administrative censoring is set at 5 years of follow-up. Treatment initiation time TA is generated from a uniform distribution U(0, 10) (in months). Additionally, we consider sex as a baseline confounder in these data. A subject’s sex is generated based on a Bernoulli distribution where the probability of being male is 0.3. We also add one time-dependent confounder Lm, which could represent cumulative disease activity, for example, such that higher cumulative disease activity has a higher risk (a log-HR of ψ3 = log(1.5)). This time-dependent confounder Lm is generated based on a Bernoulli distribution where the probability of disease activity increment is 0.75, accumulating the disease activity over at most m = 10 periods of time.

The permutation algorithm21 is used to generate survival data where binary treatment Am is time-dependent but the confounder L0 is fixed at its baseline value. Arbitrarily, the effect parameters for treatment and sex on the survival outcome are set such that the treatment has a harmful effect (a log-HR of ψ1 = 0.5) and males are at a lower risk than females (a log-HR of ψ2 = −0.7).

2.5 Performance Metrics

We assessed the performance of the various approaches by the following measures:

  • Bias=i=1N(ψ^1i-ψ1)/N: The average difference between the true and N = 1, 000 estimated parameters (log-HR).

  • SD=i=1N(ψ^1i-ψ¯1)2/(N-1) where ψ¯1=i=1Nψ^1i/N

  • Model-based SE: The average of N = 1, 000 estimated standard errors of the estimated causal effect.

  • Coverage probabilities of model-based nominal 95% CIs: Proportion of N = 1, 000 datasets in which the true parameter is contained in the nominal 95% CI.

The above quantities were defined in terms of MSCM parameter ψ1 from equation (4). In order to define the same measures from other analysis approaches under consideration, we assumed that the true target parameters (defined in equations (1), (4) and (6) respectively) are the same for all approaches, e.g., ψ1=ψ1=ψ1=0.5. A heuristic justification of such assumption of equivalence is provided at the section ‘Marginal versus Conditional Interpretations’.

3 Simulation Results

3.1 Description of the Simulated Data

To describe the data obtained from the simulation settings under the rare event condition, we generated datasets with a larger number of subjects (25, 000) with up to 10 subsequent visits from each simulation algorithm. The characteristics of the treated, untreated and partially treated groups, their failure rates and average number of visits are listed in Table 3.

Table 3.

Characteristics of the simulation settings under consideration.

Rates Simulation-I Simulation-II
Failure 0.143 0.084
Always treated 0.261 0.051
Never treated 0.046 0.150
Partially treated 0.692 0.799
Discontinuation 0.001 -
Mean visits 9.367 8.943

Simulation-I allows a few exceptions (19 out of 25, 000) where there are discontinuations. However, the proportion of discontinuation in the simulation-I dataset is negligible (0.00076) and we do not expect any noticeable impact in the results due to this small number of exceptions.

3.2 Rare Event Condition

We present the results from the rare event condition (λ0 = 0.01 in a monthly time-scale) in the two simulation settings.

3.2.1 Results From Simulation-I

Results from simulation-I are reported in Table 4. MSCM with treatment status (Am) is fitted to validate the data generating algorithm. The corresponding stabilized weights are generated based on the relationship between treatment status (Am) and the time-dependent confounder Lm. The level of bias is negligible compared to other approaches under consideration and the average coverage probability of the model-based nominal 95% CIs is 0.942. These results are now considered as the ideal for comparison purposes for this simulation setting. The time-dependent Cox models provide biased estimates (assuming ψ̃1 = ψ1 = 0.5) in the presence of this time-dependent confounder. The average coverage probability of the model-based nominal 95% CIs for the method is very low. The bias of sequential Cox analysis is comparable to that with the time-dependent Cox analysis, while the corresponding effect estimates are considerably more variable. The properties of the modified sequential Cox approach, on the other hand, are comparable to that of MSCM. The corresponding average coverage probability of the model-based nominal 95% CIs for the method is comparable to that with MSCM.

Table 4.

Comparison of the analytical approaches to adjust for time-dependent confounding from simulation-I (one time-dependent confounder and time-dependent treatment exposure) of 1, 000 datasets, each containing 2, 500 subjects followed for up to 10 time-intervals.

Approach Bias SD se Coverage Probability
TD-Cox§ 0.438 0.168 0.169 0.251
Sequential Cox#, 0.451 0.280 0.267 0.636
Modified Sequential Cox*,@ 0.015 0.221 0.222 0.956
MSCM 0.029 0.201 0.205 0.942

TD-Cox, Cox model with time-dependent exposure; MSCM, Marginal structural Cox model.

§

Includes the time-dependent confounder Lm as a covariate. In the presence of a time-dependent confounder, the time-dependent Cox model is not appropriate but the results are retained for comparison purposes.

#

Adjusts for m.

For the stabilized IPCWs, the numerator model adjusts for Am, while the denominator model adjusts for Am and m via Aalen’s additive regression.

*

Adjusts for lagged values of Am, the time-dependent confounder L⃗m, and lagged values of L⃗m. Note that, baseline covariates are not present in this setting.

@

For the stabilized IPCWs, the numerator model adjusts for Am, while the denominator model adjusts for Am, L⃗m and lagged values of L⃗m via Aalen’s additive regression.

The stabilized IPTW numerator model adjusts for time index and lagged values of Am, while the denominator model additionally adjusts for current and lagged values of Lm to predict future treatment status via pooled logistic models.

3.2.2 Results From Simulation-II

Results from simulation-II are reported in Table 5. The time-dependent Cox model with treatment status (Am), baseline covariate (L0) and time-dependent covariate (Lm) is fitted to validate the data generating permutation algorithm. The level of bias is negligible and the average coverage probability of the model-based nominal 95% CIs is 0.952. These results are considered as the ideal for comparison purposes for this simulation setting. When the sequential Cox approach is used in this simulation setting, we observe some bias. We apply MSCM with Lm treated as a time-dependent confounder, even though Lm is only a time-dependent covariate that is not affected by the past treatment. The corresponding bias is negligible and the average coverage probability of the model-based nominal 95% CIs is 0.952. As Lm is not a time-dependent confounder in this simulation, the similarity between the estimates obtained from MSCM and the time-dependent Cox model is not surprising. The properties of the modified sequential Cox approach are again comparable to MSCM with reasonable average coverage probability of the model-based nominal 95% CIs.

Table 5.

Comparison of the analytical approaches to adjust for time-dependent covariate from simulation-II (one baseline covariate, one time-dependent covariate and time-dependent treatment exposure) of 1, 000 datasets, each containing 2, 500 subjects followed for up to 10 time-intervals.

Approach Bias SD se Coverage Probability
TD-Cox§ 0.000 0.164 0.162 0.952
Sequential Cox#, 0.271 0.189 0.184 0.688
Modified Sequential Cox*, @ −0.022 0.231 0.234 0.961
MSCM±, −0.001 0.163 0.162 0.952

TD-Cox, Cox model with time-dependent exposure; MSCM, Marginal structural Cox model.

§

The baseline covariate L0 and time-dependent covariate Lm are included.

#

Adjusts for L0 and m.

In the stabilized IPCW model, the numerator model adjusts for Am and L0, while the denominator model adjusts for Am, L0 and m via Aalen’s additive model.

*

Adjusts for baseline covariates L0, lagged values of Am, the time-dependent confounder L⃗m, and lagged values of L⃗m.

@

For the stabilized IPCWs, the numerator model adjusts for Am and baseline variable L0, while the denominator model adjusts for L0, Am, L⃗m and lagged values of L⃗m via Aalen’s additive regression.

±

Adjusts for only L0.

For the stabilized IPTWs, the numerator model adjusts for the time index, L0 and lagged values of Am, while the denominator model additionally adjusts for current and lagged values of Lm to predict future treatment status via pooled logistic models.

3.3 Marginal versus Conditional Interpretations

We need to take into account the different interpretations of the target quantities (ψ̃1, ψ1 and ψ1 from equations (1), (4) and (6) respectively) being estimated by the three approaches under consideration. The clinical context should dictate whether the target parameter should bear a marginal or conditional interpretation. Conditional interpretations are generally useful in deciding personalized drug choices, whereas marginal interpretations may be more useful in making generalized policy decisions for a heterogeneous group of patients15.

The modified sequential Cox approach and its proposed modification emulate a sequence of conditionally randomized treatment assignments. This is done by first reorganizing the observed data and then stratifying the combined data based on the month of treatment initiation, conditioning on the pre-treatment covariate values. The estimated ψ1 from a modified sequential Cox approach (as well as estimated ψ`1 from a sequential Cox approach), therefore, bears a conditional interpretation9,13, as does the estimated log-hazard ratio ψ̃1 from a time-dependent Cox model23,24.

In contrast, MSCM estimates the log-hazard ratio ψ1 between two counterfactual scenarios: all subjects are treated at a given time versus none of the same subjects are treated at that time. The target quantity of interest ψ1 estimated from MSCM is the causal effect of the treatment. Assuming the MSCM assumptions hold, this quantity should match the estimated treatment effect from a randomized clinical trial25,26. In that sense, the MSCM approach mimics randomized clinical trial data setting by appropriately weighting observational data. As the corresponding outcome or hazard model does not condition on any time-dependent covariates that affect future treatment, the log-hazard ratio estimated from a MSCM is a marginal or population-averaged quantity27. It is possible to extend the MSCM by incorporating baseline covariates in the hazard model28. When we use stabilized weights in a MSCM, the treatment effect is marginal with respect to the time-dependent confounders, but conditional with respect the baseline covariates11,29,30.

Comparing the MSCM target parameter estimate with any conditional treatment effect estimate (from time-dependent Cox model approach: which is considered as the standard for comparison in simulation - II) is not straightforward when noncollapsible measures, such as HR or OR, are employed3034. Establishing the equivalence of the target parameters (log-hazard ratios ψ̃1, ψ1 and ψ1 from equations (1), (4) and (6) respectively) from the approaches under consideration may not be easy. However, the difference between the conditional and marginal parameters is expected to be negligible when the event rate in the time-intervals under consideration is small14,33. In our simulations, the event rate was 1% in each month-interval. In this scenario, noncollapsibility of the HRs should not have any noticeable impact on the findings. We performed the following numerical experiment to support this proposition14,15. We generated a very large cohort (n = 100, 000) under the simulation - II settings. As expected, the time-dependent Cox model and MSCM yield almost identical values of the treatment effect estimates (log-hazard ratio of ψ^1=0.5084 versus ψ̂1 = 0.5091). This further provides some evidence that non-collapsibility of the HR does not affect our simulation - II findings.

Similarly, we performed another Monte Carlo experiment for the simulation – I setting with N = 100 larger cohorts (n = 100, 000) to investigate whether ψ1 and ψ1 quantities estimated via the modified sequential Cox and MSCM respectively (which is considered as the standard for comparison in simulation - I) differed systematically in the settings we investigated. Both approaches produce very similar values of the treatment effect estimates on average (see Table 6). For this setting, this shows that the target parameters for these two approaches are not materially different. To check the adequacy of the sample size n = 2, 500 chosen in our original simulation – I, this study is also repeated for other cohort sizes n = 5, 000, 10, 000 and 50, 000 (see Table 6). The results look very similar, and expectedly the SDs are decreasing with increment of sample sizes.

Table 6.

Series of Monte Carlo studies for the simulation – I setting, each simulation with larger cohorts.

Approach Bias (SD) from N = 100 cohorts

Cohort size (n) n = 5,000 n = 10,000 n = 50,000 n = 100,000

MSCM 0.013 (0.150) 0.007 (0.107) 0.011 (0.049) 0.003 (0.033)
Modified Sequential Cox 0.011 (0.157) 0.003 (0.100) 0.004 (0.048) −0.001 (0.037)

3.4 When More Events are Available

The trends in the bias from the more frequent event condition (λ0 = 0.1 in a monthly time-scale) are similar compared to those in the rare event condition (see Web-Tables H.1–H.2). As expected, the standard errors are much less than in the corresponding analyses when failure rates are rare. Bias is slightly lower in some cases. One noticeable difference is observed in simulation setting - I: in the presence of the time-dependent confounder, when the failure rate is more frequent, the bias of the time-dependent Cox and MSCM approaches is reduced to minimal levels, whereas considerable bias is still apparent with the sequential Cox approach. On the other hand, the average coverage probability of the model-based nominal 95% CIs from the time-dependent Cox approach is smaller than that of MSCM. The modified sequential Cox approach estimates are still associated with good statistical properties.

4 Application in Multiple Sclerosis

We apply these methodologies to the British Columbia (BC) MS cohort data (1995–2008)11. The dataset was used in previous studies11,12,3538 to estimate the effect of β-IFN on time to irreversible disability outcomes. As before, irreversible progression of disability is measured by sustained expanded disability status scale (EDSS) 6 which is confirmed after at least 150 days, with all subsequent EDSS scores being 6 or greater. Web-Appendix §G describes the baseline characteristics, eligibility and exclusion criteria of the MS cohort.

Potential baseline confounders L0 include age, sex, disease duration and EDSS score. Also, we consider the cumulative number of relapses in the previous 2 years (hereafter called ‘cumulative relapses’) as a time-dependent confounder Lm11. Once the subjects initiate β-IFN, we assume they continue taking the drug without any discontinuation until they develop the outcome or are censored, as is assumed in our simulations and previous pharmacoepidemiologic studies9,13,39. As found in the previous study11 using this cohort, we consider the MSCM estimates to be ideal in this time-dependent confounding context. Results are reported in Table 7.

Table 7.

Summary of the estimated parameters from the multiple sclerosis (MS) patients’ data from British Columbia, Canada (1995–2008).

Approach
HR^
se(HR^)
95% CI Weights
Average (SD) range
TD-Cox§ 1.29 0.23 0.91 – 1.82
Sequential Cox#, 1.23 0.32 0.74 – 2.07 1.00 ( 0.01 ) 0.74 – 1.63
Modified Sequential Cox*, @ 1.36 0.26 0.93 – 1.99 1.00 ( 0.01 ) 0.92 – 1.24
MSCM±, 1.31 0.23 0.92 – 1.84 1.00 ( 0.06 ) 0.37 – 1.60

TD-Cox, Cox model with time-dependent exposure.

§

Adjusts for baseline covariates L0 (sex, EDSS score, age and disease duration), and for the time-dependent confounder Lm ‘cumulative relapses’.

#

Adjusts for L0, Am and m.

The stabilized IPCW numerator model adjusts for Am and L0, while the denominator model additionally adjusts for Lm and lagged values of Lm via Aalen’s additive model.

*

Adjusts for baseline covariates L0, lagged values of Am, the time-dependent confounder L⃗m and lagged values of L⃗m.

@

For the stabilized IPCWs, the numerator model adjusts for Am and baseline variable L0, while the denominator model adjusts for L0, Am, L⃗m and lagged values of L⃗m via Aalen’s additive regression.

±

Adjusts for the potential baseline confounders L0.

The stabilized IPTW numerator model adjusts for a restricted cubic spline of the follow-up time-index, baseline confounders L0 and lagged values of Am to predict future treatment status. The denominator model additionally adjusts for the current and lagged values of cumulative relapses (Lm) via the pooled logistic models.

The IPCWs in the modified sequential Cox approach are less variable than the IPWs in MSCM. IPCWs are estimated separately for each mini-trial9. When they are estimated from the aggregated dataset instead8, or when IPCWs were estimated via pooled logistic regression models2,5,39, the HR estimates are very similar (see Web-Appendix §I). No matter how they are constructed, the IPCWs from the mini-trials are well-behaved, i.e., the averages are close to one and they have low variability (most are within the range of 0.9 to 1.1 and the distributions are unimodal and symmetric; see Web-Figures I.1 and I.2).

5 Discussion

In observational studies, the estimation of a treatment effect is challenging in the absence of randomization. In longitudinal studies, additional complexity arises in the presence of time-dependent confounding. MSCMs are popularly used to deal with this problem in the survival analysis setting. MSCMs handle time-dependent confounding by re-weighting the data in such a way that the confounding effect of the time-dependent confounder is removed. Then adjusting for the baseline confounders (but not the time-dependent confounder) in the re-weighted pseudo-population is adequate to obtain the counterfactual or causal effect of the treatment under the identifiability conditions8,28. Sometimes, the MSCM estimates may be unstable due to use of IPTWs and an alternate analysis or view of the data may be helpful.

The sequential Cox approach was proposed as an alternative method to the MSCMs for estimation of the treatment effect from complex observational data settings where the treatment is time-dependent and censoring may be non-random9. This approach restructures the data in such a way that a sequence of subsets of data (mini-trials) are created based on intervals of treatment initiation. Aggregation of all the mini-trial data produces the pseudo-population. In this pseudo-population, subjects initiating treatment at each interval are compared to those who do not initiate treatment, conditional on covariates at respective intervals as well as baseline covariates. Although IPTWs are avoided in the sequential Cox approach, IPCWs are still required. These weights are less variable and more stable than IPTWs7,9 and appropriately handle the artificial censoring at later treatment start dates. We proposed a modified version of this approach that deal with analyses differently than the original proposal.

The treatment effect that is estimated from a MSCM is a marginal estimate as it is obtained by averaging over subjects with different hazards. This estimate does not condition on the time-dependent confounder. Rather the time-dependent confounder plays a role in creating weights for the MSCM model fitting. These weights are used to create the pseudo-data which is free from time-dependent confounding and mimics a clinical trial situation. Similarly, the sequential Cox approach contrasts subjects within mini-trials, where each mini-trial includes subjects who did and did not initiate the treatment. Controlling for current values of the time-dependent covariates as well as baseline confounders should make the treated subjects conditionally exchangeable with control subjects at the time of treatment initiation. Within each mini-trial, treatment assignment could be considered as random among the comparable subjects.

While the MSCM approach provides a marginal estimate of the treatment effect, the sequential Cox approach and its modified version provide conditional estimates. Both approaches are equipped to adjust for baseline confounders. Although the mechanisms and interpretations behind the sequential Cox approach and MSCM are different, both claim to achieve the same goal of estimating the causal effect of treatment in the presence of time-dependent confounders. Generally marginal and conditional estimates may not be directly comparable due to non-collapsibility9,15. By examining whether these quantities differed systematically in the settings investigated, we showed that the use of non-collapsible measure has not been an issue in the specific simulation settings we considered.

To the best of our knowledge, ours is the first study to use simulation studies to investigate the characteristics of a sequential Cox approach that has been suggested as being suitable in the context of time-dependent confounding. The first simulation setting (simulation-I) deals with the situation where the time-dependent covariate is affected by the prior treatment (i.e., is a time-dependent confounder). When a time-dependent confounder is present, MSCM is known to be an appropriate method and hence results from this method are used as the standard for comparison in this simulation setting. In this simulation process, we generate data such that the time-dependent confounder dictates the treatment assignment in the following periods. Among the subjects selected for a mini-trial based on those initiating treatment or at risk in a given period, only current (and lagged) values of the time-dependent covariates are used as adjustments in the sequential Cox approach. We proposed a modified version of sequential Cox approach, based on controlling for the time-dependent variable (confounder or covariate) values after the treatment initiation. We investigated via simulation whether such adjustments are sufficient. In the second setting (simulation-II), we have a baseline covariate and a time-dependent covariate. As the time-dependent Cox model is appropriate for simulation setting II, we use these results as the standard for comparison.

Previously, the sequential Cox approach was shown to work very well in comparison to the time-dependent Cox approach in the absence of any time-varying covariate or confounder10. However, we do not find the sequential Cox approach to be as effective as MSCM when a time-dependent confounder is present (simulation-I) or even when a time-dependent covariate which is not a time-dependent confounder is present (simulation-II). The sequential Cox approach does not seem to remove the effects of time-dependent confounding adequately, especially when the event rate is small. Based on our simulation findings, when we need to consider time-dependent confounders in order to adequately model a disease process, we recommend the use of the MSCM approach or the modified sequential Cox approach, as both are capable of producing estimates of the treatment effect that are close to the MSCM target parameter ψ1.

We apply the methods under consideration to estimate the effect of β-IFN on disease progression. The modified sequential Cox approach produces effect estimate similar to MSCM. A sensitivity analysis of the sequential Cox approach without using IPC weights yielded very similar results, implying little impact of artificial censoring due to later treatment initiation.

The focus for the sequential Cox approach and its modified version is on recreating the covariate process at each treatment start using the mini-trial approach7. Such focused and detailed scrutiny could yield insights about the data which may be hard to extract using a MSCM approach. For example, the data associated with a given mini-trial can be extracted and separated quite easily from the combined mini-trial data (pseudo-population), it is straightforward to compare the effects of early versus late treatment initiation. It is also possible to estimate the treatment effect for patients with a specific level of a time-dependent covariate at treatment initiation. Variance estimation is a challenge in the sequential Cox and similar methods15. To account for possible multiple entry of the same control subjects in different mini-trials, we used a robust (sandwich) estimator. In our simulation studies, the average standard errors are slightly lower than the empirical standard deviations in most cases for the sequential Cox approach. However, for the modified version, these estimates are very close. For more accurate estimate of the standard errors, bootstrap or jackknife estimates could be used9,13. Unlike our simulation settings, if the time-dependent covariates are very strongly affected by the prior treatment, further adjustments1619 may be necessary. Using the same simulation scheme used in this study, future studies could assess the adequacy of the sequential Cox approaches and other similar methods17,27,40,41 under such extreme setting.

Similar to other simulation studies, we investigated a few possible scenarios. However, the assumptions underlying our data simulation are consistent with patterns typical in observational survival studies where associated covariates are measured regularly. For more complex disease scenarios where an investigator may wish to assess different treatment strategies (i.e., switching between therapies) over the course of time, our assumption of no discontinuations or interruptions in the treatment is restrictive and may not be suitable8. Further simulation studies are required to assess the effect on the precision of the estimates when varying the sample size of the simulated data as well as the number of simulated datasets generated from the algorithms considered in this study42. Future research could focus on analytical derivation of the effect estimates from the sequential Cox approach and its modified version in an effort to theoretically justify the proposed analysis roadmap.

Supplementary Material

Acknowledgments

Funding Information

This work was supported by a studentship from the Multiple sclerosis (MS) Society of Canada (M.E.K.), by a post-doctoral fellowship from the Canadian Network for Observational Drug Effect Studies (CNODES) (M.E.K.) and by a grant from the National Multiple Sclerosis Society (RG 4202-A-2, PI: H.T.) and the Canadian Institutes of Health Research (MOP-93646, PI: H.T.). J.P. holds research grants from the Canadian Institutes of Health Research, the National MS Society, and the Natural Sciences and Engineering Research Council of Canada. P.G. is supported by the Natural Sciences and Engineering Research Council of Canada. R.P. is supported in part by a National Scholar (Chercheur-national) of the Fonds de Récherche du Québec-Santé (FQR-S) and is a member of the Research Institute of the McGill University Health Centre, which is supported by core funds from FQR-S. H.T. is supported by the Canada Research Chair Program and currently receives research support from the National Multiple Sclerosis Society, the Canadian Institutes of Health Research, the Multiple Sclerosis Society of Canada and the Multiple Sclerosis Scientific Research Foundation. In addition, in the last five years H.T. has received research support from the Multiple Sclerosis Society of Canada (Don Paty Career Development Award); the Michael Smith Foundation for Health Research (Scholar Award) and the UK MS Trust. Y.Z. receives research funding from the Canadian Institutes of Health Research, the MS Society of Canada and the National MS Society. A.S. was funded through a Postdoctoral Fellowship from the MS Society of Canada, and grants from the Canadian Institutes of Health Research (MOP-93646; PI=H.T.) and the National MS Society (RG 4202-A-2; PI=H.T.). E.K. was supported by Postdoctoral Fellowships from the Michael Smith Foundation for Health Research and the MS Society of Canada. C.E. was funded through grants from the Canadian Institutes of Health Research (MOP-93646; PI=H.T.) and the National MS Society (RG 4202-A-2; PI=H.T.), and the Michael Smith Foundation for Health Research. M.L.v.d.K. is supported by a Canadian Institutes of Health Research Doctoral Award - Doctoral Foreign Study Award (October 2012), offered in partnership with the Canadian Institutes of Health Research (CIHR) Strategy for Patient-Oriented Research and the CIHR HIV/AIDS Research Initiative. J.O. receives support from the Christopher Foundation and the University of British Columbia (UBC). Over the past 5 years Dr Oger has received speaker honoraria, consulting fees, travel grants and/or research grants and/or educational grants of less than $5, 000 each from Bayer, Biogen-Idec, BioMS, Glaxo-Smith-Klein, Novartis, Merck-Serono, Talecris and Teva-neurosciences. Dr Oger has received fees for services from Bayer and from Biogen Idec to serve on advisory committees.

Thank you’s

We gratefully acknowledge the British Columbia Multiple sclerosis Clinic neurologists who contributed to the study through patient examination and data collection (current members listed here by primary clinic): UBC MS Clinic: Dr. A. Traboulsee, MD, FRCPC (UBC Hospital MS Clinic Director and Head of the UBC MS Programs); Dr. A-L. Sayao, MD, FRCPC; Dr. V. Devonshire, MD, FRCPC; Dr. S. Hashimoto, MD, FRCPC (UBC and Victoria MS Clinics); Dr. J. Hooge, MD, FRCPC (UBC and Prince George MS Clinics); Dr. L. Kastrukoff, MD, FRCPC (UBC and Prince George MS Clinics); Dr. J. Oger, MD, FRCPC. Kelowna MS Clinic: Dr. D. Adams, MD, FRCPC; D. Craig, MD, FRCPC; Dr. S. Meckling, MD, FRCPC. Prince George MS Clinic: Dr. L. Daly, MD, FRCPC. Victoria MS Clinic: Dr. O. Hrebicek, MD, FRCPC; Dr. D. Parton, MD, FRCPC; Dr. K Pope, MD, FRCPC. The views expressed in this paper do not necessarily reflect the views of each individual acknowledged. None have received compensation for their role in the study.

Footnotes

Conflict of Interest

M.E.K. is supported by a CNODES post-doctoral award and has had travel and accommodation costs covered from the endMS Research and Training Network (2011, 2012) to present at conferences, and from Pacific Institute for the Mathematical Sciences (2013) to attend a workshop. Over the past three years, J.P. has received consulting fees and/or fees for service on Data Safety Monitoring Boards from EMD Serono, Myelin Research Foundation and Novartis. P.G. has received consulting fees from Biogen to serve on an advisory board. H.T has received speaker honoraria and/or travel expenses to attend conferences from the Consortium of MS Centres (2013), the National MS Society (2012, 2014, 2016), Teva Pharmaceuticals (2011), ECTRIMS (2011, 2012, 2013, 2014, 2015, 2016), UK MS Trust (2011), the Chesapeake Health Education Program, US Veterans Affairs (2012), Novartis Canada (2012), Biogen Idec (2014), American Academy of Neurology (2013, 2014, 2015, 2016). All speaker honoraria are either declined or donated to an MS charity or to an unrestricted grant for use by her research group. Y.Z. declares no conflicts of interest. A.S. has received travel grants to present at and attend conferences from the endMS Research and Training Network (2010, 2011), ECTRIMS (2010, 2011), and the Consortium of MS Centres (2012). E.K. has had travel and accommodation costs covered to present at and attend conferences from the endMS Research and Training Network (2008, 2011), the International Society for Pharmacoepidemiology (2010, 2013) and Bayer Schering Pharma (2010). C.E. has received travel grants to present and attend conferences from the endMS Research and Training Network (2011), and the European Committee for Treatment and Research in MS (2011). M.L.v.d.K. declares no conflicts of interest. Over the past five years, J.O. has received speaker honoraria, consulting fees, travel grants and/or research grants and/or educational grants from Aventis, Bayer, Biogen-Idec, BioMS, Corixa, Genentech, Novartis, Serono, Shering, Talecris and Teva-neurosciences. J.O. receives fees for services from Bayer, from Novartis and from Biogen Idec to serve on advisory committees.

References

  • 1.Sylvestre MP, Abrahamowicz M. Comparison of algorithms to generate event times conditional on time-dependent covariates. Statistics in Medicine. 2008;27(14):2618–2634. doi: 10.1002/sim.3092. [DOI] [PubMed] [Google Scholar]
  • 2.Hernán MA, Brumback B, Robins JM. Marginal structural models to estimate the causal effect of zidovudine on the survival of HIV-positive men. Epidemiology. 2000;11(5):561–570. doi: 10.1097/00001648-200009000-00012. [DOI] [PubMed] [Google Scholar]
  • 3.Fewell Z, Hernán MA, Wolfe F, Tilling K, Choi H, Sterne JA. Controlling for time-dependent confounding using marginal structural models. The Stata Journal. 2004;4(4):402–420. [Google Scholar]
  • 4.Robins JM. Latent variable modeling and applications to causality. Springer; 1997. Causal inference from complex longitudinal data; pp. 69–117. [Google Scholar]
  • 5.Robins JM, Hernán MA, Brumback B. Marginal structural models and causal inference in epidemiology. Epidemiology. 2000;11(5):550–560. doi: 10.1097/00001648-200009000-00011. [DOI] [PubMed] [Google Scholar]
  • 6.Austin PC, Platt RW. Survivor treatment bias, treatment selection bias, and propensity scores in observational research. Journal of Clinical Epidemiology. 2010;63(2):136–138. doi: 10.1016/j.jclinepi.2009.05.009. [DOI] [PubMed] [Google Scholar]
  • 7.Aalen OO. Armitage lecture 2010: understanding treatment effects: the value of integrating longitudinal data and survival analysis. Statistics in Medicine. 2012;31(18):1903–1917. doi: 10.1002/sim.5324. [DOI] [PubMed] [Google Scholar]
  • 8.Lange T, Rod NH. Handbook of Survival Analysis. CRC Press; 2013. Causal models; pp. 135–151. [Google Scholar]
  • 9.Gran JM, Røysland K, Wolbers M, Didelez V, Sterne JAC, Ledergerber B, Furrer H, von Wyl V, Aalen OO. A sequential Cox approach for estimating the causal effect of treatment in the presence of time-dependent confounding applied to data from the Swiss HIV Cohort Study. Statistics in Medicine. 2010;29(26):2757–2768. doi: 10.1002/sim.4048. [DOI] [PubMed] [Google Scholar]
  • 10.Karim ME, Gustafson P, Petkau J, Tremlett H BeAMS Group. Comparison of statistical approaches dealing with immortal time bias in drug effectiveness studies. American Journal of Epidemiology. 2016:kwv445. doi: 10.1093/aje/kwv445. First published online: July 25, 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Karim ME, Gustafson P, Petkau J, Zhao Y, Shirani A, Kingwell E, Evans C, van der Kop M, Oger J, Tremlett H. Marginal structural Cox models for estimating the association between β-Interferon exposure and disease progression in a multiple sclerosis cohort. American Journal of Epidemiology. 2014;180(2):160–171. doi: 10.1093/aje/kwu125. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Shirani A, Zhao Y, Karim ME, Evans C, Kingwell E, van der Kop M, Oger J, Gustafson P, Petkau J, Tremlett H. Association between use of interferon beta and progression of disability in patients with relapsing-remitting multiple sclerosis. Journal of American Medical Association. 2012;308(3):247–256. doi: 10.1001/jama.2012.7625. [DOI] [PubMed] [Google Scholar]
  • 13.Røysland K, Gran JM, Ledergerber B, Wyl V, Young J, Aalen OO. Analyzing direct and indirect effects of treatment using dynamic path analysis applied to data from the swiss hiv cohort study. Statistics in Medicine. 2011;30(24):2947–2958. doi: 10.1002/sim.4324. [DOI] [PubMed] [Google Scholar]
  • 14.Xiao Y, Abrahamowicz M, Moodie EEM. Accuracy of conventional and marginal structural Cox model estimators: A simulation study. The International Journal of Biostatistics. 2010;6(2):1–28. doi: 10.2202/1557-4679.1208. [DOI] [PubMed] [Google Scholar]
  • 15.Taylor JMG, Shen J, Kennedy EH, Wang L, Schaubel DE. Comparison of methods for estimating the effect of salvage therapy in prostate cancer when treatment is given by indication. Statistics in Medicine. 2014;33(2):257–274. doi: 10.1002/sim.5890. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Hernán MA, Alonso A, Logan R, Grodstein F, Michels KB, Stampfer MJ, Willett WC, Manson JE, Robins JM. Observational studies analyzed like randomized experiments: an application to postmenopausal hormone therapy and coronary heart disease. Epidemiology. 2008;19(6):766–779. doi: 10.1097/EDE.0b013e3181875e61. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Schaubel DE, Wolfe RA, Port FK. A sequential stratification method for estimating the effect of a time-dependent experimental treatment in observational studies. Biometrics. 2006;62(3):910–917. doi: 10.1111/j.1541-0420.2006.00527.x. [DOI] [PubMed] [Google Scholar]
  • 18.Schaubel DE, Wolfe RA, Sima CS, Merion RM. Estimating the effect of a time-dependent treatment by levels of an internal time-dependent covariate: application to the contrast between liver wait-list and posttransplant mortality. Journal of the American Statistical Association. 2009;104(485):49–59. [Google Scholar]
  • 19.Lu B. Propensity score matching with time-dependent covariates. Biometrics. 2005;61(3):721–728. doi: 10.1111/j.1541-0420.2005.00356.x. [DOI] [PubMed] [Google Scholar]
  • 20.Young JG, Hernán MA, Picciotto S, Robins JM. Relation between three classes of structural models for the effect of a time-varying exposure on survival. Lifetime Data Analysis. 2010;16(1):71–84. doi: 10.1007/s10985-009-9135-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Abrahamowicz M, Mackenzie T, Esdaile JM. Time-dependent hazard ratio: modeling and hypothesis testing with application in lupus nephritis. Journal of the American Statistical Association. 1996;91(436):1432–1439. [Google Scholar]
  • 22.Moodie EEM, Stephens DA, Klein MB. A marginal structural model for multiple-outcome survival data: assessing the impact of injection drug use on several causes of death in the Canadian Co-infection Cohort. Statistics in Medicine. 2014;33(8):1409–1425. doi: 10.1002/sim.6043. [DOI] [PubMed] [Google Scholar]
  • 23.Cook NR, Cole SR, Hennekens CH. Use of a marginal structural model to determine the effect of aspirin on cardiovascular mortality in the Physicians’ Health Study. American Journal of Epidemiology. 2002;155(11):1045–1053. doi: 10.1093/aje/155.11.1045. [DOI] [PubMed] [Google Scholar]
  • 24.Gerhard T, Delaney JAC, Cooper-DeHoff RM, Shuster J, Brumback BA, Johnson JA, Pepine CJ, Winterstein AG. Comparing marginal structural models to standard methods for estimating treatment effects of antihypertensive combination therapy. BMC medical research methodology. 2012;12(1):1. doi: 10.1186/1471-2288-12-119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Robins JM. Marginal structural models versus structural nested models as tools for causal inference. Statistical Models in Epidemiology, the Environment and Clinical Trials. 1999;116:95–134. [Google Scholar]
  • 26.Hernán MA, Brumback B, Robins JM. Marginal structural models to estimate the joint causal effect of nonrandomized treatments. Journal of the American Statistical Association. 2001;96(454):440–448. [Google Scholar]
  • 27.Kennedy EH, Taylor JMG, Schaubel DE, Williams S. The effect of salvage therapy on survival in a longitudinal study with treatment by indication. Statistics in Medicine. 2010;29(25):2569–2580. doi: 10.1002/sim.4017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Cole SR, Hernán MA. Constructing inverse probability weights for marginal structural models. American Journal of Epidemiology. 2008;168(6):656–664. doi: 10.1093/aje/kwn164. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Westreich D, Cole SR, Tien PC, Chmiel JS, Kingsley L, Funk MJ, Anastos K, Jacobson LP. Time scale and adjusted survival curves for marginal structural Cox models. American Journal of Epidemiology. 2010;171(6):691–700. doi: 10.1093/aje/kwp418. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Kaufman JS. Marginalia: comparing adjusted effect measures. Epidemiology. 2010;21(4):490–493. doi: 10.1097/EDE.0b013e3181e00730. [DOI] [PubMed] [Google Scholar]
  • 31.Godin O, Elbejjani M, Kaufman JS. Body mass index, blood pressure, and risk of depression in the elderly: a marginal structural model. American journal of epidemiology. 2012:kws003. doi: 10.1093/aje/kws003. [DOI] [PubMed] [Google Scholar]
  • 32.Pang M, Kaufman JS, Platt RW. Studying noncollapsibility of the odds ratio with marginal structural and logistic regression models. Statistical Methods in Medical Research. 2013 doi: 10.1177/0962280213505804. 0962280213505804. [DOI] [PubMed] [Google Scholar]
  • 33.Xiao Y, Moodie EEM, Abrahamowicz M. Comparison of approaches to weight truncation for marginal structural Cox models. Epidemiologic Methods. 2012;2(1):1–20. [Google Scholar]
  • 34.Shrier I, Pang M. Confounding, effect modification, and the odds ratio: common misinterpretations. Journal of clinical epidemiology. 2015;68(4):470. doi: 10.1016/j.jclinepi.2014.12.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Shirani A, Zhao Y, Karim ME, Evans C, Kingwell E, van der Kop M, Oger J, Gustafson P, Petkau J, Tremlett H. Investigation of heterogeneity in the association between interferon beta and disability progression in multiple sclerosis: an observational study. European Journal of Neurology. 2014;21(6):835–844. doi: 10.1111/ene.12324. [DOI] [PubMed] [Google Scholar]
  • 36.Karim ME. PhD Thesis. University of British Columbia; 2015. Causal inference approaches for dealing with time-dependent confounding in longitudinal studies, with applications to multiple sclerosis research. [Google Scholar]
  • 37.Zhang T, Shirani A, Zhao Y, Karim ME, Gustafson P, Petkau J, Evans C, Kingwell E, van der Kop M, Zhu F, Oger J, Tremlett H. Beta-interferon exposure and onset of secondary progressive multiple sclerosis. European Journal of Neurology. 2015;22(6):990–1000. doi: 10.1111/ene.12698. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Shirani A, Zhao Y, Petkau J, Gustafson P, Karim ME, Evans C, Kingwell E, van der Kop M, Oger J, Tremlett H. Multiple sclerosis in older adults: the clinical profile and impact of interferon beta treatment. BioMed Research International. 2015;2015(451912):1–11. doi: 10.1155/2015/451912. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Sterne JAC, Hernán MA, Ledergerber B, Tilling K, Weber R, Sendi P, Rickenbach M, Robins JM, Egger M. Long-term effectiveness of potent antiretroviral therapy in preventing AIDS and death: a prospective cohort study. The Lancet. 2005;366(9483):378–384. doi: 10.1016/S0140-6736(05)67022-5. [DOI] [PubMed] [Google Scholar]
  • 40.Van Houwelingen HC. Dynamic prediction by landmarking in event history analysis. Scandinavian Journal of Statistics. 2007;34(1):70–85. [Google Scholar]
  • 41.Toh S, Hernán MA. Causal inference from longitudinal studies with baseline randomization. The International Journal of Biostatistics. 2008;4(1):1–30. doi: 10.2202/1557-4679.1117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Burton A, Altman DG, Royston P, Holder RL. The design of simulation studies in medical statistics. Statistics in Medicine. 2006;25(24):4279–4292. doi: 10.1002/sim.2673. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

RESOURCES