Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2022 Apr 1.
Published in final edited form as: Stat Med. 2021 Feb 10;40(9):2177–2196. doi: 10.1002/sim.8896

Estimation and modeling of the restricted mean time lost in the presence of competing risks

Sarah C Conner 1, Ludovic Trinquart 1
PMCID: PMC8889377  NIHMSID: NIHMS1778366  PMID: 33567477

Abstract

Survival data with competing or semi-competing risks are common in observational studies. As an alternative to cause-specific and subdistribution hazard ratios, the between-group difference in cause-specific restricted mean times lost (RMTL) gives the mean difference in life expectancy lost to a specific cause of death or in disease-free time lost, in the case of a nonfatal outcome, over a pre-specified period. To adjust for covariates, we introduce an inverse probability weighted estimator and its variance for the marginal difference in RMTL. We also introduce an inverse probability of censoring weighted regression model for the RMTL. In simulation studies, we examined the finite sample performance of the proposed methods under proportional and nonproportional subdistribution hazards scenarios. We illustrated both methods with competing risks data from the Framingham Heart Study. We estimated sex differences in atrial fibrillation (AF)-free times lost over 40 years. We also estimated sex differences in mean lifetime lost to cardiovascular disease (CVD) and non-CVD death over 10 years among individuals with AF.

Keywords: competing risks, cumulative incidence, restricted mean survival, restricted mean time lost, survival analysis

1 |. BACKGROUND

The difference in restricted mean survival times (RMST) has recently become of great interest as an alternative measure of association to the hazard ratio in time-to-event analyses.1,2 Marginal between-group differences in RMST up to a given timepoint τ can be estimated nonparametrically by integration of the survival function or, in the presence of covariates, of the inverse probability weighted (IPW) survival function.3 Moreover, regression models have been introduced to estimate the difference in RMST conditional on covariates.46

Competing risks time-to-event data arise commonly in biomedical research, when a nonterminal event may be of interest but its observation is subject to a terminal event. Or, time to death from a specific disease may be of interest, but individuals can die from other causes. In this context, between-group comparisons for a specific event have been studied using the cumulative incidence, cause-specific hazard, or subdistribution hazard.79

In the presence of competing risks, the difference in RMST is inadequate. Andersen10 and Zhao et al11 proposed instead to measure the difference in restricted mean times lost (RMTL). With a single cause of failure, RMTL is equal to τ minus RMST. With competing risks, the RMTL is the area under the cause-specific cumulative incidence function. As a consequence, the difference in RMTL can be interpreted as the increase or decrease in lost life expectancy due to a specific cause of death or, in the case of a nonfatal outcome, the mean difference in disease-free time, up to time τ.

Methods for inference on the difference in RMTL with competing risks data and adjustment for covariates are scarce. Andersen described a generalized linear model for the cause-specific RMTL using the pseudo-observation technique.10 In this article, we propose an IPW-adjusted estimator of the difference in cause-specific RMTL. We also present a regression model for the RMTL conditional on covariates estimated with inverse probability of censoring weighting (IPCW). We provide R code to apply our methods with working examples and to replicate our simulation study on GitHub at https://github.com/s-conner/rmtl.

In Section 2, we discuss estimation of the difference in cause-specific RMTL, without adjustment for covariates. In Section 3, we present estimation of the marginal RMTL adjusted for covariates with IPW and its variance. In Section 4, we describe an IPCW regression model for RMTL conditional on covariates. In Section 5, we present a simulation study to assess the IPW estimator’s finite sample performance. In Section 6, we apply the proposed methods to incident atrial fibrillation (AF) and cause-specific mortality after AF with data from the Framingham Heart Study.

2 |. DIFFERENCE IN CAUSE-SPECIFIC RMTL

Without loss of generality, we focus on a terminal or nonterminal event of interest ϵ = 1, whose occurrence is subject to a competing event ϵ = 2. Our notations extend to scenarios with k competing types of event. Let T be the time to event, C be the censoring time, X = TC be the observed time to event, and δ = I(T ≤ C) be an indicator of noncensoring. We assume that C is independent of T. We observe (Xi, δiϵi) for individuals i =1, …, n.

The cumulative incidence function is defined as F1(t)=P(Tt,ϵ=1)=0tS(u)α1(u)du where S(t)=exp(0tA(u)du) is the overall survival function, αϵ(u) is the cause-specific hazard function for event ϵ, and A(u)=0uα1(t)+α2(t)dt is the cumulative all-cause hazard function. Let R = Tτ be the restricted time to event. We define the cause-specific RMTL as the area under the cause-specific cumulative incidence function up to timepoint τ. In Appendix 1, we prove that μ1(τ)=E{(τR)I(ϵ=1)}=0τF1(t)dt. We estimate the RMTL by integrating the Aalen-Johansen estimator of the cumulative incidence function

μ^1(τ)=0τF^1(t)dt. (1)

We approximate Equation (1) with Riemann sums, μ^1(τ)=tj<τ(tj+1tj)F^1(tj) where tj : t1, …, tD are the ordered, unique event times prior to τ and tD+1 = τ. In Appendix 2, we show the variance of the RMTL can be estimated by

va^r{μ^1(τ)}=tj<τ(tj+1tj)2va^r{F^1(tj)}+2tj<τtk<tj(tj+1tj)(tk+1tk)co^v{F^1(tj),F^1(tk)}. (2)

With two groups defined by A =1 and A =0, the difference in cause-specific RMTL is given by v=μ^1(τA=1)μ^1(τA=0), with variance vâr(v)=vâr{μ^1(τA=1)}+vâr{μ^1(τA=0)}.

With a single group, the timepoint τ should be less than or equal to the greatest observed time, τ ≤ max(Xi), and may be chosen clinically or empirically, for example, the 95th percentile of observed times or the maximum observed time, max(Xi), which approximates infτ{P(Cτ) = 0}.12 When taking the difference in RMTL between groups, τ should be less than or equal to the minimum of the largest event time across groups, τ ≤ min{max(Xi|Ai = 1), max(Xi|Ai = 0)}. Tian et al12 demonstrated max(Xi) may be an inappropriate choice for τ if nearby observed times are sparse, as evidenced by a flat tail in the cumulative incidence curve, and should be chosen with caution.

3 |. ADJUSTED DIFFERENCE IN CAUSE-SPECIFIC RMTL

In observational studies, we must adjust for covariates. Here, we demonstrate how to estimate the between-group difference in cause-specific RMTL adjusted for covariates. We extend Equation (1) by considering the area under the cumulative incidence function adjusted for covariates with IPW.

We again consider two groups defined by A = 1 and A = 0. Let Z denote a n × p matrix of covariates and pia = P(Ai = a|Zi). So pi1 is the propensity score for exposure. The inverse probability weight is defined as wi=I(Ai=1)pi1+I(Ai=0)1pi1. Then, the IPW Aalen-Johansen estimator of the cumulative incidence function in exposure group A = a is given by

F˜1a(t)=tjtθ˜a(tj),

where tj : t1, …, tD are the ordered, unique event times prior to τ in group a, θ˜a(tj)=S˜a(tj1)ΔN˜1a(tj)Y˜a(tj), N˜ka(t)=i:A=awiI(Xit,ϵi=k)δi is the weighted number of events of cause k in group a, ΔN˜ka(t)=N˜ka(t)N˜ka(t) is the increment in weighted events in group a, Y˜a(t)=i:A=awiI(Xit) is the weighted number at risk in group a, and S˜a(t)=tjt(1ΔN˜1a(tj)+ΔN˜2a(tj)Y˜a(tj)) is the weighted overall survival function for being alive and event-free in group a.1315 We define the estimated variance and covariance of the IPW Aalen-Johansen estimator as

va^r{F˜1a(t)}=tjtva^r{θ˜a(tj)}+2tj<ttj<tktco^v{θ˜a(tj),θ˜a(tk)}

and

co^v{F˜1a(t),F˜1a(u)}=va^r{F˜1a(t)}+tjtt<tkuco^v{θ˜a(tj),θ˜a(tk)}fort<u,

where

va^r{θ˜a(tj)}={θ˜a(tj)}2{Y˜a(tj)ΔN˜1(tj)Ma(tj)ΔN˜1(tj)+tl<tjΔN˜a(tl)Ma(tl)(Y˜a(tl)ΔN˜a(tl))}

and, for tj < tk,

co^v{θ˜a(tj),θ˜a(tk)}=θ˜a(tj)θ˜a(tk){1Ma(tj)+tl<tjΔN˜a(tl)Ma(tl)(Y˜a(tl)ΔN˜a(tl))},

where Ma(tj)=Y˜a(j)2i:Ti>tj,A=awi2 is the effective sample size, and assuming 1/Ma(tj)p0.

We estimate the RMTL in exposure group a by integrating the IPW Aalen-Johansen estimator of the cumulative incidence function

μ˜1a(τ)=0τF˜1a(t)dt. (3)

We approximate Equation (3) as μ˜1a(τ)=tj<τ(tj+1tj)F˜1a(tj). We estimate the variance of the IPW estimator of the RMTL in exposure group a by

va^r{μ˜1a(τ)}=tj<τ(tj+1tj)2va^r{F˜1a(tj)}+2tj<τtk<tj(tj+1tj)(tk+1tk)co^v{F˜1a(tj),F˜1a(tk)}. (4)

Then, the adjusted difference in RMTL between exposure groups A =1 and A =0 is

v˜=μ˜1,1(τ)μ˜1,0(τ)

with variance

var^(v˜)=var^{μ˜1,1(τ)}+var^{μ˜1,0(τ)}

assuming the weights are known. In practice, we estimate pia when deriving the weights wi used in the quantities introduced above.

4 |. INVERSE PROBABILITY OF CENSORING WEIGHTED GENERALIZED LINEAR MODEL FOR THE RMTL

We describe an IPCW regression model to assess associations between covariates and the cause-specific RMTL.6 Let δ˜=I(RC) and Z* = (1, A, Z) denote the n × (p + 2) matrix of predictors allowing an intercept term. We can model the RMTL for event ϵ = 1 with a generalized linear model,

g{μ1(τZi*)}=g[E{(τRi)I(ϵi=1)Zi*}]=Zi*β

with link function g(·). Model coefficients β represent differences in cause-specific RMTL when using the identity link, and log ratios of cause-specific RMTL when using the log link. Tian et al6 also suggested a logit-like link function, g{μ1(τ)}=log[μ1(τ)/{τμ1(τ)}], which restricts the predicted RMTL to (0, τ).

Coefficients are estimated with the IPCW technique by solving the weighted estimating equation,

U(β)=i=1nδ˜iG^(Ri)Zi*{(τRi)I(ϵi=1)g1(Zi*β)}=0, (5)

where G^(t) is the Kaplan-Meier estimator of the noncensoring distribution G(t). The outcome for each uncensored individual is either τRi or 0 due to the event type indicator, I(ϵi = 1); essentially, individuals with a competing event do not lose any time to event ϵ = 1. The noncensoring indicator δ˜i equals 0 for censored observations and 1 for uncensored observations; thus, censored observations have a weight of 0 and do not contribute to the estimating equation. However, the censoring distribution G^(t) is estimated from the sample of all individuals. In Appendix 3, we show Equation (5) is based on an unbiased estimating equation. In Appendix 4, we show the estimated coefficients β^ are asymptotically normally distributed using counting process theory, similar to Tian et al6 and Bang and Tsiatis.16,17 Then, the estimated covariance matrix that accounts for the variability in estimating G^(t) can be obtained from the sandwich estimator var^(β^)=A^1B^A^1 where

A^=1ni=1nZi*2ddβ^g1(Zi*β^),
B^=1ni=1nδ˜iG^(Ri)[Zi*2{(τRi)I(ϵi=1)g1(Zi*β^)}2]+0τ1ni=1n(δ˜iG^(Ri)[Zi*{(τRi)I(ϵi=1)g1(Zi*β^)K^(u)}]2I(Riu))dΛ^c(u)G^(u),

and

K^(u)=1S^(u)ni=1n[Zi*{(τRi)I(ϵi=1)g1(Zi*β^)}I(Riu)].

In Appendix 7, we demonstrate that the model coefficients are collapsible when g(·) is the identity or log link. In other words, adjusting for a covariate that is independent of exposure does not affect the model coefficient of the exposure.

5 |. SIMULATION STUDY

We conducted Monte Carlo simulation studies to assess the performance of the IPW method in estimating the marginal difference in RMTL between exposure groups in the presence of covariates, and the IPCW regression method in estimating the difference in RMTL conditional on covariates. We generated data with proportional and nonproportional subdistribution hazards (crossing cumulative incidence functions).8,18,19 We estimated the mean bias, mean relative bias, root mean squared error, relative standard error (SE), and empirical coverage rate of both methods. We define the relative SE as the ratio of the mean estimated SE and the Monte Carlo empirical SE.

5.1 |. IPW adjusted difference in RMTL

5.1.1 |. Data generation

We first generated five independent covariates Z = (Z1, Z2, Z3, Z4, Z5), each N(0,1). Then we generated the binary exposure, A, from a Bernoulli distribution with p = expit(Zα), where α = {α0, ln(0.7), ln(0.6), ln(0.5), ln(0.9), ln(0.8)}T and Z = (1, Z). We set the intercept α0 to allow 25% and 50% exposure.

To generate event times, we let the baseline subdistribution hazard function λ10(t) = γ exp(ρt) follow a Gompertz distribution. We then defined the subdistribution hazard function as λ1(t;Z˜)=λ10(t)exp{Z˜ψ1+Z˜ψ2t}, where Z˜=(A,Z), ψ1 = {ψ11, ln(0.9), ln(0.8), ln(0.7), ln(0.6), ln(0.5)}T, and ψ2 = (ψ21, 0, 0, 0, 0, 0)T, in which ψ11 is the main effect of A and ψ21 is the time-varying effect of A. We then defined the cumulative incidence function for event ϵ = 1 as F1(t;Z˜)=P(Tt,ϵ=1Z˜)=1exp[0tγexp{Z˜ψ1+Z˜sψ2+ρs}ds], and the cumulative incidence function for event ϵ = 2 is F2(t;Z˜)=P(Tt,ϵ=2Z˜)=exp{γexp(Z˜ψ1)Z˜ψ2+ρ}[1exp{texp(Z˜ψ3)}], where ψ3 = {ln(0.67), ln(0.5), ln(0.6), ln(0.7), ln(0.8), ln(0.9)}T is the vector of effects for Z˜ on the subdistribution hazard of event 2.

We assigned the event type by generating a Bernoulli variable with probability P(ϵ=1Z˜)=1exp{γexp(Z˜ψ1)Z˜ψ2+ρ} where Z˜ψ2+ρ<0, which is the limit of F1(t;Z˜) with respect to t. We generate the event times by simulating a random variable U ∼Uniform(0, 1) and determining the inverse of the conditional cumulative incidence function with respect to t. We generate event times from the respective conditional cumulative incidence functions: for event ϵ = 1, P(Ttϵ=1,Z˜)=(1exp[0tγexp{Z˜ψ1+Z˜sψ2+ρs}ds])/[1exp{γexp(Z˜ψ1)Z˜ψ2+ρ}]; and for event ϵ=2,P(Ttϵ=2,Z˜)=1exp{texp(Z˜ψ3)}.

We applied independent right-censoring by generating censoring times C ~Uniform(0, c), where the parameter c was chosen to yield mild and heavy right censoring (10% and 25% proportion of individuals). Then, we determined if the event time was censored, δ = I(T ≤ C), and the final observed event type, δ · ϵ.

5.1.2 |. Scenarios and true values

We considered both proportional, (γ, ρ, ψ11, ψ21) = (1.2,−1.5,−0.3,0), and nonproportional, (γ, ρ, ψ11, ψ21) = (1.5,−1.5,0.5,−2) subdistribution hazards settings. In total, there were 16 unique scenarios, with varying censoring (10%, 25%), proportion of exposure (25%, 50%), pattern of cumulative incidence functions (proportional vs nonproportional subdistribution hazards), and sample size (500, 1000). We generated 6000 datasets for each scenario.

We determined the true marginal difference in RMTL at τ = 365 days by generating Z for a sample of size n = 1,000,000.20 We first set everyone to be exposed (A = 1), and generated the event type and corresponding event time as described in 5.1.1. Then, we repeated the process where everyone was unexposed (A = 0), yielding two event sets of outcomes per person (event type and time). The true marginal difference in RMTL for event ϵ = 1 at τ = 1 year is the unadjusted difference in RMTL for event ϵ = 1 when A = 1 and A = 0; it was equal to −22.63 days for the proportional subdistribution hazards setting and 8.03 days for the nonproportional subdistribution hazards setting. The true cumulative incidence functions are depicted in Figure 1.

FIGURE 1.

FIGURE 1

True cumulative incidence functions in simulation study of inverse probability weighted estimator

To apply the IPW method, we first derived the weights by estimating the probability of exposure with a logistic regression model, logit{P(A = 1 Z)} = Zα. We then estimated the IPW-adjusted RMTL in each exposure group, A = 1 and A = 0, and the adjusted difference between groups.

5.1.3 |. Results

Overall, the IPW-estimator demonstrated little bias in all settings (Table 1). The bias was slightly greater in the presence of nonproportional subdistribution hazards in scenarios with smaller sample size (n = 500). The mean squared error was smaller in scenarios with larger sample size (n = 1000) and equal exposure groups (50% exposed). The coverage proportion of 95% confidence intervals (CIs) was greater than the nominal value, indicating that the CIs tend to be conservative. The coverage proportion was sometimes slightly better in scenarios with balanced exposure groups (50% exposed) compared with settings with unbalanced exposure groups (25% exposed). The relative SE was greater than 1 in all scenarios, which indicates that the estimated SE was greater than the Monte Carlo empirical SE and likely explains the observed overcoverage.

TABLE 1.

Performance of IPW estimator of the difference in RMTL in simulation studies, τ = 365 days

Proportional subdistribution hazards, ν = −22.63 days at τ = 365 days
Sample size Exposed Censoring Bias, days Rel bias RMSE, days Rel SE Coverage
500 25% 10% −0.026 0.001 14.269 1.262 0.984
25% 0.231 −0.010 15.046 1.224 0.982
50% 10% 0.118 −0.005 12.568 1.173 0.976
25% 0.197 −0.009 13.179 1.146 0.974
1000 25% 10% 0.134 −0.006 10.153 1.248 0.984
25% 0.424 −0.019 10.455 1.237 0.981
50% 10% −0.319 0.014 8.949 1.158 0.977
25% −0.182 0.008 9.267 1.146 0.974
Nonproportional subdistribution hazards, ν = 8.03 days at τ = 365 days
Sample size Exposed Censoring Bias, days Rel bias RMSE, days Rel SE Coverage
500 25% 10% 0.721 0.090 16.702 1.215 0.980
25% 0.759 0.095 17.433 1.196 0.978
50% 10% 0.716 0.089 13.291 1.206 0.981
25% 0.616 0.077 14.163 1.163 0.980
1000 25% 10% 0.344 0.043 11.679 1.219 0.982
25% 0.264 0.033 12.070 1.213 0.980
50% 10% 0.359 0.045 9.495 1.186 0.979
25% 0.379 0.047 9.872 1.174 0.977

Note: To assess the IPW method, we generated competing risks data dependent on a binary covariate A and covariates Z. We obtained ν, the true adjusted difference in RMTL between A = 1 and A = 0 at τ = 365 days with a counterfactual approach in a sample of n = 1,000,000.

Abbreviations: IPW, inverse probability weighted; Rel bias, mean bias relative to true parameter; Rel SE, mean estimated standard error/Monte Carlo empirical error; RMSE, root of mean squared error; RMTL, restricted mean time lost.

5.2 |. Inverse probability of censoring weighted model for the RMTL

5.2.1 |. Data generation

To assess the IPCW model, we generated data according two binary covariates, Z = (Z1, Z2). We generated each binary covariate from independent Bernoulli distributions. We let the subdistribution hazard function follow a Gompertz distribution, λ1(t|Z) = γz exp(ρzt), where (γzz) were set according to the 4 strata of (Z1, Z2).

We defined the cumulative incidence function for event 1 as F1(tZ)=P(Tt,ϵ=1Z)=1exp{0tλ1(sZ)ds}, and the cumulative incidence function for event 2 as F2(tZ)=P(Tt,ϵ=2Z)=exp(γz/ρz){1exp(t)}. We assigned the event type by generating a Bernoulli variable with probability P(ϵ=1Z)=1exp(γz/ρz) where ρz < 0. We again generated times using the inversion method, with conditional cumulative incidence functions P(Ttϵ=1,Z)=[1exp{0tλ1(sZ)ds}]/[1exp{γzexp(γz/ρz)}] and P(Tt|ϵ = 2,Z) = 1 − exp(−t). We generated right censoring and determined the final observed event type as described in Section 5.1.1.

5.2.2 |. Scenarios and true values

Similar to 5.1.2, we included 16 unique scenarios with varying censoring (10%, 25%), proportion of exposure (both P(Z1 =1) and P(Z2 =1) equal to 25% or 50%), pattern of cumulative incidence functions (proportional vs nonproportional subdistribution hazards), and sample size (500, 1000). We generated 6000 datasets for each scenario.

We define (β1, β2) as the true adjusted differences in RMTL at τ = 1 year for covariates Z = (Z1, Z2). First we let μ1(τZ)=0τF1(tZ)dt. We set Gompertz parameters (ρ, γ) that yielded RMTLs for cause 1 of {μ1(τ|Z1 = 1, Z2 = 1), μ1(τ|Z1 = 1, Z2 =|0), μ1(τ|Z1 = 0, Z2 = 1), μ1(τ|Z1 = 0, Z2 = 0)} = (0.3,0.4,0.45,0.55) years. Then, the effect of| Z1 and Z2 on the RMTL for cause 1 is additive such that (β12) = (−0.15,−0.1) years, or (−54.75,−36.5) days. The Gompertz parameters are provided in Table A1, and the true cumulative incidence functions are depicted in Figure 2.

FIGURE 2.

FIGURE 2

True cumulative incidence functions in simulation study of inverse probability of censoring weighting regression model

We estimated IPCW regression models g{μ1(τ|Z)} = β0 + β1Z1 + β2Z2 with an identity link. We then evaluated the performance of (β^1,β^2) to estimate (β12).

5.2.3 |. Results

The IPCW model demonstrated very little bias in all settings (Table 2). The mean squared error decreased as sample size increased, censoring decreased, and when the proportion exposed was balanced compared with imbalanced. The coverage proportion of 95% CIs ranged from 93% to 95%, which indicates slight undercoverage. The relative SE was less than 1 in all scenarios, which indicates that the estimated SE was smaller than the Monte Carlo empirical SE and likely explains the observed undercoverage.

TABLE 2.

Performance of IPCW model of the difference in RMTL in simulation studies, τ = 365 days

β1 = −54.75 days
β2 = −36.5 days
PSH Sample size Exposed Censoring Bias, days Rel bias RMSE, days Rel SE Cov Bias, days Rel bias RMSE, days Rel SE Cov
Yes 500 50% 10% 0.076 −0.001 13.241 0.956 0.938 0.049 −0.001 13.251 0.956 0.938
25% 0.135 −0.002 14.340 0.955 0.939 0.088 −0.002 14.282 0.955 0.942
25% 10% 0.307 −0.006 15.096 0.977 0.942 0.116 −0.003 15.332 0.977 0.936
25% 0.035 −0.001 16.661 0.957 0.935 0.481 −0.013 17.037 0.957 0.930
1000 50% 10% −0.039 0.001 9.251 0.968 0.943 −0.055 0.002 9.106 0.968 0.945
25% −0.149 0.003 10.132 0.957 0.937 0.097 −0.003 10.095 0.957 0.940
25% 10% −0.254 0.005 10.772 0.969 0.942 0.068 −0.002 10.852 0.969 0.933
25% 0.006 0.000 11.811 0.954 0.937 −0.086 0.002 11.781 0.954 0.938
No 500 50% 10% −0.204 0.004 13.411 0.963 0.942 −0.133 0.004 13.327 0.963 0.943
25% 0.097 −0.002 14.458 0.967 0.943 −0.341 0.009 14.375 0.967 0.943
25% 10% 0.077 −0.001 15.833 0.963 0.941 −0.276 0.008 15.311 0.963 0.941
25% −0.152 0.003 17.372 0.958 0.938 0.175 −0.005 17.405 0.958 0.938
1000 50% 10% 0.189 −0.003 9.304 0.982 0.942 −0.087 0.002 9.386 0.982 0.944
25% −0.163 0.003 10.366 0.954 0.938 −0.134 0.004 10.363 0.954 0.937
25% 10% −0.056 0.001 11.058 0.974 0.944 0.016 0.000 10.818 0.974 0.941
25% 0.034 −0.001 12.289 0.957 0.937 −0.190 0.005 12.066 0.957 0.937

Note: To assess the IPCW model, we generated competing risks data dependent on two binary covariates (Z1, Z2). We set the true unadjusted difference in RMTL at τ = 365 days, (β1,β2)={μ1,1z2(τ)μ1,0z2(τ),(μ1,z11(τ)μ1,z10(τ)}.

Abbreviations: IPCW, inverse probability of censoring weighted; PSH, proportional subdistribution hazards; Rel SE, mean estimated standard error/Monte Carlo empirical error; RMSE, root of mean squared error; RMTL, restricted mean time lost.

6 |. ILLUSTRATIVE EXAMPLES

We present illustrative examples on AF in the Framingham Heart Study.2123 We first evaluated sex differences in incident AF. We also evaluated sex differences in cause-specific mortality after AF. In both examples, we apply the IPW and IPCW methods. We compared results with and without accounting for competing risks. The IPW method estimates the marginal RMTL in each sex and the difference in RMTL between sexes, adjusted for covariates, while the IPCW model estimates differences in RMTLs conditional on covariates. We also estimated subdistribution hazard ratios with Fine-Gray models. To protect the confidentiality of the Framingham Heart Study participants, the data from our illustrative examples are not publicly available. Participant level data from the Framingham Heart Study are available at the database of Genotypes and Phenotypes (https://www.ncbi.nlm.nih.gov/gap/) and BioLINCC (https://biolincc.nhlbi.nih.gov/home/).

6.1 |. Sex differences in incident AF

When assessing incident AF, individuals may die before developing AF; thus, death is considered a competing risk. Up to timepoint τ, the RMTL for AF is the mean AF-free time lost, or the mean time diagnosed with AF.

We included 6016 participants who were free of AF at index age 55 years and attended an examination between ages 50 and 60 years. Risk factors were measured at the examination closest to age 55 years. Participants were followed until AF or atrial flutter, death, loss to follow-up, the last visit or medical contact. We set τ = 40 years. We adjusted for age, height, weight, current smoking status, systolic and diastolic blood pressure, hypertensive medication, current diabetes status, prior heart failure, and prior myocardial infarction.22 To address highly influential observations, leading to the instability of the IPW estimator, we replaced weights above the 99th percentile of the distribution by the 99th percentile.24

In Figure 3, we show the IPW cumulative incidence curves and the complement of the IPW Kaplan-Meier curves (1-KM). In Table A2 (Appendix 5), we show the IPW-adjusted RMTL in males and females, separately.

FIGURE 3.

FIGURE 3

Inverse probability weighted cumulative incidence curves for incident atrial fibrillation in the Framingham Heart Study, with and without accounting for competing risks. CIF, cumulative incidence function; IPW, inverse probability weighted; KM, Kaplan-Meier. The Kaplan-Meier method does not account for competing risks, while the cumulative incidence function does

With the IPW method, we found evidence of a marginal difference between sexes in RMTL for AF over 40 years (Table 3). Men lost an additional 1.4 (95% CI: 0.5, 2.4) years of AF-free time as compared with women. With the IPCW model, there was no evidence of difference in the conditional difference in RMTL. On average females lost 0.4 (−0.92, 1.78) additional years of AF-free time as compared with males. When interpreting the results, it is important to note the two methods estimate different quantities. The IPW method estimates the population difference in RMTL between sexes marginalized over covariates, or the difference of the population being male or female.25 By contrast, the IPCW regression model estimates the individual difference in RMTL between sexes conditional on covariates, or the difference in an individual being male or female.25 The adjusted subdistribution hazard ratio for AF was 1.2 (1.0, 1.5), also indicating males have an increased incidence of AF compared with females. As for the competing event, death without AF, males lost more AF-free life expectancy than females(IPW: 2.3 [1.2, 3.3], IPCW: 1.2 [0, 2.5]). The adjusted subdistribution hazard ratio for death without AF was 1.5 (1.3, 1.8).

TABLE 3.

Adjusted differences in RMTL between males and females in 40-year incident atrial fibrillation

RMTL difference, years
Estimators accounting for competing risks AF Death without AF
IPW 1.41 (0.45, 2.38) 2.26 (1.21, 3.30)
IPCW regression model −0.43 (−1.78, 0.92) 1.24 (0.00, 2.48)
RMTL difference, years
Estimators not accounting for competing risks AF Death without AF
IPW 2.83 (1.64, 4.01) 2.93 (1.81, 4.04)
IPCW regression model −1.23 (−3.10, 0.64) 1.97 (0.57, 3.37)

Note: Data are differences in cause-specific RMTL (95% confidence intervals) for men vs women. The IPW method directly yields marginal differences in RMTL adjusting for covariates, while the IPCW regression models yield differences in RMTL conditional on covariates.

Abbreviations: AF, atrial fibrillation; IPW, inverse probability weighted; IPCW, inverse probability of censoring weighted; RMTL, restricted mean time lost.

When ignoring competing risks, both the IPW and IPCW methods provided estimates larger in magnitude for both events (Table 3). In addition, both methods provided larger estimates of the RMTL in each sex, particularly the IPCW method (Appendix 6, Table A2).

6.2 |. Sex differences in cause-specific mortality after AF

We included 1483 individuals with incident AF who attended an exam within 10 years of diagnosis with nonmissing covariates. We categorized death causes into cardiovascular disease (CVD), non-CVD, and unknown causes of death. We assessed sex differences in CVD and non-CVD mortality, while unknown cause of death was considered an additional competing risk. Individuals entered the analysis at AF diagnosis, and were followed until death or loss to follow-up. We estimated differences in cause-specific RMTL between men and women, adjusted for age at AF and risk factors measured at the closest examination prior to AF: body mass index, smoking status, systolic and diastolic blood pressure, hypertensive medication, and diabetes status. We also adjusted for prior heart failure and prior myocardial infarction, assessed at time of AF onset. We set τ = 10 years.

In Figure 4, we show the IPW cumulative incidence curves and the complement of the IPW Kaplan-Meier curves (1-KM). In Table A3 (Appendix 6), we show the IPW-adjusted RMTLs for CVD death and CVD death in males and females, separately.

FIGURE 4.

FIGURE 4

Inverse probability weighted cumulative incidence curves for cause-specific mortality among individuals with AF in the Framingham Heart Study, with and without accounting for competing risks. CIF, cumulative incidence function; IPW, inverse probability weighted; KM, Kaplan-Meier. The Kaplan-Meier method does not account for competing risks, while the cumulative incidence function does

We did not find evidence of differences between sexes in 10-year cause-specific mortality (Table 4). When accounting for competing risks, both methods gave similar estimates for the difference in RMTL. The IPCW method provided slightly larger CIs. On average, males had greater lost life expectancy to CVD (IPW: 2.2 [−1.9, 6.4] months) and non-CVD causes (IPW: 4.0 [−0.5, 8.5] months) as compared with females. The adjusted subdistribution hazard ratios were consistent in direction with the differences in RMTLs (CVD-death: 1.2 [1.0, 1.5], non-CVD death: 1.3 [1.1, 1.5]).

TABLE 4.

Adjusted differences in RMTL between males and females in 10-year cause-specific mortality after atrial fibrillation

RMTL difference, years
Estimators accounting for competing risks CVD death Non-CVD death
IPW 0.18 (−0.16, 0.53) 0.33 (−0.04, 0.71)
IPCW regression model 0.19 (−0.28, 0.66) 0.37 (−0.08, 0.81)
RMTL difference, years
Estimators not accounting for competing risks CVD death Non-CVD death
IPW 0.28 (−0.10, 0.67) 0.43 (0.02, 0.84)
IPCW regression model 0.34 (−0.02, 0.70) 0.52 (0.12, 0.92)

Note: Data are differences in cause-specific RMTL (95% confidence intervals) for men vs women. The IPW methods yield marginal differences in RMTL adjusting for covariates, while the IPCW regression models yield differences in RMTL conditional on covariates.

Abbreviations: CVD, cardiovascular disease; IPW, inverse probability weighted; IPCW, inverse probability of censoring weighted; RMTL, restricted mean time lost.

When ignoring competing risks, both methods provided slightly larger estimates of the difference in RMTL for both CVD and non-CVD causes. However, the change in estimates when accounting or not accounting for competing risks was much smaller than the previous example. Again, both methods also provided larger estimates of the RMTLs in each sex (Appendix 6, Table A3).

7 |. DISCUSSION

In summary, we introduced methods to estimate and model the difference in RMTL in the presence of competing risks. We derived an IPW adjusted estimator of the difference in RMTL between exposure groups and its variance, and a generalized linear model for the RMTL conditional on covariates estimated with IPCW. In our illustrative examples, we demonstrate how ignoring competing risks leads to exaggerated estimates.

Ignoring competing events will lead to biased inference. The complement of the Kaplan-Meier curve overestimates the cumulative incidence of an event and similarly, estimators of the RMTL must also account for the competing risk to avoid bias. In our illustrative examples, the estimated RMTLs and their differences were considerably larger when ignoring competing risks. The IPCW method was particularly prone to giving larger estimates when the competing risk was ignored. This may be due to the twofold bias of the IPCW model: if competing events are considered censored, the outcome is estimated only based on the event of interest rather than taking a value of 0 for competing events, and the censoring distribution G(t) used to weight individuals is actually the distribution of a composite endpoint of censoring and the competing event.

In analyses of competing risks data, subdistribution and cause-specific hazard ratios are frequently used to measure associations.26 However, these measures can be difficult to interpret. First, they rely on proportionality over time, which often does not hold. Second, the subdistribution hazard for a given cause is a function of all cause-specific hazards, which allows the possibility that an exposure can indirectly influence the subdistribution hazard ratio through a competing event.26 Moreover, the cause-specific hazard ratio does not have a one-to-one relationship with the cumulative incidence function. This point also applies to RMTL-based measures of association, as a covariate can affect the cumulative incidence curve through the competing event. This can present difficulty in interpretation if the competing events are not also analyzed. To address this, we recommended reporting RMTL-based measures for both the event of interest and the competing event. Third, the subdistribution and cause-specific hazard ratios are not collapsible.27,28 However, the IPCW regression model for the cause-specific RMTL yields collapsible coefficients with the identity or log link.

In a randomized experiment, Weir et al2 demonstrated how clinicians may reach different conclusions when hazard ratios or differences in RMST are reported. Similarly, reporting the difference in RMTL as a complement to subdistribution and cause-specific hazard ratios may give additional insight to the data and offer a clinically meaningful interpretation. Unlike either hazard ratio, the unadjusted, and IPW adjusted differences in RMTL directly give absolute measures in each group, which provides crucial background information.29 Furthermore, the difference in RMTL is not subject to any proportionality assumptions. The assumption of proportional hazards is complex in the competing risk setting, as subdistribution hazards and cause-specific hazards cannot be simultaneously proportional and both must be explicitly assessed.30

Other methods for the RMTL exist. In previous work, Calkins et al31 estimated differences in RMST adjusted with IPW, and the bootstrap for CIs. Andersen10 developed pseudo-observation model for cause-specific life years lost. Kipourou et al32 also used the pseudo-observation technique to model life-years lost to various causes when cause of death is unreliable or missing. Sabathé et al33 used the pseudo-observation approach to account for interval censoring in an illness-death model framework. Pan and Gastwirth34 modeled the restricted mean job duration, or “lifetime,” using Cox models for cause-specific event processes.

There are several limitations to our methods. First, if the IPW estimator weights are in fact estimated by estimating pia, the proposed variance of the RMTL does not account for variability in the estimated weights. In our simulation study, the coverage proportion of the IPW method was greater than 95%. Estimation of the weights can be accounted for using the bootstrap method.31 Second, when predicting the RMTL for a covariate profile using the IPCW model, one must be careful not to extrapolate outside the window of (0, τ), which may occur when using the identity or log link functions.10 By contrast, the logit-like link function, suggested by Tian et al, restricts the predicted RMTL to (0, τ).6 Third, both of our proposed methods assume the event time and censoring are independent, which may not always hold. The IPCW framework has been extended to accommodate dependent censoring in other settings.35 The IPCW model also exhibits a trade-off between estimation of the censoring distribution G(t) and the number of uncensored observations. Ideally, one would have a large sample with sufficient censoring. One could improve estimation of the censoring distribution and overall model by including an augmentation term in the estimating equation.36 Finally, we only considered covariates at baseline but it is possible that covariates are measured repeatedly over time. One could use g-computation and g-estimation to estimate the difference in RMTL while accounting for time-varying confounding.37,38

ACKNOWLEDGEMENTS

SCC received funding from the National Heart, Lung, and Blood Institute (NHLBI) (F31 HL145904-01). LT received funding from the American Heart Association (18SFRN34150007). The Framingham Heart Study is supported by NHLBI (N01-HC25195, HHSN268201500001I; 75N92019D00031) and Boston University School of Medicine. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. The authors thank Katia Bulekova for her support with Boston University’s Shared Computing Cluster, the Framingham Heart Study participants for their commitment, Dr. Sangbum Choi for communication about the IPW Aalen-Johansen estimator, and two anonymous reviewers for their insightful comments.

APPENDIX A

A.1. Derivation of RMTL

Here, we show that E [(τR) · I(ϵ = 1)] is the area under the cause-specific cumulative incidence function, or cause-specific RMTL. While we cannot take the expectation of the subdistribution failure time, T · I(ϵ = 1), we can find the expectation of the random variable R · I(ϵ = 1),

E[RI(ϵ=1)]=0τ1F1(t)dt=τ0τF1(t)dt,

which is the time alive and free of cause 1, but still includes the expected time lost to cause 2. To isolate the time lost due to cause 1, we take the expectation of (τR) · I(ϵ = 1),

E[(τR)I(ϵ=1)]=E[(τR)I(ϵ=1)Tτ]P(Tτ)+E[(τR)I(ϵ=1)T>τ]P(T>τ)=E[(τT)I(ϵ=1)Tτ]P(Tτ).

Since P(τTt,ϵ = 1) = 1 − F1(τt) and f1,τt(t)=ddt(1F1(τt))=f1(τt),

E[(τR)I(ϵ=1)]=0τ(τT)P(Tτ)f1(τt)dtP(Tτ)=0ττf1(τt)dt0τTf1(τt)dt.

By integration by parts, it follows that

E[(τR)I(ϵ=1)]=τ(F1(0)F1(τ))(τF1(0)τF1(τ)0τF1(t)dt)=0τF1(t)dt.

This relationship can be extended to the cause-specific RMTL conditional on covariates Z, which we leverage in Appendix 3.

A.2. Variance of difference in RMTL

Let Nk(t)=i=1nI(Xit,ϵi=k)δi be the number of events of cause k by time t, N(t)= N1(t)+ N2(t) is the number of total events by time t, ΔNk(t) be the increment Nk(t)− Nk(t−), and Y(t)=i=1nI(Xit) be the number at risk just before t. Let S^(t) be the Kaplan Meier estimator of being alive and event-free at time t, S^(t)=tjt(1ΔN(tj)Y(tj)), and θ^(tj)=S^(tj1)ΔN1(tj)Y(tj). The cumulative incidence function is then estimated by the Aalen-Johansen estimator F^1(t)=tjtθ^(tj), where tj are the unique event times.39 The variance and covariance are estimated using Greenwood’s formula,40,41

var^{F^1(t)}=tjtvar^(θ^(tj))+2tj<ttj<tktcov^(θ^(tj),θ^(tk))

and

cov^{F^1(t),F^1(u)}=var^{F^1(t)}+tjtt<tkucov^{θ^(tj),θ^(tk)}fort<u,

where

var^{θ^(tj)}={θ^(tj)}2{Y(tj)ΔN1(tj)ΔN1(tj)Y(tj)+tl<tjΔN(tl)Y(tl)(Y(tl)ΔN(tl))}

and

cov^{θ^(tj),θ^(tk)}=θ^(tj)θ^(tk){1Y(tj)+tl<tjΔN(tl)Y(tl)(Y(tl)ΔN(tl))}

for tj < tk. Then, the variance of the cause-specific RMTL is

var^{μ^1(τ)}=tj<τ(tj+1tj)2var^{F^1(tj)}+2tj<τtk<tj(tj+1tj)(tk+1tk)cov^{F^1(tj),F^1(tk)},

which is Equation (2).

A.3. Unbiased IPCW estimating equation

Here, we show that the IPCW estimating equation (3) is unbiased. In the generalized linear model, we assume E{(τR)I(ϵ=1)Z*}=g1(βZ*). Following the conditional expectation arguments of Bang and Tsiatis16,17 and Scheike et al,42 we have

E(i=1nδ˜iG(Ri)Zi*{(τRi)I(ϵi=1)g1(βZi*)}Zi*)=nZ*E(E[δ˜G(R){(τR)I(ϵ=1)g1(βZ*)}R,Z*,ϵ]Z*)=nZ*E(E[δ˜R,Z*,ϵ]G(R){(τR)I(ϵ=1)g1(βZ*)}Z*)=nZ*E((τR)I(ϵ=1)g1(βZ*)Z*)=0.

It follows that

E(E[i=1nδ˜iG(Ri)Zi*{(τRi)I(ϵi=1)g1(Zi*β)}Zi*])=0.

Since G(t) is unknown, we propose estimating G(t) by the Kaplan-Meier estimator G^(t) and solving the following equation to estimate β,

i=1nδ˜iG^(Ri)Zi*{(τRi)I(ϵi=1)g1(Zi*β)}=0.

A.4. Asymptotic variance of IPCW estimating equation

Following Tian et al6 and Bang and Tsiatis,16,17 we leverage martingale identities

δiG(Xi)=10dMic(u)G(u),
G^(t)G(t)=10tG^(u)G(u)dMc(u)Y(u),

and

Y(u)/n=G^(u)S^(u).

The estimating equation evaluated at the truth β0 can be written as asymptotically linear,

n1/2U(β0)=n1/2i=1n{δ˜iG^(Ri)+δ˜iG(Ri)δ˜iG(Ri)}Zi*{(τRi)I(ϵi=1)g1(Zi*β)}=n1/2i=1n[δ˜iG(Ri)+δ˜iG^(Ri){G(Ri)G^(Ri)G(Ri)}]Zi*{(τRi)I(ϵi=1)g1(Zi*β)}=n1/2i=1n[10τdMic(u)G(u)+δ˜iG^(Ri){0RiG^(u)dMc(u)G(u)Y(u)}]Zi*{(τRi)I(ϵi=1)g1(Zi*β)}=n1/2i=1nZi*{(τRi)I(ϵi=1)g1(Zi*β0)}n1/2i=1n0τ[Zi*{(τRi)I(ϵi=1)g1(Zi*β0)}K^(β0,u)]dMiC(u)G(u)=n1/2i=1nZi*{(τRi)I(ϵi=1)g1(Zi*β0)}n1/2i=1n0τ[Zi*{(τRi)I(ϵi=1)g1(Zi*β0)}K(β0,u)]dMiC(u)G(u)+op(1),

where

K^(β0,u)=1nS^(u)i=1nδ˜iG^(Ri)[Zi*{(τRi)I(ϵi=1)g1(Zi*β0)}I(Riu)]

and

K(β0,u)=1S(u)E[Zi*{(τRi)I(ϵi=1)g1(Zi*β0)}I(Riu)].

The first two terms are independent due to martingale properties, so

var(n1/2U(β0))=E[Zi*2{(τRi)I(ϵi=1)g1(β0Zi*)}2]+0τE[(Zi*{(τRi)I(εi=1)g1(β0Zi*)}K(β0,u))2I(Riu)]dΛc(u)G(u). (A1)

Then, var{n1/2(β^β0)}=A1BA1 where B is the variance of the estimating equation (A1) and

A=E[Zi*2ddβ0g1(Zi*β0)].

This covariance matrix can be estimated by A^1B^A^1 in which

A^=1ni=1nZi*2ddβ^g1(Zi*β^)

and

B^=1ni=1nδ˜iG^(Ri)[Zi*2((τRi)I(ϵi=1)g1(Zi*β^))2]+0τ1ni=1n(δ˜iG^(Ri)[Zi*{(τRi)I(ϵi=1)g1(Zi*β^)K^(β^,u)}]2I(Riu))dΛ^c(u)G^(u),

where Λ^c(t)=0tdNc(u)π(u)du, Nc(u)=iI(Riu,δ˜i=0), and π(u)=iI(Riu).

A.4. Parameters for data generation in simulation study of IPCW regression model

TABLE A1.

Gompertz parameters for data generation in simulation study of IPCW regression model

Proportional subdistribution hazards, τ = 365 days
ρ γ Z 1 Z 2 RMTL, years RMTL, days
−2.8 1.546588 1 1 0.30 109.5
−2.9 2.291237 1 0 0.40 146.0
−1.4 1.952647 0 1 0.45 164.25
−1.7 2.879494 0 0 0.55 200.75
Nonproportional subdistribution hazards, τ = 365 days
ρ γ Z 1 Z 2 RMTL, years RMTL, days
−1.7 1.213444 1 1 0.30 109.5
−1.7 1.770851 1 0 0.40 146.0
−1.7 2.096025 0 1 0.45 164.25
−1.7 2.879494 0 0 0.55 200.75

Note: To assess the IPCW model, we generated competing risks data dependent on two binary covariates (Z1, Z2). We set the true conditional difference in RMTL at τ = 365 days, (β1, β2) = {μ1,1z2(τ) − μ1,0z2(τ), (μ1,z11(τ) − μ1,z10(τ)}.

Abbreviations: IPCW, inverse probability of censoring weighting; RMTL, restricted mean time lost.

A.5. Adjusted RMTL by sex from illustrative examples

For the IPCW models, we predicted the RMTL for each sex with a least squares means approach by summing the estimated model coefficients at the mean value for continuous covariates and the sample proportion for binary covariates.

A.6. Collapsibility of IPCW regression model

Non collapsibility is a disadvantage of the subdistribution hazard ratio.28 In other words, adjusting for a covariate may affect the subdistribution hazard ratio even if the covariate is independent of exposure. By contrast, an advantage of the IPCW regression model for the cause-specific RMTL is that it is a generalized linear model, and therefore yields collapsible coefficients when g(·) is the identity or log link. We demonstrate that the identity and log links yield collapsible differences and ratios, respectively, in cause-specific RMTL. Assume A is a binary exposure, Z is a covariate, A and Z are independent, and the true model is

g{μ1(τA,Z)}=g[E{(τR)I(ϵ=1)A,Z}]=β0+βAA+βZZ. (A2)

TABLE A2.

Adjusted RMTLs for 40-year incident atrial fibrillation by sex

RMTL, years
Estimators accounting for competing risks AF Death without AF
Males IPW 6.74 (5.98, 7.49) 9.74 (8.90, 10.57)
IPCW regression model 6.64 (5.78,7.51) 8.95 (8.17, 9.74)
Females IPW 5.33 (4.73, 5.93) 7.48 (6.86, 8.10)
IPCW regression model 7.07 (6.37, 7.78) 7.71 (7.06, 8.37)
RMTL, years
Estimators not accounting for competing risks AF Death without AF
Males IPW 9.50 (8.55, 10.45) 12.07 (11.18, 12.97)
IPCW regression model 10.85 (9.87,11.82) 12.80 (12.00, 13.60)
Females IPW 6.67 (5.96, 7.39) 9.15 (8.48, 9.81)
IPCW regression model 12.08 (10.86, 13.29) 10.83 (10.03, 11.63)

Abbreviations: AF, atrial fibrillation; IPW, inverse probability weighted; IPCW, inverse probability of censoring weighted; RMTL, restricted mean time lost.

TABLE A3.

Adjusted RMTLs for 10-year cause-specific death after atrial fibrillation by sex

RMTL, years
Estimators accounting for competing risks CVD death Non-CVD death
Males IPW 1.62 (1.40,1.84) 2.13 (1.88, 2.37)
IPCW regression model 1.58 (1.28, 1.89) 2.13 (1.85, 2.41)
Females IPW 1.44 (1.17, 1.70) 1.79 (1.51, 2.08)
IPCW regression model 1.39 (1.05, 1.73) 1.76 (1.44, 2.09)
RMTL, years
Estimators not accounting for competing risks CVD death Non-CVD death
Males IPW 1.95 (1.70, 2.19) 2.47 (2.20, 2.73)
IPCW regression model 2.50 (2.28, 2.72) 3.01 (2.76, 3.25)
Females IPW 1.66 (1.37, 1.95) 2.04 (1.73,2.35)
IPCW regression model 2.16 (1.89, 2.43) 2.49 (2.20, 2.78)

Abbreviations: CVD, cardiovascular disease; IPW, inverse probability weighted; IPCW, inverse probability of censoring; RMTL, restricted mean time lost.

When g(·) is the identity link, βA is the conditional difference in RMTL between exposure groups A = 1 and A = 0. Then, the marginal difference in RMTL is

μ1(τA=1)μ1(τA=0)=EZ{μ1(τA=1,Z)}EZ{μ1(τA=0,Z)}=EZ(E{(τR)I(ϵ=1)A=1,Z})EZ(E{(τR)I(ϵ=1)A=0,Z})={β0+βA+βZE(Z)}{β0+βZE(Z)}=βA,

which is the conditional difference in RMTL from (A2). Thus, the RMTL regression model with an identity link yields collapsible differences in RMTL. When g(·) is the log link, eβA is the conditional ratio of RMTL of exposure groups A =1 vs A =0. Similarly, the marginal ratio in RMTL is

μ1(τA=1)μ1(τA=0)=EZ{μ1(τA=1,Z)}EZ{μ1(τA=0,Z)}=EZ[eE{(τR)I(ϵ=1)A=1,Z}]EZ[eE{(τR)I(ϵ=1)A=0,Z}]=EZ(eβ0+βA+βZZ)EZ(eβ0+βZZ)=eβ0+βAEZ(eβZZ)eβ0EZ(eβZZ)=eβA,

which is the conditional ratio. Thus, the RMTL regression model with a log link is also collapsible. These arguments apply to the RMST regression model as well. However, similar to other generalized linear models, other link functions such as the logit link are prone to noncollapsible coefficients.27,43

DATA AVAILABILITY STATEMENT

We provide R code to apply our methods with working examples and to replicate our simulation studies on GitHub at https://github.com/s-conner/rmtl. To protect the confidentiality of the Framingham Heart Study participants, the data from our illustrative examples are not on our GitHub page. Participant level data from the Framingham Heart Study are available at the database of Genotypes and Phenotypes (https://www.ncbi.nlm.nih.gov/gap/) and BioLINCC (https://biolincc.nhlbi.nih.gov/home/).

REFERENCES

  • 1.Trinquart L, Jacot J, Conner SC, Porcher R. Comparison of treatment effects measured by the hazard ratio and by the ratio of restricted mean survival times in oncology randomized controlled trials. J Clin Oncol. 2016;34(15):1813–1819. [DOI] [PubMed] [Google Scholar]
  • 2.Weir I, Marshall G, Schneider J, et al. Interpretation of time-to-event outcomes in randomized trials: an online randomized experiment. Ann Oncol. 2019;30(1):96–102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Conner SC, Sullivan LM, Benjamin EJ, La Valley MP, Galea S, Trinquart L. Adjusted restricted mean survival times in observational studies. Stat Med. 2019;38(20):3832–3860. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Royston P, Parmar MK. Flexible parametric proportional-hazards and proportional-odds models for censored survival data, with application to prognostic modelling and estimation of treatment effects. Stat Med. 2002;21(15):2175–2197. [DOI] [PubMed] [Google Scholar]
  • 5.Andersen PK, Hansen MG, Klein JP. Regression analysis of restricted mean survival time based on pseudo-observations. Lifetime Data Anal. 2004;10(4):335–350. [DOI] [PubMed] [Google Scholar]
  • 6.Tian L, Zhao L, Wei L. Predicting the restricted mean event time with the subject’s baseline covariates in survival analysis. Biostatistics. 2014;15(2):222–233. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Gray RJ. A class of K-sample tests for comparing the cumulative incidence of a competing risk. Ann Stat. 1988;16(3):1141–1154. [Google Scholar]
  • 8.Fine JP, Gray RJ. A proportional hazards model for the subdistribution of a competing risk. J Am Stat Assoc. 1999;94(446):496–509. [Google Scholar]
  • 9.Kulathinal S, Gasbarra D. Testing equality of cause-specific hazard rates corresponding to m competing risks among K groups. Lifetime Data Anal. 2002;8(2):147–161. [DOI] [PubMed] [Google Scholar]
  • 10.Andersen PK. Decomposition of number of life years lost according to causes of death. Stat Med. 2013;32(30):5278–5285. [DOI] [PubMed] [Google Scholar]
  • 11.Zhao L, Tian L, Claggett B, et al. Estimating treatment effect with clinical interpretation from a comparative clinical trial with an endpoint subject to competing risks. JAMA Cardiol. 2018;3(4):357–358. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Tian L, Jin H, Uno H, et al. On the empirical choice of the time window for restricted mean survival time. Biometrics. 2020;76(4):1157–1166. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Cole SR, Lau B, Eron JJ, et al. Estimation of the standardized risk difference and ratio in a competing risks framework: application to injection drug use and progression to AIDS after initiation of antiretroviral therapy. Am J Epidemiol. 2015;181(4):238–245. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Neumann A, Billionnet C. Covariate adjustment of cumulative incidence functions for competing risks data using inverse probability of treatment weighting. Comput Methods Prog Biomed. 2016;129:63–70. [DOI] [PubMed] [Google Scholar]
  • 15.Choi S, Kim C, Zhong H, Ryu ES, Han SW. Adjusted-crude-incidence analysis of multiple treatments and unbalanced samples on competing risks. Stat Interface. 2019;12(3):423–437. [Google Scholar]
  • 16.Bang H, Tsiatis AA. Estimating medical costs with censored data. Biometrika. 2000;87(2):329–343. [Google Scholar]
  • 17.Bang H, Tsiatis AA. Median regression with censored cost data. Biometrics. 2002;58(3):643–649. [DOI] [PubMed] [Google Scholar]
  • 18.Li J, Scheike TH, Zhang MJ. Checking fine and gray subdistribution hazards model with cumulative sums of residuals. Lifetime Data Anal. 2015;21(2):197–217. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Zhou B, Fine J, Laird G. Goodness-of-fit test for proportional subdistribution hazards model. Stat Med. 2013;32(22):3804–3811. [DOI] [PubMed] [Google Scholar]
  • 20.Austin PC, Fine JP. Propensity-score matching with competing risks in survival analysis. Stat Med. 2019;38(5):751–777. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Staerk L, Wang B, Preis SR, et al. Lifetime risk of atrial fibrillation according to optimal, borderline, or elevated levels of risk factors: cohort study based on longitudinal data from the framingham heart study. British Med J. 2018;k1453:361. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Staerk L, Preis SR, Lin H, et al. Novel risk modeling approach of atrial fibrillation with restricted mean survival times: application in the framingham heart study community-based cohort. Circ Cardiovasc Qual Outcomes. 2020;13(4):e005918. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Vinter N, Huang Q, Fenger-Gron M, Frost L, Benjamin E, Trinquart L. 45-year trend in excess mortality associated with atrial fibrillation in the community-based Framingham heart study cohorts. BMJ. 2020;370:m2724. 10.1136/bmj.m2724. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Cole SR, Hernán MA. Constructing inverse probability weights for marginal structural models. Am J Epidemiol. 2008;168(6):656–664. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Austin PC. The performance of different propensity score methods for estimating marginal hazard ratios. Stat Med. 2013;32(16): 2837–2849. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Latouche A, Allignol A, Beyersmann J, Labopin M, Fine JP. A competing risks analysis should report results on all cause-specific hazards and cumulative incidence functions. J Clin Epidemiol. 2013;66(6):648–653. [DOI] [PubMed] [Google Scholar]
  • 27.Greenland S, Robins JM, Pearl J. Confounding and collapsibility in causal inference. Stat Sci. 1999;14(1):29–46. [Google Scholar]
  • 28.Martinussen T, Vansteelandt S. On collapsibility and confounding bias in Cox and Aalen regression models. Lifetime Data Anal. 2013;19(3):279–296. [DOI] [PubMed] [Google Scholar]
  • 29.Trinquart L, Jacot J, Conner SC, Porcher R. Reply to KH Eng et al. J Clin Oncol. 2017;35(4):466. [DOI] [PubMed] [Google Scholar]
  • 30.Beyersmann J, Schumacher M. Misspecified regression model for the subdistribution hazard of a competing risk. Stat Med. 2007;26(7):1649. [DOI] [PubMed] [Google Scholar]
  • 31.Calkins KL, Canan CE, Moore RD, Lesko CR, Lau B. An application of restricted mean survival time in a competing risks setting: comparing time to ART initiation by injection drug use. BMC Med Res Methodol. 2018;18(1):27. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Kipourou DK, Perme MP, Rachet B, Belot A. Direct modeling of the crude probability of cancer death and the number of life years lost due to cancer without the need of cause of death: a pseudo-observation approach in the relative survival setting. Biostatistics. 2020;kxaa017. 10.1093/biostatistics/kxaa017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Sabathé C, Andersen PK, Helmer C, Gerds TA, Jacqmin-Gadda H, Joly P. Regression analysis in an illness-death model with interval-censored data: a pseudo-value approach. Stat Methods Med Res. 2020;29(3):752–764. [DOI] [PubMed] [Google Scholar]
  • 34.Pan Q, Gastwirth JL. Estimating restricted mean job tenures in semi-competing risk data compensating victims of discrimination. Ann Appl Stat. 2013;7(3):1474–1496. [Google Scholar]
  • 35.Wang X, Schaubel DE. Modeling restricted mean survival time under general censoring mechanisms. Lifetime Data Anal. 2018;24(1):176–199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Steingrimsson JA, Diao L, Molinaro AM, Strawderman RL. Doubly robust survival trees. Stat Med. 2016;35(20):3595–3612. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Conner SC, Lodi S, Lunetta KL, et al. Refining the association between body mass index and atrial fibrillation: G-formula and restricted mean survival times. J Am Heart Assoc. 2019;8(16):e013011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Hagiwara Y, Shinozaki T, Matsuyama Y. G-estimation of structural nested restricted mean time lost models to estimate effects of time-varying treatments on a failure time outcome. Biometrics. 2020;76(3):799–810. [DOI] [PubMed] [Google Scholar]
  • 39.Aalen OO, Johansen S. An empirical transition matrix for non-homogeneous Markov chains based on censored observations. Scand J Stat. 1978;5(3):141–150. [Google Scholar]
  • 40.Dinse GE, Larson MG. A note on semi-Markov models for partially censored data. Biometrika. 1986;73(2):379–386. [Google Scholar]
  • 41.Gaynor JJ, Feuer EJ, Tan CC, et al. On the use of cause-specific failure and conditional failure probabilities: examples from clinical oncology data. J Am Stat Assoc. 1993;88(422):400–409. [Google Scholar]
  • 42.Scheike TH, Zhang MJ, Gerds TA. Predicting cumulative incidence probability by direct binomial regression. Biometrika. 2008;95(1):205–220. [Google Scholar]
  • 43.Neuhaus JM, Jewell NP. A geometric approach to assess bias due to omitted covariates in generalized linear models. Biometrika. 1993;80(4):807–815. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

We provide R code to apply our methods with working examples and to replicate our simulation studies on GitHub at https://github.com/s-conner/rmtl. To protect the confidentiality of the Framingham Heart Study participants, the data from our illustrative examples are not on our GitHub page. Participant level data from the Framingham Heart Study are available at the database of Genotypes and Phenotypes (https://www.ncbi.nlm.nih.gov/gap/) and BioLINCC (https://biolincc.nhlbi.nih.gov/home/).

RESOURCES