Estimating differences in restricted mean lifetime using observational data subject to dependent censoring

Min Zhang; Douglas E Schaubel

doi:10.1111/j.1541-0420.2010.01503.x

. Author manuscript; available in PMC: 2014 Oct 9.

Published in final edited form as: Biometrics. 2010 Oct 29;67(3):740–749. doi: 10.1111/j.1541-0420.2010.01503.x

Estimating differences in restricted mean lifetime using observational data subject to dependent censoring

Min Zhang ^1,^✉, Douglas E Schaubel ¹

PMCID: PMC4190616 NIHMSID: NIHMS630726 PMID: 21039400

Summary

In epidemiologic studies of time to an event, mean lifetime is often of direct interest. We propose methods to estimate group- (e.g., treatment-) specific differences in restricted mean lifetime for studies where treatment is not randomized and lifetimes are subject to both dependent and independent censoring. The proposed methods may be viewed as a hybrid of two general approaches to accounting for confounders. Specifically, treatment-specific proportional hazards models are employed to account for baseline covariates, while inverse probability of censoring weighting is used to accommodate time-dependent predictors of censoring. The average causal effect is then obtained by averaging over differences in fitted values based on the proportional hazards models. Large-sample properties of the proposed estimators are derived and simulation studies are conducted to assess their finite-sample applicability. We apply the proposed methods to liver wait list mortality data from the Scientific Registry of Transplant Recipients.

Keywords: Counterfactual, Cumulative treatment effect, Inverse weighting, Proportional hazards model

1. Introduction

Often in clinical and epidemiologic studies, groups of subjects are compared with respect to their survival times. Since any study is of finite duration, the time until the event of interest may be censored. Typically in observational studies, the factor of interest is not randomized (e.g., method of treatment) and may not even be assigned (e.g., race, gender, diagnosis), necessitating some form of covariate adjustment, such as that obtained through regression modeling. Since its development, the proportional hazards model (Cox, 1972) has dominated the biomedical literature as the method of choice for the regression modeling of censored data.

The popularity of the Cox model among practitioners and, by now, clinical investigators makes it an attractive means of comparing groups in observational studies. In Cox regression, the impact of each covariate is usually summarized by its effect on the hazard function. However, when comparing groups of subjects, investigators are often more interested in differences in mean lifetime than ratios of hazard functions. The survival time distribution may be heavily right skewed. Moreover, for semi- or non-parametric modeling, the mean is not well estimated; e.g., the estimated survival function need not drop td alternative is the restricted mean lifetime; i.e., for fixed L > 0, if T denotes survival time, then the restricted mean lifetime is defined as E{min(T, L)}. Restricted mean lifetime can also be expressed as $\int_{0}^{L} P (T > t) dt$ , the area under the survival curve over (0, L], a quantity easily understood by clinical investigators. For example, if L =5 years, one could interpret E{min(T, L)} as the average number of years lived out of the next 5. Restricted mean lifetime is typically of greater interest to clinicians than the usual Cox metric, the hazard ratio. In fact, in certain settings E{min(T, L)} may be of more interest than E(T) itself. For example, in the context of pediatric liver transplantation, it is almost always that case that a child receiving a liver transplant will need a second liver transplant in the next 10 years. Hence, using T to represent post-transplant survival time, survival after the first 10 years of post-transplant follow-up could not be realistically assumed to be due to the initial liver transplant; making E{min(T, 10)} of greater relevance than E(T).

This article is motivated by the desire to compare wait list survival among end-stage liver disease (ESLD) patients listed for liver transplantation. A frequent cause of chronic liver disease is Hepatitis C virus (HCV), the primary diagnosis for approximately 40% of ESLD cases. Liver transplantation is the preferred treatment for ESLD, but there are far more patients awaiting transplantation than there are available donor organs. The principle underlying the current system for allocating deceased-donor livers in the U.S. is that priority for transplantation should be based on a patients's death rate in the absence of liver transplantation. Specifically, the patients most likely to die on the wait list should get top priority for transplantation. Currently, patients on the liver transplant wait list are sequenced in decreasing order of Model of End Stage Liver Disease (MELD) score (Weisner et al, 2001). The MELD score is a function of three laboratory measures indicative of liver function, but does not consider underlying liver disease. It is suspected that HCV+ patients have lower wait list survival than HCV- patients. However, few studies have directly compared mean wait list survival time by diagnosis group. To our knowledge, no published analysis has compared mean wait list survival times (i.e., survival, in the absence of liver transplantation) between HCV+ and HCV- patients. Therefore, our objective is to estimate the difference between wait list lifetime between HCV+ and HCV- patients, adjusting for baseline (i.e., time 0) characteristics (e.g., age, gender, race, MELD score).

Comparison of liver wait list survival times is complicated by the potential for dependent censoring. Specifically, death on the liver wait list is censored by the receipt of a liver transplant, and such censoring is not independent of the survival time that would have been observed on the wait list, even conditional on the baseline adjustment covariates. A given patient's MELD score typically changes over time. The updating of MELD scores is mandatory, meaning that a longitudinal sequence of MELD scores is observed for each patient. As discussed by several previous authors in the context of causal inference (e.g., Robins, 2000; Hernan et al 2000, 2001), a comparison of survival time by HCV status should not adjust for internal time-dependent covariates, as defined by Kalbfleisch and Prentice (2002). The fact that time-dependent MELD strongly affects both wait list mortality and censoring (liver transplantation) means that lack of its adjustment (i.e., by adjusting for baseline values only) will result in the dependent censoring of wait list death time via liver transplantation.

Various authors have proposed methods for comparing restricted mean survival time in the context of Cox regression (e.g., Karrison, 1987; Zucker, 1998; Chen and Tsiatis, 2001). For example, the method of Chen and Tsiatis (2001) proposes fitting separate group-specific Cox models, then averaging over the fitted restricted mean lifetimes, with the averaging being with respect to the covariate distribution of the entire study sample. These approaches have several nice properties. First, it is not required that treatment-specific hazards be proportional. Second, an ‘overall’ treatment effect estimator is obtained, without assuming that treatment-specific adjustment covariate effects are equal. Third, the target treatment effect is interpretable as an average over a well-defined covariate distribution. However, each of the afore-listed methods assumes that censoring conditionally independent of the survival time, conditional on baseline adjustment covariates.

We propose methods for estimating group-specific differences in restricted mean lifetime, for the setting in which survival time is dependently censored. The structure of the hazard model we assume is very flexible, allowing for group-specific baseline hazards and regression coefficients. In its most general form, this amounts to fitting separate models for each treatment group. The dependent censoring is overcome through the well-established inverse probability of censoring weighting (IPCW); see Robins and Rotnitzky (1992), Robins (1993); Robins and Finkelstein (2000).

The remainder of this article is organized as follows. In Section 2 we set up the notation and formalize the problem of interest. The proposed methods are described in Section 3. Asymptotic properties are listed in Section 4, with corresponding proofs given in the Web Appendix. In Section 5, a simulation study is presented. The proposed methods are applied to national liver wait list data in Section 6. The article concludes with a discussion in Section 7.

2. Notation and Data Structure

Suppose we are interested in comparing two groups (A = 0, 1), which are not randomized, in terms of restricted mean lifetime up to time L. We denote the survival time by T. As in almost all studies involving time to an event, the event time T may be censored due to various reasons. Two types of censoring are considered in the following development. We let C₁ denote censoring which is independent conditional on baseline covariates, Z, and group indicator, A; e.g., censoring due to the end of study. We let C₂ denote dependent censoring; i.e., censoring which is not independent of T given (Z, A). For example, in the context of the data which motivated our work, a patient's wait-list mortality is censored if and when the patient receives a liver transplant and the transplant hazard and wait list mortality hazards may be correlated, even conditional on (Z, A), through mutual dependence on time-dependent covariates (e.g., MELD score). In notation, we assume that C₁╨T|Z, A, where ╨ denotes “independent of ”; while C₂ is not assumed to satisfy this condition. In practice, one observes the minimum of the survival time and time to censoring. We therefore let U = T ∧ C₁ ∧ C₂ represent the observation time and define indicators for observing the failure time and dependent censoring times: Δ₁ = I(T ≤ C₁ ∧ C₂) and Δ₂ = I(C₂ ≤ T ∧ C₁), respectively. We let X^†(t) represent the time-dependent covariate at time t. Note that X^†(0) would include the elements of Z and potentially other factors predictive of C₂. We let X̃(t) = {X^†(u); u ∈ [0, t)} denote the history of all baseline and time-dependent covariates up to just before time t. The observed data may be summarized as O_i = {A_i, U_i, Δ₁_i, Δ₂_i, Z_i, X̃_i(U_i)}, with the O_i assumed to be independent and identically distributed (iid) across i = 1,…, n. Note that the set of observed variates is redundant, in the sense that X̃(t) includes all baseline covariates Z; however, this representation is convenient for presentation purposes.

To define the parameter of interest, we follow the potential outcome framework studied by Rubin (1974 Rubin (1978) and adopted by Chen and Tsiatis (2001). Let T⁰ denote the potential (or counterfactual) lifetime for a randomly selected subject from the population if, possibly contrary to the fact, s/he were in group 0, and similarly T¹ the potential random variable corresponding to group 1. In reality, T⁰ and T¹ are never observed simultaneously for a subject and the (possibly) observed survival time T relates the two-dimensional potential outcomes (T⁰, T¹) through T = I(A = 0)T⁰ + I(A = 1)T¹. The group-specific difference in restricted mean lifetime is defined as

δ = E {min (T^{1}, L)} - E {min (T^{0}, L)} = \int_{0}^{L} {P (T^{1} > t) - P (T^{0} > t) dt .

(1)

Under the assumption that(T⁰, T¹)╨A|Z, it follows that P(T > t|A = j, Z) = P(T^j > t|A = j, Z) for j = 0, 1. Although defined through potential outcomes, the parameter of interest, δ, can then be expressed in terms of observable variates as

δ = \int_{0}^{L} E_{Z} {P (T > t | A = 1, Z) - P (T > t | A = 0, Z)} dt,

(2)

where the expectation E_Z is taken respect to the marginal distribution of Z. When A is an indicator of treatment, δ has the interpretation of average causal treatment effect. We set S_j(t|Z) = P(T > t|A = j, Z) and S_j(t) = E_Z{S_j(t|Z)} for j = 0, 1.

Estimators for δ are proposed by Chen and Tsiatis (2001) under the assumption of independent censoring. That is, in our notation, it was assumed that censoring C₂ does not exist. Using sample averages to estimate expectations, if one can obtain estimators for S₀(t|Z) andS₁(t|Z), say, Ŝ₀(t|Z) and Ŝ₁(t|Z) respectively, then a natural estimator for δ is given by

\hat{δ} = \int_{0}^{L} n^{- 1} \sum_{i = 1}^{n} {{\hat{S}}_{1} (t | Z_{i}) - {\hat{S}}_{0} (t | Z_{i})} dt .

(3)

3. Methods

As argued in the previous section, we wish to model the conditional survival function of T given baseline covariates and group indicator, S_j(t|Z), j = 0, 1. Because of its flexibility and popularity in practice, we adopt the Cox's proportional hazards model (Cox, 1972, 1975) and, in the following development, we work with the more general model where both baseline hazards functions and regression coefficients of Z are allowed to vary by group. Specifically, it is assumed that

\begin{matrix} λ_{i j} (t) \equiv λ (t | Z_{i}, A_{i} = j) = λ_{0 j} (t) exp (β_{j}^{T} Z_{i}), & j = 0, 1, \end{matrix}

(4)

where λ(t|Z_i, A_i = j) denotes the conditional hazard function given baseline covariates Z_i and membership in group A_i = j, while λ_0j(t) is the unspecified baseline hazard function for group A = j.

We now describe how to estimate parameters for model (4). To begin, suppose that C₂_i was not dependent censoring, but was instead another form of censoring that was conditionally independent of T given (Z, A). In this case, β_j could be consistently estimated by the maximum partial likelihood estimator, ${\hat{β}}_{j}^{*}$ , which could be computed as the root of the estimating equation,

\sum_{i = 1}^{n} \int_{0}^{T} {Z_{i} - \frac{\sum_{ℓ = 1}^{n} Z_{ℓ} exp (β_{j}^{T} Z_{ℓ}) Y_{ℓ j} (t)}{\sum_{ℓ = 1}^{n} exp (β_{j}^{T} Z_{ℓ}) Y_{ℓ j} (t)}} d N_{i j} (t) = 0,

(5)

where τ satisfies P(U ⩾ τ) > 0 and, in practice, can be set to the maximum observation time in the study; N_ij(t) = I(A_i = j)I(U_i ≤ t,Δ₁ _i = 1); and Y_ij(t) = I(A_i = j)I(U_i ⩾ t). Additionally, $Λ_{0 j} (t) = \int_{0}^{t} λ_{0 j} (u) d u$ could be consistently estimated by the Breslow estimator,

{\hat{Λ}}_{0 j}^{*} (t) = \int_{0}^{t} \frac{\sum_{i = 1}^{n} d N_{i j} (t)}{\sum_{i = 1}^{n} exp ({\hat{β}}_{j}^{* T} Z_{i}) Y_{i j} (t)} .

(6)

From a different perspective, ${\hat{Λ}}_{0 j}^{*} (t)$ and ${\hat{β}}_{j}^{*}$ are the solutions to the following estimating equations,

\sum_{i = 1}^{n} \int_{0}^{t} d M_{i j} (u; β, Λ) = 0

(7)

\sum_{i = 1}^{n} \int_{0}^{τ} Z_{i} d M_{i j} (t; β, Λ) = 0

(8)

respectively, where dM_ij(t;β,Λ) = dN_ij(t) − Y_ij(t)e^{β^T Z_i}dΛ(t) and dM_ij(t) ≡ dM_ij(t;β_j, Λ₀ _j). Under the assumption that (C₁ _i, C₂ _i)╨T_i|(A_i, Z_i), we have E{dM_ij(u)|Z_i, A_i} = 0 and E{Z_i dM_ij(u)|Z_i, A_i} = 0 such that (7) and (8) have mean zero at the true parameter values.

However, as stated previously, although C₁_i ╨T_i |(A_i, Z_i), it is not the case that C_2i╨T_i|(A_i, Z_i). As a result, neither (7) nor (8) have mean zero at the truth, meaning that consistent estimators of β_j and Λ₀_j(t) cannot be obtained from (5) and (6) respectively. We assume that the dependence of C_2i and T_i occurs through (and only through) the time-dependent process, X̃_i(U_i); i.e., observed data. That is, we assume that C_2i is conditionally independent of T_i given {Z_i, A_i, X̃_i(U_i)}; a condition we can express formally as follows,

\begin{array}{l} lim_{h \to 0} h^{- 1} P {t \leq U_{i} < t + h, Δ_{2 i} = 1 | U_{i} ⩾ t, A_{i}, {\tilde{X}}_{i} (t), T_{i}} \\ = & lim_{h \to 0} h^{- 1} P {t \leq U_{i} < t + h, Δ_{2 i} = 1 | U_{i} ⩾ t, A_{i}, {\tilde{X}}_{i} (t)} . \end{array}

(9)

Assumption (9) is the critical “no unmeasured confounders” (Rubin, 1977; Robins, 1993) for censoring assumption. In our setting, the assumption essentially states that the hazard of being censored by C₂ at time t depends only on observed data up to time t and not additionally on future possibly unobserved data. We define the hazard function for the dependent censoring time, C_2i for a subject in group j as

λ_{i j}^{C} (t) = lim_{h \to 0} h^{- 1} P {t \leq U_{i} < t + h, Δ_{2 i} = 1 | U_{i} ⩾ t, A_{i} = j, {\tilde{X}}_{i} (t)},

then set $Λ_{i j}^{C} (t) = \int_{0}^{t} λ_{i j}^{C} (u) d u$ .

We return now to the issue of estimating the parameters in (4). Reconsidering (7) and (8), although E{dM_ij(t)|Z_i, A_i} ≠ 0, under (9), it can be shown that $E [exp {Λ_{i j}^{C} (t)} d M_{i j} (t) | Z_{i}, A_{i}, {\tilde{X}}_{i} (t)] = 0$ (Robins and Finkelstein, 2000) and, after iterating the expectation, that $E [exp {Λ_{i j}^{C} (t)} d M_{i j} (t) | Z_{i}, A_{i}] = 0$ . More generally, it can be shown that E{W_ij(t)dM_ij(t)|Z_i, A_i} = 0, where $W_{i j} (t) = exp {Λ_{i j}^{C} (t)} κ (t; Z_{i}, A_{i})$ , where the function κ(t;Z_i, A_i) acts as a stabilization factor. Similarly, it can be shown that E{W_ij(t)Z_i dM_ij(t)|Z_i, A_i} = 0. Combining these zero-mean properties suggests the following set of inverse probability of censoring weighted (IPCW) estimating equations,

\sum_{i = 1}^{n} \int_{0}^{t} W_{i j} (u) d M_{i j} (u; β, Λ) = 0

(10)

\sum_{i = 1}^{n} \int_{0}^{τ} W_{i j} (t) Z_{i} d M_{i j} (t; β, Λ) = 0 .

(11)

Substituting the solution to (10) into (11) then re-organizing algebraically suggests that β_j be estimated by the solution to

\sum_{i = 1}^{n} \int_{0}^{τ} {Z_{i} - \frac{\sum_{ℓ = 1}^{n} W_{ℓ j} (t) Y_{ℓ j} (t) Z_{ℓ} exp (β_{j}^{T} Z_{ℓ})}{\sum_{ℓ = 1}^{n} W_{ℓ j} (t) Y_{ℓ j} (t) exp (β_{j}^{T} Z_{ℓ})}} W_{i j} (t) d N_{i j} (t) = 0,

(12)

and that the weighted Breslow estimator,

{\hat{Λ}}_{0 j} (t) = \int_{0}^{t} \frac{\sum_{i = 1}^{n} W_{i j} (u) d N_{i j} (u)}{\sum_{i = 1}^{n} W_{i j} (u) Y_{i j} (u) exp ({\hat{β}}_{j}^{T} Z_{i})},

(13)

be used to estimate Λ₀_j(t) for j = 0, 1.

The estimators in (12) and (13) are Inverse Probability of Censoring Weighting (IPCW) estimators (Robins and Rotnitzky, 1992; Robins, 1993; Robins and Finkelstein, 2000). The quantity $exp {Λ_{i j}^{C} (t)}$ can be thought of heuristically as the inverse of the probability of not having been dependently censored as of time t. Note that X̃_i(t) is an internal time-dependent covariate (Kalbfleisch & Prentice, 2002); i.e., a process generated by subject i (as opposed to an external time-dependent covariate such as temperature or air quality). Therefore, $exp {Λ_{i j}^{C} (t)}$ is not actually a probability, per se, but a product of conditional probabilities. Nonetheless, using Robins' Fundamental Identities (Robins & Rotnitzky, 1992; Robins and Finkelstein, 2000), it can be shown that the estimating function in (12) can be expressed as a (dependent censoring process) Martingale integral and hence has mean 0; a proof for which is outlined in Section 2 of the Web Appendix. The function κ(t; A_i, Z_i) can be any function of Z_i and A_i (since these are conditioned upon by model (4) anyway) and is intended to stabilize the weighted estimators. In particular, $exp {Λ_{i j}^{C} (t)}$ could be quite large towards the tail of the observation time distribution, which would result in weights which are quite large. One choice of κ which has been suggested (e.g., Robins and Finkelstein, 2000; Hernán, Brumback, and Robins, 2000) is $exp {Λ_{i j}^{C} (t | Z_{i}, A_{i})}$ . While $Λ_{i j}^{C} (t)$ would be based on a time-to-censoring model which used X̃_i(t−) as covariates, $Λ_{i j}^{C} (t | Z_{i}, A_{i})$ would only use the baseline values. If censoring was in fact independent, then W_ij(t) would tend towards 1. κ(t; A_i, Z_i) = 1, which may be appropriate if censoring is light or moderate, in which case W_ij(t) does not get unduly large. Hereafter, we refer to κ(t; A_i, Z_i) = 1 as the ‘unstabilized’ estimator. Stabilized estimators are intended to be more efficient than the unstabilized version, at the expense of additional modeling effort.

In practice, $Λ_{i j}^{C} (t)$ in the weight function is unknown to us and therefore has to be modeled and estimated. To fit models for the dependent censoring time C₂ _i, one uses U_i as the censored time variable but use Δ₂_i as the indicator for observing C₂_i. Again, due to its flexibility, the proportional hazards model is a natural choice for the dependent censoring time, C_2i, with the model being conditional on the group indicator and both baseline and time-dependent covariates. To allow for more flexibility in modeling and hence robustness in estimating the weight, one could fit group-specific Cox models,

λ_{i j}^{C} (t) = λ_{0 j}^{C} (t) exp {θ_{j}^{T} X_{i} (t)},

where $λ_{0 j}^{C} (t)$ , j = 0, 1, are unspecified group-specific baseline hazard functions for C₂_i and X_i(t) is a function of X̃_i(t) determined empirically (e.g., using standard model selection techniques, such as stepwise regression) to satisfy $λ_{i j}^{C} (t | {\tilde{X}}_{i} (t)) = λ_{i j}^{C} (t | X_{i} (t))$ . Based on the fitted model, one can estimate $Λ_{i j}^{C} (t)$ by ${\hat{Λ}}_{i j}^{C} (t)$ , which can then be used to compute the estimated weight function, Ŵ_ij(t).

Having estimated β_j and Λ₀ _j(t), one can correspondingly estimate S_ij(t) ≡ S_j(t|Z_i) by

{\hat{S}}_{j} (t | Z_{i}) \equiv {\hat{S}}_{i j} (t) = exp {- {\hat{Λ}}_{0 j} (t) exp ({\hat{β}}_{j}^{T} Z_{i})}, j = 0, 1 .

(14)

Finally, the proposed estimat or for difference in restricted mean lifetime δ is then given by

\hat{δ} = \int_{0}^{L} {{\hat{S}}_{1} (t) - {\hat{S}}_{0} (t)} dt,

(15)

where ${\hat{S}}_{j} (t) = n^{- 1} \sum_{i = 1}^{n} {\hat{S}}_{i j} (t)$ for j = 0, 1.

The key step in implementing the proposed method is to solve the weighted estimating equation (12), for which existing software may be exploited. For example, one can use proc phreg (SAS Institute; Cary, NC) with the counting process input format and the weight option. Correspondingly, Λ̂₀ _j(t) and Ŝ_ij(t) can be easily obtained. Estimating the variance is more involved, requiring additional programming (e.g., SAS's proc iml). For illustrative purpose, a SAS macro implementing the proposed methods is available at http://www.sph.umich.edu/mzhangst/.

4. Asymptotic Properties

In this section, we derive the asymptotic properties of the proposed estimators given by (15). To begin, we specify the regularity conditions, assumed to hold for i = 1,…, n and j = 0, 1.

{A_i, Z_i, U_i, Δ₁_i, Δ₂ _i, X̃_i(U_i)} are independent and identically distributed.
P(U_i ⩾ τ) > 0.
|Z_ik| < b_z and $\int_{0}^{τ} d | X_{i k} (t) | < b_{X}$ , where b_z < ∞ and b_X < ∞, and k denotes the kth element.
Λ₀_j(τ) < ∞; $Λ_{0 j}^{C} (τ) < \infty$ .
For d = 0, 1, 2,

$\begin{matrix} sup_{t \in [0, τ]} ‖ R_{j}^{(d)} (t; β) - r_{j}^{(d)} (t; β) ‖ & \overset{p}{⟶} 0 \\ sup_{t \in [0, τ]} ‖ R_{c j}^{(d)} (t; θ) - r_{C j}^{(d)} (t; θ) ‖ & \overset{p}{⟶} 0, \end{matrix}$

where we define

$\begin{matrix} R_{j}^{(d)} (t; β) = n^{- 1} \sum_{i = 1}^{n} Y_{i j} (t) W_{i j} (t) Z_{i}^{\otimes d} exp {β^{T} Z_{i}} \\ R_{C j}^{(d)} (t; β) = n^{- 1} \sum_{i = 1}^{n} Y_{i j} (t) X_{i}^{\otimes d} exp {θ^{T} X_{i} (t)} \end{matrix}$

with z^⊗0 = 1, z^⊗1 = z and z^⊗2 = zz^T.
The matrices Ω_j(β_j) and $Ω_{j}^{C} (θ_{j})$ are assumed to be positive-definite, where

$\begin{matrix} Ω_{j} (β) = E [\int_{0}^{τ} {\frac{r_{j}^{(2)} (t; β)}{r_{j}^{(0)} (t; β)} - {\bar{z}}_{j} {(t; β)}^{\otimes 2}} r_{j}^{(0)} (t; β) λ_{0 j} (t) dt] \\ Ω_{j}^{C} (θ) = E [\int_{0}^{τ} {\frac{r_{C j}^{(2)} (t; θ)}{r_{C j}^{(0)} (t; θ)} - {\bar{x}}_{j} {(t; θ)}^{\otimes 2}} r_{C j}^{(0)} (t; θ) λ_{0 j}^{C} (t) d t], \end{matrix}$

with $\bar{z} (t; β) = r_{j}^{(1)} (t; β) / r_{j}^{(0)} (t; β)$ and $\bar{x} (t; θ) = r_{C j}^{(1)} (t; θ) / r_{C j}^{(0)} (t; θ)$ .
P(A_i = j|Z_i) ∈ (0, 1).

Variations on Condition (a) are possible, although at the expense of additional technical (e.g., Lindeberg-type) conditions. Condition (b) is a standard identifiability criterion. The boundedness implied by Condition (c) helps ensure the convergence of the several stochastic integrals used in the proofs; the same can be said for Condition (d). The second-derivative matrices in condition (f) are at least non-negative definite and will be positive-definite under any sensible specification of the covariate vectors. Condition (g) is the well-known positivity requirement from the causal inference literature. If it fails, δ fails to have a causal interpretation.

We describe the primary asymptotic result for our proposed unstabilized group effect estimator in the following theorem.

Theorem 1: Under conditions (a) ‒ (g), as n → ∞, δ̂ converges in probability to δ, and for the unstabilized estimator, n^1/2(δ̂ ‒ δ) converges to a zero-mean Normal with variance E{(ϕ_i1 − ϕ_i0)²}, where

\begin{array}{l} ϕ_{i j} = & - E [Z_{i}^{T} \int_{0}^{L} {μ_{i j} (L) - μ_{i j} (t)} d Λ_{i j} (t)] Ω_{j}^{- 1} (β_{j}) U_{i j} (β_{j}) \\ - \int_{0}^{L} E [e^{β_{j}^{T} Z_{i}} {μ_{i j} (L) - μ_{i j} (t)}] d Φ_{i j} (t) + (μ_{i j} - μ_{j}), \end{array}

where $μ_{i j} (L) = μ_{i j}, μ_{i j} (t) = \int_{0}^{t} S_{i j} (u) d u, n^{\frac{1}{2}} {{\hat{Λ}}_{0 j} (t) - Λ_{0 j} (t)} = n^{- \frac{1}{2}} \sum_{i = 1}^{n} Φ_{i j} (t) + o_{p} (1)$ and $n^{\frac{1}{2}} ({\hat{β}}_{j} - β_{j}) = Ω_{j} {(β_{j})}^{- 1} n^{- \frac{1}{2}} \sum_{i = 1}^{n} U_{i j} (β_{j}) + o_{p} (1)$ with Φ_ij(t) and U_ij(β_j) defined in the Web Appendix.

The variance can be consistently estimated by $n^{- 1} {\sum_{i = 1}^{n} ({\hat{ϕ}}_{i 1} - {\hat{ϕ}}_{i 0})}^{2}$ , where ϕ̂_ij is obtained by replacing limiting values in ϕ_ij with their empirical counterparts. However, as shown in the Web Appendix, the computation of ϕ̂_ij is quite complicated owing to the complexity of Û_ij(β̂_j) and Φ̂_ij(t). As a result, estimating the variance through ϕ̂_ij is very inconvenient computationally. A computationally attractive alternative is to estimate Var(δ̂) by $n^{- 1} \sum_{i = 1}^{n} {({\hat{ϕ}}_{i 1}^{†} - {\hat{ϕ}}_{i 0}^{†})}^{2}$ , where ${\hat{ϕ}}_{i j}^{†}$ is obtained by replacing Û_ij(β̂_j) and Φ̂_ij(t) with ${\hat{U}}_{i j}^{†} ({\hat{β}}_{j})$ and ${\hat{Φ}}_{i j}^{†} (t)$ , respectively, where

\begin{array}{l} {\hat{U}}_{i j}^{†} (β) = \int_{0}^{τ} {Z_{i} - {\bar{Z}}_{j} (t; β, W)} {\hat{W}}_{i j} (t) d {\hat{M}}_{i j} (t) \\ {\hat{Φ}}_{i j}^{†} (t) = {\hat{h}}_{j}^{T} (t) {\hat{Ω}}_{j} {({\hat{β}}_{j})}^{- 1} {\hat{U}}_{i j}^{†} ({\hat{β}}_{j}) + \int_{0}^{t} {\hat{W}}_{i j} (s) R_{j}^{(0)} {(s; {\hat{β}}_{j}, \hat{W})}^{- 1} d {\hat{M}}_{i j} (s) . \end{array}

The key difference between ϕ̂_ij and

\begin{array}{l} {\hat{ϕ}}_{i j}^{†} = & - n^{- 1} \sum_{i = 1}^{n} [Z_{i}^{T} \int_{0}^{L} {{\hat{μ}}_{i j} (L) - {\hat{μ}}_{i j} (t)} d {\hat{Λ}}_{i j} (t)] {\hat{Ω}}_{j}^{- 1} ({\hat{β}}_{j}) {\hat{U}}_{i j}^{†} ({\hat{β}}_{j}) \\ - n^{- 1} \sum_{i = 1}^{n} \int_{0}^{L} [e^{{\hat{β}}_{j}^{T} z_{i}} {{\hat{μ}}_{i j} (L) - {\hat{μ}}_{i j} (t)}] d {\hat{Φ}}_{i j}^{†} (t) + ({\hat{μ}}_{i j} - {\hat{μ}}_{j}), \end{array}

is that the former accounts for the fact that Ŵ_ij(t) is estimated, while ${\hat{ϕ}}_{i j}^{†}$ is derived with Ŵ_ij(t) treated as fixed.

We refer to (ϕ_i₁ − ϕ_i0) in Theorem 1 as the influence function of δ̂, which satisfies $n^{\frac{1}{2}} (\hat{δ} - δ) = n^{- \frac{1}{2}} \sum_{i = 1}^{n} (ϕ_{i 1} - ϕ_{i 0}) + o_{p} (1)$ . The asymptotic results stated in Theorem 1 are for the unstabilized estimator with the stabilization factor κ(t; A_i, Z_i) = 1. As equations (12) and (13) are unbiased estimating equations for general κ(t; A_i, Z_i), by similar argument, it can be shown that consistency and asymptotic normality hold for estimators with κ(t; A_i, Z_i) ≠ 1 but the form of influence function will be different and even more complicated. Therefore, treating the weights as fixed would be a practical way to estimate the variance of the proposed estimators. As demonstrated in the Web Appendix, for general κ(t;| A_i, Z_i), Var(δ̂) can still be estimated by $n^{- 1} {\sum_{i = 1}^{n} ({\hat{ϕ}}_{i 1}^{†} - {\hat{ϕ}}_{i 0}^{†})}^{2}$ with ${\hat{ϕ}}_{i j}^{†}$ specified previously. According to Tsiatis (2006) Chapter 9.1, the influence function of δ̂, if the weight is estimated, is the projection of the influence function when weight is known and fixed onto the orthogonal complement of the nuisance tangent space; e.g., the spaces associated with the nuisance baseline hazard function and nuisance parameter θ_j in the model for C₂. As the true influence function is a projection, its variance is smaller than the influence function if weight is fixed and therefore the proposed variance estimator will be conservative in estimating the variance of δ̂ where weight is actually estimated. This point is discussed by several previous authors (Hernán et al. 2000 and 2001; Pan and Schaubel, 2008). The effect of estimating the weight is slight, as will be demonstrated empirically in the next section.

5. Simulation Study

We report on simulations to evaluate performance of the proposed methods. Results of two other methods are also reported: a naive method, which estimates δ by taking difference in areas under group-specific Kaplan-Meier curves from time 0 to L and consequently ignores all possible confounding, and the method proposed by Chen and Tsiatis (2001), which adjusts for baseline covariates but not time-dependent confounders for censoring.

Data were generated under three scenarios, corresponding to different confounding mechanisms and, in each scenario, two different percentages of censoring were considered, referred to as light censoring or heavy censoring cases. Specifically, in each scenario, for the light censoring case, about 20% subjects are censored by C₂ and about 5% are censored by C₁, and for the heavy censoring case, about 30% subjects are censored by C₂ and about 10% are censored by C₁. All reported results are based on 2000 Monte Carlo datasets, and L is chosen to be 15.

In the first scenario, data were generated such that both baseline and time-dependent confounders exist. For each Monte Carlo dataset, a single baseline confounder Z was generated as a truncated standard normal, truncated at -4 and 4 on each side, and group indicator A as Bernoulli with parameter exp(−0.6Z)/{1 + exp(−0.6Z)}. Survival time T was generated by transforming ε₁ ∼ Uniform (0, 1) using the inverse of the cumulative distribution function (cdf) of a Weibull distribution with shape parameter 1.25 and scale parameter exp(−0.3Z− 3.3) for group A = 1 or exp(−0.4Z−3) for A = 0. We then generated dependent censoring C₂ such that it depends both on baseline and time-dependent covariates as follows. In order for a time-dependent covariate to be a confounder, it should be correlated both with T and C₂ conditioning on (A, Z). To achieve this, we first generated X_t such that X_t = −5log{Aε₁ + (1 − A)(1 − ε₁)} + ε₂, where ε₂ ∼ Uniform(0, 1), independent of all other variables, and then let X(t) = I(X_t ⩾ t). Consequently, the time-dependent covariate X(t) is correlated with survival time T through their mutual relationships with ε₁ and such correlation exits even conditioning on (A, Z). Next, we generated dependent censoring time C₂ using a proportional hazards model with hazard rate exp{γ₁ + 0.2A + 0.2Z+γ₂X(t)}. This procedure ensures that X(t) is a time-dependent confounder and C₂ follows a proportional hazards model with Z and X(t) as our model assumes. Finally, censoring time C₁ was generated as Weibull with shape and scale parameters 3 and exp(γ₃) respectively. The coefficients (γ₁, γ₂, γ₃) are set to (-5.1, 1.5, -11) for light censoring case and to (-4.45, 1.5,-9.7) for heavy censoring case.

In the second scenario, data were generated such that, conditioning on (A, Z), timedependent covariate X(t) correlates only with survival time T but not with censoring time C₂ and therefore it is not a confounder if (A, Z) is properly adjusted for. We compare the proposed methods with the Chen and Tsiatis (2001) method, which should be consistent and asymptotically normal under this scenario. Data are generated similarly as before except for that γ₂ was set to 0, ensuring that X(t) does not affect C₂. Specifically, we chose (γ₁, γ₂, γ₃) equal to (-4, 0, -11) for light censoring case and equal to (-3.4, 0, -9.7) for heavy censoring case.

In the third scenario, we let the time-dependent covariateX(t) be conditionally independent of survival time T but still correlated with censoring time C₂ and, consequently, is not a confounder either. Data were generated the same as in scenario one except for that X_t = —51og{Aε₃ + (1 — A)(1 — ε₃)} + ε₂, where ε₃ ∼ Uniform(0, 1), independent of all other variables. We evaluate how the proposed methods compare with those of Chen and Tsiatis (2001), which should be unbiased under this scenario. All coefficients are set equal to those used in scenario 1.

Tables 1 and 2 list the results of our simulation study. The proposed estimators (unstabilized or stabilized) are consistent for the true parameter under all scenarios, whereas the Chen and Tsiatis (2001) method leads to biased estimators under scenario 1, where a time-dependent confounder exists. The naive estimator is biased under all three scenarios, where baseline or/and time-dependent confounders exist. Although the variance is estimated by treating the estimated weight as fixed, coverage probabilities of the proposed estimators achieve the nominal level. On the other hand, due to the non-ignorable bias, coverage probabilities of the other two estimators are very low, illustrating that the impact of confounders can be severe if they are not properly accounted for in the analysis.

Table 1.

Estimators for difference in restricted mean lifetime under scenario 1-3, light censoring case (2000 Monte-Carlo datasets; sample size: 1000; True is the true value of parameter; MC Bias is the Monte Carlo Bias; MC SD is the Monte Carlo standard deviation of estimates; Ave. SE is the Monte Carlo average of estimated standard errors; CP is the coverage probability of nominal 95% Wald confidence intervals.)

Method	True	MC Bias	MC SD	Ave. SE	CP
Light Censoring, Scenario 1
Unstabilized Inverse Weighting	1.172	0.021	0.319	0.327	0.950
Stabilized Inverse Weighting	1.172	0.015	0.317	0.327	0.952
Chen & Tsiatis' Method	1.172	0.316	0.321	0.324	0.840
Naive	1.172	-0.429	0.325	0.332	0.755
Light Censoring, Scenario 2
Unstabilized Inverse Weighting	1.172	0.005	0.321	0.330	0.954
Stabilized Inverse Weighting	1.172	0.002	0.321	0.330	0.953
Chen & Tsiatis' Method	1.172	0.005	0.323	0.329	0.953
Naive	1.172	-0.729	0.326	0.336	0.422
Light Censoring, Scenario 3
Unstabilized Inverse Weighting	1.172	-0.013	0.337	0.328	0.939
Stabilized Inverse Weighting	1.172	-0.016	0.337	0.327	0.936
Chen & Tsiatis' Method	1.172	-0.017	0.336	0.326	0.934
Naive	1.172	-0.746	0.341	0.332	0.392

Open in a new tab

Table 2. Estimators for difference in restricted mean lifetime under Scenario 1-3, heavy censoring case (entries are as in Table 1).

Method	True	MC Bias	MC SD	Ave. SE	CP
Heavy Censoring, Scenario 1
Unstabilized Inverse Weighting	1.172	0.040	0.331	0.342	0.948
Stabilized Inverse Weighting	1.172	0.050	0.334	0.344	0.947
Chen & Tsiatis' Method	1.172	0.593	0.334	0.337	0.584
Naive	1.172	-0.157	0.339	0.344	0.926
Heavy Censoring, Scenario 2
Unstabilized Inverse Weighting	1.172	0.006	0.329	0.329	0.959
Stabilized Inverse Weighting	1.172	0.003	0.328	0.346	0.960
Chen & Tsiatis' Method	1.172	0.004	0.334	0.345	0.956
Naive	1.172	-0.725	0.344	0.350	0.455
Heavy Censoring, Scenario 3
Unstabilized Inverse Weighting	1.172	-0.009	0.350	0.345	0.936
Stabilized Inverse Weighting	1.172	-0.014	0.348	0.343	0.941
Chen & Tsiatis' Method	1.172	-0.015	0.345	0.340	0.940
Naive	1.172	-0.743	0.348	0.344	0.424

Open in a new tab

Under scenarios 2 and 3, where there are no time-dependent confounders, the usual partial likelihood estimator is semiparametric efficient in estimating coefficients of a Cox proportional hazards model. Therefore, the Chen and Tsiatis (2001) method can be used as a benchmark to evaluate the loss of efficiency of the proposed methods due to inverse probability weighting. Our results demonstrate that stabilized version of the proposed estimator behaves very similarly to the estimator of Chen and Tsiatis (2001). This would be expected since the weights would tend towards 1 in this scenario, such that the loss of efficiency (corresponding to the unstabilized weighting) is only mild. In addition, simulation results suggest that the effect of the stabilization factor, κ(t; Z, A), on the efficiency is more pronounced for estimators of the Cox regression parameter, compared to the estimators of δ. Results in Table 3 show that, at least under the simulated scenarios, stabilization results in considerable efficiency gains for β̂_j but only mild increases in precision for δ̂.

Table 3.

Comparing efficiency of the unstabilized and stabilized inverse weighting methods in estimating δ in (1) and coefficients of Cox's proportional hazards models in (4) (δ̂ is weighted estimator for δ; β̂_j is weighted estimator for β_j in (4), j = 0, 1; RE is the relative efficiency compared to the unstabilized inverse weighting method, calculated as the square of the ratio of the Monte Carlo standard deviation for the unstabilized inverse weighting estimator over that for the indicated estimator)

Method	β̂₁		β̂₀		δ̂
Method	MC SD	RE	MC SD	RE	MC SD	RE
Light censoring, Scenario 1
Unstabilized Inverse Weighting	0.059	1	0.058	1	0.319	1
Stabilized Inverse Weighting	0.057	1.085	0.055	1.114	0.317	1.011
Light censoring, Scenario 2
Unstabilized Inverse Weighting	0.058	1	0.058	1	0.321	1
Stabilized Inverse Weighting	0.057	1.035	0.057	1.060	0.320	1.004
Light censoring, Scenario 3
Unstabilized Inverse Weighting	0.060	1	0.060	1	0.337	1
Stabilized Inverse Weighting	0.057	1.094	0.056	1.143	0.337	0.999
Heavy censoring, Scenario 1
Unstabilized Inverse Weighting	0.069	1	0.066	1	0.334	1
Stabilized Inverse Weighting	0.063	1.195	0.059	1.250	0.331	1.016
Heavy censoring, Scenario 2
Unstabilized Inverse Weighting	0.064	1	0.066	1	0.329	1
Stabilized Inverse Weighting	0.062	1.070	0.062	1.110	0.328	1.006
Heavy censoring, Scenario 3
Unstabilized Inverse Weighting	0.069	1	0.072	1	0.350	1
Stabilized Inverse Weighting	0.062	1.217	0.063	1.320	0.348	1.024

Open in a new tab

6. Application

Data were obtained from the Scientific Registry of Transplant Recipients (SRTR). The study population (n = 6, 371) included all chronic liver disease patients initially wait listed for deceased-donor liver transplantation in the U.S. at age ⩾ 18 between March 1, 2002 and February 28, 2003. For each patient, the time origin (t = 0) was the date of wait listing. Patients were followed from that date until the earliest of death, receipt of a liver transplant, loss to follow-up and the end of the observation period: December 31, 2008.

The event of interest was wait list mortality. Independent censoring consisted of random loss to follow-up and administrative censoring at the end of the observation period. Dependent censoring occurred through liver transplantation which, although not preventing the observation of death, does preclude wait list death.

The objective of the analysis was to compare 5-year mean wait list survival time between Hepatitis C positive (HCV+) versus HCV- patients. HCV is a leading cause of chronic liver disease. Baseline adjustment covariates included the following factors, as measured at the time of wait initial listing: age, gender, race, region, Model for End-stage Liver Disease (MELD) score, serum albumin, sodium, bodymass index, diabetes, hospitalization status, ascites, dialysis and encephalopathy. The MELD score is a log linear combination of serum creatinine, bilirubin and international normalized ratio for prothrombin time. MELD has been shown to be a very strong predictor of wait list mortality. Currently, patients are ordered on the wait list in decreasing order of MELD score, such that the higher the MELD score, the greater the liver transplant hazard.

MELD is time-dependent, since a patient's score will be updated regularly. Other time-dependent covariates include dialysis, serum albumin, sodium and active and removal status. In the time-until-transplant model, each of these factors was represented in the time-dependent covariate vector. The Cox model for transplant was given by

λ_{i}^{C} (t) = {1 - I_{i} (t)} {1 - R_{i} (t)} λ_{0}^{C} (t) exp {θ_{0}^{T} X_{i} (t)},

(16)

where I_i(t) is an indicator for being inactivated from the wait and R_i(t) is an indicator for being removed from the wait list at time t. It is not possible for a patient to receive a transplant while they are inactivated (usually temporary) or removed (typically permanent). To fit model (16), we deleted patient subintervals where either I_i(t) = 1 or R_i(t) = 1. After model (16) was fitted, the IPCW weight was then computed using $\int_{0}^{t} {1 - I_{i} (s)} {1 - R_{i} (s)} d {\hat{Λ}}_{0}^{C} (s)$ , such that the transplant hazard increment was set to 0 for each subinterval where the patient was either inactivated or removed.

The study population consisted of 2,754 HCV+ (j=1) and 3,617 HCV- (j=0) patients. There were a total of 1,849 wait list deaths; 3,194 liver transplants and 1,328 independently censored subjects. Average survival curves are presented in Figure 1. Average wait list survival probability was 51% for HCV- patients at 5 years, compared to 41% for HCV+ patients. In Table 4, we compute 1-, 3- and 5-year average restricted mean wait list lifetime for HCV+ and HCV- patients. For each of L = 1, L = 3 and L = 5 years, average restricted mean lifetime is significantly greater for HCV- compared to HCV+ patients. In Figure 2, we plot the point estimates and 95% confidence intervals for δ.

Average wait list survival probability for HCV- and HCV + patients at 5 years.

Table 4.

Average restricted mean wait list lifetime for HCV+ and HCV- patients restricted to L = 1, L = 3 and L = 5 years.

L	μ̂₁	μ̂₀	δ̂	SE (δ̂)	p
1	0.83	0.85	-0.02	0.01	0.047
3	2.10	2.21	-0.11	0.03	0.002
5	3.04	3.31	-0.26	0.06	<0.0001

Open in a new tab

Point estimates and 95% confidence intervals for differences in average restricted mean wait list lifetime between HCV+ and HCV- patients.

7. Discussion

In this article, we have proposed methods for estimating differences in restricted mean survival time between groups where group assignment is not randomized and, conditional on baseline covariates and group assignment, censoring may still be correlated with survival time. Differences in restricted mean lifetime may be of direct interest, and could also serve as a cumulative effect measure in settings where group-specific hazards are non-proportional. To be general, in our formulation, we considered that both conditionally independent and dependent censoring exist, which is often the case in practice; this formulation includes as a special case when only one type of censoring exits. The proposed methods employ two general approaches to account for two types of confounders. The proposed methods combine inverse probability of censoring weighting (e.g., Robins & Rotnitzky, 1992) and the procedure of explicitly averaging over the marginal covariate distribution (e.g., Chen & Tsiatis, 2001).

In our proposed procedure, computation is simplified by treating the IPCW weights as fixed. Since the inverse weights are actually estimated using the data, treating them as fixed should result in conservative confidence intervals and hypothesis tests, as reported by several previous authors (e.g., Pan & Schaubel, 2008; Hernán et al. 2000 and 2001). Our simulation results reveal the proposed standard error estimators and corresponding confidence intervals are quite accurate. An alternative to treating the weights as fixed would be to use the bootstrap, but this is much less convenient computationally.

The methods we propose require that the IPCW weight be correctly specified. The degree of bias introduced by misspecifying the Cox model for censoring would be expected to increase with increasing proportion of dependent (relative to independent) censoring; increasing strength of association (i) between the death hazard and time-dependent confounders and (ii) between the dependent censoring hazard and time-dependent confounders; and of course degree to which the IPCW model is misspecified. Fortunately, one can readily evaluate the fit of the proportional hazards model through well-established techniques and using standard software.

There are several alternatives to our proposed approach, one being joint modeling. For example, one could combine a mixed model for the longitudinal measurements with a hazard model which uses time-dependent covariates (e.g., Proust-Lima & Taylor, 2009). Or, one could also model mean residual survival time directly using pseudo-observations (Andersen, Hansen, and Klein, 2004). It could well be the case that the preferred method is a function of the data configuration. A detailed comparison of the three types of approaches would be very useful to practitioners.

Supplementary Material

Supp data

NIHMS630726-supplement-Supp_data.pdf^{(151.6KB, pdf)}

Acknowledgments

This research was supported in part by National Institutes of Health grant R01 DK-70869 (DES). The authors thank the Scientific Registry for Transplant Recipients (SRTR) and Organ Procurement and Transplantation Network (OPTN) for access to the organ failure database.The SRTR is funded by a contract from the Health Resources and Services Administration (HRSA), U.S. Department of Health and Human Services.

Footnotes

Supplementary Materials Web Appendices, referenced in Section 4 are available under the Paper Information link at the Biometrics website http://www.biometrics.tibs.org.

References

Andersen PK, Hansen MG, Klein JP. Regression Analysis of Restricted Mean Survival Time Based on Pseudo-Observations. Lifetime Data Analysis. 2004;10:335–350. doi: 10.1007/s10985-004-4771-0. [DOI] [PubMed] [Google Scholar]
Chen P, Tsiatis AA. Causal inference on the difference of the restricted mean life between two groups. Biometrics. 2001;57:1030–1038. doi: 10.1111/j.0006-341x.2001.01030.x. [DOI] [PubMed] [Google Scholar]
Cox DR. Regression models and life tables (with Discussion) Journal of the Royal Statistical Society, Series B. 1972;34:187–200. [Google Scholar]
Cox DR. Partial likelihood. Biometrika. 1975;62:269–275. [Google Scholar]
Hernán MA, Brumback B, Robins JM. Marginal structural models to estimate the causal effect on the survival of HIV-positive men. Epidemiology. 2000;11:561–570. doi: 10.1097/00001648-200009000-00012. [DOI] [PubMed] [Google Scholar]
Hernán MA, Brumback B, Robins JM. Marginal structural models to estimate the joint causal effect of nonrandomized treatments. Journal of the American Statistical Association – Applications and Case Studies. 2001;96:440–448. [Google Scholar]
Kalbfleisch JD, Prentice RL. The Statistical Analysis of Failure Time Data, 2nd Edition. New York: Wiley; 2002. [Google Scholar]
Karrison T. Restricted mean life with adjustment for covariates. Journal of the American Statistical Association. 1987;18:151–167. [Google Scholar]
Pan Q, Schaubel DE. Proportional hazards regression based on biased samples and estimated selection probabilities. Canadian Journal of Statistics. 2008;36:111–127. [Google Scholar]
Proust-Lima C, Taylor JM. Development and validation of a dynamic prognostic tool for prostate cancer recurrence using repeated measures of posttreatment PSA: a joint modeling approach. Biostatistics. 2009;10:535–549. doi: 10.1093/biostatistics/kxp009. [DOI] [PMC free article] [PubMed] [Google Scholar]
Robins JM. Proceedings of the Biopharmaceutical Section, American Statistical Assocation. Alexander, Virginia: American Statistical Association; 1993. Information recovery and bias adjustment in proportional hazards regression analysis of randomized trials using surrogate markers; pp. 24–23. [Google Scholar]
Robins JM. Robust estimation in sequentially ignorable missing data and causal inference models. Proceedings of the American Statistical Association Section on Bayesian Statistical Science. 2000;1999:6–10. [Google Scholar]
Robins JM, Finkelstein D. Correcting for Non-compliance and Dependent Censoring in an AIDS Clinical Trial with Inverse Probability of Censoring Weighted (IPCW) Log-rank Tests. Biometrics. 2000;56:779–788. doi: 10.1111/j.0006-341x.2000.00779.x. [DOI] [PubMed] [Google Scholar]
Robins JM, Rotnitzky A. AIDS Epidemiology - Methodological Issues. Birkhäuser; Boston: 1992. Recovery of information and adjustment for dependent censoring using surrogate markers; pp. 297–331. [Google Scholar]
Rubin DB. Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of Educational Psychology. 1974;66:688–701. [Google Scholar]
Rubin DB. Inference and missing data. Biometrika. 1977;63:581–592. [Google Scholar]
Rubin DB. Bayesian inference for causal effects: The role of randomization. Annals of Statistics. 1978;6:34–58. [Google Scholar]
SAS Institute Inc. SAS Online Doc 9.1.3.Cary. North Carolina: SAS Institute, Inc; 2006. [Google Scholar]
Tsiatis AA. Semiparametric Theory and Missing Data. New York: Springer; 2006. [Google Scholar]
Wiesner RH, McDiarmid SV, Kamath PS, Edwards EB, Malinchoc M, Kremers WK, Krom RA, Kim WR. MELD and PELD: Application of survival models to liver allocation. Liver Transplantation. 2001;7:567–580. doi: 10.1053/jlts.2001.25879. [DOI] [PubMed] [Google Scholar]
Zucker DM. Restricted mean life with covariates: modification and extension of a useful survival analysis method. Journal of the American Statistical Association. 1998;93:702–709. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supp data

NIHMS630726-supplement-Supp_data.pdf^{(151.6KB, pdf)}

[R1] Andersen PK, Hansen MG, Klein JP. Regression Analysis of Restricted Mean Survival Time Based on Pseudo-Observations. Lifetime Data Analysis. 2004;10:335–350. doi: 10.1007/s10985-004-4771-0. [DOI] [PubMed] [Google Scholar]

[R2] Chen P, Tsiatis AA. Causal inference on the difference of the restricted mean life between two groups. Biometrics. 2001;57:1030–1038. doi: 10.1111/j.0006-341x.2001.01030.x. [DOI] [PubMed] [Google Scholar]

[R3] Cox DR. Regression models and life tables (with Discussion) Journal of the Royal Statistical Society, Series B. 1972;34:187–200. [Google Scholar]

[R4] Cox DR. Partial likelihood. Biometrika. 1975;62:269–275. [Google Scholar]

[R5] Hernán MA, Brumback B, Robins JM. Marginal structural models to estimate the causal effect on the survival of HIV-positive men. Epidemiology. 2000;11:561–570. doi: 10.1097/00001648-200009000-00012. [DOI] [PubMed] [Google Scholar]

[R6] Hernán MA, Brumback B, Robins JM. Marginal structural models to estimate the joint causal effect of nonrandomized treatments. Journal of the American Statistical Association – Applications and Case Studies. 2001;96:440–448. [Google Scholar]

[R7] Kalbfleisch JD, Prentice RL. The Statistical Analysis of Failure Time Data, 2nd Edition. New York: Wiley; 2002. [Google Scholar]

[R8] Karrison T. Restricted mean life with adjustment for covariates. Journal of the American Statistical Association. 1987;18:151–167. [Google Scholar]

[R9] Pan Q, Schaubel DE. Proportional hazards regression based on biased samples and estimated selection probabilities. Canadian Journal of Statistics. 2008;36:111–127. [Google Scholar]

[R10] Proust-Lima C, Taylor JM. Development and validation of a dynamic prognostic tool for prostate cancer recurrence using repeated measures of posttreatment PSA: a joint modeling approach. Biostatistics. 2009;10:535–549. doi: 10.1093/biostatistics/kxp009. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R11] Robins JM. Proceedings of the Biopharmaceutical Section, American Statistical Assocation. Alexander, Virginia: American Statistical Association; 1993. Information recovery and bias adjustment in proportional hazards regression analysis of randomized trials using surrogate markers; pp. 24–23. [Google Scholar]

[R12] Robins JM. Robust estimation in sequentially ignorable missing data and causal inference models. Proceedings of the American Statistical Association Section on Bayesian Statistical Science. 2000;1999:6–10. [Google Scholar]

[R13] Robins JM, Finkelstein D. Correcting for Non-compliance and Dependent Censoring in an AIDS Clinical Trial with Inverse Probability of Censoring Weighted (IPCW) Log-rank Tests. Biometrics. 2000;56:779–788. doi: 10.1111/j.0006-341x.2000.00779.x. [DOI] [PubMed] [Google Scholar]

[R14] Robins JM, Rotnitzky A. AIDS Epidemiology - Methodological Issues. Birkhäuser; Boston: 1992. Recovery of information and adjustment for dependent censoring using surrogate markers; pp. 297–331. [Google Scholar]

[R15] Rubin DB. Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of Educational Psychology. 1974;66:688–701. [Google Scholar]

[R16] Rubin DB. Inference and missing data. Biometrika. 1977;63:581–592. [Google Scholar]

[R17] Rubin DB. Bayesian inference for causal effects: The role of randomization. Annals of Statistics. 1978;6:34–58. [Google Scholar]

[R18] SAS Institute Inc. SAS Online Doc 9.1.3.Cary. North Carolina: SAS Institute, Inc; 2006. [Google Scholar]

[R19] Tsiatis AA. Semiparametric Theory and Missing Data. New York: Springer; 2006. [Google Scholar]

[R20] Wiesner RH, McDiarmid SV, Kamath PS, Edwards EB, Malinchoc M, Kremers WK, Krom RA, Kim WR. MELD and PELD: Application of survival models to liver allocation. Liver Transplantation. 2001;7:567–580. doi: 10.1053/jlts.2001.25879. [DOI] [PubMed] [Google Scholar]

[R21] Zucker DM. Restricted mean life with covariates: modification and extension of a useful survival analysis method. Journal of the American Statistical Association. 1998;93:702–709. [Google Scholar]

PERMALINK

Estimating differences in restricted mean lifetime using observational data subject to dependent censoring

Min Zhang

Douglas E Schaubel

Summary

1. Introduction

2. Notation and Data Structure

3. Methods

4. Asymptotic Properties

5. Simulation Study

Table 1.

Table 2. Estimators for difference in restricted mean lifetime under Scenario 1-3, heavy censoring case (entries are as in Table 1).

Table 3.

6. Application

Figure 1.

Table 4.

Figure 2.

7. Discussion

Supplementary Material

Acknowledgments

Footnotes

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Estimating differences in restricted mean lifetime using observational data subject to dependent censoring

Min Zhang

Douglas E Schaubel

Summary

1. Introduction

2. Notation and Data Structure

3. Methods

4. Asymptotic Properties

5. Simulation Study

Table 1.

Table 2. Estimators for difference in restricted mean lifetime under Scenario 1-3, heavy censoring case (entries are as in Table 1).

Table 3.

6. Application

Figure 1.

Table 4.

Figure 2.

7. Discussion

Supplementary Material

Acknowledgments

Footnotes

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases