Modeling Restricted Mean Survival Time under General Censoring Mechanisms

Xin Wang; Douglas E Sehaubel

doi:10.1007/s10985-017-9391-6

. Author manuscript; available in PMC: 2019 Jan 1.

Published in final edited form as: Lifetime Data Anal. 2017 Feb 21;24(1):176–199. doi: 10.1007/s10985-017-9391-6

Modeling Restricted Mean Survival Time under General Censoring Mechanisms

Xin Wang ¹, Douglas E Sehaubel ²

PMCID: PMC5565738 NIHMSID: NIHMS854625 PMID: 28224260

Abstract

Restricted mean survival time (RMST) is often of great clinical interest in practice. Several existing methods involve explicitly projecting out patient-specific survival curves using parameters estimated through Cox regression. However, it would often be preferable to directly model the restricted mean for convenience and to yield more directly interpretable covariate effects. We propose generalized estimating equation methods to model RMST as a function of baseline covariates. The proposed methods avoid potentially problematic distributional assumptions pertaining to restricted survival time. Unlike existing methods, we allow censoring to depend on both baseline and time-dependent factors. Large sample properties of the proposed estimators are derived and simulation studies are conducted to assess their finite sample performance. We apply the proposed methods to model RMST in the absence of liver transplantation among end-stage liver disease (ESLD) patients. This analysis requires accommodation for dependent censoring since pre-transplant mortality is dependently censored by the receipt of a liver transplant.

Keywords: Dependent censoring, Generalized linear model, Inverse weighting, Pre-treatment survival, Restricted mean lifetime, Transplantation

1 Introduction

The Cox proportional hazards model (Cox 1972, 1975) is the strong default for analyzing time to event data with covariate adjustment. A key motivation for the hazard ratio (HR) is its connection to the ordering of the survival functions, under the assumption of proportional hazards. However, when there are departures from proportional hazards, this connection is lost and it is then difficult to interpret the HR. A HR estimated by ignoring the non-proportionality will be a poorly specified mixture of the survival distribution and censoring distribution (Gillen and Emerson 2007), such that the resulting inference may then differ for studies with identical survival time distributions but different censoring patterns. In the presence of non-proportionality, alternatives to the HR include the ‘average effect’ (Xu and O’Quigley 2000), or applying a ‘stopped Cox model’ (Van Houwelingen 2007; Van Houwelingen and Putter 2015).

Covariate effects that are cumulative in nature are often of greater interest than instantaneous effects, especially in the presence of non-proportionality (Schaubel and Wei 2011). In particular, the contrast in restricted mean survival time (RMST) is a useful alternative. The RMST is defined as the average survival time up to a fixed point L and can be written as the area under the survival curve on [0, L]. RMST is an easily interpretable and clinically relevant measure for summarizing the mortality over a fixed follow-up time period of interest. Most existing methods estimate RMST indirectly through hazard regression (e.g., Zucker 1998; Chen and Tsiatis 2001; Zhang and Schaubel 2011). These approaches start by estimating the regression parameters and baseline hazard from a Cox model, calculating the cumulative baseline hazard, transforming it to obtain the survival function and, finally, integrating the survival function to obtain the RMST. Such indirect RMST estimation is inconvenient and computationally cumbersome, even for obtaining a point estimate, let alone its corresponding asymptotic standard error. Hence, it may be preferable to directly model RMST itself (Andersen et al. 2004; Tian et al. 2014).

The majority of existing methods for directly modeling RMST require assumptions regarding the censoring mechanism, which are often untenable. Censoring may result from multiple sources in an observational study. The simplest type would be covariate-independent censoring, which occurs independently of the death time and all the covariates. When this is the only type of censoring present, one can conduct regression analysis of RMST using imputed event times based on pseudo-observation methods (Andersen et al. 2004), or one can construct estimating equations for RMST based on Inverse Probability of Censoring Weighting (IPCW; Robins and Rotnitzky 1992; Robins 1993; Robins and Finkelstein 2000) as in Tian et al. (2014). However, in observational studies, censoring will often depend on the covariate vector. Censoring can depend on baseline covariates, but be conditionally independent of the event time given such covariates; this is referred to as covariate-dependent censoring. For example, it is common to have staggered entry in an observational study with a fixed calendar period, such that subjects who enter later would have a different censoring distribution than those who enter earlier; e.g., registration date on the wait-list for a liver transplant. Since mortality is often subject to calendar time trends, covariate-dependent censoring would be expected to be a frequent occurrence in observational studies. Andersen and Perme (2010) and Binder et al. (2014) conducted simulation studies to examine the bias and efficiency of the pseudo-observations approach for competing risks, in the presence of covariate dependent censoring. A third type of censoring is dependent censoring, which is often correlated with the event time through a mutual association with time varying covariates. Covariate-dependent and dependent censoring have been overcome in many applications by IPCW. Through pseudo-observations, Xiang and Murray (2012) modeled a standard linear regression of restricted survival time on the logarithm scale and handled dependent censoring through IPCW. Specifically, we connect the RMST and covariate vector through a user-specified link function, while Xiang and Murray (2012) model log restricted survival time through linear regression. In addition, being based on a pseudo-observation approach, their work has no systematic procedure for evaluating the asymptotic properties. To our knowledge, there is no existing method to directly model RMST in the presence of dependent censoring, or even covariate-dependent censoring.

The setting which motivated the proposed methods involves mortality in the absence of liver transplantation among End-Stage Liver Disease (ESLD) patients. Since the number of patients in need of liver transplantation is much greater than the number of available deceased-donor livers, medically suitable ESLD patients are placed on a wait-list. Priority for transplantation is then determined by medical urgency, as quantified by the Model for End-Stage Liver Disease (MELD) score. This score is calculated using bounded versions of serum bilirubin, serum creatinine, international normalized ratio for prothrombin time (INR) and dialysis status (Kamath et al. 2001; Wiesner et al. 2003). The MELD score has been shown to be strongly predictive of pre-transplant survival among chronic ESLD patients (Wiesner et al. 2001). For a given ESLD patient, the MELD score is updated frequently, such that MELD constitutes a time-varying covariate. Since wait-listed patients are sequenced on the wait-list in decreasing order of current MELD score, MELD is strongly associated with transplant rate. As the organ assignment is correlated with pre-transplant mortality through its mutual association with time varying MELD score, dependent censoring occurs through the receipt of a liver transplant, which precludes the observation of pre-transplant death. We are interested in the effect on pre-transplant mortality of prognostic factors observed at the time of wait-listing, as such information would be useful to hepatologists and transplant surgeons for counseling patients.

We propose semi-parametric regression methods for directly modeling RMST given baseline covariates in the presence of both covariate-dependent and dependent censoring. The proposed methods can be used to evaluate the cumulative effect of baseline covariates and to quantify treatment effects in terms of contrasts in RMST. Our proposed methods do not require any distributional assumption on the death variates and, analogous to generalized linear models, allow for different link functions.

The contribution of our proposed work, compared to Tian et al. (2014), is that the latter requires that censoring does not depend on the covariate vector. Although random censoring may be a reasonable assumption in clinical trials, it will often fail in observational studies. Our methods not only allow for covariate-dependent censoring, but also allow for dependent censoring (e.g., dependence between the death and censoring times not captured by the covariates used in the death model). In Tian et al. (2014), the weight function is the inverse of the Kaplan-Meier estimator. In the methods we propose, we distinguish between covariate dependent and dependent censoring; in particular, a double inverse weight is required and estimated through separate Cox models for the two types of censoring.

The remainder of this article is organized as follows. In Section 2, we formulate the data structure, define the necessary assumptions and then describe the proposed methods. Asymptotic properties are given in Section 3. In Section 4, we conduct simulation studies to evaluate the accuracy of the proposed procedures in finite samples. In Section 5, we apply our methods to the motivating ESLD data to determine the effect on pre-transplant mortality of several clinically meaningful variables. We conclude this paper with a brief discussion in Section 6. Derivation of the asymptotic properties and additional results for ESLD data analysis are provided in the Supplementary Materials.

2 Proposed Methods

We begin with the necessary notation. Let D_i be the treatment-free survival time for subject i from a cohort of sample size n. We consider two types of censoring. One potential censoring time, denoted as C_i, is independent of D_i conditional on the baseline covariates; this type of censoring includes loss to follow-up or administrative censoring on the day the database closes. The other potential censoring, denoted as T_i, is not conditionally independent of D_i given baseline covariates; one example would be treatment time, which may dependently censor pre-treatment mortality. The observation time for subject i is X_i = D_i ∧ T_i ∧ C_i, where a ∧ b = min{a, b}; and the indicators for at risk status, pre-transplant death, dependent and independent censoring are denoted by R_i(t) = I(X_i ≥ t), $Δ_{i}^{D} = I (D_{i} \leq T_{i} \land C_{i})$ , $Δ_{i}^{T} = I (T_{i} < D_{i} \land C_{i})$ and $Δ_{i}^{C} = I (C_{i} < D_{i} \land T_{i})$ respectively. We denote the covariates predicting D_i, T_i and C_i by $Z_{i}^{D} (t)$ , $Z_{i}^{T} (t)$ and $Z_{i}^{C}$ respectively. Although we have defined these notations to accommodate time varying covariates, some elements of each may be time constant; e.g., gender or race. In some practical studies, the investigators might want to use the same covariate set for censoring and death time; however, we will distinguish these covariate sets for the purpose of generality. Stacking these covariates together and removing any redundancy, we obtain Z_i(t) and the corresponding covariate history as ${\tilde{Z}}_{i} (t) = {Z_{i} (u) : 0 \leq u < t}$ . Our observed data are then given by {X_i, $Δ_{i}^{D}$ , $Δ_{i}^{T}$ , $Δ_{i}^{C}$ , $\tilde{Z} (X_{i})$ : i = 1, …, n}.

Let L be a pre-specified time point of interest, before the maximum follow-up time τ = max{X_i : i = 1, …, n}. Denote the restricted observation time as Y_i = X_i ∧ L and its corresponding indicator Δ_i = I(D_i ∧ L ≤ T_i ∧ C_i). We are interested in the average survival time up to L and will model this measure through baseline covariates $Z_{i}^{D} (0)$ :

μ_{i} (L) : = E {D_{i} \land L | Z_{i}^{D} (0)} .

Analogous to a generalized linear model, we assume a direct relationship between this RMST and baseline covariates as follows:

g [μ_{i} (L)] \equiv g [E {D_{i} \land L | Z_{i}^{D} (0)}] = {β^{'}}_{D} Z_{i}^{D} (0),

(1)

where g is a strictly monotone link function with a continuous derivative within an open neighborhood $ℬ_{D}$ of β_D. Examples of link functions include g(x) = x (identity link), g(x) = log(x) (log link) and g(x) = log(x/(L − x)) (logistic link). We choose to model the impact of baseline covariates for many reasons. First, our intention is to develop a model useful for application at the start of follow-up. For example, modeling (1) such modeling could be used in counseling ESLD patients regarding their prognosis in the absence of liver transplantation, given the information observed at the time of wait-list registration. Second, RMST predictions based on time-dependent covariates are difficult to interpret, at least for internal time varying factors (Kalbfleisch and Prentice, 2002). The role of Z_i(t) depends on the model being considered. In model (1), only baseline values are used and we average over the time-varying process. However, time-dependent values are needed to accommodate dependent censoring, as explained in the paragraphs below.

The choice of L requires careful thought. One would normally choose a time point of clinical relevance or, at least, of particular interest to the investigators, respecting the bound at the maximum follow-up time. If too small an L value is selected, D ∧ L = L for most subjects, leading to a largely uninformative analysis. Conversely, if too large an L value is selected, Ŝ(L) ≈ 0. Setting L too large or too small will generally result in attenuated covariate effects. The choice of link function also requires some consideration. The identity link is usually the most interesting because of its straightforward interpretation. However, it has the problem of unbounded predicted values. From this perspective, the logistic link may be a better choice, at least for the purposes of prediction. In addition, practitioners can conduct sensitivity analyses and model diagnostic procedures to test the performance of different link functions, as we demonstrate in Section 5.

Note that we do not make any assumption about the error structure, in the interest of flexibility and robustness. Although it might be difficult to envision an arbitrarily truncated variate having a well-behaved distribution, it is reasonable to assume that the corresponding mean has a convenient form. Framing the model in terms of g[E(D_i ∧ L)] instead of E[g(D_i ∧ L)] is very important in our settings. For example, if $\log {E (D_{i} \land L)} = {β^{'}}_{D} Z_{i}^{D} (0)$ , the parameters in β_D can be interpreted as the multiplicative effect on RMST per unit increase in the corresponding covariate. This is quite different from the model assumption $E {\log (D_{i} \land L)} = {β^{'}}_{D} Z_{i}^{D} (0)$ , where β_D equals the average change in logarithm of restricted survival time per unit increase in $Z_{i}^{D} (0)$ . The latter interpretation is much less intuitive, since back-transforming is invalid in the light of Jensen’s Inequality.

We now derive the estimating equation for the parameter of interest, β_D. In absence of censoring, based on (1), β_D could be estimated via the following estimating equation:

\frac{1}{n} \sum_{i = 1}^{n} Z_{i}^{D} (0) [D_{i} \land L - g^{- 1} {β^{'} Z_{i}^{D} (0)}] = 0 .

(2)

Although connected to generalized linear models, (2) is more accurately interpreted as a generalized estimating equation due to the absence of distributional assumptions on D_i ∧ L.

However, we will not observe D_i for all patients due to the occurrence of censoring. Instead we may observe either independent censoring time C_i or treatment time T_i. Provided that a rich set of variables is included in Z_i(0), we assume that C_i is conditionally independent of D_i, given the baseline covariate vector. Such an assumption is not assumed to hold for T_i; however, we can assume that the dependence of T_i and D_i occurs through (and only through) the time varying process ${\tilde{Z}}_{i} (t)$ . That is, conditional on ${\tilde{Z}}_{i} (t)$ , we assume “no unmeasured confounders”, which can be formulated as follows:

\begin{array}{l} \lim_{h \to 0} \frac{P {X_{i} \in [t, t + h), Δ_{i}^{T} = 1 | X_{i} \geq t, {\tilde{Z}}_{i} (t), D_{i}}}{h} \\ = \lim_{h \to 0} \frac{P {X_{i} \in [t, t + h), Δ_{i}^{T} = 1 | X_{i} \geq t, {\tilde{Z}}_{i} (t)}}{h} . \end{array}

This essentially assumes that the hazard of being censored by T_i at t depends only on the observed covariate history up to t and not additionally on future data. For example, based on the current liver allocation system, the receipt of a deceased-donor transplant depends only on the patient’s current prognostic factors, and not the future disease pathology.

Denote the hazard functions for C_i and T_i at time t as $λ_{i}^{C} (t)$ and $λ_{i}^{T} (t)$ , respectively; i.e.,

\begin{array}{l} λ_{i}^{C} (t) = \lim_{h \to 0} \frac{P {X_{i} \in [t, t + h), Δ_{i}^{C} = 1 | X_{i} \geq t, {\tilde{Z}}_{i} (t)}}{h}, \\ λ_{i}^{T} (t) = \lim_{h \to 0} \frac{P {X_{i} \in [t, t + h), Δ_{i}^{T} = 1 | X_{i} \geq t, {\tilde{Z}}_{i} (t)}}{h}, \end{array}

with corresponding cumulative hazards, $Λ_{i}^{C} (t) = \int_{0}^{t} λ_{i}^{C} (u) d u$ and $Λ_{i}^{T} (t) = \int_{0}^{t} λ_{i}^{T} (u) d u$ . Although $E (Z_{i}^{D} (0) [Y_{i} - g^{- 1} {{β^{'}}_{D} Z_{i}^{D} (0)}]) \neq 0$ in the presence of censoring, we can show that under our assumption the IPCW weighted expectation is still zero; i.e., $E (Z_{i}^{D} (0) W_{i} (Y_{i}) Δ_{i} [Y_{i} - g^{- 1} {{β^{'}}_{D} Z_{i}^{D} (0)}]) = 0$ , where $W_{i} (t) = W_{i}^{T} (t) W_{i}^{C} (t)$ , $W_{i}^{T} (t) = \exp {Λ_{i}^{T} (t)}$ and $W_{i}^{C} (t) = \exp {Λ_{i}^{C} (t)}$ . Therefore, the following equation is unbiased for β_D:

Φ^{*} (β) = \frac{1}{n} \sum_{i = 1}^{n} Z_{i}^{D} (0) W_{i} (Y_{i}) Δ_{i} [Y_{i} - g^{- 1} {{β^{'}}_{D} Z_{i}^{D} (0)}] = 0,

(3)

provided that the weight function $W_{i} (t) = \exp {Λ_{i}^{T} (t)} \exp {Λ_{i}^{C} (t)}$ is known. However, $Λ_{i}^{T} (t)$ and $Λ_{i}^{C} (t)$ are rarely known in practice and therefore must be estimated from the observed data. For this purpose, we assume Cox models for $λ_{i}^{T} (t)$ and $λ_{i}^{C} (t)$ . Cox regression is a natural choice for modeling censoring times since it is a well established approach, especially in the context of IPCW. Besides its computational convenience, Cox regression can flexibly accommodate both time constant and time varying covariates. We assume the following Cox models, for T_i through time-dependent covariates $Z_{i}^{T} (t)$ , and for C_i based on covariates $Z_{i}^{C}$ :

\begin{array}{l} λ_{i}^{T} (t) = λ_{0}^{T} (t) \exp {{β^{'}}_{T} Z_{i}^{T} (t)} . \\ λ_{i}^{C} (t) = λ_{0}^{C} (t) \exp ({β^{'}}_{C} Z_{i}^{C}) . \end{array}

Using partial likelihood (Cox 1975) and the Breslow estimator (Breslow 1972), we can estimate ${\hat{Λ}}_{0}^{T} (t)$ and ${\hat{β}}_{T}$ from data ${X_{i}, Δ_{i}^{T}, Z_{i}^{T} (t) : t \in [0, X_{i}), i = 1, \dots, n}$ , and ${\hat{Λ}}_{0}^{T} (t)$ and ${\hat{β}}_{C}$ from data ${X_{i}, Δ_{i}^{C}, Z_{i}^{C} : i = 1, \dots, n}$ respectively. Plugging ${\hat{Λ}}_{i}^{T} (t)$ , ${\hat{β}}_{T}$ , ${\hat{Λ}}_{i}^{C} (t)$ , ${\hat{β}}_{C}$ into (3), we obtain the following estimating equation,

Φ (β) = \frac{1}{n} \sum_{i = 1}^{n} Z_{i}^{D} (0) {\hat{W}}_{i} (Y_{i}) Δ_{i} [Y_{i} - g^{- 1} {{β^{'}}_{D} Z_{i}^{D} (0)}] = 0,

(4)

where ${\hat{W}}_{i} (t) = {\hat{W}}_{i}^{T} (t) {\hat{W}}_{i}^{C} (t)$ , ${\hat{W}}_{i}^{T} (t) = \exp {{\hat{Λ}}_{i}^{T} (t)}$ and ${\hat{W}}_{i}^{C} (t) = \exp {{\hat{Λ}}_{i}^{C} (t)}$ . The solution to (4) provides for consistent estimation of β_D, with its asymptotic properties discussed in Section 3. The use of a double inverse weight has some similarity to Schaubel and Wei (2011). However, unlike the methods we propose, the first weight in Schaubel and Wei (2011) is derived from Inverse Probability of Treatment Weighting (IPTW) and serves to balance treatment-specific covariate distributions.

3 Asymptotic Properties

Before presenting the asymptotic properties of our proposed estimators, we specify the following regularity conditions (a)–(g) for i = 1, …, n.

${X_{i}, Δ_{i}^{T}, Δ_{i}^{C}, {\tilde{Z}}_{i} (X_{i})}$ are independently and identically distributed.
P(R_i(t) = 1) > 0 for t ∈ (0, τ].
$| Z_{i k} (0) | + \int_{0}^{τ} d | Z_{i k} (t) | < M_{Z} < \infty$ for i = 1,…,n, where Z_ik(0);Z_ik(t) are the kth components of Z_i(0) and Z_i(t), respectively.
$Λ_{i}^{T} (τ) < \infty$ , $Λ_{i}^{C} (τ) < \infty$ and $Λ_{i}^{T} (t)$ , $Λ_{i}^{C} (t)$ are absolutely continuous for t ∈ (0; τ ].
There exist neighborhoods $ℬ_{T}$ of β_T and $ℬ_{C}$ of β_C such that for k = 0; 1; 2,
$\begin{array}{r} \sup_{t \in (0, τ], β \in ℬ_{T}} ‖ \frac{1}{n} \sum_{i = 1}^{n} \exp {β^{'} Z_{i}^{T} (t)} R_{i} (t) Z_{i}^{T} {(t)}^{\otimes k} - r_{T}^{(k)} (t; β) ‖ \overset{p}{\to} 0, \\ \sup_{t \in (0, τ], β \in ℬ_{C}} ‖ \frac{1}{n} \sum_{i = 1}^{n} \exp (β^{'} Z_{i}^{C}) R_{i} (t) Z_{i}^{C \otimes k} - r_{C}^{(k)} (t; β) ‖ \overset{p}{\to} 0, \end{array}$
where υ^⊗0 = 1, υ^⊗1 = υ, υ^⊗2 = υ′υ and
$\begin{array}{l} r_{T}^{(k)} (t; β) = E [\exp {β^{'} Z_{i}^{T} (t)} R_{i} (t) Z_{i}^{T} {(t)}^{\otimes k}], \\ r_{C}^{(k)} (t; β) = E {\exp (β^{'} Z_{i}^{C}) R_{i} (t) Z_{i}^{C \otimes k}} . \end{array}$
Define h(x) = ∂g⁻¹(x)/∂x, where h exists and is continuous in an open neighborhood $ℬ_{D}$ of β_D.
The matrices A(β_D), Ω_T (β_T), Ω_C(β_C) are each positive definite, where
$\begin{array}{l} A (β) = E [Z_{i}^{D} {(0)}^{\otimes 2} h {{β^{'}}_{D} Z_{i}^{D} (0)}], \\ Ω_{T} (β) = E [\int_{0}^{τ} {\frac{r_{T}^{(2)} (t; β)}{r_{T}^{(0)} (t; β)} - {\bar{z}}_{T} {(t; β)}^{\otimes 2}} d N_{i}^{T} (t)], \\ Ω_{C} (β) = E [\int_{0}^{τ} {\frac{r_{C}^{(2)} (t; β)}{r_{C}^{(0)} (t; β)} - {\bar{z}}_{C} {(t; β)}^{\otimes 2}} d N_{i}^{C} (t)], \end{array}$
and
$\begin{array}{l} {\bar{z}}_{T} (t; β) = \frac{r_{T}^{(1)} (t; β)}{r_{T}^{(0)} (t; β)}, \\ {\bar{z}}_{C} (t; β) = \frac{r_{C}^{(1)} (t; β)}{r_{C}^{(0)} (t; β)} . \end{array}$

Condition (a) can be relaxed at the expense of additional technical development. Condition (b) is needed for the purpose of identifiability. Conditions (c)–(f) ensure the convergence of several stochastic integrals used in the proofs. The matrices A(β_D), Ω_T (β_T), Ω_C(β_C) in condition (g) are at least non-negative definite and will be positive definite under any non-redundant specification of the respective covariate vectors. Our main asymptotic results are summarized in Theorems 1 and 2 below, with the proofs presented in Web Appendix A.

Theorem 1

Under regularity conditions (a)–(g), as n → ∞, $\sqrt{n} Φ (β_{D})$ converges to a zero-mean Normal with variance B(β_D) = E{B_i(β_D)^⊗2}, for any subject i = 1, …, n,

\begin{array}{l} B_{i} (β) = ε_{i} (β) + K_{T} (β) Ω_{T} {(β_{T})}^{- 1} U_{i}^{T} (β_{T}) + K_{C} (β) Ω_{C} {(β_{C})}^{- 1} U_{i}^{C} (β_{C}) \\ + \int_{0}^{τ} H_{T} (u; β) r_{T}^{(0)} {(u; β_{T})}^{- 1} d M_{i}^{T} (u) + \int_{0}^{τ} H_{C} (u; β) r_{C}^{(0)} {(u; β_{C})}^{- 1} d M_{i}^{C} (u), \end{array}

where we define

\begin{array}{l} ε_{i} (β) = Δ_{i} W_{i} (Y_{i}) [Y_{i} - g^{- 1} {β^{'} Z_{i}^{D} (0)}] Z_{i}^{D} (0), \\ U_{i}^{T} (t) = \int_{0}^{t} {Z_{i}^{T} (u) - {\bar{z}}_{T} (u; β_{T})} d M_{i}^{T} (u), \\ U_{i}^{C} (t) = \int_{0}^{t} {Z_{i}^{C} - {\bar{z}}_{C} (u; β_{C})} d M_{i}^{C} (u), \\ K_{T} (β) = E {ε_{i} (β) D_{i}^{T} {(Y_{i})}^{'}}, \\ K_{C} (β) = E {ε_{i} (β) D_{i}^{C} {(Y_{i})}^{'}}, \\ H_{T} (u; β) = E [ε_{i} (β) \exp {{β^{'}}_{T} Z_{i}^{T} (u)} R_{i} (u)], \\ H_{C} (u; β) = E {ε_{i} (β) \exp ({β^{'}}_{C} Z_{i}^{C}) R_{i} (u)}, \\ D_{i}^{T} (t) = \int_{0}^{t} {Z_{i}^{T} (u) - {\bar{z}}_{T} (u; β_{T})} d Λ_{i}^{T} (u), \\ D_{i}^{C} (t) = \int_{0}^{t} {Z_{i}^{C} (u) - {\bar{z}}_{C} (u; β_{C})} d Λ_{i}^{C} (u), \end{array}

with Ω_T (β), Ω_C(β) defined in Condition (g).

Here we use the usual counting process notations, where $N_{i}^{T} (t) = I (X_{i} \leq t, Δ_{i}^{T} = 1)$ and $N_{i}^{C} (t) = I (X_{i} \leq t, Δ_{i}^{C} = 1)$ are observed counting processes for T_i and C_i respectively, with $d M_{i}^{T} (t) = d N_{i}^{T} (t) - R_{i} (t) d Λ_{i}^{T} (t)$ and $d M_{i}^{C} (t) = d N_{i}^{C} (t) - R_{i} (t) d Λ_{i}^{C} (t)$ being the corresponding zero mean processes. The proof utilizes various results derived in Zhang and Schaubel (2011), primarily the asymptotic expression of the empirical weight in terms of the true weight. The main purpose of Theorem 1 is to set up Theorem 2.

Theorem 2

Under regularity conditions (a)–(g), as n → ∞, ${\hat{β}}_{D}$ converges in probability to β_D and $\sqrt{n} ({\hat{β}}_{D} - β_{D})$ converges to a zero-mean Normal with variance A(β_D)⁻¹B(β_D)A(β_D)⁻¹ with A(β) defined in condition (g) and B(β) defined in Theorem 1.

The proof of consistency of ${\hat{β}}_{D}$ follows primarily from the Inverse Function Theorem (Foutz 1977) while the proof of asymptotic normality follows through the combination of various Taylor series expansions and the Cramér-Wold Theorem.

We propose two versions of asymptotic standard error (ASE) estimators for our proposed estimator ${\hat{β}}_{D}$ . The first ASE is derived from (3) and, as such, treats the IPCW weights as known:

{ASE}_{1} = \sqrt{\frac{1}{n} Diag [\hat{A} {(β_{D})}^{- 1} {\hat{B}}^{*} (β_{D}) \hat{A} {(β_{D})}^{- 1}]},

where ${\hat{B}}^{*} (β) = \hat{E} {ε_{i} {(β)}^{\otimes 2}}$ . The second ASE is based on (4) and derived in Theorem 2:

{ASE}_{2} = \sqrt{\frac{1}{n} Diag [\hat{A} {(β_{D})}^{- 1} \hat{B} (β_{D}) \hat{A} {(β_{D})}^{- 1}]},

where $\hat{B} (β) = \hat{E} {B_{i} {(β)}^{\otimes 2}}$ . These two ASEs can be obtained by plugging all the undetermined terms with their respective estimators. More detail about the calculation procedures is provided in Web Appendix B. These two versions of sandwich ASEs share the same second derivative matrix, A(β), but differ with two different middle matrices. The first version, ASE₁, which results from the weight function being known as opposed to estimated, treats the weights as fixed and therefore its middle matrix involves ε_i only. The second version, ASE₂, contains several extra terms in its middle matrix, in order to account for the variation due to estimating the weights. Although ASE₂ should be closer to the truth, it adds the complexity of middle matrix and is usually more complicated to calculate in practice than ASE₁. In contrast, ASE₁ can be easily computed with built-in commands from many statistical software packages (e.g., SAS, R) and therefore serves as a useful approximation of ASE₂. We evaluate the performance of both ASE₁ and ASE₂ through simulations presented in the next section.

4 Simulation Study

We conducted simulations to evaluate the performance of the proposed methods in finite samples. Two different percentages of right censoring were considered, moderate and heavy censoring. For each simulated subject i = 1, …, n, two baseline covariates Z_i₁, Z_i₂ were generated from Bernoulli(0.5) distributions. The death time, D_i, was generated from D_i = g⁻¹ (α₀ + α₁Z_i₁ + α₂Z_i₂)+ε₁_i, where ε₁_i ~ Uniform(−σ, σ), α = [α₀, α₁, α₂]′ and σ were chosen in accordance with the particular link function. More specifically, α = [5.5, 0.25, 0.25]′ was tested for the linear link, and α = [−0.63, .08, .08]′ was tested and for log link. This death generator was used to induce the same mean structure for D_i and D_i ∧ L, as the mean structure of the former, g{E(D_i|Z_i₁, Z_i₂)} = α₀ + α₁Z_i₁ + α₂Z_i₂, is similar to that of the latter,

g {E (D_{i} \land L | Z_{i 1}, Z_{i 2})} = β_{D 0} + β_{D 1} Z_{i 1} + β_{D 2} Z_{i 2} + β_{D 3} Z_{i 1} Z_{i 2} .

(5)

Because the above model (5) is saturated, due to an extra interaction term Z_i₁Z_i₂, the true values of β_D = [β_D₀, β_D₁, β_D₂, β_D₃]′ can be determined computationally and are calculated using Monte Carlo methods with size 10 million. We set L = 10 to yield a reasonable range of P (D > L). We evaluated the linear and log links, since they would be the most popular choices in practice.

We generated the independent censoring time C_i from a Cox model with the following hazard,

λ_{i}^{C} (t) = λ_{0}^{C} \exp (β_{C 1} Z_{i 1}),

(6)

where $λ_{0}^{C}$ ranged from 1/36 to 1/21 and β_C₁ ranged from −log(3) to log(2). We generated a time-dependent covariate which was correlated with death time D_i and treatment time T_i even after conditioning on baseline covariates Z_i₁, Z_i₂. First, let V_i = −V₀ log{(ε_i₁ + σ)/(2σ)} + ε_i₂, where V₀ is a constant ranging from 40 to 100 and ε_i₂ ~ Uniform(0, 1). Then define a time-dependent variable V_i(t) = I(t ≤ V_i). Thus V_i(t) is still correlated with D_i, through ε_i₁, even after conditioning on Z_i₁, Z_i₂. Treatment time T_i was generated from a Cox model with the following hazard,

λ_{i}^{T} (t) = λ_{0}^{T} \exp {β_{T 1} Z_{i 2} + β_{T 2} V_{i} (t)},

(7)

where $λ_{0}^{T}$ ranged from 1/35 to 1/18, β_T₁ ranged from −log(4) to log(3) and β_T₂ ranged from log(2) to log(3). Therefore T_i is correlated with D_i conditional on Z_i₁, Z_i₂, through a mutual unobserved variable ε_i₁.

We present results for samples sizes n = 250 and n = 500, under moderate and heavy censoring. For the linear link, P (D > 10) ≈ 11%, approximately 10% C_i and 21% T_i are observed in the moderate censoring scenario, and 15% C_i and 36% T_i are observed in the heavy censoring scenario. For the log link, P (D > 10) ≈ 10%, approximately 8% C_i and 24% T_i are observed in the moderate censoring scenario, with 15% C_i and 35% T_i observed in the heavy censoring scenario. As displayed in Tables 1 and 2, the estimates for β_D are approximately unbiased, with the magnitudes of any bias generally decreasing with increasing sample size. The calculated ASE₁s and ASE₂s are very close to their corresponding empirical standard deviations (ESD), and therefore their empirical coverage probabilities CP₁ and CP₂ are close to 95%. The implications from our simulation studies are that, in moderate samples, the proposed methods result in unbiased estimation, and that the easily computed ASE₁ is a useful approximation to the more complicated ASE₂. In our real-data application presented in Section 5, the sample size far exceeds 1,000, which would render finite-sample bias in ASE₁ much less of an issue than in our simulation studies.

Table 1.

Simulation results: g(x) = x

Scenario	N	β_D	True	BIAS	ESD	ASE₁	CP₁(%)	ASE₂	CP₂(%)
Moderate Censoring	250	β₀	5.452	−0.008	0.505	0.519	94	0.515	94
		β₁	0.23	−0.037	0.714	0.72	95	0.735	96
		β₂	0.227	0.008	0.67	0.686	95	0.711	96
		β₃	−0.014	0.033	0.928	0.947	95	0.99	96
	500	β₀	5.452	0.002	0.346	0.368	96	0.364	96
		β₁	0.23	−0.016	0.503	0.508	95	0.519	95
		β₂	0.227	−0.018	0.469	0.487	96	0.505	96
		β₃	−0.014	0.012	0.674	0.671	95	0.701	97

Heavy Censoring	250	β₀	5.452	−0.006	0.467	0.486	96	0.577	98
		β₁	0.23	−0.042	0.704	0.725	95	0.858	98
		β₂	0.227	−0.037	0.821	0.83	95	0.991	98
		β₃	−0.014	−0.021	1.271	1.249	94	1.493	96
	500	β₀	5.452	0.01	0.328	0.345	96	0.398	98
		β₁	0.23	−0.024	0.512	0.512	94	0.589	96
		β₂	0.227	−0.043	0.591	0.594	95	0.694	97
		β₃	−0.014	−0.017	0.924	0.887	93	1.039	97

Open in a new tab

Table 2.

Simulation results: g(x) = log(x)

Scenario	N	β_D	True	BIAS	ESD	ASE₁	CP₁(%)	ASE₂	CP₂(%)
Moderate Censoring	250	β₀	−0.634	−0.008	0.104	0.104	95	0.102	94
		β₁	0.075	−0.007	0.143	0.139	95	0.142	96
		β₂	0.074	0.005	0.13	0.129	95	0.133	96
		β₃	−0.004	0.008	0.173	0.172	95	0.179	95
	500	β₀	−0.634	−0.004	0.068	0.073	97	0.072	96
		β₁	0.075	−0.001	0.095	0.097	96	0.099	96
		β₂	0.074	−0.001	0.088	0.091	96	0.094	97
		β₃	−0.004	0.001	0.12	0.121	96	0.126	97

Heavy Censoring	250	β₀	−0.634	−0.013	0.124	0.125	94	0.125	95
		β₁	0.075	−0.01	0.183	0.171	94	0.177	94
		β₂	0.074	0.007	0.157	0.157	95	0.167	96
		β₃	−0.004	0.01	0.222	0.212	95	0.227	96
	500	β₀	−0.634	−0.008	0.083	0.089	96	0.088	96
		β₁	0.075	0.001	0.121	0.12	95	0.124	95
		β₂	0.074	0.001	0.109	0.111	95	0.118	97
		β₃	−0.004	0.001	0.151	0.15	95	0.159	96

Open in a new tab

Mis-specification of the censoring model is a common issue in IPCW methods, in which case bias will generally exist in the estimation of the mortality model parameters. To evaluate how much bias is introduced by such mis-specification, we conducted additional numerical studies. In particular, we considered 3 different types of mis-specification:

Model for independent censoring, C, is mis-specified: add a covariate, Z_i₂, to the censoring hazard $λ_{i}^{C} (t)$ ; i.e., Z_i₂ was added to the generator, but not the model for C_i being fitted.
Model for dependent censoring, T, is mis-specified: add a covariate, Z_i₁, to censoring hazard $λ_{i}^{T} (t)$ , but not to the T_i model being fitted.
Models are mis-specified for both C and T simultaneously, per (i) and (ii) above.

Figure 1 displays the bias for sample size N = 250 calculated with 1, 000 replications for the 4 scenarios in Table 1 and 2, after introducing mis-specification (i)–(iii). In general, bias exists when the censoring model is not correctly specified, and is more pronounced when both types of censoring are incorrectly modeled.

Fig. 1 — Bias of [*β_D*₀*, β_D*₁*, β_D*₂*, β_D*₃]′ when censoring models are mis-specified

An interesting comparison is between our methods and those proposed by Tian et al. (2014). Methods of Tian et al. (2014) were developed in the context of covariate-independent censoring and, thus, proposed IPCW weights based on Kaplan-Meier estimators. When the censoring mechanism is more complicated than independent censoring (i.e., in presence of either covariate-dependent or dependent censoring), then our methods should perform better than Tian’s methods. We illustrated this phenomenon with additional simulations; specifically, we ran the simulation studies in Table 1 and 2 again, applied Tian et al. (2014), and then plotted the resulting bias along with the bias from our methods. As shown in Fig 2, Tian et al. (2014) has quite severe bias for N = 250 cases, with similar results obtained for the N = 500 case. This is expected, since we are testing Tian et al. (2014) outside the set-up for which the methods were developed.

Fig. 2 — Bias comparison between our and Tian’s methods in presence of dependent censoring

In presence of only covariate-independent censoring, both our methods and those of Tian et al. (2014) should apply. In evaluating our methods, we blindly added non-predictive covariates into the censoring model, in order to assess the degree of efficiency loss. We generated death time with D_i = g⁻¹ (5.25 + 0.5Z_i₁ + 0.5Z_i₂) + ε₁_i, where Z_i₁, Z_i₂ ~ Bernoulli(0.5) and ε₁_i ~ Uniform(−5.25, 5.25). We chose L = 9 and introduced a simple censoring pattern (Exponential, rate 0.05). Approximately 79% D’s are observed, with P (D > L) ≈ 19%. In Table 3, Bias₁ and ESD₁ are bias and empirical standard deviation (ESD) calculated by our methods with futile covariates Z_i₁, Z_i₂ in the censoring model, and Bias₂ and ESD₂ are calculated using Tian et al. (2014) with a correctly specified non-parametric censoring model. The last column is Empirical Relative Efficiency (ERE) between Tian et al. (2014) and our methods, computed as the ratio of mean square error. Examining various replicates, the estimated regression coefficients for the C model tends to be very close to 0, with large p values. In practice, users would feel inclined to drop them from the model, due to their negligible effect on C. Since these non-zero censoring coefficients are actually 0, our methods have a little bit greater bias than Tian et al. (2014). Regarding efficiency, our methods exhibited approximately the same efficiency as Tian et al. (2014), despite including the two irrelevant covariates in the C model.

Table 3.

Efficiency comparison between our and Tian’s methods in presence of only covariate-independent censoring: g(x) = x

N	β_D	True	Bias₁	Bias₂	ESD₁	ESD₂	ERE
250	β₀	5.148	0.023	< 0.001	0.344	0.351	0.963
	β₁	0.406	−0.017	−0.002	0.404	0.401	1.012
	β₂	0.403	−0.009	0.005	0.403	0.402	1.004
500	β₀	5.148	0.012	0.001	0.244	0.25	0.953
	β₁	0.406	−0.007	< 0.001	0.286	0.286	1.003
	β₂	0.403	−0.009	−0.002	0.286	0.286	1.002

Open in a new tab

5 Analysis of Liver Disease Data

We applied the proposed methods to directly estimate RMST among End-Stage Liver Disease (ESLD) patients. Of interest was survival in the absence of liver transplantation. We obtained data from the Scientific Registry of Transplant Recipients (SRTR). The SRTR data system includes data on all donors, wait-listed candidates, and transplant recipients in the U.S., as submitted by the members of the Organ Procurement and Transplantation Network (OPTN), and has been described elsewhere. The Health Resources and Services Administration (HRSA), U.S. Department of Health and Human Services provides oversight to the activities of the OPTN and SRTR contractors.

The study population consisted of all chronic ESLD patients initially wait-listed for deceased-donor liver transplantation in U.S. at age ≥ 18 between January 1, 2005 and December 31, 2012. For each patient, the time origin (t = 0) is the date of wait-listing, with each patient followed from that date until earliest of death, receipt of a liver transplant, loss to follow-up, or the end of the observation period on 12/31/2012. Independent censoring occurs through loss to follow-up, administrative censoring, or receipt of a living-donor liver transplant. Note that living-donor transplantation is usually carried out with a liver segment donated by a family member or a close friend, such that the process is not systematically influenced by MELD score trajectory conditional on baseline covariates. As described in Section 1, dependent censoring occurs through the receipt of a deceased-donor transplant, which is correlated with pre-transplant mortality through its mutual association with time varying MELD score. A total of n =55,651 patients were included in our study population. Among them, 13,640 (25%) died before receipt of a transplant, 23,335 (42%) received a liver transplant, and 18,676 (34%) were independently censored.

We constructed our independent censoring model and pre-transplant mortality model using baseline covariates historically reported to be important prognostic factors, including age, gender, race, blood type, United Network for Organ Sharing (UNOS) Region, calendar year of listing, underlying diagnosis, body mass index (BMI), dialysis, sodium, hospitalization status and MELD score at listing (t = 0). The liver transplant hazard model incorporated additional time-dependent covariates, including MELD score, dialysis, sodium, ascites and encephalopathy. UNOS has established 11 geographic Regions for administrative purposes. The availability of deceased-donor organs and the distribution of mortality is quite different across these 11 Regions. This therefore suggests the necessity of adjusting for UNOS Region in both our censoring and mortality models:

λ_{i j}^{T} (t) = A_{i} (t) λ_{0 j}^{T} (t) \exp {{β^{'}}_{T} Z_{i}^{T} (t)},

(8)

λ_{i j}^{C} (t) = λ_{0 j}^{C} (t) \exp ({β^{'}}_{C} Z_{i}^{C}),

(9)

where the subscript j = 1, 2, …, 11 stands for UNOS Region, while the indicator A_i(t) records whether the patient is active and not removed from the wait-list at time t. Patients generally start follow-up as active, such that A_i(0) = 1, but may be made temporarily inactive due to illness (A_i(t) = 0), in which case the patient retains his/her position on the wait-list but cannot receive deceased-donor liver offers. A patient whose health condition declines to the point where liver transplantation is considered futile may be permanently removed from the wait-list (A_i(u) = 0 for any time point u after the time of removal). Therefore A_i(t) serves as a time varying at-risk indicator for transplantation. Subintervals during which a given patient is in-active make no contribution to the fitting of model (8). We then compute cumulative hazard functions as ${\hat{Λ}}_{i j}^{C} (t) = \exp {- \int_{0}^{t} \exp ({\hat{β}}^{'}_{C} Z_{i}^{C}) d {\hat{Λ}}_{0 j}^{C} (u)}$ and ${\hat{Λ}}_{i j}^{T} (t) = \exp [- \int_{0}^{t} A_{i} (u) \exp {{\hat{β}}^{'}_{T} Z_{i}^{T} (u)} d {\hat{Λ}}_{0 j}^{T} (u)]$ and obtain the IPCW weight as ${\hat{W}}_{i j} (t) = \exp {{\hat{Λ}}_{i j}^{C} (t)} \exp {{\hat{Λ}}_{i j}^{T} (t)}$ . Since more than 99% of the estimated IPCW weights are below 10, we cap the weights by 10 in order to reduce variance.

We modeled restricted mean survival time at L = 1, L = 3, L = 5 years post wait-list registration, which are reasonable time windows in the ESLD setting. Overall crude survival probabilities are approximately 79%, 62% and 51% for the 3 time windows respectively. We present the parameter estimates for the pre-transplant mortality model using three link functions, including linear, log and logistic. As shown in Table 4, the covariate effects demonstrate the same trends across the different link functions. The average pre-transplant survival time out of the next 3 years is estimated as approximately 33, 35.8, and 33 months using the three link functions respectively, for a ‘reference’ patient; i.e., a white male wait-listed at age 50, registered in Region 5 (the Region with the largest population) during year 2005, diagnosed as none of the listed types, not hospitalized, not on dialysis, with blood Type O, BMI between 20 and 25, sodium level at 130 mmol/l and MELD score 6 (the minimum possible value). For another patient with the same profile but a different MELD score (e.g., MELD=30), the average pre-transplant survival time out of the next 3 years is estimated as approximately 11.6, 9.8, and 7.5 months respectively. This discrepancy in predicted RMST underscores the importance of the MELD score. The linear link does not always result in fitted RMST values within an admissible range (0, L]. This could perhaps be remedied by transforming various covariates, or by simply bounding estimated RMST. The remaining parameter estimates for the 1- and 5-year windows are provided in Web Appendix E.

Table 4.

Estimated covariate effects on RMST in the absence of liver transplantation (L = 36 months)

Linear

Log

Logistic

Z_{i}^{D} (0)

{\hat{β}}_{D}

ASE₁

{\hat{β}}_{D}

ASE₁

{\hat{β}}_{D}

ASE₁

Intercept

32.95

0.5

< 0.01

3.57

0.02

0.8

2.42

0.08

< 0.01

Year-2005

−1.02

0.05

< 0.01

−0.04

< 0.01

−0.16

0.01

< 0.01

Age-50 (Years)

−0.2

0.01

< 0.01

−0.01

< 0.01

−0.04

< 0.01

Sodium-130 (mmol/l)

0.47

0.02

< 0.01

0.02

< 0.01

0.08

< 0.01

MELD Score-6

−0.89

0.02

< 0.01

−0.05

< 0.01

−0.16

< 0.01

UNOS Region

Reference Group: 5

−1.52

0.49

< 0.01

−0.06

0.02

< 0.01

−0.27

0.07

< 0.01

−1.35

0.38

< 0.01

−0.06

0.01

< 0.01

−0.22

0.06

< 0.01

−3.26

0.43

< 0.01

−0.11

0.02

< 0.01

−0.49

0.07

< 0.01

0.23

0.34

0.5

0.02

0.01

0.09

0.04

0.06

0.46

−0.97

0.54

0.07

< 0.01

0.02

0.95

−0.12

0.09

0.21

−0.59

0.42

0.16

−0.02

0.02

0.32

−0.1

0.07

0.14

−0.9

0.42

0.03

< 0.01

0.02

0.94

−0.13

0.07

0.06

−1.41

0.36

< 0.01

−0.07

0.01

< 0.01

−0.21

0.06

< 0.01

−2.24

0.48

< 0.01

−0.09

0.02

< 0.01

−0.39

0.07

< 0.01

−2.75

0.44

< 0.01

−0.1

0.02

< 0.01

−0.44

0.07

< 0.01

Gender

Reference Group: Male

Female

0.49

0.22

0.02

< 0.01

0.01

0.95

0.06

0.03

0.1

Race

Reference Group: White

Black

0.34

0.41

0.02

0.23

0.06

0.31

Hispanic

0.23

0.27

0.4

0.01

0.21

0.04

0.05

0.33

Asian

1.76

0.55

< 0.01

0.05

0.02

0.01

0.33

0.11

< 0.01

Others

−0.57

0.89

0.52

−0.01

0.04

0.74

−0.11

0.15

0.48

Blood Type

Reference Group: O

−0.08

0.21

0.72

−0.01

0.01

0.26

−0.02

0.03

0.63

−0.02

0.36

0.95

−0.01

0.01

0.59

< 0.01

0.06

0.97

−0.59

0.78

0.44

−0.05

0.03

0.08

−0.19

0.11

0.1

Diagnosis

Reference Group: No or Yes

Hepatitis C

−1.33

0.33

< 0.01

−0.04

0.01

< 0.01

−0.26

0.05

< 0.01

Noncholestatic

0.76

0.33

0.02

0.06

0.01

< 0.01

0.12

0.05

0.02

Cholestatic

−1.26

0.45

0.01

−0.05

0.02

< 0.01

−0.28

0.07

< 0.01

Acute Hepatic Necrosis

2.27

0.81

0.01

0.06

0.03

0.04

0.56

0.18

< 0.01

Metastatic Disease

−2.95

0.66

< 0.01

−0.11

0.03

< 0.01

−0.53

0.11

< 0.01

Malignant Neoplasm

−7.24

0.37

< 0.01

−0.43

0.02

< 0.01

−1.23

0.06

< 0.01

BMI

Reference Group: (20, 25]

(0, 20]

−2.05

0.47

< 0.01

−0.07

0.02

< 0.01

−0.28

0.07

< 0.01

(25, 30]

0.02

0.27

0.95

0.01

0.25

−0.01

0.04

0.8

> 30

−0.09

0.27

0.74

< 0.01

0.01

0.98

−0.02

0.04

0.57

Hospitalized

Reference Group: Not Hospitalized

ICU

−1.83

0.65

< 0.01

−0.54

0.08

< 0.01

−0.82

0.14

< 0.01

not ICU

−2.41

0.42

< 0.01

−0.33

0.04

< 0.01

−0.49

0.08

< 0.01

Dialysis

Reference Group: No or Yes

Yes

1.69

0.54

< 0.01

0.13

0.04

< 0.01

0.49

0.09

< 0.01

Open in a new tab

An offset of L = 36 months is applied for log link.

Figure 3 plots fitted RMST values (L = 3 years) by MELD score (ranging from 6 to 40; i.e., for all possible MELD scores), for the above-described reference patient. For all the three link functions, RMST decreases strongly with increasing MELD score, as anticipated. The fitted values based on all the three links result in fitted values which tail off at higher MELD scores. Among the three link functions, the linear link may be most appealing in terms of its straightforward interpretation. For example, for per unit increase in a patient’s MELD score, the average survival time (capped at 3 years) will decrease approximately 0.9 months; for a 5-year increase in age at wait-listing, 3-year RMST decreases by approximately 1 month. Analogous trends are also observed in the models using the other two link functions. We will further compare the model adequacy using different link functions in terms of discrimination ability and prediction accuracy.

Fig. 3 — Fitted RMST (L = 36 months) by MELD score for a reference patient: white, male, age=50, Region=5, year=2005, not hospitalized, not on dialysis, blood Type=O, BMI ∈ (20, 25], sodium=130

To evaluate each model’s discrimination ability, we compute the Index of Concordance (IOC), also known as the C statistic (Harrell et al. 1996; Heagerty and Zheng 2005; Uno et al. 2011), denoted by:

IOC = \frac{\sum_{i = 1}^{n} \sum_{j = 1}^{n} Δ_{i} {\hat{W}}_{i} (Y_{i}) {\hat{W}}_{j} (Y_{i}) I [Y_{i} < Y_{j}, g^{- 1} {{\hat{β}}^{'}_{D} Z_{i}^{D} (0)} < g^{- 1} {{\hat{β}}^{'}_{D} Z_{j}^{D} (0)}]}{\sum_{i = 1}^{n} \sum_{j = 1}^{n} Δ_{i} {\hat{W}}_{i} (Y_{i}) {\hat{W}}_{j} (Y_{i}) I (Y_{i} < Y_{j})} .

This statistic converges to a censoring distribution free quantity, $P ({\hat{β}}^{'}_{D} Z_{i}^{D} (0) < {\hat{β}}^{'}_{D} Z_{j}^{D} (0) | D_{i} \land L < D_{j} \land L)$ , measuring the agreement of predictions with observed failure order. Quantities frequently used to evaluate model prediction accuracy include the mean absolute deviation (MAD) and mean squared deviation (MSD) (see Davison and Hinkley 1997; Tian et al. 2007), formulated as:

\begin{array}{l} MAD = \frac{1}{n} \sum_{i = 1}^{n} Δ_{i} {\hat{W}}_{i} (Y_{i}) | [Y_{i} - g^{- 1} {{\hat{β}}^{'}_{D} Z_{i}^{D} (0)} |, \\ MSD = \frac{1}{n} \sum_{i = 1}^{n} Δ_{i} {\hat{W}}_{i} (Y_{i}) {[Y_{i} - g^{- 1} {{\hat{β}}^{'}_{D} Z_{i}^{D} (0)}]}^{2}, \end{array}

which converge to $E | D_{i} \land L - g^{- 1} {{\hat{β}}^{'}_{D} Z_{i}^{D} (0)} |$ and $E {[D_{i} \land L - g^{- 1} {{\hat{β}}^{'}_{D} Z_{i}^{D} (0)}]}^{2}$ respectively, quantifying the “distance” between predicted and observed outcomes. Proof sketches of the convergence of IOC, MAD and MSD are provided in Web Appendix C.

In Table 5, we calculate these three statistics for the 9 models (3 link functions, with 3 values of L) through 2-fold cross validation. Specifically, we split our data by randomly selecting half of patients (n =27,825) into a “training” set (to which the models are fitted), and using the remaining half (n =27,826) as the “validation” set (to which the discrimination and predictive accuracy measures are applied). For L = 1, IOC=0.82 for all three link functions. For L = 3 and L = 5, IOC is largest for the log link, although not by a wide margin. As L increases, the IOCs decrease, for all link functions; this makes sense intuitively since covariate measurements at times more distant in the past should correspond to reduced discrimination. In terms of both MAD and MSD, the logistic link has the best prediction accuracy by a fair margin.

Table 5.

Index of Concordance (IOC), Mean Absolute Deviation (MAD) and Mean Squared Deviation (MSD): Comparison of link functions by L (years)

Measure	L	Linear	Log	Logistic
IOC	1	0.82	0.82	0.82
	3	0.77	0.78	0.77
	5	0.72	0.75	0.74

MAD	1	1.52	1.67	1.36
	3	5.08	5.05	4.58
	5	7.19	6.56	6.06

MSD	1	6.66	6.97	6.42
	3	70.10	69.51	66.02
	5	153.25	145.78	139.75

Open in a new tab

6 Discussion

We have proposed methods to model restricted mean survival time as a function of baseline covariates, using techniques that are valid under a wide variety of censoring mechanisms. RMST is often of inherent interest to investigators, especially in settings where cumulative covariate effects are appealing. RMST is also an attractive alternative when the proportional hazards assumption does not hold. We have constructed double IPCW weights to simultaneously account for independent and dependent censoring. This general setup is frequently necessary in applications, and failing to account for either type of censoring may result in biased estimation of the mortality model. In studies with only one type of censoring, one would need to calculate the corresponding IPCW weight and set the other to 1. In the interests of flexibility and robustness, our proposed approach does not assume a model for the death time D∧L but for its mean E(D ∧ L). This, however, sacrifices some efficiency in settings wherein a (correctly specified) parametric model is assumed.

Stemming from our current work are several interesting directions worth exploring in the future. One is to consider ‘residual’ RMST (Grand and Putter 2015); i.e. conditional RMST given that the patients is still alive at a later time point after being wait-listed. This could be achieved by applying our methods to landmark analysis, which incorporates time-varying covariates in the mortality model. For instance, results from this landmark analysis should be of interest to patients (already wait-listed for a period of time) who want to know the effect of MELD on residual survival time. It would also be interesting to contrast RMST to the analogous number, expectancy of life lost before time L, which is the area under the curve of cumulative incidence functions rather than marginal survival functions. Andersen (2013) has discussed decomposition of number of life years lost using pseudo-observations in the context of competing risk. Assuming the absence of time-varying covariates (as needed in Andersen 2013) and the use of a simple linear link, we expect the estimated covariate effects from modeling number of life years lost would remain the same magnitude but change the sign, and the intercept would change to L subtracted by the intercept in RMST model. Furthermore, one recent paper by Zhao et al. (2016) proposed to infer RMST curves as a function of L. Following this direction, extension of our methods to RMST curves might be worth further consideration.

We have applied our proposed method to ESLD data to study pre-transplant mortality. Such data requires consideration not only of independent censoring, but also of dependent censoring, where the receipt of a transplant precludes observation of wait-list mortality. This is the first paper to directly estimate RMST in the ESLD setting. The R code to implement our methods is available upon request to the first author.

Supplementary Material

10985_2017_9391_MOESM1_ESM

NIHMS854625-supplement-10985_2017_9391_MOESM1_ESM.pdf^{(330.1KB, pdf)}

Acknowledgments

This work was supported in part by National Institutes of Health Grant 5R01 DK070869. Data analyzed in this report were supplied by the Minneapolis Medical Research Foundation as the contractor for the Scientific Registry of Transplant Recipients. The interpretation and reporting of these data are the responsibility of the authors and in no way should be seen as an official policy of or interpretation by the Scientific Registry of Transplant Recipients or the U.S. Government.

Contributor Information

Xin Wang, Department of Biostatisties, University of Michigan, 1415 Washington Hts., Ann Arbor, MI, 48109-2029.

Douglas E. Sehaubel, Department of Biostatisties, University of Michigan, 1415 Washington Hts., Ann Arbor, MI, 48109-2029

References

Cox DR. Regression Models and Life-Tables. Journal of the Royal Statistical Society. Series B. 1972;34(2):187–220. [Google Scholar]
Andersen PK. Decomposition of number of life years lost according to causes of death. Statistics in medicine. 2013;32(30):5278–5285. doi: 10.1002/sim.5903. [DOI] [PubMed] [Google Scholar]
Andersen PK, Hansen MG, Klein JP. Regression analysis of restricted mean survival time based on pseudo-observations. Lifetime data analysis. 2004;10(4):335–350. doi: 10.1007/s10985-004-4771-0. [DOI] [PubMed] [Google Scholar]
Andersen PK, Perme MP. Pseudo-observations in survival analysis. Statistical methods in medical research. 2010;19(1):71–99. doi: 10.1177/0962280209105020. [DOI] [PubMed] [Google Scholar]
Binder N, Gerds TA, Andersen PK. Pseudo-observations for competing risks with covariate dependent censoring. Lifetime data analysis. 2014;20(2):303–315. doi: 10.1007/s10985-013-9247-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
Breslow NE. Contribution to the discussion of the paper by D. R. Cox. Journal of the Royal Statistical Society, Series B. 1972;34(2):216–217. [Google Scholar]
Chen PY, Tsiatis AA. Causal inference on the difference of the restricted mean lifetime between two groups. Biometrics. 2001;57(4):1030–1038. doi: 10.1111/j.0006-341x.2001.01030.x. [DOI] [PubMed] [Google Scholar]
Cox DR. Partial likelihood. Biometrika. 1975;62(2):269–276. [Google Scholar]
Davison AC, Hinkley DV. Bootstrap methods and their application, Volume 1 of Cambridge Series in Statistical and Probabilistic Mathematics. Cambridge University Press; 1997. [Google Scholar]
Foutz RV. On the unique consistent solution to the likelihood equations. Journal of the American Statistical Association. 1977;72(357):147–148. [Google Scholar]
Gillen DL, Emerson SS. Nontransitivity in a class of weighted logrank statistics under nonproportional hazards. Statistics & probability letters. 2007;77(2):123–130. [Google Scholar]
Grand MK, Putter H. Regression models for expected length of stay. Statistics in medicine. 2015;35(7):1178–1192. doi: 10.1002/sim.6771. [DOI] [PubMed] [Google Scholar]
Harrell FE, Lee KL, Mark DB. Tutorial in biostatistics multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Statistics in medicine. 1996;15(4):361–87. doi: 10.1002/(SICI)1097-0258(19960229)15:4<361::AID-SIM168>3.0.CO;2-4. [DOI] [PubMed] [Google Scholar]
Heagerty PJ, Zheng Y. Survival model predictive accuracy and ROC curves. Biometrics. 2005;61(1):92–105. doi: 10.1111/j.0006-341X.2005.030814.x. [DOI] [PubMed] [Google Scholar]
Kalbfleisch J, Prentice R. The Statistical Analysis of Failure Time Data. Wiley; 2002. (Wiley Series in Probability and Statistics). [Google Scholar]
Kamath PS, Wiesner RH, Malinchoc M, Kremers W, Therneau TM, Kosberg CL, DAmico G, Dickson ER, Kim WR. A model to predict survival in patients with end-stage liver disease. Hepatology. 2001;33(2):464–470. doi: 10.1053/jhep.2001.22172. [DOI] [PubMed] [Google Scholar]
Robins J, Rotnitzky A. AIDS Epidemiology. Birkhuser; Boston: 1992. Recovery of information and adjustment for dependent censoring using surrogate markers; pp. 297–331. [Google Scholar]
Robins JM. Proceedings of the Biopharmaceutical Section, American Statistical Association. 3. Vol. 24. American Statistical Association; 1993. Information recovery and bias adjustment in proportional hazards regression analysis of randomized trials using surrogate markers. [Google Scholar]
Robins JM, Finkelstein DM. Correcting for noncompliance and dependent censoring in an aids clinical trial with inverse probability of censoring weighted (IPCW) log-rank tests. Biometrics. 2000;56(3):779–788. doi: 10.1111/j.0006-341x.2000.00779.x. [DOI] [PubMed] [Google Scholar]
Schaubel DE, Wei G. Double inverse-weighted estimation of cumulative treatment effects under nonproportional hazards and dependent censoring. Biometrics. 2011;67(1):29–38. doi: 10.1111/j.1541-0420.2010.01449.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
Tian L, Cai T, Goetghebeur E, Wei L. Model evaluation based on the sampling distribution of estimated absolute prediction error. Biometrika. 2007;94(2):297–311. [Google Scholar]
Tian L, Zhao L, Wei L. Predicting the restricted mean event time with the subjects baseline covariates in survival analysis. Biostatistics. 2014;15(2):222–233. doi: 10.1093/biostatistics/kxt050. [DOI] [PMC free article] [PubMed] [Google Scholar]
Uno H, Cai T, Pencina MJ, DAgostino RB, Wei L. On the C-statistics for evaluating overall adequacy of risk prediction procedures with censored survival data. Statistics in medicine. 2011;30(10):1105–1117. doi: 10.1002/sim.4154. [DOI] [PMC free article] [PubMed] [Google Scholar]
Van Houwelingen HC. Dynamic prediction by landmarking in event history analysis. Scandinavian Journal of Statistics. 2007;34(1):70–85. [Google Scholar]
Van Houwelingen HC, Putter H. Comparison of stopped cox regression with direct methods such as pseudo-values and binomial regression. Lifetime data analysis. 2015;21(2):180–196. doi: 10.1007/s10985-014-9299-3. [DOI] [PubMed] [Google Scholar]
Wiesner R, Edwards E, Freeman R, Harper A, Kim R, Kamath P, Kremers W, Lake J, Howard T, Merion RM, et al. Model for end-stage liver disease (meld) and allocation of donor livers. Gastroenterology. 2003;124(1):91–96. doi: 10.1053/gast.2003.50016. [DOI] [PubMed] [Google Scholar]
Wiesner RH, McDiarmid SV, Kamath PS, Edwards EB, Malinchoc M, Kremers WK, Krom RA, Kim WR. MELD and PELD: application of survival models to liver allocation. Liver transplantation. 2001;7(7):567–580. doi: 10.1053/jlts.2001.25879. [DOI] [PubMed] [Google Scholar]
Xiang F, Murray S. Restricted mean models for transplant benefit and urgency. Statistics in medicine. 2012;31(6):561–576. doi: 10.1002/sim.4450. [DOI] [PubMed] [Google Scholar]
Xu R, O’Quigley J. Estimating average regression effect under non-proportional hazards. Biostatistics. 2000;1(4):423–439. doi: 10.1093/biostatistics/1.4.423. [DOI] [PubMed] [Google Scholar]
Zhang M, Schaubel DE. Estimating differences in restricted mean lifetime using observational data subject to dependent censoring. Biometrics. 2011;67(3):740–749. doi: 10.1111/j.1541-0420.2010.01503.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hao L, Claggett B, Tian L, Uno H, Pfeffer MA, Solomon SD, Wei LJ. On the restricted mean survival time curve in survival analysis. Biometrics. 2016;72(1):215–221. doi: 10.1111/biom.12384. [DOI] [PMC free article] [PubMed] [Google Scholar]
Zucker DM. Restricted mean life with covariates: modification and extension of a useful survival analysis method. Journal of the American Statistical Association. 1998;93(442):702–709. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

10985_2017_9391_MOESM1_ESM

NIHMS854625-supplement-10985_2017_9391_MOESM1_ESM.pdf^{(330.1KB, pdf)}

[R1] Cox DR. Regression Models and Life-Tables. Journal of the Royal Statistical Society. Series B. 1972;34(2):187–220. [Google Scholar]

[R2] Andersen PK. Decomposition of number of life years lost according to causes of death. Statistics in medicine. 2013;32(30):5278–5285. doi: 10.1002/sim.5903. [DOI] [PubMed] [Google Scholar]

[R3] Andersen PK, Hansen MG, Klein JP. Regression analysis of restricted mean survival time based on pseudo-observations. Lifetime data analysis. 2004;10(4):335–350. doi: 10.1007/s10985-004-4771-0. [DOI] [PubMed] [Google Scholar]

[R4] Andersen PK, Perme MP. Pseudo-observations in survival analysis. Statistical methods in medical research. 2010;19(1):71–99. doi: 10.1177/0962280209105020. [DOI] [PubMed] [Google Scholar]

[R5] Binder N, Gerds TA, Andersen PK. Pseudo-observations for competing risks with covariate dependent censoring. Lifetime data analysis. 2014;20(2):303–315. doi: 10.1007/s10985-013-9247-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R6] Breslow NE. Contribution to the discussion of the paper by D. R. Cox. Journal of the Royal Statistical Society, Series B. 1972;34(2):216–217. [Google Scholar]

[R7] Chen PY, Tsiatis AA. Causal inference on the difference of the restricted mean lifetime between two groups. Biometrics. 2001;57(4):1030–1038. doi: 10.1111/j.0006-341x.2001.01030.x. [DOI] [PubMed] [Google Scholar]

[R8] Cox DR. Partial likelihood. Biometrika. 1975;62(2):269–276. [Google Scholar]

[R9] Davison AC, Hinkley DV. Bootstrap methods and their application, Volume 1 of Cambridge Series in Statistical and Probabilistic Mathematics. Cambridge University Press; 1997. [Google Scholar]

[R10] Foutz RV. On the unique consistent solution to the likelihood equations. Journal of the American Statistical Association. 1977;72(357):147–148. [Google Scholar]

[R11] Gillen DL, Emerson SS. Nontransitivity in a class of weighted logrank statistics under nonproportional hazards. Statistics & probability letters. 2007;77(2):123–130. [Google Scholar]

[R12] Grand MK, Putter H. Regression models for expected length of stay. Statistics in medicine. 2015;35(7):1178–1192. doi: 10.1002/sim.6771. [DOI] [PubMed] [Google Scholar]

[R13] Harrell FE, Lee KL, Mark DB. Tutorial in biostatistics multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Statistics in medicine. 1996;15(4):361–87. doi: 10.1002/(SICI)1097-0258(19960229)15:4<361::AID-SIM168>3.0.CO;2-4. [DOI] [PubMed] [Google Scholar]

[R14] Heagerty PJ, Zheng Y. Survival model predictive accuracy and ROC curves. Biometrics. 2005;61(1):92–105. doi: 10.1111/j.0006-341X.2005.030814.x. [DOI] [PubMed] [Google Scholar]

[R15] Kalbfleisch J, Prentice R. The Statistical Analysis of Failure Time Data. Wiley; 2002. (Wiley Series in Probability and Statistics). [Google Scholar]

[R16] Kamath PS, Wiesner RH, Malinchoc M, Kremers W, Therneau TM, Kosberg CL, DAmico G, Dickson ER, Kim WR. A model to predict survival in patients with end-stage liver disease. Hepatology. 2001;33(2):464–470. doi: 10.1053/jhep.2001.22172. [DOI] [PubMed] [Google Scholar]

[R17] Robins J, Rotnitzky A. AIDS Epidemiology. Birkhuser; Boston: 1992. Recovery of information and adjustment for dependent censoring using surrogate markers; pp. 297–331. [Google Scholar]

[R18] Robins JM. Proceedings of the Biopharmaceutical Section, American Statistical Association. 3. Vol. 24. American Statistical Association; 1993. Information recovery and bias adjustment in proportional hazards regression analysis of randomized trials using surrogate markers. [Google Scholar]

[R19] Robins JM, Finkelstein DM. Correcting for noncompliance and dependent censoring in an aids clinical trial with inverse probability of censoring weighted (IPCW) log-rank tests. Biometrics. 2000;56(3):779–788. doi: 10.1111/j.0006-341x.2000.00779.x. [DOI] [PubMed] [Google Scholar]

[R20] Schaubel DE, Wei G. Double inverse-weighted estimation of cumulative treatment effects under nonproportional hazards and dependent censoring. Biometrics. 2011;67(1):29–38. doi: 10.1111/j.1541-0420.2010.01449.x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R21] Tian L, Cai T, Goetghebeur E, Wei L. Model evaluation based on the sampling distribution of estimated absolute prediction error. Biometrika. 2007;94(2):297–311. [Google Scholar]

[R22] Tian L, Zhao L, Wei L. Predicting the restricted mean event time with the subjects baseline covariates in survival analysis. Biostatistics. 2014;15(2):222–233. doi: 10.1093/biostatistics/kxt050. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R23] Uno H, Cai T, Pencina MJ, DAgostino RB, Wei L. On the C-statistics for evaluating overall adequacy of risk prediction procedures with censored survival data. Statistics in medicine. 2011;30(10):1105–1117. doi: 10.1002/sim.4154. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R24] Van Houwelingen HC. Dynamic prediction by landmarking in event history analysis. Scandinavian Journal of Statistics. 2007;34(1):70–85. [Google Scholar]

[R25] Van Houwelingen HC, Putter H. Comparison of stopped cox regression with direct methods such as pseudo-values and binomial regression. Lifetime data analysis. 2015;21(2):180–196. doi: 10.1007/s10985-014-9299-3. [DOI] [PubMed] [Google Scholar]

[R26] Wiesner R, Edwards E, Freeman R, Harper A, Kim R, Kamath P, Kremers W, Lake J, Howard T, Merion RM, et al. Model for end-stage liver disease (meld) and allocation of donor livers. Gastroenterology. 2003;124(1):91–96. doi: 10.1053/gast.2003.50016. [DOI] [PubMed] [Google Scholar]

[R27] Wiesner RH, McDiarmid SV, Kamath PS, Edwards EB, Malinchoc M, Kremers WK, Krom RA, Kim WR. MELD and PELD: application of survival models to liver allocation. Liver transplantation. 2001;7(7):567–580. doi: 10.1053/jlts.2001.25879. [DOI] [PubMed] [Google Scholar]

[R28] Xiang F, Murray S. Restricted mean models for transplant benefit and urgency. Statistics in medicine. 2012;31(6):561–576. doi: 10.1002/sim.4450. [DOI] [PubMed] [Google Scholar]

[R29] Xu R, O’Quigley J. Estimating average regression effect under non-proportional hazards. Biostatistics. 2000;1(4):423–439. doi: 10.1093/biostatistics/1.4.423. [DOI] [PubMed] [Google Scholar]

[R30] Zhang M, Schaubel DE. Estimating differences in restricted mean lifetime using observational data subject to dependent censoring. Biometrics. 2011;67(3):740–749. doi: 10.1111/j.1541-0420.2010.01503.x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R31] Hao L, Claggett B, Tian L, Uno H, Pfeffer MA, Solomon SD, Wei LJ. On the restricted mean survival time curve in survival analysis. Biometrics. 2016;72(1):215–221. doi: 10.1111/biom.12384. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R32] Zucker DM. Restricted mean life with covariates: modification and extension of a useful survival analysis method. Journal of the American Statistical Association. 1998;93(442):702–709. [Google Scholar]

PERMALINK

Modeling Restricted Mean Survival Time under General Censoring Mechanisms

Xin Wang

Douglas E Sehaubel, Ph.D.

Abstract

1 Introduction

2 Proposed Methods

3 Asymptotic Properties

Theorem 1

Theorem 2

4 Simulation Study

Table 1.

Table 2.

Fig. 1.

Fig. 2.

Table 3.

5 Analysis of Liver Disease Data

Table 4.

Fig. 3.

Table 5.

6 Discussion

Supplementary Material

Acknowledgments

Contributor Information

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Modeling Restricted Mean Survival Time under General Censoring Mechanisms

Xin Wang

Douglas E Sehaubel, Ph.D.

Abstract

1 Introduction

2 Proposed Methods

3 Asymptotic Properties

Theorem 1

Theorem 2

4 Simulation Study

Table 1.

Table 2.

Fig. 1.

Fig. 2.

Table 3.

5 Analysis of Liver Disease Data

Table 4.

Fig. 3.

Table 5.

6 Discussion

Supplementary Material

Acknowledgments

Contributor Information

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases