Estimating the Average Treatment Effect on Survival Based on Observational Data and Using Partly Conditional Modeling

Qi Gong; Douglas E Schaubel

doi:10.1111/biom.12542

. Author manuscript; available in PMC: 2017 Mar 24.

Published in final edited form as: Biometrics. 2016 May 18;73(1):134–144. doi: 10.1111/biom.12542

Estimating the Average Treatment Effect on Survival Based on Observational Data and Using Partly Conditional Modeling

Qi Gong ¹, Douglas E Schaubel ²

PMCID: PMC5116003 NIHMSID: NIHMS794870 PMID: 27192660

Summary

Treatments are frequently evaluated in terms of their effect on patient survival. In settings where randomization of treatment is not feasible, observational data are employed, necessitating correction for covariate imbalances. Treatments are usually compared using a hazard ratio. Most existing methods which quantify the treatment effect through the survival function are applicable to treatments assigned at time 0. In the data structure of our interest, subjects typically begin follow-up untreated; time-until-treatment and the pre-treatment death hazard are both heavily influenced by longitudinal covariates; and subjects may experience periods of treatment ineligibility. We propose semiparametric methods for estimating the average difference in restricted mean survival time attributable to a time-dependent treatment, the average effect of treatment among the treated, under current treatment assignment patterns. The pre- and post-treatment models are partly conditional, in that they use the covariate history up to the time of treatment. The pre-treatment model is estimated through recently developed landmark analysis methods. For each treated patient, fitted pre- and post-treatment survival curves are projected out, then averaged in a manner which accounts for the censoring of treatment times. Asymptotic properties are derived and evaluated through simulation. The proposed methods are applied to liver transplant data in order to estimate the effect of liver transplantation on survival among transplant recipients under current practice patterns.

Keywords: Landmark analysis, Observational data, Partly conditional model, Proportional hazards regression, Time-varying covariates, Treatment effect

1. Introduction

It is often of interest in biomedical settings to evaluate the benefit of a treatment on survival. In many clinical settings, treatment is not administered right at the time of diagnosis, such that a period of waiting time occurs for some (or perhaps all) patients. In cases where treatment is not randomized, it is often useful to assess the benefit of treatment under current treatment assignment patterns. Through the average effect-of-treatment-on-the-treated (ETT; Pearl, 2009), one can evaluate the benefit of treatment as currently practiced.

Survival probabilities are easily understood by health care professionals, as is the area under the survival curve (restricted mean lifetime). Various authors have proposed using Cox regression with the primary goal not being to estimate hazard ratios, but to compare differences in survival and/or restricted mean lifetime. For example, Zucker (1998) and Chen and Tsiatis (2001) proposed methods that involved averaging over fitted values from Cox models. Zhang and Schaubel (2011) extended the methods of Chen and Tsiatis (2001) to accommodate dependent censoring, then subsequently developed double-robust methods (Zhang and Schaubel, 2012). Each of the afore-described methods applies to treatments assigned at baseline, as opposed to time-varying treatments.

In the data structure of interest in this report, all patients begin follow-up untreated, with some eventually receiving treatment and others dying beforehand. Pre-treatment mortality and treatment assignment rates are dependent on longitudinal covariates (including periods during which a subject is declared treatment-ineligible), such that a patient’s pre-treatment death is dependently censored by the receipt of treatment. Post-treatment survival is dependent on a subject’s condition at the time of treatment, and the duration of pre-treatment follow-up time. Our objective is to contrast two scenarios: (a) treatment is never assigned (b) treatment is assigned according to current practice patterns.

The proposed methods are motivated by the end-stage liver disease (ESLD) setting. The number of available deceased-donor livers is always less than the number of patients in need of liver transplantation. As a result, medically suitable patients are placed on a liver transplant waiting list. Patients typically begin follow-up on the wait list (‘untreated’; i.e., not transplanted), such that transplantation can be viewed as a time-dependent treatment. In the U.S., chronic end-stage liver disease patients are sequenced in decreasing order of Model for End-Stage Liver Disease (MELD) score, a very strong predictor of pre-transplant mortality. Transplantation results in the dependent censoring of pre-transplant death, since MELD scores predict both wait list mortality and transplant rates. Note that patients may be removed from the wait list (or made inactive) and, in such cases, are permanently (or temporarily) ineligible to receive a transplant. In the setting of our interest, the effect of treatment on the treated is of greater interest than the average causal effect, due to the implausibility of all patients receiving treatment.

Our analysis in Section 5 is different from that in Gong and Schaubel (2013) since (i) the former only looked at pre-transplant survival; (ii) did not compare post- versus pre-transplant survival; (iii) reported contrasts only in terms of the hazard ratio; and (iv) did not exclude Status 1 (acute liver failure) patients and, in fact, focused on contrasting them with chronic ESLD patients.

We develop semiparametric methods to estimate the average effect-of-treatment-on-the-treated through partly conditional modeling. The proposed method involves averaging over the observed instances of treatment initiation, with the averaging accounting for the various complexities in data structure; e.g., treatment initiation times are subject to right censoring; patients may die before treatment is received; and patients cannot initiate treatment while ineligible. For each treated patient, we use the accrued history (up to the time of treatment initiation) to project out a survival curve for post-treatment residual lifetime. Based on the same accrued pre-treatment history, we also project out the survival curve that would apply in the absence of treatment. This set-up lends itself well to partly conditional modeling (Zheng and Heagerty, 2005; Gong and Schaubel, 2013); see also the closely related concept of landmark analysis (Feuer et al., 1992; van Houwelingen, 2007; van Houwelingen and Putter, 2012; Parast, Tian and Cai, 2014). Gong and Schaubel (2013) developed methods for fitting partly conditional hazard regression models which apply to the absence-of-treatment setting in our set-up. We extend the ideas in Gong and Schaubel (2013) to estimate the average ETT through residual survival and restricted mean survival time. Although we focus on partly conditional modeling in this report, it should be noted that other pertinent methods exist, as described in Section 6.

The remainder of this article is organized as follows. In Section 2, we describe the proposed methods. Asymptotic properties are provided in Section 3 (for proofs, see Supplementary Materials), with finite-sample properties evaluated through simulation in Section 4. We apply the proposed methods to the motivating data set in Section 5. Concluding remarks are made in Section 6.

2. Proposed Methods

2.1 Set-up and Notation

We now formalize the ideas introduced in Section 1, in the absence of censoring. We remove subscripting, such that defined variates pertain to any hypothetical subject. We let T represent treatment time, with T > 0 since subjects begin follow-up untreated. Death time in the absence of treatment is denoted by D⁰. Note that, consistent with the intent-to-treat principle, patients that initiate treatment are considered to be ‘treated’ thereafter. Let ℰ(s) = 1 if the patient is treatment-eligible (i.e., eligible to initiate treatment) at time s, and 0 otherwise. A patient may oscillate between the eligible and ineligible states before time D⁰ ∧ T, where a ∧ b = min(a, b). In particular, ℰ(s) = 0 for s > D⁰ ∧ T, since a patient cannot initiate treatment more than once, and cannot initiate treatment after death. Note that a patient may only initiate treatment while eligible; i.e., dI(T ≤ s) = ℰ(s)dI(T ≤ s).

Under the above-listed Scenario (a), T = ∞. Under Scenario (b), treatment only occurs when T < D⁰, in which case D⁰ is considered latent; D⁰ serves as a competing risk for T. For a patient with treatment time T = s, D¹ is the death time, such that (D¹ − s)₊ is the residual post-treatment survival, with a₊ = a · I(a > 0) and I(·) being the familiar 0/1 indicator function. The quantity (D⁰ − s)₊ then represents the residual survival beyond s that would have been observed in the absence of treatment. Note that if D⁰ < T, then D¹ is undefined.

The covariate vector, which contains some time-varying elements, is denoted by Z*(s). The patient’s covariate and eligibility history up to time s is given by ℋ(s) = {Z*(u), ℰ(u); 0 ≤ u < s}. The above described set-up is illustrated in Figure 1. For a patient with treatment-initiation time T = s, we are interested in the average difference between (D¹ − s)₊ and (D⁰ − s)₊ given [ℋ(s), T = s], with the average being taken with respect to the subdistribution function for T.

History on [0, s) and residual survival beyond s under two scenarios. In each case, covariate (demographic, biological) and treatment-eligibility history ℋ(s) has accumulated, and the subject is treatment-eligible at time s, ℰ(s) = 1. Under Scenario (1), T = s and (D¹ − s)₊ represents residual survival post-treatment. Under Scenario (0), T = ∞ since treatment is never available, such that death time is given by D⁰ and residual survival beyond time s equals (D⁰ − s)₊. Under the proposed methods, for each treated subject, partly conditional modeling is used to project (D¹ − s)₊ and (D⁰ − s)₊ given [ℋ(s), T = s]. The proposed effect-of-treatment-on-the-treated is then obtained after averaging over [ℋ(T), T].

2.2 Treatment Effect: Conditional and Average

For a patient initiating treatment at time T = s, there are two death times of interest; the post-treatment residual death time, (D¹ − s)₊, and the residual death time that would have occurred in the absence of treatment, (D⁰ − s)₊. At the time of treatment (e.g., T = s), we observe ℋ(s), and ℰ(s) = 1. Conditional on [ℋ(s), T = s], we contrast

S_{1} (t; s | ℋ (s), T = s) = P {(D^{1} - s) > t | ℋ (s), T = s)

(1)

S_{0} (t; s | ℋ (s), T = s) = P {(D^{0} - s) > t | ℋ (s), T = s)

(2)

the survival functions pertaining to the counterfactual variates (D¹ − s)₊ and (D⁰ − s)₊, respectively. Note that, in both S₁(t; s|·) and S₀(t; s|·), the time index s represents conditioning time, while t refers to residual survival t time units beyond the conditioning time, s. That is, S_j(t; s|·) pertains to a gap of t units beyond time s, which equals total time (s + t). We assume strong ignorability (Rubin, 1978), permitting inference on the counterfactuals (D¹ − s)₊ and (D⁰ − s)₊, through observed data. The strong ignorability assumption is detailed in the Supplementary Materials. An implication this assumption is that S₀(t; s|ℋ(s), T = s) = S₀(t; s|ℋ(s), ℰ(s) = 1), consistent with the counterfactuals (D¹ − s)₊ and (D⁰ − s)₊ being independent of the receipt of treatment at time s.

For fixed L > 0, restricted mean residual survival times are given by

μ_{1} (L; s | ℋ (s), T = s) = \int_{0}^{L} S_{1} (t; s | ℋ (s), T = s) d t

(3)

μ_{0} (L; s | ℋ (s), T = s) = \int_{0}^{L} S_{0} (t; s | ℋ (s), T = s) d t .

(4)

Conditioning on [ℋ(s), T = s], a pertinent contrast in survival functions is then

δ (t; s | ℋ (s), T = s) = S_{1} (t; s | ℋ (s), T = s) - S_{0} (t; s | ℋ (s), T = s),

(5)

while a contrast in restricted mean residual lifetime is defined as

Δ (L; s | ℋ (s), T = s) = μ_{1} (L; s | ℋ (s), T = s) - μ_{0} (L; s | ℋ (s), T = s) .

(6)

Average survival functions are then defined as

S_{1} (t) = E [S_{1} (t; T | ℋ (T), T)]

S_{0} (t) = E [S_{0} (t; T | ℋ (T), T)],

(7)

where, in each case, the expectation is taken with respect to the joint distribution of [ℋ(T), T)] over the identifiable range of T which would in practice be capped by the maximum follow-up time. Correspondingly, average restricted mean residual lifetimes are:

μ_{1} (L) = E [μ_{1} (L; T | ℋ (T), T)] = \int_{0}^{L} S_{1} (t) d t

μ_{0} (L) = E [μ_{0} (L; T | ℋ (T), T)] = \int_{0}^{L} S_{0} (t) d t .

(8)

The ETT can then be defined in terms of mean difference in survival probability as

δ (t) = E [δ (t; T | ℋ (T), T)] = S_{1} (t) - S_{0} (t)

(9)

and in terms of average difference in residual mean survival time, by

Δ (L) = E [Δ (L | ℋ (T), T)] = μ_{1} (L) - μ_{0} (L) = \int_{0}^{L} δ (t) d t .

(10)

Having specified the treatment effect of interest, the remaining subsections in Section 2 describe the proposed methods for estimating δ(t) and Δ(L).

2.3 Observed data: Notation and set-up

We let D_i denote the death time for subject i (i = 1, …, n). The time of treatment is given by T_i, with T_i = ∞ when D_i < T_i. Treatment and death times are subject to independent right censoring, C_i, intended to represent the combination of administrative censoring and random loss to follow-up. Observation time is then given by X_i = D_i∧C_i. We define counting processes for death, treatment and censoring, as N_i(t) = I(D_i ≤ t ∧ C_i), $N_{i}^{T} (t) = I (T_{i} \leq t \land D_{i} \land C_{i})$ and $N_{i}^{C} (t) = I (C_{i} \leq t \land D_{i})$ , respectively. Recall that ℰ_i(u) equals 1 if patient i is eligible for treatment at time u, and 0 otherwise. Note that $N_{i}^{T} (t) = \int_{0}^{t} ℰ_{i} (u) d N_{i}^{T} (u)$ , since treatment can only be initiated for an eligible subject. The covariate vector, observed longitudinally, is denoted by $Z_{i}^{*} (t)$ . The covariate and treatment-eligibility history for subject i as of time t is denoted by $ℋ_{i} (t) = {Z_{i}^{*} (u), ℰ_{i} (u); u \in [0, t)}$ . Covariate information is assumed to not be available after treatment is assigned, such that the total observed history for subject i is given by ℋ_i(X_i ∧ T_i); such data are not required by the proposed methods.

2.4 Assumed Models and Estimation Methods

We now describe the assumed models for ${(D_{i}^{1} - T_{i})}_{+}, {(D_{i}^{0} - T_{i})}_{+}$ , T_i and C_i. As implied by (7) and (8), our target ETT implies averaging over the observed [T_i, ℋ_i(T_i)] distribution. Per (1) and (2), we achieve this by working with $[{(D_{i}^{1} - s)}_{+} | ℋ_{i} (s), T_{i} = s]$ and $[{(D_{i}^{0} - s)}_{+} | ℋ_{i} (s), T_{i} = s]$ directly, after which we will then average explicitly. We model the partly conditional hazard function for $[{(D_{i}^{1} - s)}_{+} | ℋ_{i} (s), T_{i} = s]$ , which uses in the covariate vector all pertinent information in the history prior to the time of treatment, ℋ_i(T_i). The model is partly conditional since the covariate is not updated after the time treatment is initiated. The covariate is not updated after time T_i since we want to project residual survival from T_i onward, and a survival projections based on traditional time-dependent model would require a model for ℋ_i(s + t). In many cases, a model for ℋ_i(s + t) is complicated to fit accurately, and is of little inherent interest to the investigators.

2.4.1 Post-Treatment Survival

We let λ₁(t; s|ℋ(s), T = s) denote the hazard function corresponding to S₁(t; s|ℋ(s), T = s) from (1). We assume the following proportional hazards model,

λ_{1} (t; s | ℋ_{i} (s), T_{i} = s) = λ_{01} (t) exp {β_{1}^{'} Z_{i 1} (s)},

(11)

where the covariate Z_i1(s) is chosen to summarize the pre-treatment history, {ℋ_i(s), T_i = s}, pertinent to predicting post-treatment survival. Typically, time until treatment, T_i, would be represented parametrically in the covariate vector, Z_i1(s). Note that the Z_i1(s) covariate is fixed at treatment time T_i = s, reflecting the partly conditional (Zheng and Heagerty, 2005; Gong and Schaubel, 2013) nature of (11), which uses time-dependent data ‘frozen’ at time of treatment. This could also be considered a ‘landmark’ analysis (e.g., van Houwelingen, 2007), with landmark times being customized to each subject and set to T_i.

We assume that treatment times are independently censored by C_i. Assuming that (D_i − T_i)₊ is independently censored by (C_i − T_i)₊ given [Z_i1(T_i), T_i], parameter estimation for model (11) proceeds through unweighted partial likelihood. We denote the resulting estimators for model (11) by β̂₁ and Λ̂₀₁(t), with the latter being the Breslow-Aalen (1972) estimator. We estimate S₁(t; s|ℋ_i(s), T_i = s) by Ŝ₁(t; s|Z_i1(s)) = exp{−Λ̂₁(t; s|Z_i1(s))}, where ${\hat{Λ}}_{1} (t; s | Z_{i 1} (s)) = {\hat{Λ}}_{01} (t) exp {{\hat{β}}_{1}^{'} Z_{i 1} (s))}$ , and μ₁(L; s|ℋ_i(s), T_i = s) by ${\hat{μ}}_{1} (L; s | Z_{i 1} (s)) = \int_{0}^{L} Ŝ_{1} (t; s | Z_{i 1} (s)) d t$ .

2.4.2 Survival in the Absence of Treatment

We begin by describing the assumed hazard model for survival in the absence of treatment. We then outline the proposed data augmentation, which involves selecting calendar date cross-sections. Next, we detail fitting the model through an inverse weighted and stratified log rank estimating function.

We let λ₀(t; s|ℋ(s), T = s) denote the hazard function corresponding to (2). Under strong ignorability, note that λ₀(t; s|ℋ_i(s), T_i = s) = λ₀(t; s|ℋ_i(s), ℰ_i(s) = 1), which we use in listing the assumed model,

λ_{0} (t; s | ℋ_{i} (s), ℰ_{i} (s) = 1) = λ_{00} (t) exp {β_{0}^{'} Z_{i 0} (s)},

(12)

where Z_i0(s) is chosen such that λ₀(t; s|ℋ_i(s), ℰ_i(s) = 1) = λ₀(t; s|Z_i0(s)), implying that Z_i0(s) contains all elements of the history pertinent to predicting ${(D_{i}^{0} - s)}_{+}$ , including all appropriate functions of time-already-survived, s. Model (12) is partly conditional (Zheng and Heagerty, 2005; Gong and Schaubel, 2013) since, although the hazard at time s + t is of interest, the model conditions on information which is ‘frozen’ at time s. In contrast, a typical (fully) conditional or ‘time-dependent’ model would condition on ℋ_i(s + t).

Partly Conditional Model

The motivation for using a partly conditional model is described at the start of Section 2.4. Generally, fitting a partly conditional model requires some form of data augmentation in which the records corresponding to each subject’s observed data are restructured in order to facilitate fitting the assumed model. After such augmentation, each input record has a prior time survived (e.g., s_i) and corresponding prior history ℋ_i(s_i), with residual survival in the absence of treatment, (D_i − s_i)₊ then being analyzed. In fitting the post-treatment residual survival model (11), there is an obvious choice for each treated subject’s conditioning time, namely s_i ≔ T_i. In accordance with (2), we actually need to project residual survival (in the absence of treatment) beyond this same conditioning time. Although the appropriate conditioning time for projecting (1) is clear, the nature of the data augmentation for fitting model (12) requires consideration.

Calendar Time Cross-sections

In landmark analysis, typically survival from a specific follow-up time point (or set of specific time points) is desired, with survival probability projected out after the chosen landmark time(s). In our case, since treatment can occur at any time point (e.g., T = s), we need to be able to project conditional survival forward from any conditioning time s. This suggests a partly conditional model which includes terms representing previous time survived, s. Variation in previous time survived is then required, which means that sampling component of the data augmentation should be based on something other then s itself. We choose to sample based on calendar time, since each calendar time cross-section will contain wide variation in previous time survived. As we later describe, we stratify the model by cross-section for computational savings, which is important in large data sets like that we analyze in Section 5. For instance, Gong and Schaubel (2013) developed a partly conditional model which chooses the conditioning times to be the s_i values observed on a randomly selected calendar date. For example, consider a particular calendar date (e.g., 07/01/2004); input records for fitting the model would consist of s_i (subject i’s prior follow-up time as of 07/01/2004), the corresponding ℋ_i(s_i), and (X_i − s_i)₊ among subjects who (as of 07/01/2004) were alive, uncensored, yet-untreated, but eligible to initiate treatment; i.e., {i : X_i > s_i, ℰ_i(s_i) = 1}.

Method of Gong and Schaubel (2013)

The estimation of β₀ from model (12) was developed by Gong and Schaubel (2013). The essential ideas are presented here for continuity, and because the authors only derived the properties of β̂₀, but not those of Ŝ₀(t; s|Z_i0(s)), μ̂₀(L; s|Z_i0(s)), Ŝ₀(t) or μ̂₀(L).

To begin, we choose a set of K calendar dates, {CS₁, …, CS_K}. Each cross-section date CS_k is intended to represent a calendar date at which a set of treatment-eligible patients (could have been but) was not treated; we model residual survival in the absence of treatment from this date forward. For calendar date CS_k, we select the cross-section of treatment-eligible patients who were not treated (on or before that day). For patient i, follow-up time (previous time survived) as of calendar date CS_k is denoted by s_ik. Hence, a patient selected into cross-section CS_k must, as follow-up time s_ik be: alive (D_i > s_ik), uncensored (C_i > s_ik), untreated (T_i > s_ik) and treatment-eligible ℰ_i(s_ik) = 1. Three remarks are important at this juncture. First, treatment-eligibility is a cross-section inclusion criterion, but not a censoring criterion; e.g., having been included in cross-section k and, hence, with ℰ_i(s_ik) = 1, patient i is not censored upon subsequently being deemed treatment-ineligible. Second, the covariate will be frozen at s_ik, such that the survival projection for the residual time ${(D_{i}^{0} - s_{i k})}_{+}$ is based on ℋ_i(s_ik). Third, a patient included in cross-section k is censored if treated; this induces dependent censoring. Each of these remarks is formalized shortly.

We now establish additional notation pertinent to model (12). Since survival time from cross-section is modeled, we define the following times-since-cross-section: D_ik = (D_i − s_ik)₊, T_ik = (T_i − s_ik)₊ and C_ik = (C_i − s_ik)₊ as the death, treatment and censoring time respectively corresponding to the ith patient and measured from the kth cross section date. Figure 2 provides an illustration of how the treatment-free observation time is transformed into time-since-cross-section times. A modified counting and at-risk processes are also defined as N_i0k(t) = N_i(s_ik + t)I(T_i > s_ik + t) and Y_i0k(t) = I(D_ik ∧ C_ik ≥ t), respectively.

Examples of the relationship between cross-section time and follow-up time. Four subjects (i = 1, …, i = 4) and two cross sections (k = 1, 2) are shown. The four subjects begin follow-up at different calendar dates. For subject i = 1, failure times D₁₁ and D₁₂ correspond to cross sections k = 1 and k = 2, respectively. Note subject i = 1 is not censored at the treatment-ineligible time after cross section k = 2. Subject i = 2 is treated and, hence, dependently censored at time T₂₂ following cross section k = 2. Subject i = 3 is excluded from cross-section k = 1 and k = 2 due to starting and finishing follow-up between CS₁ and CS₂. Subject i = 4 is included in cross section k = 1, but then becomes (and remains) treatment-ineligible until some a time after cross section k = 2. With respect to cross section k = 1, subject i = 4 is censored at treatment time T₄₁, as opposed to being censored earlier at the beginning of the treatment-ineligible period. Subject i = 4 is treatment-ineligible at cross section k = 2 and, hence, not included in CS₂.

Note: Vertical dashed lines denote cross-section dates, while horizontal dashed lines denote treatment-ineligible period.

Following Gong and Schaubel (2013), we estimate β₀ through the stratified model,

λ_{0 k} (t; s | ℋ_{i} (s_{i k}), ℰ_{i} (s_{i k}) = 1) = λ_{00 k} (t) exp {β_{0}^{'} Z_{i 0} (s_{i k})},

(13)

where β₀ is the same parameter in the unstratified model of interest, (12). Model (13) is quite flexible. Non-proportionality can be accommodated by replacing β₀ with β₀(t), a parametric function on t. The parameter vector could also be allowed to be a parametric function of previous time survived; e.g., β_0k, or β₀(s_ik). Moreover, interactions between s_i and elements of ℋ_i(s_i) are also possible. Alternatively, van Houwelingen and Putter (2015) suggested a stopped Cox model to avoid non-proportionality, with artificial censoring at t = L. By breaking the stratification on k, one could also model the effect of calendar time.

Inverse weighting

Model (13) conditions on ℋ_i(s_ik). However, we anticipate that ℋ_i(s_ik + t) would be predictive of both the treatment hazard and the pre-treatment death hazard at time (s_ik + t). The mutual association, even conditional on ℋ_i(s_ik), between pre-treatment death after s_ik, the probability of treatment after s_ik and ℋ_i(s_ik + t) sets up dependent censoring of (D_i − s_ik)₊ by (T_i − s_ik)₊. The potential bias due to such dependent censoring can be corrected through a variant of Inverse Probability of Censoring Weighting (IPCW; e.g., Robins and Rotnitzky, 1992) which requires a model for the treatment-initiation hazard. We fit the following two treatment hazard models:

λ_{i}^{T} (t | ℋ_{i} (t), ℰ_{i} (t)) = ℰ (s_{i k}) ℰ_{i} (t) λ_{0}^{T} (t) exp {θ_{0}^{'} Z_{i} (t)},

(14)

ℰ (s_{i k} λ_{i k}^{†} (t; s_{i k} | Z_{i 0} (s_{i k}), ℰ (s_{i k}) = ℰ (s_{i k}) λ_{0 k}^{†} (t) exp {θ_{1}^{'} Z_{i} (s_{i k}))},

(15)

with model (14) assumed to be the correct model; model (15) is expected to be misspecified, but is only used to provide a weight stabilizer. We assume no-unmeasured-confounders for treatment, $λ_{i}^{T} (t | ℋ_{i} (t)) = λ_{i}^{T} (t | ℋ_{i} (D_{i}), D_{i})$ , and that $λ_{i}^{T} (t | ℋ_{i} (t)) = λ_{i}^{T} (t | Z_{i} (t))$ . Note that $λ_{i}^{T} (t | ℋ_{i} (t), ℰ_{i} (t))$ in (14) uses (total) follow-up time t (measured from time 0) as the time axis, conditions on information on [0, t), while $λ_{i k}^{†} (t; s_{i k} | Z_{i 0} (s_{i k}), ℰ (s_{i k}) = 1))$ in (15) uses (residual) time since s_ik and conditions on the history over [0, s_ik] given [ℰ_i(s_ik) = 1]. Parameters in (14) and (15) are estimated through standard partial likelihood (Cox, 1975).

As derived in Gong and Schaubel (2013), an appropriate weight function is given by

W_{i k}^{A} (t) = Y_{i 0 k} (t) exp {Λ_{i}^{T} (s_{i k} + t) - Λ_{i}^{T} (s_{i k})},

(16)

where $Λ_{i}^{T} (t) = \int_{0}^{t} ℰ_{i} (u) λ_{0}^{T} (u) exp {θ_{0}^{'} Z_{i} (u)} d u$ . The quantity $W_{i k}^{A} (t)$ can be thought of as the inverse of the conditional probability of remaining untreated at time (s_ik + t), given that the subject was untreated and treatment-eligible at time s_ik. Gong and Schaubel (2013) suggest the following stabilized inverse weight,

W_{i k}^{B} (t) = Y_{i 0 k} (t) \frac{exp {Λ_{i}^{T} (s_{i k} + t) - Λ_{i}^{T} (s_{i k})}}{exp {Λ_{i k}^{†} (t)}} .

(17)

Note that artificially censoring subjects at t = L would be an alternative to the stabilizer.

Parameter Estimation for Model (12)

An estimator for β₀, denoted by β̂₀, is obtained through solving the following inverse-weighted score function,

U_{0} (β) = \sum_{i = 1}^{n} \sum_{k = 1}^{K} \int_{0}^{τ_{0 k}} ℰ_{i} (S_{i k}) {Z_{i 0} (s_{i k}) - {\bar{Z}}_{0 k} (t; β, W)} W_{i k}^{B} (t) d N_{i 0 k} (t),

(18)

with ${\bar{Z}}_{0 k} (t; β_{0}) = R_{0 k}^{(1)} (t; β_{0}) / R_{0 k}^{(0)} (t; β_{0})$ and $R_{0 k}^{(d)} (t; β_{0}) = n^{- 1} \sum_{i = 1}^{n} ℰ_{i} (s_{i k}) W_{i k} (t) Z_{i 0} {(s_{i k})}^{\otimes d} exp {β_{0}^{'} Z_{i 0} (s_{i k})}$ with d = 0, 1, 2 and where τ_0k satisfies P{Y_i0k(τ_0k) = 1} > 0, and can in practice be set to the largest X_ik among subjects with ℰ_i(s_ik) = 1. A Breslow-Aalen estimator pooled across strata is obtained as

{\hat{Λ}}_{00} (t; {\hat{β}}_{0}) = n^{- 1} \sum_{i = 1}^{n} \sum_{k = 1}^{K} \int_{0}^{t} R_{0}^{(0)} {(u; {\hat{β}}_{0})}^{- 1} ℰ_{i} (s_{i k}) W_{i k}^{B} (u) d N_{i 0 k} (u)

(19)

for t ∈ (0, L], where $R_{0}^{(0)} (u; β_{0}) = \sum_{k = 1}^{K} R_{0 k}^{(0)} (u; β_{0})$ .

2.4.3 Conditional Treatment Effect

Consider patient i, treated at follow-up time T_i = s with covariate history ℋ_i(s). Post-treatment survival probability for this patient is predicted by Ŝ₁(t; s|ℋ_i(s), T_i = s), while predicted L−year restricted mean post-treatment lifetime is given by μ̂₁(L; s|ℋ_i(s), T_i = s). Correspondingly, in the absence of treatment, predicted survival and L-year restricted mean lifetime for subject i (from T_i onward) would be given by Ŝ₀(t; s|ℋ_i(s), T_i = s) and ${\hat{μ}}_{0} (L | ℋ_{i} (s), ℰ_{i} (s) = 1) = \int_{0}^{L} Ŝ_{0} (t | ℋ_{i} (s), ℰ_{i} (s) = 1) d t$ , respectively. The treatment effect corresponding to treatment initiation by subject i at follow-up time T_i can then be estimated by

\hat{δ} (t; T_{i} | ℋ_{i} (T_{i}), T_{i}) = Ŝ_{1} (t; T_{i} | ℋ_{i} (T_{i}), T_{i}) - Ŝ_{0} (t; T_{i} | ℋ_{i} (T_{i}), ℰ_{i} (T_{i}) = 1, T_{i}),

(20)

in terms of survival probability, and

\hat{Δ} (L; T_{i} | ℋ_{i} (T_{i}), T_{i}) = {\hat{μ}}_{1} (L; T_{i} | ℋ_{i} (T_{i}), T_{i}) - {\hat{μ}}_{0} (L; T_{i} | ℋ_{i} (T_{i}), ℰ_{i} (T_{i}) = 1, T_{i})

(21)

in terms of restricted residual mean survival time.

2.4.4 Average Treatment Effect

Having established how to estimate the treatment effect for a subject treated at T_i = s with covariate history ℋ_i(s), we now describe how to estimate the quantities of chief interest, namely δ(t) = E[δ(t|ℋ_i(s), T_i = s)] and $Δ (L) = \int_{0}^{L} δ (t) d t$ from (9). In the absence of censoring, we could average with respect to the empirical distribution of {T_i, ℋ_i(T_i)} values. Right censoring of T_i values rules out using the sample mean, since this averaging would then generally depend on the C_i distribution. This implies inverse weighting the observed treatment assignments, such that the inverse weighted distribution reflects that which would have been obtained in the absence of censoring. We use the result,

E [\int_{0}^{t} \frac{d N_{i}^{T} (u)}{G_{i} (u)} | ℋ_{i} (u)] = F_{i}^{T} (t | ℋ_{i} (t)),

(22)

where $F_{i}^{T} (t | ℋ_{i} (t)) = E [\int_{0}^{t} d I (T_{i} \leq u) | ℋ_{i} (u)]$ is analogous to the cumulative incidence function for T_i (with D_i serving as a competing risk) and with G_i(u) = P(C_i > u|Z_i(0)). We assume the following proportional hazards model for C_i,

λ_{i}^{C} (t) = λ_{0}^{C} (t) exp {α_{0}^{'} Z_{i} (0)} .

(23)

Observed data used to fit model (23) include {X_i, I(C_i < D_i), Z_i(0)}, with α₀ and $Λ_{0}^{C} (t) = \int_{0}^{t} λ_{0}^{C} (u) d u$ estimated through unweighted Cox regression. Note that C_i is viewed in this report as administrative censoring, in which case (23) may not even depend on Z_i(0). If in fact $λ_{i}^{C} (t)$ depended on the ℋ_i(t), model (23) could easily be enriched to accommodate such dependence, with little subsequent modification to the procedures next described.

Finally, estimators of δ(t) and Δ(L) are given by

\hat{δ} (t) = \frac{\sum_{i = 1}^{n} \int_{0}^{τ} \hat{δ} (t; u | ℋ_{i} (u), T_{i} = u) Ĝ_{i} {(u)}^{- 1} d N_{i}^{T} (u)}{\sum_{i = 1}^{n} \int_{0}^{τ} Ĝ_{i} {(u)}^{- 1} d N_{i}^{T} (u)},

(24)

\hat{Δ} (L) = \int_{0}^{L} \hat{δ} (t) d t

(25)

respectively, where $Ĝ_{i} (u) = exp {- {\hat{Λ}}_{i}^{C} (u)}$ , and with τ satisfying P(X_i ≥ τ) > 0 and typically chosen to be the maximum observed follow-up time.

3. Asymptotic Properties

We assume that the random vectors {X_i, N_i(X_i), $N_{i}^{T} (X_{i})$ , ℋ_i(X_i ∧T_i)} are independent and identically distributed for i = 1 … n, with all elements of ℋ_i(t) bounded for t ∈ (0, τ]. A complete list of regularity conditions is provided in the Supplementary Materials document.

Theorem 1

Under certain regularity conditions, n^1/2{δ̂(t) − δ(t)} and n^1/2{Δ̂(L) − Δ(L)} each converge asymptotically to zero-mean Gaussian processes with covariance functions E[ξ_j(t)²] and $E [η_{j}^{2}]$ , respectively, where {ξ₁(t), …, ξ_n(t)} and {η₁(L), …, η_n(L)} are i.i.d. with mean 0 asymptotically. Expressions for ξ_i(t) and $η_{i} (L) = \int_{0}^{L} ξ_{i} (t) d t$ , which are quite lengthly, are provided in the Supplementary Materials.

Variance estimators for δ̂(t) and Δ̂(L) are given by $n^{- 2} \sum_{i = 1}^{n} {\hat{ξ}}_{i} {(t)}^{2}$ and $n^{- 2} \sum_{i = 1}^{n} {\hat{η}}_{i} {(L)}^{2}$ , respectively; where η̂_i(L) and ξ̂_i(t) are computed by replacing all limiting values by their empirical counterparts. A proof of Theorem 1 is given in the Appendix. The essence of the proof is demonstrating that, asymptotically, $n^{1 / 2} {\hat{δ} (t) - δ (t)} = n^{- 1 / 2} \sum_{i = 1}^{n} ξ_{i} (t) + o_{p} (1)$ through a sequence of Taylor series expansions and applications of empirical process results.

The proof is provided for the weight, $Ŵ_{i k}^{A} (t)$ . In practice, the stabilized weight, $Ŵ_{i k}^{B} (t)$ would often be preferred. As implied by Theorem 1, the computation of the variance is quite involved, and such computation becomes more complicated when a stabilizer is incorporated. Such concerns motivate a computationally simpler form for the variance estimator, resulting from taking Ĝ_i(t)⁻¹ and $Ŵ_{i k}^{A} (t)$ , or $Ŵ_{i k}^{B} (t)$ as the case may be, as fixed. Variance estimators for δ̂(t) and Δ̂(L) then simplify considerably. We evaluate the performance of these simplified variance estimators through simulation in Section 4.

4. Simulations

We generated treatment-free survival to follow the assumed partly conditional model using methods from Gong and Schaubel (2013). First, subject i enters the study on calendar date, B_i, which is generated from a Uniform(0, b) distribution. We then generate a single binary (0,1) group indicator Z_ia, taking the value 1 with probability 0.5. A longitudinal covariate, Z_i(s_ik), is then created and assumed to be measured at a common set of cross-section dates: CS₁, CS₂, …, CS_K. To generate data {D_i, Z_ia, Z_ib} where Z_ib = vec{Z_i(s_ik)}, we first let $Z_{i b 0} = b_{i} + \sum_{k = 1}^{K} log (V_{i k}) / γ_{2}$ , where b_i ~ N(μ, σ²) and V_ik ~ P(ρ), independent positive stable random variables with index ρ. A pre-treatment death time, $D_{i}^{0}$ , is then generated with hazard $λ_{i 0} (t) = V_{i 0}^{1 / ρ} λ_{0} (t) exp {γ_{1} Z_{i a} + γ_{2} Z_{i b 0}}$ , where V_i0 ~ P(ρ) and is independent of V_ik, with Λ₀(t) = (t/a)^1/ρ² and a is a constant. Setting Z_i(s_ik) = Z_ib0 − log(V_ik)/γ₂, the pre-treatment death hazard can then be written as $λ_{i 0} (t) = V_{i 0}^{1 / ρ} λ_{0} (t) exp {γ_{1} Z_{i a} + γ_{2} Z_{i} (s_{i k}) + log (V_{i k})}$ . Treatment time, T_i, is generated from the proportional hazards model, $λ_{i}^{T} (t) = λ_{0}^{T} (t) exp {θ_{01} Z_{i a} + θ_{02} I (R_{i} > t)}$ , where $λ_{0}^{T} (t) = d_{3}$ and $θ_{0}^{'} = (θ_{01}, θ_{02})$ and the time of treatment-ineligibility, R_i, is generated with hazard $λ_{i}^{R} (t) = λ_{0}^{R} (t) exp {d_{1} V_{i 0}}$ , where $λ_{0}^{R} (t) = d_{2}$ . Thus, R_i and D_i are positively correlated, which is consistent with the data which motivated the proposed methods. Independent censoring time, C_i, is generated from hazard $λ_{i}^{C} (t) = λ_{0}^{C} (t) exp {α_{0} Z_{i a}}$ , where $λ_{0}^{C} (t) = d_{4}$ . Note that treatment time and pre-treatment death time, T_i, and D_i are dependent since both depend on treatment-ineligibility time, R_i. However, the independent censoring time C_i is independent of D_i conditional on Z_ia.

After obtaining the pertinent survival function, transforming the time scale to represent time since cross-section (setting t_k = t − s_ik), then averaging, we obtain

λ_{i} (t_{k} | Z_{i a}, Z_{i} (s_{i k}), D_{i} > s_{i k}) = \frac{λ_{0} (t_{k} + s_{i k}) ρ^{2} {Λ_{0} (t_{k} + s_{i k})}^{(ρ^{2 - 1})}}{cos {(π ρ / 2)}^{(ρ + 1)}} exp {ρ^{2} γ_{1} Z_{i a} + ρ^{2} γ_{2} Z_{i} (s_{i k})} .

Setting Λ₀(t) = (t/a)^1/ρ² and λ₀(t_k + s_ik)ρ²{Λ₀(t_k + s_ik)}^(ρ²−1) = 1/a yields

λ_{i} (t_{k} | Z_{i a}, Z_{i} (s_{i k}), D_{i} > s_{i k}) = exp {ρ^{2} γ_{1} Z_{i a} + ρ^{2} γ_{2} Z_{i} (s_{i k})} / [a cos {(π ρ / 2)}^{(ρ + 1)}] .

If we define λ_i0k(t; s_ik) = λ_i(t_k|Z_ia, Z_i(s_ik), D_i > s_ik), λ_00k(t) = [a cos(πρ/2)^(ρ+1)]⁻¹ and β₀ = (β₀₁, β₀₂) = (ρ²γ₁, ρ²γ₂), then the proportional hazards model for pre-treatment death time is given by λ_i0k(t; s_ik) = λ_00k(t) exp{β₀₁Z_ia + β₀₂Z_i(s_ik)}.

For patients who received treatment prior to dying (D_i > T_i), a post-treatment death time ${(D_{i}^{1} - T_{i})}_{+}$ , is then generated via the hazard, λ_i1(t; T_i) = λ₀₁(t) exp{β₁₁Z_ia+β₁₂Z_i(T_i)}, where t represents time from treatment and $β_{1}^{'} = (β_{11}, β_{12}) = (ρ^{2} γ_{1}, ρ^{2} γ_{2})$ . We set λ₀₁(t) = a₁.

The complexity in the data generator is necessary to induce the partly conditional structure of the pre-treatment survival model. The positive stable frailty has become a common choice in the simulation of multivariate survival set-ups due to its preservation of the proportional hazards assumption both conditionally and marginally. Analogous set-ups were used by Zheng and Heagerty (2005) and Gong and Schaubel (2013).

We used K = 10 cross section dates, with CS_k = 100 × k. For the simulation results presented, parameter specifications were as follows: b = 500, (θ₀₁, θ₀₂) = (−1, −1), μ = 18, σ = 1, (γ₁, γ₂) = (−1, −0.5), d₁ = d₂ = d₃ = d₄ = 0.001, and ρ = 0.8, which implies (β₀₁, β₀₂) = (β₁₁, β₁₂) = (−0.64, −0.32); We varied a from a = 2000, to a = 5000 and a = 7000, which led to treatment initiation rates of 10%, 15% and 20%, respectively; with similar independent censoring rates in each case. Each data configuration was replicated 1000 times, with n = 500 subjects per replicate.

We present settings where treatment has no effect (δ(t) = Δ(L) = 0), for which a₁ = [a cos(πρ/2)^(ρ+1)]⁻¹. We also list results for a setting with a positive treatment effect (δ(t) > 0, Δ(L) > 0) induced by specifying a₁ = 0.5 × 10⁻⁴. In developing appropriate parameter settings, we conceptualized the time scale as representing days. For reporting purposes, time is recorded in years, with results presented for δ̂(1), δ̂(2), δ̂(3) and Δ̂(3). The weight $Ŵ_{i k}^{B} (t)$ was used throughout, with the simplified variance estimators applied.

Table 1 presents simulation results for settings with Δ(L) = 0 and Δ(L) > 0. The quantity Δ(L), with L = 3, can be interpreted as the difference of 3-year restricted mean survival time due to treatment, among the treated. The proposed estimators appear to be approximately unbiased, with coverage probabilities close to the nominal 95% level. Some degree of under-coverage is observed, which is due to the approximation of the results from Section 3 by treated the (random) weights as fixed. The under-coverage is not in unacceptable amounts, particularly relative to the great reduction in complexity and hence computational burden associated with the approximation.

Table 1.

Simulation results: n = 500, with weight function $W_{i k}^{B} (t)$

Setting

E [N_{i}^{C} (τ)]

E [N_{i}^{T} (τ)]

Parameter

True

BIAS

ESE

ASE

0.10

∆(3)

0.040

0.204

0.190

0.92

δ(1)

0.012

0.089

0.082

0.92

δ(2)

0.016

0.092

0.085

0.93

δ(3)

0.022

0.094

0.082

0.91

0.15

∆(3)

0.022

0.164

0.154

0.93

δ(1)

0.007

0.065

0.061

0.93

δ(2)

0.010

0.077

0.072

0.93

δ(3)

0.010

0.083

0.077

0.91

0.20

∆(3)

0.009

0.144

0.141

0.94

δ(1)

0.001

0.056

0.054

0.93

δ(2)

0.004

0.067

0.066

0.94

δ(3)

0.005

0.074

0.073

0.94

0.10

∆(3)

0.87

0.030

0.204

0.190

0.92

δ(1)

0.29

0.009

0.088

0.074

0.92

δ(2)

0.35

0.009

0.100

0.088

0.92

δ(3)

0.35

0.008

0.110

0.097

0.92

0.15

∆(3)

0.61

0.017

0.150

0.145

0.94

δ(1)

0.19

0.006

0.054

0.052

0.94

δ(2)

0.25

0.008

0.070

0.068

0.94

δ(3)

0.28

0.005

0.082

0.077

0.92

0.20

∆(3)

0.43

0.020

0.135

0.133

0.94

δ(1)

0.13

0.006

0.048

0.94

δ(2)

0.18

0.009

0.064

0.062

0.93

δ(3)

0.20

0.006

0.077

0.072

0.93

Open in a new tab

ESE = empirical standard error; ASE = asymptotic standard error CP = 95% coverage probability; $E [N_{i}^{C} (τ)]$ = proportion censored; $E [N_{i}^{T} (τ)]$ = proportion treated; δ(t) and Δ(L) are as defined in (9) and (10), respectively.

We examined the performance of the proposed methods under various degrees of model misspecification (see Supplementary Materials). The methods generally perform adequately, although some bias is introduced, and increases with increasing model misfit. The method appears to be most sensitive to misspecification of the treatment initiation hazard.

5. Application to Liver Transplant Data

We applied the proposed methods to estimate the average effect of liver transplantation among the transplanted, by Model for End-stage Liver Disease (MELD) score. This study used data from the Scientific Registry of Transplant Recipients (SRTR). The SRTR data system includes data on all donor, wait-listed candidates, and transplant recipients in the U.S., submitted by the members of the Organ Procurement and Transplantation Network (OPTN), and has been described elsewhere. The Health Resources and Services Administration (HRSA), U.S. Department of Health and Human Services provides oversight to the activities of the OPTN and SRTR contractors.

The study population included patients age ≥ 18 wait listed between 03/01/2002 and 12/31/2009. We excluded patients who were Status 1 (acute liver failure) or previously transplanted. Cross-section dates were chosen every 7 days, 30 days or 90 days from 03/01/2002 to 12/31/2009, which led to K=409, 96, or 32 cross sections respectively. The transplant hazard model, $λ_{i r}^{T} (t) = ℰ_{i} (t) λ_{0 r}^{T} (t) exp {θ_{0}^{'} Z_{i} (t)}$ , was stratified by United Network for Organ Sharing (UNOS) Region (r = 1, …, 11). The covariate, Z_i(t), included MELD score, albumin, age, gender, race, diagnosis of Hepatitis C, body mass index, diabetes, hospitalization, blood type, dialysis within prior week, encephalopathy, ascites and serum creatinine.

The pre-transplant death model, $λ_{i 0 k r} (t) = λ_{00 k r} (t) exp {β_{0}^{'} Z_{i} (s_{i k})}$ , was also stratified, where k = 1, …, K stands for cross section and r again denotes UNOS Region. The covariate, Z_i(s_ik), included MELD score, albumin, age, gender, race, diagnosis, body mass index, diabetes, hospitalization status at listing, previous dialysis, malignancy, time on wait-list (i.e., s_ik itself), slope of MELD score over [0, s_ik], slope of albumin, percentage of time spent in inactive status, and percent of time receiving dialysis. In the post-transplant death model, $λ_{i 1} (t; T_{i}) = λ_{01} (t) exp {β_{1}^{'} Z_{i 1} (T_{i})}$ , Z_i1(T_i) included terms for T_i, MELD score, albumin, age, gender, race, diagnosis, body mass index, diabetes, hospitalization status at listing, previous dialysis and malignancy and Donor Risk Index (DRI; Feng et al., 2006).

The pre-transplant study sample consisted of n = 66, 884 patients, of which 34,539 were observed to receive a deceased-donor liver transplant. For the MELD 30–40 subgroup, weekly cross section dates were chosen. For MELD 18–29 cross sections were drawn monthly. For MELD 6–17, cross sections were drawn every 3 months. Note that, we also tried weekly cross section dates for MELD 6–29 patients, which yielded almost identical results. The analysis was based on the weight, $W_{i k}^{B} (t)$ .

Figure 3 shows the estimated survival curves for MELD groups 6–8, 15–17, 20–22 and 36–40. Note that the MELD score categories refer to MELD at transplant. Within a MELD category, Ŝ₁(t) can be interpreted as the average survival probability, with t representing residual time post-transplant. Analogously, Ŝ₀(t) can be interpreted as the average survival that would have resulted in the absence of liver transplantation, among patients who received a liver transplant. For the MELD 6–8 group, survival in the absence-of-transplantation exceeds post-transplant survival until approximately t = 2 years post-transplant. However, Ŝ₁(t) > Ŝ₀(t) for t > 2 years, with the distance between the curves widening as t increases. The early survival advantage (absence-of-transplant versus with a transplant) for patients in the MELD 6–8 group is the combination of relatively mortality in this subgroup, combined with the risk of surgery-related mortality (not faced unless transplantation occurs). The early survival advantage without transplant is even observed in MELD 15–17 patients, but is much less pronounced and very short-lived. In fact, Ŝ₁(t) > Ŝ₀(t) for t > 0.25 years in this subgroup. For MELD 36–40 group, the absence-of-transplant survival curve drops dramatically during the first couple of months, then steadily declines thereafter. Note that Ŝ₁(t) curves are quite similar across MELD subgroups, with Ŝ₀(t) decreasing strongly as MELD increases.

Analysis of SRTR data: Estimated survival curves after with a liver transplant (solid line) and in the absence of liver transplantation (dashed line) among liver transplant recipients. The time axis t is years post-transplant.

In Table 2, we list estimates of the difference in survival probability, δ̂(t) for t = 1, 3, 5 years, as well as Δ̂(5), the difference in 5-year restricted mean residual lifetime. The group that benefits the most from liver transplantation is clearly MELD 36–40, with an average gain in residual survival time of Δ̂(5) ≈ 2.4 years. The next greatest gain is observed in the MELD 30–35 group, with Δ̂(5) = 1.4 years. For MELD scores between 15 and 30, there is little difference in the gain in 5-year restricted mean residual survival time, with Δ̂(5) fluctuating about 1 year across the MELD 26–29, 23–25, 20–22, 18–19 and 15–17 subgroups. Only for the MELD 6–8 group is H₀ : Δ(5) = 0 not rejected.

Table 2.

Analysis of SRTR data: Estimating the effect of liver transplantation on the transplanted (with 95% confidence interval in parentheses), by MELD score at transplant.

MELD Score	δ̂(1)	δ̂(3)	δ̂(5)	Δ̂(5)
6–8	−0.03 (−0.05, −0.01)	0.03 (−0.01, 0.05)	0.11 (0.07, 0.15)	0.11 (−0.03, 0.25)
9–11	−0.02 (−0.04, 0.00)	0.09 (0.07, 0.11)	0.17 (0.15, 0.19)	0.29 (0.15, 0.43)
12–14	0.02 (0.00, 0.04)	0.16 (0.12, 0.20)	0.23 (0.19, 0.27)	0.59 (0.43, 0.75)
15–17	0.09 (0.07, 0.11)	0.26 (0.22, 0.30)	0.32 (0.28, 0.36)	1.00 (0.80, 1.20)
18–19	0.15 (0.13, 0.17)	0.26 (0.24, 0.28)	0.27 (0.23, 0.31)	1.06 (0.90, 1.22)
20–22	0.19 (0.15, 0.23)	0.29 (0.23, 0.35)	0.30 (0.24, 0.36)	1.23 (0.95, 1.41)
23–25	0.19 (0.15, 0.23)	0.23 (0.19, 0.27)	0.26 (0.18, 0.34)	1.07 (0.79, 1.35)
26–29	0.25 (0.17, 0.33)	0.19 (0.11, 0.27)	0.16 (0.06, 0.26)	0.99 (0.59, 1.39)
30–35	0.33 (0.25, 0.41)	0.27 (0.07, 0.47)	0.25 (0.01, 0.49)	1.45 (0.05, 2.85)
36–40	0.48 (0.40, 0.56)	0.48 (0.36, 0.60)	0.45 (0.33, 0.57)	2.38 (1.70, 3.06)

Open in a new tab

δ̂(t) and Δ̂(L) are as defined in (24) and (25), respectively. The time scale represents years post-transplant.

In the Supplementary Materials, we provide results based on the Sequential Stratification method (Schaubel, Wolfe and Port, 2006; Schaubel et al., 2009), which features inverse weighted time-dependent stratification to create customized comparisons groups for each subject receiving the time-dependent treatment. Comparing our results in Table 2 to those based on Sequential Stratification, the main difference is in the MELD 6–8 group; the models from Sharma et al (2015) report a hazard ratio of 2.04 (p < 10⁻⁴), indicating that liver transplant is associated with a doubling of the mortality hazard in this subgroup. In the presence of non-proportionality (which is clear in Figure 3, particularly for this subgroup), the hazard ratio and difference in restricted mean do not have to agree.

Additional analysis is presented in the Supplementary Materials. For each MELD category, multiplying the number of transplants by the δ̂(5) yields the number of life-years saved via liver transplantation (considering only the first 5 post-transplant). The largest number of transplants was in the MELD 15–17 category (5,028), but the greatest number of life-years saved (7,649) was in the MELD 36–40 group. We estimate that 34,757 years of life were spared based on the liver transplants observed in this analysis. The Supplementary Materials also present plots of pre-transplant MELD profiles over time, the baseline pre-transplant mortality hazard, the liver transplant baseline hazard, and cumulative incidence of transplantation.

6. Discussion

In this report, we develop methods for estimating the average effect on the treated of a time-dependent treatment. The methods can be used to evaluate the benefit, in terms of patient survival, of a treatment under current treatment assignment practices. The methods were applied to quantify the survival benefit of deceased-donor liver transplantation among the transplanted, by Model for End-stage Liver Disease (MELD) score.

The proposed methods are not intended to guide treatment decisions. For example, the fact that we estimate a larger treatment effect for MELD 36–40 than for 30–35 does not imply that a patient with MELD=32 should wait until his/her MELD score increases to ≥ 36 before they agree to be transplanted. The proposed methods cannot generally be used to compare treatment effects, since each treatment effect is averaged differently. For example, the difference in the treatment effect between patients transplanted at MELD 15–17 (Δ̂(5) = 1.00) and MELD 12–14 (Δ̂(5) = 0.59) is partly attributable to the former group being transplanted with higher quality donor livers.

There are now many methods available for evaluating a time-dependent treatments. Marginal Structural Models (MSM; e.g., Hernán, Brumback and Robins, 2000; Robins, Hernán and Brumback, 2000) are not well-suited to our set-up due to the potential for treatment to interact with time-varying covariates. Structural Nested Failure Time Models (SNFTMs; e.g., Robins, 1988; Joffe et al., 1998; Keiding et al., 1999; Hernan et al., 2005; Taubman et al., 2009; Vock et al., 2013) are an alternative. In particular, the method of Vock et al. (2013) was motivated by the lung transplant setting. Versions of Sequential Stratification, which involves stratified and inverse weighted Cox regression, have been used to evaluate the benefit of kidney transplantation (Schaubel, Wolfe and Port, 2006) and liver transplantation (Schaubel et al., 2009). An advantage the proposed method over SNFTMs and Sequential Stratification is the avoidance of any parametric assumptions regarding the treatment effect. SNFTMs assume that treatment alters the time scale through a constant, while Sequential Stratification assumes proportionality of the pre- and post-treatment hazard functions. A further advantage of our proposed methods over SNFTMs relates to implementation. Although explicit coding would be required for either approach, the ‘core’ models in our method merely involve Cox regression and, therefore, can be fitted using standard statistical software (SAS, R) after modifying the input data appropriately.

In estimating the ETT, we consider the absence of treatment; i.e., T_i = ∞. In setting where this is found to be too ambitious a goal (e.g., lack of sufficiently long follow-up, in a setting where treatment is inevitable), one could change [T_i = ∞] to [T_i > L] in describing the absence-of-treatment scenario.

An alternative to the measures proposed in (9) and (10) would be to redefine S₁(t) to be the population average survival (i.e., averaging over the current treated and untreated experiences), with S₀(t) then representing the average population survival in the absence of treatment. Unless strong or unrealistic assumptions were made, the ‘core’ models for this approach would be quite similar to those in the proposed approach, except for the pre-treatment hazard model. The proposed averaging would be preferred in many practical settings (including the liver transplant setting which motivated our current work) since the absence of a treatment benefit among non-recipients is made explicit.

Supplementary Material

Supp Info

NIHMS794870-supplement-Supp_Info.pdf^{(885.9KB, pdf)}

Acknowledgments

This work was supported in part by National Institutes of Health Grant R01-DK070869. The data reported here have been supplied by the Minneapolis Medical Research Foundation (MMRF) as the contractor for the Scientific Registry of Transplant Recipients (SRTR). The interpretation and reporting of these data are the responsibility of the authors and in no way should be seen as an official policy of or interpretation by the SRTR or the U.S. Government. The authors wish to thank the Associate Editor and two Reviewers, whose comments and suggestions led to considerable improvement of the manuscript. They also thank Min Zhang for her many thoughtful suggestions.

Footnotes

Supplementary Materials

Supplementary Materials, referenced in Sections 3, 4 and 5, are available with this paper at the Biometrics website on Wiley Online Library.

References

Andersen PK, Gill RD. Cox’s regression model for counting processes: A large sample study. The Annals of Statistics. 1982;10:1100–1120. [Google Scholar]
Breslow NE. Contribution to the discussion of paper by D.R. Cox. Journal of the Royal Statistical Society Series B. 1972;34:216–217. [Google Scholar]
Chen P, Tsiatis AA. Causal inference on the difference of the restricted mean life between two groups. Biometrics. 2001;57:1030–1038. doi: 10.1111/j.0006-341x.2001.01030.x. [DOI] [PubMed] [Google Scholar]
Cox DR. Regression models and life tables (with Discussion) Journal of the Royal Statistical Society, Series B. 1972;34:187–200. [Google Scholar]
Cox DR. Partial likelihood. Biometrika. 1975;62:269–275. [Google Scholar]
Feng S, Goodrich NP, Bragg-Gresham JL, Dykstra DM, Punch JD, DebRoy MA, Greenstein SM, Merion RM. Characteristics associated with liver graft failure: The concept of a donor risk index. American Journal of Transplantation. 2006;6:783–790. doi: 10.1111/j.1600-6143.2006.01242.x. [DOI] [PubMed] [Google Scholar]
Feuer EJ, Hankey BF, Gaynor JJ, Wesley MN, Baker SG, Meyer JS. Graphical representation of survival curves associated with a binary non-reversible time dependent covariate. Statistics in Medicine. 1992;11:455–474. doi: 10.1002/sim.4780110408. [DOI] [PubMed] [Google Scholar]
Gong Q, Schaubel DE. Partly conditional estimation of the effect of a time-dependent factor in the presence of dependent censoring. Biometrics. 2013;69:338–347. doi: 10.1111/biom.12023. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hernán MA, Cole SR, Margolick J, Cohen M, Robins JM. Structural accelerated failure time models for survival analysis in studies with time-varying treatments. Pharmacoepidemiology and drug safety. 2005;14:477–491. doi: 10.1002/pds.1064. [DOI] [PubMed] [Google Scholar]
Joffe MM, Hoover DR, Jacobson LP, Kingsley L, Chmiel JS, Visscher BR, Robins JM. Estimating the effect of Zidovudine on Karposi’s sarcoma form observational data using a rank preserving structural failure time model. Statistics in Medicine. 1998;17:1073–1102. doi: 10.1002/(sici)1097-0258(19980530)17:10<1073::aid-sim789>3.0.co;2-p. [DOI] [PubMed] [Google Scholar]
Kaplan EL, Meier P. Nonparametric estimation from incomplete observations. Journal of the American Statistical Association. 1958;282:457–481. [Google Scholar]
Kalbfleisch JD, Prentice RL. The Statistical Analysis of Failure Time Data: 2nd Edition. New York: Wiley; 2002. [Google Scholar]
Keiding N, Filiberti M, Esbjerg S, Robins JM, Jacobsen N. The graft versus leukemia effect after bone marrow transplantation: A case study using structural nested failure time models. Biometrics. 1999;57:23–28. doi: 10.1111/j.0006-341x.1999.00023.x. [DOI] [PubMed] [Google Scholar]
Parast L, Tian L, Cai T. Landmark estimation of survival and treatment effect in a randomized clinical trial. Journal of the American Statistical Association. 2014;109:383–394. doi: 10.1080/01621459.2013.842488. [DOI] [PMC free article] [PubMed] [Google Scholar]
Pearl J. Causality: Models, Reasoning, and Inference. New York: Cambridge; 2009. [Google Scholar]
Robins JM. The control of confounding by intermediate variables. Statistics in Medicine. 1988;8:679–701. doi: 10.1002/sim.4780080608. [DOI] [PubMed] [Google Scholar]
Robins JM, Rotnitzky A. Recovery of information and adjustment for dependent censoring using surrogate markers. In: Jewell N, Dietz K, Farewell B, editors. AIDS Epidemiology - Methodological Issues. Boston: Birkhäuser; 1992. pp. 297–331. [Google Scholar]
Rubin DB. Bayesian inference for causal effect: The role of randomization. Annals of Statistics. 1978;6:34–58. [Google Scholar]
Schaubel DE, Wolfe RA, Port FK. A sequential stratification method for estimating the effect of a time-dependent experimental treatment in observational studies. Biometrics. 2006;62:910–917. doi: 10.1111/j.1541-0420.2006.00527.x. [DOI] [PubMed] [Google Scholar]
Schaubel DE, Wolfe RA, Sima CS, Merion RM. Estimating the effect of a time-dependent treatment by levels of an internal time-dependent covariate. Journal of the American Statistical Association. 2009;104:49–59. [Google Scholar]
Sharma P, Schaubel DE, Goodrich NP, Merion RM. Serum sodium and the survival benefit of liver transplantation. Liver Transplantation. 2015;21:308–313. doi: 10.1002/lt.24063. [DOI] [PMC free article] [PubMed] [Google Scholar]
Van Houwelingen HC. Dynamic prediction by landmarking in event history analysis. Scandinavian Journal of Statistics. 2007;34:70–85. [Google Scholar]
Van Houwelingen HC, Putter H. Dynamic prediction in clinical survival analysis. New York: CRC Press; 2012. [Google Scholar]
Van Houwelingen HC, Putter H. Comparison of stopped Cox regression with direct methods such as pseudo-values and binomial regression. Lifetime Data Analysis. 2015;21:180–196. doi: 10.1007/s10985-014-9299-3. [DOI] [PubMed] [Google Scholar]
Vock DM, Tsiatis AA, Davidian M, Laber EB, Tsuang WM, Finlen Copeland CA, Palmer SM. Assessing the causal effect of organ transplantation on the distribution of residual lifetime. Biometrics. 2013;69:820–829. doi: 10.1111/biom.12084. [DOI] [PMC free article] [PubMed] [Google Scholar]
Zhang M, Schaubel DE. Estimating differences in restricted mean lifetime using observational data subject to dependent censoring. Biometrics. 2011;67:740–749. doi: 10.1111/j.1541-0420.2010.01503.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
Zhang M, Schaubel DE. Double-robust semiparametric estimator for differences in restricted mean lifetimes in observational studies. Biometrics. 2012;68:999–1009. doi: 10.1111/j.1541-0420.2012.01759.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
Zheng YY, Heagerty PJ. Partly conditional survival models for longitudinal data. Biometrics. 2005;61:379–391. doi: 10.1111/j.1541-0420.2005.00323.x. [DOI] [PubMed] [Google Scholar]
Zucker DM. Restricted mean life with covariates: Modification and extension of a useful survival analysis method. Journal of the American Statistical Association. 1998;93:702–709. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supp Info

NIHMS794870-supplement-Supp_Info.pdf^{(885.9KB, pdf)}

[R1] Andersen PK, Gill RD. Cox’s regression model for counting processes: A large sample study. The Annals of Statistics. 1982;10:1100–1120. [Google Scholar]

[R2] Breslow NE. Contribution to the discussion of paper by D.R. Cox. Journal of the Royal Statistical Society Series B. 1972;34:216–217. [Google Scholar]

[R3] Chen P, Tsiatis AA. Causal inference on the difference of the restricted mean life between two groups. Biometrics. 2001;57:1030–1038. doi: 10.1111/j.0006-341x.2001.01030.x. [DOI] [PubMed] [Google Scholar]

[R4] Cox DR. Regression models and life tables (with Discussion) Journal of the Royal Statistical Society, Series B. 1972;34:187–200. [Google Scholar]

[R5] Cox DR. Partial likelihood. Biometrika. 1975;62:269–275. [Google Scholar]

[R6] Feng S, Goodrich NP, Bragg-Gresham JL, Dykstra DM, Punch JD, DebRoy MA, Greenstein SM, Merion RM. Characteristics associated with liver graft failure: The concept of a donor risk index. American Journal of Transplantation. 2006;6:783–790. doi: 10.1111/j.1600-6143.2006.01242.x. [DOI] [PubMed] [Google Scholar]

[R7] Feuer EJ, Hankey BF, Gaynor JJ, Wesley MN, Baker SG, Meyer JS. Graphical representation of survival curves associated with a binary non-reversible time dependent covariate. Statistics in Medicine. 1992;11:455–474. doi: 10.1002/sim.4780110408. [DOI] [PubMed] [Google Scholar]

[R8] Gong Q, Schaubel DE. Partly conditional estimation of the effect of a time-dependent factor in the presence of dependent censoring. Biometrics. 2013;69:338–347. doi: 10.1111/biom.12023. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R9] Hernán MA, Cole SR, Margolick J, Cohen M, Robins JM. Structural accelerated failure time models for survival analysis in studies with time-varying treatments. Pharmacoepidemiology and drug safety. 2005;14:477–491. doi: 10.1002/pds.1064. [DOI] [PubMed] [Google Scholar]

[R10] Joffe MM, Hoover DR, Jacobson LP, Kingsley L, Chmiel JS, Visscher BR, Robins JM. Estimating the effect of Zidovudine on Karposi’s sarcoma form observational data using a rank preserving structural failure time model. Statistics in Medicine. 1998;17:1073–1102. doi: 10.1002/(sici)1097-0258(19980530)17:10<1073::aid-sim789>3.0.co;2-p. [DOI] [PubMed] [Google Scholar]

[R11] Kaplan EL, Meier P. Nonparametric estimation from incomplete observations. Journal of the American Statistical Association. 1958;282:457–481. [Google Scholar]

[R12] Kalbfleisch JD, Prentice RL. The Statistical Analysis of Failure Time Data: 2nd Edition. New York: Wiley; 2002. [Google Scholar]

[R13] Keiding N, Filiberti M, Esbjerg S, Robins JM, Jacobsen N. The graft versus leukemia effect after bone marrow transplantation: A case study using structural nested failure time models. Biometrics. 1999;57:23–28. doi: 10.1111/j.0006-341x.1999.00023.x. [DOI] [PubMed] [Google Scholar]

[R14] Parast L, Tian L, Cai T. Landmark estimation of survival and treatment effect in a randomized clinical trial. Journal of the American Statistical Association. 2014;109:383–394. doi: 10.1080/01621459.2013.842488. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R15] Pearl J. Causality: Models, Reasoning, and Inference. New York: Cambridge; 2009. [Google Scholar]

[R16] Robins JM. The control of confounding by intermediate variables. Statistics in Medicine. 1988;8:679–701. doi: 10.1002/sim.4780080608. [DOI] [PubMed] [Google Scholar]

[R17] Robins JM, Rotnitzky A. Recovery of information and adjustment for dependent censoring using surrogate markers. In: Jewell N, Dietz K, Farewell B, editors. AIDS Epidemiology - Methodological Issues. Boston: Birkhäuser; 1992. pp. 297–331. [Google Scholar]

[R18] Rubin DB. Bayesian inference for causal effect: The role of randomization. Annals of Statistics. 1978;6:34–58. [Google Scholar]

[R19] Schaubel DE, Wolfe RA, Port FK. A sequential stratification method for estimating the effect of a time-dependent experimental treatment in observational studies. Biometrics. 2006;62:910–917. doi: 10.1111/j.1541-0420.2006.00527.x. [DOI] [PubMed] [Google Scholar]

[R20] Schaubel DE, Wolfe RA, Sima CS, Merion RM. Estimating the effect of a time-dependent treatment by levels of an internal time-dependent covariate. Journal of the American Statistical Association. 2009;104:49–59. [Google Scholar]

[R21] Sharma P, Schaubel DE, Goodrich NP, Merion RM. Serum sodium and the survival benefit of liver transplantation. Liver Transplantation. 2015;21:308–313. doi: 10.1002/lt.24063. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R22] Van Houwelingen HC. Dynamic prediction by landmarking in event history analysis. Scandinavian Journal of Statistics. 2007;34:70–85. [Google Scholar]

[R23] Van Houwelingen HC, Putter H. Dynamic prediction in clinical survival analysis. New York: CRC Press; 2012. [Google Scholar]

[R24] Van Houwelingen HC, Putter H. Comparison of stopped Cox regression with direct methods such as pseudo-values and binomial regression. Lifetime Data Analysis. 2015;21:180–196. doi: 10.1007/s10985-014-9299-3. [DOI] [PubMed] [Google Scholar]

[R25] Vock DM, Tsiatis AA, Davidian M, Laber EB, Tsuang WM, Finlen Copeland CA, Palmer SM. Assessing the causal effect of organ transplantation on the distribution of residual lifetime. Biometrics. 2013;69:820–829. doi: 10.1111/biom.12084. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R26] Zhang M, Schaubel DE. Estimating differences in restricted mean lifetime using observational data subject to dependent censoring. Biometrics. 2011;67:740–749. doi: 10.1111/j.1541-0420.2010.01503.x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R27] Zhang M, Schaubel DE. Double-robust semiparametric estimator for differences in restricted mean lifetimes in observational studies. Biometrics. 2012;68:999–1009. doi: 10.1111/j.1541-0420.2012.01759.x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R28] Zheng YY, Heagerty PJ. Partly conditional survival models for longitudinal data. Biometrics. 2005;61:379–391. doi: 10.1111/j.1541-0420.2005.00323.x. [DOI] [PubMed] [Google Scholar]

[R29] Zucker DM. Restricted mean life with covariates: Modification and extension of a useful survival analysis method. Journal of the American Statistical Association. 1998;93:702–709. [Google Scholar]

PERMALINK

Estimating the Average Treatment Effect on Survival Based on Observational Data and Using Partly Conditional Modeling

Qi Gong

Douglas E Schaubel

Summary

1. Introduction

2. Proposed Methods

2.1 Set-up and Notation

Figure 1.

2.2 Treatment Effect: Conditional and Average

2.3 Observed data: Notation and set-up

2.4 Assumed Models and Estimation Methods

2.4.1 Post-Treatment Survival

2.4.2 Survival in the Absence of Treatment

Partly Conditional Model

Calendar Time Cross-sections

Method of Gong and Schaubel (2013)

Figure 2.

Inverse weighting

Parameter Estimation for Model (12)

2.4.3 Conditional Treatment Effect

2.4.4 Average Treatment Effect

3. Asymptotic Properties

Theorem 1

4. Simulations

Table 1.

5. Application to Liver Transplant Data

Figure 3.

Table 2.

6. Discussion

Supplementary Material

Acknowledgments

Footnotes

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases