Estimation of the Cumulative Incidence Function Under Multiple Dependent and Independent Censoring Mechanisms

Judith J Lok; Shu Yang; Brian Sharkey; Michael D Hughes

doi:10.1007/s10985-017-9393-4

. Author manuscript; available in PMC: 2019 Apr 1.

Published in final edited form as: Lifetime Data Anal. 2017 Feb 25;24(2):201–223. doi: 10.1007/s10985-017-9393-4

Estimation of the Cumulative Incidence Function Under Multiple Dependent and Independent Censoring Mechanisms

Judith J Lok ^1,^*, Shu Yang ², Brian Sharkey ³, Michael D Hughes ⁴

PMCID: PMC5572121 NIHMSID: NIHMS855630 PMID: 28238045

Abstract

Competing risks occur in a time-to-event analysis in which a patient can experience one of several types of events. Traditional methods for handling competing risks data presuppose one censoring process, which is assumed to be independent. In a controlled clinical trial, censoring can occur for several reasons: some independent, others dependent. We propose an estimator of the cumulative incidence function in the presence of both independent and dependent censoring mechanisms. We rely on semi-parametric theory to derive an augmented inverse probability of censoring weighted (AIPCW) estimator. We demonstrate the efficiency gained when using the AIPCW estimator compared to a non-augmented estimator via simulations. We then apply our method to evaluate the safety and efficacy of two anti-HIV regimens in a randomized trial conducted by the AIDS Clinical Trial Group, ACTG A5095.

Keywords: Competing risks, Cumulative incidence function, Dependent censoring, Inverse probability weighting

1 Introduction

Competing risks often arise in medical studies. In the competing risks setting, as opposed to the standard survival analysis setting, the failure event is classified into one of several mutually exclusive types, and occurrence of one type of event precludes the occurrence of an event of another type. For example, if interest is in death due to cardiovascular disease, a patient experiencing death due to cancer would be precluded from experiencing the event of interest. Standard statistical methods for the analysis of competing risks data are described in, for example, Andersen et al. (1993); Kalbfleisch and Prentice (1980); Pintilie (2006).

We focus our attention on the cumulative incidence function (CIF), defined as the probability of a particular type of failure by time t, in an environment where other causes of failure may occur. There have been significant developments in statistical inference based on the CIF. Gray (1988) developed a class of tests for comparing CIFs of a particular type of failure among different groups. Lin (1997) constructed confidence bands for the CIF. Fine and Gray (1999) proposed a semi-parametric proportional hazards model for the subdistribution of a competing risk. Other work has focused on modeling the CIF directly, see for example Fine (2001); Bryant and Dignam (2004); Jeong and Fine (2006).

Previous work has assumed that follow-up of patients is subject to only one censoring process, which is assumed to be independent. However, a patient’s follow-up time may be censored for one of many reasons, some of which may be independent and some may be dependent. For example, so-called administrative censoring occurs when patients reach the end of a study, often inducing independent censoring (although, as noted in e.g. Lok and Hughes (2016), this censoring may be dependent if, for example, patients with characteristics which suggest that they might be harder to follow, are under-represented in the patient population enrolling early, or if sites with distinctly different patient populations start enrollment at different times). On the other hand, patients may prematurely drop out of a study prior to the study’s planned end of follow-up, which may induce dependent censoring if the patients who dropped out are not representative of the entire sample (e.g. sicker patients drop out of the study with higher probability than healthier patients). Thus, dependent censoring may more accurately reflect situations that arise in clinical studies. If dependent censoring is present, use of methods which assume independent censoring can lead to biased estimates of parameters or functions of interest. Rotnitzky et al. (2007) and Rotnitzky et al. (2009) estimated survival curves in the presence of dependent censoring. The purpose of this article is to adapt these methods to estimate the CIF for competing risk data in the presence of multiple censoring mechanisms, some of which may be dependent.

This paper is organized as follows. In section 2 we introduce the AIDS Clinical Trial Group (ACTG) A5095 randomized trial, which motivated the methodological developments. In section 3, we introduce our notation and data structure and in section 4, our assumptions. We introduce our estimator of the cumulative incidence function in section 5. In section 6, a simulation study is conducted to evaluate the performance of our estimator in finite samples. In section 7, we illustrate the application of our methods to an analysis of the A5095 study. We end with a discussion in section 8.

2 The ACTG A5095 Study: A Motivating Example

ACTG A5095 was a multicenter, randomized, double-blind, placebo-controlled clinical trial designed to compare the safety and efficacy of two 3-drug regimens versus a 4-drug regimen for initial treatment of HIV-1 infection (Gulick et al., 2004; Gulick et al., 2006). One 3-drug regimen was discontinued early on the recommendation of the data and safety monitoring board. Our focus is therefore on the comparison of the remaining 3-drug regimen (zidovudine, lamivudine, and efavirenz) and the 4-drug regimen (zidovudine, lamivudine, abacavir, and efavirenz).

The primary efficacy outcome measure in A5095 was the time to virologic failure (VF), defined as the time to the first of two successive HIV-1 RNA levels of 200 copies/mL of plasma or greater at or 16 weeks of follow-up. This was analyzed using an intention-to-treat analysis ignoring the changes from the randomized regimens which occurred in a reasonable proportion of study participants, often due to treatment limiting adverse events (TLAEs), sometimes due to treatment limiting other events (TLOEs) such as pregnancy and death. Clinically, there is therefore also considerable interest in comparing regimens with respect to regimen failure, with the competing outcome types of VF, TLAE, and TLOE. These are competing risks in that discontinuation of treatment due to a TLAE or TLOE precludes follow-up for VF while on that randomized treatment. However, some participants discontinue randomized treatment prior to the planned administrative end of follow-up of the study for reasons other than VF, TLAE, or TLOE, and there is often concern that this censoring of follow-up on study treatment may be dependent (Dudley et al. (1995); Ioannidis et al. (1997); Arici et al. (2002); Lanoy et al. (2006); Andersen et al. (2007); Krishnan et al. (2010); Fleishman et al. (2012)). For example, if patients who feel bad on treatment discontinue treatment and therefore leave the trial, censoring due to dropout might be dependent. Developing statistical methods that allow for dependent censoring is therefore important, particularly for checking the sensitivity of study conclusions to the handling of such discontinuations.

3 Notation and Goal

We consider a study that has staggered entry and maximum follow-up time ν*. Let T * and C* be non-negative time to event and time to censoring random variables, respectively. Let J ∈ {1, 2, … j*} denote the type of failure and R ∈ {1, 2, … r*} denote the reason for censoring. In order for our estimator to converge properly (specifically, to ensure that regularity condition (2), defined below, holds), we will need to discard data that were recorded after time ν = ν* − ε, where ε is a small positive number. We then define the event time as T = min(T *, ν) and the censoring time as C = min(C*, ν). We assume that we observe n independent and identically distributed copies of $O = (X, Δ, Δ J, (1 - Δ) R, {\bar{V}}_{X})$ where X = min(T, C), ∆ = 1(T ≤ C), and ${\bar{V}}_{t} = (V_{s} : s \leq t)$ where 1(·) is the indicator function taking value 1 if its argument is true and 0 otherwise, and V_s is the vector of covariates measured at time s. Note that when X = T we observe a patient’s full covariate history, ${\bar{V}}_{T}$ . We assume that either the type of failure, J, or the reason for censoring, R, is observed, but not both. Going forward, we include the type of failure or reason for censoring in ${\bar{V}}_{X}$ . Our goal is to estimate the cumulative incidence function on the interval [0, ν), defined as

F_{j} (t) = P (T \leq t, J = j),

in the presence of multiple reasons for censoring, some (or all) of which may be dependent.

4 Assumptions

We shall assume that for r = 1, …, r*:

λ_{C, r} (t | {\bar{V}}_{T}, T, J, T > t) = λ_{C, r} (t | {\bar{V}}_{T}, T > t)

(1)

where $λ_{C, r} (t | {\bar{V}}_{T}, T, J, T > t) = \lim_{h \to 0} \frac{P (t \leq C < t + h, R = r | C \geq t, {\bar{V}}_{T}, T, J, T > t)}{h}$ . In words, we assume that the hazard of censoring at time t for reason r, depends only on the measured variables up to time t and not on any future observed or unobserved variables, failure time, or failure type. When assuming (1), we make the non-identifiable assumption that data on all time-dependent and time-independent covariates that are predictive of both failure and censoring are available and included in ${\bar{V}}_{t}$ . Equation (1) is equivalent to assuming that the data are coarsened at random (CAR) (Heitjan and Rubin, 1991) or missing at random (MAR) (Rubin, 1976). For ease of notation we will denote $λ_{C, r} (t | {\bar{V}}_{t}, T > t)$ by $λ_{C, r} (t | {\bar{V}}_{t})$ .

We impose the regularity condition that for some constant ξ,

λ_{C, r} (t | {\bar{V}}_{T}, T, J, T > t) < ξ

(2)

with probability 1, for t in the interval [0, ν). Condition (2) would be false if we took ν = ν* since, with probability 1, all patients who are uncensored just before the administrative end of study ν* will be censored when the study ends (Rotnitzky et al., 2007).

When ${\bar{V}}_{t}$ is high dimensional, we cannot estimate $λ_{C, r} (t | {\bar{V}}_{t})$ non-parametrically due to the curse of dimensionality. Thus, we specify a model for $λ_{C, r} (t | {\bar{V}}_{t})$ . In this paper, we use Cox’s proportional hazards model, of the form:

λ_{C, r} (t | {\bar{V}}_{t}) = λ_{0, r} (t) \exp [γ_{r}^{'} w_{r} (t, {\bar{V}}_{t})],

(3)

where λ₀_,r(t) is an unknown, non-negative function of t, $w_{r} (t, {\bar{V}}_{t})$ is a specified function of t and ${\bar{V}}_{t}$ , and γ_r is an unknown parameter vector.

5 Estimation

5.1 Inverse Probability of Censoring Weighted (IPCW) Estimator

In the absence of censoring, F_j(t) could be estimated non-parametrically by solving

\sum_{i = 1}^{n} {1 (T_{i} \leq t, J_{i} = j) - F_{j} (t)} = 0.

Due to censoring we must modify this expression. The main idea underlying our estimator of F_j(t) is that of “non-uniform-pseudo-redistribution” to the right via inverse probability weighting of uncensored patients (Robins et al., 1995; Rotnitzky et al., 2009). That is, when a patient is censored, our estimator redistributes his or her weight among “similar” remaining uncensored patients. Following Rotnitzky et al. (2007), we define the inverse weights, $π (t | {\bar{V}}_{t}; Λ_{0})$ , as follows:

\begin{array}{l} π (t | {\bar{V}}_{t}; Λ_{0}) = \exp (- \int_{0}^{t} λ_{C} (u | {\bar{V}}_{T}, T, J, T > u) d u) \\ = \exp (- \int_{0}^{t} \sum_{r = 1}^{r^{*}} λ_{0, r} (u) \exp [γ_{r}^{'} w_{r} (u, {\bar{V}}_{u})] d u) \\ = \prod_{r = 1}^{r^{*}} \prod_{0 \leq u \leq t}^{} [1 - \exp [γ_{r}^{'} w_{r} (u, {\bar{V}}_{u})] d Λ_{0, r} (u)], \end{array}

with the cumulative baseline hazard, Λ₀_,r(t), defined as $Λ_{0, r} (t) = \int_{0}^{t} λ_{0, r} (s) d s$ .

We can find an estimate of F_j(t) as the solution to the following equation:

\sum_{i = 1}^{n} \frac{{\tilde{Δ}}_{i}}{π_{i} ({\tilde{T}}_{i} | {\bar{V}}_{i, {\tilde{T}}_{i}}; Λ_{0})} {1 (T_{i} \leq t, J_{i} = j) - F_{j} (t)} = 0

(4)

where $\tilde{T}$ is the minimum time such that 1(T ≤ t, J = j) is observed, i.e. $\tilde{T} = min (T, t)$ , and $\tilde{Δ} = 1 (\tilde{T} < C)$ . As shown in the appendix, equation (4) is an unbiased estimating equation for F_j(t) since under CAR, $\Pr (\tilde{Δ} = 1 | V_{\tilde{T}}) = π (\tilde{T} | {\bar{V}}_{\tilde{T}}; Λ_{0})$ . Note that if regularity condition (2) were false, we would be dividing by 0; this is called a positivity violation, where some patients have probability 0 of remaining uncensored, and IPCW fails (Robins et al., 1995).

Note that using ∆ and T instead of $\tilde{Δ}$ and $\tilde{T}$ will result in a less efficient estimator of F_j(t). Intuitively this makes sense: by using ∆ and T, censored patients would contribute nothing to equation (4). However, for those patients who were censored after time t, we know the value of 1(T ≤ t, J = j). As a result, we can use this information to construct a more efficient estimator.

Estimation of the inverse weights $π (t | {\bar{V}}_{t}; Λ_{0})$ first requires estimation of $γ = (γ_{1}, γ_{2}, \dots, γ_{r^{*}})$ and $Λ_{0} (t) = (Λ_{0, 1} (t), Λ_{0, 2} (t), \dots, Λ_{0, r^{*}} (t))$ . Because of the missing at random assumption we can estimate γ_r, which is the unknown parameter vector in equation (3), using standard software via a Cox proportional hazards model with time dependent covariates. To estimate γ_r, treat censoring due to reason r as a “failure” in the time dependent Cox proportional hazards model. All events and censoring due to causes other than r are treated as “censored” observations. This process is repeated to estimate all γ_r’s.

Once we have an estimate of γ, $\hat{γ}$ , we can estimate the cumulative baseline hazard, $Λ_{0} (t) = (Λ_{0, 1} (t), \dots, Λ_{0, r^{*}} (t))$ , using Breslow’s estimator (Andersen et al., 1993),

{\hat{Λ}}_{0, r} (t) = \int_{0}^{t} \frac{\sum_{i = 1}^{n} d N_{C_{i}, r} (u)}{\sum_{i = 1}^{n} \exp [{\hat{γ}}_{r}^{'} w_{r} (u, {\bar{V}}_{i, u})] I (X_{i} \geq u)}

(5)

where N_C,r(u) ≡ 1(C ≤ u, R = r, C ≤ T) is the counting process of observing censoring of type r. Next, $π (t | {\bar{V}}_{t}; Λ_{0})$ can be estimated by

\hat{π} (t | {\bar{V}}_{t}; {\hat{Λ}}_{0}) = \prod_{r = 1}^{r^{*}} \prod_{0 \leq s \leq t} [1 - \exp [{\hat{γ}}_{r}^{'} w_{r} (s, {\bar{V}}_{s})] d {\hat{Λ}}_{0, r} (s)] .

(6)

We can now find an estimate of F_j(t) as the solution to

\sum_{i = 1}^{n} \frac{{\tilde{Δ}}_{i}}{{\hat{π}}_{i} ({\tilde{T}}_{i} | {\bar{V}}_{i, {\tilde{T}}_{i}}; {\hat{Λ}}_{0})} {1 (T_{i} \leq t, J_{i} = j) - F_{j} (t)} = 0.

(7)

Denote the estimator solving equation (7) as ${\hat{F}}_{j}^{N A} (t)$ . ${\hat{F}}_{j}^{N A} (t)$ is known as a (non-augmented) inverse probability of censoring weighted (IPCW) estimator. We can use the non-parametric bootstrap to estimate the variance of ${\hat{F}}_{j}^{N A} (t)$ .

5.2 Augmented Inverse Probability of Censoring Weighted (AIPCW) Estimator

We can improve the efficiency of ${\hat{F}}_{j}^{N A} (t)$ by introducing an augmentation term (Tsiatis, 2006; Rotnitzky and Robins, 2005). Consider the solution to the following equation

\sum_{i = 1}^{n} {\frac{{\tilde{Δ}}_{i}}{π_{i} ({\tilde{T}}_{i} | {\bar{V}}_{{\tilde{T}}_{i}}; Λ_{0})} {1 (T_{i} \leq t, J_{i} = j) - F_{j} (t)} - A_{i} {F_{j} (t), γ, b (\cdot)}} = 0

(8)

where A_i{F_j(t), γ, b(·)} is the augmentation term and is defined as

A {F_{j} (t), γ, b (\cdot)} \equiv \sum_{r = 1}^{r^{*}} \int \frac{b (u, {\bar{V}}_{u})}{π (u - | {\bar{V}}_{u}; Λ_{0} (u))} d M_{C, r} (u)

(9)

where $b (u, {\bar{V}}_{u})$ is a user specified, left-continuous function of u and ${\bar{V}}_{u}$ , where $π (u - | {\bar{V}}_{u}; Λ_{0} (u))$ indicates the left-continuous version of π, and where

M_{C, r} (u) = N_{C, r} (u) - \int_{0}^{u} 1 (X \geq s) \exp {γ_{r}^{'} w_{r} (s, {\bar{V}}_{s})} d Λ_{0, r} (s) .

(10)

The process M_C,r(u) is a mean zero martingale with respect to the filtration $ℱ (u)$ , where we define $ℱ (u)$ as the increasing sequence of sigma algebras generated by $σ {1 (C \leq x), {\bar{V}}_{x}, 0 \leq x \leq u}$ . In the appendix, we show that equation (8) is an unbiased estimating equation for F_j(t).

For efficiency reasons (to be discussed below), we choose $b (u, {\bar{V}}_{u})$ as follows:

b (u, {\bar{V}}_{u}) = - E [{1 (T \leq t, J_{i} = j) - F_{j} (t)} | {\bar{V}}_{u -}, T \geq u] .

(11)

If we can consistently estimate $b (u, {\bar{V}}_{u})$ as defined in equation (11), we can find an estimate of F_j(t) as the solution to

\sum_{i = 1}^{n} {\frac{{\tilde{Δ}}_{i}}{{\hat{π}}_{i} ({\tilde{T}}_{i} | {\bar{V}}_{{\tilde{T}}_{i}}; {\hat{Λ}}_{0})} {1 (T_{i} \leq t, J_{i} = j) - F_{j} (t)} - {\hat{A}}_{i} (F_{j} (t), \hat{b} (u, {\bar{V}}_{u}), \hat{γ})} = 0,

(12)

with

\hat{A} (F_{j} (t), \hat{b} (u, {\bar{V}}_{u}), \hat{γ}) = \sum_{r = 1}^{r^{*}} \int_{0}^{\tilde{T}} \frac{- \hat{P} [(T \leq t, J = j) | {\bar{V}}_{u -}, T \geq u] + F_{j} (t)}{\hat{π} (u - | {\bar{V}}_{u}; {\hat{Λ}}_{0} (u))} d {\hat{M}}_{C, r} (u)

and

{\hat{M}}_{C, r} (u) = N_{C, r} (u) - \int_{0}^{u} \exp {{\hat{γ}}_{r}^{'} w_{r} (s, {\bar{V}}_{s})} d {\hat{Λ}}_{0, r} (s) .

The estimator that solves equation (12) is denoted by ${\hat{F}}_{j}^{A} (t)$ and is an augmented inverse probability of censoring weight (AIPCW) estimator of F_j(t). Again, we can use the non-parametric bootstrap to estimate the variance of ${\hat{F}}_{j}^{A} (t)$ .

If we can consistently estimate $b (u, {\bar{V}}_{u})$ as defined in (11), then ${\hat{F}}_{j}^{A} (t)$ would be doubly robust (Rotnitzky and Robins, 2005). That is, ${\hat{F}}_{j}^{A} (t)$ is consistent and asymptotically normal if the model for the censoring process, $π (t | {\bar{V}}_{t}; Λ_{0})$ , is correctly specified or the conditional model, $E [1 (T \leq t, J = j) | {\bar{V}}_{u -}, T \geq u]$ is correctly specified. Also, if both the model for the censoring process and the conditional model are correctly specified then ${\hat{F}}_{j}^{A} (t)$ is locally semi-parametric efficient (Robins and Rotnitzky, 1992; Tsiatis, 2006). The function $b (u, {\bar{V}}_{u})$ is not arbitrary and is chosen to equal $- E [1 (T \leq t, J = j) - F_{j} (t) | {\bar{V}}_{u -}, T \geq u]$ in order to gain the greatest efficiency among estimators that solve an equation such as equation (8) (Rotnitzky et al., 2007).

In practice, estimating the conditional expectation $E [{1 (T \leq t, J = j) - F_{j} (t)} | {\bar{V}}_{u -}, T \geq u]$ can be difficult, because the information considered in ${\bar{V}}_{u -}$ in equation (11) is time-dependent. Thus, in order to make the problem more tractable, one can instead consider estimating

E [{1 (T \leq t, J = j) - F_{j} (t)} | {\bar{V}}_{0}, T \geq u],

(13)

where ${\bar{V}}_{0}$ are the baseline covariates. One way to estimate (13) is as follows:

\hat{P} [(T \leq t, J = j) | {\bar{V}}_{0}, T \geq u] = \frac{\int_{u}^{t} \hat{S} (a - | {\bar{V}}_{0}) d {\hat{Λ}}_{j} (a | {\bar{V}}_{0})}{\hat{S} (u - | {\bar{V}}_{0})},

(14)

where ${\hat{Λ}}_{j} (a | {\bar{V}}_{0}) = {\hat{Λ}}_{0, j} (a) \exp {{\hat{β}}_{j}^{'} f ({\bar{V}}_{0})}$ and ${\hat{β}}_{j}^{'}$ is the estimated parameter vector obtained from fitting a Cox proportional hazards model to the overall time T in each treatment group:

λ_{T, j} (t | {\bar{V}}_{0}) = λ_{0, j} (t) \exp {β_{j}^{'} f ({\bar{V}}_{0})},

(15)

where $λ_{T, j} (t | {\bar{V}}_{0}) = \lim_{h \to 0} \frac{P (t \leq T < t + h, J = j | {\bar{V}}_{0}, T > t)}{h}$ , and λ₀_,j(t) is an unknown non-negative function of t. The cumulative baseline hazard, Λ₀_,j(a), can be estimated using Breslow’s estimator, and $\hat{S} (u | {\bar{V}}_{0}) = \exp {- \sum_{j} {\hat{Λ}}_{j} (u | {\bar{V}}_{0})}$ .

The resulting estimator of F_j(t) will not be doubly robust or locally semi-parametric efficient. However, as shown in the appendix, it is still a consistent and asymptotically normal semi-parametric estimator for F_j(t), if the censoring models (3) are correctly specified.

We compared our augmented and non-augmented IPCW estimators with the standard estimator of the cumulative incidence function which assumes independent censoring (Andersen et al., 1993), the Aalen-Johansen estimator ${\hat{F}}_{j}^{0} (t)$ :

{\hat{F}}_{j}^{0} (t) = \sum_{i | t_{j i} < t} d_{j i} n_{j i}^{- 1} \hat{S} (t_{j i}),

where d_ji is the number of people with failure type j at time t_ji, t_j₁ < t_j₂ < … < t_jk_j are the failure times for failures of type j, n_ji is the number of people at risk at time t_ji, and $\hat{S} (t)$ is the Kaplan-Meier estimator of the overall survival function (i.e. for all failure types combined). We calculated the standard error for ${\hat{F}}_{j}^{0} (t)$ using the delta method (Pintilie, 2006).

6 Simulation Study

We conducted a simulation study, which is a modification of the simulation study in Rotnitzky et al. (2007), in order to evaluate the performance of our estimators in finite samples.

We generated T * according to an exponential distribution with mean equal to 1.25. We assumed there were 2 event types, with probabilities equal to 0.35 for event type 1, and 0.65 for event type 2. Here, the type of failure is independent of T *. We then generated two covariates, one time-independent (V_TI) and one time-dependent (V_TD). The time-independent covariate was generated from a Bernoulli distribution with mean equal to 0.55. The time dependent covariate was a 1 × 3 row vector generated from a multivariate normal distribution with mean equal to (T*, T*, T*)′ (so as to create a dependence between V_TD and T*) and covariance equal to 0.7^|ⁱ⁻^j^|, where i, j = 1, 2, 3. This vector represents the values of V_TD(t) at times t₁ = 0, t₂ = 0.5, and t₃ = 1. V_TD(t) at t = 0 represents a baseline measurement. We assumed that the time-dependent variable remains constant between measurements.

We assumed the maximum follow up time was ν* = 1.35. We generated the censoring times for the independent censoring process, $C_{2}^{*}$ , according to a uniform(.55,1.35) distribution, to represent an administrative censoring process. We chose a uniform(.55,1.35) distribution as opposed to a uniform(0,1.35) distribution because many HIV clinical trials are designed to follow all patients for a pre-specified duration after the last patient is enrolled.

Next, we generated the censoring times for the dependent censoring process, $C_{1}^{*}$ , according to the following hazard rate: $λ_{c} (t | {\bar{V}}_{t}) = λ_{0} (t) \exp [γ' w (t, {\bar{V}}_{t})]$ , where $γ' w (t, {\bar{V}}_{t}) = γ_{1} V_{T I} + γ_{2} V_{T D} (t)$ with λ₀(t) = 1.5, γ₁ = 0.15 and γ₂ = 0.8. Generating the censoring times according to the time-dependent model was done sequentially. This was done because the hazard of censoring in the time interval t₁ = 0 to t₂ = 0.5 differs from the hazard of censoring in the next interval. The algorithm to construct $C_{1}^{*}$ for each simulated patient is as follows:

–
Generate a censoring time, C₁, compatible with the hazard function for the first time interval, [t₁, t₂), (where t₁ = 0) using the method of Bender et al. (2005). Note that this step is simply generating a censoring time that is compatible with a Cox proportional hazards model with time-independent covariates and constant baseline hazard.
–
If C₁ is contained within the first time interval [t₁, t₂), then set $C_{1}^{*} = C_{1}$ .
–
If C₁ is not contained within the interval [t₁, t₂), generate a censoring time, C₂, compatible with the model for the second time interval,[t₂, t₃).
–
If C₂ is contained within the interval [0, t₃ − t₂), then set $C_{1}^{*} = C_{2} + t_{2}$ .
–
If C₂ is not contained within the time interval [0, t₃ − t₂), then repeat the previous two steps for the last time interval [t₃, ∞).

Finally, C* was defined as $min (C_{1}^{*}, C_{2}^{*})$ .

We repeated the simulations with the same setting, but with λ₀(t) = 2.5 and λ₀ = 0.04 instead of λ₀(t) = 1.5, so as to introduce varying levels of dependent censoring. Furthermore, in order to evaluate the performance of the augmented and non-augmented IPCW estimators when the Aalen-Johansen estimator is consistent, that is, when censoring is independent, we also simulated a scenario with only independent censoring. In this scenario, the distribution of the outcomes was the same as before, but $C_{1}^{*}$ was uniform(0,1.35) and $C_{2}^{*}$ was uniform(0.55,1.35).

Practically, in order to ensure the regularity condition that $λ_{C, r} (t | {\bar{V}}_{T}, T, J, T > t) < ξ$ , we can treat the last observation (or last x observations) in each dataset as a failure (Robins and Rotnitzky, 1992) and set T = T * and C = C*. Alternatively, we could have chosen an arbitrary ε, set ν = ν* − ε, T = min(T *, ν) and C = min(C*, ν). Both methods would ensure that $λ_{C, r} (t | {\bar{V}}_{T}, T, J, T > t) < ξ$ with probability 1. Here, we treated the last 5 observations as failures; this was chosen ad hoc. We also examined treating only the last observation as a failure, and then taking the last 10 observations as failures; our results were not sensitive to this condition. We generated 1000 datasets with 250 patients each. We estimated F_j(t) at 8 time points: 0.05, 0.2, 0.35, 0.5, 0.65, 0.8, 0.95, 1.1.

In the first simulation scenario, the average censoring rate for the 1000 simulations was 55%, and the average dependent censoring rate was 33%. The results are presented in Table 1A. In the second simulation scenario, the average censoring rate for the 1000 simulations was 58%, and the average dependent censoring rate was 43%. The results are presented in Table 1B. In the third simulation scenario, the average censoring rate for the 1000 simulations was 50%, and the average dependent censoring rate was 15%. The results are presented in Table 2A. In these three simulation scenarios, the augmented IPCW estimator had bias very close to zero for each failure type. As expected in these scenarios, the Aalen-Johansen estimator had substantially larger bias and Mean Squared Error. The non-augmented IPCW estimator had reduced bias compared with the Aalen-Johansen estimator but still appeared to show some bias, particularly at later follow-up times and the highest percentage of dependent censoring. At earlier follow-up times and lower percentage of dependent censoring, the bias of both IPCW estimators did not substantially contribute to the Mean Squared Error. Also, for most time points the augmented IPCW estimator had smaller Mean Squared Error and hence was more efficient than the IPCW estimator, even though (11) was misspecified here. The efficiency gains increased over time. The gain obtained by augmenting the IPCW estimator was larger in the scenario with more dependent censoring.

Table 1.

Simulation Results under Dependent Censoring: scenarios 1 and 2, with more dependent censoring. rMSE is the square root mean squared error, and % Dec. in rMSE is the percentage decrease in rMSE of the augmented estimator (superscript A) compared to the non-augmented estimator (superscript NA). Superscript AJ refers to the Aalen-Johansen estimator.

Estimator

Time (t)

0.05

0.20

0.35

0.50

0.65

0.80

0.95

1.10

F₁(t) (truth)

0.014

0.052

0.085

0.114

0.142

0.166

0.186

0.204

F₂(t) (truth)

0.025

0.096

0.159

0.215

0.264

0.306

0.346

0.380

Table 1A: 33% dependent censoring

{\hat{F}}_{1}^{A} (t)

Bias

0.000

0.001

0.002

0.007

0.015

0.018

0.022

0.024

0.027

0.031

0.037

rMSE

0.007

0.015

0.018

0.022

0.024

0.027

0.031

0.037

% Dec. in rMSE

0.48

2.31

5.44

8.52

14.54

17.55

22.51

26.60

{\hat{F}}_{1}^{N A} (t)

Bias

0.001

0.002

0.003

0.004

0.005

0.008

0.012

0.016

0.007

0.015

0.019

0.024

0.027

0.031

0.036

0.044

rMSE

0.007

0.015

0.019

0.024

0.027

0.032

0.038

0.047

{\hat{F}}_{1}^{A J} (t)

Bias

0.001

0.004

0.009

0.015

0.020

0.026

0.031

0.034

0.008

0.015

0.020

0.024

0.026

0.029

0.033

0.037

rMSE

0.008

0.016

0.022

0.028

0.033

0.039

0.045

0.050

{\hat{F}}_{2}^{A} (t)

Bias

0.000

0.001

0.000

0.001

0.000

0.001

0.000

0.010

0.018

0.023

0.027

0.030

0.035

0.042

0.049

rMSE

0.010

0.018

0.023

0.027

0.030

0.035

0.042

0.049

% Dec. in rMSE

0.68

2.77

7.04

12.71

23.58

37.06

36.82

47.35

{\hat{F}}_{2}^{N A} (t)

Bias

0.000

0.002

0.003

0.006

0.009

0.014

0.019

0.027

0.010

0.018

0.025

0.029

0.036

0.045

0.054

0.067

rMSE

0.010

0.018

0.025

0.030

0.037

0.047

0.057

0.072

{\hat{F}}_{2}^{A J} (t)

Bias

0.001

0.007

0.015

0.026

0.036

0.047

0.055

0.061

0.010

0.019

0.025

0.029

0.033

0.037

0.042

0.047

rMSE

0.010

0.020

0.029

0.039

0.049

0.060

0.069

0.077

Table 1B: 43% dependent censoring

{\hat{F}}_{1}^{A} (t)

Bias

0.000

0.001

0.002

0.003

0.007

0.015

0.019

0.022

0.025

0.029

0.034

0.045

rMSE

0.007

0.015

0.019

0.022

0.025

0.029

0.034

0.045

% Dec. in rMSE

0.64

3.50

8.52

12.67

16.91

22.22

26.21

27.80

{\hat{F}}_{1}^{N A} (t)

Bias

0.001

0.002

0.003

0.006

0.008

0.012

0.017

0.025

0.008

0.015

0.020

0.025

0.029

0.036

0.044

0.057

rMSE

0.008

0.015

0.020

0.026

0.030

0.038

0.047

0.062

{\hat{F}}_{1}^{A J} (t)

Bias

0.001

0.006

0.012

0.020

0.027

0.034

0.040

0.044

0.008

0.016

0.022

0.026

0.028

0.032

0.036

0.041

rMSE

0.008

0.017

0.025

0.033

0.039

0.047

0.054

0.060

{\hat{F}}_{2}^{A} (t)

Bias

0.000

0.001

0.000

0.001

0.002

0.003

0.010

0.018

0.024

0.028

0.032

0.038

0.036

0.045

rMSE

0.010

0.018

0.024

0.028

0.032

0.038

0.036

0.045

% Dec. in rMSE

0.68

5.78

12.11

20.42

38.02

51.02

52.63

53.60

{\hat{F}}_{2}^{N A} (t)

Bias

0.000

0.003

0.004

0.010

0.014

0.021

0.031

0.044

0.010

0.019

0.027

0.032

0.043

0.055

0.069

0.086

rMSE

0.010

0.019

0.027

0.034

0.045

0.059

0.076

0.097

{\hat{F}}_{2}^{A J} (t)

Bias

0.001

0.010

0.021

0.036

0.050

0.063

0.072

0.078

0.010

0.020

0.027

0.032

0.035

0.041

0.047

0.051

rMSE

0.010

0.022

0.034

0.048

0.061

0.075

0.086

0.093

Open in a new tab

Table 2.

Simulation Results under less or no Dependent Censoring. SD is the standard deviation, rMSE is the square root mean squared error, and % Dec. in rMSE is the percentage decrease in rMSE of the augmented estimator (superscript A) compared to the non-augmented estimator (superscript NA). Superscript AJ refers to the Aalen-Johansen estimator.

Estimator

Time (t)

0.05

0.20

0.35

0.50

0.65

0.80

0.95

1.10

F₁(t) (truth)

0.014

0.052

0.085

0.114

0.142

0.166

0.186

0.204

F₂(t) (truth)

0.025

0.096

0.159

0.215

0.264

0.306

0.346

0.380

Table 2A: 15% dependent censoring

{\hat{F}}_{1}^{A} (t)

Bias

0.000

0.001

0.002

0.008

0.014

0.018

0.021

0.022

0.025

0.027

0.030

rMSE

0.008

0.014

0.018

0.021

0.022

0.025

0.027

0.030

% Dec. in rMSE

0.12

0.56

1.43

2.00

2.61

4.38

8.17

10.02

{\hat{F}}_{1}^{N A} (t)

Bias

0.001

0.002

0.003

0.004

0.006

0.008

0.015

0.018

0.021

0.023

0.026

0.029

0.032

rMSE

0.008

0.015

0.018

0.021

0.023

0.026

0.029

0.033

{\hat{F}}_{1}^{A J} (t)

Bias

0.001

0.002

0.004

0.006

0.008

0.011

0.014

0.016

0.008

0.015

0.019

0.022

0.024

0.026

0.029

0.031

rMSE

0.008

0.015

0.019

0.023

0.025

0.028

0.032

0.035

{\hat{F}}_{2}^{A} (t)

Bias

0.000

−0.001

0.010

0.018

0.023

0.026

0.029

0.031

0.034

0.037

rMSE

0.010

0.018

0.023

0.026

0.029

0.031

0.034

0.037

% Dec. in rMSE

0.13

0.79

1.80

3.18

5.98

7.03

12.81

18.52

{\hat{F}}_{2}^{N A} (t)

Bias

0.000

0.001

0.000

0.002

0.004

0.005

0.007

0.010

0.018

0.023

0.027

0.030

0.034

0.039

0.043

rMSE

0.010

0.018

0.023

0.027

0.030

0.034

0.039

0.044

{\hat{F}}_{2}^{A J} (t)

Bias

0.000

0.002

0.005

0.009

0.014

0.019

0.022

0.025

0.010

0.019

0.023

0.028

0.029

0.033

0.036

0.040

rMSE

0.010

0.019

0.024

0.029

0.033

0.038

0.042

0.047

Table 2B: Independent censoring

{\hat{F}}_{1}^{A} (t)

Bias

0.000

0.007

0.014

0.018

0.022

0.025

0.028

0.035

0.044

rMSE

0.007

0.014

0.018

0.022

0.025

0.028

0.035

0.044

% Dec. in rMSE

0.07

−0.13

0.40

0.66

0.48

0.40

−0.91

6.23

{\hat{F}}_{1}^{N A} (t)

Bias

0.000

0.007

0.014

0.018

0.022

0.025

0.029

0.035

0.047

rMSE

0.007

0.014

0.018

0.022

0.025

0.029

0.035

0.047

{\hat{F}}_{1}^{A J} (t)

Bias

0.000

0.006

0.014

0.018

0.022

0.024

0.028

0.034

0.043

rMSE

0.006

0.014

0.018

0.022

0.024

0.028

0.034

0.043

{\hat{F}}_{2}^{A} (t)

Bias

0.000

0.010

0.019

0.024

0.028

0.033

0.038

0.046

0.060

rMSE

0.010

0.019

0.024

0.028

0.033

0.038

0.046

0.060

% Dec. in rMSE

−0.06

0.50

1.32

0.99

0.75

−0.12

0.45

9.23

{\hat{F}}_{2}^{N A} (t)

Bias

0.000

0.010

0.019

0.024

0.029

0.033

0.038

0.046

0.065

rMSE

0.010

0.019

0.024

0.029

0.033

0.038

0.046

0.065

{\hat{F}}_{2}^{A J} (t)

Bias

0.000

0.010

0.019

0.024

0.028

0.032

0.038

0.045

0.058

rMSE

0.010

0.019

0.024

0.028

0.032

0.038

0.045

0.058

Open in a new tab

Table 2B displays the results for the scenario where censoring was independent. In this simulation scenario, the average censoring rate for the 1000 simulations was 48%. In this scenario, all three estimators are consistent. As can be seen in Table 2B, IPCW and augmented IPCW hardly inflated the Mean Squared Errors. This indicates that adjusting for dependent censoring can be done without paying a price in the form of a substantial increase in precision.

7 Analysis of Competing Risks in ACTG A5095

Our event of interest is failure of the initial treatment regimen and can be classified as one of three types: 1) virologic failure (VF), 2) discontinuation of initial treatment due to treatment limiting adverse event (TLAE), or 3) discontinuation of initial treatment due to treatment limiting other event (TLOE). TLOEs included required discontinuation of study treatment because of the need for medications which could not be taken with study treatment, clinical events, pregnancy, and death. In addition to administrative censoring, arising when the study closes to follow-up, patients may discontinue randomized treatment for reasons other than VF, TLAE, or TLOE (for example, loss of follow-up). Supposing that discontinuing treatment for other reasons could in principle be avoided, our aim is to describe what might happen in the setting where treatment is only discontinued because of VF, TLAE, or TLOE. Therefore, we censor patients if they discontinue treatment for reasons other than VF, TLAE, or TLOE. This may lead to dependent censoring.

A total of 758 patients were randomized, including 382 patients who received the 4-drug regimen and 376 patients who received the 3-drug regimen. Of the 758 patients, 146 had failure of their initial randomized regimen due to virologic failure (VFs), 58 discontinued their initial regimen due to treatment-limiting adverse events (TLAEs), and 26 discontinued their initial treatment due to treatment limiting other events (TLOEs, including 5 deaths). Of the remaining 528 patients who were censored, 432 patients were still on their initial randomized regimen at completion of the study, and so were administratively censored. The remaining 96 patients were non-administratively censored, mainly due to loss of follow-up while on their initial randomized regimen. The types of failure among the two regimens as well as the types of censoring are presented in Table 3.

Table 3.

Types of Failure and Censoring

Regimen	VF	TLAE	Admin. Censoring	Non-Admin Censoring	TLOE
4-drug (N=382)	65	35	213	56	13
3-drug (N=376)	81	23	219	40	13

Open in a new tab

We based the model for non-administrative censoring on a literature review of variables that might predict losses to follow-up in HIV-infected patients (Dudley et al. (1995); Ioannidis et al. (1997); Arici et al. (2002); Lanoy et al. (2006); Andersen et al. (2007); Krishnan et al. (2010); Fleishman et al. (2012)), and used the same set of variables for administrative censoring. Many of these variables have also been associated with the competing outcomes of interest, and so dependent censoring is a reasonable concern. We therefore used the following variables in equation (3), for the hazard of censoring, $w_{r} (t, {\bar{V}}_{t})$ :

{CD4 {Count}_{t} {,Log Viral Load}_{0}, Sex, Age, IV drug use, Black, Hispanic},

for r = 1, 2 (administrative and non-administrative censoring), where CD4 count is a time dependent variable coded as 1 for counts ≤ 200 and 0 otherwise; Log Viral Load is the log₁₀ HIV viral load in the blood; Sex is coded as 1 for males, 0 for females; IV drug use is coded as 1 for patients who reported ever using illicit intravenous (IV) drugs, and 0 otherwise; Black and Hispanic are the indicator variables for patients of black non-Hispanic and Hispanic race/ethnicity, respectively, with reference category white non-Hispanic. Table 4 presents the parameter estimates for the two censoring models, one for administrative and one for non-administrative censoring. We found no significant predictors of administrative censoring though there was some evidence of an increased odds of administrative censoring among men (p=0.07) and Hispanic patients (p=0.06) for the 3-drug regimen. Given that, due to randomization, treatment and administrative censoring are unrelated, this could well be a statistical artifact. Reported use of IV drugs was highly predictive of non-administrative censoring in both treatment arms (p = 0.006 for the 3-drug regimen, p=0.02 for the 4-drug regimen). Hispanic race/ethnicity was marginally significantly associated with an increased odds of non-administrative censoring in those on the 3-drug regimen (p=0.08), and male sex was marginally significantly associated with a reduced odds of non-administrative censoring in those on the 4-drug regimen (p=0.06). Thus, the assumption of independent censoring is violated if the time or type of event depends on, for example, use of IV drugs.

Table 4.

Parameter Estimates (95%-Confidence Intervals) for the Models for Administrative and Non-Administrative Censoring.

Covariate	3-drug Regimen		4-drug Regimen
Covariate	Admin. Censoring	Non-Admin. Censoring	Admin. Censoring	Non-Admin. Censoring
Sex (male vs female)	0.33 (−0.03, 0.69) p=0.07	0.31 (−0.58, 1.19) p=0.50	−0.03 (−0.41, 0.35) p=0.87	−0.57 (−1.16, 0.03) p=0.06
IV Drug Use (ever vs never)	−0.47 (−1.18, 0.24) p=0.20	1.18 (0.34, 2.02) p=0.006	−0.05 (−0.50, 0.41) p=0.85	0.83 (0.14, 1.51) p=0.02
Age ≤ 30 (vs > 30 years)	0.18 (−0.16, 0.52) p=0.30	0.54 (−0.14, 1.21) p=0.12	−0.07 (−0.40, 0.25) p=0.65	0.28 (−0.31, 0.87) p=0.36
Hispanic (vs white, non-Hispanic)	0.36 (−0.003, 0.72) p=0.06	0.76 (−0.08, 1.59) p=0.08	0.13 (−0.23, 0.48) p=0.49	0.34 (−0.34, 1.02) p=0.33
Black, non-Hispanic (vs white, non-Hispanic)	−0.13 (−0.45, 0.19) p=0.42	0.56 (−0.20, 1.32) p=0.15	0.22 (−0.10, 0.55) p=0.18	0.10 (−0.54, 0.74) p=0.76
Log Viral Load (per 1 log₁₀ copies/ml)	−0.05 (−0.25, 0.15) p=0.63	−0.12 (−0.56, 0.32) p=0.59	−0.03 (−0.21, 0.16) p=0.79	−0.24 (−0.61, 0.13) p=0.21
Time-dependent CD4 count ≤ 200	−0.01 (−0.64, 0.61) p=0.97	−0.45 (−1.37, 0.47) p=0.34	0.19 (−0.43, 0.82) p=0.55	0.31 (−0.37, 0.99) p=0.37

Open in a new tab

The estimated cumulative incidence curves for VF and TLAE are shown in Figure 1. Since there were only 26 TLOEs, we do not present the cumulative incidence curve for TLOE. The non-augmented IPCW estimator was essentially identical to the augmented IPCW estimator and is not shown here. Despite the fact that there were strong predictors of censoring, and the concern that these might also be predictors of the competing outcomes of interest, there was little difference between the standard estimate, ${\hat{F}}_{j}^{0} (t)$ , and the augmented IPCW estimate. For this application, of considerable importance, the conclusions that might be drawn from the study have been shown to be not sensitive to potentially dependent censoring, a concern that was well motivated by the fact that some predictors of loss to follow-up are also predictors of the outcome of interest. Comparing treatments, there were general trends for higher rates of VF but lower rates of TLAE for the 3-drug versus the 4-drug regimen.

Cumulative Incidence Curves, by Regimen.

The standard errors of our non-augmented and augmented IPCW estimators as well as the standard error for ${\hat{F}}_{j}^{0} (t)$ are shown in Figure 2. The standard errors for ${\hat{F}}_{j}^{N A} (t)$ and ${\hat{F}}_{j}^{A} (t)$ were obtained using the non-parametric bootstrap with 500 bootstrap samples. The difference in standard errors between the augmented and non-augmented IPCW estimators is generally small, suggesting that for this application, there is only a slight efficiency advantage to using the more complicated augmented estimator. Furthermore, for this application, the standard error of the augmented IPCW estimator is very similar to that of the Aalen-Johansen estimator. This is particularly important, because our estimator remains valid in the presence of dependent censoring, which is not so for the Aalen-Johansen estimator. Thus, we can rely on the conclusions even though the assumption of independent censoring may be violated.

Standard Errors of ${\hat{F}}_{j} (t)$ , by Regimen and Type of Failure, using Bootstrap Variance Estimation.

8 Discussion

In this paper we have developed a method to estimate the cumulative incidence function with multiple types of censoring. The use of methods of analysis which more appropriately address the challenges of competing risk data and potentially dependent censoring may be very valuable in understanding the relative balance of safety outcomes (e.g. TLAE) and efficacy outcomes (e.g. VF), and how this balance evolves with time on treatment and compares among treatments. Such analyses will likely be important complements to analyses of composite outcome measures (e.g. time to first of TLAE or VF) which can be difficult to interpret because they are often complex mixes of efficacy and safety outcomes. In addition, being able to handle dependent censoring in statistical analyses is important, where assessment of the sensitivity of the conclusions to the handling of different reasons for censoring should be part of standard analyses.

We investigated four simulation scenarios. When censoring was dependent, the augmented IPCW estimator had substantially reduced bias as compared to the Aalen-Johansen estimator, which assumes independent censoring; the IPCW estimator was in between, with small bias for earlier time points and larger bias later, especially where there was more dependent censoring. The decrease in root Mean Squared Error obtained from augmenting the IPCW estimator (as opposed to using the non-augmented IPCW estimator) increased with time and with percentage of dependent censoring, and was substantial for later time points and larger percentages of dependent censoring. When there was less dependent censoring, the IPCW estimator had comparable standard errors as the Aalen-Johansen estimator, and less bias; and the augmented IPCW estimator outperformed the IPCW estimator in terms of root MSE by a smaller percentage. In the scenario with independent censoring, where all three estimators are consistent, IPCW and augmented IPCW did not substantially inflate standard errors, as compared to the Aalen-Johansen estimator.

In our application, the results were not appreciably changed by allowing for the possibility of dependent censoring, but there may be other applications where doing so is important. Furthermore, this analysis provides more confidence in the resulting estimates since it incorporates the possibility of dependent censoring. As shown, this does not need to be at the expense of larger standard errors.

Even if there are only baseline covariates which predict censoring, IPCW is sometimes preferable over basing the analysis on a Cox proportional hazards model for the cause specific hazards. The reason is that we don’t need to assume a semi-parametric Cox model for the cause specific hazards of the competing outcomes; we only need to specify (3), which is automatically correctly specified if independent censoring does turn out to hold.

One direction of possible future research is to relax the assumption that data on all time-dependent and independent covariates that are prognostic for both failure and censoring are recorded and available. This future research would rely on sensitivity analyses in order to handle the non-ignorable missingness.

Acknowledgments

The authors would like to thank Andrea Rotnitzky for her review and valuable comments on this paper. We are grateful to the ACTG for providing data used in the motivating application. This work was partially supported by grants AI024643, AI068634, and AI007358 from the National Institutes of Health. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

A Appendix

Equation (3) is an unbiased estimating equation for F_j(t) since

\begin{array}{l} E (\frac{\tilde{Δ}}{π (\tilde{T} | {\bar{V}}_{\tilde{T}}; Λ_{0})} {1 (T \leq t, J = j) - F_{j} (t)}) \\ = E (E {\frac{\tilde{Δ}}{π (\tilde{T} | {\bar{V}}_{\tilde{T}}; Λ_{0})} {1 (T \leq t, J = j) - F_{j} (t)} | {\bar{V}}_{T}, T}) \\ = E (\frac{E [\tilde{Δ} | {\bar{V}}_{T}, T]}{π (\tilde{T} | {\bar{V}}_{\tilde{T}}; Λ_{0})} {1 (T \leq t, J = j) - F_{j} (t)}) \\ = E (1 \cdot {1 (T \leq t, J = j) - F_{j} (t)}) \\ = 0. \end{array}

The third equality follows from the fact that $E (\tilde{Δ} | {\bar{V}}_{T}, T) = P r [\tilde{Δ} = 1 | {\bar{V}}_{T}, T]$ which under our coarsening at random assumption equals $π (\tilde{T} | {\bar{V}}_{\tilde{T}}; Λ_{0})$ . From this it is clear that equation (3) is an unbiased estimating equation for F_j(t).

One can use the results in Robins and Rotnitzky (1992) to prove that with one type of censoring the solution to

\sum_{i = 1}^{n} {\frac{{\tilde{Δ}}_{i}}{π_{i} ({\tilde{T}}_{i} | {\bar{V}}_{{\tilde{T}}_{i}}; Λ_{0})} {1 (T_{i} \leq t, J_{i} = j) - F_{j} (t)} - A_{i} {F_{j} (t), γ, b (\cdot)}} = 0

(16)

with

A_{i} {F_{j} (t), γ, b (\cdot)} \equiv \int \frac{b (u, {\bar{V}}_{u})}{π (u - | {\bar{V}}_{u}; Λ_{0} (u))} d M_{C} (u)

(17)

and $b (u, {\bar{V}}_{u})$ the same as in equation (9), is a doubly robust, locally efficient estimator for F_j(t). Here, dM_C(u) = dN_C(u) − dΛ(u). In our situation, with multiple types of censoring, it is easy to show that $d M_{C} (u) = \sum_{r = 1}^{r *} d M_{C, r} (u)$ , which leads to our augmentation term in (8). Note that with one type of failure and one type of censoring, our method reduces to that of (Rotnitzky and Robins, 2005).

Now, it can be shown that for each r, since

\frac{b (u, {\bar{V}}_{u})}{π (u - | {\bar{V}}_{u}; Λ_{0} (u))}

is a bounded and predictable process, defined on the same filtration as M_C,r(u),

\int \frac{b (u, {\bar{V}}_{u})}{π (u - | {\bar{V}}_{u}; Λ_{0} (u))} d M_{C, r} (u)

is a mean zero martingale (Fleming and Harrington, 1991, Thm 1.5.1). Note that the left-continuous versions of $b (u, {\bar{V}}_{u})$ and $π (u | {\bar{V}}_{u}; Λ_{0} (u))$ are needed here. As a result

\sum_{r = 1}^{r^{*}} \int \frac{b (u, {\bar{V}}_{u})}{π (u - | {\bar{V}}_{u}; Λ_{0} (u))} d M_{C, r} (u)

is also a mean zero martingale. Thus, A_i{F_j(t), γ, b(·)} has mean zero. Since A_i{F_j(t), γ, b(·)} has mean zero, it also follows trivially that equation (7) is an unbiased estimating equation for F_j(t).

Footnotes

Ethical approval

Analysis of data from ACTG A5095 was approved by the Institutional Review Board of the Harvard School of Public Health. Informed consent was obtained from all individual participants included in the study.

Contributor Information

Judith J. Lok, Department of Biostatistics, Harvard School of Public Health.

Shu Yang, Department of Statistics, North Carolina State University.

Brian Sharkey, Incyte, Wilmington, USA.

Michael D. Hughes, Department of Biostatistics, Harvard School of Public Health

References

Andersen JW, Fass R, van der Horst C. Factors associated with early study discontinuation in AACTG studies, DACS 200. Contemporary Clinical Trials. 2007;28:583–592. doi: 10.1016/j.cct.2007.02.002. [DOI] [PubMed] [Google Scholar]
Andersen PK, Borgan Ø, Gill RD, Keiding N. Statistical models based on counting processes. New York: Springer–Verlag; 1993. (Springer series in statistics). [Google Scholar]
Arici C, Ripamonti D, Maggiolo F, Rizzi M, Finazzi MG, Pezzotti P, Suter F. Factors associated with the failure of HIV-positive persons to return for scheduled medical visits. HIV Clinical Trials. 2002;3(1):52–57. doi: 10.1310/2XAK-VBT8-9NU9-6VAK. [DOI] [PubMed] [Google Scholar]
Bender R, Augustin T, Blettner M. Generating survival times to simulate cox proportional hazards models. Statistics in Medicine. 2005;24(11):1713–1723. doi: 10.1002/sim.2059. [DOI] [PubMed] [Google Scholar]
Bryant J, Dignam JJ. Semiparametric models for cumulative incidence functions. Biometrics. 2004;60(1):182–190. doi: 10.1111/j.0006-341X.2004.00149.x. [DOI] [PubMed] [Google Scholar]
Dudley J, Jin S, Hoover D, Metz S, Thackeray R, Chmiel J. The Multicenter AIDS Cohort Study: Retention after 9 1/2 years. American Journal of Epidemiology. 1995;142(3):323–330. doi: 10.1093/oxfordjournals.aje.a117638. [DOI] [PubMed] [Google Scholar]
Fine JP. Regression modeling of competing crude failure probabilities. Biostatistics. 2001;2(1):85–97. doi: 10.1093/biostatistics/2.1.85. [DOI] [PubMed] [Google Scholar]
Fine JP, Gray RJ. A proportional hazards model for the subdistribution of a competing risk. Journal of the American Statistical Association. 1999;94(446):496–497. [Google Scholar]
Fleishman JA, Yehia BR, Moore RD, Korthuis PT, Gebo KA, for the HIV Reseach Network Establishment, retention, and loss to follow-up in outpatient HIV care. Journal of Acquired Immune Deficiency Syndrom. 2012;60(3):249–259. doi: 10.1097/QAI.0b013e318258c696. [DOI] [PMC free article] [PubMed] [Google Scholar]
Fleming T, Harrington D. Counting processes and survival analysis. Vol. 8. Wiley Online Library; 1991. [Google Scholar]
Gray RJ. A class of k-sample tests for comparing the cumulative incidence of a competing risk. The Annals of Statistics. 1988;16(3):1141–1154. [Google Scholar]
Gulick R, Ribaudo H, Shikuma C, Lalama C, Schackman B, Meyer W, III, Acosta E, Schouten J, Squires K, Pilcher C, et al. Three-vs four-drug antiretroviral regimens for the initial treatment of hiv-1 infection. JAMA: the Journal of the American Medical Association. 2006;296(7):769–781. doi: 10.1001/jama.296.7.769. [DOI] [PubMed] [Google Scholar]
Gulick R, Ribaudo H, Shikuma C, Lustgarten S, Squires K, Meyer W, III, Acosta E, Schackman B, Pilcher C, Murphy R, et al. Triplenucleoside regimens versus efavirenz-containing regimens for the initial treatment of hiv-1 infection. New England Journal of Medicine. 2004;350(18):1850–1861. doi: 10.1056/NEJMoa031772. [DOI] [PubMed] [Google Scholar]
Heitjan DF, Rubin DB. Ignorability and coarse data. The Annals of Statistics. 1991;19(4):2244–2253. [Google Scholar]
Ioannidis JPA, Bassett R, Hughes MD, Volberding PA, Sacks HS, Lau J. Predictors and impact of patients lost to follow-up in a long-term randomized trial of immediate versus deferred antiretroviral treatment. Journal of Acquired Immune Deficiency Syndromes and Human Retrovirology. 1997;16(1):22–30. doi: 10.1097/00042560-199709010-00004. [DOI] [PubMed] [Google Scholar]
Jeong JH, Fine J. Direct parametric inference for the cumulative incidence function. Journal of the Royal Statistical Society: Series C (Applied Statistics) 2006;55(2):187–200. [Google Scholar]
Kalbfleisch JD, Prentice RL. The statistical analysis of failure time data. Vol. 5. Wiley; New York: 1980. [Google Scholar]
Krishnan S, Wu K, Smurzynski M, Bosch RJ, Benson CA, Collier AC, Klebert MK, Feinberg J, Koletar SL, for the ALLRT/A5001 team. Incidence rate of and factors associated with loss to follow-up in a longitudinal cohort of antiretroviral-treated HIV-infected persons: An AIDS Clinical Trials Group (ACTG) Longitudinal Linked Randomized Trials (ALLRT) analysis. HIV Clinical Trials. 2010;12(4):190–200. doi: 10.1310/HCT1204-190. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lanoy E, Mary-Krause M, Tattevin P, Dray-Spira R, Duvidier C, Fischer P, Obadia Y, Lert F, Costagliola D, the Clinical Epidemiology Group of the French Hospital Database on HIV infection. Predicators identified for losses to follow-up among HIV-seropositive patients. Journal of Clinical Epidemiology. 2006;59:829–835. doi: 10.1016/j.jclinepi.2005.11.024. [DOI] [PubMed] [Google Scholar]
Lin DY. Non-parametric inference for cumulative incidence functions in competing risks studies. Statistics in Medicine. 1997;16(8):901–910. doi: 10.1002/(sici)1097-0258(19970430)16:8<901::aid-sim543>3.0.co;2-m. [DOI] [PubMed] [Google Scholar]
Lok JJ, Hughes MD. Evaluating predictors of competing risk outcomes when censoring depends on time-dependent covariates, with application to safety and efficacy of HIV treatment. 2016 doi: 10.1002/sim.6852. Accepted by Statistics in Medicine. [DOI] [PMC free article] [PubMed] [Google Scholar]
Pintilie M. Competing risks: a practical perspective. John Wiley & Sons; New York: 2006. [Google Scholar]
Robins J, Rotnitzky A. Recovery of information and adjustment for dependent censoring using surrogate markers. Aids Epidemiology, Methodological issues. 1992:297–331. [Google Scholar]
Robins JM, Rotnitzky A, Zhao LP. Analysis of semiparametric regression models for repeated outcomes in the presence of missing data. Journal of the American Statistical Association. 1995;90(429):106–121. [Google Scholar]
Rotnitzky A, Bergesio A, Farall A. Analysis of quality-of-life adjusted failure time data in the presence of competing, possibly informative, censoring mechanisms. Lifetime Data Analysis. 2009;15(1):1–23. doi: 10.1007/s10985-008-9088-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
Rotnitzky A, Farall A, Bergesio A, Scharfstein D. Analysis of failure time data under competing censoring mechanisms. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 2007;69(3):307–327. [Google Scholar]
Rotnitzky A, Robins J. Inverse probability weighting in survival analysis. Encyclopedia of Biostatistics. 2005;4 [Google Scholar]
Rubin DB. Inference and missing data. Biometrika. 1976;63(3):581–592. [Google Scholar]
Tsiatis AA. Semiparametric theory and missing data. Springer Verlag; 2006. [Google Scholar]

[R1] Andersen JW, Fass R, van der Horst C. Factors associated with early study discontinuation in AACTG studies, DACS 200. Contemporary Clinical Trials. 2007;28:583–592. doi: 10.1016/j.cct.2007.02.002. [DOI] [PubMed] [Google Scholar]

[R2] Andersen PK, Borgan Ø, Gill RD, Keiding N. Statistical models based on counting processes. New York: Springer–Verlag; 1993. (Springer series in statistics). [Google Scholar]

[R3] Arici C, Ripamonti D, Maggiolo F, Rizzi M, Finazzi MG, Pezzotti P, Suter F. Factors associated with the failure of HIV-positive persons to return for scheduled medical visits. HIV Clinical Trials. 2002;3(1):52–57. doi: 10.1310/2XAK-VBT8-9NU9-6VAK. [DOI] [PubMed] [Google Scholar]

[R4] Bender R, Augustin T, Blettner M. Generating survival times to simulate cox proportional hazards models. Statistics in Medicine. 2005;24(11):1713–1723. doi: 10.1002/sim.2059. [DOI] [PubMed] [Google Scholar]

[R5] Bryant J, Dignam JJ. Semiparametric models for cumulative incidence functions. Biometrics. 2004;60(1):182–190. doi: 10.1111/j.0006-341X.2004.00149.x. [DOI] [PubMed] [Google Scholar]

[R6] Dudley J, Jin S, Hoover D, Metz S, Thackeray R, Chmiel J. The Multicenter AIDS Cohort Study: Retention after 9 1/2 years. American Journal of Epidemiology. 1995;142(3):323–330. doi: 10.1093/oxfordjournals.aje.a117638. [DOI] [PubMed] [Google Scholar]

[R7] Fine JP. Regression modeling of competing crude failure probabilities. Biostatistics. 2001;2(1):85–97. doi: 10.1093/biostatistics/2.1.85. [DOI] [PubMed] [Google Scholar]

[R8] Fine JP, Gray RJ. A proportional hazards model for the subdistribution of a competing risk. Journal of the American Statistical Association. 1999;94(446):496–497. [Google Scholar]

[R9] Fleishman JA, Yehia BR, Moore RD, Korthuis PT, Gebo KA, for the HIV Reseach Network Establishment, retention, and loss to follow-up in outpatient HIV care. Journal of Acquired Immune Deficiency Syndrom. 2012;60(3):249–259. doi: 10.1097/QAI.0b013e318258c696. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R10] Fleming T, Harrington D. Counting processes and survival analysis. Vol. 8. Wiley Online Library; 1991. [Google Scholar]

[R11] Gray RJ. A class of k-sample tests for comparing the cumulative incidence of a competing risk. The Annals of Statistics. 1988;16(3):1141–1154. [Google Scholar]

[R12] Gulick R, Ribaudo H, Shikuma C, Lalama C, Schackman B, Meyer W, III, Acosta E, Schouten J, Squires K, Pilcher C, et al. Three-vs four-drug antiretroviral regimens for the initial treatment of hiv-1 infection. JAMA: the Journal of the American Medical Association. 2006;296(7):769–781. doi: 10.1001/jama.296.7.769. [DOI] [PubMed] [Google Scholar]

[R13] Gulick R, Ribaudo H, Shikuma C, Lustgarten S, Squires K, Meyer W, III, Acosta E, Schackman B, Pilcher C, Murphy R, et al. Triplenucleoside regimens versus efavirenz-containing regimens for the initial treatment of hiv-1 infection. New England Journal of Medicine. 2004;350(18):1850–1861. doi: 10.1056/NEJMoa031772. [DOI] [PubMed] [Google Scholar]

[R14] Heitjan DF, Rubin DB. Ignorability and coarse data. The Annals of Statistics. 1991;19(4):2244–2253. [Google Scholar]

[R15] Ioannidis JPA, Bassett R, Hughes MD, Volberding PA, Sacks HS, Lau J. Predictors and impact of patients lost to follow-up in a long-term randomized trial of immediate versus deferred antiretroviral treatment. Journal of Acquired Immune Deficiency Syndromes and Human Retrovirology. 1997;16(1):22–30. doi: 10.1097/00042560-199709010-00004. [DOI] [PubMed] [Google Scholar]

[R16] Jeong JH, Fine J. Direct parametric inference for the cumulative incidence function. Journal of the Royal Statistical Society: Series C (Applied Statistics) 2006;55(2):187–200. [Google Scholar]

[R17] Kalbfleisch JD, Prentice RL. The statistical analysis of failure time data. Vol. 5. Wiley; New York: 1980. [Google Scholar]

[R18] Krishnan S, Wu K, Smurzynski M, Bosch RJ, Benson CA, Collier AC, Klebert MK, Feinberg J, Koletar SL, for the ALLRT/A5001 team. Incidence rate of and factors associated with loss to follow-up in a longitudinal cohort of antiretroviral-treated HIV-infected persons: An AIDS Clinical Trials Group (ACTG) Longitudinal Linked Randomized Trials (ALLRT) analysis. HIV Clinical Trials. 2010;12(4):190–200. doi: 10.1310/HCT1204-190. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R19] Lanoy E, Mary-Krause M, Tattevin P, Dray-Spira R, Duvidier C, Fischer P, Obadia Y, Lert F, Costagliola D, the Clinical Epidemiology Group of the French Hospital Database on HIV infection. Predicators identified for losses to follow-up among HIV-seropositive patients. Journal of Clinical Epidemiology. 2006;59:829–835. doi: 10.1016/j.jclinepi.2005.11.024. [DOI] [PubMed] [Google Scholar]

[R20] Lin DY. Non-parametric inference for cumulative incidence functions in competing risks studies. Statistics in Medicine. 1997;16(8):901–910. doi: 10.1002/(sici)1097-0258(19970430)16:8<901::aid-sim543>3.0.co;2-m. [DOI] [PubMed] [Google Scholar]

[R21] Lok JJ, Hughes MD. Evaluating predictors of competing risk outcomes when censoring depends on time-dependent covariates, with application to safety and efficacy of HIV treatment. 2016 doi: 10.1002/sim.6852. Accepted by Statistics in Medicine. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R22] Pintilie M. Competing risks: a practical perspective. John Wiley & Sons; New York: 2006. [Google Scholar]

[R23] Robins J, Rotnitzky A. Recovery of information and adjustment for dependent censoring using surrogate markers. Aids Epidemiology, Methodological issues. 1992:297–331. [Google Scholar]

[R24] Robins JM, Rotnitzky A, Zhao LP. Analysis of semiparametric regression models for repeated outcomes in the presence of missing data. Journal of the American Statistical Association. 1995;90(429):106–121. [Google Scholar]

[R25] Rotnitzky A, Bergesio A, Farall A. Analysis of quality-of-life adjusted failure time data in the presence of competing, possibly informative, censoring mechanisms. Lifetime Data Analysis. 2009;15(1):1–23. doi: 10.1007/s10985-008-9088-y. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R26] Rotnitzky A, Farall A, Bergesio A, Scharfstein D. Analysis of failure time data under competing censoring mechanisms. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 2007;69(3):307–327. [Google Scholar]

[R27] Rotnitzky A, Robins J. Inverse probability weighting in survival analysis. Encyclopedia of Biostatistics. 2005;4 [Google Scholar]

[R28] Rubin DB. Inference and missing data. Biometrika. 1976;63(3):581–592. [Google Scholar]

[R29] Tsiatis AA. Semiparametric theory and missing data. Springer Verlag; 2006. [Google Scholar]

PERMALINK

Estimation of the Cumulative Incidence Function Under Multiple Dependent and Independent Censoring Mechanisms

Judith J Lok

Shu Yang

Brian Sharkey

Michael D Hughes

Abstract

1 Introduction

2 The ACTG A5095 Study: A Motivating Example

3 Notation and Goal

4 Assumptions

5 Estimation

5.1 Inverse Probability of Censoring Weighted (IPCW) Estimator

5.2 Augmented Inverse Probability of Censoring Weighted (AIPCW) Estimator

6 Simulation Study

Table 1.

Table 2.

7 Analysis of Competing Risks in ACTG A5095

Table 3.

Table 4.

Figure 1.

Figure 2.

8 Discussion

Acknowledgments

A Appendix

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Estimation of the Cumulative Incidence Function Under Multiple Dependent and Independent Censoring Mechanisms

Judith J Lok

Shu Yang

Brian Sharkey

Michael D Hughes

Abstract

1 Introduction

2 The ACTG A5095 Study: A Motivating Example

3 Notation and Goal

4 Assumptions

5 Estimation

5.1 Inverse Probability of Censoring Weighted (IPCW) Estimator

5.2 Augmented Inverse Probability of Censoring Weighted (AIPCW) Estimator

6 Simulation Study

Table 1.

Table 2.

7 Analysis of Competing Risks in ACTG A5095

Table 3.

Table 4.

Figure 1.

Figure 2.

8 Discussion

Acknowledgments

A Appendix

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases