A Proportional Hazards Regression Model for the Sub-distribution with Covariates Adjusted Censoring Weight for Competing Risks Data

PENG HE; FRANK ERIKSSON; THOMAS H SCHEIKE; MEI-JIE ZHANG

doi:10.1111/sjos.12167

. Author manuscript; available in PMC: 2017 Mar 1.

Published in final edited form as: Scand Stat Theory Appl. 2015 Jun 5;43(1):103–122. doi: 10.1111/sjos.12167

A Proportional Hazards Regression Model for the Sub-distribution with Covariates Adjusted Censoring Weight for Competing Risks Data

PENG HE ¹, FRANK ERIKSSON ², THOMAS H SCHEIKE ², MEI-JIE ZHANG ³

PMCID: PMC4809648 NIHMSID: NIHMS703555 PMID: 27034534

Abstract

With competing risks data, one often needs to assess the treatment and covariate effects on the cumulative incidence function. Fine and Gray proposed a proportional hazards regression model for the subdistribution of a competing risk with the assumption that the censoring distribution and the covariates are independent. Covariate-dependent censoring sometimes occurs in medical studies. In this paper, we study the proportional hazards regression model for the subdistribution of a competing risk with proper adjustments for covariate-dependent censoring. We consider a covariate-adjusted weight function by fitting the Cox model for the censoring distribution and using the predictive probability for each individual. Our simulation study shows that the covariate-adjusted weight estimator is basically unbiased when the censoring time depends on the covariates, and the covariate-adjusted weight approach works well for the variance estimator as well. We illustrate our methods with bone marrow transplant data from the Center for International Blood and Marrow Transplant Research (CIBMTR). Here cancer relapse and death in complete remission are two competing risks.

Keywords: competing risks, cumulative incidence function, proportional hazards model, subdistribution, inverse probability of censoring weight

1 Introduction

Biomedical research often involves competing risks in which each subject is at risk of failure from K different causes. For competing risks data, one only observes the first event to occur and this precludes the occurrence of another event. Also, one often wishes to estimate and model the cumulative incidence function (CIF), which is the marginal probability of failure of a specific cause. The standard approach of modeling CIF is to model the cause-specific hazard functions for all causes. Let λ_k(t; Z) be the kth conditional cause-specific hazard (k = 1, 2 for simplicity), where Z is given set of covariates. The CIF of cause 1 given by Z is

F_{1} (t; Z) = P (\tilde{T} \leq t, ∊ = 1 ∣ Z) = \int_{0}^{t} λ_{1} (s; Z) \exp [- \int_{0}^{s^{-}} {λ_{1} (u; Z) + λ_{2} (u; Z)} d u] d s,

where $\tilde{T}$ is the failure time and ∊ indicates the type of failure. Here, all cause-specific hazards need to be modeled adequately and correctly. Note that the cumulative incidence function F₁(t; Z) is a subdistribution function since F₁(∞; Z) < 1. Prentice et al. (1978) and Cheng et al. (1998) proposed using Cox (1972) proportional hazards model for all causes. Alternatively, Shen and Cheng (1999) considered a special additive model, and Scheike and Zhang (2002, 2003) proposed and studied a flexible Cox-Aalen model, which allows some of the covariates to have time-varying effects. Since the cumulative incidence function of a specific cause is a function of cause-specific hazards for all causes, it is difficult to summarize the covariate effect (Zhang and Fine, 2008) and to identify the covariate effect on the cumulative incidence function. However, regression methods have been developed to directly model the cumulative incidence function. Fine and Gray (1999) (FG) developed a regression method to directly model the CIF by modeling the subdistribution hazard function through a Cox type regression model, $λ_{k}^{*} (t, Z) = - d \log {1 - F_{k} (t; Z)} ∕ d t = λ_{k 0}^{*} (t) \exp (β_{k}^{⊺} Z)$ based on early work by Gray (1988) and Pepe (1991). FG proposed using an inverse probability of the censoring weighting (IPCW) technique to estimate the regression parameter β_k and cumulative baseline subdistribution hazard function $Λ_{k 0}^{*} (t) = \int_{0}^{t} λ_{k 0}^{*} (s) d s$ . This approach has been implemented in an R-package, cmprsk. FG's model has been considered and used extensively in cancer studies, epidemiological studies, and many other biomedical studies (Scrucca et al., 2007; Wolbers et al., 2009; Kim, 2007; Lau et al., 2009). Let $r (t) = I {C \geq (\tilde{T} \land t)}$ and G_C(t) = P(C > t), where C is the censoring time. Fine and Gray's approach is based on the fact that $E [r (t) ∕ G_{C} {(\tilde{T} \land C) \land t} ∣ Data] = 1$ provided that censoring time is independent of the covariates, and FG proposed using the Kaplan-Meier estimator to estimate the unknown censoring distribution G_C. However, in biomedical research studies, the censoring time may depend on some of the covariates and the treatment group. In a clinical trial, patients may be more likely to drop out with some specific value of covariate characteristics, and one treatment group may have a higher dropout rate than the others (Mai, 2008). DiRienzo and Lagakos (2001a,b) showed that when the distribution of censoring depends on both the treatment group and the covariates, in general the null asymptotic distribution of the score test is not centered at zero when the model is misspecified, the tests of treatment group effect can be severely biased. Heinze et al. (2003) showed that if the censoring distributions are not similar in the two comparison groups, the log-rank test and fitting a regression model, such as fitting a proportional hazards model, may not be valid. For the competing risks data, one can show that $E [r (t) ∕ G_{C} {(\tilde{T} \land C) \land t ∣ Data} ∣ Data] = 1$ , where $G_{C} {(\tilde{T} \land C) \land t ∣ Data}$ is the conditional censoring distribution given by Data. Thus, parameter estimates using the inverse probability of censoring weighting approach with the Kaplan-Meier estimator may be biased when the censoring distribution depends on some of the covariates. To adjust the IPCW when censoring distribution depends on some of the covariates, Fine and Gray (1999) suggested using a stratified Kaplan-Meier estimator for the discrete covariates and assuming the Cox model for the continuous covariates. In this study, we considered a regression model for the censoring distribution, such as a Cox proportional hazards model, and using the predicted censoring probability for each individual subject for the weight function. With the Cox model adjusted weight, we derived an efficient variance estimator which includes variation contributed from estimated censoring distribution, and we performed a simulation study to examine the bias that would arise without adjusting covariates for estimating the censoring distribution, potential bias reduction and robustness of using the Cox model for the censoring distribution. Furthermore, Fine and Gray proposed using a stabilized factor ${\hat{G}}_{C} (t)$ with inverse weight $r (t) {\hat{G}}_{C} (t) ∕ {\hat{G}}_{C} {(\tilde{T} \land C) \land t}$ . Our simulation indicates that this stabilized weight improves the efficiency and reduces the bias, but not enough. With the Cox model adjusted weight function, we also considered using a stabilized weight $r (t) {\hat{G}}_{C} (t ∣ X = x) ∕ {\hat{G}}_{C} {(\tilde{T} \land C) \land t ∣ X = x}$ to improve efficiency and to reduce bias, where X is the covariates, which is associated with the censoring distribution and could be a subset covariates of Z.

The outline of the remainder of the paper is as follows. In Section 2 we describe the competing risks data structure. We introduce a regression-adjusted inverse weighted estimation for the proportional subdistribution hazards model and present the asymptotic results that can be used for inference. Simulation studies are provided in Section 3. In Section 4 we analyze two real data sets, which were originally studied by Kumar et al. (2012) and by Ringdén et al. (2012) using data from the Center for International Blood and Marrow Transplant Research (CIBMTR). Concluding remarks are provided in Section 5. The proof of the main asymptotic result and the simulation procedure are given in Appendix A and B, respectively.

2 Data and covariate adjusted censoring weight

Let ${\tilde{T}}_{i}$ and C_i be the event time and right censoring time for ith individual, respectively. ∊_i ∈ {1,…,K} indicates the cause of failure. For simplicity, we assume K = 2 in this study. Let $T_{i} = \min ({\tilde{T}}_{i}, C_{i})$ and $Δ_{i} = I ({\tilde{T}}_{i} \leq C_{i})$ . We observe n independent and identically distributed (i.i.d.) data {T_i, Δ_i, Δ_i∊_i, Z_i} for i = 1, …, n, where Z_i = (Z_i1,…, Z_iq)^T are associated covariates. We assume that ( ${\tilde{T}}_{i}, ∊_{i}$ ) are independent of C_i given covariates of Z_i. We are interested in modeling the cumulative incidence function of cause 1, F₁(t; Z). Based on Gray (1988) subdistribution hazard technique, Fine and Gray (1999) proposed a proportional subdistribution hazards model

λ_{1}^{*} (t; Z) = \frac{- d \log {1 - F_{1} (t; Z)}}{d t} = λ_{10}^{*} (t) \exp {β_{0}^{⊺} Z} .

(2.1)

There is a direct relationship between the CIF and subdistribution hazard function:

F_{1} (t; Z) = 1 - \exp {- (\int_{0}^{t} λ_{10}^{*} (u) d u) e^{β_{0}^{⊺} Z}} .

Let $N_{i}^{1} (t) = I ({\tilde{T}}_{i} \leq t, ∊_{i} = 1)$ be the underlying counting process associated with cause 1. For right censored competing risks data, $N_{i}^{1} (t)$ and $Y_{i}^{1} (t) = 1 - N_{i}^{1} (t^{-})$ are not fully observed. For a censored individual, they are only observed up to the censoring time C_i. Define $r_{i} (t) = I {C_{i} \geq ({\tilde{T}}_{i} \land t)}$ . Then $r_{i} (t) N_{i}^{1} (t)$ and $r_{i} (t) Y_{i}^{1} (t)$ are computable for all times t. Let G_C(t; Z) = P (C ≥ t|Z) be the conditional censoring distribution. Based on the assumption that given covariates Z the event time and censoring time are independent and the models are formulated as standard regression models conditional on Z, it then follows that given Z

\begin{matrix} E {\frac{r (t) N^{1} (t)}{G_{C} (T \land t; Z)} ∣ Z} & = E [E {\frac{r (t) N^{1} (t)}{G_{C} (T \land t; Z)} ∣ T, ∊, Z} ∣ Z] \\ = E [E {N^{1} (t) ∣ T, ∊, Z} \frac{E {r (t) ∣ T, ∊, Z}}{G_{C} (T \land t; Z)} ∣ Z] \\ = F_{1} (t; Z) . \end{matrix}

FG proposed using an inverse probability of the censoring weighting (IPCW) approach to fit the model (2.1) and proposed an IPCW weight function ${\hat{w}}_{i}^{KM} (t) = r_{i} (t) {\hat{G}}_{C}^{KM} (t) ∕ {\hat{G}}_{C}^{KM} (T_{i} \land t)$ , where ${\hat{G}}_{C}^{KM} (t)$ is the Kaplan-Meier estimator for the unknown censoring distribution. FG proposed estimating the unknown regression coefficient β by solving the score equation

U_{KM} (β) = \sum_{i} \int_{0}^{τ} {Z_{i} - \frac{Σ_{j} w_{j}^{KM} (u) Y_{j}^{1} (u) Z_{j} \exp {β^{⊺} Z_{j}}}{Σ_{j} w_{j}^{KM} (u) Y_{j}^{1} (u) \exp {β^{⊺} Z_{j}}}} w_{i}^{KM} (u) d N_{i}^{1} (u) = 0,

where τ is is end of the study time point, and denote the estimate as ${\hat{β}}_{KM}$ . FG showed that under regularity conditions and the condition that the censoring distribution is independent of covariates, ${\hat{β}}_{KM}$ is consistent for β₀ and derived large sample properties for $\sqrt{n} ({\hat{β}}_{KM} - β_{0})$ and $\sqrt{n} {{\hat{Λ}}_{10}^{KM} (t) - Λ_{10}^{*} (t)}$ , where the cumulative baseline subdistribution hazard $Λ_{10}^{*} (t) = \int_{0}^{t} λ_{10}^{*} (u)$ is estimated by

{\hat{Λ}}_{10}^{KM} (t) = \sum_{i} \int_{0}^{t} \frac{w_{i}^{KM} (u) d N_{i}^{1} (u)}{Σ_{j} w_{j}^{KM} (u) Y_{j}^{1} (u) \exp {{\hat{β}}_{KM}^{⊺} Z_{j}}} .

It has been shown that in biomedical research studies the censoring time may depend on some of the covariates and the treatment group. To make asymptotically unbiased inference, we needed to model the censoring distribution and to estimate the censoring survival probability, G_C(T ∧ t; Z_i), for each individual. In this study, as suggested by Fine and Gray (1999), we considered the commonly used Cox proportional hazards model for the censoring distribution,

λ_{C} (t; X_{i}) = λ_{C 0} (t) \exp {γ_{0}^{⊺} X_{i}},

where X_i is the covariates associated with the censoring distribution and can be a function or subset of Z_i. In practice one can use the standard model checking procedure to check the Cox model assumption for the censoring distribution and use the standard model building procedure to identify the risk factors which are associated with the censoring time. Let x_i be the fixed observed value for the ith individual's covariates, we estimate the predicted censoring survival probability $G_{COX}^{C} (t; x_{i}) = P (C > t ∣ X_{i} = x_{i}) = \exp {- Λ_{C 0} (t) \exp (γ_{0}^{⊺} x_{i})}$ by

{\hat{G}}_{C}^{COX} (t; x_{i}) = \exp {- {\hat{Λ}}_{C 0} (t) \exp ({\hat{γ}}^{⊺} x_{i})},

(2.2)

where $\hat{γ}$ is a maximum partial likelihood estimate for γ₀ and ${\hat{Λ}}_{C 0} (t)$ is the standard Breslow estimator for the cumulative baseline censoring hazard $Λ_{C 0} (t) = \int_{0}^{t} λ_{C 0} (u) d u$ . Note that, any administrative censoring events at time τ are not considered as events in the regression estimation. In this study, we considered a covariate-adjusted IPCW weight function

{\hat{w}}_{i}^{COX} (t) = r_{i} (t) {\hat{G}}_{C}^{COX} (t; x_{i}) ∕ {\hat{G}}_{C}^{COX} (T_{i} \land t; x_{i}) .

We estimated β in model (2.1) by solving the score equation

U_{COX} (β) = \sum_{i} \int_{0}^{τ} {Z_{i} - \frac{Σ_{j} {\hat{w}}_{j}^{COX} (u) Y_{j}^{1} (u) Z_{j} \exp {β^{⊺} Z_{j}}}{Σ_{j} {\hat{w}}_{j}^{COX} (u) Y_{j}^{1} (u) \exp {β^{⊺} Z_{j}}}} {\hat{w}}_{i}^{COX} (u) d N_{i}^{1} (u) = 0,

(2.3)

and denoted the estimate as ${\hat{β}}_{COX}$ . Then we estimated $Λ_{10}^{*} (t)$ by

{\hat{Λ}}_{10}^{COX} (t) = \sum_{i} \int_{0}^{t} \frac{{\hat{w}}_{i}^{COX} (u) d N_{i}^{1} (u)}{Σ_{j} {\hat{w}}_{j}^{COX} (u) Y_{j}^{1} (u) \exp {{\hat{β}}_{COX}^{⊺} Z_{j}}} .

Under regularity conditions (given in Section 6.1)), it can be shown that (see (6.6))

\sqrt{n} ({\hat{β}}_{COX} - β_{0}) = Ω^{- 1} {\frac{1}{\sqrt{n}} U_{COX} (β_{0})} + o_{P} (1),

where Ω = lim_n→∞ n⁻¹I_COX(β₀), I_COX(β) = −∂{U_COX(β)}/∂β, and Ω can be estimated by $\hat{Ω} = n^{- 1} I_{COX} (\hat{β})$ Furthermore (see(6.4)), $n^{- 1 ∕ 2} U_{COX} (β_{0}) = n^{- 1 ∕ 2} \sum_{i} (ξ_{i}^{COX} + ψ_{i}^{COX}) + o_{P} (1)$ , where explicit expressions for $ξ_{i}^{COX}$ and $ψ_{i}^{COX}$ are given in the Appendix A. The quantities can be estimated by plug-in estimators denoted by $ξ_{i}^{COX}$ and $ψ_{i}^{COX}$ , respectively. It follows that $\sqrt{n} ({\hat{β}}_{COX} - β_{0})$ converges in distribution to a mean zero Gaussian distribution with an asymptotic variance that can be estimated by

{\hat{Σ}}_{β}^{COX} = n \sum_{i} {({\hat{W}}_{β, i}^{COX})}^{\otimes 2} = n {I_{COX} ({\hat{β}}_{COX})}^{- 1} {\sum_{i} {({\hat{ξ}}_{i}^{COX} + {\hat{ψ}}_{i}^{COX})}^{\otimes 2}} {I_{COX} ({\hat{β}}_{COX})}^{- 1},

where a^⊗2 = aa^⊺ for a column vector a.

Similarly (see (6.10)), $\sqrt{n} {{\hat{Λ}}_{10}^{COX} (t) - Λ_{10}^{*} (t)} = n^{- 1 ∕ 2} \sum_{i} W_{Λ, i}^{COX} (t) + o_{P} (1)$ , which converges weakly to a mean zero Gaussian process with asymptotic variances, which can be estimated by

{\hat{Σ}}_{Λ_{10}}^{COX} (t) = \frac{1}{n} \sum_{i} {{\hat{W}}_{Λ, i}^{COX} (t)}^{2} .

Explicit expressions for $W_{Λ, i}^{COX} (t)$ and ${\hat{W}}_{Λ, i}^{COX} (t)$ can be found in the Appendix A.

For a given set value of covariates, z₀, the predicted CIF of cause 1 can be estimated by ${\hat{F}}_{1}^{KM} (t; z_{0}) = 1 - \exp {- {\hat{Λ}}_{10}^{KM} (t) \exp ({\hat{β}}_{KM}^{⊺} z_{0})}$ or ${\hat{F}}_{1}^{COX} (t; z_{0}) = 1 - \exp {- {\hat{Λ}}_{10}^{COX} (t) \exp ({\hat{β}}_{COX}^{⊺} z_{0})}$ respectively. Fine and Gray (1999) derived the large sample property for $\sqrt{n} {{\hat{F}}_{1}^{KM} (t; z_{0}) - F_{1} (t; z_{0})}$ when the censoring distribution is independent of the covariates. When the censoring distribution depends on the covariates through a Cox model, by the functional Delta method it follows that $\sqrt{n} {{\hat{F}}_{1}^{COX} (t; z_{0}) - F_{1} (t; z_{0})}$ converges in distribution to a Gaussian process with mean zero and asymptotic variances, which can be estimated by

n {1 - {\hat{F}}_{1}^{COX} (t; z_{0})}^{2} \sum_{i} {{\hat{W}}_{F_{1}, i}^{COX} (t; z_{0})}^{2},

where

{\hat{W}}_{F_{1, i}}^{COX} (t; z_{0}) = \exp ({\hat{β}}_{COX}^{⊺} z_{0}) {{\hat{Λ}}_{10}^{COX} (t) {({\hat{W}}_{β, i}^{COX})}^{⊺} z_{0} + {\hat{W}}_{Λ, i}^{COX} (t)} .

Resampling techniques can be used to construct confidence bands for $Λ_{10}^{*} (t)$ and F₁(t;z₀) (Lin et al., 1994; Scheike et al., 2008).

3 Simulations

We compared the finite-sample performance of the estimator using the covariate-adjusted censoring weight to the unadjusted estimator using the Kaplan-Meier estimator for the censoring distribution. Two simulation studies were considered to examine the potential bias reduction with the covariate-adjusted censoring weight estimator. For the first study, we had one binary covariate. For the second study, we considered one binary covariate and one continuous covariate. In both studies, we compared the performance of estimators using two weights, ${\hat{w}}_{i}^{KM} (t)$ and ${\hat{w}}_{i}^{COX} (t)$ , respectively.

3.1 Study 1

The regression model below has one binary covariate Z. Given Z, the cumulative incidence functions are given by

F_{1} (t; Z) = 1 - {1 - p (1 - e^{- t})}^{\exp (β Z)}

and

F_{2} (t; Z) = {(1 - p)}^{\exp (β Z)} {1 - e^{- t \exp (β Z)}},

where p = F₁(∞|Z = 0). We let p = 0.66 and Z be a Bernoulli random variable, with value 1 for half of the sample and 0 for the other half. For each setting, we simulated 10,000 replicates with sample size of n = 100 and 300, respectively (detailed simulating procedure is given in Appendix B). We set β = 1 and considered the following three simulation scenarios.


Scenario 1	Censoring times are independent of Z:
	Generate censoring times from an exponential distribution ~ exp(λ_C)
	Set λ_C = 0.556 for 30% censoring, λ_C = 1.342 for 50% censoring

Scenario 2	Censoring times depend on Z by a Cox model:
	Generate censoring times from a Cox model, λ_C(t\|Z) = λ_C exp(β_CZ)
	Set β_C = 2.5 and λ_C = 0.137 for 30% censoring
	Set β_C = 2.5 and λ_C = 0.391 for 50% censoring

Scenario 3	Censoring times depend on Z, not by a Cox model:
	C ~ U(0.25, 4.00), if Z = 0, C ~ U(0.07, 1.12), if Z = 1 for 30% censoring
	C ~ U(0.25, 2.00), if Z = 0, C ~ U(0.06, 0.46), if Z = 1 for 50% censoring

Open in a new tab

The regression coefficient β was estimated by the methods described in Section 2. We report the average of bias (Bias), the sample standard deviation of $\hat{β}$ (SD), the average of estimated standard error $(\bar{\hat{σ}})$ using formula given in Appendix A, average of standardized bias ( $Std-B = E {∣ \hat{β} - β ∣ ∕ \hat{σ}$ ), the coverage probability of β, and the mean squared error (MSE). Table 1 shows the simulation results. We also examined the potential bias of estimating the cumulative baseline subdistribution hazard, $Λ_{10}^{*} (t)$ , using both weights at a set of time points, t = (0.25, 0.5, 0.75, 1.00)^⊺. Figure 1 shows the simulation results.

Table 1.

Simulation results for bias reduction with a single binary covariate (β = 1).

				Unadjusted weight				Cox model adjusted weight
Scenario	N	Cens.	β	Bias (Std-B)	$\bar{\hat{σ}}$ (SD)	Coverage	MSE	Bias (Std-B)	$\bar{\hat{σ}}$ (SD)	Coverage	MSE
1	100	30%	β	0.0081 (0.0291)	0.2759 (0.2798)	0.9494	0.0783	0.0089 (0.0319)	0.2757 (0.2797)	0.9495	0.0783
		50%	β	0.0145 (0.0430)	0.3329 (0.3384)	0.9488	0.1147	0.0153 (0.0450)	0.3331 (0.3387)	0.9489	0.1149
	300	30%	β	0.0010 (0.0061)	0.1584 (0.1586)	0.9499	0.0252	0.0013 (0.0079)	0.1580 (0.1583)	0.9512	0.0251
		50%	β	0.0030 (0.0155)	0.1899 (0.1909)	0.9498	0.0365	0.0033 (0.0174)	0.1898 (0.1908)	0.9503	0.0364

2	100	30%	β	−.1119 (0.3592)	0.3041 (0.3116)	0.9278	0.1096	0.0050 (0.0160)	0.3080 (0.3123)	0.9487	0.0976
		50%	β	−.1162 (0.2719)	0.4075 (0.4271)	0.9359	0.1959	0.0175 (0.0398)	0.4217 (0.4388)	0.9501	0.1929
	300	30%	β	−.1244 (0.7109)	0.1741 (0.1750)	0.8865	0.0461	0.0042 (0.0236)	0.1765 (0.1762)	0.9503	0.0311
		50%	β	−.1336 (0.5596)	0.2306 (0.2346)	0.9063	0.0729	0.0055 (0.0226)	0.2393 (0.2409)	0.9511	0.0581

3	100	30%	β	−.1020 (0.3383)	0.2937 (0.3016)	0.9312	0.1013	0.0045 (0.0145)	0.3036 (0.3096)	0.9503	0.0958
		50%	β	−.0988 (0.2507)	0.3797 (0.3941)	0.9350	0.1650	0.0168 (0.0401)	0.4057 (0.4198)	0.9487	0.1765
	300	30%	β	−.1099 (0.6491)	0.1685 (0.1692)	0.8927	0.0407	0.0026 (0.0147)	0.1748 (0.1749)	0.9535	0.0306
		50%	β	−.1088 (0.5009)	0.2152 (0.2173)	0.9152	0.0591	0.0072 (0.0310)	0.2309 (0.2332)	0.9508	0.0544

Open in a new tab

Bias=Average of bias of $\hat{β}$ ; SD=Sample standard deviation of $\hat{β}$ ; $\bar{\hat{σ}}$ =Average of estimated standard error; Std-B = $E {∣ \hat{β} - β ∣ ∕ \hat{σ}}$ ; MSE=Mean squared error.

Simulation results (1 covariate) for biases of cumulative baseline subdistribution hazards at t = (0.25, 0.5, 0.75, 1)^T with 30% and 50% censoring, respectively.

The simulation results show that when the censoring time depends on the covariate (scenario 2 and 3), the unadjusted estimator produces significantly biased results, and the estimator using the covariate-adjusted censoring weight provides satisfactory results where the biases are all close to zero. Both estimators give satisfactory variance estimates and have almost identical sample standard deviations (see scenario 2 and 3 in Table 1). Regarding the cumulative subdistribution hazard estimators, estimates using the Cox model adjusted weights have smaller biases compared to those using the unadjusted Kaplan-Meier weight at almost all time points (see Figure 1). Simulation results also indicated that the estimator using the Cox model adjusted weight provides satisfactory results when the Cox model is not the true model for the censoring distribution (see scenario 3 in Table 1 and Figure 1). In scenario 1, where the censoring distribution is independent of the covariate Z, both estimators provide satisfactory results in estimating the covariate effect and cumulative baseline subdistribution hazard function. Both estimators also have almost identical sample standard deviation and similar MSE, which indicate that the potential efficiency losses from modeling the censoring distribution are minimal when using covariate-adjusted censoring weights.

3.2 Study 2

The regression models below have one binary covariate Z₁ and one continuous covariate Z₂. Given Z₁ and Z₂, the cumulative incidence functions are given by

F_{1} (t; Z_{1}; Z_{2}) = 1 - {1 - p (1 - e^{- t})}^{\exp (β_{1} Z_{1} + β_{2} Z_{2})}

and

F_{2} (t; Z_{1}, Z_{2}) = {(1 - p)}^{\exp (β_{1} Z_{1} + β_{2} Z_{2})} {1 - e^{- t \exp (β_{1} Z_{1} + β_{2} Z_{2})}} .

We let p = 0.66, and Z₁ is a Bernoulli random variable, with a value 1 for half of the sample and 0 for the other half. Z₂ is a N(0,1) random variable. We set β₁ = 1, β₂ = 0.5 and considered the following four scenarios.


Scenario 1	Censoring times are independent of Z₁ and Z₂
	Generate censoring times from an exponential distribution ~ exp(λ_C)
	Set λ_C = 0.547 for 30% censoring, λ_C = 1.352 for 50% censoring

Scenario 2	Censoring times depend on Z₁ by a Cox model
	Generate censoring times from λ_C(t\|Z) = λ_C exp(β_C1Z₁)
	Set β_C1 = 2.5. Set λ_C = 0.137 for 30% censoring,
	λ_C = 0.397 for 50% censoring

Scenario 3	Censoring times depend on Z₁ and Z₂ by a Cox model
	Generate censoring times from λ_C(t\|Z) = λ_C exp(β_C1Z₁ + β_C2Z₂)
	Set β_C1 = 2.5, β_C2 = 2.5. Set λ_C = 0.082 for 30% censoring,
	λ_C = 0.389 for 50% censoring

Scenario 4	Censoring times depend on Z₁, not by a Cox model
	C ~ U(0.25, 4.00), if Z₁ = 0, C ~ U(0.07, 1.14), if Z₁ = 1 for 30% censoring
	C ~ U(0.25, 2.00), if Z₁ = 0, C ~ U(0.06, 0.438), if Z₁ = 1 for 50% censoring

Open in a new tab

For each setting, we simulated 10,000 replicates with n = 100 and 300. The regression coefficients β₁ and β₂ were estimated by the methods described in Section 2. Table 2 shows the simulation results. We also examined the potential bias of estimating the cumulative baseline subdistribution hazard, $Λ_{10}^{*} (t)$ , using both weights at a set of time points t = (0.25, 0.5, 0.75, 1.00)^T for selected scenarios. Figure 2 shows the simulation results.

Table 2.

Simulation results for biases using 1 binary and 1 continuous covariate (β₁ = 1, β₂ = 0.5).

				Unadjusted weight				Cox model adjusted weight
Scenario	N	Cens.	β	Bias (Std-B)	$\hat{σ}$ (SD)	Coverage	MSE	Bias (Std-B)	$\hat{σ}$ (SD)	Coverage	MSE
1	100	30%	β ₁	0.0148 (0.0521)	0.2793 (0.2849)	0.9462	0.0814	0.0154 (0.0542)	0.2793 (0.2847)	0.9463	0.0813
			β ₂	0.0130 (0.0873)	0.1424 (0.1483)	0.9381	0.0222	0.0130 (0.0876)	0.1424 (0.1482)	0.9394	0.0221
		50%	β ₁	0.0293 (0.0847)	0.3353 (0.3463)	0.9474	0.1207	0.0299 (0.0863)	0.3355 (0.3464)	0.9468	0.1209
			β ₂	0.0143 (0.0812)	0.1676 (0.1763)	0.9362	0.0313	0.0146 (0.0827)	0.1676 (0.1764)	0.9362	0.0313
	300	30%	β ₁	0.0039 (0.0242)	0.1595 (0.1608)	0.9498	0.0259	0.0041 (0.0254)	0.1592 (0.1602)	0.9495	0.0257
			β ₂	0.0042 (0.0521)	0.0807 (0.0813)	0.9494	0.0066	0.0043 (0.0535)	0.0806 (0.0811)	0.9485	0.0066
		5%0	β ₁	0.0062 (0.0327)	0.1898 (0.1883)	0.9525	0.0355	0.0063 (0.0333)	0.1897 (0.1881)	0.9529	0.0354
			β ₂	0.0063 (0.0661)	0.0941 (0.0958)	0.9448	0.0092	0.0065 (0.0674)	0.0941 (0.0957)	0.9438	0.0092

2	100	30%	β ₁	−.0938 (0.2949)	0.3036 (0.3180)	0.9243	0.1099	0.0119 (0.0370)	0.3101 (0.3220)	0.9421	0.1038
			β ₂	0.0273 (0.1778)	0.1462 (0.1536)	0.9355	0.0243	0.0160 (0.1037)	0.1468 (0.1540)	0.9357	0.0240
		50%	β ₁	−.0955 (0.2241)	0.4031 (0.4262)	0.9325	0.1908	0.0179 (0.0409)	0.4181 (0.4385)	0.9414	0.1926
			β ₂	0.0247 (0.1349)	0.1740 (0.1831)	0.9333	0.0341	0.0142 (0.0771)	0.1748 (0.1839)	0.9354	0.0340
	300	30%	β ₁	−.1089 (0.6286)	0.1734 (0.1733)	0.9002	0.0419	0.0049 (0.0277)	0.1771 (0.1769)	0.9514	0.0313
			β ₂	0.0174 (0.2056)	0.0827 (0.0845)	0.9409	0.0074	0.0050 (0.0589)	0.0831 (0.0848)	0.9428	0.0072
		50%	β ₁	−.1136 (0.4956)	0.2271 (0.2291)	0.9153	0.0654	0.0049 (0.0205)	0.2363 (0.2380)	0.9516	0.0566
			β ₂	0.0142 (0.1418)	0.0979 (0.1002)	0.9407	0.0102	0.0027 (0.0273)	0.0984 (0.1004)	0.9434	0.0101

3	100	30%	β ₁	−.0299 (0.0945)	0.3061 (0.3168)	0.9418	0.1012	0.0145 (0.0459)	0.3070 (0.3149)	0.9466	0.0994
			β ₂	−.0527 (0.2665)	0.1898 (0.1979)	0.9224	0.0419	0.0014 (0.0069)	0.1934 (0.1992)	0.9422	0.0397
		50%	β ₁	−.0692 (0.1711)	0.3872 (0.4042)	0.9366	0.1682	0.0163 (0.0398)	0.3935 (0.4090)	0.9441	0.1676
			β ₂	−.0963 (0.3695)	0.2501 (0.2605)	0.9087	0.0771	−.0007 (0.0025)	0.2630 (0.2728)	0.9380	0.0744
	300	30%	β ₁	−.0473 (0.2687)	0.1750 (0.1762)	0.9363	0.0333	0.0018 (0.0102)	0.1749 (0.1758)	0.9486	0.0309
			β ₂	−.0631 (0.5767)	0.1079 (0.1094)	0.8987	0.0159	−.0016 (0.0148)	0.1097 (0.1101)	0.9494	0.0121
		50%	β ₁	−.0874 (0.3920)	0.2196 (0.2231)	0.9266	0.0574	0.0046 (0.0206)	0.2230 (0.2251)	0.9489	0.0507
			β ₂	−.1035 (0.7300)	0.1404 (0.1418)	0.8698	0.0308	0.0022 (0.0146)	0.1488 (0.1504)	0.9451	0.0226

4	100	30%	β ₁	−.0773 (0.2578)	0.2934 (0.2999)	0.9358	0.0959	0.0149 (0.0481)	0.3035 (0.3089)	0.9476	0.0956
			β ₂	0.0251 (0.1642)	0.1441 (0.1526)	0.9346	0.0239	0.0146 (0.0952)	0.1444 (0.1529)	0.9361	0.0236
		50%	β ₁	−.0731 (0.1868)	0.3799 (0.3913)	0.9402	0.1584	0.0265 (0.0636)	0.4040 (0.4163)	0.9494	0.1740
			β ₂	0.0248 (0.1390)	0.1708 (0.1783)	0.9361	0.0324	0.0150 (0.0840)	0.1713 (0.1787)	0.9366	0.0322
	300	30%	β ₁	−.0924 (0.5450)	0.1672 (0.1695)	0.9076	0.0372	0.0055 (0.0314)	0.1735 (0.1750)	0.9485	0.0306
			β ₂	0.0142 (0.1703)	0.0816 (0.0833)	0.9439	0.0071	0.0028 (0.0330)	0.0818 (0.0834)	0.9462	0.0070
		50%	β ₁	−.0877 (0.4128)	0.2149 (0.2124)	0.9327	0.0528	0.0101 (0.0446)	0.2290 (0.2269)	0.9544	0.0516
			β ₂	0.0123 (0.1255)	0.0960 (0.0977)	0.9434	0.0097	0.0023 (0.0232)	0.0963 (0.0982)	0.9455	0.0096

Open in a new tab

Simulation results (2 covariates) for biases of cumulative baseline subdistribution hazards at t = (0.25, 0.5, 0.75, 1.00)^T with 30% and 50% censoring, respectively.

This simulation study shows similar results as in study 1. The unadjusted estimator produces biased results when the censoring distribution depends on the covariates (scenario 2 to 4), and the estimator using the Cox model adjusted weight provides a good bias reduction. Both estimators give satisfactory variance estimates for both parameters. Regarding the cumulative baseline subdistribution hazard estimates, estimates using the Cox-adjusted weight have smaller biases at almost all points (see Figure 2).

Both simulation studies show that the unadjusted estimator produces significant biased results when the censoring time depends on the covariates and the proposed estimator using covariate adjusted weight works well in bias reduction.

4 Real data examples

4.1 Example 1

We considered data from multiple myeloma patients treated with allogeneic stem cell transplantation from the Center for International Blood and Marrow Transplantat Research (CIBMTR) (Kumar et al., 2012). The CIBMTR is comprised of clinical and basic scientists who share data on their blood and bone marrow transplant patients with the CIBMTR Data Collection Center located at the Medical College of Wisconsin. The CIBMTR has a repository of information regarding the results of transplants at more than 450 transplant centers worldwide. The data used in this paper consist of patients transplanted from 1995 to 2005, and we compared the outcomes between transplant periods: 2001-2005 (N=488) versus 1995-2000 (N=375) (Kumar et al., 2012). Two competing events are multiple myeloma relapse and treatment-related mortality (TRM) defined as death without relapse. The CIBMTR study (Kumar et al., 2012) identified that donor type and prior autologous transplantation were associated with relapse or TRM. The variables considered in this example are transplant time period (GP (main interest of the study): 1 for transplanted in 2001-2005 versus 0 for transplanted in 1995-2000), donor type (DNR: 1 for Unrelated or other related donor (N=280) versus 0 for HLA-identical sibling (N=584)), and prior autologous transplant (PREAUTO: 1 for Auto+Allo transplant (N=399) versus 0 for allogeneic transplant alone (N=465)).

First, we fit a Cox model for the censoring distribution where relapsed or dead individuals are considered as censoring subjects. The hazard ratios (HR) are: HR(GP)=6.42 (P < 0.0001); HR(DNR)=0.48 (p = 0.0018); HR(PREAUTO)=1.73 (p = 0.0013). These results indicate that the censoring distribution depends on the transplant time period, donor type and prior autologous transplantation. Next, we fit a proportional subdistribution hazards model (2.1) with the Kaplan-Meier estimated unadjusted weight and the Cox model adjusted weight, and we computed the predicted cumulative incidence probability for a patient who received an HLA-identical sibling donor allogeneic transplantation in 1995-2000 or in 2001-2005 (see results in Table 3–4 and Figure 3). Both weights give similar estimates for TRM. However, for cancer relapse, the regression estimate of the main treatment effect are $\hat{β} = 0.38$ and $\hat{β} = 0.54$ by unadjusted weight and Cox model adjusted weight, respectively. At three years after transplant, the differences in cumulative incidence of relapse between late and early transplant (TX) patients are 0.09 (CIF=0.34 for the late TX versus CIF=0.25 for the early TX) and 0.13 (CIF=0.35 for the late TX versus CIF=0.22 for the early TX) by unadjusted weight and Cox model adjusted weight, respectively. The unadjusted weight underestimates the effect size of CIF of relapse by 4% compared to the point estimate using the Cox model adjusted weight (Table 4). Underestimated effect size counts about 14% (0.04/((0.22+0.35)/2)) of estimated average CIF, which leads to quite a large relative bias.

Table 3.

Fit a proportional subdistribution hazards model.

	Unadjusted weight	Cox model adjusted weight
Variable	$\hat{β}$ ; exp(β) (95% CI); P	$\hat{β}$ ; exp(β) (95% CI); P
RELAPSE
GP	0.38; 1.47(1.16–1.86); 0.0017	0.54; 1.71(1.34–2.20); < 0.0001
DNR	0.39; 1.48(1.18–1.86); 0.0007	0.35; 1.42(1.13–1.78); 0.0027
PREAUTO	0.41; 1.51(1.19–1.91); 0.0007	0.42; 1.53(1.21–1.93); 0.0004
TRM
GP	−0.59; 0.55(0.42–0.73); < 0.0001	−0.56; 0.57(0.43–0.75); < 0.0001
DNR	0.57; 1.76(1.38–2.25); < 0.0001	0.55; 1.73(1.35–2.20); < 0.0001
PREAUTO	−0.38; 0.68(0.51–0.91); 0.0099	−0.37; 0.69(0.52–0.92); 0.0117

Open in a new tab

Table 4.

Predicted CIF of relapse and TRM for a patient who received an HLA-identical sibling donor and allogeneic along transplantation

Unadjusted Weight

Cox model adjusted Weight

1995–2000

2001–2005

1995–2000

2001–2005

Time

{\hat{F}}_{1}

(95% CI)

{\hat{F}}_{2}

(95% CI)

∣ {\hat{F}}_{1} - {\hat{F}}_{2} ∣

{\hat{F}}_{1}

(95% CI)

{\hat{F}}_{2}

(95% CI)

∣ {\hat{F}}_{1} - {\hat{F}}_{2} ∣

RELAPSE

1 Year

0.16 (0.13–0.19)

0.23 (0.18–0.27)

0.07

0.15 (0.13–0.17)

0.24 (0.18–0.30)

0.09

3 Year

0.25 (0.20–0.29)

0.34 (0.28–0.40)

0.09

0.22 (0.20–0.25)

0.35 (0.28–0.42)

0.13

5 Year

0.29 (0.24–0.34)

0.40 (0.33–0.46)

0.11

0.26 (0.24–0.30)

0.41 (0.33–0.49)

0.15

TRM

1 Year

0.38 (0.32–0.43)

0.23 (0.18–0.28)

0.15

0.37 (0.34–0.41)

0.23 (0.17–0.29)

0.14

3 Year

0.42 (0.37–0.48)

0.26 (0.20–0.32)

0.16

0.42 (0.38–0.46)

0.27 (0.20–0.33)

0.15

5 Year

0.44 (0.38–0.49)

0.27 (0.21–0.33)

0.17

0.43 (0.39–0.47)

0.27 (0.21–0.34)

0.16

Open in a new tab

Predicted cumulative incidence probability of relapse and TRM for a patient who received an HLA-identical sibling donor allogeneic transplantation.

4.2 Example 2

We considered another CIBMTR study data set (Ringdén et al., 2012) that consists of 177 myeloma patients who received a reduced-intensity conditioning allogeneic transplantation. Cancer relapse and TRM were two competing risks in this study. 105 patients received prior autologous transplant, and 72 patients received allogeneic transplant alone. We were interested in transplant type effect on relapse and TRM. Let PREAUTO be the indicator of transplant type (1 for Auto+Allo transplant versus 0 for Allogeneic transplant alone). Here the censoring distribution depends on the transplant type (p = 0.0047). We fit a proportional subdistribution hazards model (2.1) for PREAUTO with unadjusted weight and Cox model adjusted weight, respectively. For relapse, we have ${\hat{β}}_{cox} = - 0.34 (\hat{σ} = 0.25)$ ; $\exp ({\hat{β}}_{cox}) = 0.71$ and ${\hat{β}}_{KM} = - 0.41 (\hat{σ} = 0.25)$ ; $\exp ({\hat{β}}_{KM}) = 0.66$ . Here the Cox model adjusted weight reduces a relative bias of 21% ((0.41 − 0.34)/0.34).

5 Concluding remarks

We have shown that the competing risks regression based on IPCW techniques may be biased when the censoring distribution depends on the covariates and the biases could be significant for fixed sample sizes. We considered a regression model for the censoring distribution, and used the Cox proportional hazards model to predict the censoring weight for each individual.

Clearly, using the Cox model to estimate the censoring weights rely on this model fitting well. Our methodology may be adapted to deal with other regression models for the censoring distribution, for example additive hazards models that are more flexible. Efficient variance estimator, which includes variation contributed from estimated censoring distribution, needs to be derived for any alternative model-based weight function, and a computing package needs to be further developed as well.

The censoring time could depend on a time-dependent covariate, but in this case the predictions given the fixed covariates of the competing risks regression model may be hard to get, and this is generally not directly feasible. As Kalbfleisch and Prentice (2002) pointed out that the predicted survival probability is no longer feasible for a random (internal) time-dependent covariate. Further study will be needed.

Recently, the inverse probability of censoring weighting (IPCW) technique (Robins and Rotnitzky, 1992) has been used extensively for right-censored survival data and, specifically, for completing risks data. It has been shown that regression modeling of the censoring distribution can be used to improve the efficiency of the IPCW technique (Bickel et al., 1993; Van der Laan and Robins, 2003; Scheike et al., 2008) even if the censoring distribution is independent of the covariates. In this study, we showed that the covariate-adjusted IPCW technique can be used to reduce bias for modeling the subdistribution hazard function when censoring depends on the covariates. In general, the covariate-adjusted IPCW technique should be considered to improve efficiency and reduce bias.

We have developed an R-package, wtcrsk, which is available on CRAN.

6 Appendix A

Here we present regularity conditions and give detailed proofs for the asymptotic properties of ${\hat{β}}_{COX}$ and ${\hat{Λ}}_{10}^{COX} (t)$ . Similar arguments can been seen in Ghosh and Lin (2002). First, assuming the censoring distribution depends on covariates X through a Cox proportional hazards model where X could be a subset covariates of Z,

λ_{C} (t; X) = λ_{C 0} (t) \exp {γ_{0}^{⊺} X} .

Let $N_{i}^{C} (t) = I (T_{i} \leq t; Δ_{i} = 0)$ , Y_i(t) = I(T_i ≥ t), U_C(γ) be the partial likelihood for the censoring time, and

S_{C}^{(k)} (γ, t) = \frac{1}{n} \sum_{i} Y_{i} (t) X_{i}^{\otimes k} \exp {γ^{⊺} X_{i}}, for k = 0, 1, 2

s_{C}^{(k)} (γ, t) = E {Y_{1} (t) X_{1}^{\otimes k} e^{γ^{⊺} X_{1}}}, for k = 0, 1, 2 .

Further, let

S_{COX}^{(k)} (β, t) = \frac{1}{n} \sum_{i} {\hat{w}}_{i}^{COX} (t) Y_{i}^{1} (t) Z_{i}^{\otimes k} \exp {β^{⊺} Z_{i}}, for k = 0, 1, 2 .

s_{COX}^{(k)} (β, t) = E {Y_{1}^{1} (t) G_{C}^{COX} (t; X_{1}) Z_{1}^{\otimes k} e^{β^{⊺} Z_{1}}}, for k = 0, 1, 2,

where $G_{C}^{COX} (t; X) = \exp {- Λ_{C 0} (t) \exp (γ_{0}^{⊺} X)}$ and $Λ_{C 0} (t) = \int_{0}^{t} λ_{CO} (u) d u$ .

We assume the following regularity conditions to be hold throughout the appendix.

6.1 Assumptions

(A1)
{ ${\tilde{T}}_{i}$ , X_i, Z_i, $Y_{i}^{1}$ , Δ_i, ε_i}, i = 1, …, n, are i.i.d. instances of { $\tilde{T}$ , X, Z, Y¹, Δ, ε}.
(A2)
$\tilde{T}$ and C are independent conditional on X, Z.
(A3)
There is a maximum follow-up time τ < ∞ such that P(T > τ) > 0.
(B1)
The hazard for the right-censoring times is $λ_{C} (t; X) = λ_{C 0} (t) \exp {γ_{0}^{⊺} X}$ , where $\int_{0}^{τ} λ_{C 0} (t) d t < \infty$ and , for a compact .
(B2)
The covariates X are bounded almost surely.
(B3)
The matrix
$Ω_{C} = \int_{0}^{τ} [\frac{s_{C}^{(2)} (γ_{0}, t)}{s_{C}^{(0)} (γ_{0}, t)} - {\frac{s_{C}^{(1)} (γ_{0}, t)}{s_{C}^{(0)} (γ_{0}, t)}}^{\otimes 2}] s_{C}^{(0)} (γ_{0}, t) λ_{C 0} (t) d t$
is positive definite.
(C1)
The subdistribution hazards for cause one is $λ_{1}^{*} (t ∣ Z) = λ_{10}^{*} (t) \exp (β_{0}^{⊺} Z)$ , where $\int_{0}^{τ} λ_{10}^{*} (t) d t < \infty$ and , for a compact .
(C2)
The covariates Z are bounded almost surely.
(C3)
The matrix
$Ω = \int_{0}^{τ} [\frac{s_{COX}^{(2)} (β_{0}, t)}{s_{COX}^{(0)} (β_{0}, t)} - {\frac{s_{COX}^{(1)} (β_{0}, t)}{s_{COX}^{(0)} (β_{0}, t)}}^{\otimes 2}] s_{COX}^{(0)} (β_{0}, t) λ_{10}^{*} (t) d t$
is positive definite.

6.2 Preliminaries

To estimate the censoring distribution we use Cox regression with the roles of $\tilde{T}$ and C exchanged. Any administrative censoring events at time τ are not considered as events in the regression estimation. From assumption (B1) the censoring process N^C(t), for 0 ≤ t < τ, has intensity on the Cox model form. For this process, conditions (A1), (A2), (A3), (B1), (B2), and (B3), are sufficient to fulfill the conditions for the large sample results on the Cox model of Andersen and Gill (1982) (see their Theorem 4.1).

Let $\hat{γ}$ and ${\hat{Λ}}_{C 0}$ be the Cox estimates of γ₀ and Λ_C0, and let

{\hat{G}}_{C}^{COX} (t; X_{i}) = \exp {- {\hat{Λ}}_{C 0} (t) \exp ({\hat{γ}}^{⊺} X_{i})}

By the arguments given by Andersen and Gill (1982), see also Equation (2.1) of Lin et al. (1994), we have

{\hat{G}}_{C}^{COX} (t; X_{i}) - G_{C}^{COX} (t; X_{i}) = - \frac{G_{C}^{COX} (t; X_{i})}{n} \sum_{j = 1}^{n} \int_{u = 0}^{τ} [h^{⊺} (t, 0, X_{i}) Ω_{C}^{- 1} {X_{j} - \frac{s_{C}^{(1)} (γ_{0}, u)}{s_{C}^{(0)} (γ_{0}, u)}} + \frac{e^{(γ_{0}^{⊺} X_{i})} I (u \leq t)}{s_{C}^{0} (γ_{0}, u)}] d M_{COX, j}^{C} (u) + o_{P} (n^{- 1 ∕ 2}),

(6.1)

where Ω_C was defined in (B3), and

h (t, u, X) = e^{(γ_{0}^{⊺} X)} \int_{v = u}^{t} {X - \frac{s_{C}^{(1)} (γ_{0}, v)}{s_{C}^{(0)} (γ_{0}, v)}} d Λ_{C 0} (v),

and

M_{COX, i}^{C} (t) = N_{i}^{C} (t) - \int_{0}^{t} Y_{i} (u) \exp {γ_{0}^{⊺} X_{i}} d Λ_{C 0} (u),

is a martingale with respect to the censoring filtration. Note that by (6.1),

\begin{matrix} \frac{{\hat{G}}_{C}^{COX} (t; X_{i})}{{\hat{G}}_{C}^{COX} (T_{i} \land t; X_{i})} - \frac{G_{C}^{COX} (t; X_{i})}{G_{C}^{COX} (T_{i} \land t; X_{i})} \\ = I (T_{i} < t) \frac{G_{C}^{COX} (T_{i}; X_{i}) {{\hat{G}}_{C}^{COX} (t; X_{i}) - G_{C}^{COX} (t; X_{i})} - G_{C}^{COX} (t; X_{i}) {{\hat{G}}_{C}^{COX} (T_{i}; X_{i}) - G_{C}^{COX} (T_{i}; X_{i})}}{{\hat{G}}_{C}^{COX} (T_{i}; X_{i}) G_{C}^{COX} (T_{i}; X_{i})} \\ = - I (T_{i} < t) \frac{G_{C}^{COX} (t; X_{i})}{G_{C}^{COX} (T_{i}; X_{i})} \\ \times \frac{1}{n} \sum_{j = 1}^{n} \int_{u = 0}^{τ} [h^{⊺} (t, T_{i}, X_{i}) Ω_{C}^{- 1} {X_{j} - \frac{s_{C}^{(1)} (γ_{0}, u)}{s_{C}^{(0)} (γ_{0}, u)}} + \frac{e^{γ_{0}^{⊺} X_{i}} I (T_{i} < u \leq t)}{s_{C}^{(0)} (γ_{0}, u)}] d M_{COX, j}^{C} (u) \\ + o_{P} (n^{- 1 ∕ 2}) . \end{matrix}

(6.2)

Let $w_{i}^{COX} (t) = r_{i} (t) G_{C}^{COX} (t; X_{i}) ∕ G_{C}^{COX} (T_{i} Λ t; X_{i})$ . From the i.i.d. assumption (A1) and the boundedness implied by (A3), (B1), (B2), (C1) and (C2)

S_{COX}^{(k)} (β, t) = \frac{1}{n} \sum_{i = 1}^{n} Y_{i}^{1} (t) w_{i}^{COX} (t) Z_{i}^{\otimes k} e^{β^{⊺} Z_{i}} + \frac{1}{n} \sum_{i = 1}^{n} Y_{i}^{1} (t) r_{i} (t) {\frac{{\hat{G}}_{C}^{COX} (t; X_{i})}{{\hat{G}}_{C}^{COX} (T_{i} \land t; X_{i})} - \frac{G_{C}^{COX} (t; X_{i})}{G_{C}^{COX} (T_{i} \land t; X_{i})}} Z_{i}^{\otimes k} e^{β^{⊺} Z_{i}} \underset{\to}{P} s_{COX}^{(k)} (β, t),

uniformly in t ∊ [0, τ] and $β \in B$ , by (6.2) and the uniform weak law of large numbers. Also note that, by the same assumptions, the limit is bounded from above and bounded away from zero, and we may take derivatives by differentiating under the integral sign.

6.3 Asymptotic normality of n^−1/2U_COX(β₀)

The IPCW score function (2.3) for β evaluated at β₀ is

\begin{matrix} U_{COX} (β_{0}) & = \sum_{i = 1}^{n} \int_{0}^{τ} {Z_{i} - \frac{S_{COX}^{(1)} (β_{0}, t)}{S_{COX}^{(0)} (β_{0}, t)}} {\hat{w}}_{i}^{COX} (t) d M_{i}^{1} (t) \\ = \sum_{i = 1}^{n} \int_{0}^{τ} {Z_{i} - \frac{s_{COX}^{(1)} (β_{0}, t)}{s_{COX}^{(0)} (β_{0}, t)}} w_{i}^{COX} (t) d M_{i}^{1} (t) \\ + \sum_{i = 1}^{n} \int_{0}^{τ} {Z_{i} - \frac{S_{COX}^{(1)} (β_{0}, t)}{S_{COX}^{(0)} (β_{0}, t)}} {{\hat{w}}_{i}^{COX} (t) - w_{i}^{COX} (t)} d M_{i}^{1} (t) + o_{P} (n^{1 ∕ 2}) . \end{matrix}

The first term on the right-hand side above is a sum of mean zero i.i.d. random variables. From (6.2), the second term on the right-hand side is

\sum_{i = 1}^{n} \int_{0}^{τ} {Z_{i} - \frac{S_{COX}^{(1)} (β_{0}, t)}{S_{COX}^{(0)} (β_{0}, t)}} r_{i} (t) {\frac{{\hat{G}}_{C}^{COX} (t; X_{i})}{{\hat{G}}_{C}^{COX} (T_{i}; X_{i})} - \frac{G_{C}^{COX} (t; X_{i})}{G_{C}^{COX} (T_{i}; X_{i})}} d M_{i}^{1} (t) = \sum_{i = 1}^{n} \int_{u = 0}^{τ} q_{i}^{(1)} (u) d M_{COX, i}^{C} (u) + o_{P} (n^{1 ∕ 2}),

(6.3)

where

q_{i}^{(1)} (u) = - lim_{n \to \infty} \frac{1}{n} \sum_{i = 1}^{n} \int_{t = T_{j}}^{τ} {Z_{j} - \frac{s_{COX}^{(1)} (β_{0}, t)}{s_{COX}^{(0)} (β_{0}, t)}} w_{j}^{COX} (t) \times [h^{⊺} (t, T_{j}, X_{j}) Ω_{C}^{- 1} {X_{i} - \frac{s_{C}^{(1)} (γ_{0}, u)}{s_{C}^{(0)} (γ_{0}, u)}} + \frac{e^{(γ_{0}^{⊺} X_{j})} I (u \leq t)}{s_{C}^{(0)} (γ_{0}, u)}] d M_{j}^{1} (t) .

Thus,

\frac{1}{\sqrt{n}} U_{COX} (β_{0}) = \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} (η_{i}^{COX} + ψ_{i}^{COX}) + o_{P} (1),

(6.4)

where

η_{i}^{COX} = \int_{0}^{τ} {Z_{i} - \frac{s_{COX}^{(1)} (β_{0}, t)}{s_{COX}^{(0)} (β_{0}, t)}} w_{i}^{COX} (t) d M_{i}^{1} (t),

ψ_{i}^{COX} = \int_{0}^{τ} q_{i}^{(1)} (t) d M_{COX, i}^{C} (t) .

and $(η_{i}^{COX} + ψ_{i}^{COX})$ are independent and identically distributed zero-mean variables.

6.4 Consistency and asymptotic normality ${\hat{β}}_{COX}$

Consider the derivative of U_COX(β) with respect to β. Let

I_{COX} (\tilde{β}) = {- \frac{\partial}{\partial β} U_{COX} (β) ∣}_{β = \tilde{β}} = \int_{0}^{τ} [\frac{S_{COX}^{(2)} (\tilde{β}, t)}{S_{COX}^{(0)} (\tilde{β}, t)} - {\frac{S_{COX}^{(1)} (\tilde{β}, t)}{S_{COX}^{(0)} (\tilde{β}, t)}}^{\otimes 2}] \sum_{i = 1}^{n} {\hat{w}}_{i}^{COX} (t) d N_{i}^{1} (t) .

By the same boundedness arguments as used in connection with (6.2), n⁻I(β) converges in probability uniformly in β to a continuous limit such that lim_n→∞ n⁻¹I_COX(β₀ = Ω). From Assumption (C3), Ω is positive definite, and from the previous section we know that n⁻¹U_COX(β₀) = o_P(1). Thus, by the argument in the proof of Theorem 2 of Foutz (1977), ${\hat{β}}_{COX}$ converges in probability to β₀.

Because $U_{COX} ({\hat{β}}_{COX}) = 0$ , it follows that

\begin{matrix} 0 & = \frac{1}{\sqrt{n}} U_{COX} ({\hat{β}}_{COX}) \\ = \frac{1}{\sqrt{n}} U_{COX} (β_{0}) - \frac{1}{n} \int_{0}^{1} I_{COX} {β_{0} + v ({\hat{β}}_{COX} - β_{0})} d v \sqrt{n} ({\hat{β}}_{COX} - β_{0}), \end{matrix}

(6.5)

Then by the consistency of ${\hat{β}}_{COX}, n^{- 1} \int_{0}^{1} I_{COX} {β_{0} + v ({\hat{β}}_{COX} - β_{0})} d v \overset{P}{\to} Ω$ . By (C3), Ω is invertible and (6.5) gives that

\begin{matrix} \sqrt{n} ({\hat{β}}_{COX} - β_{0}) & = Ω^{- 1} {\frac{1}{\sqrt{n}} U_{COX} (β_{0})} + 0_{P} (1) \\ = Ω^{- 1} {\frac{1}{\sqrt{n}} \sum_{i = 1}^{n} (η_{i}^{COX} + ψ_{i}^{COX})} + 0_{P} (1), \end{matrix}

(6.6)

which is essentially a sum of bounded independent and identically distributed variables and thus by the central limit theorem it is asymptotically normally distributed with mean zero and variance matrix

Σ_{β}^{COX} = Ω^{- 1} E {(ξ_{1}^{COX} + ψ_{1}^{COX}) {(ξ_{1}^{COX} + ψ_{1}^{COX})}^{⊺}} Ω^{- 1} .

The variance matrix $\sum_{β}^{COX}$ can be estimated by a plug-in estimator

{\hat{Σ}}_{β}^{COX} = n \sum_{i} {({\hat{W}}_{β, i}^{COX})}^{\otimes 2} = n {I_{COX} ({\hat{β}}_{COX})}^{- 1} {\sum_{i} {({\hat{ξ}}_{i}^{COX} + {\hat{ψ}}_{i}^{COX})}^{\otimes 2}} {I_{COX} ({\hat{β}}_{COX})}^{- 1},

where

{\hat{ξ}}_{i}^{COX} = \int_{0}^{τ} {Z_{i} - \frac{S_{COX}^{(1)} ({\hat{β}}_{COX}, t)}{S_{COX}^{(0)} ({\hat{β}}_{COX}, t)}} {\hat{w}}_{i}^{COX} (t) d {\hat{M}}_{COX, i}^{1} (t)

{\hat{ψ}}_{i}^{COX} = \int_{u = 0}^{τ} {\hat{q}}_{i}^{(1)} (u) d {\hat{M}}_{COX, i}^{C} (u)

{\hat{q}}_{i}^{(1)} (u) = - \sum_{j = 1}^{n} \int_{t = T_{j}}^{τ} {Z_{j} - \frac{S_{COX}^{(1)} (β_{COX}, t)}{S_{COX}^{(0)} ({\hat{β}}_{COX}, t)}} {\hat{w}}_{j}^{COX} (t) \times [{\hat{h}}^{⊺} (t, T_{j}, X_{j}) {I_{C} (\hat{γ})}^{- 1} {X_{i} - \frac{S_{C}^{(1)} (\hat{γ}, u)}{S_{C}^{(0)} (\hat{γ}, u)}} + \frac{e^{({\hat{γ}}^{⊺} X_{j})} I (u \leq t)}{S_{C}^{(0)} (\hat{γ}, u)}] d {\hat{M}}_{COX, j}^{1} (t)

\hat{h} (t, T_{i}, X_{i}) = e^{({\hat{γ}}^{⊺} X_{i})} \int_{u = T_{i}}^{t} {X_{i} - \frac{S_{C}^{(1)} (\hat{γ}, u)}{S_{C}^{(0)} (\hat{γ}, u)}} d {\hat{Λ}}_{C 0} (u)

I_{C} (γ) = - \frac{\partial U_{C} (γ)}{\partial γ} = \sum_{i} \int_{0}^{τ} {\frac{S_{C}^{(2)} (γ, t)}{S_{C}^{(0)} (γ, t)} - {(\frac{S_{C}^{(2)} (γ, t)}{S_{C}^{(0)} (γ, t)})}^{\otimes 2}} d N_{i}^{C} (t)

{\hat{w}}_{i}^{COX} (t) d {\hat{M}}_{COX, i}^{t} (t) = {\hat{w}}_{i}^{COX} (t) d N_{i}^{1} (t) - {\hat{w}}_{i}^{COX} (t) Y_{i}^{1} (t) \exp ({\hat{β}}_{COX}^{⊺} Z_{i}) d {\hat{Λ}}_{10}^{COX} (t)

d {\hat{M}}_{COX, i}^{C} (t) = d N_{i}^{C} (t) - Y_{i} (t) \exp ({\hat{γ}}^{⊺} X_{i}) d {\hat{Λ}}_{C 0} (t) .

6.5 Consistency and weak convergence of ${\hat{Λ}}_{10}^{COX}$

We first note that ${\hat{Λ}}_{10}^{COX} (t)$ is uniformly consistent for $Λ_{10}^{*} (t) = \int_{0}^{t} λ^{10} (u) d u$ . To see this, write

{\hat{Λ}}_{10}^{COX} (t) - Λ_{10}^{*} (t) = \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{t} \frac{{\hat{w}}_{i}^{COX} (u) d M_{i}^{1} (u)}{S_{COX}^{(0)} (β_{0}, u)} + \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{t} {\frac{1}{S_{COX}^{(0)} ({\hat{β}}_{COX}, u)} - \frac{1}{S_{COX}^{(0)} (β_{0}, u)}} {\hat{w}}_{i}^{COX} (u) d N_{i}^{1} (u) .

(6.7)

By the arguments in Section 6.2, the consistency of ${\hat{β}}_{COX}$ and the boundedness away from zero and smoothness of $S_{COX}^{(0)}$ , the right-hand side of (6.7) converges to zero in probability, uniformly in t ∊ [0, τ].

Consider the first term on the right-hand side of (6.7). By the same argument as used for establishing (6.3),

\begin{matrix} \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{t} \frac{{\hat{w}}_{i}^{COX} (u) d M_{i}^{1} (u)}{S_{COX}^{(0)} (β_{0}, u)} & = \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{t} \frac{w_{i}^{COX} (u) d M_{i}^{1} (u)}{S_{COX}^{(0)} (β_{0}, u)} + \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{t} \frac{{\hat{w}}_{i}^{COX} (u) - w_{i}^{COX} (u)}{S_{COX}^{(0)} (β_{0}, u)} d M_{i}^{1} (s) \\ = \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{t} \frac{w_{i}^{COX} (s) s M_{i}^{1} (s)}{s_{COX}^{(0)} (β_{0}, u)} + \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{t} q_{i}^{(2)} (u, t) d M_{COX, i}^{C} (u) + o_{P} (n^{- 1 ∕ 2}), \end{matrix}

(6.8)

where

q_{i}^{(2)} (u, t) = - \lim_{n \to \infty} \frac{1}{n} \sum_{j = 1}^{n} \int_{v = T_{j}}^{t} \frac{1}{s_{COX}^{(0)} (β_{0}, v)} \times [h^{⊺} (v, T_{j}, X_{j}) Ω_{C}^{- 1} {X_{i} - \frac{s_{C}^{(1)} (γ_{0}, u)}{s_{C}^{(0)} (γ_{0}, u)}} + \frac{e^{(γ_{0}^{⊺} X_{j})} I (u \leq v)}{s_{C}^{(0)} (γ_{0}, u)}] w_{j}^{COX} (v) d M_{j}^{1} (v) .

Similar to (6.5), by a first-order expansion, the last term on the right-hand side of (6.7), is

\begin{matrix} - \int_{0}^{t} {\frac{S_{COX}^{(1)} (β_{0}, u)}{S_{COX}^{(0)} (β_{0}, u)}}^{⊺} \frac{n^{- 1} Σ_{i = 1}^{n} {\hat{w}}_{i}^{COX} (u) d N_{i}^{1} (u)}{S_{COX}^{(0)} (β_{0}, u)} ({\hat{β}}_{COX} - β_{0}) + o_{P} (n^{- 1 ∕ 2}) \\ = h_{Λ}^{⊺} (t) Ω^{- 1} \frac{1}{n} \sum_{i = 1}^{n} (η_{i}^{COX} + ψ_{i}^{COX}) + o_{P} (n^{- 1 ∕ 2}), \end{matrix}

(6.9)

where

h_{Λ} (t) = - \int_{0}^{t} \frac{s_{COX}^{(1)} (β_{0}, u)}{s_{COX}^{(0)} (β_{0}, u)} λ_{10}^{*} (u) d u .

Combining (6.5), (6.8) and (6.9),

\sqrt{n} {{\hat{Λ}}_{10}^{COX} (t) - Λ_{10}^{*} (t)} = \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} W_{Λ, i}^{COX} (t) + o_{P} (1),

(6.10)

where

W_{Λ, i}^{COX} (t) = \int_{0}^{t} \frac{w_{i}^{COX} (u) d M_{i}^{1} (u)}{s_{COX}^{(0)} (β_{0}, u)} + \int_{0}^{t} q_{i}^{(2)} (u, t) d M_{COX, i}^{C} (u) + h_{Λ}^{⊺} (t) Ω^{- 1} (η_{i}^{COX} + ψ_{i}^{COX}) .

The $W_{Λ, i}^{COX} (t)$ terms are i.i.d. stochastic processes forming a Donsker class indexed by t. To see this, note that $M_{i}^{1}$ , $M_{COX, i}^{C}$ and $w_{i}^{COX}$ are of bounded variation and thus Donsker and that sums of bounded Donsker classes are Donsker (van der Vaart, 1998, Example 19.20, Example 19.11). Thus, $\sqrt{n} {{\hat{Λ}}_{10}^{COX} (t) - Λ_{10}^{*} (t)}$ converges weakly to a mean-zero Gaussian process with variance $Σ_{Λ_{10}}^{COX} (t) = E {W_{Λ, i}^{COX} (t)}$ . The variance can be estimated by a plug in estimator, ${\hat{Σ}}_{Λ_{10}}^{COX} (t) = n^{- 1} \sum_{i} {{\hat{W}}_{Λ, i}^{COX} (t)}^{2}$ , where

{\hat{W}}_{Λ, i}^{COX} (t) = \int_{0}^{t} \frac{{\hat{w}}_{i}^{COX} (u) d {\hat{M}}_{i}^{1} (u)}{S_{COX}^{(0)} ({\hat{β}}_{COX}, u)} + \int_{0}^{τ} {\hat{q}}_{i}^{(2)} (u, t) d {\hat{M}}_{COX, i}^{C} (u) + {\hat{h}}_{Λ}^{⊺} (t) {\hat{Ω}}^{- 1} ({\hat{η}}_{i}^{COX} + {\hat{ψ}}_{i}^{COX})

{\hat{q}}_{i}^{(2)} (u, t) = - \frac{1}{n} \sum_{j = 1}^{n} \int_{j = 1}^{t} \frac{1}{S_{COX}^{(0)} ({\hat{β}}_{COX}, v)} \times [\hat{h} (v, T_{j}, X_{j}) {\hat{Ω}}_{C}^{- 1} {X_{i} - \frac{S_{C}^{(1)} (\hat{γ}, u)}{S_{C}^{(0)} (\hat{γ}, u)}} + \frac{e^{(\hat{γ} X_{j})} I (u \leq v)}{S_{C}^{(0)} (\hat{γ}, u)}] {\hat{w}}_{j}^{COX} (v) d {\hat{M}}_{j}^{1} (v)

{\hat{h}}_{Λ} (t) = - \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{t} \frac{S_{COX}^{(1)} ({\hat{β}}_{COX}, u)}{{S_{COX}^{(0)} ({\hat{β}}_{COX}, u)}^{2}} {\hat{w}}_{i}^{COX} (u) d N_{i}^{1} (u) .

7 Appendix B

Cumulative incidence functions for cause 1 and 2 are generated from

F_{1} (t) = 1 - {1 - p (1 - e^{- 1})}^{\exp (b e t a Z)}

F_{2} (t) = {(1 - p)}^{\exp (β Z)} {1 - e^{- t \exp (β Z)}}

where 0 < p < 1. Generating data step (need to set parameters p and β first):

Generate covariate Z
Based on p, β and Z, compute P₁ and P₂ (probability of type 1 and type 2 failures): P₁ = F₁(∞) = 1 – (1 – p)^exp(βZ) and P₂ = F₂(∞) = (1 – p)^exp(βZ)
Generate an uniform r.v. U₁ ~ U(0, 1). Generate cause of death indicator, δ, by
$δ = {\begin{matrix} 1, & if U_{1} \leq P_{1} \\ 2, & Otherwise \end{matrix}$
Based on δ = k, k = 1,2, compute the conditional probability
${\tilde{F}}_{k} (t) = P (T_{k} \leq t ∣ δ = k) = \frac{F_{k} (t)}{P (δ = k)} = \frac{F_{k} (t)}{F_{k} (\infty)} = \frac{F_{k} (t)}{P_{k}}$
Generate second uniform r.v. U₂ ~ U(0,1). Then use inverse distribution method to generate T_k based on ${\tilde{F}}_{k} (t)$ .

References

Andersen PK, Gill RD. Cox's regression model for counting processes: A large sample study. The Annals of Statistics. 1982;10:1100–1120. [Google Scholar]
Bickel P, Klassen C, Ritov Y, Wellner J. Efficient and Adaptive Estimation for Semiparametric Models. Springer-Verlag; New York: 1993. [Google Scholar]
Cheng SC, Fine JP, Wei LJ. Prediction of cumulative incidence function under the proportional hazards model. Biometrics. 1998;54:219–228. [PubMed] [Google Scholar]
Cox DR. Regression models and life tables (with discussion) J. Roy. Statist. Soc. Ser. B. 1972;34:187–220. [Google Scholar]
DiRienzo A, Lagakos S. Bias correction for score tests arising from misspecified proportional hazards regression models. Biometrika. 2001a;88:421–434. [Google Scholar]
DiRienzo A, Lagakos S. Effects of model misspecification on tests of no randomized treatment effect arising from Cox's proportional hazards model. J. R. Statist. Soc. B. 2001b;63:745–757. [Google Scholar]
Fine JP, Gray RJ. A proportional hazards model for the subdistribution of a competing risk. Journal of the American Statistical Association. 1999;94:496–509. [Google Scholar]
Foutz R. On the unique consistent solution to the likelihood equations. Journal of the American Statistical Association. 1977;72:147–148. [Google Scholar]
Ghosh D, Lin DY. Marginal regression models for recurrent and terminal events. Statistica Sinica. 2002;12:663–688. [Google Scholar]
Gray RJ. A class of k-sample tests for comparing the cumulative incidence of a competing risk. The Annals of Statistics. 1988;16:1141–1154. [Google Scholar]
Heinze G, Gnant M, Schemper M. Exact log-rank tests for unequal follow-up. Biometrics. 2003;59:1151–1157. doi: 10.1111/j.0006-341x.2003.00132.x. [DOI] [PubMed] [Google Scholar]
Kalbfleisch JD, Prentice RL. The Statistical Analysis of Failure Time Data. Wiley; New York: 2002. [Google Scholar]
Kim H. Cumulative incidence in competing risks data and competing risks regression analysis. Clinical Cancer Research. 2007;13:559–565. doi: 10.1158/1078-0432.CCR-06-1210. [DOI] [PubMed] [Google Scholar]
Kumar S, Zhang M, Li P, Dispenzieri A, Milone G, Lonial S, Krishnan A, Maiolino A, Wirk B, Weiss B, Freytes C, Vogl D, Vesole D, Lazarus H, Meehan K, Hamadani M, Lill M, Callander N, Majhail N, Wiernik P, Nath R, Kamble R, Vij R, Kyle R, Gale R, Hari P. Trends in allogeneic stem cell transplantation for multiple myeloma: a CIBMTR analysis. Blood. 2012;118:1979–1988. doi: 10.1182/blood-2011-02-337329. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lau B, Cole S, Gange S. Competing risk regression models for epidemiologic data. American Journal of Epidemiology. 2009;170:244–256. doi: 10.1093/aje/kwp107. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lin DY, Fleming T, Wei L. Confidence bands for survival curves under the proportional hazards model. Biometrika. 1994;81:73–81. [Google Scholar]
Mai Y. PhD thesis. University of Maryland; 2008. Comparing Survival Distributions in the Presence of Dependent Censoring: Asymptotic Validity and Bias Corrections of the Logrank Test. [Google Scholar]
Pepe MS. Inference for events with dependent risks in multiple endpoint studies. J. Amer. Statist. Assoc. 1991;86:770–778. [Google Scholar]
Prentice RL, Kalbfleisch JD, Peterson AV, Flournoy N, Farewell VT, Breslow N. The analysis of failure time data in the presence of competing risks. Biometrics. 1978;34:541–554. [PubMed] [Google Scholar]
Ringdén O, Shrestha S, Tunes da Silva G, Zhang M, Dispenzieri A, Remberger M, Kamble R, Freytes C, O., Gale R, Gibson J, Gupta V, Holmberg L, Lazarus H, McCarthy P, Meehan K, Schouten H, Milone G, Lonial S, Hari P. Effect of acute and chronic GVHD on relapse and survival after reduced-intensity conditioning allogeneic transplantation for myeloma. Bone Marrow Transplantation. 2012;47:831–837. doi: 10.1038/bmt.2011.192. [DOI] [PMC free article] [PubMed] [Google Scholar]
Robins JM, Rotnitzky A. AIDS Epidemiology-Methodological Issues. Birkhäuser; Boston: 1992. Recovery of information and adjustment of dependent censoring using surrogate markers; pp. 24–33. [Google Scholar]
Scheike TH, Zhang M-J. An additive-multiplicative Cox-Aalen model. Scandinavian Journal of Statistics. 2002;28:75–88. [Google Scholar]
Scheike TH, Zhang M-J. Extensions and applications of the Cox-Aalen survival model. Biometrics. 2003;59:1033–1045. doi: 10.1111/j.0006-341x.2003.00119.x. [DOI] [PubMed] [Google Scholar]
Scheike TH, Zhang M-J, Gerds T. Predicting cumulative incidence probability by direct binomial regression. Biometrika. 2008;95:205–20. [Google Scholar]
Scrucca L, Santucci A, Aversa F. Competing risk analysis using R: an easy guide for clinicians. Bone Marrow Transplant. 2007;40(4):381–387. doi: 10.1038/sj.bmt.1705727. [DOI] [PubMed] [Google Scholar]
Shen Y, Cheng SC. Confidence bands for cumulative incidence curves under the additive risk model. Biometrics. 1999;55:1093–1100. doi: 10.1111/j.0006-341x.1999.01093.x. [DOI] [PubMed] [Google Scholar]
Van der Laan MJ, Robins JM. Unified Methods for Censored Longitudinal Data and Causality. Springer; 2003. [Google Scholar]
van der Vaart A. Asymptotic Statistics. Cambridge University Press; 1998. [Google Scholar]
Wolbers M, Koller M, Witteman J, Steyerberg E. Prognostic models with competing risks: methods and application to coronary risk prediction. Epidemiology. 2009;20(4):555–561. doi: 10.1097/EDE.0b013e3181a39056. [DOI] [PubMed] [Google Scholar]
Zhang M-J, Fine J. Summarizing differences in cumulative incidence functions. Statist. Med. 2008;27:4939–49. doi: 10.1002/sim.3339. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R1] Andersen PK, Gill RD. Cox's regression model for counting processes: A large sample study. The Annals of Statistics. 1982;10:1100–1120. [Google Scholar]

[R2] Bickel P, Klassen C, Ritov Y, Wellner J. Efficient and Adaptive Estimation for Semiparametric Models. Springer-Verlag; New York: 1993. [Google Scholar]

[R3] Cheng SC, Fine JP, Wei LJ. Prediction of cumulative incidence function under the proportional hazards model. Biometrics. 1998;54:219–228. [PubMed] [Google Scholar]

[R4] Cox DR. Regression models and life tables (with discussion) J. Roy. Statist. Soc. Ser. B. 1972;34:187–220. [Google Scholar]

[R5] DiRienzo A, Lagakos S. Bias correction for score tests arising from misspecified proportional hazards regression models. Biometrika. 2001a;88:421–434. [Google Scholar]

[R6] DiRienzo A, Lagakos S. Effects of model misspecification on tests of no randomized treatment effect arising from Cox's proportional hazards model. J. R. Statist. Soc. B. 2001b;63:745–757. [Google Scholar]

[R7] Fine JP, Gray RJ. A proportional hazards model for the subdistribution of a competing risk. Journal of the American Statistical Association. 1999;94:496–509. [Google Scholar]

[R8] Foutz R. On the unique consistent solution to the likelihood equations. Journal of the American Statistical Association. 1977;72:147–148. [Google Scholar]

[R9] Ghosh D, Lin DY. Marginal regression models for recurrent and terminal events. Statistica Sinica. 2002;12:663–688. [Google Scholar]

[R10] Gray RJ. A class of k-sample tests for comparing the cumulative incidence of a competing risk. The Annals of Statistics. 1988;16:1141–1154. [Google Scholar]

[R11] Heinze G, Gnant M, Schemper M. Exact log-rank tests for unequal follow-up. Biometrics. 2003;59:1151–1157. doi: 10.1111/j.0006-341x.2003.00132.x. [DOI] [PubMed] [Google Scholar]

[R12] Kalbfleisch JD, Prentice RL. The Statistical Analysis of Failure Time Data. Wiley; New York: 2002. [Google Scholar]

[R13] Kim H. Cumulative incidence in competing risks data and competing risks regression analysis. Clinical Cancer Research. 2007;13:559–565. doi: 10.1158/1078-0432.CCR-06-1210. [DOI] [PubMed] [Google Scholar]

[R14] Kumar S, Zhang M, Li P, Dispenzieri A, Milone G, Lonial S, Krishnan A, Maiolino A, Wirk B, Weiss B, Freytes C, Vogl D, Vesole D, Lazarus H, Meehan K, Hamadani M, Lill M, Callander N, Majhail N, Wiernik P, Nath R, Kamble R, Vij R, Kyle R, Gale R, Hari P. Trends in allogeneic stem cell transplantation for multiple myeloma: a CIBMTR analysis. Blood. 2012;118:1979–1988. doi: 10.1182/blood-2011-02-337329. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R15] Lau B, Cole S, Gange S. Competing risk regression models for epidemiologic data. American Journal of Epidemiology. 2009;170:244–256. doi: 10.1093/aje/kwp107. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R16] Lin DY, Fleming T, Wei L. Confidence bands for survival curves under the proportional hazards model. Biometrika. 1994;81:73–81. [Google Scholar]

[R17] Mai Y. PhD thesis. University of Maryland; 2008. Comparing Survival Distributions in the Presence of Dependent Censoring: Asymptotic Validity and Bias Corrections of the Logrank Test. [Google Scholar]

[R18] Pepe MS. Inference for events with dependent risks in multiple endpoint studies. J. Amer. Statist. Assoc. 1991;86:770–778. [Google Scholar]

[R19] Prentice RL, Kalbfleisch JD, Peterson AV, Flournoy N, Farewell VT, Breslow N. The analysis of failure time data in the presence of competing risks. Biometrics. 1978;34:541–554. [PubMed] [Google Scholar]

[R20] Ringdén O, Shrestha S, Tunes da Silva G, Zhang M, Dispenzieri A, Remberger M, Kamble R, Freytes C, O., Gale R, Gibson J, Gupta V, Holmberg L, Lazarus H, McCarthy P, Meehan K, Schouten H, Milone G, Lonial S, Hari P. Effect of acute and chronic GVHD on relapse and survival after reduced-intensity conditioning allogeneic transplantation for myeloma. Bone Marrow Transplantation. 2012;47:831–837. doi: 10.1038/bmt.2011.192. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R21] Robins JM, Rotnitzky A. AIDS Epidemiology-Methodological Issues. Birkhäuser; Boston: 1992. Recovery of information and adjustment of dependent censoring using surrogate markers; pp. 24–33. [Google Scholar]

[R22] Scheike TH, Zhang M-J. An additive-multiplicative Cox-Aalen model. Scandinavian Journal of Statistics. 2002;28:75–88. [Google Scholar]

[R23] Scheike TH, Zhang M-J. Extensions and applications of the Cox-Aalen survival model. Biometrics. 2003;59:1033–1045. doi: 10.1111/j.0006-341x.2003.00119.x. [DOI] [PubMed] [Google Scholar]

[R24] Scheike TH, Zhang M-J, Gerds T. Predicting cumulative incidence probability by direct binomial regression. Biometrika. 2008;95:205–20. [Google Scholar]

[R25] Scrucca L, Santucci A, Aversa F. Competing risk analysis using R: an easy guide for clinicians. Bone Marrow Transplant. 2007;40(4):381–387. doi: 10.1038/sj.bmt.1705727. [DOI] [PubMed] [Google Scholar]

[R26] Shen Y, Cheng SC. Confidence bands for cumulative incidence curves under the additive risk model. Biometrics. 1999;55:1093–1100. doi: 10.1111/j.0006-341x.1999.01093.x. [DOI] [PubMed] [Google Scholar]

[R27] Van der Laan MJ, Robins JM. Unified Methods for Censored Longitudinal Data and Causality. Springer; 2003. [Google Scholar]

[R28] van der Vaart A. Asymptotic Statistics. Cambridge University Press; 1998. [Google Scholar]

[R29] Wolbers M, Koller M, Witteman J, Steyerberg E. Prognostic models with competing risks: methods and application to coronary risk prediction. Epidemiology. 2009;20(4):555–561. doi: 10.1097/EDE.0b013e3181a39056. [DOI] [PubMed] [Google Scholar]

[R30] Zhang M-J, Fine J. Summarizing differences in cumulative incidence functions. Statist. Med. 2008;27:4939–49. doi: 10.1002/sim.3339. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

A Proportional Hazards Regression Model for the Sub-distribution with Covariates Adjusted Censoring Weight for Competing Risks Data

PENG HE

FRANK ERIKSSON

THOMAS H SCHEIKE

MEI-JIE ZHANG

Abstract

1 Introduction

2 Data and covariate adjusted censoring weight

3 Simulations

3.1 Study 1

Table 1.

Figure 1.

3.2 Study 2

Table 2.

Figure 2.

4 Real data examples

4.1 Example 1

Table 3.

Table 4.

Figure 3.

4.2 Example 2

5 Concluding remarks

6 Appendix A

6.1 Assumptions

6.2 Preliminaries

6.3 Asymptotic normality of n^−1/2U_COX(β₀)

6.4 Consistency and asymptotic normality ${\hat{β}}_{COX}$

6.5 Consistency and weak convergence of ${\hat{Λ}}_{10}^{COX}$

7 Appendix B

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

A Proportional Hazards Regression Model for the Sub-distribution with Covariates Adjusted Censoring Weight for Competing Risks Data

PENG HE

FRANK ERIKSSON

THOMAS H SCHEIKE

MEI-JIE ZHANG

Abstract

1 Introduction

2 Data and covariate adjusted censoring weight

3 Simulations

3.1 Study 1

Table 1.

Figure 1.

3.2 Study 2

Table 2.

Figure 2.

4 Real data examples

4.1 Example 1

Table 3.

Table 4.

Figure 3.

4.2 Example 2

5 Concluding remarks

6 Appendix A

6.1 Assumptions

6.2 Preliminaries

6.3 Asymptotic normality of n−1/2UCOX(β0)

6.4 Consistency and asymptotic normality β^COX

6.5 Consistency and weak convergence of Λ^10COX

7 Appendix B

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

6.3 Asymptotic normality of n^−1/2U_COX(β₀)

6.4 Consistency and asymptotic normality ${\hat{β}}_{COX}$

6.5 Consistency and weak convergence of ${\hat{Λ}}_{10}^{COX}$