Joint Inference for Competing Risks Survival Data

Gang Li; Qing Yang

doi:10.1080/01621459.2015.1093942

. Author manuscript; available in PMC: 2019 Nov 19.

Published in final edited form as: J Am Stat Assoc. 2016 Oct 18;111(515):1289–1300. doi: 10.1080/01621459.2015.1093942

Joint Inference for Competing Risks Survival Data

Gang Li ^a, Qing Yang ^b

PMCID: PMC6863485 NIHMSID: NIHMS986715 PMID: 31745375

Abstract

This article develops joint inferential methods for the cause-specific hazard function and the cumulative incidence function of a specific type of failure to assess the effects of a variable on the time to the type of failure of interest in the presence of competing risks. Joint inference for the two functions are needed in practice because (i) they describe different characteristics of a given type of failure, (ii) they do not uniquely determine each other, and (iii) the effects of a variable on the two functions can be different and one often does not know which effects are to be expected. We study both the group comparison problem and the regression problem. We also discuss joint inference for other related functions. Our simulation shows that our joint tests can be considerably more powerful than the Bonferroni method, which has important practical implications to the analysis and design of clinical studies with competing risks data. We illustrate our method using a Hodgkin disease data and a lymphoma data. Supplementary materials for this article are available online.

Keywords: Cause-specific hazard, Censoring, Cox’s model, Cumulative incidence, Log-rank test, Subdistribution hazard

1. Introduction

Competing risks failure time data arise commonly in clinical trials, reliability testing, and other fields. For instance, in a clinical trial, one may be interested in time to death due to a particular disease, but a patient can also die from other competing diseases that are potentially positively correlated with the disease of interest. Competing risks can also be negatively correlated with the event time of interest. For example, in a kidney transplantation program, patients who are ineligible for transplantation due to reasons, such as being overweight, are put on a waiting list until they become eligible (see, e.g., Sancho et al. 2007). An important outcome variable is the waiting time to become eligible for transplantation. In this case, death before becoming eligible for transplantation is a competing risk event that is potentially negatively correlated with the waiting time. More examples of competing risks failure time data can be found in Prentice et al. (1978), Pintilie (2006), Gichangi and Vach (2005), and Putter, Fiocco, and Geskus (2007), and the references therein. There is a broad literature on statistical methods for competing risks failure time data. Group comparison of a specific type of failure has been studied using either the cause-specific hazard (Prentice et al. 1978; Lindkvist and Belyaev 1998; Kulathinal and Gasbarra 2002) or the cumulative incidence (Gray 1988; Pepe and Mori 1993; Bajorunaite and Klein 2007). Methods to compare failures across failure types have been developed with respect to either the cause-specific hazard, or the cumulative incidence, or both (Aly, Kochar, and McKeague 1994; Sun and Tiwari 1995; Lam 1998; Luo and Turnbull 1999). Tiwari, Kulasekera, and Park (2006) proposed a test to check equality of cause-specific hazards across all failure types and groups. For regression analysis of competing risks failure time data, Prentice et al. (1978), Lagakos (1978), Holt (1978), Cox and Oakes (1984, chap. 9), Larson (1984), and Lunn and McNeil (1995) studied proportional cause-specific hazards models. Fine and Gray (1999) introduced a proportional subdistribution hazards model for the cumulative incidence function. Fine (1999, 2001), Klein and Andersen (2005), and Gerds, Scheike, and Andersen (2012) used transformation models to directly model the cumulative incidence function. Klein (2006) discussed additive models for both the cause-specific hazard and the cumulative incidence function. Comprehensive survey of statistical methods for competing risks survival data and further references can be found in Beyersmann et al. (2007), Latouche et al. (2007), and Haller, Schmidt, and Ulm (2012).

In this article, we focus on the problem of assessing the effects of a variable (treatment or covariate) on the time to a particular type of failure. For convenience, we assume hereafter that there are only two types of failure, where Type 1 represents the failure type of interest and Type 2 includes all other competing risks. As discussed earlier, there are mainly two approaches to this problem based on either the cause-specific hazard function or the cumulative incidence function. The cause-specific hazard function for Type 1 failure is defined as

λ_{1} (t) = \lim_{d t ↓ 0} P (t \leq T < t + d t, D = 1 | T \geq t) / d t, t > 0,

the instantaneous risk for Type 1 failure at time t given that the subject is at risk just prior to t, where T is the continuous failure time with multiple failure types and D is the failure type. For example, Prentice et al. (1978) showed that the standard Cox (1972, 1975) regression method can be used to study the effects of a variable on the cause-specific hazard λ₁(t) by treating other types of failures as independent right censoring events. The cumulative incidence function is defined as F₁(t) = P(T ≤ t, D = 1), t > 0, the cumulative incidence rate of Type 1 failure by time t, which can be uniquely characterized by the following subdistribution hazard:

{\tilde{λ}}_{1} (t) = \lim_{d t ↓ 0} P (t \leq T < t + d t, D = 1 | T \geq t \cup (T < t \cap D \neq 1)) / d t = - d \log {1 - F_{1} (t)} / d t .

In particular, Gray (1988) developed a class of nonparametric tests to compare the cumulative incidence function of a given type of failure between different groups and Fine and Gray (1999) introduced a proportional subdistribution hazards model for the regression problem.

Despite the extensive literature on this topic, there are still confusions to practitioners as to which method should be used in practice when studying the effects of a variable on Type 1 failure. We point out that joint inference for both λ₁(t) and F₁(t) should be made. First of all, these two quantities describe different characteristics of Type 1 failure: λ₁(t) represents the instantaneous Type 1 failure rate at time t given survival to t, whereas F₁(t) summarizes the prevalence or cumulative incidence of Type 1 failure over the time interval [0, t]. Second, λ₁(t) and F₁(t) do not uniquely determine each other except when J = 1. It can be shown that $F_{1} (t) = \int_{0}^{t} S (u) λ_{1} (u) d u$ , where S(u) = P(T > u) is the all-cause survival function. Thus F₁(t) depends not only on λ₁(t), but also on other cause-specific hazards through the all-cause survival function S(t). Finally, the effects of a variable on λ₁(t) can be different from its effects on F₁(t) (Gray 1988; Fine and Gray 1999), and one often does not know which effects are to be expected. To the best of our knowledge, no formal joint inference procedure for these quantities is available in the literature. Although the Bonferroni method provides a straightforward solution, it can be severely under-powered as demonstrated later in Sections 4 and 5.

The primary purpose of this article is to develop joint inference procedures to assess the effects of a variable on λ₁(t) and F₁(t) simultaneously. We allow independent right censoring in addition to competing risks. We first consider the two-sample comparison problem with respect to both λ₁(t) and F₁(t). By establishing the asymptotic joint distribution of the weighted log-rank test statistic for λ₁(t) and the Gray (1988) test statistic for F₁(t), we derive two-sample joint tests for λ₁(t) and F₁(t). We then extend our method to a regression setting based on Cox-type models for λ₁(t) and (or ${\tilde{λ}}_{1} (t)$ ). We also discuss joint inference for other related quantities.

In Section 2, we first review the weighted log-rank test for group comparison of λ₁(t) and the Gray (1988) test for F₁(t). Then, we develop joint test procedures for group comparisons of both λ₁(t) and F₁(t). We also discuss joint tests for other equivalent pairs including λ₁(t) with the all-cause hazard, and λ₁(t) with the cause-specific hazard for other failure types. Section 3 develops joint regression analysis methods for λ₁(t) and (or ${\tilde{λ}}_{1} (t)$ ) under Cox-type regression models. Section 4 presents some simulation results to evaluate the proposed methods and compare them with the Bonferroni method. In Section 5, we illustrate our methods on a Hodgkin disease data and a lymphoma data. Section 6 gives some further remarks. The proofs for the theorems and additional simulation results are provided in the Appendix in the supplementary material.

2. Two-Sample Joint Tests for Competing Risks Data

Suppose that there are two independent groups of subjects. Let T_ik, D_ik, and C_ik denote the continuous failure time, the type of failure, and the censoring time, respectively, for subject i in group k, i = 1, …, n_k, k = 1, 2. Assume that the triplets (T_ik, D_ik, C_ik) for different subjects within each group are independent and identically distributed and that the censoring time C_ik is independent of the failure time T_ik. The two groups are allowed to have different censoring distributions. For group k (k = 1, 2), one observes a right censored competing risks failure time data {(X_ik, δ_ik), i = 1, …, n_k}, where X_ik = min(T_ik, C_ik) and δ_ik = D_ikI(T_ik ≤ C_ik). Let S_k(t) = P(T_ik > t) and $S_{k}^{c} (t) = P (C_{i k} > t)$ . For group k (k = 1, 2), let λ_1k(t), F_1k(t), and ${\tilde{λ}}_{1 k} (t)$ denote the cause-specific hazard function, the cumulative incidence function, and the subdistribution hazard function, respectively, for Type 1 failure. We develop nonparametric tests for the following null hypothesis,

H_{0} : λ_{11} (t) = λ_{12} (t) and F_{11} (t) = F_{12} (t) for all 0 < t < τ,

(1)

where τ is some prespecified fixed time.

2.1. Preliminaries

We first review the two-sample weighted log-rank test for the cause-specific hazard and the Gray (1988) two-sample test for the cumulative incidence for Type 1 failure. These tests will be used as building blocks to develop joint tests for the hypothesis (1).

2.1.1. Two-Sample Tests for Cause-Specific Hazard

It is now well known that the standard (weighted) log-rank test (Peto and Peto 1972; Andersen et al. 1982) for right censored failure time data can be applied to test

H_{0} : λ_{11} (t) = λ_{12} (t) for all 0 < t < τ,

(2)

by treating all other competing risks as independent right censoring (Tsiatis 1975; Prentice et al. 1978; Lindkvist and Belyaev 1998). Specifically, let $N_{j k} (t) = \sum_{i = 1}^{n_{k}} I (X_{k i} \leq t, D_{k i} = j)$ be the counting process of the number of observed type j failures in group k by time t, and $Y_{k} (t) = \sum_{i = 1}^{n_{k}} I {X_{k i} \geq t}$ be the at risk process indicating the number of subjects in group k who are at risk prior to time t, k = 1, 2. Let $N_{j \cdot} (t) = \sum_{k = 1}^{2} N_{j k} (t)$ and $Y_{\cdot} (t) = \sum_{k = 1}^{2} Y_{k} (t)$ . The weighted log-rank test statistic for (2) is defined as

U_{1 k} = \int_{0}^{τ} W_{1} (t) Y_{k} (t) {\frac{d N_{1 k} (t)}{Y_{k} (t)} - \frac{d N_{1 \cdot} (t)}{Y_{\cdot} (t)}},

(3)

where W₁(t) is a predictable weight function that converges in probability to some deterministic function w₁(t) as n → ∞, and τ is the largest time at which all of the groups have at least one subject at risk. It can be shown that under the null hypothesis (2), $n^{- 1 / 2} U_{11} / \hat{σ}$ has a standard normal limiting distribution where

{\hat{σ}}^{2} = n^{- 1} \int_{0}^{τ} W_{1}^{2} (t) \frac{Y_{1} (t) Y_{2} (t)}{Y_{\cdot} (t)} \frac{d N_{1 \cdot} (t)}{Y_{\cdot} (t)} .

(4)

This leads to an asymptotic χ² test or a Z test for (2).

2.1.2. Two-Sample Tests for Cumulative Incidence Function

Gray (1988) developed a class of K-sample nonparametric tests to compare the cumulative incidence between different groups. Consider the following null hypothesis,

H_{0} : F_{11} (t) = F_{12} (t) for all 0 < t < τ .

(5)

The Gray (1988) nonparametric test statistic is defined as

{\tilde{U}}_{1 k} = \int_{0}^{τ_{k}} \tilde{W} (t) R_{k} (t) {\frac{d N_{1 k} (t)}{R_{k} (t)} - \frac{d N_{1 \cdot} (t)}{R_{\cdot} (t)}},

(6)

where $\tilde{W} (t)$ is a predictable weight function that converges in probability to some deterministic function $\tilde{w} (t)$ as n → ∞, $R_{k} (t) = I (τ_{k} \geq t) Y_{k} (t) {\hat{G}}_{1 k} (t -) / {\hat{S}}_{k} (t -)$ can be considered as an adjusted risk set size for group k at time t, ${\hat{G}}_{j k} (t -)$ is the left-hand limit of the Kaplan–Meier (1958) estimate of G_jk(t) = 1 − F_jk(t), ${\hat{S}}_{k} (t -)$ is the left-hand limit of the Kaplan–Meier estimate of S_k(t), τ_k is some fixed time point satisfying $S_{k} (τ_{k}) S_{k}^{c} (τ_{k}) > 0$ , and R_⋅(t) represents the same quantity as R_k(t) using the pooled sample. Gray (1988) showed that under (5), $n^{- 1 / 2} {\tilde{U}}_{11} / \hat{\tilde{σ}}$ has a standard normal limiting distribution, where

{\hat{\tilde{σ}}}^{2} = \sum_{k = 1}^{2} n^{- 1} {\int_{0}^{τ_{1}} {\hat{a}}_{k}^{2} (t) {\hat{h}}_{k}^{- 1} (t) {\hat{h}}_{\cdot}^{- 1} (t) d N_{1 \cdot} (t) + \int_{0}^{τ_{1}} {\hat{b}}_{2 k}^{2} (t) {\hat{h}}_{k}^{- 2} (t) d N_{2 k} (t)},

(7)

with

{\hat{a}}_{k} (t) = {\hat{d}}_{1 k} (t) + {\hat{b}}_{1 k} (t), {\hat{b}}_{j k} (t) = [I (j = 1) - {\hat{G}}_{1 \cdot} (t) / {\hat{S}}_{k} (t)] [{\hat{c}}_{k} (τ_{1}) - {\hat{c}}_{k} (t)], {\hat{c}}_{k} (t) = \int_{0}^{t} {\hat{d}}_{1 k} (u) {\hat{G}}_{1 \cdot} {(u -)}^{- 1} {\hat{h}}_{\cdot}^{- 1} (u) d N_{1 \cdot} (u), {\hat{d}}_{j k} (t) = n^{- 1} I (j = 1) \tilde{W} (t) R_{1} (t) \times [I (k = 1) - {\hat{h}}_{k} (t) / {\hat{h}}_{\cdot} (t)] / {\hat{G}}_{1 \cdot} (t -), {\hat{h}}_{k} (t) = I (t \leq τ_{k}) n^{- 1} Y_{k} (t) / {\hat{S}}_{k} (t -), {\hat{h}}_{\cdot} (t) = I (t \leq m a x (τ_{1}, τ_{2})) n^{- 1} Y_{\cdot} (t) / {\hat{S}}_{\cdot} (t -), {\hat{G}}_{1 \cdot} (t) = 1 - {\hat{F}}_{1 \cdot} (t) = 1 - n^{- 1} \int_{0}^{t} {\hat{h}}_{\cdot}^{- 1} (u) d N_{1 \cdot} (u) .

(8)

This gives an asymptotic χ ² test for (5) based on $n^{- 1} {\tilde{U}}_{11}^{2} / {\hat{\tilde{σ}}}^{2}$ or a Z test based on $n^{- 1 / 2} {\tilde{U}}_{11} / \hat{\tilde{σ}}$ .

Examples of the weight functions in the above test statistics have been discussed by a number of authors (Gehan 1965; Breslow 1970; Peto and Peto 1972; Kalbfleisch 1980; Gray 1988). A nice survey of various weight functions and their applications can be found in Klein and Moeschberger (2003, chap. 7.2).

2.2. Joint Two-Sample Tests for Cause-Specific Hazard and Cumulative Incidence Function

To test the joint null hypothesis (1), we first establish the joint limiting distribution of U₁₁ and ${\tilde{U}}_{11}$ below.

Theorem 1.

Let U₁₁ and ${\tilde{U}}_{11}$ be defined by (3) and (6). Under the null hypothesis (1), $n^{- 1 / 2} (U_{11}, {\tilde{U}}_{11})$ has an asymptotically bivariate normal distribution with mean 0 and variance-covariance matrix $Σ^{(1)} = (σ_{i j}^{(1)})$ as n → ∞, where Σ⁽¹⁾ is defined in (A.1) and (A.4) of Appendix A.1. Furthermore, $σ_{11}^{(1)}$ and $σ_{22}^{(1)}$ are consistently estimated by (4) and (7), and the covariance $σ_{12}^{(1)}$ is consistently estimated by

{\hat{σ}}_{12}^{(1)} = n^{- 1} {\int_{0}^{τ} W_{1} (t) \frac{Y_{2} (t)}{Y_{\cdot} (t)} {\hat{V}}_{11} (t) + {\hat{c}}_{1} (τ) \int_{0}^{τ} W_{1} (t) \frac{Y_{2} (t)}{Y_{\cdot} (t)} {\hat{E}}_{11} (t) {\hat{h}}_{1}^{- 1} (t)} Y_{1} (t) d {\hat{Λ}}_{11} (t) + n^{- 1} {\int_{0}^{τ} W_{1} (t) \frac{Y_{1} (t)}{Y_{\cdot} (t)} {\hat{V}}_{12} (t) + {\hat{c}}_{2} (τ) \int_{0}^{τ} W_{1} (t) \frac{Y_{1} (t)}{Y_{\cdot} (t)} {\hat{E}}_{12} (t) {\hat{h}}_{2}^{- 1} (t)} Y_{2} (t) d {\hat{Λ}}_{12} (t),

(9)

where ${\hat{Λ}}_{1 k} (τ) = \int_{0}^{τ} Y_{k}^{- 1} (t) d N_{1 k} (t)$ , ${\hat{V}}_{j k} (t) = [{\hat{d}}_{j k} (t) - {\hat{E}}_{j k} (t) {\hat{c}}_{k} (t)] {\hat{h}}_{k}^{- 1} (t)$ , ${\hat{E}}_{j k} (t) = I (j = 1) - {\hat{G}}_{1 k} (t -) / {\hat{S}}_{k} (t -)$ , and other quantities are defined in (8).

2.2.1. Chi-Square Joint Test for (1)

Define

X^{2} = n^{- 1} (U_{11}, {\tilde{U}}_{11}) {\hat{Σ}}^{(1) (- 1)} (\begin{matrix} U_{11} \\ {\tilde{U}}_{11} \end{matrix}) .

It follows from Theorem 1 that under (1), X² has an asymptotically chi-square distribution with 2 degrees of freedom. This leads to the following chi-square test for (1):

Reject (1) at level α if X^{2} > χ_{2}^{2} (α),

where $χ_{2}^{2} (α)$ is the upper 1 − α percentile of the standard $χ_{2}^{2}$ distribution.

Rejection of (1) by the above chi-square test implies that there is a difference in either cause-specific hazard or cumulative incidence between the two groups. However, it does not indicate which individual quantity has a difference. The following maximum test provides an alternative joint test that allows one to draw a conclusion on each individual quantity. It also allows one-sided test.

2.2.2. Maximum Joint Test for (1)

Define

T^{*} = \max (| Z_{11} |, | {\tilde{Z}}_{11} |)),

where $Z_{11} = n^{- 1 / 2} U_{11} / \sqrt{{\hat{σ}}_{11}^{(1)}}$ and ${\tilde{Z}}_{11} = n^{- 1 / 2} {\tilde{U}}_{11} / \sqrt{{\hat{σ}}_{22}^{(1)}}$ . We would reject (1) if the observed T* is large. It follows from Theorem 1 that for large samples, the distribution of (Z₁₁, ${\tilde{Z}}_{11}$ ) can be approximated by the bivariate normal distribution $N ({(0, 0)}^{T}, (1, 1, \hat{ρ}))$ , where $\hat{ρ} = \frac{{\hat{σ}}_{12}^{(1)}}{\sqrt{{\hat{σ}}_{11}^{(1)}} \sqrt{{\hat{σ}}_{22}^{(1)}}}$ . Thus, we can approximate the distribution of T* using Monte Carlo simulation. Specifically, we generate N pairs of random variables from the bivariate normal distribution $N ({(0, 0)}^{T}, (1, 1, \hat{ρ}))$ . For the lth generated pair, compute the maximum absolute value, and denote it by $T_{l}^{*}$ . Let T_α be the upper 100(1 − α)th sample quantile of $T_{1}^{*}, \dots, T_{N}^{*}$ . Reject the null hypothesis (1) at level α if T* > T_α.

Remark 1.

It is straightforward to modify the maximum joint test procedure to test one-sided alternative(s) based on either $T^{*} = \max (Z_{11}, {\tilde{Z}}_{11})$ , $T^{*} = \max (| Z_{11} |, {\tilde{Z}}_{11})$ , or $T^{*} = \max (Z_{11}, | {\tilde{Z}}_{11} |)$ as deemed appropriate.

Remark 2.

(K-Sample Joint Tests) The above two-sample joint tests can be easily extended to the K-sample problem (K ≥ 2) for the following null hypothesis

H_{0} : λ_{11} (t) = \dots = λ_{1 K} (t) and F_{11} (t) = \dots = F_{1 K} (t) for all 0 < t < τ,

(10)

where τ is some prespecified fixed time. Similar to Theorem 1, it can be shown that under the null hypothesis (10), $V_{n} = n^{- 1 / 2} (U_{11}, \dots, U_{1 K - 1}, {\tilde{U}}_{11}, \dots, {\tilde{U}}_{1 K - 1})$ has an asymptotic multivariate normal distribution with mean 0 and variance-covariance matrix Σ*, where Σ* is defined as the limit of the variance-covariance matrix of V_n and can be consistently estimated as follows. From Kulathinal and Gasbarra (2002), we have $\hat{cov} (n^{- 1 / 2} U_{1 k}, n^{- 1 / 2} U_{1 k^{'}}) = - n^{- 1} \int_{0}^{τ} W_{1}^{2} (t) \frac{Y_{k} (t) Y_{k^{'}} (t)}{Y_{\cdot} (t)} d {\hat{Λ}}_{1} (t),$ where k, k′ = 1, …, K. $\hat{cov} (n^{- 1 / 2} {\tilde{U}}_{1 k}, n^{- 1 / 2} {\tilde{U}}_{1 k^{'}})$ is given by Equation (2.10) on page 1146 of Gray (1988). Similar to the proof of Theorem 1,

\hat{cov} (n^{- 1 / 2} U_{1 k}, n^{- 1 / 2} {\tilde{U}}_{1 k^{'}}) = n^{- 1} \int_{0}^{τ} (W_{1} (t) {\hat{V}}_{1 k^{'} k} (t) + {\hat{c}}_{k^{'} k} (τ) \int_{0}^{τ} W_{1} (t) {\hat{E}}_{1 k} (t) {\hat{h}}_{k}^{- 1} (t)) \times Y_{k} (t) d {\hat{Λ}}_{1 k} (t) + n^{- 1} \sum_{l = 1}^{K} (\int_{0}^{τ} W_{1} (t) \frac{Y_{k} (t)}{Y_{\cdot} (t)} {\hat{V}}_{1 k^{'} l} (t) + {\hat{c}}_{k^{'} l} (τ) \int_{0}^{τ} W_{1} (t) \frac{Y_{k} (t)}{Y_{\cdot} (t)} {\hat{E}}_{1 l} (t) {\hat{h}}_{l}^{- 1} (t)) Y_{l} (t) d {\hat{Λ}}_{1 l} (t),

where ${\hat{Λ}}_{1 k} (τ) = \int_{0}^{τ} Y_{k}^{- 1} (t) d N_{1 k} (t)$ , ${\hat{V}}_{j k l} (t) = [{\hat{D}}_{j k l} (t) - {\hat{E}}_{j l} (t) {\hat{c}}_{k l} (t)] {\hat{h}}_{l}^{- 1} (t)$ , ${\hat{D}}_{j k l} = n^{- 1} I (j = 1) \tilde{W} (t) R_{k} (t) [I (k = l) - {\hat{h}}_{l} (t) / {\hat{h}}_{\cdot} (t)] / {\hat{G}}_{1 \cdot} (t -)$ , ${\hat{c}}_{k l} (t) = n^{- 1} \int_{0}^{t} {\hat{d}}_{1 k l} (u) {\hat{G}}_{1 \cdot} {(u -)}^{- 1} {\hat{h}}_{\cdot}^{- 1} (u) d N_{1 \cdot} (u)$ , ${\hat{E}}_{j k} (t) = I (j = 1) - {\hat{G}}_{1 k} (t -) / {\hat{S}}_{k} (t -)$ , and all other quantities are defined in (8). These results allow one to derive a chi-square test and a maximal test similar to the two-sample case.

2.3. Joint Two Sample Tests for Other Quantities

Joint tests can also be derived for other related quantities. For group k, let λ_2k(t) and λ·_k(t) denote the other (Type 2) cause-specific hazard function and the all-cause hazard function, respectively.

2.3.1. Two-Sample Joint Tests for Cause-Specific Hazard and All-Cause Hazard

Consider the following null hypotheses

H_{0} : λ_{11} (t) = λ_{12} (t) and λ_{\cdot 1} (t) = λ_{\cdot 2} (t) for all 0 < t < τ .

(11)

Let

U_{\cdot k} = \int_{0}^{τ} W_{\cdot} (t) Y_{k} (t) {\frac{d N_{\cdot k} (t)}{Y_{k} (t)} - \frac{d N_{\cdot \cdot} (t)}{Y_{\cdot} (t)}},

(12)

be the weighted log-rank test statistic for H₀ : λ_·1(t) = λ_·2(t) for all t > 0, where $N_{\cdot k} (t) = \sum_{j = 1}^{2} N_{j k} (t)$ , $N_{\cdot \cdot} (t) = \sum_{k = 1}^{2} \sum_{j = 1}^{2} N_{j k} (t)$ , and W_·(t) is a predictable weight function that converges in probability to some deterministic function w.(t) as n → ∞. Let U₁₁ and U.₁ be defined by (3) and (12). Then, n^−1/2(U₁₁, U.₁) has an asymptotic bivariate normal distribution with mean 0 and variance-covariance matrix $Σ^{(2)} = (σ_{i j}^{(2)})$ . Furthermore, Σ⁽²⁾ is consistently estimated by ${\hat{Σ}}^{(2)} = ({\hat{σ}}_{i j}^{(2)})$ , where ${\hat{σ}}_{11}^{(2)} = n^{- 1} \int_{0}^{τ} W_{1}^{2} (t) \frac{Y_{1} (t) Y_{2} (t)}{Y_{1} (t) + Y_{2} (t)} \frac{d N_{11} (t)}{Y_{1} (t)}$ , ${\hat{σ}}_{22}^{(2)} = n^{- 1} \int_{0}^{τ} W_{.}^{2} (t) \frac{Y_{1} (t) Y_{2} (t)}{Y_{1} (t) + Y_{2} (t)} \frac{d N_{\cdot 1} (t)}{Y_{1} (t)}$ , and ${\hat{σ}}_{12}^{(2)} = n^{- 1} \int_{0}^{τ} W_{1} (t) W_{\cdot} (t) \frac{Y_{1} (t) Y_{2} (t)}{Y_{1} (t) + Y_{2} (t)} \frac{d N_{11} (t)}{Y_{1} (t)}$ . These results allow one to construct a chi-square joint test and a maximum joint test for (11) similar to those for (1) in the previous section.

2.3.2. Two-Sample Joint Tests for Both Cause-Specific Hazards

Consider

H_{0} : λ_{11} (t) = λ_{12} (t) and λ_{21} (t) = λ_{22} (t) for all 0 < t < τ .

(13)

Let

U_{2 k} = \int_{0}^{τ} W_{2} (t) Y_{k} (t) {\frac{d N_{2 k} (t)}{Y_{k} (t)} - \frac{d N_{2 \cdot} (t)}{Y_{\cdot} (t)}},

(14)

be the weighted log-rank test statistic for H₀ : λ₂₁(t) = λ₂₂(t) for all 0 < t < τ, where W₂(t) is a predictable weight function that converges in probability to some deterministic function w₂(t) as n → ∞. It is well known that U_1k and U_2k are asymptotically independent (Prentice et al. 1978). Hence, one can construct a chi-square joint test and a maximum joint test for (13) based on the joint distribution of the two test statistics. Joint test for (13) was also studied previously by Lindkvist and Belyaev (1998) and Kulathinal and Gasbarra (2002) among others. In particular, the K-sample chi-square test of Kulathinal and Gasbarra (2002, p. 150) for the (λ_1k, λ_2k) pair with a special weight function $K_{k i j}^{n} (t) = I (i = j) W_{1} (t)$ reduces to that based on U_1k and U_2k. We also note that the ideas of Kulathinal and Gasbarra (2002) could be extended to derive a test for the (λ_1k, λ_·k) pair, although it was not explicitly developed in their article.

Remark 3.

It can be shown that for group k, the three pairs of functions (λ_1k(·), F_1k(·)), (λ_1k(·), λ_·k(·)), and (λ_1k(·), λ_2k(·)) uniquely determine each other and that each pair uniquely determines the joint distribution of (X_ik, δ_ik). This implies that the three null hypotheses (1), (11), and (13) are equivalent. On the other hand, their alternative hypotheses are different because the three pairs of functions characterize different features of competing risks data. Furthermore, a significant effect of a variable on one pair does not necessarily imply a significant effect on another pair, as illustrated later in Section 5.1. A practical question is which pair(s) should be used, especially when planning a study. The answer would depend on the specific research questions of a study. The cause-specific hazard and cumulative incidence pair, or (λ_1k(·), F_1k(·)), would be useful when studying the effects of a variable on a given type (Type 1) failure since they directly characterize two distinct and easily interpretable features of Type 1 failure. The cause-specific hazard and all-cause hazard pair, or (λ_1k(·), λ_·k(·)), would be useful when the all-cause hazard describes a meaningful clinical outcome such as “overall survival” (death due to any disease) in a randomized clinical trial of a new treatment versus a standard treatment for a specific disease in which the disease-specific survival and overall survival are co-primary endpoints. Note that the all-cause hazard may not always describe a meaningful clinical outcome especially when the two types of failures are negatively correlated as exemplified in the kidney transplantation program example discussed in the beginning of Section 1. Finally, joint inference for both cause-specific hazards, or (λ_1k(·), λ_2k(·)), would useful when both types of failures are of interest to the study.

3. Joint Regression Analysis for Competing Risks Data

3.1. Joint Regression Analysis of Cause-Specific Hazard and Cumulative Incidence

We now consider joint inference for the cause-specific hazard and the cumulative incidence hazard under a regression setting. Assume that one observes n independent and identically distributed triples (X_i, δ_i, Z_i), where for subject i (i = 1, …, n), X_i = min{T_i, C_i}, δ_i = D_iI(T_i ≤ C_i), T_i is the failure time of interest, C_i is a right censoring time, D_i is discrete random variable taking values on 1, 2 with D_i = j indicating that type j failure is observed, and Z_i is a vector of fixed or time-varying covariates that are observed on [0, X_i]. Assume C_i is independent of T_i, D_i, and Z_i, and pr(C_i ≥ t) = G^c(t).

Let λ₁(t|z) and ${\tilde{λ}}_{1} (t | z)$ be the conditional cause-specific hazard function and the conditional subdistribution hazard function for Type 1 failure for an individual with covariate z. Assume the proportional cause-specific hazards model (Prentice et al. 1978)

λ_{1} (t | Z) = λ_{10} (t) \exp (β_{1}^{T} Z^{(1)} (t)),

(15)

and the proportional subdistribution hazards model (Fine and Gray 1999)

{\tilde{λ}}_{1} (t | Z) = {\tilde{λ}}_{10} (t) \exp (γ_{1}^{T} Z^{(2)} (t)),

(16)

where λ₁₀(t) and ${\tilde{λ}}_{10} (t)$ are unknown baseline cause-specific hazard and baseline subdistribution hazard for Type 1 failure, respectively, and Z⁽¹⁾(t) and Z⁽²⁾(t) are functions of the original covariates Z and t that allow time × covariates interactions. Prentice et al. (1978) showed that inference for β₁ under the proportional cause-specific hazards model (15) can be made using the standard Cox (1972, 1975) partial likelihood method by regarding other types of failure as independent censoring. The proportional subdistribution hazards model (16) was introduced by Fine and Gray (1999) who developed large sample inference for γ₁.

Below we develop joint inference for β₁ and γ₁. Specifically, we consider the following joint null hypothesis

H_{0} : A_{1}^{T} β_{1} = d_{1} and A_{2}^{T} γ_{1} = d_{2},

(17)

where A₁ and A₂ are constant matrices, and d₁ and d₂ are constant column vectors.

Following Prentice et al. (1978) and Fine and Gray (1999), let

U_{1} (β_{1}) = \sum_{i = 1}^{n} \int_{0}^{\infty} {Z_{i}^{(1)} (t) - {\bar{Z}}^{(1)} (β_{1}, t)} d N_{i 1} (t),

(18)

and

{\tilde{U}}_{1} (γ_{1}) = \sum_{i = 1}^{n} \int_{0}^{\infty} {Z_{i}^{(2)} (t) - {\bar{Z}}^{(2)} (γ_{1}, t)} ω_{i} (t) d {\tilde{N}}_{i 1} (t),

(19)

be the score functions for β₁ and γ₁ under models (15) and (16), respectively, where

{\bar{Z}}^{(1)} (β_{1}, t) = \frac{\sum_{l = 1}^{n} Y_{l} (t) Z_{l}^{(1)} (t) \exp (β_{1}^{T} Z_{l}^{(1)} (t))}{\sum_{l = 1}^{n} Y_{l} (t) \exp (β_{1}^{T} Z_{l}^{(1)} (t))}, Y_{i} (t) = I {X_{i} \geq t},

and

N_{i 1} (t) = I (X_{i} \leq t, D_{i} = 1), {\bar{Z}}^{(2)} (γ_{1}, t) = \frac{\sum_{l = 1}^{n} ω_{l} (t) {\tilde{Y}}_{l} (t) Z_{l}^{(2)} \exp (γ_{1}^{T} Z_{l}^{(2)} (t))}{\sum_{l = 1}^{n} ω_{l} (t) {\tilde{Y}}_{l} (t) \exp (γ_{1}^{T} Z_{l}^{(2)} (t))},

${\tilde{N}}_{i 1} (t) = I (T_{i} \leq t, D_{i} = 1)$ , ${\tilde{Y}}_{i} (t) = 1 - {\tilde{N}}_{i 1} (t -)$ , $ω_{i} (t) = I (C_{i} \geq T_{i} \land t) {\hat{G}}^{c} (t) / {\hat{G}}^{c} (X_{i} \land t)$ , and ${\hat{G}}^{c}$ is the Kaplan and Meier (1958) estimate of the survival function G^c of the censoring variable C. Note that ${\tilde{N}}_{i 1} (t)$ is different from N_i1(t) and may not be observed if the subject is censored, but $ω_{i} (t) {\tilde{N}}_{i 1} (t)$ can always be computed. Let ${\hat{β}}_{1}$ and ${\hat{γ}}_{1}$ be the solutions of the score equations U₁(β₁) = 0 and ${\tilde{U}}_{1} (γ_{1}) = 0$ , respectively.

Theorem 2.

Under similar regularity conditions to Andersen et al. (1982) and Fine and Gray (1999), we have

n^{1 / 2} (\begin{matrix} {\hat{β}}_{1} - β_{1} \\ {\hat{γ}}_{1} - γ_{1} \end{matrix}) \overset{N}{\to} (0, Σ^{(1)}), as n \to \infty,

where Σ⁽¹⁾ is defined by (A.11) in Appendix A.1. Furthermore, Σ⁽¹⁾ can be consistently estimated by

{\hat{Σ}}^{(1)} = (\begin{matrix} {\hat{Ω}}_{(p p)}^{(1) - 1} & {\hat{Ω}}_{(p p)}^{(1) - 1} {\hat{Ω}}_{(p q)}^{(1)} {\hat{Ω}}_{(q q)}^{(1) - 1} \\ {\hat{Ω}}_{(q q)}^{(1) - 1} {\hat{Ω}}_{(q p)}^{(1)} {\hat{Ω}}_{(p p)}^{(1) - 1} & {\hat{Ω}}_{(q q)}^{(1) - 1} {\hat{Ω}}_{(q q)}^{* (1)} {\hat{Ω}}_{(q q)}^{(1) - 1} \end{matrix}),

(20)

where

{\hat{Ω}}_{(p p)}^{(1)} = \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{\infty} [\frac{\sum_{l = 1}^{n} Y_{l} (t) Z_{l}^{(1)} {(t)}^{\otimes 2} \exp ({\hat{β}}_{1}^{T} Z_{l}^{(1)} (t))}{\sum_{l = 1}^{n} Y_{l} (t) \exp ({\hat{β}}_{1}^{T} Z_{l}^{(1)} (t))} - {\bar{Z}}^{(1)} {({\hat{β}}_{1}, t)}^{\otimes 2}] d N_{i 1} (t), {\hat{Ω}}_{(q q)}^{(1)} = \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{\infty} {\frac{\sum_{l = 1}^{n} ω_{l} (t) {\tilde{Y}}_{l} (t) Z_{l}^{(2)} {(t)}^{\otimes 2} \exp ({\hat{γ}}_{1}^{T} Z_{l}^{(2)} (t))}{\sum_{l = 1}^{n} ω_{l} (t) {\tilde{Y}}_{l} (t) \exp ({\hat{γ}}_{1}^{T} Z_{l}^{(2)} (t))} - {\bar{Z}}^{(2)} {({\hat{γ}}_{1}, t)}^{\otimes 2}} d {\tilde{N}}_{i 1} (t), {\hat{Ω}}_{(p q)}^{(1)} = \frac{1}{n} \sum_{i = 1}^{n} {\int_{0}^{\infty} (Z_{i}^{(1)} (t) - {\bar{Z}}^{(1)} ({\hat{β}}_{1}, t)) \times (d N_{i 1} (t) - Y_{i} (t) \exp ({\hat{β}}_{1}^{T} Z_{i}^{(1)} (t) d {\hat{Λ}}_{10} (t)) * {\hat{η}}_{i}} + \frac{1}{n} \sum_{i = 1}^{n} {\int_{0}^{\infty} (Z_{i}^{(1)} (t) - {\bar{Z}}^{(1)} ({\hat{β}}_{1}, t)) \times (d N_{i 1} (t) - Y_{i} (t) \exp ({\hat{β}}_{1}^{T} Z_{i}^{(1)} (t)) d {\hat{Λ}}_{10} (t)) * {\hat{ϕ}}_{i}} {\hat{Ω}}_{(q q)}^{* (1)} = \frac{1}{n} \sum_{i = 1}^{n} {({\hat{η}}_{i} + {\hat{ϕ}}_{i})}^{\otimes 2},

(21)

with

{\hat{η}}_{i} = \int_{0}^{\infty} {Z_{i}^{(2)} (t) - {\bar{Z}}^{(2)} ({\hat{γ}}_{1}, t)} ω_{i} (t) d {\hat{\tilde{M}}}_{i 1} (t), {\hat{\tilde{M}}}_{i 1} (t) = {\tilde{N}}_{i 1} (t) - \int_{0}^{t} {\tilde{Y}}_{i} (u) \exp ({\hat{γ}}_{1}^{T} Z_{i}^{(2)} (u)) d {\hat{\tilde{Λ}}}_{10} (u), {\hat{\tilde{Λ}}}_{10} (t) = \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{t} {\sum_{l = 1}^{n} {\tilde{Y}}_{l} (u) \exp ({\hat{γ}}_{1}^{T} Z_{l}^{(2)} (u))}^{- 1} \times ω_{i} (u) d {\tilde{N}}_{i 1} (u), {\hat{ϕ}}_{i} = \int_{0}^{\infty} \frac{\hat{q} (t)}{\hat{π} (t)} d {\hat{M}}_{i}^{c} (t), {\hat{M}}_{i}^{c} (t) = I (X_{i} \leq t, δ_{i} = 0) - \int_{0}^{t} I (X_{i} \geq u) d {\hat{Λ}}^{c} (u), {\hat{Λ}}^{c} (t) = \int_{0}^{t} \frac{\sum_{i = 1}^{n} d {I (X_{i} \leq u, δ_{i} = 0)}}{\sum_{i = 1}^{n} I (X_{i} \geq u)}, \hat{q} (t) = - n^{- 1} \sum_{i = 1}^{n} \int_{0}^{\infty} {Z_{i}^{(2)} (s) - {\bar{Z}}^{(2)} ({\hat{γ}}_{1}, s)} \times I (s \geq t > X_{i}) ω_{i} (s) d {\hat{\tilde{M}}}_{i 1} (s), \hat{π} (t) = n^{- 1} \sum_{i = 1}^{n} I (X_{i} \geq t) .

Corollary 1.

Let $ξ_{n} = n^{1 / 2} (A_{1} {\hat{β}}_{1} - d_{1})$ and $η_{n} = n^{1 / 2} (A_{2} {\hat{γ}}_{1} - d_{2})$ . Then, under the null hypothesis (17), we have

(\begin{array}{l} ξ_{n} \\ η_{n} \end{array}) \overset{N}{\to} (0, V), as n \to \infty,

where

V = (\begin{array}{l} A_{1} & 0 \\ 0 & A_{2} \end{array}) Σ^{(1)} (\begin{array}{l} A_{1}^{T} & 0 \\ 0 & A_{2}^{T} \end{array}) .

(22)

Define the following Wald-type test statistic

X_{W}^{2} = (ξ_{n}^{T}, η_{n}^{T}) {\hat{V}}^{- 1} (\begin{matrix} ξ_{n} \\ η_{n} \end{matrix}),

where $\hat{V}$ is a consistent estimate of V obtained by replacing Σ⁽¹⁾ with ${\hat{Σ}}^{(1)}$ in (22). It follows immediately from Corollary 1 that under (17), $X_{W}^{2}$ has an asymptotic chi-squared distribution with p_d₁ + p_d₂ degrees of freedom, where p_d₁ and p_d₂ are the dimensions of d₁ and d₂, respectively. This leads to the following chi-square joint test for (17):

Reject (17) at level α if X_{W}^{2} > χ_{p_{d_{1}} + p_{d_{2}}}^{2} (α),

where $χ_{p_{d_{1}} + p_{d_{2}}}^{2} (α)$ is the upper 1 − α percentile of the standard $χ_{p_{d_{1}} + p_{d_{2}}}^{2}$ distribution.

3.2. Joint Regression Analysis of Other Quantities

Besides analyzing λ₁(t|Z) and ${\tilde{λ}}_{1} (t | Z)$ jointly, it is sometimes also useful to consider other related quantities as discussed in Section 2.3 (Remark 3).

3.2.1. Joint Regression Analysis of Cause-Specific Hazard and All-Cause Hazard

Assume that the proportional cause-specific hazards model (15) holds. In addition, assume the proportional all-cause hazards model:

λ (t | Z) = λ_{0} (t) \exp (β_{\cdot}^{T} Z^{(3)} (t)),

(23)

where λ(t|Z)) denote the conditional all-cause hazard function given Z, λ₀(t) is an unknown baseline hazard, and Z⁽³⁾(t) are functions of the original covariates Z and t that allow time × covariates interactions. Below we derive joint inference for β₁ and β_⋅.

Let

U_{\cdot} (β_{\cdot}) = \sum_{i = 1}^{n} \int_{0}^{\infty} {Z_{i}^{(3)} (t) - {\bar{Z}}^{(3)} (β_{\cdot}, t)} d N_{i} (t),

(24)

be the score function for β_. under model (23), where

{\bar{Z}}^{(3)} (β_{\cdot}, t) = \frac{\sum_{l = 1}^{n} Y_{l} (t) Z_{l}^{(3)} (t) \exp (β_{\cdot}^{T} Z_{l}^{(3)} (t))}{\sum_{l = 1}^{n} Y_{l} (t) \exp (β_{\cdot}^{T} Z_{l}^{(3)} (t))}

and N_i(t) = I(X_i ≤ t, δ_i = 1). Let ${\hat{β}}_{\cdot}$ be the solution of the score equation U_·(β_·) = 0.

Theorem 3.

Under some regularity conditions, as n → ∞,

n^{1 / 2} (\begin{matrix} {\hat{β}}_{1} - β_{1} \\ {\hat{β}}_{\cdot} - β_{\cdot} \end{matrix}) \overset{N}{\to} (0, Σ^{(2)}),

where Σ⁽²⁾ is defined by (A.13) in Appendix A.1. Furthermore, Σ⁽²⁾ can be consistently estimated by

{\hat{Σ}}^{(2)} = (\begin{matrix} {\hat{Ω}}_{(p p)}^{(2) - 1} & {\hat{Ω}}_{(p p)}^{(2) - 1} {\hat{Ω}}_{(p q)}^{(2)} {\hat{Ω}}_{(q q)}^{(2) - 1} \\ {\hat{Ω}}_{(q q)}^{(2) - 1} {\hat{Ω}}_{(q p)}^{(2)} {\hat{Ω}}_{(p p)}^{(2) - 1} & {\hat{Ω}}_{(q q)}^{(2) - 1} \end{matrix}),

(25)

where

{\hat{Ω}}_{(p p)}^{(2)} = \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{\infty} [\frac{\sum_{l = 1}^{n} Y_{l} (t) Z_{l}^{(1)} {(t)}^{\otimes 2} \exp ({\hat{β}}_{1}^{T} Z_{l}^{(1)} (t))}{\sum_{l = 1}^{n} Y_{l} (t) \exp ({\hat{β}}_{1}^{T} Z_{l}^{(1)} (t))} - {\bar{Z}}^{(1)} {({\hat{β}}_{1}, t)}^{\otimes 2}] d N_{i 1} (t),

{\hat{Ω}}_{p q}^{(2)} = \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{\infty} (Z_{i}^{(3)} (t) - {\bar{Z}}^{(3)} ({\hat{β}}_{\cdot}, t)) \times (Z_{i}^{(1)} (t) - {\bar{Z}}^{(1)} ({\hat{β}}_{1}, t)) Y_{i} (t) \times \exp ({\hat{β}}_{1}^{T} Z^{(1)} (t)) d {\hat{Λ}}_{10} (t),

{\hat{Ω}}_{(q q)}^{(2)} = \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{\infty} [\frac{\sum_{l = 1}^{n} Y_{l} (t) Z_{l}^{(3)} {(t)}^{\otimes 2} \exp ({\hat{β}}_{\cdot}^{T} Z_{l}^{(3)} (t))}{\sum_{l = 1}^{n} Y_{l} (t) \exp ({\hat{β}}_{\cdot}^{T} Z_{l}^{(3)} (t))} - - {\bar{Z}}^{(3)} {({\hat{β}}_{\cdot}, t)}^{\otimes 2}] d N_{i} (t),

with ${\hat{Λ}}_{10} (t) = \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{t} {\sum_{l = 1}^{n} Y_{l} (u) ({\hat{β}}_{1}^{T} Z_{i}^{(1)} (u))}^{- 1} d N_{i 1} (u)$ is an estimator of the baseline cumulative cause-specific hazard for Type 1 failure.

Theorem 3 enables one to draw joint inference for β₁ and β_⋅ along the lines of the previous section.

3.2.2. Joint Regression Analysis of Both Cause-Specific Hazards

Assume the proportional cause-specific hazards model (15) for Type 1 failure. In addition, assume the following proportional cause-specific hazards model for Type 2 failure:

λ_{2} (t | Z) = λ_{20} (t) \exp (β_{2}^{T} Z^{(4)} (t)),

(26)

where λ₂₀(t) is an unknown baseline cause-specific hazard, and Z⁽⁴⁾(t) are functions of the original covariates Z and t that allow time × covariates interactions.

Let

U_{2} (β_{2}) = \sum_{i = 1}^{n} \int_{0}^{\infty} {Z_{i}^{(4)} (t) - {\bar{Z}}^{(4)} (β_{2}, t)} d N_{i 2} (t),

(27)

be the score test statistic under model (26), where

{\bar{Z}}^{(4)} (β_{2}, t) = \frac{\sum_{l = 1}^{n} Y_{l} (t) Z_{l}^{(4)} (t) \exp (β_{2}^{T} Z_{l}^{(4)} (t))}{\sum_{l = 1}^{n} Y_{l} (t) \exp (β_{2}^{T} Z_{l}^{(4)} (t))} .

Let ${\hat{β}}_{2}$ be the solution of the score equations U₂(β₂) = 0. It can be shown that U₁ and U₂ are asymptotically independent since N_i1(t) and N_i2(t) do not jump at the same time. Therefore, one draw joint inference for β₁ and β₂ similar to the previous sections.

Remark 4.

In addition to being easy to interpret, the PH models for the cause-specific hazard and the all-cause hazard only require that the censoring time be conditionally independent of the survival time given the observed covariates, which is weaker than the completely censoring at random assumption needed by the proportional subdistribution hazards model.

Remark 5 (Model Checking).

Model diagnostic techniques for the standard Cox (1972) proportional hazards model can be readily applied to assess model assumptions of the individual models (15), (23), and (26) (Schoenfeld 1980, 1982; Lagakos 1981; Andersen 1982; Nagelkerke, Oosting, and Hart 1984; Moreau, O’quigley, and Mesbah 1985; Arjas 1988; Beyersmann et al. 2007; Latouche et al. 2007; Grambauer, Schumacher, and Beyersmann 2010; Andersen et al. 2012; Haller, Schmidt, and Ulm 2012). Graphical methods for these models can also be adapted for the proportional subdistribution hazards model (16). Formal goodness-of-fit tests for (16) have been developed by Scheike and Zhang (2008). In addition to assessing goodness of fit of an individual model, it is also important to check if two individual models hold simultaneously. For example, it has been well recognized that the proportional hazards assumption for a time-independent covariate does not hold simultaneously for the cause-specific hazard and the cause-specific subdistribution hazard, and thus it is important for models (15) and (16) to allow time × covariates interactions. To check if (15) and (16) hold simultaneously, one needs to verify that for any z, $Λ_{2} (t | z) \equiv {\tilde{Λ}}_{1} (t | z) - Λ_{1} (t | z) + l o g λ_{1} (t | z) - l o g {\tilde{λ}}_{1} (t | z)$ is nondecreasing and satisfies Λ₂(0|z) = 0. In other words, the above defined Λ₂(t|z) is a proper conditional cumulative cause-specific hazard function for Type 2 failure. We provide an example of the joint model of (15) and (16) in Section 4 (model (28)).

4. Simulations

We present some simulation results to illustrate the advantage of the proposed joint tests over the Bonferroni method. The weight function is set to be a constant 1 in all simulations.

The first simulation considers two-group comparison of Type 1 failure with respect to both cause-specific hazard (CSH) and cumulative incidence function (CIF). We assign equal number of patients in the two groups. Competing risks data are generated using Beyersmann’s et al. (2009) cause-specific hazard-driven method that requires only specification of the cause-specific hazard for each type of failure.

Figure 1 depicts simulated rejection power of the two-sided chi-square joint test, maximum joint test, and Bonferroni joint test for hypothesis (1) for various sample sizes per group under four scenarios. Figure 1(a) corresponds to a null case under H₀. Figure 1(b) corresponds to a scenario where there is a small group difference in CSH and a large group difference in CIF, whereas Figure 1(c) corresponds to an opposite situation. Figure 1(d) corresponds to a case where the group effects on CSH and CIF are similar. Specifically, in the first two scenarios, we assume constant cause-specific hazard for both causes, with λ₁₁ = λ₁₂ = 0.04, λ₂₁ = λ₂₂ = 0.01 for Figure 1(a) and λ₁₁ = λ₁₂ = 0.1, λ₂₁ = 0.04, λ₂₂ = 0.01 for Figure 1(b), where λ_jk denotes the cause-specific hazard for type j failure in group k. In the last two scenarios, we assume λ₁(t|Z) = λ₁₀(t) exp(γZ * I(t < 1) + βZ * I(t ≥ 1)) and ${\tilde{λ}}_{1} (t | Z) = {\tilde{λ}}_{10} (t) \exp (γ Z)$ , with β = 0.4, γ = 0.01 for Figure 1(c) and β = 0.5, γ = 0.5 for Figure 1(d), where λ₁₀(t) = 0.05 * I(0 ≤ t < 1) + 0.1 * I(t ≥ 1), ${\tilde{λ}}_{10} (t) = \frac{0.05 e^{- t}}{1 - 0.05 (1 - e^{- t})}$ , and Z is a binary group variable. The censoring rate is set to be 0.1 with an independent exponential censoring time in each scenario. The nominal significance level is 0.05. A graphical illustration of the CIF by groups under all four scenarios is presented in Appendix A.3 (Figure A.5).

It is seen from Figure 1(a) that the Type I error rates for all three tests are well controlled around the 0.05 nominal level. In all three alternative cases ((b)–(d)), either the chi-square joint test, or the maximum joint test, or both are more powerful than the Bonferroni method. In the cases where the group effects on CSH and CIF are quite different (Figure 1(b) and 1(c)), the chi-square joint test is observed to be most powerful with substantially improved power. When the effect sizes for CSH and CIF are similar (Figure 1(d)), the maximum joint test outperforms the others. The improved power of the proposed joint tests has important implications for the design of clinical trials in the presence of competing risks. For example, to achieve 80% power under the second scenario (Figure 1(b)), it would require n = 80 patients for the chi-square joint test, about 200 patients for the maximum joint test, and more than 200 patients for the Bonferroni joint test.

We also conducted power comparisons for one-sided joint tests under the same four scenarios as in Figure 1. The results are presented in Figure A.1 in Appendix A.2. The results are consistent with the two-sided case except that the maximum joint test has much more pronounced improvement over the chi-square joint test in the last scenario. We note that the chi-square joint test is constructed for a two-sided hypothesis, and thus can be underpowered when used as a one-sided test as shown in Figure A.1(d).

The second simulation study considers a joint regression model of CSH and CIF with respect to Type 1 failure. It is well known that the proportional hazards assumption for a time-independent covariate usually does not hold simultaneously for the CSH and the CIF hazard (or subdistribution hazard), so it is imperative to include time by covariate interactions in the joint model. As an illustration, we consider the following joint model:

λ_{1} (t | Z) = λ_{10} (t) \exp (γ^{T} Z * I (t < τ_{0}) + β^{T} Z * I (t \geq τ_{0})), {\tilde{λ}}_{1} (t | Z) = {\tilde{λ}}_{10} (t) \exp ({\tilde{γ}}^{T} Z * I (t < τ_{0}) + {\tilde{β}}^{T} Z * I (t \geq τ_{0})),

(28)

where λ₁₀(t) = aI(0 ≤ t < τ₀) + bI(t ≥ τ₀), ${\tilde{λ}}_{10} (t) = \frac{c e^{- t}}{1 - c (1 - e^{- t})}$ , Z = (Z₁, Z₂) with Z₁, Z₂ being binary variables, γ = (γ₁, γ₂), β = (β₁, β₂), and τ₀ is some prespecified constant. Note that under model (28), the conditional cumulative cause-specific hazard function for cause 2 given Z = z is $Λ_{2} (t | z) = {\tilde{Λ}}_{1} (t | z) - Λ_{1} (t | z) + \log λ_{1} (t | z) - \log {\tilde{λ}}_{1} (t | z)$ . For Λ₂(t|z) to be a proper conditional cumulative cause-specific hazard function, it must satisfy

Λ_{2} (0 | z) = 0 and λ_{2} (t | z) = \frac{\partial Λ_{2} (t | z)}{\partial t} \geq 0 for all t \geq 0,

which imply some constraints on the parameters in model (28). For simplicity, we further assume $\tilde{γ} = \tilde{β}$ for our simulation. In this case, it can be shown that Λ₂(t|z) is a proper cumulative cause-specific hazard function if the following constraints hold: (i) a = c ≤ b, (ii) $e^{γ^{T} z} < \frac{1 - a}{c}$ , (iii) $e^{β^{T} z} < 1 / a (1 - e^{- τ_{0}})$ , and (iv) $\tilde{γ} = γ$ . We then generated competing risks data from λ₁(t|z) and λ₂(t|z) using the method of Beyersmann et al. (2009).

Figure 2 displays the simulated power curves of the three two-sided joint tests described in Sections 3.1 for the following local hypothesis regarding the effects of Z₁ on the CSH and the CIF hazard after time τ₀:

H_{0} : β_{1} = 0 and γ_{1} = 0.

(29)

We consider four scenarios: (a) the null case (β₁ = 0, γ₁ = 0); (b) smaller Z₁ effect on CSH and larger Z₁ effect on CIF (β₁ = −0.1,γ₁ = −0.4); (c) larger Z₁ effect on CSH and smaller Z₁ effect on CIF (β₁ = −0.6,γ₁ = −0.2); and (d) similar Z₁ effects on CSH and CIF (β₁ = −0.5,γ₁ = −0.5). In all four scenarios, we set a = 0.05, b = 0.1, β₂ = −0.2, γ₂ = −0.1, $γ = \tilde{γ}$ , and τ₀ = 1.

Figure 2 leads to similar conclusions to what we have observed for the two-group case in the first simulation study. In the supplementary material, we also present some simulations for the CSH and all-cause hazard (ACH) pair, which have similar conclusions.

Finally, we conducted a small-scale simulation to compare the power of the three joint tests for (1), (11), and (13). When there is little group difference in a particular quantity, a test for a pair involving that quantity was observed to have lower power than those for other pairs. This is not surprising because a joint test for a specific pair is constructed to detect a group difference in the direction of that pair. The details are omitted.

5. Real Data Example

We illustrate our methods on two real datasets. In the first example, we consider joint inference for time to second malignancy in Hodgkin disease patients. In the second example, we perform joint analysis of the cause-specific hazard (CSH) for time to progression (TTP) and the all-cause hazard (ACH) for time to progression or death (progression-free survival or PFS) for follicular-type lymphoma patients.

5.1. Hodgkin Disease

The Hodgkin disease data was described in Pintilie (2006). It consists of 865 patients who were diagnosed with Hodgkin disease and received radio therapy in Princess Margaret Hospital between 1968 and 1986. Here we are interested in studying time to second malignancy after receiving radio therapy, which is an important variable for evaluating the side effects of radio therapy. Death without second malignancy is a competing risk. Among the 865 patients, 93 developed second malignancy, 386 were dead without the second malignancy, and 386 were right censored who did not experience any of the two events by the end of study. For illustration purpose, we investigate whether or not the risks of developing second malignancy were the same among older (≥30) and younger (<30) patients.

Figure 3(a) and 3(b) depicts the cumulative cause-specific hazard functions and the cumulative incidence functions, respectively, for time to second malignancy for the older (≥30) and younger (<30) groups. There appears to be a higher cause-specific hazard for the older patients since the slope of their cumulative cause-specific hazard is noticeably bigger (Figure 3(a)). However, the cumulative incidence functions for the two age groups are barely distinguishable (Figure 3(b)). The two-sample log-rank test for the cause-specific hazard for time to second malignancy yields a p-value = 0.037. The Gray (1988) two-sample test for the cumulative incidence for time to second malignancy gives a p-value = 0.770. At 5% overall significant level, none of the individual tests is statistically significant at the Bonferroni adjusted level 0.05/2 = 0.025.

We performed the chi-square joint test and the maximum joint test for the null hypothesis that there is no difference in the cause-specific hazard (CSH) and the cumulative incidence (CIF) for time to second malignancy between older and younger patients. The p-values are presented in the first part of Table 1, along with the results of the individual tests and the Bonferroni’s method. In contrast to the Bonferroni method, the two-sample chi-square joint test for the cause-specific hazard and the cumulative incidence yields a p-value 0.02, which is highly significant at 5% significance level. The maximum joint test is also significant at level 0.05 (p-value = 0.05). As illustrations, we also performed joint tests for (CSH, ACH) and for CSH with the other cause-specific hazard (OCH) (parts 2 and 3 of Table 1), which show that in addition to an elevated cause-specific hazard for time to second malignancy, the older patients also had a higher risk of dying from other life-threatening diseases without developing second malignancy. This explains why their observed cumulative incidence for time to second malignancy was not significantly different from the younger patients.

Table 1.

Separate and joint test results for Hodgkin disease example for three pairs of quantities.

	Separate test		Joint test
Test	CSH	CIF	Bonferroni	χ²	Max
p-value	0.037	0.770	0.074	0.020	0.050
Test	CSH	ACH	Bonferroni	χ²	Max
p-value	0.037	5.2E–8	1.0E–7	3.4E–7	3.0E–8
Test	CSH	OCH	Bonferroni	χ²	Max
p-value	0.037	4.7E–7	9.4E–7	3.5E–7	8.0E–7

Open in a new tab

NOTE: χ² and Max are abbreviations for the chi-square joint test and the maximum joint test described in section 2.2.

5.2. Follicular Cell Lymphoma Study

The follicular cell lymphoma study (Pintilie 2006; Scheike and Zhang 2011) consists of 541 early stage (I or II) follicular type lymphoma patients who were enrolled between 1967 and 1996 and treated with either radiation alone (RT) or with radiation and chemotherapy (CMT). There were 272 events due to disease (relapse or no treatment response), 76 competing risk events (death without relapse), and 193 censored individuals who did not experience any of the two events at the end of the followup. As in Scheike and Zhang (2011), we test if the CMT group has a longer time to relapse or no treatment response than the RT group. Although one could study different pairs of quantities, we consider joint inference of the cause-specific hazard and the all-cause hazard based on models (15) and (23) because they correspond to two commonly used clinical endpoints, namely, time to progression (TTP) and progression-free survival (PFS), in oncology trials. Here TTP, defined as time to relapse or no treatment response, is an endpoint for the antitumor activity of a treatment, and PFS, defined as time to progression or death before progression, is an endpoint for the overall effects on a patient. In addition to a binary treatment variable (1 for RT and 0 for CMT), we adjust for patient’s baseline age, stage, and hemoglobin level (hgb) by including them as covariates in our models. The Cox–Snell residual plots for the proportional all-cause hazards model (Figure A.6(a)) and the proportional cause-specific hazards model (Figure A.6(b)), which presented in Appendix A.3, indicate reasonable overall fit of both models. We conducted the chi-square joint test and the maximum joint test for the treatment variable and summarized the results along with Bonferroni adjustment method and the individual tests in Table 2. The maximum joint test (p-value = 0.047) is significant, whereas the chi-square joint test (p-value = 0.182) and the Bonferroni method (p-value = 0.07) are not significant at 5% significance level. The one-sided individual test statistics for CSH and ACH are 1.81 and 1.78, respectively, both exceeding 1.77, the cutoff value of the maximum test. Therefore, we conclude that at 5% overall significance level, CMT group has a lower risk of TTP (cause-specific hazard) and a lower risk of PFS (ACH) as compared to the RT group adjusting for patient’s baseline age, stage, and hemoglobin level (hgb). Finally, the chi-square joint test has a relatively large p-value because it is actually a two-sided test that is not powered for a one-sided hypothesis, especially when the effect sizes for CSH and ACH are similar, which is consistent with our simulation results (Figure A.3(d)).

Table 2.

Separate and joint test results for follicular cell lymphoma study.

	Separate test		Joint test
Test	CSH	ACH	Bonferroni	χ²	Max
p-value	0.035	0.037	0.070	0.182	0.047

Open in a new tab

NOTE: χ² and Max are abbreviations for the chi-square joint test and the maximum joint test.

6. Discussion

We emphasize the importance of joint inference for the cause-specific hazard and the cumulative incidence because one quantity alone does not fully characterize the time to a particular type of failure in the presence of competing risks. As illustrated in our simulations and real data examples, the proposed chi-square joint test and maximum joint test can be much more powerful than the Bonferroni method. The increased power implies substantial saving in the number of patients required in a clinical trial. In a sequel, we will develop power analysis methods to determine the required sample size to test a group difference based on the developed joint tests. We also note that the chi-square joint test tends to be more powerful than the maximum joint test when the effects on the two quantities are very different and that the maximum joint test dominates the chi-square joint test when the effects on the two quantities are similar. In practice, we recommend that both joint tests be performed together with the separate tests for the individual quantities as illustrated in our real data example. The joint regression methods in Section 3 can be extended to beyond Cox’s models. For example, the accelerated failure time models can be used to model the cause-specific hazard. Scheike and Zhang (2008) considered other regression models for the subdistribution hazard. Joint inference procedures for these models can be developed similarly. Finally, joint modeling of the cause-specific hazard and the cumulative incidence is nontrivial since the proportional cause-specific hazards model and the proportional subdistributional hazards model are unlikely to hold simultaneously, especially for a time-independent covariate. However, this issue can be resolved by including time by covariate interactions in the regression models. In particular, we presented a joint model with piecewise proportional cause-specific hazards and piecewise proportional subdistributional hazards and discussed how to check if the two models hold simultaneously in Section 4.

Supplementary Material

NIHMS986715-supplement-1.pdf^{(611.9KB, pdf)}

Acknowledgments

The authors thank the co-editor, the associate editor, and the two anonymous referees for their valuable comments that helped improve this article significantly.

Funding

Gang Li’s work was partially supported by NIH grant 5P30CA-16042 and NIH grant 8UL1TR000124.

Footnotes

Supplementary Materials

Appendix: Proofs for the theorems and additional simulation results.

Supplementary materials for this article are available online. Please go to www.tandfonline.com/r/JASA.

References

Aly E, Kochar S, and McKeague I (1994), “Some Tests for Comparing Cumulative Incidence Functions and Cause-Specific Hazard Rates,” Journal of the American Statistical Association, 89, 994–999. [1289] [Google Scholar]
Andersen P, Borgan Ø, Gill R, and Keiding N (1982), “Linear Nonparametric Tests for Comparison of Counting Processes, With Applications to Censored Survival Data, Correspondent Paper,” International Statistical Review/Revue Internationale de Statistique, 50, 219–244. [1290,1293] [Google Scholar]
Andersen P, Geskus R, de Witte T, and Putter H (2012), “Competing Risks in Epidemiology: Possibilities and Pitfalls,” International Journal of Epidemiology, 41, 861–870. [1295] [DOI] [PMC free article] [PubMed] [Google Scholar]
Andersen PK (1982), “Testing Goodness of Fit of Cox’s Regression and Life Model,” Biometrics, 38, 67–77. [1295] [Google Scholar]
Arjas E (1988), “A Graphical Method for Assessing Goodness of Fit in Cox’s Proportional Hazards Model,” Journal of the American Statistical Association, 83, 204–212. [1295] [Google Scholar]
Bajorunaite R, and Klein J (2007), “Two-Sample Tests of the Equality of Two Cumulative Incidence Functions,” Computational Statistics & Data Analysis, 51, 4269–4281. [1289] [Google Scholar]
Beyersmann J, Dettenkofer M, Bertz H, and Schumacher M (2007), “A Competing Risks Analysis of Bloodstream Infection After Stem-Cell Transplantation Using Subdistribution Hazards and Cause-Specific Hazards,” Statistics in Medicine, 26, 5360–5369. [1289,1295] [DOI] [PubMed] [Google Scholar]
Beyersmann J, Latouche A, Buchholz A, and Schumacher M (2009), “Simulating Competing Risks Data in Survival Analysis,” Statistics in Medicine, 28, 956–971. [1295,1297] [DOI] [PubMed] [Google Scholar]
Breslow N (1970), “A Generalized Kruskal-Wallis Test for Comparing k Samples Subject to Unequal Patterns of Censorship,” Biometrika, 57, 579–594. [1291] [Google Scholar]
Cox D (1972), “Regression Models and Life-Tables,” Journal of the Royal Statistical Society, Series B, 34, 187–220. [1289,1293,1295] [Google Scholar]
Cox D (1975), “Partial Likelihood,” Biometrika, 62, 269–276. [1289,1293] [Google Scholar]
Cox D, and Oakes D (1984), Analysis of Survival Data (vol. 21), Boca Raton, FL: Chapman & Hall/CRC; [1289] [Google Scholar]
Fine J (1999), “Analysing Competing Risks Data With Transformation Models,” Journal of the Royal Statistical Society, Series B, 61, 817–830. [1289] [Google Scholar]
Fine J (2001), “Regression Modeling of Competing Crude Failure Probabilities,” Biostatistics, 2, 85–97. [1289] [DOI] [PubMed] [Google Scholar]
Fine JP, and Gray RJ (1999), “A Proportional Hazards Model for the Subdistribution of a Competing Risk,” Journal of the American Statistical Association, 94, 496–509. [1289,1290,1293] [Google Scholar]
Gehan EA (1965), “A Generalized Wilcoxon Test for Comparing Arbitrarily Singly-Censored Samples,” Biometrika, 52, 203–223. [1291] [PubMed] [Google Scholar]
Gerds T, Scheike T, and Andersen P (2012), “Absolute Risk Regression for Competing Risks: Interpretation, Link Functions, and Prediction,” Statistics in Medicine, 31, 3921–3930. [1289] [DOI] [PMC free article] [PubMed] [Google Scholar]
Gichangi A, and Vach W (2005), “The Analysis of Competing Risks Data: A Guided Tour,” Statistics in Medicine. [1289] [Google Scholar]
Grambauer N, Schumacher M, and Beyersmann J (2010), “Proportional Subdistribution Hazards Modeling Offers a Summary Analysis, Even if Misspecified,” Statistics in Medicine, 29, 875–884. [1295] [DOI] [PubMed] [Google Scholar]
Gray R (1988), “A Class of K-Sample Tests for Comparing the Cumulative Incidence of a Competing Risk,” The Annals of Statistics, 16, 1141–1154. [1289,1290,1291,1292,1298] [Google Scholar]
Haller B, Schmidt G, and Ulm K (2012), “Applying Competing Risks Regression Models: An Overview,” Lifetime Data Analysis, 19, 1–26. [1289,1295] [DOI] [PubMed] [Google Scholar]
Holt J (1978), “Competing Risk Analyses With Special Reference to Matched Pair Experiments,” Biometrika, 65, 159–165. [1289] [Google Scholar]
Kalbfleisch J, and Prentice RL (1980), The Statistical Analysis of Failure Time Data (Vol. 360), New York: Wiley; [1291] [Google Scholar]
Kaplan E, and Meier P (1958), “Nonparametric Estimation From Incomplete Observations,” Journal of the American Statistical Association, 53, 457–481. [1291,1293] [Google Scholar]
Klein J (2006), “Modelling Competing Risks in Cancer Studies,” Statistics in Medicine, 25, 1015–1034. [1289] [DOI] [PubMed] [Google Scholar]
Klein J, and Andersen P (2005), “Regression Modeling of Competing Risks Data Based on Pseudovalues of the Cumulative Incidence Function,” Biometrics, 61, 223–229. [1289] [DOI] [PubMed] [Google Scholar]
Klein JP, and Moeschberger ML (2003), Survival Analysis: Techniques for Censored and Truncated Data, New York: Springer Science & Business Media; [1291] [Google Scholar]
Kulathinal S, and Gasbarra D (2002), “Testing Equality of Cause-Specific Hazard Rates Corresponding to m Competing Risks Among k Groups,” Lifetime Data Analysis, 8, 147–161. [1289,1292] [DOI] [PubMed] [Google Scholar]
Lagakos S (1978), “A Covariate Model for Partially Censored Data Subject to Competing Causes of Failure,” Applied Statistics, 27, 235–241. [1289] [Google Scholar]
Lagakos S (1981), “The Graphical Evaluation of Explanatory Variables in Proportional Hazard Regression Models,” Biometrika, 68, 93–98. [1295] [Google Scholar]
Lam K (1998), “A Class of Tests for the Equality of k Cause-Specific Hazard Rates in a Competing Risks Model,” Biometrika, 85, 179–188. [1289] [Google Scholar]
Larson M (1984), “Covariate Analysis of Competing-Risks Data With Log-Linear Models,” Biometrics, 40, 459–469. [1289] [PubMed] [Google Scholar]
Latouche A, Boisson V, Chevret S, and Porcher R (2007), “Misspecified Regression Model for the Subdistribution Hazard of a Competing Risk,” Statistics in Medicine, 26, 965–974. [1289,1295] [DOI] [PubMed] [Google Scholar]
Lindkvist H, and Belyaev Y (1998), “A Class of Non-Parametric Tests in the Competing Risks Model for Comparing Two Samples,” Scandinavian Journal of Statistics, 25, 143–150. [1289,1290,1292] [Google Scholar]
Lunn M, and McNeil D (1995), “Applying Cox Regression to Competing Risks,” Biometrics, 51, 524–532. [1289] [PubMed] [Google Scholar]
Luo X, and Turnbull B (1999), “Comparing Two Treatments With Multiple Competing Risks Endpoints,” Statistica Sinica, 9, 985–998. [1289] [Google Scholar]
Moreau T, O’Quigley J, and Mesbah M (1985), “A Global Goodness-ofFit Statistic for the Proportional Hazards Model,” Applied Statistics, 34, 212–218. [1295] [Google Scholar]
Nagelkerke N, Oosting J, and Hart A (1984), “A Simple Test for Goodness of Fit of Cox’S Proportional Hazards Model,” Biometrics, 40, 483–486. [1295] [Google Scholar]
Pepe M, and Mori M (1993), “Kaplan–Meier, Marginal or Conditional Probability Curves in Summarizing Competing Risks Failure Time Data?” Statistics in Medicine, 12, 737–751. [1289] [DOI] [PubMed] [Google Scholar]
Peto R, and Peto J (1972), “Asymptotically Efficient Rank Invariant Test Procedures,” Journal of the Royal Statistical Society, Series A, 135, 185–207. [1290,1291] [Google Scholar]
Pintilie M (2006), Competing Risks: A Practical Perspective, New York: Wiley; [1289,1298,1299] [Google Scholar]
Prentice R, Kalbfleisch J, Peterson A Jr, Flournoy N, Farewell V, and Breslow N (1978), “The Analysis of Failure Times in the Presence of Competing Risks,” Biometrics, 34, 541–554. [1289,1290,1292,1293] [PubMed] [Google Scholar]
Putter H, Fiocco M, and Geskus R (2007), “Tutorial in Biostatistics: Competing Risks and Multi-State Models,” Statistics in Medicine, 26, 2389–2430. [1289] [DOI] [PubMed] [Google Scholar]
Sancho A, Ávila A, Gavela E, Beltrán S, Fernández-Nájera J, Molina P, Crespo J, and Pallardó L (2007), “Effect of Overweight on Kidney Transplantation Outcome,” in Transplantation Proceedings (Vol. 39), Orlando, FL: Grune & Stratton, pp. 2202–2204. [1289] [DOI] [PubMed] [Google Scholar]
Scheike T, and Zhang M (2008), “Flexible Competing Risks Regression Modeling and Goodness-of-Fit,” Lifetime Data Analysis, 14, 464–483. [1295,1299] [DOI] [PMC free article] [PubMed] [Google Scholar]
Scheike T, and Zhang M (2011), “Analyzing Competing Risk Data Using the r Timereg Package,” Journal of Statistical Software, 38 [1299] [PMC free article] [PubMed] [Google Scholar]
Schoenfeld D (1980), “Chi-Squared Goodness-of-Fit Tests for the Proportional Hazards Regression Model,” Biometrika, 67, 145–153. [1295] [Google Scholar]
Schoenfeld D (1982), “Partial Residuals for the Proportional Hazards Regression Model,” Biometrika, 69, 239–241. [1295] [Google Scholar]
Sun Y, and Tiwari R (1995), “Comparing Cause-Specific Hazard Rates of a Competing Risks Model With Censored Data,” Lecture NotesMonograph Series, 27, 255–270. [1289] [Google Scholar]
Tiwari R, Kulasekera K, and Park C (2006), “Nonparametric Tests for Cause Specific Hazard Rates With Censored Data for Competing Risks Among Several Groups,” Journal of Statistical Planning and Inference, 136, 1718–1745. [1289] [Google Scholar]
Tsiatis A (1975), “A Nonidentifiability Aspect of the Problem of Competing Risks,” Proceedings of the National Academy of Sciences, 72, 20–22. [1290] [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

NIHMS986715-supplement-1.pdf^{(611.9KB, pdf)}

[R1] Aly E, Kochar S, and McKeague I (1994), “Some Tests for Comparing Cumulative Incidence Functions and Cause-Specific Hazard Rates,” Journal of the American Statistical Association, 89, 994–999. [1289] [Google Scholar]

[R2] Andersen P, Borgan Ø, Gill R, and Keiding N (1982), “Linear Nonparametric Tests for Comparison of Counting Processes, With Applications to Censored Survival Data, Correspondent Paper,” International Statistical Review/Revue Internationale de Statistique, 50, 219–244. [1290,1293] [Google Scholar]

[R3] Andersen P, Geskus R, de Witte T, and Putter H (2012), “Competing Risks in Epidemiology: Possibilities and Pitfalls,” International Journal of Epidemiology, 41, 861–870. [1295] [DOI] [PMC free article] [PubMed] [Google Scholar]

[R4] Andersen PK (1982), “Testing Goodness of Fit of Cox’s Regression and Life Model,” Biometrics, 38, 67–77. [1295] [Google Scholar]

[R5] Arjas E (1988), “A Graphical Method for Assessing Goodness of Fit in Cox’s Proportional Hazards Model,” Journal of the American Statistical Association, 83, 204–212. [1295] [Google Scholar]

[R6] Bajorunaite R, and Klein J (2007), “Two-Sample Tests of the Equality of Two Cumulative Incidence Functions,” Computational Statistics & Data Analysis, 51, 4269–4281. [1289] [Google Scholar]

[R7] Beyersmann J, Dettenkofer M, Bertz H, and Schumacher M (2007), “A Competing Risks Analysis of Bloodstream Infection After Stem-Cell Transplantation Using Subdistribution Hazards and Cause-Specific Hazards,” Statistics in Medicine, 26, 5360–5369. [1289,1295] [DOI] [PubMed] [Google Scholar]

[R8] Beyersmann J, Latouche A, Buchholz A, and Schumacher M (2009), “Simulating Competing Risks Data in Survival Analysis,” Statistics in Medicine, 28, 956–971. [1295,1297] [DOI] [PubMed] [Google Scholar]

[R9] Breslow N (1970), “A Generalized Kruskal-Wallis Test for Comparing k Samples Subject to Unequal Patterns of Censorship,” Biometrika, 57, 579–594. [1291] [Google Scholar]

[R10] Cox D (1972), “Regression Models and Life-Tables,” Journal of the Royal Statistical Society, Series B, 34, 187–220. [1289,1293,1295] [Google Scholar]

[R11] Cox D (1975), “Partial Likelihood,” Biometrika, 62, 269–276. [1289,1293] [Google Scholar]

[R12] Cox D, and Oakes D (1984), Analysis of Survival Data (vol. 21), Boca Raton, FL: Chapman & Hall/CRC; [1289] [Google Scholar]

[R13] Fine J (1999), “Analysing Competing Risks Data With Transformation Models,” Journal of the Royal Statistical Society, Series B, 61, 817–830. [1289] [Google Scholar]

[R14] Fine J (2001), “Regression Modeling of Competing Crude Failure Probabilities,” Biostatistics, 2, 85–97. [1289] [DOI] [PubMed] [Google Scholar]

[R15] Fine JP, and Gray RJ (1999), “A Proportional Hazards Model for the Subdistribution of a Competing Risk,” Journal of the American Statistical Association, 94, 496–509. [1289,1290,1293] [Google Scholar]

[R16] Gehan EA (1965), “A Generalized Wilcoxon Test for Comparing Arbitrarily Singly-Censored Samples,” Biometrika, 52, 203–223. [1291] [PubMed] [Google Scholar]

[R17] Gerds T, Scheike T, and Andersen P (2012), “Absolute Risk Regression for Competing Risks: Interpretation, Link Functions, and Prediction,” Statistics in Medicine, 31, 3921–3930. [1289] [DOI] [PMC free article] [PubMed] [Google Scholar]

[R18] Gichangi A, and Vach W (2005), “The Analysis of Competing Risks Data: A Guided Tour,” Statistics in Medicine. [1289] [Google Scholar]

[R19] Grambauer N, Schumacher M, and Beyersmann J (2010), “Proportional Subdistribution Hazards Modeling Offers a Summary Analysis, Even if Misspecified,” Statistics in Medicine, 29, 875–884. [1295] [DOI] [PubMed] [Google Scholar]

[R20] Gray R (1988), “A Class of K-Sample Tests for Comparing the Cumulative Incidence of a Competing Risk,” The Annals of Statistics, 16, 1141–1154. [1289,1290,1291,1292,1298] [Google Scholar]

[R21] Haller B, Schmidt G, and Ulm K (2012), “Applying Competing Risks Regression Models: An Overview,” Lifetime Data Analysis, 19, 1–26. [1289,1295] [DOI] [PubMed] [Google Scholar]

[R22] Holt J (1978), “Competing Risk Analyses With Special Reference to Matched Pair Experiments,” Biometrika, 65, 159–165. [1289] [Google Scholar]

[R23] Kalbfleisch J, and Prentice RL (1980), The Statistical Analysis of Failure Time Data (Vol. 360), New York: Wiley; [1291] [Google Scholar]

[R24] Kaplan E, and Meier P (1958), “Nonparametric Estimation From Incomplete Observations,” Journal of the American Statistical Association, 53, 457–481. [1291,1293] [Google Scholar]

[R25] Klein J (2006), “Modelling Competing Risks in Cancer Studies,” Statistics in Medicine, 25, 1015–1034. [1289] [DOI] [PubMed] [Google Scholar]

[R26] Klein J, and Andersen P (2005), “Regression Modeling of Competing Risks Data Based on Pseudovalues of the Cumulative Incidence Function,” Biometrics, 61, 223–229. [1289] [DOI] [PubMed] [Google Scholar]

[R27] Klein JP, and Moeschberger ML (2003), Survival Analysis: Techniques for Censored and Truncated Data, New York: Springer Science & Business Media; [1291] [Google Scholar]

[R28] Kulathinal S, and Gasbarra D (2002), “Testing Equality of Cause-Specific Hazard Rates Corresponding to m Competing Risks Among k Groups,” Lifetime Data Analysis, 8, 147–161. [1289,1292] [DOI] [PubMed] [Google Scholar]

[R29] Lagakos S (1978), “A Covariate Model for Partially Censored Data Subject to Competing Causes of Failure,” Applied Statistics, 27, 235–241. [1289] [Google Scholar]

[R30] Lagakos S (1981), “The Graphical Evaluation of Explanatory Variables in Proportional Hazard Regression Models,” Biometrika, 68, 93–98. [1295] [Google Scholar]

[R31] Lam K (1998), “A Class of Tests for the Equality of k Cause-Specific Hazard Rates in a Competing Risks Model,” Biometrika, 85, 179–188. [1289] [Google Scholar]

[R32] Larson M (1984), “Covariate Analysis of Competing-Risks Data With Log-Linear Models,” Biometrics, 40, 459–469. [1289] [PubMed] [Google Scholar]

[R33] Latouche A, Boisson V, Chevret S, and Porcher R (2007), “Misspecified Regression Model for the Subdistribution Hazard of a Competing Risk,” Statistics in Medicine, 26, 965–974. [1289,1295] [DOI] [PubMed] [Google Scholar]

[R34] Lindkvist H, and Belyaev Y (1998), “A Class of Non-Parametric Tests in the Competing Risks Model for Comparing Two Samples,” Scandinavian Journal of Statistics, 25, 143–150. [1289,1290,1292] [Google Scholar]

[R35] Lunn M, and McNeil D (1995), “Applying Cox Regression to Competing Risks,” Biometrics, 51, 524–532. [1289] [PubMed] [Google Scholar]

[R36] Luo X, and Turnbull B (1999), “Comparing Two Treatments With Multiple Competing Risks Endpoints,” Statistica Sinica, 9, 985–998. [1289] [Google Scholar]

[R37] Moreau T, O’Quigley J, and Mesbah M (1985), “A Global Goodness-ofFit Statistic for the Proportional Hazards Model,” Applied Statistics, 34, 212–218. [1295] [Google Scholar]

[R38] Nagelkerke N, Oosting J, and Hart A (1984), “A Simple Test for Goodness of Fit of Cox’S Proportional Hazards Model,” Biometrics, 40, 483–486. [1295] [Google Scholar]

[R39] Pepe M, and Mori M (1993), “Kaplan–Meier, Marginal or Conditional Probability Curves in Summarizing Competing Risks Failure Time Data?” Statistics in Medicine, 12, 737–751. [1289] [DOI] [PubMed] [Google Scholar]

[R40] Peto R, and Peto J (1972), “Asymptotically Efficient Rank Invariant Test Procedures,” Journal of the Royal Statistical Society, Series A, 135, 185–207. [1290,1291] [Google Scholar]

[R41] Pintilie M (2006), Competing Risks: A Practical Perspective, New York: Wiley; [1289,1298,1299] [Google Scholar]

[R42] Prentice R, Kalbfleisch J, Peterson A Jr, Flournoy N, Farewell V, and Breslow N (1978), “The Analysis of Failure Times in the Presence of Competing Risks,” Biometrics, 34, 541–554. [1289,1290,1292,1293] [PubMed] [Google Scholar]

[R43] Putter H, Fiocco M, and Geskus R (2007), “Tutorial in Biostatistics: Competing Risks and Multi-State Models,” Statistics in Medicine, 26, 2389–2430. [1289] [DOI] [PubMed] [Google Scholar]

[R44] Sancho A, Ávila A, Gavela E, Beltrán S, Fernández-Nájera J, Molina P, Crespo J, and Pallardó L (2007), “Effect of Overweight on Kidney Transplantation Outcome,” in Transplantation Proceedings (Vol. 39), Orlando, FL: Grune & Stratton, pp. 2202–2204. [1289] [DOI] [PubMed] [Google Scholar]

[R45] Scheike T, and Zhang M (2008), “Flexible Competing Risks Regression Modeling and Goodness-of-Fit,” Lifetime Data Analysis, 14, 464–483. [1295,1299] [DOI] [PMC free article] [PubMed] [Google Scholar]

[R46] Scheike T, and Zhang M (2011), “Analyzing Competing Risk Data Using the r Timereg Package,” Journal of Statistical Software, 38 [1299] [PMC free article] [PubMed] [Google Scholar]

[R47] Schoenfeld D (1980), “Chi-Squared Goodness-of-Fit Tests for the Proportional Hazards Regression Model,” Biometrika, 67, 145–153. [1295] [Google Scholar]

[R48] Schoenfeld D (1982), “Partial Residuals for the Proportional Hazards Regression Model,” Biometrika, 69, 239–241. [1295] [Google Scholar]

[R49] Sun Y, and Tiwari R (1995), “Comparing Cause-Specific Hazard Rates of a Competing Risks Model With Censored Data,” Lecture NotesMonograph Series, 27, 255–270. [1289] [Google Scholar]

[R50] Tiwari R, Kulasekera K, and Park C (2006), “Nonparametric Tests for Cause Specific Hazard Rates With Censored Data for Competing Risks Among Several Groups,” Journal of Statistical Planning and Inference, 136, 1718–1745. [1289] [Google Scholar]

[R51] Tsiatis A (1975), “A Nonidentifiability Aspect of the Problem of Competing Risks,” Proceedings of the National Academy of Sciences, 72, 20–22. [1290] [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Joint Inference for Competing Risks Survival Data

Gang Li

Qing Yang

Abstract

1. Introduction

2. Two-Sample Joint Tests for Competing Risks Data

2.1. Preliminaries

2.1.1. Two-Sample Tests for Cause-Specific Hazard

2.1.2. Two-Sample Tests for Cumulative Incidence Function

2.2. Joint Two-Sample Tests for Cause-Specific Hazard and Cumulative Incidence Function

Theorem 1.

2.2.1. Chi-Square Joint Test for (1)

2.2.2. Maximum Joint Test for (1)

Remark 1.

Remark 2.

2.3. Joint Two Sample Tests for Other Quantities

2.3.1. Two-Sample Joint Tests for Cause-Specific Hazard and All-Cause Hazard

2.3.2. Two-Sample Joint Tests for Both Cause-Specific Hazards

Remark 3.

3. Joint Regression Analysis for Competing Risks Data

3.1. Joint Regression Analysis of Cause-Specific Hazard and Cumulative Incidence

Theorem 2.

Corollary 1.

3.2. Joint Regression Analysis of Other Quantities

3.2.1. Joint Regression Analysis of Cause-Specific Hazard and All-Cause Hazard

Theorem 3.

3.2.2. Joint Regression Analysis of Both Cause-Specific Hazards

Remark 4.

Remark 5 (Model Checking).

4. Simulations

Figure 1.

Figure 2.

5. Real Data Example

5.1. Hodgkin Disease

Figure 3.

Table 1.

5.2. Follicular Cell Lymphoma Study

Table 2.

6. Discussion

Supplementary Material

Acknowledgments

Footnotes

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases