Skip to main content
Biostatistics (Oxford, England) logoLink to Biostatistics (Oxford, England)
. 2017 May 5;19(1):14–26. doi: 10.1093/biostatistics/kxx016

Concordance measure and discriminatory accuracy in transformation cure models

Yilong Zhang 1, Yongzhao Shao 2,
PMCID: PMC6075574  PMID: 28481968

SUMMARY

Many populations of early-stage cancer patients have non-negligible latent cure fractions that can be modeled using transformation cure models. However, there is a lack of statistical metrics to evaluate prognostic utility of biomarkers in this context due to the challenges associated with unknown cure status and heavy censorship. In this article, we develop general concordance measures as evaluation metrics for the discriminatory accuracy of transformation cure models including the so-called promotion time cure models and mixture cure models. We introduce explicit formulas for the consistent estimates of the concordance measures, and show that their asymptotically normal distributions do not depend on the unknown censoring distribution. The estimates work for both parametric and semiparametric transformation models as well as transformation cure models. Numerical feasibility of the estimates and their robustness to the censoring distributions are illustrated via simulation studies and demonstrated using a melanoma data set.

Keywords: c-index, Concordance probability, Cure fraction, Predictive accuracy, Prognostics for censored survival, Mixture cure model

1. Introduction

Recent advancements in cancer screening have led to the identification of a large number of early-stage cancers including many curable ones. For example, a high percentage of resected early-stage melanoma patients are cured by surgery although no available method can reliably distinguish them from the uncured patients. Currently, the only way to accurately identify the uncured melanoma patients after surgery is to await cancer recurrence, by then, the best chance to treat cancer may have been lost. Thus, effective adjuvant therapies are needed to prevent or delay cancer progression and recurrence of uncured patients. Accordingly, there is an urgent and unmet need to develop statistical metrics to evaluate the efficacy of candidate adjuvant therapies (e.g. immunotherapies) and prognostic utility of biomarkers for survival of uncured patients without knowing who are surely cured.

The transformation mixture cure models jointly model the latent cure status using logistic regression and model the survival of uncured patients using transformation models (Lu and Ying, 2004). Important special cases include the proportional hazard mixture cure models (Farewell, 1982; Kuk and Chen, 1992; Sy and Taylor, 2000) and the proportional odds (PO) mixture cure models. Another type of transformation cure models that can account for the cure fraction are often referred as promotion time cure models (Zeng and others, 2006), originally proposed by Yakovlev and Tsodikov (1996) and Chen and others (1999) based on some biological mechanistic considerations.

These two-type of transformation cure models are better suited for prognostic analysis of many early-stage cancers with cure fractions than the conventional survival models [e.g. Cox proportional hazards model (PHM)] that ignore the cured subjects (Othus and others, 2012; Asano and others, 2014). In particular, these models can potentially allow the possibility of making unbiased assessments of prognostic utilities of biomarkers (e.g. primary tumor thickness and ulceration status of melanoma) for predicting survival of uncured patients without knowing exactly who are cured. Due to the challenges associated with unknown cure status, there has been a lack of statistical metrics for evaluating the prognostic accuracy of transformation cure models (Asano and others, 2014). In particular, measuring the prognostic utility of biomarkers for the survival of the uncured subjects has been an important open problem, even for parametric cure models (Farewell, 1982).

Concordance probability has been used for measuring prognostic accuracy in survival setting. However, the conventional concordance probability estimator as proposed in Harrell and others (1982, 1984) and Gönen and Heller (2005) are not directly applicable for measuring prognostic accuracy in the context of unknown cure status of patients. Specifically, let Inline graphic, Inline graphic denote the observed time, censoring indicator, and risk score. Herrell’s c-index can be written as

ijI(T~i<T~j,Ri>Rj,δi=1)ijI(T~i<T~j,δi=1). (1.1)

As pointed out by Pencina and D’Agostino (2004) and Uno and others (2011), Harrell’s c-index depends on censoring distribution and is not consistent to the concordance probability Inline graphic where Inline graphic and Inline graphic are two randomly selected pair of bivariate observations of risk score and failure time. Under the Cox proportional hazard (PH) models, Gönen and Heller (2005) propose to estimate the concordance probability Inline graphic by

2n(n1)ijI(Rj>Ri)1+exp(RjRi), (1.2)

where the risk scores Inline graphic is the linear combination of covariates for the Inline graphicth subject in Cox PHM. Gonen and Heller’s estimator has several desirable properties compared with Harrell’s c-index: (i) Gonen and Heller’s estimator does not depend on the censoring distribution. (ii) Gonen and Heller’s estimator is consistent and asymptotic normal. However, Gonen and Heller’s estimator cannot be applied to proportional odds model (POM), Accelerated Failure Time (AFT) model and other non-Cox PH transformation models. Furthermore, when the population is a mixture of cured and uncured patients, both Harrell’s c-index and Gonen and Heller’s estimator cannot be directly used. To fill in these knowledge gaps, we extend the concordance probability measures of Gönen and Heller (2005) for Cox PHM to general transformation models. Moreover, the general concordance index is also applicable for measuring accuracy in predicting the survival of the uncured subjects in transformation cure models that include both the promotion time cure models (Zeng and others, 2006) and transformation mixture cure models (Lu and Ying, 2004).

The rest of the article is organized as follows. In section 2, We propose the general concordance-based prognostic measures, the k-index, for standard transformation models as well as for the two types of transformation cure models. We also provide explicit formulas for the estimators of the newly defined measures and establish their consistency and asymptotic normality. Section 3 reports simulation studies that illustrate the numerical feasibility of the estimates and their robustness to the unknown censoring distributions, and also provides real data illustrations of the proposed method. Section 4 contains discussion and final remarks. Technical proofs are contained in the Appendices. In the supplementary material available at Biostatistics online, the proofs of the proposed theorems are provided.

2. Methods

2.1. The transformation models and cure fraction

Let Inline graphic denote the response with Inline graphic being the failure time and Inline graphic the censoring time. The event indicator is Inline graphic. Let Inline graphic and Inline graphic denote two monotone increasing functions. The generalized transformation model can be written as

H(T)=φ(Xβ)+ϵ, (2.1)

where Inline graphic is a regression parameter vector associated with covariate vector Inline graphic, and Inline graphic is random error such that for some function Inline graphic and some survival function Inline graphic we have

pr(ϵ>ψ(X,t)|X)=Sγ(ψ(X,t)). (2.2)

A special case of the generalized transformation models defined via (2.1) and (2.4) assumes that Inline graphic is independent of Inline graphic with Inline graphic in (2.4), and Inline graphic in (2.3). That is

H(T)=Xβ+ϵ, (2.3)

where Inline graphic Indeed, model (2.3) is the conventional transformation model that contains many widely used survival models (Kalbfleisch and Prentice, 2011). For example, if Inline graphic has the extreme value distribution with Inline graphic, then the model (2.3) is the Cox PHM. Similarly, when Inline graphic has the logistic distribution with Inline graphic, then the model (2.3) is the POM. In particular, consistent and asymptotically normal estimates for the conventional transformation model (2.3) can be obtained based on martingale estimating equations (Chen and others, 2002) or maximum likelihood estimates (Zeng and Lin, 2007).

The general promotion time cure model as considered in Zeng and others (2006) is also a special case of the generalized transformation model, when Inline graphic for some unknown CDF Inline graphic in (2.1), and Inline graphic in (2.4). Then the conditional survival function of Inline graphic is given as

S(tX)=pr(T>tX)=Sγ{φ(Xβ)+logF(t)}. (2.4)

Note that Inline graphic is an improper survival function since Inline graphic. In particular, Inline graphic is the proportion of latent cured individuals given covariate Inline graphic. Zeng and others (2006) assumed both Inline graphic and Inline graphic are known functions to ensure identifiability of the promotion time cure model defined in (2.4), and established asymptotic efficiency of semiparametric maximum likelihood estimates. So, we also assume Inline graphic and Inline graphic are both known functions.

In addition to the promotion time cure model defined in (2.4), another type of transformation cure models is the transformation mixture cure models (Lu and Ying, 2004). The transformation mixture cure models assume the failure time Inline graphic of the uncured subjects following the conventional transformation models in (2.3) and model the binary uncured status Inline graphic using a generalized linear model. Specifically, given covariate Inline graphic for the binary uncured status Inline graphic (Inline graphic denotes a patient being uncured), then

H(T)=Xβ+ϵ  and  πb(Z)=pr(Y=1|Z)=g(bTZ), (2.5)

where the common choice of the link function Inline graphic include logit, log–log and probit link functions. The covariate Inline graphic contains intercept and may share common components with covariates Inline graphic. General consistent and asymptotically normal estimates of Inline graphic and Inline graphic in (2.5) have been obtained by Lu and Ying (2004).

2.2. The concordance measures

2.2.1 The concordance index for transformation models without cured subjects

We first consider concordance measures for the conventional transformation model in (2.3) in the absence of cured subjects and extend it later to models in the presence of cured subjects. For a randomly selected pair of subjects Inline graphic and Inline graphic, a Kendall’s type concordance index Inline graphic for model (2.3) is as follows:

τK=pr(T1>T2|βX2>βX1). (2.6)

When Inline graphic, larger Inline graphic can almost surely predict a shorter survival Inline graphic. Therefore, Inline graphic is a natural measure of discriminatory accuracy for shorter versus longer survival based on Inline graphic in generalized transformation models. Under commonly existing symmetry between the two subjects in the pair, i.e. Inline graphic and Inline graphic we have

τK=2pr(T1>T2,βX2>βX1)=pr(βX2>βX1|T1>T2).

In fact, the well-known Harrell’s c-index is often referred as an estimator of Inline graphic Unfortunately, Harrell’s c-statistics in (1.1) is not a generally consistent estimate of Inline graphic and the magnitude of the asymptotic bias depends on the unknown censoring distribution (Uno and others, 2011). Fortunately, we can derive asymptotically unbiased estimate for Inline graphic under the generalized transformation models in (2.1). When Inline graphic is strictly increasing, we have

τK=2pr(ϵ1ϵ2>φ(X1β)φ(X2β),β(X2X1)>0).

Let Inline graphic, the above Inline graphic is equivalent to

τK=2E{G(φ(X1β),φ(X2β))I(βX2>βX1)}. (2.7)

Given a random sample of Inline graphic of size Inline graphic, by the properties of U-statistics, we have Inline graphic, where

Kn(β)=(n2)1ijG(φ(Xiβ),φ(Xjβ))I{β(XiXj)<0}.

Let Inline graphic be a consistent estimator of Inline graphic, which generally exists under random right censoring and some regularity conditions, e.g. Zeng and Lin (2007). Then, in contrast to the inconsistency of Harrell’s C-statistics, we have a generally consistent estimate Inline graphic of Inline graphic. From Corollary 1.1 in supplementary material available at Biostatistics online, Inline graphic has an asymptotically normal distribution that does not depend on the unknown censoring distribution. In short, the Kendall’s type concordance index Inline graphic for the transformation model in (2.3) can generally be consistently estimated using the following U-statistic

Kn(β^)=2n(n1)ijG(φ(β^Xi),φ(β^Xj))I(β^Xiβ^Xj<0), (2.8)

where Inline graphic is a consistent estimator of Inline graphic. As a special case, under the Cox PHM, Inline graphic has the extreme value distribution. Then the kernel function Inline graphic relates to the survival function of the standard logistic distribution:

G(x1,x2)={1+exp(x1x2)}1. (2.9)

It is easy to see that plugging the Inline graphic in (2.9) into (2.8) would yield exactly the same estimate in (1.2) for the concordance measure proposed by Gönen and Heller (2005) for the Cox PHM. Thus the general estimate in (2.8) naturally extends the concordance index proposed by Gönen and Heller (2005) for the Cox PHM to generalized transformation models in (2.1). As another example, when Inline graphic follows the POM, Inline graphic provides a consistent estimate for the concordance probability Inline graphic with the following kernel function Inline graphic:

G(x1,x2)=1ex1x2+(x1x2)ex1x2(1ex1x2)2, (2.10)

provided Inline graphic is a consistent estimate of Inline graphic. Note that Gönen and Heller (2005) established asymptotic normality of Inline graphic in (1.2) for the Cox PHM. In general, when Inline graphic is asymptotically normal, for the conventional transformation models in (2.3), the general consistency and asymptotic normality of the concordance estimate in (2.8) follows directly as an easy corollary of Theorem 2 for transformation mixture cure models proved in Appendix A of supplementary material available at Biostatistics online.

When we have two competing cure models, it is of interest to compare the prognostic accuracy of these models based on their k-indices Inline graphic and Inline graphic. Let Inline graphic. Then a consistent estimator for Inline graphic is Inline graphic where Inline graphic and Inline graphic are the corresponding k-indices in (2.8). Note that the difference of two Harrell’s c-statistics generally depends on the unknown censoring distribution while the difference of two k-indices, Inline graphic, does not depend on the unknown censoring distribution and is consistent to Inline graphic. We can use similar arguments as in the proof of Theorem 2 to show that Inline graphic can be approximated by a mean zero normal distribution. The estimated variance Inline graphic of Inline graphic can be obtained via bootstrap. Similar to the procedure proposed in Uno and others (2011), we suggest to report a two-sided 95% confidence interval (CI) of Inline graphic that is Inline graphic.

2.2.2 The k-index for promotion time cure model

Many populations of early stage cancers have non-ignorable latent fractions of cured patients that can be modeled using transformation cure models. Let Inline graphic indicate the binary uncured status such that Inline graphic (or Inline graphic) when the subject is uncured (or cured), respectively. In the special case, when all subjects are uncured, one can simply use (2.8) for an unbiased assessment of the discriminatory accuracy for predicting survival of the uncured subjects. In populations with cured patients, we define the k-index for uncured patients as follows:

K=pr(T1>T2βX2>βX1,Y1=Y2=1), (2.11)

where Inline graphic and Inline graphic are a randomly selected pair of subjects. For the promotion time cure model, we have Inline graphic iff Inline graphic Then it is straightforward to show that

K(β)=pr(T1>T2βX2>βX1,T1<,T2<)=E{I(βX2>βX1)G(φ(βX1),φ(βX2))}E{I(βX2>βX1){1Sγ[φ(βX1)]}{1Sγ[φ(βX2)]}},

where Inline graphic. Then, we can estimate the k-index in (2.11) by a ratio of U-statistics

Kn(β^)=ijG(φ(β^Xi),φ(β^Xj))I{φ(β^Xi)φ(β^Xj)<0}ijI{φ(β^Xi)φ(β^Xj)<0}{1Sγ[φ(β^Xi)]{1Sγ[φ(β^Xj)]}}. (2.12)

When all patients are uncured, the concordance probability in (2.12) is clearly equivalent to the estimate in (2.8). As shown below, Inline graphic is consistent to Inline graphic when Inline graphic is a consistent estimate of Inline graphic. Moreover, Inline graphic is asymptotically normal under the following assumptions (A1)–(A3).

  • (A1) Inline graphic is differentiable. Inline graphic has a continuous component and Inline graphic is bounded in probability.

  • (A2) Let Inline graphic denote the observed data that are independent copies of Inline graphic. There exists a square-integrable random vector Inline graphic such that Inline graphic and Inline graphic is nonsingular, and
    n1/2(β^β)=n1/2i=1nU(Oi)+op(1). (2.13)
  • (A3) Asymptotic linearity of Inline graphic: In a neighborhood of the true parameter Inline graphic, we have
    Kn(β^)=Kn(β)+D(β)T(β^β)+op(β^β), (2.14)
    where Inline graphic.
Theorem 1

Denote Inline graphic. Under the above assumptions (A1)–(A3) we have, in distribution,

n1/2{Kn(β^)K}/σKN(0,1).

Condition (A1) is assumed in Zeng and others (2006) in order to derive the consistency and asymptotic normality of Inline graphic. Condition (A2) assumes that Inline graphic is a regular asymptotically linear (RAL) estimator of Inline graphic (Tsiatis, 2007) that is commonly hold for asymptotically normal estimator (Lu and Ying, 2004; Lu, 2008). Condition (A3) is a technical condition that assume the local linear expansion holds for Inline graphic. A similar condition is used in Gönen and Heller (2005). The proof of Theorem 1 and the plug-in estimator of Inline graphic are provided in supplementary material available at Biostatistics online. To compare the prognostic accuracy for two sets of covariates based on the difference of their k-indices Inline graphic, we can construct a 95% CI for the difference of two k-indices as discussed in the last section.

2.2.3 The k-index for transformation mixture cure models

In transformation mixture cure models defined in (2.5), the k-index in (2.11) can be written as

K(θ)=E{I(βX2>βX1)G(βX1,βX2)πb(Z1)πb(Z2)}E{I(βX2>βX1)πb(Z1)πb(Z2)}, (2.15)

where Inline graphic and Inline graphic, Inline graphic. Similarly Inline graphic can be estimated by

Kn(θ^)=ijG{β^Xi,β^Xj}I{β^Xi<β^Xj}πb^(Zi)πb^(Zj)ijI{β^Xi<β^Xj}πb^(Zi)πb^(Zj), (2.16)

where Inline graphic. The estimator Inline graphic is also consistent and asymptotically normal under three regularity conditions as described below.

  • (B1) Let Inline graphic denote the observed data that are independent copies of Inline graphic. There exists a square-integrable random vector Inline graphic such that Inline graphic and Inline graphic is nonsingular, and
    n1/2(θ^θ)=n1/2i=1nU(Oi)+op(1). (2.17)
  • (B2) A positive proportion of the sample is uncured, i.e. Inline graphic as Inline graphic, where Inline graphic is a constant.

  • (B3) Asymptotic linearity of Inline graphic: In a neighborhood of true parameter Inline graphic we have
    Kn(θ^)=Kn(θ)+D(θ)T(θ^θ)+op(θ^θ),
    where Inline graphic.
Theorem 2

Denote Inline graphic. Under conditions (B1)–(B3), in distribution,

n1/2{Kn(θ^)K}/σKN(0,1).

Condition (B2) is just to ensure the existence of events when sample size is large. That is clearly a minimal assumption. These conditions (B1) and (B3) are similar to those in Theorem 1. In particular, for parametric models, these regularity conditions can be verified straightforwardly. Thus, our proposed estimates work for many parametric as well as semiparametric models. The proof of Theorem 2 and the evaluation of plug-in estimator of Inline graphic are provided in supplementary material available at Biostatistics online. To compare the prognostic accuracy for two sets of covariates based on the difference of their k-indices Inline graphic, we can construct a 95% CI for the difference of two k-indices as in the Section 2.2.1.

3. Numerical studies

We conducted extensive simulations to evaluate the performance of the newly proposed k-index in both the promotion time cure models and the mixture cure models.

3.1. A simulation study for promotion time cure model

We consider promotion time cure model defined in formula (2.4) with conditional survival function Inline graphic. Following Zeng and others (2006), let Inline graphic, Inline graphic and Inline graphic follow the Box-Cox type transformations:

Sγ(t)=(1+γexp(t))1/γI(γ>0)+exp(exp(t))I(γ=0).

In this family, we considered the situation Inline graphic for PHM and Inline graphic for POM. We assumed that the covariate Inline graphic from a uniformly distributed random variable in Inline graphic and the covariate Inline graphic from a Bernoulli distribution with “success” probability Inline graphic. Let Inline graphic where Inline graphic, Inline graphic and Inline graphic. The censoring time was generated from an exponential distribution with selected rates such that the censoring proportions are between Inline graphic and Inline graphic. We tried other censoring distributions but the results are similar, thus not presented. We also tried different sample sizes for our simulations, since the results and patterns are similar, we only present the case with the sample size Inline graphic. For each simulated dataset, we estimated the parameter Inline graphic by fitting a parametric promotion time cure model. Then we calculated the k-index in the formula (2.12). We used Inline graphic independent replications of the same procedure and compared the estimated Inline graphic-index with the true value Inline graphic that was calculated based on Inline graphic replicates. Table 1 summarizes the simulation results.

Table 1.

Simulation results for parametric promotion time cure models with Inline graphic

Model Censored Proportion (%) Cured Proportion (%) Inline graphic Inline graphic SE of Inline graphic Simulation SE of Inline graphic
PHM 50 30 0.645 0.641 0.014 0.014
PHM 60 30 0.645 0.639 0.015 0.015
PHM 70 30 0.645 0.640 0.016 0.016
POM 40 20 0.699 0.696 0.012 0.012
POM 50 20 0.699 0.695 0.012 0.012
POM 60 20 0.699 0.695 0.013 0.013

As can be seen from Table 1, the estimator Inline graphic is close to the true value and the bootstrap standard deviation of Inline graphic is close to the empirical standard deviation in all scenarios.

3.2. A simulation study for transformation mixture cure models

For uncured subjects, we generated survival time data from the following transformation model,

log(T)=X1X2+ϵ,

where Inline graphic was sampled from a normal distribution with standard deviation Inline graphic, and Inline graphic was a Bernoulli random variable with “success” probability Inline graphic. We considered two scenarios for the distribution of Inline graphic that included extreme value distribution for PHM, and standard logistic distribution for POM. Censoring times were generated from a uniform distribution, Inline graphic, where Inline graphic is selected to control the percentage of censored observations in uncured subjects. We also tried exponential distribution and other censoring distributions but the results are similar, thus not presented. For the cure status, we chose the following logistic model with the same covariates Inline graphic and Inline graphic:

πb(X)=exp(b0+X1X2)1+exp(b0+X1X2). (3.1)

We selected Inline graphic to control the percentage of cured observations in each replication. Uncured status Inline graphic were then generated from a Bernoulli distribution with “success” probability Inline graphic in (3.1). We selected Inline graphic and Inline graphic such that the cured and overall censored proportion were [(0.5,0.6),(0.3,0.6),(0.5,0.8)]. We tried different sample sizes for our simulations, since the results and patterns are similar, we only present the case with the sample size Inline graphic. For each simulated dataset, we estimated parameters by fitting a parametric mixture cure model. Then we calculated the k-index in the formula (2.16). We repeated the procedure Inline graphic times and compared the estimated Inline graphic-index with the true value Inline graphic that was calculated based on Inline graphic replicates. Table 2 summarized the simulation results.

Table 2.

Simulation results for parametric transformation mixture cure models with Inline graphic

Model Censored Proportion (%) Cured Proportion (%) Inline graphic Inline graphic SE of Inline graphic Simulation SE of Inline graphic
PHM 60 50 0.798 0.798 0.017 0.017
PHM 60 30 0.798 0.807 0.015 0.017
PHM 80 50 0.798 0.800 0.022 0.022
POM 60 50 0.739 0.739 0.021 0.022
POM 60 30 0.739 0.750 0.021 0.022
POM 80 50 0.739 0.741 0.029 0.027

As shown in Table 2, Inline graphic is stable and close to the nominal truth. The standard error of Inline graphic is similar to the empirical variability of Inline graphic obtained by simulation. We also fitted semiparametric Cox PH cure model (Peng and Dear, 2000; Sy and Taylor, 2000). Results were similar to the parametric Cox PHM. The simulation results demonstrate that Inline graphic is not sensitive to the censoring distribution. This is similar to those findings reported in Gönen and Heller (2005) where k-index is insensitive to censoring distribution while the c-index of Harrell is sensitive. We show different types of censoring distributions for transformation mixture cure models and promotion time cure models to illustrate that our proposed method does not depend on the type of censoring distributions.

3.3. An example: a well-known Denmark melanoma dataset

In this section, we analyzed the dataset on malignant melanoma collected by K. T. Drzewiecki. In this study, investigator collected 205 stage I melanoma patients at the Department of Plastic Surgery, University Hospital of Odense, Denmark from 1962 to 1977. The data were summarized in Appendix 1 of Andersen and others (1993) and public available in the Inline graphic package Inline graphic (Dalgaard, 2008). The median follow-up time is 5.5 years, and the overall censoring proportion is 72%. Gender, tumor thickness and ulceration status are provided for all patients. We took log transformation for tumor thickness. The Kaplan–Meier survival curves in Figure 1 shows a plateau near the end of the study that suggests the possibility of a cured sub-population. To model the cure fraction, we fit Cox PH mixture cure model and report the parameter estimates in Table 3. The Cox PH mixture cure model indicates that patients with ulceration have lower chance to be cured after surgery (P-value = 0.040); patients with thicker tumor tend to have a shorter lifetime among uncured patients (P-value = 0.008).

Fig. 1.

Fig. 1.

Kaplan–Meier curves for the melanoma data stratified by above or below the median of tumor thickness (left) and Yes or No for ulceration status (right).

Table 3.

Parameter Estimates (standard errors) and P-values fitted by semiparametric Cox PH cure model for the melanoma data

  Cox PH model Logit cure model
  Log HR P-value Log OR P-value
Intercept N/A   Inline graphic3.00(1.46) 0.039
Gender 0.69(0.46) 0.132 0.22(0.45) 0.632
log(tumor thick) 0.79(0.30) 0.008 0.35(0.28) 0.209
Ulceration 0.27(0.49) 0.581 1.07(0.52) 0.040

The k-index based on Cox PH mixture cure model with covariates gender, log of tumor thickness, and ulceration is 0.72, and the 95% CI is (0.63, 0.81). From the k-index and 95% CI, we can see that the linear combination of gender, the logarithm of tumor thickness and ulceration have statistically significant predictive utility for the survival of uncured patients. However, the predictive accuracy of such prognostic models might be further improved by inlcuding other clinico-pathologic characteristics (Cymerman and others, 2016), treatment profiles(Sun and others, 2016), and biomarkers such as levels of circulating microRNA (Friedman and others, 2012) and ctDNA (Chang and others, 2016). Nevertheless, this example illustrates that we can assess the prognostic utility of log-thickness and ulceration without the need to know who is surely cured.

4. Discussion

Even though mixture cure models and other cure models have been introduced for decades in statistics literature, they have not been widely used in clinical practice (Othus and others, 2012). In particular, there has been a lack of statistical metrics to evaluate treatment effects of adjuvant therapies and prognostic utility of biomarkers in this context due to the challenges associated with unknown cured status. Although more than 90% of stage I melanoma patients are cured by surgery, there is no timely and reliable way to tell who is surely cured. It seems unrealistic to think there is ever a magic method capable of predicting with certainty that a patient has been cured soon after surgery. When the cure fraction is high, statistical metrics and analysis based on overall survival of the patient population and ignoring existence of cured patients are inadequate to characterize cancer prognosis.

There is a recent surge in developing immunotherapies or cancer vaccines that can strengthen patients’ immune system to prevent or delay cancer progression and recurrence (Hodi and others, 2010; Couzin-Frankel, 2013; Wolchok and others, 2013; Robert and others, 2015). Many of the candidate agents for cancer vaccine are known to be non-toxic or minimal toxic thus without the need of the typical stage I trials for assessment of their toxicity profile (Simon and others, 2001; Mukhi and Shao, 2009; Schlom, 2012). Also, there have been sustained interest in the medical community to use some of these non-toxic cancer vaccine agents to develop effective adjuvant therapies for early stage cancers with high percentage of curable subjects. Thus, one potential application of the newly proposed evaluation metric is to directly evaluate long-term treatment effects of adjuvant therapies (e.g. immunotherapies) for uncured patients with or without the presence of cured ones. Further applications of the proposed evaluation metrics for discriminative accuracy can be found in various applications of the mixture cure model as discussed in Maller and Zhou (1996) and Liu and others (2006).

In this article, we develop general concordance measures as evaluation metrics for the discriminatory accuracy of the conventional transformation models as well as transformation cure models extending the pioneering work of Gönen and Heller (2005) originally designed for the Cox PH models. We introduce explicit formulas for the consistent estimates of the concordance measures, and show that their asymptotic distributions do not depend on the unknown censoring distribution. The proposed concordance measures and estimates work for both parametric and semiparametric transformation models as well as transformation cure models. Numerical feasibility of the estimates and their robustness to the censoring distributions are illustrated via simulation studies. However, the proposed measures and estimates work well only when the assumed statistical models are correctly specified. Thus, goodness of fit or model diagnostic procedures should be identified and applied properly to ensure the transformation models and mixture cure models are suitable for the data (Wileyto and others, 2013; Peng and Taylor, 2016). Additionally, it is of interest to compare predictive accuracy of two competing predictive models by a formal hypothesis testing procedure. However, there is a well-known difficulty associated with testing correlated predictive models even for binary outcomes (Vickers and others, 2011; Demler and others, 2012; Pepe and others, 2013, 2014). Thus the testing problem is worth further systematic investigation in the future. Finally, we have written an R package to calculate k-index, with bootstrap CIs. The GitHub repo of the R package is available at https://github.com/elong0527/evacure with commit number 629d91087b950cc788326d8d1eb159a55d8fc4e0; last accessed date 21 April 2017.

Supplementary material

Supplementary material is available at http://biostatistics.oxfordjournals.org.

Supplementary Material

Supplementary Data

Acknowledgments

Conflict of Interest: None declared.

Funding

National Institutes of Health grant P30 CA16087-23 and P30 AG0851.

References

  1. Andersen P. K., Borgan r., Gill R. D. & Keiding N. (1993). Statistical Models Based on Counting Processes. New York, NY: Springer. [Google Scholar]
  2. Asano J., Hirakawa A. and Hamada C. (2014). Assessing the prediction accuracy of cure in the cox proportional hazards cure model: an application to breast cancer data. Pharmaceutical Statistics 13, 357–363. [DOI] [PubMed] [Google Scholar]
  3. Chang G., Tadepalli J., Shao Y., Zhang Y., Weiss S., Robinson E., Spittle C., Furtado M., Shelton DN., Karlin-Neumann G. & Pavlick A. (2016). Sensitivity of plasma braf mutant and nras mutant cell-free dna assays to detect metastatic melanoma in patients with low recist scores and non-recist disease progression. Molecular Oncology 10, 157–165. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Chen M.-H., Ibrahim J. G. & Sinha D. (1999). A new Bayesian model for survival data with a surviving fraction. Journal of the American Statistical Association 94, 909–919. [Google Scholar]
  5. Chen K., Jin Z. & Ying Z. (2002). Semiparametric analysis of transformation models with censored data. Biometrika 89, 659–668. [Google Scholar]
  6. Couzin-Frankel J. (2013). Cancer immunotherapy. Science 342, 1432–1433. [DOI] [PubMed] [Google Scholar]
  7. Cymerman R., Shao Y., Wang K., Zhang Y., Murzaku E., Penn L., Osman I. & Polsky D. (2016). De novo vs nevus-associated melanomas: Differences in associations with prognostic indicators and survival. Journal of the National Cancer Institute 108, pii: djw121. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Dalgaard P. (2008). Introductory Statistics with R. New York, NY: Springer-Verlag. [Google Scholar]
  9. Demler O. V., Pencina M. J. & D’Agostino R. B. (2012). Misuse of delong test to compare aucs for nested models. Statistics in Medicine 31, 2577–2587. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Farewell V. T. (1982). The use of mixture models for the analysis of survival data with long-term survivors. Biometrics 38, 1041–1046. [PubMed] [Google Scholar]
  11. Friedman E., Shang S., de Miera E., Fog J., Teilum M., Ma M., Berman R., Pavlick A., Hernando E., Baker A., Shao Y. And others (2012). Serum microRNAs as biomarkers for recurrence in melanoma. Journal of Translational Medicine 10, 155. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Gönen M. & Heller G. (2005). Concordance probability and discriminatory power in proportional hazards regression. Biometrika 92, 965–970. [Google Scholar]
  13. Harrell F. E., Califf R. M., Pryor D. B., Lee K. L. & Rosati R. A. (1982). Evaluating the yield of medical tests. Journal of the American Medical Association 247, 2543–2546. [PubMed] [Google Scholar]
  14. Harrell F. E., Lee K. L., Califf R. M., Pryor D. B. & Rosati R. A. (1984). Regression modelling strategies for improved prognostic prediction. Statistics in Medicine 3, 143–152. [DOI] [PubMed] [Google Scholar]
  15. Hodi F. S., O’Day S. J., McDermott D. F., Weber R. W., Sosman J. A., Haanen J. B., Gonzalez R., Robert C., Schadendorf D., Hassel J. C. et al. (2010). Improved survival with ipilimumab in patients with metastatic melanoma. New England Journal of Medicine 363, 711–723. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Kalbfleisch J. D. & Prentice R. L. (2011). The Statistical Analysis of Failure Time Data, Volume 360 New York, NY: John Wiley & Sons. [Google Scholar]
  17. Kuk A. Y. & Chen C.-H. (1992). A mixture model combining logistic regression with proportional hazards regression. Biometrika 79, 531–541. [Google Scholar]
  18. Liu M., Lu W. & Shao Y. (2006). Interval mapping of quantitative trait loci for time to event data with the proportional hazards mixture cure. Biometrics 62, 1053–1061. [DOI] [PubMed] [Google Scholar]
  19. Lu W. (2008). Maximum likelihood estimation in the proportional hazards cure model. Annals of the Institute of Statistical Mathematics 60, 545–574. [Google Scholar]
  20. Lu W. & Ying Z. (2004). On semiparametric transformation cure models. Biometrika 91, 331–343. [Google Scholar]
  21. Maller R. A. & Zhou X. (1996). Survival Analysis with Long-Term Survivors. New York: Wiley. [Google Scholar]
  22. Mukhi V. & Shao Y. (2009). Optimal two-stage designs to evaluate a series of new agents or treatments. Statistics in Biopharmaceutical Research 1, 377–387. [Google Scholar]
  23. Othus M., Barlogie B., LeBlanc M. L. & Crowley J. J. (2012). Cure models as a useful statistical tool for analyzing survival. Clinical Cancer Research 18, 3731–3736. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Pencina M. J. & D’Agostino R. B. (2004). Overall c as a measure of discrimination in survival analysis: model specific population value and confidence interval estimation. Statistics in Medicine 23, 2109–2123. [DOI] [PubMed] [Google Scholar]
  25. Peng Y. & Dear K. B. (2000). A nonparametric mixture model for cure rate estimation. Biometrics 56, 237–243. [DOI] [PubMed] [Google Scholar]
  26. Peng Y. & Taylor J. M. (2016). Residual-based model diagnosis methods for mixture cure models. Biometrics. [DOI] [PubMed] [Google Scholar]
  27. Pepe M. S., Janes H. & Li C. I. (2014). Net risk reclassification p values: valid or misleading? Journal of the National Cancer Institute 106, dju041. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Pepe M. S., Kerr K. F., Longton G. & Wang Z. (2013). Testing for improvement in prediction model performance. Statistics in Medicine 32, 1467–1482. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Robert C., Long G. V., Brady B., Dutriaux C., Maio M., Mortier L., Hassel J. C., Rutkowski P., McNeil C., Kalinka-Warzocha E. et al. (2015). Nivolumab in previously untreated melanoma without braf mutation. New England Journal of Medicine 372, 320–330. [DOI] [PubMed] [Google Scholar]
  30. Schlom J. (2012). Therapeutic cancer vaccines: current status and moving forward. Journal of the National Cancer Institute 104, 599–613. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Simon R. M., Steinberg S. M., Hamilton M., Hildesheim A., Khleif S., Kwak L. W., Mackall C. L., Schlom J., Topalian S. L. & Berzofsky J. A. (2001). Clinical trial designs for the early clinical development of therapeutic cancer vaccines. Journal of Clinical Oncology 19, 1848–1854. [DOI] [PubMed] [Google Scholar]
  32. Sun X., Bao J. & Shao Y. (2016). Mathematical modeling of therapy-induced cancer drug resistance: Connecting cancer mechanisms to population survival rates. Scientific Reports 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Sy J. P. & Taylor J. M. (2000). Estimation in a cox proportional hazards cure model. Biometrics 56, 227–236. [DOI] [PubMed] [Google Scholar]
  34. Tsiatis A. (2007). Semiparametric Theory and Missing Data. New York, NY: Springer Science & Business Media. [Google Scholar]
  35. Uno H., Cai T., Pencina M. J., D’Agostino R. B. & Wei L. (2011). On the c-statistics for evaluating overall adequacy of risk prediction procedures with censored survival data. Statistics in Medicine 30, 1105–1117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Vickers A. J., Cronin A. M. & Begg C. B. (2011). One statistical test is sufficient for assessing new predictive markers. BMC Medical Research Methodology 11, 13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Wileyto E. P., Li Y., Chen J. & Heitjan D. F. (2013). Assessing the fit of parametric cure models. Biostatistics 14(2), 340–50. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Wolchok J. D., Kluger H., Callahan M. K., Postow M. A., Rizvi N. A., Lesokhin A. M., Segal N. H., Ariyan C. E., Gordon R.-A., Reed K. et al. (2013). Nivolumab plus ipilimumab in advanced melanoma. New England Journal of Medicine 369, 122–133. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Yakovlev A. Y. & Tsodikov A. D. (1996). Stochastic Models of Tumor Latency and Their Biostatistical Applications, Volume 1 OECD Publishing. [Google Scholar]
  40. Zeng D. & Lin D. (2007). Maximum likelihood estimation in semiparametric regression models with censored data. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 69, 507–564. [Google Scholar]
  41. Zeng D., Yin G. & Ibrahim J. G. (2006). Semiparametric transformation models for survival data with a cure fraction. Journal of the American Statistical Association 101, 670–684. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Data

Articles from Biostatistics (Oxford, England) are provided here courtesy of Oxford University Press

RESOURCES