Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2016 Jul 30.
Published in final edited form as: Stat Med. 2015 Apr 10;34(17):2497–2513. doi: 10.1002/sim.6501

A Weibull multi-state model for the dependence of progression-free survival and overall survival

Yimei Li a, Qiang Zhang b,*
PMCID: PMC4490073  NIHMSID: NIHMS681480  PMID: 25865438

Abstract

In oncology clinical trials, overall survival (OS), time to progression (TTP) and progression free survival (PFS) are three commonly used endpoints. Empirical correlations among them have been published for different cancers, but statistical models describing the dependence structures are limited. Recently Fleischer et al. proposed a statistical model that is mathematically tractable and shows some flexibility to describe the dependencies in a realistic way, based on the assumption of exponential distributions. This paper aims to extend their model to the more flexible Weibull distribution. We derived theoretical correlations among different survival outcomes, as well as the distribution of overall survival induced by the model. Model parameters were estimated by the maximum likelihood method and the goodness of fit was assessed by plotting estimated vs. observed survival curves for OS. We applied the method to three cancer clinical trials. In the non-small-cell lung cancer trial, both the exponential and the Weibull models provided an adequate fit to the data and the estimated correlations were very similar under both models. In the prostate cancer trial and the laryngeal cancer trial, the Weibull model exhibited advantages over the exponential model and yielded larger estimated correlations. Simulations suggested that the proposed Weibull model is robust for data generated from a range of distributions.

Keywords: overall survival, progression-free survival, correlation, oncology, Weibull distribution

1. Introduction

In the development of anti-cancer therapies the most commonly used endpoints are: overall survival (OS), time to progression (TTP) and progression free survival (PFS). OS is defined as the time from randomization until death from any cause or censoring at the last follow up. TTP is defined as the time from randomization until tumor progression, considering death as censoring. PFS considered both progression and death as events, and is defined as time from randomization to tumor progression or death whichever occurs first. Generally OS is considered as the most reliable clinical endpoint and is preferred by regulatory agencies, because prolonging patients’ lives is the ultimate goal [1]. However, determination of OS may require a prolonged follow-up period and be influenced by subsequent therapies after patients go off a given trial [2]. On the other hand, for the endpoints such as PFS and TTP the events can be observed sooner, and for certain settings these are more direct measures of clinical benefit [3, 4]. PFS and TTP have become more acceptable endpoints, especially in the earlier phases of drug development such as phase II trials [5, 6]. Among these two potential surrogate endpoints for OS, PFS is often preferred over TTP.

To validate PFS or TTP as acceptable surrogates for OS, establishing their correlations is an important aspect. There have been publications of the empirical correlations between these endpoints in different cancers [7, 8, 9, 10], but statistical models describing the dependence structures are very limited. Recently Fleischer et al. proposed a statistical model that is mathematically tractable and shows some flexibility to describe the dependencies in a realistic way [11]. Their model assumed exponential distributions for TTP, survival time before progression, and survival time post progression. They derived the correlations among the survival endpoints (TTP, PFS, and OS) and also the distribution of the OS induced by the model. However, in our attempt to apply their methods to the data from cancer clinical trials, we found cases in which there are discrepancies between the observed OS and their model-based estimated OS. Therefore, we aim to extend their method by generalizing the exponential distribution to the Weibull distribution, which should provide a more flexible model framework and a better fit to real data.

The rest of the article is organized as follows: In §2, we present the general model and the correlations between different endpoints induced by the model. In §3, we describe the method for estimating parameters and assessing overall model fit. In §4, we apply the method to three cancer clinical trials and compare the results with those from the exponential model. In §5 we present simulation results, and we offer concluding comments in §6.

2. The Statistical Model

Following the framework of Fleischer et al. [11], we use OSorig to represent the hypothetical original OS, which is independent of TTP. Then PFS is given by the minimum of TTP and OSorig, reflecting the general construction mechanism of the PFS. If death occurs before progression (i.e. PFS ≠ TTP), OS would be the same as PFS. Otherwise if the progression occurred first, we introduced OS’ representing the time to death after progression, and naturally we would expect that the hazard rate for OS’ would be different from the hazard rate for OSorig, due to the occurrence of progression. In this case OS would be the sum of time to progression (TTP) and the time to death post progression (OS’). Therefore, the general model is:

TTP~f1=Weibull(α,λ1),OSorig~f2=Weibull(α,λ2),PFS=min(TTP,OSorig)
OS~f3=Weibull(α,λ3),OSTTPOSorig
OS={PFS,ifPFSTTPTTP+OS,ifPFS=TTP;
whereλ1>0,λ2>0,λ3>0andα>0. (1)

We assume a Weibull distribution for TTP, OSorig and OS’, extending the exponential distribution assumption in the earlier work. We allow various rate parameters λ1, λ2 and λ3 but assume a same shape parameter α, for the purpose of mathematical convenience (will be shown in the subsequent proofs). In practice often times the shape of the hazard function for progression and death are similar, e.g. if the hazard of progression increases over time, then the hazard of death increases over time as well; therefore the assumption of the same shape parameter α seems plausible. Note that in our parametrization, a variable T follows Weibull(α, λ) means f (t) = λαtα−1exp(−λtα).

The model could also be represented as a multi-state model as illustrated in Figure 1. At randomization (or the study entry in a single-arm trial) the state is 0, progression of disease is state 1 and death is state 2. A patient could reach the absorbing state 2 (death) directly or through the transient state 1 (progression). The transition intensities (or hazards) from the state i to the state j at time t are denoted as πij(t). The exponential model [11] assumed that the transition intensities are constant over time, i.e., π01(t) = λ1, π02(t) = λ2, and π12(t) = λ3. Our extended Weibull model allows the transition intensities to change over time, i.e., π01(t) = λ1αtα−1, π02(t) = λ2αtα−1, and π12(t) = λ3αtα−1.

Figure 1.

Figure 1

The multi-state Weibull model.

Then similar to Fleischer et al. [11], we can derive the five theorems:

Theorem 1

Under the general model (1), the distribution of PFS is:

PFS~weibull(α,λ1+λ2);

The probability that progression occurs first is:

p0=Prob(PFS=TTP)=Prob(TTP<OS)=λ1λ1+λ2.

Theorem 2

Under the general model (1), the correlation between PFS and OS is:

Corr(PFS,OS)=Γ(1+2α)Γ2(1+1α)Γ(1+2α)Γ2(1+1α)+λ12[Γ(1+2α)Γ2(1+1α)]+λ1λ2Γ(1+2α)(λ1+λ2)(22/α)λ32/α.

Theorem 3

Under the general model (1), the correlation between TTP and OS is:

Corr(TTP,OS)=Cov(TTP,OS)Γ(1+2α)Γ2(1+1α)λ12α·(Γ(1+2α)Γ2(1+1α)(λ1+λ2)2α+λ12[Γ(1+2α)Γ2(1+1α)]+λ1λ2Γ(1+2α)(λ1+λ2)2λ32α);
Cov(TTP,OS)=λ1λ1+λ2[Γ(1+2α)(λ1+λ2)2α+Γ2(1+1α)(λ1+λ2)1αλ31α]+λ2αλ11α0xαexp(λ2xα)UIG(1+1α,λ1xα)dxΓ2(1+1α)λ11α[1(λ1+λ2)1α+λ1(λ1+λ2)λ31α],

where UIG is the Upper Incomplete Gamma function UIG (a,t)=tua1eudu.

Theorem 4

Under the general model (1), the conditional correlation between TTP and OS is:

Corr(TTP,OS|TTP<OS)=λ31αλ32α+(λ1+λ2)2α.

Theorem 5

Under the general model (1), the survival function for OS at any time point × is:

SOS(x)=exp[(λ1+λ2)xα]+λ1α0xyα1exp[(λ1+λ2)yαλ3(xy)α]dy

Proofs of the above theorems are provided in the Appendix. Figure 2 illustrates the correlation between PFS and OS based on Theorem 2. In this example, the parameters λ1 and λ2 are fixed as λ1 = 2 and λ2 = 1. We show three scenarios of α and various scales of λ3. Obviously for a fixed value of α, the correlation becomes larger when λ3 increases, and the value of α influences how the correlation changes with λ3.

Figure 2.

Figure 2

Correlation of PFS and OS for λ1 = 2, λ2 = 1, variable λ3 and α.

To calculate the correlation between TTP and OS based on Theorem 3, we need numerical integration to obtain the integral in the formula of Cov(TTP,OS). For any values of (λ1, λ2, λ3, α), the three correlations could be calculated using Theorems 2–4. Figure 3 illustrates the correlations for λ1 = 2, λ2 = 1, λ3 = 2 and variable α. We note that the correlation Corr(PFS,OS) and the conditional correlation Corr(TTP,OS|TTP<OS) are always greater than zero, but the correlation Corr(TTP,OS) could be less than zero. When λ1, λ2, and λ3 are fixed, the shape parameter α has a relatively greater influence on Corr(PFS,OS) and Corr(TTP,OS|TTP<OS), and a lesser influence on Corr(TTP,OS).

Figure 3.

Figure 3

Correlations of Corr(PFS,OS), Corr(TTP,OS) and Corr(TTP,OS|TTP<OS), for λ1 = 2, λ2 = 1, λ3 = 2 and variable α.

Moreover using Theorem 5, we can obtain the overall survival probability at any time. We again use numerical integration to calculate the integral in the formula. We also note that when α = 1, all of the above theorems (1–5) lead to the same results as shown in Fleischer et al. [11].

3. Model Estimation

We denote θ = (α, λ1, λ2, λ3) as the vector of parameters to be estimated, and θ̂ = (α̂, λ̂1, λ̂2, λ̂3) as the corresponding maximum likelihood estimates (MLEs). Typically the observed data contain four types of patients: 1. patients who progress and then censor without death; 2. patients who progress and then die; 3. patients who die before progression; 4. patients who censor without progression or death. We used δi as the indicator for patient type i (i = 1, 2, …, n); δi = k means patient i is of type k. For a patient of type 1 or 2, we observe two times: ti1 representing the time to progression and ti2 representing the time after progression to death or censoring. For a patient of type 3 or 4, we only observe one time: ti1 representing the time to death or censoring and ti2 is missing. We can then calculate the likelihood Li(k)(θ) (k = 1, 2, 3, 4), for each type of patient:

  • If δi = 1 then
    Li(1)(θ)=P(TTP=ti1)P(OSorig>ti1)P(OS>ti2)=f1(ti1)S2(ti1)S3(ti2)
  • If δi = 2 then
    Li(2)(θ)=P(TTP=ti1)P(OSorig>ti1)P(OS=ti2)=f1(ti1)S2(ti1)f3(ti2)
  • If δi = 3 then
    Li(3)(θ)=P(TTP>ti1)P(OSorig=ti1)=S1(ti1)f2(ti1)
  • If δi = 4 then
    Li(4)(θ)=P(TTP>ti1)P(OSorig>ti1)=S1(ti1)S2(ti1)

Recall that f1(·) and S1(·) are the density and survival functions for TTP which follow Weibull(α, λ1), f2(·) and S2(·) are the density and survival functions for OSorig which follow Weibull(α, λ2), and f3(·) and S3(·) are the density and survival functions for OS’ which follow Weibull(α, λ3). Summing the log likelihood across all subjects gives the overall log likelihood:

logL(θ)=i=1nk=14logLi(k)(θ)I(δi=k)

Applying an optimization algorithm to the likelihood we can obtain the MLEs. In our analyses, we use the optim() function in R with the BFGS (Broyden Fletcher Goldfarb Shanno) optimization method [12]. We reparameterize all parameters (log transformation) to avoid boundary constraints in the optimization procedures. Simulations suggest that the MLE estimates behave well, with mean estimates very close to the true values.

After obtaining the MLEs, we can substitute the values of (α̂, λ̂1, λ̂2, λ̂3) into Theorems 2–4 to calculate the estimated correlations. Similarly, we can estimate the survival function for OS at any time point × according to Theorem 5. Plotting this OS against time × we have the estimated OS curve based on the model, which can be compared to the Kaplan-Meier (KM) curve of the OS from the observed data. A good agreement between two curves suggests a good overall fit.

Because our model assumes a common α for TTP, OSorig, and OS’, it would be useful to formally test this assumption. To do so we could fit another Weibull model with three different shape parameters, by simply changing the S and f in the formulas of Li(k). Then we can compare the two models using the likelihood ratio (LR) test. A significant p value would suggest that the assumption of a common α is violated.

All analyses are conducted in R. Sample R codes are provided in the supplement and full codes are available from the authors.

4. Application

We applied our methods to three examples of cancer clinical trials from the Radiation Therapy Oncology Group (RTOG). In all three examples, PFS was the primary endpoint and OS was one of the secondary endpoints. The first study was a phase III trial to compare prophylactic cranial irradiation versus observation in patients with locally advanced non-small-cell lung cancer (RTOG 0214) [13, 14]. The failure event of PFS was defined as the earliest event of death due to any cause, local progression, regional metastasis, distant metastasis, or second primary tumor. The second study was a phase III trial to compare whole-pelvic versus prostate-only radiotherapy and neoadjuvant versus adjuvant combined androgen suppression (RTOG 9413) [15]. The failure event of PFS was defined as the first occurrence of local progression, regional nodal failure, distant failure, biochemical (PSA) failure or death due to any cause. The third study was a randomized trial to compare three treatments for patients with locally advanced cancer of the larynx (RTOG 9111) [16]. The failure event of PFS was defined as laryngectomy or death due to any cause. In RTOG 0214 and RTOG 9111 neither OS or PFS was significantly different between treatment arms; therefore, all treatment arms were pooled in our analyses. In RTOG 9413, although OS was similar across treatment arms, PFS was significantly higher in the patients treated with whole-pelvic radiotherapy (WPRT) than those treated with prostate-only radiotherapy (PORT), so the two treatment groups were analyzed separately.

Table 1 presents the estimated parameters from the exponential model and the Weibull model, Table 2 presents the estimated correlations from the two models by substituting MLEs into Theorems 2–4, and Figure 4 shows the estimated KM curve for OS together with the estimated curves for OS based on Theorem 5. In the first trial RTOG 0214, the estimated log(α) in the Weibull model is very close to 0, so that the Weibull model essentially reduces to the exponential model. Therefore the estimated log(λ)s under the two models are almost identical (Table 1), the estimated correlations under two models are almost the same as well (Table 2), and the estimated OS survival curves from both models are almost overlapping with the observed KM curve (Figure 4). Interestingly, comparing our Weibull model with the same shape parameter to the Weibull model with different shape parameters, the LR test gives a p value of 0.0183 suggesting a violation of the common α assumption. In the model with different shape parameters, the estimated αs are 0.93, 1.35 and 0.88. Therefore, when forcing a common α as in our proposed Weibull model, the estimated α is close to 1 reflecting an average of the three αs. This trial example suggests that our Weibull model could fit the data well even if the assumption of common shape parameter is questionable, and the exponential model and Weibull model fit the data similarly well when the shape parameter α is close to 1.

Table 1.

Parameter estimates from exponential and Weibull models.

exponential Weibull


log(λ1) log(λ2) log(λ3) log(α) log(λ1) log(λ2) log(λ3)
RTOG 0214 −0.846 −2.418 0.037 −0.057 −0.817 −2.382 0.043
RTOG 9413 WPRT −2.066 −3.481 −2.527 0.219 −2.361 −3.778 −2.710
RTOG 9413 PORT −2.011 −3.586 −2.535 0.138 −2.187 −3.763 −2.649
RTOG 9111 −1.710 −2.768 −1.086 −0.260 −1.463 −2.524 −0.907

Table 2.

Estimated correlations from exponential and Weibull models.

exponential Weibull


PFS&OS TTP&OS Conditional
TTP&OS
PFS&OS TTP&OS Conditional
TTP&OS
RTOG 0214 0.897 0.679 0.895 0.901 0.682 0.897
RTOG 9413 WPRT 0.460 0.228 0.453 0.527 0.269 0.536
RTOG 9413 PORT 0.446 0.240 0.440 0.491 0.269 0.494
RTOG 9111 0.820 0.496 0.812 0.835 0.511 0.814

Figure 4.

Figure 4

Observed KM curve and model-estimated curves for OS based on Theorem 5.

In the second trial RTOG 9413, the LR test for the common α gives a p value of 0.6065 for both WPRT and PORT treatment arms, supporting the use of the proposed model. Also for both treatment arms, the estimated log(α) in the Weibull model is greater than 0 so the estimated log(λ)s under the two models are somewhat different and the correlations estimated from the Weibull model are greater than those from the exponential model. Figure 4 suggests that for both treatment arms the Weibull model fit better to the observed data than the exponential model. Comparing the parameter estimates for the two treatment arms, we observe a greater difference in log(λ1) than in log(λ2) or log(λ3). This suggests that the treatment has some influence on the hazard of TTP but not on the hazard of death – either before or after progression. This is consistent with the original study findings that PFS is significantly different between treatment arms while OS is not.

In the third trial RTOG 9111, the p value for the the LR test of the common α is 0.3679. The estimated log(α) in the Weibull model is less than 0, leading to different estimates of log(λ)s and correlations under the two models. In Figure 4, although both estimated curves show some departure from the observed KM curve, the estimated curve from the Weibull model is closer to the KM curve suggesting a better fit with the Weibull model.

5. Simulation

We have conducted simulations to evaluate the fit of the two models (exponential model and Weibull model), using data sets generated from various distributions. In all simulations, we considered a moderate sample size of 500 subjects. For each subject, we generated TTP, OSorig, OS’ from the assumed distributions and applied a fixed censoring time. One data set was generated for each scenario.

In the first scenario, the event times were generated from exponential distributions. More specifically, TTP was generated from exponential(λ1 = 1), OSorig was generated from exponential(λ2 = 0.5), and OS’ was generated from exponential(λ3 = 1). This led to a mean TTP of 1 year, a mean OSorig of 2 years, a mean OS’ of 1 year, and a mean OS of 0.99 years. We then applied a fixed censoring time at 1.5 years. In the data set we generated, 119 subjects progressed and were then censored without death, 190 subjects progressed and then died, 137 subjects died before progression, and 54 patients were censored without progression or death. We fitted the exponential model and the Weibull model separately to the data set and estimated MLEs. We estimated model-based correlations according to Theorems 2–4 and summarized these in Table 3. We also estimated model-based survival functions for OS according to Theorem 5, and Figure 5 (Figure 5A for this scenario) shows the observed KM curve for OS and the estimated curves from the two models. As expected, because the data were generated from exponential distributions, correlations under the two models were similar and both models fit the observed data perfectly.

Table 3.

Estimated correlations from the exponential and Weibull models, in simulated data sets from various distributions.

exponential Weibull


Data distribution PFS&OS TTP&OS Conditional
TTP&OS
PFS&OS TTP&OS Conditional
TTP&OS
Exponential 0.574 0.214 0.555 0.573 0.215 0.553
Weibull w. same shape 0.560 0.159 0.534 0486 0.022 0.648
Weibull w. diff shape 0.504 0.175 0.488 0.474 0.046 0.593
Log-logistic 0.000 −0.062 0.000 0.204 −0.137 0.041
Log-normal 0.479 0.112 0.457 0.504 0.094 0.515

Figure 5.

Figure 5

Fit of two models to data sets simulated from various distributions.

In the second scenario, the event times were generated from Weibull distributions with the same shape parameter. More specifically, TTP was generated from Weibull(α = 3, λ1 = 2), OSorig was generated from Weibull(α = 3, λ2 = 1), and OS’ was generated from Weibull(α = 3, λ3 = 2). This led to a mean TTP of 0.71 years, a mean OSorig of 0.89 years, and a mean OS’ of 0.71 years, with a fixed censoring time at 1.5 years. The final OS had a mean of 1.03 years. Figure 5B suggests that the Weibull model fit the data perfectly but the exponential model fit poorly, because the data violated the exponential distribution assumption. Table 3 suggests that the exponential model overestimated correlations between PFS and OS and between TTP and OS, but underestimated the conditional correlation between TTP and OS.

Because our Weibull model assumed the same shape parameter in distributions of the three endpoints, in the third scenario we generated the event times from Weibull distributions with different shape parameters. More specifically, TTP was generated from Weibull(α = 2, λ1 = 2), OSorig was generated from Weibull(α = 3, λ2 = 1), and OS’ was generated from Weibull(α = 4, λ3 = 2). So TTP had a mean of of 0.63 years, OSorig had a mean of 0.89 years, OS’ had a mean of 0.76 years, and finally OS had a mean of 1.05 years. The LR test of the common α assumption gave a p value of < 0.0001, as expected. However, even though the data violated the assumption of the same shape parameter, our Weibull model could still fit satisfactorily. As shown in Figure 5C, there was an adequate fit with the Weibull model but a lack of fit with the exponential model. Table 3 shows that correlations under the two models differed by −0.13 to 0.11. The correlations under the Weibull model should be more accurate given that the Weibull model fit the data better.

Moreover because the Weibull distribution assumed a monotone hazard function, we generated event times from log-logistic distributions to allow hazards with non-monotone shapes. With our notation, time T follows log-logistic (α, λ) means that the hazard function is (αtα−1λ)/(1 + λtα). In this example, TTP was generated from log-logistic(α = 3, λ1 = 2), OSorig was generated from log-logistic(α = 3, λ2 = 1), and OS’ was generated from log-logistic(α = 3, λ3 = 2). So the hazard functions of all three event times first increased and then decreased. The data then had a mean TTP of 0.96 years, a mean OSorig of 1.21 years, a mean OS’ of 0.96 years, and a mean OS of 1.00 years. Figure 5D shows that the curve based on the Weibull model was much closer to the KM curve, compared to the curve based on the exponential model. This indicated that the Weibull model provided better fit than the exponential model even if the underlying distribution was not Weibull. Table 3 shows that correlations under the two models differed by −0.08 to 0.20.

Lastly, we generated event times from log-normal distributions. TTP was generated from log-normal(μ = −0.5, σ = 1), OSorig was generated from log-normal(μ = 0.1, σ = 1), and OS’ was generated from log-normal(μ = −0.2, σ = 1). The final OS had a mean of 1.01 years. Figure 5E shows that the Weibull model fit the data somewhat better than the exponential model. Table 3 shows that correlations under the two models differed by −0.02 to 0.06.

We also used one simulation example to explore the implications of our model on power calculations. We assumed that the data consisted of two groups with a fixed follow up of 1.5 years. We assumed the data of the treatment group were from our model with TTP~Weibull(α = 3, λ1 = 2), OSorig ~Weibull(α = 3, λ2 = 1), and OS’~Weibull(α = 3, λ3 = 2), and the data of the control group were from our model with TTP~Weibull(α = 3, λ1 = 2.7), OSorig ~Weibull(α = 3, λ2 = 1.7), and OS’~Weibull(α = 3, λ3 = 2.7). This meant that the one-year survival for OS was 45% in the control group and 57% in the treatment group. Using the standard method assuming OS is exponentially distributed and neglecting any correlation between OS and PFS, the one-year survival for OS translated to a hazard rate of 0.56 and 0.80 for the two groups, respectively. Thus, we would need a total of 396 subjects (i.e., 256 events in 1.5 year follow up period) to detect such a difference with 80% power. In contrast, by generating 4000 data sets with a sample size of 396 subjects (198 per group) from the assumed Weibull model and conducting log-rank test on each data set, 94.8% of them gave a significant p value. Therefore, the power is much higher using our proposed model compared to using the standard exponential model.

6. Discussion

We have proposed a Weibull statistical model to describe the dependence between PFS and OS, which is a direct extension of the earlier work of Fleischer et al [11]. Under the model framework we derived analytical correlations among TTP, PFS and OS, and presented five theorems that are similar in structure to Fleischer et al. When generalizing from exponential to Weibull distributions, the results of the theorems become complex and not straightforward. Nevertheless, the extended model is more robust and realistic for data we often observe in various studies, as shown in our clinical trial examples.

As illustrated by Figure 1, our proposed model actually falls under the umbrella of general illness-death models. The transition intensities under this three-state model are simply the hazard rates for TTP, OSorig and OS’. So the distribution function of OS in Theorem 5 can also be derived using the techniques for multi-state models and calculating the transition probabilities from one state to another. In addition to the usual focus of multi-state models on estimating transition probabilities, our paper had another goal of estimating correlations among different survival endpoints.

The correlations among TTP, PFS and OS can be estimated using the presented Theorems 2–4. In the three clinical trials, we observed small differences in the the estimated correlations under the Weibull model and the exponential model, presumably because the estimated α was close to 1, so that the advantage of the Weibull over the exponential model was modest. In the simulation examples, the differences of the estimated correlations under the two models were moderate, in the range of −0.13 to 0.20. Theoretically, as illustrated in Figure 2 and 3, the correlations from the two models could be substantially different under some parameter values. In applications of the methods to real data, one cannot predict the magnitude of the difference before fitting the Weibull model and estimating the shape parameter α. Since the Weibull model in general is more flexible and fits data better than the exponential model, the correlations under the Weibull model should be more accurate. Naively applying an exponential model could potentially over- or under- estimate the true correlations.

In our Weibull model, we assume that the three event times share the same shape parameter α, which makes the analytical derivations of the correlations feasible. As shown in our appendix, the derivations are already complex under the current assumption, and would become untraceable if we generalized to Weibull distributions with different shape parameters. In analysis of real data, one can always fit the Weibull model with different shape parameters and conduct the LR test for common α, as we demonstrate in §4. A non-significant LR test would justify the use of our proposed Weibull model with a common α. Even in the cases that the common α assumption is violated, our Weibull model could still fit data adequately, as shown in RTOG 0214 and in the simulation example. Nevertheless, one could choose the model with three shape parameters if the common α assumption is a concern, but should keep in mind that under this model the correlations among the survival endpoints and the distribution of OS cannot be derived analytically.

Although the Weibull model provides more flexibility than the exponential model, it is possible that the Weibull model still misspecifies the underlying distribution, especially if the data are actually from a distribution with a non-monotone hazard function. However in our two simulation examples with data from log-logistic distributions and log-normal distributions, our Weibull model seems to provide an adequate fit.

The majority of the paper focuses on one-group scenarios but we explore a two-group scenario through one real trial example and one simulation example. In RTOG 9413, we fit the proposed model to the two treatment groups separately and the group-specific estimates reveal how the treatment influences the hazards of progression and death. Moreover under our proposed model although the distributions for TTP, OSorig and OS’ are Weibull which implies proportional hazard between groups, the induced OS will not have proportional hazards. So our model would be useful when the data exhibit non-proportionality features. Lastly by considering the dependence structure between PFS and OS, our Weibull model would be more powerful than the standard method assuming a simple exponential distribution for OS. This is demonstrated by the last simulation example and a thorough investigation of this issue will be our future research.

Acknowledgement

This project is funded, in part, under a grant with the Pennsylvania Department of Health. The Department specifically declaims responsibility for any analyses, interpretations or conclusions. Also, the National Institutes of Health supported the research of Yimei Li under USPHS grant P30-CA016520.

Appendix

We first note that for a Weibull distribution with parameters (λ, α),

E(T)=λ1αΓ(1+1α),E(T2)=λ2αΓ(1+2α)
Var(T)=λ2αΓ(1+2α)λ2α[Γ(1+1α)]2

Proof of Theorem 1

P(PFSt)=P(min(TTP,OSorig)t)=P(TTPt)P(OSorigt)=exp[(λ1+λ2)tα]

That is, PFS follows Weibull (α, λ1 + λ2).

p0=Prob(PFS=TTP)=P(TTP<OSorig)=y<xλ1αyα1exp(λ1yα)λ˙αxα1exp(λ2xα)dxdy=0λ1αyα1exp(λ1yα)[yλ2αxα1exp(λ2xα)dx]dy=0λ1αyα1exp(λ1yα)exp(λ2yα)dy=λ1λ1+λ20(λ1+λ2)αyα1exp[(λ1+λ2)yα]dy=λ1λ1+λ2.

Proof of Theorem 2

E(PFS·OS)=E(PFS·OS|PFS<OS)Prob(PFS<OS)+E(PFS·OS|PFS=OS)Prob(PFS=OS)=E(PFS·(PFS+OS))Prob(PFS=TTP)+E(PFS·PFS)Prob(PFSTTP)=E(PFS2+PFS·OS)p0+E(PFS2)(1p0)=E(PFS2)+E(PFS)·E(OS)p0=(λ1+λ2)2αΓ(1+2α)+(λ1+λ2)1αΓ(1+1α)·λ31αΓ(1+1α)·λ1λ1+λ2=1(λ1+λ2)2αΓ(1+2α)+λ1(λ1+λ2)(1+1α)λ31αΓ2(1+1α) (2)
E(PFS)=1(λ1+λ2)1αΓ(1+1α),(use Theorem1) (3)
E(OS)=E(OS|PFS<OS)Prob(PFS<OS)+E(OS|PFS=OS)Prob(PFS=OS)=E(PFS+OS)Prob(PFS=TTP)+E(PFS)Prob(PFSTTP)=E(PFS+OS)p0+E(PFS)(1p0)=E(PFS)+E(OS)p0=(λ1+λ2)1αΓ(1+1α)+λ31αΓ(1+1α)·λ1λ1+λ2=Γ(1+1α)[1(λ1+λ2)1α+λ1(λ1+λ2)λ31α] (4)
Cov(PFS,OS)=E(PFS·OS)E(PFS)E(OS),(use(2),(3)and(4))=1(λ1+λ2)2αΓ(1+2α)+λ1(λ1+λ2)(1+1α)λ31αΓ2(1+1α)1(λ1+λ2)1αΓ(1+1α)·Γ(1+1α)[1(λ1+λ2)1α+λ1(λ1+λ2)λ31α]=Γ(1+2α)Γ2(1+1α)(λ1+λ2)2α (5)
Var(PFS)=1(λ1+λ2)2α[Γ(1+2α)Γ2(1+1α)],(use Theorem1) (6)
E(OS2)=E(OS2|PFS<OS)Prob(PFS<OS)+E(OS2|PFS=OS)Prob(PFS=OS)=E((PFS+OS)2)Prob(PFS=TTP)+E(PFS2)Prob(PFSTTP)=E(PFS2+OS2+2PFS·OS)p0+E(PFS2)(1p0)=E(PFS2)+E(OS2)p0+2E(PFS)E(OS)p0,(use Theorem1and Weibull distribution properties)=(λ1+λ2)2αΓ(1+2α)+λ32αΓ(1+2α)·λ1λ1+λ2+2(λ1+λ2)1αΓ(1+1α)·λ31αΓ(1+1α)·λ1λ1+λ2=Γ(1+2α)[1(λ1+λ2)2α+λ1(λ1+λ2)λ32α]+2Γ2(1+1α)λ1(λ1+λ2)(1+1α)λ31α (7)
Var(OS)=E(OS2)E2(OS),(use(7)and(4))=Γ(1+2α)[1(λ1+λ2)2α+λ1(λ1+λ2)λ32α]+2Γ2(1+1α)λ1(λ1+λ2)(1+1α)λ31α(Γ(1+1α)[1(λ1+λ2)2α+λ1(λ1+λ2)λ31α])2=Γ(1+2α)Γ2(1+1α)(λ1+λ2)2α+λ12[Γ(1+2α)Γ2(1+1α)]+λ1λ2Γ(1+2α)(λ1+λ2)2λ32α (8)

Equations 5, 6 and 8 give

Corr(PFS,OS)=Γ(1+2α)Γ2(1+1α)Γ(1+2α)Γ2(1+1α)+λ12[Γ(1+2α)Γ2(1+1α)]+λ1λ2Γ(1+2α)(λ1+λ2)(22/α)λ32/α. (9)

Proof of Theorem 3

Var(TTP)=Γ(1+2α)Γ2(1+1α)λ12α,(use Weibull properties); (10)
E(TTP·OS,PFS=OS)=E(TTP·PFS,PFS<TTP)=E(TTP·OSorig,OSorig<TTP)=x<yxyλ2αxα1exp(λ2xα)λ1αyα1exp(λ1yα)dxdy=0xλ2αxα1exp(λ2xα)dxxyλ1αyα1exp(λ1yα)dy=0λ2αxαexp(λ2xα)dx1λ11αλ1xαt1αexp(t)dt=λ2αλ11α0xαexp(λ2xα)UIG(1+1α,λ1xα)dx (11)
E(TTP·OS)=E(TTP·OS|PFS<OS)Prob(PFS<OS)+E(TTP·OS,PFS=OS)=p0E(PFS(PFS+OS))+E(TTP·OS,PFS=OS)=p0[E(PFS2)+E(PFS)E(OS)]+E(TTP·OS,PFS=OS),(use Theorem1,Weibull properties and(11))=λ1λ1+λ2·[(λ1+λ2)2αΓ(1+2α)+(λ1+λ2)1αΓ(1+1α)·λ31αΓ(1+1α)]+λ2αλ11α0xαexp(λ2xα)UIG(1+1α,λ1xα)dx=λ1λ1+λ2[Γ(1+2α)(λ1+λ2)2α+Γ2(1+1α)(λ1+λ2)1αλ31α]+λ2αλ11α0xαexp(λ2xα)UIG(1+1α,λ1xα)dx (12)
Cov(TTP,OS)=E(TTP·OS)E(TTP)E(OS),(use Weibull property and(4))=E(TTP·OS)1λ11αΓ(1+1α)·Γ(1+1α)[1(λ1+λ2)2α+λ1(λ1+λ2)λ31α]=E(TTP·OS)Γ2(1+1α)λ11α[1(λ1+λ2)1α+λ1(λ1+λ2)λ31α]=λ1λ1+λ2[Γ(1+2α)(λ1+λ2)2α+Γ2(1+1α)(λ1+λ2)1αλ31α]+λ2αλ11α0xαexp(λ2xα)UIG(1+1α,λ1xα)dxΓ2(1+1α)λ11α[1(λ1+λ2)1α+λ1(λ1+λ2)λ31α]; (13)
Corr(TTP,OS)=Cov(TTP,OS)Var(TTP)Var(OS),(use equations(8)and(10)=Cov(TTP,OS)Γ(1+2α)Γ2(1+1α)λ12α·(Γ(1+2α)Γ2(1+1α)(λ1+λ2)2α+λ12[Γ(1+2α)Γ2(1+1α)]+λ1λ2Γ(1+2α)(λ1+λ2)2λ32α).

Proof of Theorem 4

E(TTP·OS|TTP<OS)=E(TTP·OS|PFS=TTP)=E(PFS·(PFS+OS))=E(PFS2)+E(PFS)E(OS),(use Weibull properties)=(λ1+λ2)2αΓ(1+2α)+(λ1+λ2)1αΓ(1+1α)·λ31αΓ(1+1α)=Γ(1+2α)(λ1+λ2)2α+Γ2(1+1α)(λ1+λ2)1αλ31α; (14)
E(TTP|TTP<OS)=E(TTP|PFS=TTP)=E(PFS)=Γ(1+1α)(λ1+λ2)1α; (15)
E(OS|TTP<OS)=E(OS|PFS=TTP)=E(PFS+OS)=E(PFS)+E(OS),(use Weibull properties)=(λ1+λ2)1αΓ(1+1α)+λ31αΓ(1+1α)=Γ(1+1α)[1(λ1+λ2)1α+1λ31α]; (16)
Cov(TTP,OS|TTP<OS)=Cov(TTP,OS|PFS=TTP)=E(TTP·OS|PFS=TTP)E(TTP|PFS=TTP)E(OS|PFS=TTP),(use(14),(15),(16))=Γ(1+2α)(λ1+λ2)2α+Γ2(1+1α)(λ1+λ2)1αλ31αΓ(1+1α)(λ1+λ2)1α·Γ(1+1α)[1(λ1+λ2)1α+1λ31α]=Γ(1+2α)Γ2(1+1α)(λ1+λ2)2α; (17)
Var(TTP|TTP<OS)=Var(TTP|PFS=TTP)=Var(PFS),(use Weibull properties)=Γ(1+2α)Γ2(1+1α)(λ1+λ2)2α; (18)
Var(OS|TTP<OS)=Var(OS|PFS=TTP)=E(OS2|PFS=TTP)E2(OS|PFS=TTP)=E(PFS+OS)2E2(PFS+OS)=E(PFS2)+2E(PFS)E(OS)+E(OS2)[E(PFS)+E(OS)]2,(use Weibull properties)=(λ1+λ2)2αΓ(1+2α)+2(λ1+λ2)1αΓ(1+1α)·λ31αΓ(1+1α)+λ32αΓ(1+2α)[(λ1+λ2)1αΓ(1+1α)+λ31αΓ(1+1α)]2=[Γ(1+2α)Γ2(1+1α)][1(λ1+λ2)2α+1λ32α]; (19)
Corr(TTP,OS|TTP<OS)=Cov(TTP,OS|TTP<OS)Var(TTP|TTP<OS)Var(OS|TTP<OS),(use(17),(18),(19))=Γ(1+2α)Γ2(1+1α)(λ1+λ2)2αΓ(1+2α)Γ2(1+1α)(λ1+λ2)2α·[Γ(1+2α)Γ2(1+1α)][1(λ1+λ2)2α+1λ32α]=λ31αλ32α+(λ1+λ2)2α. (20)

Proof of Theorem 5

Given OS≠PFS, then OS=PFS+OS’, so that

P(OS<x|OSPFS)=P(PFS+OS<x)=y+z<x(λ1+λ2)αyα1exp[(λ1+λ2)yα]λ3αzα1exp(λ3zα)dydz=0x(λ1+λ2)αyα1exp[(λ1+λ2)yα]0xyλ3αzα1exp(λ3zα)dzdy=0x(λ1+λ2)αyα1exp[(λ1+λ2)yα][1exp(λ3(xy)α)]dy=0x(λ1+λ2)αyα1exp[(λ1+λ2)yα]dy0x(λ1+λ2)αyα1exp[(λ1+λ2)yαλ3(xy)α]dy=1exp[(λ1+λ2)xα](λ1+λ2)α0xyα1exp[(λ1+λ2)yαλ3(xy)α]dy; (21)

Given OS=PFS, then OS follows Weibull(α, λ1 + λ2), so that

P(OS<x|OS=PFS)=1exp[(λ1+λ2)xα]; (22)
P(OS<x)=P(OS<x|OS=PFS)P(OS=PFS)+P(OS<x|OSPFS)P(OSPFS)=P(OS<x|OS=PFS)(1p0)+P(OS<x|OSPFS)p0,(use(22)and(21))=(1exp[(λ1+λ2)xα])(1p0)+(1exp[(λ1+λ2)xα](λ1+λ2)α0xyα1exp[(λ1+λ2)yαλ3(xy)α]dy)p0=1exp[(λ1+λ2)xα]p0(λ1+λ2)α0xyα1exp[(λ1+λ2)yαλ3(xy)α]dy,(use Theorem1)=1exp[(λ1+λ2)xα]λ1α0xyα1exp[(λ1+λ2)yαλ3(xy)α]dy; (23)

Therefore, the survival function for OS is:

SOS(x)=exp[(λ1+λ2)xα]+λ1α0xyα1exp[(λ1+λ2)yαλ3(xy)α]dy.

References

  • 1.U.S. Department of Health and Huamn Services, Food and Drug Administration, Center for Drug Evaluation and Research (CDER) Guidance for Industry: Clinical Trial Endpoints for the Approval of Cancer Drugs and Biologics. 2007 [Google Scholar]
  • 2.Verma S, Mcleod D, Batist G, Robidoux A, Martins IR, Mackey JR. In the end what matters most? A review of clinical endpoints in advanced breast cancer. The Oncologist. 2011;16:25–35. doi: 10.1634/theoncologist.2010-0278. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Punt CJA, Buyse M, Kohne CH, Hohenberger P, Labianca R, Schmoll HJ, Pahlman L, Sobrero A, Douilard JY. Endpoints in adjuvant treatmetn trials: a systematic review of the literature in colon cancer and proposed definitions for future trials. J Natl Cancer Inst. 2007;99:998–1003. doi: 10.1093/jnci/djm024. [DOI] [PubMed] [Google Scholar]
  • 4.Piedbois P, Buyse M. Endpoints and surrogate endpoints in colorectal cancer: a review of recent development. Current Opinion in Oncology. 2008;20:466–471. doi: 10.1097/CCO.0b013e32830218fe. [DOI] [PubMed] [Google Scholar]
  • 5.Eskens FA, Verweij J. Clinical studies in the development of new anticancer agents exhibiting growth inhibition in models: facing the challenge of a proper study design. Critical Reviews in Oncology/Haematology. 2000;34:83–88. doi: 10.1016/s1040-8428(00)00055-x. [DOI] [PubMed] [Google Scholar]
  • 6.Stone A, Wheeler C, Barge A. Improving the design of phase II trials of cytostatic anticancer agents. Contemporary Clinical Trials. 2007;28:138–145. doi: 10.1016/j.cct.2006.05.009. [DOI] [PubMed] [Google Scholar]
  • 7.Buyse M, Burzykowski T, Carroll K, Michiels S, Sargent DJ, Miller LL, Elfring GL, Pignon JP, Piedbois P. Progression-free survival is a surrogate for survival in advanced colorectal cancer. Journal of Clinical Oncology. 2007;25:5218–5224. doi: 10.1200/JCO.2007.11.8836. [DOI] [PubMed] [Google Scholar]
  • 8.Tang AP, Bentzen SM, Chen EX, Siu LL. Surrogate end points for median overall survival in metastatic colorectal cancer: literature-based analysis from 39 randomized controlled trials of first-line chemotherapy. Contemporary Clinical Trials. 2007;28:138–145. doi: 10.1200/JCO.2006.08.1935. [DOI] [PubMed] [Google Scholar]
  • 9.Miksad RA, Zietemann V, Gothe R, Schwarzer R, Conrads-Frank A, Schnell-Inderst P, Stollenwerk B, Siebert U. Progression-free survival as a surrogate endpoint in advanced breast cancer. Int J Technol Asess Health Care. 2008;24:371–384. doi: 10.1017/S0266462308080495. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Sherrill B, Amonkar M, Wu Y, Hirst C, Stein S, Walker M, Cuzick J. Relationship between effects on time-to-disease progression and overall sruvival in studies of metastatic breast cancer. Br J Cancer. 2008;99:1572–1578. doi: 10.1038/sj.bjc.6604759. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Fleischer F, Gaschler-Markefski B, Bluhmki E. A statistical model for the dependence between progression-free survival and overall survival. Statistics in Medicine. 2009;28:2669–2686. doi: 10.1002/sim.3637. [DOI] [PubMed] [Google Scholar]
  • 12.Shanno DF. Conditioning of quasi-Newton methods for function minimization. Mathematics of Computation. 1970;24:647657. [Google Scholar]
  • 13.Gore EM, Bae K, Wong SJ, Sun A, Bonner JA, Schild SE, Gaspar LE, Bogart JA, Werner-Wasik M, Choy H. A phase III comparison of prophylactic cranial irradiation versus observation in patients with locally advanced non-small cell lung cancer: primary analysis of Radiation Therapy Oncology Group 0214. J Clin Oncol. 2011;29(3):272–278. doi: 10.1200/JCO.2010.29.1609. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Gore EM, Paulus R, Wong SJ, Sun A, Videtic G, Dutta S, Suntharalingam M, Chen Y, Gaspar LE, Choy H. A phase III comparison of prophylactic cranial irradiation versus observation in patients with locally advanced non-small cell lung cancer: An Updated Analysis of RTOG 0214. Oral Scientific Sessions. 2012;84(3S):S103. [Google Scholar]
  • 15.Roach M, DeSilvio M, Lawton C, Uhl V, Machtay M, Seider MJ, Rotman M, Jones C, Asbell SO, Valicenti RK, Han S, Thomas CR, Jr, Shipley WS. Phase III trial comparing whole-pelvic versus prostate-only radiotherapy and neoadjuvant versus adjuvant combined androgen suppression: Radiation Therapy Oncology Group 9413. J Clin Oncol. 2003;21(10):1904–1911. doi: 10.1200/JCO.2003.05.004. [DOI] [PubMed] [Google Scholar]
  • 16.Forastiere AA, Goepfert H, Maor M, Pajak TF, Weber R, Morrison W, Glisson B, Trotti A, Ridge JA, Chao C, Peters G, Lee DJ, Leaf A, Ensley J, Copper J. Concurrent chemotherapy and radiotherapy for organ preservation in advanced laryngeal cancer. N Engl J Med. 2003;349(22):2091–2098. doi: 10.1056/NEJMoa031317. [DOI] [PubMed] [Google Scholar]
  • 17.Anderson P. Statistical models based on counting process. New York: Springer-Verlag; 1993. [Google Scholar]

RESOURCES