Evaluation of Treatment Effect with Paired Failure Times in a Single-Arm Phase II Trial in Oncology

Matthieu Texier; Federico Rotolo; Michel Ducreux; Olivier Bouché; Jean-Pierre Pignon; Stefan Michiels

doi:10.1155/2018/1672176

. 2018 Jan 11;2018:1672176. doi: 10.1155/2018/1672176

Evaluation of Treatment Effect with Paired Failure Times in a Single-Arm Phase II Trial in Oncology

Matthieu Texier ¹, Federico Rotolo ^1,², Michel Ducreux ³, Olivier Bouché ⁴, Jean-Pierre Pignon ^1,², Stefan Michiels ^1,^2,^✉

PMCID: PMC5820554 PMID: 29568321

Abstract

In early phase clinical trials of cytotoxic drugs in oncology, the efficacy is typically evaluated based on the tumor shrinkage. However, this criterion is not always appropriate for more recent cytostatic agents, and alternative endpoints have been proposed. The growth modulation index (GMI), defined as the ratio between the times to progression in two successive treatment lines, has been proposed for a single-arm phase II trials. The treatment effect is evaluated by estimating the rate of patients having a GMI superior to a given threshold. To estimate this rate, we investigated a parametric method based on the distribution of the times to progression and a nonparametric one based on a midrank estimator. Through simulations, we studied their operating characteristics and the impact of different design parameters (censoring, dependence, and distribution) on them. In these simulations, the nonparametric estimator slightly underestimated the rate and had slightly overconservative confidence intervals in some cases. Conversely, the parametric estimator overestimated the rate and had anticonservative confidence intervals in some cases. The nonparametric method appeared to be more robust to censoring than the parametric one. In conclusion, we recommend the nonparametric method, but the parametric method can be used as a supplementary tool.

1. Introduction

In oncology, if a new treatment is found to be acceptably safe in a phase I clinical trial, it can be tested in a phase II trial to look for evidence of efficacy. The type of response or benefit to evaluate depends on the goals of the treatment; in advanced cancer trials, the most used endpoints are related to the change of the size of the lesion or its disappearance. Historically, the tumor shrinkage was the primary endpoint in phase II trials for cytotoxic cancer drugs. Since the 90s, cytostatic drugs, which are supposed to modulate the tumor growth without causing immediate shrinkage, are being developed. Thus, Von Hoff [1] and Mick et al. [2] advocated for rather evaluating the time to progression (TTP) as the primary endpoint in a one-stage design. Since patients being offered phase II studies of new agents have typically failed a previous regimen, then all first progressions are observed and TTP before experimental treatment, say TTP₁, is known for all the patients enrolled. Conversely, the TTP after the experimental agent, TTP₂, may or may not be censored at the time of the analysis. As the TTP is highly variable across patients and the degree of correlation between the paired failure times is a key feature, Von Hoff [1] proposed to evaluate the growth modulation index (GMI = TTP₂/TTP₁) instead, so that each patient serves as his/her own historical control. Von Hoff [1] assumed a null ratio value of 1 and that the GMI needs to be greater than 1.33 for a new regimen to be considered effective in delaying progression. Mick et al. [2] argued that because patients enter a new treatment line after a new progression, the prognosis is expected to be poorer than at the previous treatment line. Thus, because of the natural history of the disease, one expects that in general TTP₂ is shorter than TTP₁, which would indicate a null ratio value smaller than 1 and that a GMI superior to 1 is enough for considering a new regimen as effective.

Some authors have started employing the GMI as primary endpoint. At the time of writing (April 2017), there were a total of ten oncology trials registered in the European Union Clinical Trial Register and eleven oncology trials registered on the https://www.clinicaltrials.gov website as using GMI. For example, Von Hoff et al. [3] used the GMI to measure the activity of a targeted therapy selected by molecular profiling in patients having failed all effective treatments. Eighteen out of 66 patients (27%) had a progression-free survival (PFS) ratio superior to 1.33 (95% confidence interval: [17%; 38%]). Several others published trials [4–6] used a GMI-based approach to assess the activity of second-line treatments, but the estimation did not account for patients with censored times to progression. Only a recent secondary analysis of the SHIVA trial estimated the PFS ratio by Kaplan-Meier curves [7].

Before the GMI can be regularly used as primary endpoint in phase II studies, we need appropriate statistical methods and detailed knowledge of its statistical characteristics. In the present paper, we present methods to estimate the proportion of patients having a GMI greater than a given threshold by handling censored observations, we explore their operating characteristics via simulations and we show an application on a real data set. Such a motivating study in advanced colorectal cancer is presented in Section 2. Section 3 summarizes the statistical methodology to estimate the probability that the GMI is higher than a given threshold. Section 4 presents a simulation study to investigate parameters which could influence the performance of the estimators. Finally, in Section 5, we apply the presented methods to real data. Section 6 discusses the findings.

2. Motivating Example

The FFCD 2000-05 trial [8, 9] was a randomized trial conducted by the French Federation of Digestive Oncology, which included 410 patients with advanced colorectal cancer. It was a phase III trial comparing a sequential (S) arm to a combination (C) arm. Patients in arm S were treated with 5-fluorouracil and leucovorin (LV5FU2) in first line, then with FOLFOX (LV5FU2 + oxaliplatin) in second line, and then with FOLFIRI (LV5FU2 + irinotecan) in third line. Patients in arm C were treated directly with FOLFOX in first line and then with FOLFIRI in second line. The times to progression in the first, second, and third treatment lines were recorded for patients who entered each line of treatment, respectively. Such a design provided us with four separate scenarios in which the effect of the treatment between each couple of lines can be estimated (Figure 1). We considered line 2 versus line 1 in arm C (FOLFOX versus FOLFIRI) as representative of a phase II framework. Then, we compared results to those obtained considering line 3 versus line 2 in arm S (FOLFOX versus FOLFIRI, again), which contrasts the same drugs, despite the fact that patients had been treated previously by LV5FU2 alone.

Single-arm scenarios based on the FFCD 2000-05 trial.

3. Methods

3.1. Dependence between TTP₁ and TTP₂

The time to progression (TTP) is likely to be linked to general characteristics of each patient, whatever the treatment line. Because TTP₁ and TTP₂ share these common factors, Von Hoff [1] expected that the growth modulation index (GMI) is a less heterogeneous endpoint, as some of the variability of TTP₂ may be captured through TTP₁. Therefore, the correlation between successive times to progression could play a key role in determining the performance of the GMI as clinical endpoint. Mick et al. [2] showed, through simulations, that reasonable power for a trial was only attainable for moderate to strong correlation between consecutive times to progression.

As the dependence between TTP₁ and TTP₂ is due to some underlying factors shared by the two time-to-event variables, it can be modeled in a very natural way via shared frailty models [10]. The shared frailty model is an extension of the proportional hazards model in which an unobservable random quantity, called the frailty term, acts multiplicatively on the baseline hazard functions of the time variables. This term accounts for intrapatient correlation. The frailty model is defined in terms of the conditional hazard:

\begin{matrix} h_{j i} (t ∣ u_{i}) = h_{j 0} (t) u_{i} \exp (x_{j i}^{T} β_{j}), \end{matrix}

(1)

for patient i ∈ {1,…, n} at treatment line j ∈ {1,2}, and where h_j0(t) is the treatment line-specific baseline hazard function, u_i the frailty term for the patient i, x_ji the vector of his/her covariates in the jth treatment line, and β_j the vector of regression coefficients. In a gamma frailty model, the frailty term is a random variable with probability density function:

\begin{matrix} f (u) = \frac{θ^{- 1 / θ} u^{1 / θ - 1} \exp (- u / θ)}{Γ (1 / θ)}, \end{matrix}

(2)

where Γ(·) is the gamma function. This distribution corresponds to a gamma distribution with mean and variance equal to 1 and θ. Shared frailty models allow estimating the intrapatient dependence via Kendall's τ, which is a rank correlation measure of the concordance between time pairs. In the case of a gamma frailty model, Kendall's τ is equal to θ/(θ + 2) and can thus be estimated by plugging in the estimate of θ. Different distributions can be assumed for the baseline hazard [11]; we chose a Weibull distribution because it was the one which fitted the best our advanced colorectal data. We fitted and compared the parametric frailty models using the parfm package in R [11].

3.2. Growth Modulation Index TTP₂/TTP₁

If we consider a study in which patients enter after having a first progression, the time to progression at prior therapy (TTP₁) is always observed by design. After a first progression, the experimental treatment is administered. Contrary to TTP₁, the time to progression with the new therapy (TTP₂) can be right-censored. In that case, also the growth modulation index GMI = TTP₂/TTP₁ [1] is right-censored. As this ratio is a nonnegative and possibly right-censored random variable, it can be treated as a time-to-event variable [12]. Therefore, the statistic of interest,

\begin{matrix} S_{G M I} (δ) = P [\frac{{T T P}_{2}}{{T T P}_{1}} > δ], δ \geq 0, \end{matrix}

(3)

can be handled as the survival probability of a time-to-event random variable at a given time point δ. For a given threshold δ, we define a patient as “responder” if his/her GMI is greater than δ and “nonresponder” otherwise. Since, in advanced cancer patients, successive TTPs tend to be shorter and shorter [13], GMI ≥ 1 should be considered as a sign of drug activity, which is less conservative than the threshold δ = 1.33 proposed by Von Hoff [1]. In what follows, we describe two methods, a parametric and a nonparametric one, to estimate S_GMI(δ) for any choice of δ.

3.2.1. Nonparametric Method

The nonparametric approach, inspired by the Wilcoxon rank sum test, consists in using the ranks of each pair (TTP₁, TTP₂) to estimate S_GMI(δ). Due to censoring, the ranks of some observations are unknown but can be estimated by midranks. Midranks are computed according to the procedure proposed by Hudgens and Satten [14] which can be summarized as follows.

For each patient i = 1,…, n, the pair of times (TTP_1i; TTP_2i) is observed. Each time TTP_ji (j = 1,2) is decomposed into an interval, denoted [L_ji; R_ji]. The left bound is always fixed to L_ji = TTP_ji. If TTP_ji is observed (which is always the case for j = 1) then R_ji = TTP_ji. If TTP_ji is right-censored (which is only possible for j = 2), then R_2i = ∞. The midranks are computed using the minimum and the maximum ranks of the interval bounds associated with each TTP_ji as follows. Given TTP_ji, the minimum rank is the rank of L_ji among the 2n pooled R_j's:

\begin{matrix} {m i n}_{j i} : R_{j (1)} \leq R_{j (2)} \leq \dots \leq R_{j ({m i n}_{i} - 1)} \leq L_{j i} \leq R_{j ({m i n}_{i})} \\ \leq \dots \leq R_{j (2 n)} . \end{matrix}

(4)

The maximum rank is the rank of R_ji among the 2n pooled L_ji's:

\begin{matrix} {m a x}_{j i} : L_{j (1)} \leq L_{j (2)} \leq \dots \leq L_{j ({m a x}_{i})} \leq R_{j i} \leq L_{j ({m a x}_{i} + 1)} \\ \leq \dots \leq L_{j (2 n)} . \end{matrix}

(5)

Now, the midrank M_ji is the midpoint of the minimum and the maximum rank:

\begin{matrix} M_{j i} = \frac{{m i n}_{j i} + {m a x}_{j i}}{2} . \end{matrix}

(6)

To estimate S_GMI(δ), we replace TTP_1i with TTP′_1i = δTTP_1i and compute the midranks M′_1i of TTP′_1i and M_2i of TTP_2i to obtain the n pairs of midranks (M′_1i; M_2i). Finally, the estimate of the probability of interest is as follows:

\begin{matrix} {\hat{S}}_{G M I} (δ) = \frac{1}{n} \sum_{i = 1}^{n} I (M_{2 i} \geq {M^{'}}_{1 i}), \end{matrix}

(7)

with I(·) being the indicator function which takes value 1 if its argument is true and 0 otherwise.

3.2.2. Parametric Method

In this approach, a parametric probability distribution is assumed for the GMI, so that the probability of interest can be easily derived as a function of the estimated distribution parameters. Let us assume that, conditionally on a frailty term u_i, TTP₁ and TTP₂ have Weibull marginal distributions W(a; b₁u_i) and W(a; b₂u_i) with a common shape parameter a:

\begin{matrix} f_{j} (x; a, b_{j} ∣ u_{i}) \\ = a {(u_{i} b_{j})}^{- a} x^{a - 1} \exp \{{- {[\frac{x}{(u_{i} b_{j})}]}^{}}^{a}\} . \end{matrix}

(8)

Then, Owen [15] showed that the ratio TTP₂/TTP₁ follows a log-logistic distribution,

\begin{matrix} f (δ; a, κ) = a κ^{a} δ^{a - 1} {(1 + {(δ κ)}^{a})}^{- 2}, δ \geq 0, \end{matrix}

(9)

with κ = b₁/b₂, which does no longer depend on the shared frailty u_i.

By using this distribution, we can obtain maximum likelihood estimates of the distribution parameters and directly derive the probability of interest from the survival function:

\begin{matrix} S (δ; \hat{α}, \hat{κ}) = {(1 + {(δ \hat{κ})}^{\hat{a}})}^{- 1} . \end{matrix}

(10)

Parameter estimates were computed using the survreg function in the R package survival.

R code of the two methods is available for download on https://github.com/Oncostat/TTPratio.

4. Simulation Study

4.1. Simulation Design

We designed a simulation study to evaluate the influence of the design parameters on the two estimators of S_GMI(δ). We varied (i) the dependence between the two successive times to progression via Kendall's τ, (ii) the shape a of the distribution of TTP_j, (iii) the relative effect e of the second-line treatment as compared to the first-line treatment, and (iv) the censoring rate r for TTP₂.

4.1.1. Data Generation

First, for given values of the parameters of interest, we generated a frailty term u_i for each patient using random values from a gamma distribution with density given in Section 3. Due to the link between τ and θ, for a given τ, we could fix θ = 2τ/(1 − τ). Three values of τ were used in our simulation: 0.1, 0.2, and 0.3.

Then, we generated times to first and second progressions from Weibull distribution with density:

\begin{matrix} f_{j} (x; a, b_{j} ∣ u_{i}) \\ = a {(u_{i} b_{j})}^{- a} x^{a - 1} \exp \{{- {[\frac{x}{(u_{i} b_{j})}]}^{}}^{a}\}, j = 1,2 . \end{matrix}

(11)

For the shape parameter a, common to the two distributions, we considered three values: 0.5, 1, and 2. A shape of a = 0.5 represents a metastatic disease with a median of TTP₁ greater than 15 months, whereas a shape of a = 2 corresponds to a more aggressive disease (median of TTP₁ close to 6 months).

The scale parameter was different for the two distributions: b₂ = b₁∗e, where e is the median of TTP₂/TTP₁. We considered three values for e: 0.77, representing inactivity of the second-line treatment; 1, representing an equivalence of the two treatments; and 1.33, representing efficacy according to the definition of Von Hoff [1].

Independent and noninformative censoring was introduced by taking the minimum between TTP₂ and a random uniform variable. Desired censoring rates (10% and 40%) were obtained by controlling the support of the uniform distribution.

We performed 10,000 simulations for each of the 54 scenarios defined by a, e, τ, and a censoring rate. The statistical properties of the parametric and nonparametric estimators were evaluated in terms of the mean bias, the average standard error, and the empirical standard error, the latter being defined as the standard deviation of the 10,000 estimates.

4.2. Results

The results of the simulations are summarized in Figure 2 (see Supplementary Tables A1–A6 for detailed results). The nonparametric method underestimated the probability of interest in 51/54 scenarios, but the mean bias was low in general, ranging across scenarios from −0.062 to 0.001 (median: −0.006). On the contrary, the parametric method always overestimated the probability of interest, but the mean bias was low as well, ranging across scenarios from 0.009 to 0.082 (median: 0.028). With a censoring rate of 10% and considering all scenarios, the nonparametric estimator was slightly less biased than the parametric estimator (median absolute bias: 0.003 versus 0.014): the absolute bias of the nonparametric estimator was at most 0.011 and the bias of the parametric estimator was at most 0.018. The bias of the parametric estimator increased with increasing censoring rate; across all scenarios with censoring rate of 40%, its median absolute bias was 0.069. The nonparametric estimator was more robust to censoring with a median absolute bias of 0.018 for 40% of censoring.

Probability ${\hat{S}}_{G M I} (δ = 1)$ of GMI being greater than 1 estimated in the simulation study via the parametric (black) and nonparametric (red) methods. Normally approximate 95% confidence intervals using the empirical standard error.

Both estimators were robust to changes in dependence, shape parameter a, and treatment effect e. Considering all scenarios, the average (over the 10,000 replicates) of the estimated standard error (ASE) via the nonparametric method was greater than or equal to the empirical standard error (ESE). This suggests that the nonparametric confidence intervals are more conservative than their nominal level. For the parametric estimator, on the contrary, when we considered second-line treatment inactivity (median GMI = 0.77) and 40% of censoring, the ASE was smaller than the ESE. This means that parametric confidence intervals are too liberal under the null hypothesis.

5. Application to the FFCD 2000-05 Trial

In this section, we illustrate the presented methodology to the data of the FFCD2000-05 trial (see Section 2 and Figure 1). As discussed previously, we will consider situations 1 and 4 only, in which the same couple of treatments are contrasted. The ratio TTP₂/TTP₁ could be evaluated on 129 patients in situation 1. The ratio TTP₃/TTP₂ could be evaluated on 92 patients in situation 4. A total of 15 patients (12%) had their TTP₂ censored in situation 1 and 13 patients (14%) had their TTP₃ censored in situation 4.

5.1. Dependence between TTP1 and TTP2

As discussed in Section 3, we estimated Kendall's τ by modeling the risks of progression via shared frailty models. Weibull distributions were assumed for the baseline hazard functions. The use of a gamma distribution for the frailty term was justified by a preliminary study comparing the Akaike Information Criterion (AIC) of the model with gamma and inverse Gaussian frailty distributions. The positive stable frailty distribution was considered too, but it was also discarded due to the lack of numerical convergence. In all four situations, the model with gamma distribution had the smallest AIC.

In situation 1, the estimated Kendall's τ was 0.195, a relatively low correlation. In situation 4, that is FOLFOX versus FOLFIRI again, but after a first line with LV5FU2, the estimated Kendall's τ was slightly higher: 0.225. Even weaker dependence was estimated for situation 3 (τ = 0.152) and situation 2 (τ = 0.142). Overall, these values fell in between the first and second values of τ considered in our simulations: 0.1 and 0.2.

5.2. Estimation of ${\hat{S}}_{G M I} (δ)$

To apply the parametric estimation method for S_GMI(1) described in Section 3, we assumed Weibull distributions of times to progression with common shape parameter. This assumption was needed in order to assume a log-logistic distribution for their GMI. Thus, we fitted the Kaplan-Meier estimates of the GMI and compared them to the maximum likelihood log-logistic survival curves to informally check the appropriateness of the parametric assumption. Figure 3 shows, for situations 1 and 4, the Kaplan-Meier estimates of the GMI with the estimated log-logistic survival curves. This distribution seems to fit quite well to the data.

Survival function estimate of the growth modulation index (situation 1 in (a); situation 4 in (b)) via the Kaplan-Meier method and via a log-logistic distribution. The gray area is the 95% confidence band for the Kaplan-Meier estimate.

In situation 1, the estimated probability that the GMI ≥1 was ${\hat{S}}_{G M I} (1)$ = 0.21 with the parametric estimator (95% Confidence interval: [0.14; 0.29]) and ${\hat{S}}_{G M I} (1)$ = 0.24 with the nonparametric estimator (95% CI: [0.17; 0.31]). In situation 4, comparing the same two treatments after an LV5FU2 line, the estimated probability was 0.24 (95% CI: [0.15; 0.33]) with the parametric estimator and 0.27 (95% CI: [0.18; 0.36]) with the nonparametric estimator. These results suggest that the sequence “FOLFOX in first line/FOLFIRI in second line” leads to a shortened time to progression: FOLFIRI's activity in second line seems inferior to FOLFOX's activity in first line.

Table 1 shows the different estimations for the other situations, too. The activity of FOLFOX in second line seems to be comparable to the activity of LV5FU2 in first line for patients in the arm S.

Table 1.

Estimation of S_GMI(δ = 1) = P(GMI > 1) for the four situations in the FFCD 2000-05 trial.

	Treatment		N	Events	Estimator
	Line 1	Line 2	N	Events	Parametric	Nonparametric
Arm C
Situation 1	FOLFOX	FOLFIRI	129	114	0.21 [0.14; 0.29]	0.24 [0.17; 0.31]
Situation 3	FOLFIRI	Investigator	74	59	0.52 [0.41; 0.63]	0.54 [0.43; 0.65]
Arm S
Situation 2	LV5FU2	FOLFOX	152	122	0.54 [0.46; 0.62]	0.48 [0.40; 0.56]
Situation 4	FOLFOX	FOLFIRI	92	79	0.24 [0.15; 0.33]	0.27 [0.18; 0.36]

Open in a new tab

6. Discussion

The growth modulation index (GMI) is more and more used to evaluate the treatment effect in single-arm phase II trials. An increasing number of clinical trials employ the GMI and the European Medicine Agency (EMA), in its “Guideline on Evaluation of Anticancer Medicinal Products in Man,” admits its utilization for a comparison between two successive therapies [16]. By choosing an adequate threshold δ (0.77, 1, or 1.33), the estimated probability of interest S_GMI(δ) is a practical measure of the proportion of patients for whom two successive lines of treatment are ineffective, equivalent, or effective.

In this article, we evaluated two ways to estimate S_GMI(δ) and we investigated how the design parameters had an impact on these estimators. The censoring rate had an impact on the parametric and nonparametric estimators, respectively. In our simulations, the nonparametric method was more robust to high censoring rates, but the average bias was small in any case. Thus, the use of this method in phase II studies could represent substantial time savings for the analysis when the disease in question progresses slowly over time. Von Hoff [1] showed the key role of dependence between the paired times to progression, but in our study this parameter did not have a noticeable impact neither on the bias nor on the empirical standard error. The few published clinical trials that used the GMI as a criterion of activity reported a rather low correlation of the paired time to progression. However, in some of them, such a low correlation may be due to the heterogeneity of the first-line treatment (different nature of chemotherapy) or to the localization of the tumor. In Penel et al. [17], for instance, the analysis did not account for the heterogeneity of the subtypes of sarcoma. Further studies are needed to detect the influence of cancer localization on the different design parameters. To date, it is not well known in which cancer types the intrapatient correlation is the strongest.

There are practical limitations to the use of GMI in a phase II study. The collection of PFS or TTP measurements for each patient has to be very precise and homogeneous between patients and, if the case, between centers. The frequency of the follow-up evaluations affects the estimation of TTP and PFS [18]. This issue should be considered carefully in the design and the conduct of a trial employing this endpoint.

In clinical practice, patients can interrupt the first-line treatment for many reasons such like toxicity occurrence. In that case, they can enter the second line without a progression, causing TTP₁ being censored. For these patients, TTP₂/TTP₁ is left censored (the GMI is unknown but an upper bound is known) and inferential methods can be adapted to that situation. Nevertheless, if both TTP₁ and TTP₂ are censored, neither an upper nor a lower bound is known and the observation is noninformative. However, one could argue that phase II studies using GMI as the primary endpoint should enroll only patients who have failed previous treatment and thus exclude cases where TTP₁ is censored. A third approach would be to consider also treatment interruptions due to toxicity as events in a treatment-failure perspective. Eventually, the most appropriate approach will depend on clinical considerations about whether the new treatment is intended for patients recurring only, or for any interruptions of the previous treatment, whatever the cause.

In our simulations, nonparametric and parametric methods, when biased, had biases in opposite directions. We recommend using the nonparametric method to estimate the proportion of patients having a GMI superior to a threshold because it is more conservative. Nevertheless, the parametric method can more easily deal with interval censoring, which is an inherent issue with progression-free survival data [19]. Consequently, the parametric method can be used as a supplementary tool.

Acknowledgments

The authors thank the FFCD-2000-05 trial investigators for their participation.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Supplementary Materials

Online-only Supplementary Materials detail the numerical results for each scenario of the simulation study. Mean bias, average standard error, and empirical standard error of nonparametric and parametric estimation are presented in Tables A1–A6. Table A1: bias, average standard error, and empirical standard error of the nonparametric estimator of the probability in S_GMI(δ = 1) with equivalent treatments (median (GMI) = 1). Table A2: bias, average standard error, and empirical standard error of the parametric estimator of the probability S_GMI(δ = 1) with equivalent treatments (median (GMI) = 1). Table A3: bias, average standard error, and empirical standard error of the nonparametric estimator of the probability S_GMI(δ = 1) with an inactive second-line treatment (median (GMI) = 0.77). Table A4: bias, average standard error, and empirical standard error of the parametric estimator of the probability S_GMI(δ = 1) with an inactive second-line treatment (median (GMI) = 0.77). Table A5: bias, average standard error, and empirical standard error of the nonparametric estimator of the probability S_GMI(δ = 1) with an active second-line treatment (median (GMI) = 1.33). Table A6: bias, average standard error, and empirical standard error of the parametric estimator of the probability S_GMI(δ = 1) with an active second-line treatment (median (GMI) = 1.33).

Click here for additional data file.^{(26.1KB, docx)}

References

1.Von Hoff D. D. There are no bad anticancer agents, only bad clinical trial designs- twenty-first Richard and Hinda Rosenthal foundation award lecture. Clinical Cancer Research. 1998;4(5):1079–1086. [PubMed] [Google Scholar]
2.Mick R., Crowley J. J., Carroll R. J. Phase II clinical trial design for noncytotoxic anticancer agents for which time to disease progression is the primary endpoint. Controlled Clinical Trials. 2000;21(4):343–359. doi: 10.1016/s0197-2456(00)00058-1. [DOI] [PubMed] [Google Scholar]
3.Von Hoff D. D., Stephenson J. J., Jr., Rosen P., et al. Pilot study using molecular profiling of patients' tumors to find potential targets and select treatments for their refractory cancers. Journal of Clinical Oncology. 2010;28(33):4877–4883. doi: 10.1200/jco.2009.26.5983. [DOI] [PubMed] [Google Scholar]
4.Massard C., Michiels S., Ferté C., et al. High-Throughput Genomics and Clinical Outcome in Hard-to-Treat Advanced Cancers: Results of the MOSCATO 01 Trial. Cancer Discovery. 2017;7(6):586–595. doi: 10.1158/2159-8290.CD-16-1396. [DOI] [PubMed] [Google Scholar]
5.Schwaederle M., Parker B. A., Schwab R. B., et al. Precision oncology: The UC San Diego moores cancer center predict experience. Molecular Cancer Therapeutics. 2016;15(4):743–752. doi: 10.1158/1535-7163.MCT-15-0795. [DOI] [PubMed] [Google Scholar]
6.Bonetti A., Zaninelli M., Leone R., et al. Use of the ratio of time to progression following first- and second-line therapy to document the activity of the combination of oxaliplatin with 5-fluorouracil in the treatment of colorectal carcinoma. Annals of Oncology. 2001;12(2):187–191. doi: 10.1023/a:1008354909478. [DOI] [PubMed] [Google Scholar]
7.Belin L., Kamal M., Mauborgne C., et al. Randomized phase II trial comparing molecularly targeted therapy based on tumor molecular profiling versus conventional therapy in patients with refractory cancer: Cross-over analysis from the SHIVA trial. Annals of Oncology. 2017;28:592–596. doi: 10.1093/annonc/mdw666. [DOI] [PubMed] [Google Scholar]
8.Ducreux M., Malka D., Mendiboure J., et al. Sequential versus combination chemotherapy for the treatment of advanced colorectal cancer (FFCD 2000-05): An open-label, randomised, phase 3 trial. The Lancet Oncology. 2011;12(11):1032–1044. doi: 10.1016/S1470-2045(11)70199-1. [DOI] [PubMed] [Google Scholar]
9.Pénichoux J., Michiels S., Bouché O., et al. Taking into account successive treatment lines in the analysis of a colorectal cancer randomised trial. European Journal of Cancer. 2013;49(8):1882–1888. doi: 10.1016/j.ejca.2013.02.006. [DOI] [PubMed] [Google Scholar]
10.Duchateau L., Janssen P. The frailty model. New York, NY, USA: Springer; 2008. [Google Scholar]
11.Munda M., Rotolo F., Legrand C. parfm: parametric frailty models in R. Journal of Statistical Software. 2012;51(11) doi: 10.18637/jss.v051.i11. [DOI] [Google Scholar]
12.Kovalchik S., Mietlowski W. Statistical methods for a phase II oncology trial with a growth modulation index (GMI) endpoint. Contemporary Clinical Trials. 2011;32(1):99–107. doi: 10.1016/j.cct.2010.09.010. [DOI] [PubMed] [Google Scholar]
13.Dufresne A., Pivot X., Tournigand C., et al. Impact of chemotherapy beyond the first line in patients with metastatic breast cancer. Breast Cancer Research and Treatment. 2008;107(2):275–279. doi: 10.1007/s10549-007-9550-7. [DOI] [PubMed] [Google Scholar]
14.Hudgens M. G., Satten G. A. Midrank unification of rank tests for exact, tied, and censored data. Journal of Nonparametric Statistics. 2002;14(5):569–581. doi: 10.1080/10485250213905. [DOI] [Google Scholar]
15.Owen W. J. A power analysis of tests for paired lifetime data. Lifetime Data Analysis. 2005;11(2):233–243. doi: 10.1007/s10985-004-0385-9. [DOI] [PubMed] [Google Scholar]
16.Agency EM. Guideline on the evaluation of anticancer medicinal products in man. European Medicines Agency. 33: 44; 2012. [Google Scholar]
17.Penel N., Demetri G. D., Blay J. Y., et al. Growth modulation index as metric of clinical benefit assessment among advanced soft tissue sarcoma patients receiving trabectedin as a salvage therapy. Annals of Oncology. 2013;24(2):537–542. doi: 10.1093/annonc/mds470.mds470 [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Bhattacharya S., Fyfe G., Gray R. J., Sargent D. J. Role of sensitivity analyses in assessing progression-free survival in late-stage oncology trials. Journal of Clinical Oncology. 2009;27(35):5958–5964. doi: 10.1200/JCO.2009.22.4329. [DOI] [PubMed] [Google Scholar]
19.Panageas K. S., Ben-Porat L., Dickler M. N., Chapman P. B., Schrag D. When you look matters: The effect of assessment schedule on progression-free survival. Journal of the National Cancer Institute. 2007;99(6):428–432. doi: 10.1093/jnci/djk091. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Click here for additional data file.^{(26.1KB, docx)}

[B1] 1.Von Hoff D. D. There are no bad anticancer agents, only bad clinical trial designs- twenty-first Richard and Hinda Rosenthal foundation award lecture. Clinical Cancer Research. 1998;4(5):1079–1086. [PubMed] [Google Scholar]

[B2] 2.Mick R., Crowley J. J., Carroll R. J. Phase II clinical trial design for noncytotoxic anticancer agents for which time to disease progression is the primary endpoint. Controlled Clinical Trials. 2000;21(4):343–359. doi: 10.1016/s0197-2456(00)00058-1. [DOI] [PubMed] [Google Scholar]

[B3] 3.Von Hoff D. D., Stephenson J. J., Jr., Rosen P., et al. Pilot study using molecular profiling of patients' tumors to find potential targets and select treatments for their refractory cancers. Journal of Clinical Oncology. 2010;28(33):4877–4883. doi: 10.1200/jco.2009.26.5983. [DOI] [PubMed] [Google Scholar]

[B4] 4.Massard C., Michiels S., Ferté C., et al. High-Throughput Genomics and Clinical Outcome in Hard-to-Treat Advanced Cancers: Results of the MOSCATO 01 Trial. Cancer Discovery. 2017;7(6):586–595. doi: 10.1158/2159-8290.CD-16-1396. [DOI] [PubMed] [Google Scholar]

[B5] 5.Schwaederle M., Parker B. A., Schwab R. B., et al. Precision oncology: The UC San Diego moores cancer center predict experience. Molecular Cancer Therapeutics. 2016;15(4):743–752. doi: 10.1158/1535-7163.MCT-15-0795. [DOI] [PubMed] [Google Scholar]

[B6] 6.Bonetti A., Zaninelli M., Leone R., et al. Use of the ratio of time to progression following first- and second-line therapy to document the activity of the combination of oxaliplatin with 5-fluorouracil in the treatment of colorectal carcinoma. Annals of Oncology. 2001;12(2):187–191. doi: 10.1023/a:1008354909478. [DOI] [PubMed] [Google Scholar]

[B7] 7.Belin L., Kamal M., Mauborgne C., et al. Randomized phase II trial comparing molecularly targeted therapy based on tumor molecular profiling versus conventional therapy in patients with refractory cancer: Cross-over analysis from the SHIVA trial. Annals of Oncology. 2017;28:592–596. doi: 10.1093/annonc/mdw666. [DOI] [PubMed] [Google Scholar]

[B8] 8.Ducreux M., Malka D., Mendiboure J., et al. Sequential versus combination chemotherapy for the treatment of advanced colorectal cancer (FFCD 2000-05): An open-label, randomised, phase 3 trial. The Lancet Oncology. 2011;12(11):1032–1044. doi: 10.1016/S1470-2045(11)70199-1. [DOI] [PubMed] [Google Scholar]

[B9] 9.Pénichoux J., Michiels S., Bouché O., et al. Taking into account successive treatment lines in the analysis of a colorectal cancer randomised trial. European Journal of Cancer. 2013;49(8):1882–1888. doi: 10.1016/j.ejca.2013.02.006. [DOI] [PubMed] [Google Scholar]

[B10] 10.Duchateau L., Janssen P. The frailty model. New York, NY, USA: Springer; 2008. [Google Scholar]

[B11] 11.Munda M., Rotolo F., Legrand C. parfm: parametric frailty models in R. Journal of Statistical Software. 2012;51(11) doi: 10.18637/jss.v051.i11. [DOI] [Google Scholar]

[B12] 12.Kovalchik S., Mietlowski W. Statistical methods for a phase II oncology trial with a growth modulation index (GMI) endpoint. Contemporary Clinical Trials. 2011;32(1):99–107. doi: 10.1016/j.cct.2010.09.010. [DOI] [PubMed] [Google Scholar]

[B13] 13.Dufresne A., Pivot X., Tournigand C., et al. Impact of chemotherapy beyond the first line in patients with metastatic breast cancer. Breast Cancer Research and Treatment. 2008;107(2):275–279. doi: 10.1007/s10549-007-9550-7. [DOI] [PubMed] [Google Scholar]

[B14] 14.Hudgens M. G., Satten G. A. Midrank unification of rank tests for exact, tied, and censored data. Journal of Nonparametric Statistics. 2002;14(5):569–581. doi: 10.1080/10485250213905. [DOI] [Google Scholar]

[B15] 15.Owen W. J. A power analysis of tests for paired lifetime data. Lifetime Data Analysis. 2005;11(2):233–243. doi: 10.1007/s10985-004-0385-9. [DOI] [PubMed] [Google Scholar]

[B16] 16.Agency EM. Guideline on the evaluation of anticancer medicinal products in man. European Medicines Agency. 33: 44; 2012. [Google Scholar]

[B17] 17.Penel N., Demetri G. D., Blay J. Y., et al. Growth modulation index as metric of clinical benefit assessment among advanced soft tissue sarcoma patients receiving trabectedin as a salvage therapy. Annals of Oncology. 2013;24(2):537–542. doi: 10.1093/annonc/mds470.mds470 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B18] 18.Bhattacharya S., Fyfe G., Gray R. J., Sargent D. J. Role of sensitivity analyses in assessing progression-free survival in late-stage oncology trials. Journal of Clinical Oncology. 2009;27(35):5958–5964. doi: 10.1200/JCO.2009.22.4329. [DOI] [PubMed] [Google Scholar]

[B19] 19.Panageas K. S., Ben-Porat L., Dickler M. N., Chapman P. B., Schrag D. When you look matters: The effect of assessment schedule on progression-free survival. Journal of the National Cancer Institute. 2007;99(6):428–432. doi: 10.1093/jnci/djk091. [DOI] [PubMed] [Google Scholar]

PERMALINK

Evaluation of Treatment Effect with Paired Failure Times in a Single-Arm Phase II Trial in Oncology

Matthieu Texier

Federico Rotolo

Michel Ducreux

Olivier Bouché

Jean-Pierre Pignon

Stefan Michiels

Abstract

1. Introduction

2. Motivating Example

Figure 1.

3. Methods

3.1. Dependence between TTP₁ and TTP₂

3.2. Growth Modulation Index TTP₂/TTP₁

3.2.1. Nonparametric Method

3.2.2. Parametric Method

4. Simulation Study

4.1. Simulation Design

4.1.1. Data Generation

4.2. Results

Figure 2.

5. Application to the FFCD 2000-05 Trial

5.1. Dependence between TTP1 and TTP2

5.2. Estimation of ${\hat{S}}_{G M I} (δ)$

Figure 3.

Table 1.

6. Discussion

Acknowledgments

Conflicts of Interest

Supplementary Materials

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Evaluation of Treatment Effect with Paired Failure Times in a Single-Arm Phase II Trial in Oncology

Matthieu Texier

Federico Rotolo

Michel Ducreux

Olivier Bouché

Jean-Pierre Pignon

Stefan Michiels

Abstract

1. Introduction

2. Motivating Example

Figure 1.

3. Methods

3.1. Dependence between TTP1 and TTP2

3.2. Growth Modulation Index TTP2/TTP1

3.2.1. Nonparametric Method

3.2.2. Parametric Method

4. Simulation Study

4.1. Simulation Design

4.1.1. Data Generation

4.2. Results

Figure 2.

5. Application to the FFCD 2000-05 Trial

5.1. Dependence between TTP1 and TTP2

5.2. Estimation of S^GMI(δ)

Figure 3.

Table 1.

6. Discussion

Acknowledgments

Conflicts of Interest

Supplementary Materials

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

3.1. Dependence between TTP₁ and TTP₂

3.2. Growth Modulation Index TTP₂/TTP₁

5.2. Estimation of ${\hat{S}}_{G M I} (δ)$