Summary
We present a two-stage phase I/II design of a combination of two drugs in cancer clinical trials. The goal is to estimate safe dose combination regions with a desired level of efficacy. In stage I, conditional escalation with overdose control is used to allocate dose combinations to successive cohorts of patients and the maximum tolerated dose curve is estimated as a function of Bayes estimates of the model parameters. In stage II, we propose a Bayesian adaptive design for conducting the phase II trial to determine dose combination regions along the MTD curve with a desired level of efficacy. The methodology is evaluated by extensive simulations and application to a real trial.
Keywords: Cancer Phase I/II trials, Maximum tolerated dose, Escalation with overdose control, Drug combination, Bayesian adaptive design, Cubic Splines
1. Introduction
The primary objective of cancer phase I/II clinical trials is to determine a tolerable dose level that maximizes treatment efficacy. For treatments where efficacy is ascertained in a relatively short period of time such as one or two cycles of therapy, sequential designs for updating the joint probability of toxicity and efficacy and estimating the optimal dose have been studied extensively in the literature for single agent trials, see e.g. Murtaugh and Fisher (1990); Thall and Russell (1998); Braun (2002); Ivanova (2003); Thall and Cook (2004); Chen et al. (2015); Sato et al. (2016). These methods are designed to identify a safe dose that maximizes the probability of treatment response or a pre-defined utility function. More recently, these designs have been extended to dose combination trials where the dose levels of two drugs are allowed to vary during the trial and explored for toxicity and efficacy (Yuan and Yin (2011); Wages and Conaway (2014); Cai et al. (2014), Riviere et al. (2015)) or to determine the optimal treatment regimen Wages et al. (2016). These methods accommodate cytotoxic and biologic drugs and are appropriate for short term efficacy endpoints to allow sequential updating of the probabilities of toxicity and efficacy.
For treatments where response evaluation takes few cycles of therapy, it is standard practice to perform a two-stage design where a maximum tolerable dose (MTD) of a new drug or combinations of drugs is first determined, then this recommended phase II dose is studied in stage II and evaluated for treatment efficacy, possibly using a different population of cancer patients from stage I, see Rogatko et al. (2008); Le Tourneau et al. (2009); Chen et al. (2012) for a review of such a paradigm. The two-stage design is routinely used in early phase cancer trials investigating a single agent or combinations of two or more drugs except that in stage II, a single dose level is investigated. For drug combination phase I trials where the dose levels of two or more drugs are allowed to vary during the trial, more than one MTD can be recommended at the conclusion of the trial, e.g. Thall et al. (2003); Wang and Ivanova (2005); Braun and Wang (2010); Tighiouart et al. (2014); Mander and Sweeting (2015); Wages (2016); Tighiouart et al. (2017). Choosing a single MTD combination for efficacy study may result in a failed phase II trial since other MTDs may present higher treatment efficacy. Hence, adaptive or parallel phase II trials may be more suitable for searching an optimal dose combination that is well tolerable with desired level of efficacy as mentioned by Thall et al. (2003).
In this article, we present a two-stage design for drug combination trials with continuous dose levels when treatment efficacy is evaluated after three or more cycles of therapy. Examples of early phase trials that use continuous dose levels can be found in Cheng et al. (2004); Borghaei et al. (2009). In the first stage, the method of Tighiouart et al. (2017) is used to estimated the MTD curve. In stage II, a Bayesian adaptive design is proposed to allocate patients to drug combinations along the MTD curve obtained from stage I with the goal of determining a dose with maximum probability of treatment efficacy. To accommodate various functional forms of the efficacy curve, a flexible family of cubic splines is used to model the dose-response relationship. Adaptive randomization is used sequentially after the efficacy status of each cohort of patients is resolved and the efficacy curve is updated in order to minimize the number of patients treated at sub-therapeutic dose combinations. At the end of the trial, a dose combination with maximum probability of response a posteriori is selected for further phase III studies. The method is illustrated by an application to a phase I/II trial of the combination of cisplatin and cabazitaxel (ciscab) in patients with advanced prostate cancer. A previous phase I trial with dose expansion cohort explored three dose combinations of these agents and declared the combination cabazitaxel/cisplatin of 15/75 mg/m2 to be the MTD although no DLT was observed at this dose in the phase I part of the study. Based on safety and efficacy data from the phase I stud and expansion cohort, we hypothesize that a series of tolerable and active dose combinations exist.
The manuscript is organized as follows. In Section 2, we review the stage I design described in Tighiouart et al. (2017) for estimating the MTD curve. Modeling the efficacy curve along the MTD curve and the adaptive design in stage II of the trial are described in Section 3 along with operating characteristics to evaluate the performance of the phase II trial. In Section 4, we apply this methodology to the ciscab trial described above with the goal of finding a tolerable dose combination with maximum benefit rate. Some concluding remarks and a discussion are found in Section 5.
2. Stage I
2.1. Model
Consider the dose-toxicity model of the form
| (1) |
where T is the indicator of DLT, T = 1 if a patient given the dose combination (x, y) exhibits DLT within one cycle of therapy, and T = 0 otherwise, x ∈ [Xmin,Xmax] is the dose level of agent A1, y ∈ [Ymin, Ymax] is the dose level of agent A2, and F is a known cumulative distribution function. Here, Xmin,Xmax and Ymin, Ymax are the lower and upper bounds of the continuous dose levels of agents A1 and A2, respectively. Suppose that the doses of agents A1 and A2 are continuous and standardized to be in the interval [0, 1] using the transformations h1(x) = (x − Xmin)/(Xmax − Xmin), h2(y) = (y − Ymin)/(Ymax − Ymin) and the interaction parameter η3 > 0.
We will assume that that the probability of DLT increases with the dose of any one of the agents when the other one is held constant. A necessary and sufficient condition for this property to hold is to assume η1, η2 > 0. The MTD is defined as any dose combination (x*, y*) such that
| (2) |
The target probability of DLT θ is set relatively high when the DLT is a reversible or non-fatal condition, and low when it is life threatening. We reparameterize model (1) in terms of parameters clinicians can easily interpret. One way is to use ρ10, the probability of DLT when the levels of drugs A1 and A2 are 1 and 0, respectively, ρ01, the probability of DLT when the levels of drugs A1 and A2 are 0 and 1, respectively, and ρ00, the probability of DLT when the levels of drugs A1 and A2 are both 0. It can then be shown that the MTD is
| (3) |
Following the work of Tighiouart et al. (2017), we assume that ρ01, ρ10, η3 are independent a priori with ρ01 ~ beta(a1, b1), ρ10 ~ beta(a2, b2), and conditional on (ρ01, ρ10), ρ00/min(ρ01, ρ10) ~ beta(a3, b3). The prior distribution on the interaction parameter η3 is a gamma with mean a/b and variance a/b2. If Dk = {(xi, yi, Ti)} is the data after enrolling k patients to the trial, the posterior distribution of the model parameters is
where
| (4) |
2.2. Trial Design
Dose escalation/de-escalation proceeds by treating cohorts of two patients simultaneously. It is based on the escalation with overdose control (EWOC) principle where at each stage of the trial, the posterior probability of overdosing a future patient is bounded by a feasibility bound α, see e.g. Babb et al. (1998); Tighiouart et al. (2005); Tighiouart and Rogatko (2010); Tighiouart and Rogatko (2012). For a given cohort, one subject receives a new dose of agent A1 for a given dose of agent A2 that was previously assigned and the other patient receives a new dose of agent A2 for a given dose of agent A1 that was previously assigned. The algorithm is described in Tighiouart et al. (2017) and is reviewed here for convenience.
The first two patients receive the same dose combination (x1, y1) = (x2, y2) = (0, 0) and let D2 = {(x1, y1, T1), (x2, y2, T2)}.
In the second cohort, patients 3 and 4 receive doses (x3, y3) and (x4, y4), respectively, where y3 = y1, x4 = x2, x3 is the α-th percentile of π(ΓA1|A2=y1|D2), and y4 is the α-th percentile of π(ΓA2|A1=x2|D2). Here, π(ΓA1|A2=y1|D2) is the posterior distribution of the MTD of drug A1 given that the level of drug A2 is y1, given the data D2.
-
In the i-th cohort of two patients, if i is even, then patient (2i − 1) receives dose (x2i−1, y2i−3), patient 2i receives dose (x2i−2, y2i), where . Similar dose allocations can be derived when i is odd. Here, denotes the inverse cdf of the posterior distribution π(ΓA1|A2=y|D).
Repeat step (iii) until N patients are enrolled to the trial subject subject to the following stopping rule.
Stopping Rule
We stop enrollment to the trial if P(P(DLT|(x, y) = (0, 0)) > θ + ξ1|data) > ξ2, i.e. if the posterior probability that the probability of DLT at the minimum available dose combination in the trial exceeds the target probability of DLT is high. The design parameters ξ1 and ξ2 are chosen to achieve desirable model operating characteristics.
At the end of the trial, we estimate the MTD curve using (3) as
| (5) |
where ρ̂00, ρ̂01, ρ̂10, η̂ are the posterior medians given the data DN.
Uncertainty about the estimated MTD curve is evaluated by the pointwise average bias and percent selection described in Tighiouart et al. (2017) when assessing the operating characteristics of the design. Specifically, for i = 1, …, m, let Ci be the estimated MTD curve and Ctrue be the true MTD curve, where m is the number of simulated trials. For every point (x, y) ∈ Ctrue, let
| (6) |
where y′ is such that (x, y′) ∈ Ci. This is the minimum relative distance of the point (x, y) on the true MTD curve to the estimated MTD curve Ci. Let
| (7) |
Equation (7) can be interpreted as the pointwise average bias in estimating the MTD.
Let Δ(x, y) be the Euclidean distance between the minimum dose combination (0, 0) and the point (x, y) on the true MTD curve and 0 < p < 1. Let
| (8) |
This is the pointwise percent of trials for which the minimum distance of the point (x, y) on the true MTD curve to the estimated MTD curve Ci is no more than (100×p)% of the true MTD. This statistic is equivalent to drawing a circle with center (x, y) on the true MTD curve and radius pΔ(x, y) and calculating the percent of trials with MTD curve estimate Ci falling inside the circle. This will give us the percent of trials with MTD recommendation within (100×p)% of the true MTD for a given tolerance p. This is interpreted as the pointwise percent selection for a given tolerance p. In this manuscript, we present the operating characteristics of stage I in the context of the ciscab trial in Section 4.
3. Stage II
Let Cest be the estimated MTD curve obtained from stage I and suppose it is defined for x ∈ [X1,X2] and y ∈ [Y1, Y2]. Here, [X1,X2] ⊂ [Xmin,Xmax] and [Y1, Y2] ⊂ [Ymin, Ymax]. Let E be the indicator of treatment response such as tumor shrinkage, E = 1 if we have a positive response after a pre-defined number of treatment cycles, and E = 0 otherwise. Let p0 be the probability of efficacy of the standard of care treatment. We propose to carry out a phase II study to identify dose combinations (x, y) ∈ Cest such that P(E = 1|(x, y)) > p0.
3.1. Model
For every dose combination (x, y) ∈ Cest, let x be the unique vertical projection of (x, y) on the interval [X1,X2]. Next, denote by z ∈ [0, 1] the standardized dose of x ∈ [X1,X2] using the transformation z = h3(x) = (x − X1)/(X2 − X1). In the sequel, we will refer to z as dose combination since there is a one-to-one transformation mapping z ∈ [0, 1] to (x, y) ∈ Cest, x ∈ [X1,X2], y ∈ [Y1, Y2]. This process is further illustrated in the supplement using the CisCab trial example described in Section 4. We model the probability of treatment response given dose combination z in Cest as
| (9) |
where F is a known link function, f(z;ψ) is an unknown function and ψ is an unknown parameter. A flexible way to model the probability of efficacy along the MTD curve is the cubic spline function
| (10) |
where ψ = (β, κ), β = (β0, …, βk), κ = (κ3, …, κk) with κ3 = 0. Let Dm = {(zi,Ei), i = 1, …, m} be the data after enrolling m patients in the trial, where Ei is the response of the i-th patient treated with dose combination zi and let π(ψ) be a prior density on the parameter ψ. The posterior distribution is
| (11) |
Let pz be the probability of treatment efficacy at dose combination z and denote by p0 the probability of efficacy of a poor treatment or treatment not worthy of further investigation. We propose an adaptive design to conduct a phase II trial in order to test the hypothesis
H0: pz ≤ p0 for all z versus H1: pz > p0 for some z.
3.2. Trial Design
Randomly assign n1 patients to dose combinations z1, …, zn1 equally spaced along the MTD curve Cest so that each combination is assigned to one and only one patient.
Obtain a Bayes estimate ψ̂ of ψ given the data Dn1 using (11).
Generate n2 dose combinations from the standardized density F(f(z; ψ̂)) and assign them to the next cohort of n2 patients.
Repeat steps (ii) and (iii) until a total of n patients have been enrolled to the trial subject to pre-specified stopping rules.
This algorithm can be viewed as an extension of a Bayesian adaptive design to select a superior arm among a finite number of arms Berry et al. (2011) to selecting a superior arm from an infinite number of arms.
Decision Rule
At the end of the trial, we accept the alternative hypothesis if
| (12) |
where δu is a design parameter.
Stopping Rules
For ethical considerations and to avoid exposing patients to sub-therapeutic doses, we stop the trial for futility after j patients are evaluable for efficacy if there is strong evidence that none of the dose combinations are promising, i.e. Maxz[P(F(f(z;ψ)) > p0|Dj)] < δ0 where δ0 is a small pre-specified threshold. In cases where the investigator is interested in stopping the trial early for superiority, the trial can be terminated after j patients are evaluable for efficacy if Maxz[P(F(f(z;ψ)) > p0|Dj)] > δ1 where δ1 ≥ δu is a pre-specified threshold and the corresponding dose combination z* = argmaxu{P(F(f(u;ψ)) > p0|Dj)} is selected for future randomized phase II or III studies.
4. Application to the CisCab Trial
The methodology described in Sections 2 and 3 was used to design a phase I/II trial of the combination cisplatin and cabazitaxel in patients with prostate cancer with visceral metastasis. A recently published phase I trial of this combination by Lockhart et al. (2014) identified the MTD of cabazitaxel/cisplatin as 15/75 mg/m2. This trial used a “3+3” design exploring 3 pre-specified dose levels 15/75, 20/75, 25/75. In part 1 of the trial, 9 patients we evaluated for safety and no DLT was observed at 15/75 mg/m2. In part 2 of the study, 15 patients were treated at 15/75 mg/m2 and 2 DLTs were observed. Based on these results and other preliminary efficacy data, we hypothesize that there exists a series of active dose combinations which are tolerable and active in prostate cancer. Cabazitaxel dose levels will be selected in the interval [10, 25] and cisplatin dose levels were selected in the interval [50, 100] administered intravenously. For stage I, we plan to enroll N = 30 patients and estimate the MTD curve and in stage II, n = 30 patients will be enrolled to identify dose combinations along the MTD curve with maximum clinical benefit rate. Clinical benefit is defined as either a complete response, partial response, or stable disease within three cycles of treatment. The probability of a poor clinical benefit is p0 = 0.15 and we expect that a tolerable dose combination achieves a clinical benefit rate of p = 0.4.
4.1. Stage I
For stage I, trial design proceeds as described in Section 2.2 with a target probability of DLT θ = 0.33 and a logistic link function for F(·) in (1). DLT is resolved within one cycle (3 weeks) of treatment. The starting dose combination for the first cohort of two patients is 15 mg/m2 cabazitaxel and 75 mg/m2 cisplatin. The prior distributions were calibrated so that the prior mean probability of DLT at the dose combination 15/75 mg/m2 equals the target probability of DLT. Specifically, informative priors were used for the model parameters ρ01, ρ10 ~ beta(1.4, 5.6), and conditional on ρ01, ρ10, ρ00/min(ρ01, ρ10) ~ beta(0.8, 7.2) and a vague prior for η3 was used as in Tighiouart et al. (2017) so that E(P(DLT|(15; 75))) ≈ 0.33 a priori. We derived the operating characteristics of stage I by simulating 2000 trial replicates as in Tighiouart et al. (2017) under four scenarios for the true MTD curve. Due to space consideration, we only present the estimated MTD curves from two scenarios A and B that are expected by the principal investigator of the trial. Figure 1 shows the true and estimated MTD curve obtained obtained using (3) with the parameters ρ00, ρ01, ρ10, η3 replaced by their posterior median averaged across all 2000 simulated trials. Scenario A shown on the left panel of Figure 1 is a case where the true MTD curve passes through a point very close to the dose combination (15, 75) identified as the MTD from the previous trial. Scenario B shown on the right panel is a case where the MTD curve is way above this dose combination. In each case, the estimated MTD curves are very close to the true MTD curves. This is also evidenced by the pointwise bias and percent selection (graphs included in the supplement). The trial was also safe since the percent of trials with DLT rate above θ + 0.1 were 3.5% for the scenario on the left and 5.0% for the scenario on the right.
Fig. 1.

True and estimated MTD curve under two different scenarios for the MTD curve. The grey diamonds represent the last dose combination from each simulated trial along with a 90% confidence region.
4.2. Stage II
4.2.1. Simulation Set-up and Scenarios
For stage II, we present simulations based on six scenarios that include three situations favoring the alternative hypothesis and three instances supporting the null hypothesis. A logistic link function F(u) = (1+exp(u))−1 is used in (9) and f(z;ψ) is modeled as a cubic spline function with two knots in (0,1). This is a very flexible class of efficacy curves and accommodates cases of constant probability of efficacy along the MTD curve, high probability of efficacy around the middle of the MTD curve and high probability of efficacy at one or both edges of the MTD curve. Vague priors are placed on the model parameters by assuming that β ~ 𝒩(0, σ2I6) with σ2 = 104 and (κ4, κ5) ~ Unif{(u, v): 0 ≤ u < v ≤ 1}. It can be shown that the induced prior mean and variance of the probability of treatment response are Eprior(F(f(z;ψ))) ≈ 0.5 and Varprior(F(f(z;ψ))) ≈ 0.25 for all dose combinations z ∈ [0, 1]. The trial sample size is n = 30, the initial number of patients enrolled to the trial was set to n1 = 10, and n2 = 5 was used in the adaptive randomization phase of the design. The design parameter for the decision rule in (12) was taken as δu = 0.8. In each scenario, we simulated M = 2000 trial replicates. The true probability of response curves under scenarios (a,b,c) are shown in blue in Figure 2. The black horizontal lines correspond to the probability of a poor treatment response p0 = 0.15 and the green horizontal lines represent the target probability of response p = 0.4. Scenario (a) is a case where the probability of efficacy is maximized near the middle of the estimated MTD curve with dose combinations in the interval (0.03, 0.76) having probability of efficacy greater than p0 = 0.15. The target probability of response is achieved at a single dose combination z = 0.42. Scenario (b) is a case where higher doses of cisplatin and lower doses of cabazitaxel achieve higher efficacy. Specifically, standardized dose combinations in the interval (0.00, 0.49) have probability of efficacy greater or equal to p0 = 0.15. Scenario (c) is an unusual situation where the probability of efficacy is maximized at the edges of the MTD curve. In this case, dose combinations in the interval (0.00, 0.41) ∪ (0.90, 1.00) have probability of efficacy greater or equal to p0 = 0.15. Corresponding to these scenarios are situations (d–f) favoring the null hypothesis shown in Figure 2 (d,e,f). The true probability of response curves shown in blue have been shifted downward so that the probability of response equals to p0 at one dose combination only for scenarios (d), (e), and (f).
Fig. 2.
True and estimated efficacy curve under six scenarios favoring the null and alternative hypotheses.
4.2.2. Operating Characteristics
For each scenario favoring the alternative hypothesis, we estimate the Bayesian power as
| (13) |
where P(F(f(z;ψi)) > p0|Dn,i) is estimated using an MCMC sample of ψi,
| (14) |
where ψi,j, j = 1, …, L is an MCMC sample from the i-th trial. For scenarios favoring the null hypothesis, (13) is the estimated Bayesian type I error probability. The optimal or target dose from the i-th trial is
| (15) |
We also report the estimated efficacy curve by replacing ψ in (9) by the average posterior medians across all simulated trials
| (16) |
where ψ̄ = (β̄, κ̄), , l = 0, …, 5, , l = 4,5 and β̂i,l, κ̂i,k are the posterior medians from the i-th trial. Finally, we also report the mean posterior probability of declaring the treatment as efficacious for all dose combination z as
| (17) |
4.2.3. Results
The estimated efficacy curves shown in black dashed-lines in Figure 2 computed using equation 16 are fairly close to the true probability of efficacy curve in all scenarios except for scenario (c) near the lower edge of the MTD curve. The mean posterior probability of efficacy curve shown in red dashed-line computed using equation 17 is 80% or more at dose combinations where the true probability of efficacy is maximized for scenarios (a,b) adn close to 80% for scenario (c). Similar conclusions can be drawn for scenarios favoring the null hypothesis where the maximum of the mean posterior probability of efficacy is less than 50%. Figure 3 is the estimated density of the target dose z* defined in equation 15 under scenarios favoring the alternative hypothesis (ac) and the shaded region corresponds to dose combinations with corresponding true probability of efficacy greater than p0 = 0.15. The mode of these densities are close to the target doses. Moreover, the estimated probabilities of selecting a dose with true probability of efficacy greater than p0 = 0.15 varies between 0.90 and 0.96 across the three scenarios. The Bayesian power for scenarios (a–c) and type I error probability for scenarios (d–f) estimated using equation 13 using a threshold δu = 0.8 are reported in Table 1. Power varies between 0.81 and 0.92 and the type I error probability varies between 0.10 and 0.19. The coverage probability in the last column of Table 1 is the estimated probabilities of selecting a dose with true probability of efficacy greater than p0 = 0.15.
Fig. 3.
Estimated density of the target dose combination under three scenarios favoring the alternative hypothesis.
Table 1.
Bayesian power, type I error and coverage probabilities.
| Scenarios | Power | Scenarios | Prob(Type I error) | Coverage Prob. |
|---|---|---|---|---|
| (a) | 0.896 | (d) | 0.100 | 0.964 |
| (b) | 0.921 | (e) | 0.190 | 0.897 |
| (c) | 0.810 | (f) | 0.143 | 0.937 |
We carried out a sensitivity analysis with respect to the initial number of patients enrolled to the trial n1 and the cohort size used in the adaptive randomization phase of the design n2. Table 2 gives the power, type I error and coverage probabilities as n1 varies between 5 and 20 and n2 varies between 3 and 7. For scenario (a), power is about 2% higher when using n2 = 3, 7 relative to the other three cases but the type I error (scenario (d)) increases significantly. Under scenario (b), there is a 3% drop in power when n1 = 20 relative to the case n1 = 10, n2 = 5. The type I error under scenario (e) is fairly constant. For scenario (c), an even higher drop in power of 6% is observed when n1 = 20 relative to the case n1 = 10, n2 = 5 with no significant changes in the type I error probabilities under scenario (f). In all cases, the coverage probabilities are not affected by n1 and n2. These findings show that the effects of n1 and n2 on the power depend on the scenarios under study, at least for a final sample size n = 30. The drop in power of 6% under scenario (c) suggests that in practice, the initial number of patients n1 to be randomized to dose combinations equally spaced along the estimated MTD curve be selected between 1/4 and 1/3 of the total sample size in stage II. For binary efficacy outcomes, it is well known that outcome adaptive randomization is not suitable for fast accruing trials or for trials with a longer time τ to resolve efficacy outcome, e.g., response at 6 months or one year disease free survival. Otherwise, we recommend that the number of patients n2 to use for the adaptive phase of the trial to be around the number of patients expected to be accrued within the interval [0, τ]. For this trial, the PI expects to enroll about 2 patients per month. Since the length of a cycle is 3 weeks and clinical benefit is resolved withing 3 cycles of therapy, n2 should be selected between 4 and 5 patients. It is important to note that these guidelines do depend on p0, the effect size, and total trial sample size and the investigators have to evaluate the corresponding operating characteristics taking into account the accrual rate and length of the trial.
Table 2.
Sensitivity analysis for n1 and n2.
| Scenario | n1 | n2 | Power | Scenario | Prob(Type I error) | Coverage Prob. |
|---|---|---|---|---|---|---|
| (a) | 10 | 5 | 0.896 | (d) | 0.100 | 0.964 |
| 9 | 3 | 0.916 | 0.170 | 0.946 | ||
| 9 | 7 | 0.915 | 0.159 | 0.944 | ||
| 5 | 5 | 0.898 | 0.091 | 0.954 | ||
| 20 | 5 | 0.906 | 0.119 | 0.979 | ||
|
| ||||||
| (b) | 10 | 5 | 0.921 | (e) | 0.190 | 0.897 |
| 9 | 3 | 0.931 | 0.183 | 0.899 | ||
| 9 | 7 | 0.930 | 0.183 | 0.881 | ||
| 5 | 5 | 0.94 | 0.183 | 0.892 | ||
| 20 | 5 | 0.887 | 0.189 | 0.904 | ||
|
| ||||||
| (c) | 10 | 5 | 0.810 | (f) | 0.143 | 0.937 |
| 9 | 3 | 0.810 | 0.150 | 0.934 | ||
| 9 | 7 | 0.825 | 0.136 | 0.929 | ||
| 5 | 5 | 0.861 | 0.135 | 0.959 | ||
| 20 | 5 | 0.750 | 0.144 | 0.916 | ||
Finally, we assessed the uncertainty about the estimated MTD curve in stage II in terms of safety of the phase II trial as determined by first cycle DLT under scenario A shown in Figure 1 (A). To estimate how likely a dose used in the phase II trial is toxic taking into account the pointwise percent selection given in (8), for each dose xi,j ∈ (0, 1) from the i-th simulated trial allocated to the j-th patient using the algorithm in Section 3.2, i = 1, …, M, j = 1, …, n, let (xi,j, yi,j) be the dose combination on the true MTD curve shown in Figure 1 (A). Next, generate a dose combination ( ) uniformly inside the circle with center (xi,j, yi,j) and radius pΔ(xi,j, yi,j), where the tolerance probability p and distance Δ(x, y) are defined in Section 2.2. A first cycle DTL Ti,j is then generated from model (1) with (η0, η1, η2, η3) replaced by the true values defining scenario A and . Safety of the phase II trial is summarized by reporting the average DLT rate and the percent of trials with an excessive DLT rate, i.e., a DLT rate exceeding θ + 0.1. These statistics are reported in Table 3 under the six scenarios and various values of n1 and n2 when the tolerance probability is p = 0.05. As expected, the mean first cycle DLT rate is essentially equal to the target probability of DLT θ = 0.33. The percent of trials with excessive DLT rate varies between 16% and 19% when n1 = 10, n2 = 5. These rates are not constant due to the dependence of the radius of the circle on the location of the dose combination on the MTD curve. We conclude that the phase II trial is safe with respect to first cycle DLT rate. Since treatment in the ciscab consists of three cycles of therapy, a separate stopping rule for safety was included in the clinical protocol. Specifically, let θ* be the true probability that a DLT occurs within three cycles of therapy. The trial will be stopped for safety if P(θ* > 0.43|data) > 0.9. A Bayesian continuous monitoring was used by evaluating this rule after every 6-th patient is evaluable for DLT with a uniform prior on θ*. Other simulations testing the same hypotheses showed similar results. We conclude that the design has good operating characteristic in identifying tolerable dose combinations with maximum benefit rate. The trial design has recently been approved by the scientific review committee of Cedars-Sinai Medical Center and is scheduled to open enrollment. Operating characteristics of the phase II trial design under a smaller effect size and different values for the probability of a poor treatment response p0 can be found in the supplement.
Table 3.
Safety assessment on the uncertainty of the MTD curve.
| Scenario | n1 | n2 | Ave. DLT rate | % Trials: DLT rate > θ + 0.1 |
|---|---|---|---|---|
| (a) | 10 | 5 | 34.0 | 19.0 |
| 9 | 3 | 34.0 | 19.0 | |
| 9 | 7 | 33.0 | 19.0 | |
| 5 | 5 | 34.0 | 18.0 | |
| 20 | 5 | 34.0 | 19.0 | |
|
| ||||
| (b) | 10 | 5 | 33.0 | 16.0 |
| 9 | 3 | 33.0 | 17.0 | |
| 9 | 7 | 33.0 | 16.0 | |
| 5 | 5 | 33.0 | 17.0 | |
| 20 | 5 | 33.0 | 18.0 | |
|
| ||||
| (c) | 10 | 5 | 34.0 | 18.0 |
| 9 | 3 | 33.0 | 17.0 | |
| 9 | 7 | 34.0 | 18.0 | |
| 5 | 5 | 34.0 | 18.0 | |
| 20 | 5 | 34.0 | 18.0 | |
|
| ||||
| (d) | 10 | 5 | 33.0 | 17.0 |
| 9 | 3 | 33.0 | 16.0 | |
| 9 | 7 | 33.0 | 18.0 | |
| 5 | 5 | 33.0 | 18.0 | |
| 20 | 5 | 33.0 | 18.0 | |
|
| ||||
| (e) | 10 | 5 | 34.0 | 19.0 |
| 9 | 3 | 33.0 | 19.0 | |
| 9 | 7 | 33.0 | 17.0 | |
| 5 | 5 | 33.0 | 16.0 | |
| 20 | 5 | 34.0 | 17.0 | |
|
| ||||
| (f) | 10 | 5 | 34.0 | 17.0 |
| 9 | 3 | 34.0 | 16.0 | |
| 9 | 7 | 33.0 | 17.0 | |
| 5 | 5 | 33.0 | 19.0 | |
| 20 | 5 | 34.0 | 20.0 | |
5. Discussion
We described a two-stage Bayesian adaptive design for cancer phase I clinical trials using two drugs with continuous dose levels. The goal is to (1) estimate the MTD curve in the two-dimensional Cartesian plane and (2) search for dose combination regions along the MTD curve that yield a desired probability of treatment response. Design of the phase I trial and estimation of the MTD curve in stage I was carried out using EWOC (Tighiouart et al. (2014, 2017)). In the context of the ciscab trial, we showed that good operating characteristics are obtained using informative prior distributions and sample size of 30 patients. In stage II, we modeled treatment efficacy as a binary indicator of treatment response using a cubic spline form of the dose combination-treatment response relationship. This is a very flexible form for the efficacy curve since it accommodates cases of constant probability of efficacy along the MTD curve, high probability of efficacy around the middle of the MTD curve, high probability of efficacy at the edges of the MTD curve. In this stage, a Bayesian adaptive design is proposed to conduct a phase II trial with the goal of identifying dose combination regions that yield a desired probability of treatment response. Initially, a number of patients are treated with dose combinations equally spaced along the estimated MTD curve from stage I and after resolving their treatment response status, the estimated probability of efficacy curve is updated. A small number of patients are then allocated to dose combinations generated from this updated efficacy curve. The trial continues until we reach the final sample size. This design can be viewed as an extension of the Bayesian adaptive design comparing a finite number of arms (Berry et al. (2011)) to comparing an infinite number of arms. In particular, if the dose levels of the two agents are discrete, then methods such as the ones described in Thall et al. (2003); Wang and Ivanova (2005); Wages (2016) can be used to identify a set of MTDs in stage I and the trial in stage II can be done using adaptive randomization to select the most efficacious dose. Unlike phase I/II designs that use toxicity and efficacy data simultaneously and require a short period of time to resolve efficacy status, the use of a two-stage design is sometimes necessary in practice if it takes few cycles of therapy to resolve treatment efficacy or if the populations of patients in phase I and II are different. In fact, for the ciscab trial described in Section 4, efficacy is resolved after three cycles (9 weeks) of treatment and patients in stage I must have metastatic, castration-resistant prostate cancer whereas patients in stage II must have visceral metastasis.
We studied the properties of this design under various scenarios for the true probability of efficacy as a function of dose combinations and we found that the method yields high power and small type I error probability using a sample size of 40 patients with a target effect size of 0.2. For the ciscab trial, a sample size of 30 patients with effect size of 0.25 yield reasonable operating characteristics. Based on these scenarios and proposed models, we conclude that this two-stage design is feasible with sample size ranging between 60 and 80 patients. The uncertainty of the estimated MTD curve in stage I is not taken into account in stage II of the design in the sense that the MTD curve is not updated as a result of observing DLTs in Stage 2. This is a limitation of this approach since patients in stage II may come from a different population and may have different treatment susceptibility relative to patients in stage I. This problem is also inherent to single agent two-stage designs where the MTD from the phase I trial is used in phase II studies and safety is monitored continuously during this phase. An alternative design would account for first, second, and third cycle DLT in addition to efficacy outcome at each cycle. In addition, the nature of DLT (reversible vs. non-reversible) should be taken into account since patients with a reversible DLT are usually treated for that side effect and kept in the trial with dose reduction in subsequent cycles. For single agent trial and a discrete set of doses, Lee et al. (2015) introduced such designs that account for cycles one and two toxicities and efficacy. The authors noted the increase in model complexity if the number of cycles is increased to three and this would ultimately require a more parsimonious model. For drug combinations with continuous dose levels and three cycles of therapy, another layer of model complexity will be introduced and such designs are beyond the scope of this manuscript and are subjects of future research. For the ciscab trial, a separate stopping rule using Bayesian continuous monitoring for excessive toxicity is included in the clinical protocol. We further emphasize the fact that phase I/II designs such as the ones described in Wages and Conaway (2014); Cai et al. (2014) or Wages et al. (2016) require a short period of time to resolve efficacy endpoint and may not be appropriate in the setting of the ciscab trial. Finally, we note that our proposed method is applicable to any combination of cytotoxic and biologic agents since the models used in the phase I part of the study still assume monotonicity of the probability of DLT of either agent for fixed level of the other agent. We plan to extend this work to time to event endpoints such as progression free survival and to accommodate patients baseline characteristics such as biomarker expression that is known to be a target of one or both drugs.
Supplementary Material
Acknowledgments
We thank the reviewers and associate editor for their thorough review and valuable comments which helped improve the quality of the manuscript. This work is supported in part by the National Institute of Health Grant Number R01 CA188480-01A1, the National Center for Research Resources, Grant UL1RR033176, and is now at the National Center for Advancing Translational Sciences, Grant UL1TR000124, and P01 CA098912.
References
- Babb J, Rogatko A, Zacks S. Cancer phase I clinical trials: efficient dose escalation with overdose control. Statistics in Medicine. 1998;17:1103–1120. doi: 10.1002/(sici)1097-0258(19980530)17:10<1103::aid-sim793>3.0.co;2-9. [DOI] [PubMed] [Google Scholar]
- Berry S, Carlin B, Lee J, Muller P. Bayesian Adaptive Methods for Clinical Trials. Boca Raton, FL: Chapman & Hall; 2011. [Google Scholar]
- Borghaei H, Alpaugh K, Hedlund G, Forsberg G, Langer C, Rogatko A, Hawkins R, Dueland S, Lassen U, Cohen R. Phase I dose escalation, pharmacokinetic and pharmacodynamic study of naptumomab estafenatox alone in patients with advanced cancer and with docetaxel in patients with advanced non-small-cell lung cancer. Journal of Clinical Oncology. 2009;27:4116–4123. doi: 10.1200/JCO.2008.20.2515. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Braun T. The bivariate continual reassessment method: extending the CRM to phase I trials of two competing outcomes. Controlled Clinical Trials. 2002;23:240–256. doi: 10.1016/s0197-2456(01)00205-7. [DOI] [PubMed] [Google Scholar]
- Braun TM, Wang SF. A hierarchical Bayesian design for phase I trials of novel combinations of cancer therapeutic agents. Biometrics. 2010;66:805–812. doi: 10.1111/j.1541-0420.2009.01363.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cai C, Yuan Y, Ji Y. A Bayesian dose finding design for oncology clinical trials of combinational biological agents. Applied Statistics. 2014;63:159–173. doi: 10.1111/rssc.12039. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen Z, Yuan Y, Li Z, Kutner M, Owonikoko T, Curran W, Khuri F, Kowalski J. Dose escalation with over-dose and under-dose controls in phase I/II clinical trials. Contemporary Clinical Trials. 2015;43:133–141. doi: 10.1016/j.cct.2015.05.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen Z, Zhao Y, Cui Y, Kowalski J. Methodology and application of adaptive and sequential approaches in contemporary clinical trials. Journal of Probability and Statistics. 2012;2012:20. [Google Scholar]
- Cheng JD, Babb JS, Langer C, Aamdal S, Robert F, Engelhardt LR, Fernberg O, Schiller J, Forsberg G, Alpaugh RK, Weiner LM, Rogatko A. Individualized patient dosing in phase I clinical trials: The role of EWOC in PNU-214936. J Clin Oncol. 2004;22:602–609. doi: 10.1200/JCO.2004.12.034. [DOI] [PubMed] [Google Scholar]
- Ivanova A. A new dose-finding design for bivariate outcomes. Biometrics. 2003;59:1001–1007. doi: 10.1111/j.0006-341x.2003.00115.x. [DOI] [PubMed] [Google Scholar]
- Le Tourneau C, Lee JJ, Siu LL. Dose escalation methods in phase I cancer clinical trials. Journal of the National Cancer Institute. 2009;101:708–720. doi: 10.1093/jnci/djp079. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lee J, Thall P, Ji Y, Müller P. Bayesian dose-finding in two treatment cycles based on the joint utility of efficacy and toxicity. Journal of the American Statistical Association. 2015;110:711–722. doi: 10.1080/01621459.2014.926815. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lockhart A, Sundaram S, Sarantopoulos J, Mita M, Wang-Gillam A, Moseley J, Barber S, Lane A, Wack C, Kassalow L, JFD, Mita A. Phase I dose-escalation study of cabazitaxel administered in combination with cisplatin in patients with advanced solid tumors. Investigational New Drugs. 2014;32:1236–1245. doi: 10.1007/s10637-014-0145-y. [DOI] [PubMed] [Google Scholar]
- Mander A, Sweeting M. A product of independent beta probabilities dose escalation design for dual-agent phase I trials. Statistics in Medicine. 2015;34:1261–1276. doi: 10.1002/sim.6434. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Murtaugh P, Fisher L. Bivariate binary models of efficacy and toxicity in dose-ranging trials. Communications in Statistics Theory and Methods. 1990;19:2003–2020. [Google Scholar]
- Riviere M, Yuan Y, Dubois F, Zohar S. A Bayesian dose-finding design for clinical trials combining a cytotoxic agent with a molecularly targeted agent. Journal of the Royal Statistical Society: Series C. 2015;64:215–229. [Google Scholar]
- Rogatko A, Gosh P, Vidakovic B, Tighiouart M. Patient-specific dose adjustment in the cancer clinical trial setting. Pharmaceutical Medicine. 2008;22:345–350. [Google Scholar]
- Sato H, Hirakawa A, Hamada C. An adaptive dose-finding method using a change-point model for molecularly targeted agents in phase I trials. Statistics in Medicine. 2016;35:4093–4109. doi: 10.1002/sim.6981. [DOI] [PubMed] [Google Scholar]
- Thall P, Cook J. Dose-finding based on efficacytoxicity trade-offs. Biometrics. 2004;60:684–693. doi: 10.1111/j.0006-341X.2004.00218.x. [DOI] [PubMed] [Google Scholar]
- Thall P, Russell K. A strategy for dose-finding and safety monitoring based on efficacy and adverse outcomes in phase I/II clinical trials. Biometrics. 1998;54:251–264. [PubMed] [Google Scholar]
- Thall PF, Millikan RE, Mueller P, Lee SJ. Dose-finding with two agents in phase I oncology trials. Biometrics. 2003;59:487–496. doi: 10.1111/1541-0420.00058. [DOI] [PubMed] [Google Scholar]
- Tighiouart M, Li Q, Rogatko A. A Bayesian adaptive design for estimating the maximum tolerated dose curve using drug combinations in cancer phase I clinical trials. Statistics in Medicine. 2017;36:280–290. doi: 10.1002/sim.6961. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tighiouart M, Piantadosi S, Rogatko A. Dose finding with drug combinations in cancer phase I clinical trials using conditional escalation with overdose control. Statistics in Medicine. 2014;33:3815–3829. doi: 10.1002/sim.6201. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tighiouart M, Rogatko A. Dose finding with escalation with overdose control (EWOC) in cancer clinical trials. Statistical Science. 2010;25:217–226. [Google Scholar]
- Tighiouart M, Rogatko A. Number of patients per cohort and sample size considerations using dose escalation with overdose control. Journal of Probability and Statistics. 2012;2012:16. [Google Scholar]
- Tighiouart M, Rogatko A, Babb JS. Flexible Bayesian methods for cancer phase I clinical trials. Dose escalation with overdose control. Statistics in Medicine. 2005;24:2183–2196. doi: 10.1002/sim.2106. [DOI] [PubMed] [Google Scholar]
- Wages N. Identifying a maximum tolerated contour in two-dimensional dose finding. Statistics in Medicine. 2016;36:242–253. doi: 10.1002/sim.6918. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wages N, Conaway M. Phase I/II adaptive design for drug combination oncology trials. Statistics in Medicine. 2014;33:1990–2003. doi: 10.1002/sim.6097. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wages N, Slingluff C, Petroni G. A Phase I/II adaptive design to determine the optimal treatment regimen from a set of combination immunotherapies in high-risk melanoma. Contemporary Clinical Trials. 2016;41:172–179. doi: 10.1016/j.cct.2015.01.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang K, Ivanova A. Two-dimensional dose finding in discrete dose space. Biometrics. 2005;61:217–222. doi: 10.1111/j.0006-341X.2005.030540.x. [DOI] [PubMed] [Google Scholar]
- Yuan Y, Yin G. Bayesian phase I/II adaptively randomized oncology trials with combined drugs. Annals of Applied Statistics. 2011;5:924–942. doi: 10.1214/10-AOAS433. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.


