Abstract
Traditional dose-finding designs do not require assignment of patients to a control group. Motivated by SHRINC (Safety of Pioglitazone for hematoma resolution in intracerebral hemorrhage), we developed a placebo-controlled dose-finding study to identify the maximum tolerated dose for pioglitazone in stroke patients with spontaneous intracerebral hemorrhage. We designed an extension of the continuous reassessment method that allowed to incorporate information from the control group (i.e., the standard of care), and utilized it to determine the maximum tolerated dose in the SHRINC trial. We evaluated the operating characteristics of our design by conducting extensive simulation studies. Our findings from the simulation studies demonstrate that our proposed design is robust and performs well. By estimating the toxicity rate in the control group, we were able to obtain more accurate information about the natural history of the disease and identify appropriate dose for the next phase of this study. The proposed design provides a tool to incorporate the information from the control group into the dose-finding framework for trials with similar objectives.
Keywords: Continuous reassessment method, Dose finding, Placebo-controlled study, Stroke research
1. Introduction
Clinical trials are considered as the most reliable method for evaluation of safety and efficacy of new drugs and other clinical interventions. Phase I dose-finding trials are designed to identify the maximum tolerated dose (MTD) of a new drug, defined as the highest dose within a tolerable dose-limiting toxicity (DLT). After the MTD is determined, the drug will be carried forward for subsequent assessments through phase II and III trials. Inaccurate evaluation of the MTD can lead to waste of resources if an inappropriate dose is moved to subsequent phases. Therefore, it is important to design efficient dose-finding trials to determine the most appropriate dose before the drug is tested in future phases of drug development.
Broadly, dose-finding trial designs are classified into two types, algorithm-based designs, and model-based designs. The algorithm-based designs, also known as up-and-down designs, are used often in practice due to their simplicity in implementation. The most popular algorithm-based design is the “3 + 3” design [1]. Although it is popular in practice, the reported shortcomings of this design include unreliable estimation of the MTD [2], a significant large proportion of patients treated at subtherapeutic dose levels [3], and the restricted choice of the target DLT rate [4]. Several investigators attempted to develop improved up-and-down designs to identify the MTD, including the accelerated titration design [5], the biased coin design [6] and its extension with isotonic regression [7], the k-in-a-row design [8], the up-and-down design based on isotonic regression [9], the modified toxicity probability interval design [10], and the Bayesian optimal interval design [11]. Comprehensive reviews of up-and-down designs are provided by Ivanova [12] and Liu et al. [4]. For model-based designs, the most popular one is the continual reassessment method (CRM) [13]. In contrast to 3 + 3 design, CRM design provides a more accurate estimation of toxicity probability of the MTD and a more flexible setup of target DLT rate. Due to the popularity of CRM, a variety of extensions have been proposed to improve its practical implementation and operating characteristics [14], [15], [16], [17], [18], [19], [20], [21], [22], [23], [24]. Other model-based designs include designs based on Bayesian decision-theoretic approach [25], the escalation with overdose control [26], Bayesian nonparametric approach [27], and stochastic approximation [28]. Comprehensive reviews for dose-finding designs can be found in Rosenberger and Haines [29], Chevret [30], Ting [31], and Le Tourneau et al. [3].
In stroke trials, DLT usually is defined as a clinical event with substantial morbidity and mortality such as a cerebral hemorrhage [32]. Therefore, the acceptable target rate of patients undergoing such events is quite small. With the development of aforementioned dose-finding designs, CRM is the most promising detection method to address this challenge. There has been some effective utilization of dose-finding designs in stroke trials, especially the application of CRM [33], [34], [35]. In addition to the utilization of CRM design, other dose-finding designs were also applied to stroke trials. For example, Krams et al. [36] used a Bayesian adaptive dose finding design based on normal dynamic linear model in an acute ischemic stroke study. Whelan et al. [37] described the utilization of the Bayesian phase I/II design proposed by Thall and Cook [2] to find an optimal dose for treatment of ischemic stroke in children.
All the aforementioned dose-finding designs do not require assignment of patients to a control group. However, for an acute intracerebral hemorrhage (ICH) trial, SHRINC (Safety of Pioglitazone for hematoma resolution in intracerebral hemorrhage), the investigators were interested in finding a dose with the target toxicity rate dependent upon the rate in the concurrent control group. That means, the target toxicity rate in the SHRINC study is unknown before the initiation of the trial and needs to be determined based on the toxicity data collected from the concurrent control group. Motivated by SHRINC, we developed a placebo-controlled dose-finding study based on CRM to identify the MTD. The inclusion of a control group allowed us to study the natural history of the disease and co-morbidities. Therefore, our proposed design can lead to more meaningful MTD identification than traditional designs. The purpose of this paper is to describe the unique features of our proposed design, and share our experience on its application to the SHRINC study. Using a stroke study as a motivating example, our proposed design provides a tool to incorporate the information from the control group into the dose-finding framework for trials in other diseases with similar objectives. We evaluate the operating characteristics of the proposed design by conducting extensive simulation studies.
2. Methods
2.1. Rational of the inclusion of a control group
SHRINC study was designed to assess the safety of PIO in spontaneous ICH compared with the standard of care. It is a prospective, randomized, blinded, placebo-controlled, and dose-escalation safety trial in which patients are randomly allocated to control or treatment groups. The primary objective of the SHRINC trial is to determine the MTD of PIO, a dose with DLT rate closest to a target rate. More detailed information including inclusion/exclusion criteria, informed consent, safety outcomes, and clinical and radiographic outcomes can be found in Gonzales et al. [38]. The SHRINC study is registered at http://www.clinicaltrials.gov/(registration number NCT00827892).
The inclusion of a control group is not a characteristic of the traditional dose finding trials. In SHRINC, we included a control group for the following reasons: first, many expected adverse events (AEs) and serious AEs (SAEs) in the PIO group were also expected in the ICH population as part of the natural history of the disease and co-morbidities. These AEs and SAEs were not captured in our prospective stroke registry [39] and are difficult to collect accurately in a retrospective manner. Therefore, we planned to collect this information prospectively and compared the rates of AE/SAEs between the PIO-treated and control groups to obtain a more accurate measure of safety with the use of PIO in our patient population. Second, in our preclinical work, more rapid hematoma resolution was correlated with improved neurologic recovery. This finding has not been demonstrated in the clinical setting. In addition, the control group data related to the rate of hematoma resolution would help to determine an optimal duration of PIO. Third, retrospective studies usually overestimate the expected benefit of treatment. Therefore, the inclusion of a concurrent control group would improve our knowledge of toxicity and efficacy profile of the study population and guide us to choose a meaningful target DLT for MTD identification. Thus, instead of specifying a fixed target DLT rate before the initiation of the trial, we consider the target rate dependent up on the rate in the concurrent control group. As a result, many existing methods, such as the CRM, cannot be directly applied. Toward this goal, we propose a modified CRM dose-finding design with a control group for our study.
2.2. Dose-finding method
In this section, we propose an extension of the CRM that allows incorporating information from the control group for dose finding. Suppose K dose levels, denoted as d1, d2, …, dK, have been chosen for the investigation with the true toxicity probability Pk for dose dk, k = 1, …, K. Let Y be a binary variable to denote whether a patient has experienced the prespecified DLT event, with 1 denoting an event and 0 otherwise, and let x denote the dose level for this patient. Usually, a one-parameter model ψ(x,α) is proposed to model the relationship between dose level x and its toxicity probability with the unknown parameter α. There are three popular dose-toxicity models of ψ(x,α) proposed for CRM: logistic model, power model, and hyperbolic tangent model. In our proposed design, we utilize the following one-parameter logistic model,
| (1) |
where c is a constant and recommended to be 3 [40]. As part of the design, we also need a “skeleton” for the CRM, which is the investigator's prior estimates of DLT at each dose level, denoted as for dose level dk. By plugging the toxicity probability in equation (1) with each value of the “skeleton”, we obtain the standardized dose , where is the mean or median of the prior estimate of the parameter and .
During the course of the trial, if n patients have been enrolled into the study and assigned to different dose levels, we denote the standardized dose for ith patient, i = 1, …, n as xi with his/her observed outcome yi, where . Denoting the observed data {xi, yi, i = 1, …, n} as D, the likelihood function of observed data D can be written as . Denoting the prior density of α as π(α), the posterior density of α can be written as . The prior distribution for α is set to unit exponential distribution. The other choice can be uniform distribution with 0 < α < 3, and the log-normal distribution. The Gibbs sampler [41] can be used to obtain the posterior samples for α. After that, the posterior mean for Pi at dose di, i = 1, …, K, can be calculated as
In the traditional CRM, if the target DLT rate is set at a fixed value of θ, then the MTD d∗ is taken to be one of the specified dose in the set of d1, d2, …, dk which satisfies the following criteria Without losing the generality of assumption, we consider a design to identify the MTD with the DLT rate closest to a target rate, defined as a rate higher than the rate in the control group by a constant magnitude of δ. We denote the toxicity rate in the control group as P0. To reduce the variability of the toxicity rate in the control group among different cohorts, we estimate P0 during the course of the trial using all enrolled controls so far. Suppose we observe m0 toxicities among n0 patients in the control group and assume P0 follows a Beta distribution Beta(a, b) with two shape parameters a and b. Under the beta-binomial model, the posterior mean of P0 is estimated as . Then the modified MTD d∗ in our proposed design is identified as the dose which satisfies the following modified criteria
| (2) |
2.3. Dose escalation/de-escalation rule
We assume patients are treated in cohorts at each dose level and the concurrent control group. To ensure patient safety, we do not allow skipping more than one dose level for dose escalation, but for de-escalation, we allow more than one skip in dose levels. Our proposed dose-finding algorithm is described as follows:
-
1.
Start the trial at the lowest dose level and randomize a cohort of patients to the control and treatment groups based on a 1:1 allocation ratio.
-
2.After obtaining the toxicity outcome of the enrolled patients, update the dose-toxicity relationship for the treatment group using model (1), estimate the toxicity rate of the control group using all enrolled cohorts in the control group and identify the MTD based on the criteria (2).
-
2.1.If the estimated MTD is lower than the current dose level, de-escalate the dose to the estimated MTD.
-
2.2.If the estimated MTD is higher than the current dose level, escalate the dose to the estimated MTD if no more than one dose level was skipped; Otherwise, escalate the dose to the next higher dose level.
-
2.3.If the estimated MTD is the current dose level, assign the next cohort to the same dose level.
-
2.1.
-
3.
Repeat the above steps until reaching the sample size N and select as the MTD the dose satisfying the criteria (2).
3. Simulation studies
Simulation studies have been conducted to evaluate the performance of our proposed design. To mimic our SHRINC study, we consider a sample size of 84 and 11 dose levels for investigation. The investigator's prior estimates of DLT at each dose level are 10%, 12%, 15%, 18%, 21%, 25%, 26%, 27%, 28%, 29%, 30%. We set the target DLT rate the same as the rate in the control group, i.e., δ = 0. In addition, we consider the cases with δ = 0.10. The logistic model (1) is utilized to estimate the dose-toxicity relationship with unit exponential prior for α. To estimate the toxicity rate in the control group, i.e., P0, we assume it follows a weakly informative prior Beta(0.1, 0.6) with mean at 0.14.
We consider ten scenarios to evaluate the performance under various dose-toxicity profiles and different values of δ. We display the true toxicity rates and the value of δ under each scenario in Table 1. In scenario 1, we consider the toxicity rate of the control group P0 as 0.10 and the toxicity rates of 11 PIO dose levels ranging from 0.01 to 0.46. Since the MTD is defined as a dose with a toxicity rate the same as P0, i.e., δ = 0, the target MTD in scenario 1 is the third dose level, i.e., 0.4 mg/kg/day. We consider P0 as 0.20 in scenario 2 and 0.30 in scenario 3, respectively. In scenarios 4 to 6, we consider δ = 0.10. In scenarios 7 to 10, we consider the scenarios with the target toxicity rate of MTD exactly equal to 15% and 25% under different choices of P0 and δ. The target MTDs are presented in boldface in Table 1 under each dose-toxicity profile.
Table 1.
True DLT rates at eleven PIO dose levels as well as the toxicity rate in the control group under six scenarios (the target MTD are in boldface).
| Scenario | δ | Toxicity rate in control | True DLT rate |
||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| PIO dose level (mg/kg/day) | |||||||||||||
| 0.1 | 0.2 | 0.4 | 0.6 | 0.8 | 1.0 | 1.2 | 1.4 | 1.6 | 1.8 | 2.0 | |||
| 1 | 0 | 0.10 | 0.01 | 0.04 | 0.09 | 0.15 | 0.20 | 0.28 | 0.33 | 0.37 | 0.39 | 0.43 | 0.46 |
| 2 | 0 | 0.20 | 0.06 | 0.10 | 0.14 | 0.20 | 0.28 | 0.36 | 0.42 | 0.46 | 0.50 | 0.53 | 0.58 |
| 3 | 0 | 0.30 | 0.14 | 0.21 | 0.28 | 0.35 | 0.42 | 0.48 | 0.52 | 0.59 | 0.62 | 0.65 | 0.68 |
| 4 | 0.10 | 0.10 | 0.01 | 0.04 | 0.09 | 0.15 | 0.20 | 0.28 | 0.33 | 0.37 | 0.39 | 0.43 | 0.46 |
| 5 | 0.10 | 0.20 | 0.06 | 0.10 | 0.14 | 0.20 | 0.28 | 0.36 | 0.42 | 0.46 | 0.50 | 0.53 | 0.58 |
| 6 | 0.10 | 0.30 | 0.14 | 0.21 | 0.28 | 0.35 | 0.42 | 0.48 | 0.52 | 0.59 | 0.62 | 0.65 | 0.68 |
| 7 | 0 | 0.15 | 0.01 | 0.04 | 0.09 | 0.15 | 0.22 | 0.29 | 0.35 | 0.39 | 0.43 | 0.46 | 0.49 |
| 8 | 0 | 0.25 | 0.03 | 0.08 | 0.13 | 0.18 | 0.25 | 0.31 | 0.36 | 0.42 | 0.46 | 0.50 | 0.53 |
| 9 | 0.10 | 0.05 | 0.01 | 0.04 | 0.09 | 0.15 | 0.20 | 0.28 | 0.33 | 0.37 | 0.39 | 0.43 | 0.46 |
| 10 | 0.10 | 0.15 | 0.06 | 0.10 | 0.14 | 0.19 | 0.25 | 0.33 | 0.38 | 0.44 | 0.50 | 0.53 | 0.58 |
We evaluate and compare the performance of our proposed study design with the traditional CRM and 3 + 3 designs. For the traditional CRM design, the MTD is usually defined as a dose with a prespecified target toxicity rate. However, in the SHRINC study, the target toxicity rate of the MTD depends on P0. Without knowing P0, we encounter a challenge to select an appropriate target rate when the traditional CRM design is applied. Therefore, we consider two choices of the target toxicity rate for the traditional CRM design: 1) set the target rate at 15%, denoted as CRM15%; 2) set the target toxicity rate at 25%, denoted as CRM25%. Second, our proposed design includes a control group and would assign half of patients (i.e., 42 patients) to the control group, but the traditional CRM design does not require the control group. For a fair comparison, we consider two different sample sizes, i.e., 42 and 84, for the traditional CRM design.
Simulation results are displayed in Table 2, which compares the correct selection percentage of the MTD and the percentage of patients assigned to over-toxic doses, i.e., dose levels above the MTD, among the proposed, the traditional CRM, and 3 + 3 designs. In scenario 1, our proposed design successfully selects the MTD (i.e, the 3rd dose) with the percentage of 24.6%, which is higher than that using the traditional CRM and the 3 + 3 designs. Although the selection percentage of the MTD using CRM15% is close to that of our proposed design, it assigns almost half the number of patients to over-toxic doses. CRM25% has the worst performance and incorrectly considers a higher dose as MTD most of the time. In scenario 2, our proposed design has similar performance as CRM15% with sample size of 42, but performs worse than CRM25% and the 3 + 3 designs. This is reasonable because the target toxicity rates of CRM25% and the 3 + 3 design are both close to the true toxicity rate of the MTD. We observe similar findings in scenario 3, in which the proposed design outperforms CRM15% and 3 + 3 designs, but not CRM25%. In scenarios 4 to 6, we consider δ = 0.10, i.e., the MTD is defined as a dose with a toxicity rate higher than P0 by a constant magnitude of 0.10. In scenario 4, our proposed design has a higher percentage of selecting MTD than that using CRM15% and the 3 + 3 designs. CRM25% with sample size of 84 selects the MTD with a higher percentage than our proposed design, but it assigns more than half number of patients to over-toxic doses. In scenario 5, our proposed design outperforms CRM15% and the 3 + 3 designs, but worse than CRM25% design of which the chosen target rate is close to the true toxicity rate of the MTD. In scenario 6, only our proposed design is able to identify the MTD and selects it with the highest percentage among all designs. In scenarios 7 and 9, the toxicity rate of the MTD is 15%, which exactly equals to the prespecified target rate in CRM15% design. Therefore, the CRM15% design performs best among all designs. Our proposed design has a lower correct selection percentage than the CRM15% design, but performs better than the CRM25% design. Similarly, in scenarios 8 and 10, with the toxicity rate of the MTD at 25%, the CRM25% design has the best performance, and the CRM15% design has the worst performance. These results show that arbitrarily choosing a target rate for the CRM design may not be appropriate especially when the selected rate deviates from the real value a lot, resulting in an incorrect identification of MTD. Overall, based on the findings from these scenarios, if the target rate of the MTD depends on an unknown P0, we need to estimate it and incorporate it into the dosing finding procedure. Otherwise, we may have high chance to miss the MTD. The 3 + 3 design works well if the toxicity rate of the MTD is close to 20–25%, e.g., scenarios 2 and 4. Similarly, the traditional CRM design also has a good performance if the prespecified rate is close to the toxicity rate of the MTD. In summary, our proposed design is robust and works well in most of the scenarios when the toxicity rate in the control group needs to be estimated and incorporated into the MTD finding procedure. To evaluate the robustness of the proposed design, we conduct a sensitivity analysis to evaluate its performance with a different choice of the prior specification for the parameter α. We consider a log-normal distribution with mean of 0 and variance of 1 for α and display the simulation results under the ten scenarios in Table 3. We compare these results to that in Table 2, in which the proposed design specifies a unit exponential prior for the parameter α. The percentages of the correct identification of MTD are similar for each scenario with these two priors. These results demonstrate that the performance of our design is robust on the specification of the prior distribution for the parameter α as long as it covers a reasonable range of the parameter estimated. In addition, we conduct a sensitivity analysis with a different choice of the allocation ratios between the treatment and the control groups. Different from the allocation ratio 1:1 used in Table 2, we consider the allocation ratios of 3:2 and 3:1, and summarize these results in Table 3. The selection percentage of the correct MTD identification with 3:2 ratio is slightly lower than the results with a ratio at 1:1 (see Table 2), but it utilizes a smaller sample size. However, the ratio 3:1 may not be a good choice in which the correct selection percentage is much lower than that with a ratio at 1:1. Therefore, we suggest to evaluate the performance of the proposed design with different choices of allocation ratio to select a one which provides a desirable performance with a reasonable sample size.
Table 2.
Percentage of the correct identification of MTD (Average percentage of patients assigned to the doses above the MTD) based on the proposed, traditional CRM, and 3 + 3 designs under different scenarios based on 1000 simulations.
| Scenario | Proposedc Design | CRM15%a |
CRM25%b |
3 + 3 Design | ||
|---|---|---|---|---|---|---|
| N = 42d | N = 84e | N = 42d | N = 84e | |||
| 1 | 24.6 (18.3) | 21.6 (54.6) | 23.4 (62.8) | 2.5 (75.7) | 0.5 (87.3) | 17.6 (49.3) |
| 2 | 22.9 (17.6) | 22.9 (14.7) | 26.1 (10.9) | 26.3 (44.9) | 33.4 (51.4) | 24.4 (17.9) |
| 3 | 22.0 (21.4) | 6.4 (5.2) | 2.8 (3.0) | 33.3 (24.8) | 43.6 (21.7) | 17.7 (10.6) |
| 4 | 29.8 (17.2) | 24.3 (17.1) | 26.8 (11.9) | 26.6 (45.5) | 38.3 (51.6) | 24.5 (16.5) |
| 5 | 28.9 (20.2) | 8.4 (5.3) | 4.1 (3.1) | 37.7 (24.6) | 46.5 (21.1) | 16.3 (7.1) |
| 6 | 23.7 (17.6) | 0.2 (0.5) | 0 (0.4) | 5.6 (4.0) | 1.3 (2.3) | 2.1 (0.7) |
| 7 | 22.6 (17.6) | 34.4 (32.0) | 48.3 (30.1) | 11.2 (61.6) | 8.8 (73.9) | 25.7 (29.9) |
| 8 | 26.7 (16.8) | 14.1 (9.3) | 9.2 (5.3) | 33.8 (33.4) | 47.3 (33.3) | 18.3 (10.3) |
| 9 | 26.0 (18.4) | 34.3 (33.9) | 46.8 (33.0) | 7.8 (63.0) | 5.4 (76.5) | 23.8 (31.0) |
| 10 | 27.7 (14.5) | 10.6 (7.2) | 6.5 (4.2) | 35.0 (28.8) | 48.4 (27.8) | 17.6 (8.7) |
Traditional CRM design with the target toxicity limit at 15%.
Traditional CRM design with the target toxicity limit at 25%.
We considered a sample size of 84 patients and randomized half patients to control group in the proposed design.
We considered a sample size of 42 patients for the traditional CRM design.
We considered a sample size of 84 patients for the traditional CRM design.
Table 3.
Sensitivity analysis with a lognormal prior distribution for alpha and different allocation ratios between treatment and control groups based on the proposed design under ten scenarios.
| Setting | Scenario |
|||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | |
| Lognormal prior for α | 23.4 (15.6) | 22.0 (15.8) | 23.5 (18.6) | 26.3 (15.2) | 29.3 (17.9) | 21.9 (15.3) | 22.5 (17.2) | 25.6 (16.3) | 26.2 (17.7) | 28.7 (13.8) |
| Ratio 3:2 | 21.8 (15.6) | 19.8 (16.0) | 21.7 (19.3) | 29.3 (15.3) | 25.3 (18.8) | 23.0 (16.1) | 19.6 (17.4) | 23.2 (17.1) | 27.7 (18.5) | 26.4 (18.2) |
| Ratio 3:1 | 13.5 (13.8) | 15.9 (14.8) | 16.1 (18.5) | 20.4 (14.9) | 20.9 (17.1) | 17.1 (15.6) | 16.8 (15.3) | 16.7 (15.6) | 25.1 (17.9) | 20.3 (21.4) |
Data are represented as the percentage of the correct identification of MTD (average percentage of patients assigned to the doses above the MTD).
4. Application of the proposed design to the SHRINC study
We implemented the proposed placebo-controlled CRM design for the MTD determination in the SHRINC study. There were 11 dose levels proposed in the study, which are 0.1, 0.2, 0.4, 0.6, 0.8, 1.0, 1.2, 1.4, 1.6, 1.8, 2.0 mg/kg/day. A total of 84 patients were recruited and randomized to two study arms. Patients were enrolled in a cohort of 6 with 3 patients randomized to the treatment group and 3 patients randomized to the control group. Patients assigned to the treatment group were administered the escalated dose for 3 days followed by a daily maintenance dose of 30 mg. DLT was defined as in-hospital death (discharge or day 14 whichever occurs first) and the target rate was defined as a rate the same as the toxicity rate in the control group. That means, if the toxicity rate in control group is 15%, then the target rate for the MTD identification is set at 15%, i.e., δ = 0 in criteria (2).
Our trial successfully enrolled 84 patients with 42 assigned to the placebo group and 42 assigned to the PIO group. Table 4 shows the number of patients and the observed in-hospital deaths at each dose level. At the end of the trial, we observed two in-hospital deaths, with one in the control group and the other in the PIO group. The course of patient flow including the dose escalation and de-escalation is shown in Fig. 1. Three patients out of the first cohort were randomized to the starting dose, 0.1 mg/kg/day. Subsequent cohorts were randomized to doses after obtaining in-hospital data of all previous cohorts. The doses for cohorts 2 to 7 were escalated as follows: 0.4, 0.6, 0.8, 1.0, 1.2, and 1.4 mg/kg/day. On the completion of cohort 7, we observed one death in the control group. The trial was continued with dose escalation for cohorts 8, 9, 10, and 11. However, we observed one death at cohort 11 in the PIO group. Subsequently, the dose levels for the next two cohorts were de-escalated to 1.7 and 1.6 mg/kg/day for patient safety. The dose level was then re-escalated to 1.7 mg/kg/day for the last cohort. After reaching the planned sample size, the study was completed. According to the proposed dose-finding algorithm, we updated the dose-mortality curve (Fig. 2) and estimated the in-hospital mortality rate for each dose level. The posterior mean of the mortality rate at the highest dose level 2.0 mg/kg/day is 4.1% and the posterior mean of the mortality rate in the control group is 2.6%. The dose level 1.0 mg/kg/day is the one that satisfies the proposed MTD criteria defined in (2), which is the dose with the mortality rate closest to that in the control group.
Table 4.
Number of patients and the observed deaths at different PIO dose levels.
| Pioglitazone (PIO) | ||
|---|---|---|
| PIO daily dose for three days (mg/kg/day) | # of treated patients | # of deaths at discharge or day 14 |
| 0.1 | 4 | 0 |
| 0.2 | 0 | 0 |
| 0.4 | 2 | 0 |
| 0.6 | 3 | 0 |
| 0.8 | 3 | 0 |
| 1.0 | 4 | 0 |
| 1.2 | 3 | 0 |
| 1.4 | 3 | 0 |
| 1.5 | 3 | 0 |
| 1.6 | 3 | 0 |
| 1.7 | 6 | 0 |
| 1.8 | 6 | 1 |
| 1.9 | 3 | 0 |
| 2.0 | 0 | 0 |
Fig. 1.
Does escalation/de-escalation throughout the course of the trial for the PIO and placebo groups. Patients are presented in cohort order from left to right. Open circles indicate patients in the PIO group and solid grey circles indicate patients in the placebo group. Patients who experienced DLT, i.e., mortality within two weeks or before discharge, are denoted as crosses.
Fig. 2.
Model-based probabilities of the mortality. Crosses represent a priori assumptions about the defined mortality from the investigator, i.e., “Skeleton”. Filled circles and bars represent the posterior means as well as the 95% credible intervals of the mortality rates at each dose level based on the data collected at the end of the study. The horizontal line indicates the estimated mortality rate in the concurrent control group.
5. Discussion
In this manuscript, we proposed a placebo-controlled dose-finding design with an application to a stroke study and shared our experiences at various faces of its designing and implementations. The motivating study was not only a dose-finding study for PIO treating ICH patients, but also studying the natural history of the disease and identifying whether the speed of hematoma/edema resolution in ICH represents a biological marker actively correlated with radiographic and clinical effect of PIO. Considering the overall objectives of the study as well as the MTD identification, there was a need for having a control group and leveraging this additional information available to us by incorporating it into dose-finding procedure. The proposed design provides a tool to incorporate the control data into the dose-finding framework for studies with similar objectives, and can be applied to other diseases other than stroke. Through simulation studies, we evaluated the performance of our proposed design by comparing it with the traditional CRM and 3 + 3 designs. Our simulation results demonstrate that our proposed design is robust and works well when the toxicities of the treatment can overlap with the natural history of the studied disease. Therefore, we suggest including a concurrent control group for more meaningful MTD identification under this situation. The data from the control group contain important information to study the natural history of the disease, such as the incidence of AE/SAEs and the radiographic information. In addition, the two-week mortality rate in our concurrent control group is different from the one we estimated from the historical data using our registry data, which also demonstrates the importance of the inclusion of concurrent control group. These differences between the concurrent and historical controls could be due to the possible guidelines or standard of care changes.
A number of lessons we have learned from our SHRINC study. First, we have enrolled the planned 84 patients and observed two DLTs. With 11 dose levels proposed in our study, nearly 1 or 2 cohorts were assigned to each dose. After the interim analysis, due to safety reasons, three more doses were inserted into the prespecified 11 dose levels. Although this provides an estimation of dose-toxicity relationship under different dose levels, it limits the number of cohorts we could assign to the target MTD or its neighborhood. Therefore, for designing a similar dose-finding study in future, we suggest investigating fewer dose levels or starting the trial from the dose in the middle of dose range. For appropriate dose insertion, simulation studies are recommended to evaluate the performance before implementing it. Second, the DLT in our SHRINC study is defined as a death before discharge or at two weeks which occurs first. Therefore, we need to wait for several days or even two weeks to observe outcomes from all previous cohorts. During the course of our trial, if an eligible patient arrived before the outcomes were observed for the current cohort, we allowed enrollment of this patient into the study and randomized this patient to either the current dose or control group, e.g., 4 patients assigned to 0.1 mg/kg/day. After this patient was enrolled, we did not randomize any new patients until observing outcome for all the patients enrolled to that point. However, it is possible that we still miss eligible patients. To avoid this, we can apply the methods described in Cai et al. [42] to impute the missing outcomes before decision making. Third, we realized the choice of in-hospital mortality as primary endpoint may not be optimal for this study after we finished the trial. The observed mortality rate is lower than our expected rate, which is not as informative as we expected for our dose escalation and dose finding procedure, but fortunately it demonstrates our proposed treatment is safe to patients. Our results show that the prespecified skeleton in our SHRINC study may deviate from the truth. To overcome the arbitrariness and further enhance the robustness of the design, one possible approach is to specify a different set of skeletons and adopt Bayesian model averaging method proposed in Yin and Yuan (2009) [20]. Further exploration of the optimal dose identification for the subsequent phase based on other information such as serious adverse events as well as considering both toxicity and efficacy data may be beneficial [43]. This may require a specific definition of the AEs and the determination whether the AE is part of the natural history of the disease or due to the treatment.
Declaration of conflicting interests
The authors declare that there is no conflict of interest.
Acknowledgements
This research is funded by the National Institute of Neurological Disorders and Stroke (NINDS) through a SPOTRIAS program grant P50NS044227 awarded to the University of Texas Health Science Center at Houston (UTHealth). We also acknowledge the support provided by the Biostatistics/Epidemiology/Research Design (BERD) component of the Center for Clinical and Translational Sciences (CCTS) for this project. CCTS is mainly funded by the NIH Centers for Translational Science Award (NIH CTSA) grant UL1 RR024148 (KL2 RR0224149 for the K12 program), awarded to University of Texas Health Science Center at Houston in 2006 by the National Center for Research Resources (NCRR) and its renewal (UL1 TR000371) by the National Center for Advancing Translational Sciences (NCATS). The content is solely the responsibility of the authors and does not necessarily represent the official views of the NINDS, NCRR, or the NCATS.
References
- 1.Storer B.E. Design and analysis of phase I clinical trials. Biometrics. 1989 Sep;45(3):925–937. [PubMed] [Google Scholar]
- 2.Thall P.F., Lee S.J. Practical model-based dose-finding in phase I clinical trials: methods based on toxicity. Int. J. Gynecol. Cancer. 2003 May;13(3):251–261. doi: 10.1046/j.1525-1438.2003.13202.x. [DOI] [PubMed] [Google Scholar]
- 3.Le Tourneau C., Lee J.J., Siu L.L. Dose escalation methods in phase I cancer clinical trials. J. Natl. Cancer Inst. 2009 May 20;101(10):708–720. doi: 10.1093/jnci/djp079. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Liu S., Cai C., Ning J. Up-and-down designs for phase I clinical trials. Contemp. Clin. Trials. 2013 Sep;36(1):218–227. doi: 10.1016/j.cct.2013.07.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Simon R., Freidlin B., Rubinstein L., Arbuck S.G., Collins J., Christian M.C. Accelerated titration designs for phase I clinical trials in oncology. J. Natl. Cancer Inst. 1997 Aug 6;89(15):1138–1147. doi: 10.1093/jnci/89.15.1138. [DOI] [PubMed] [Google Scholar]
- 6.Durham S.D., Flournoy N. Springer; 1994. Random Walks for Quantile Estimation. Statistical Decision Theory and Related Topics V; pp. 467–476. [Google Scholar]
- 7.Stylianou M., Flournoy N. Dose finding using the biased coin up-and-down design and isotonic regression. Biometrics. 2002 Mar;58(1):171–177. doi: 10.1111/j.0006-341x.2002.00171.x. [DOI] [PubMed] [Google Scholar]
- 8.Gezmu M. American University; Washington, DC: 1995. The Geometric Up and Down Design for Allocating Dosage Levels. [Google Scholar]
- 9.Leung D.H., Wang Y. Isotonic designs for phase I trials. Control Clin. Trials. 2001 Apr;22(2):126–138. doi: 10.1016/s0197-2456(00)00132-x. [DOI] [PubMed] [Google Scholar]
- 10.Ji Y., Liu P., Li Y., Bekele B.N. A modified toxicity probability interval method for dose-finding trials. Clin. Trials. 2010 Dec;7(6):653–663. doi: 10.1177/1740774510382799. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Liu S., Yuan Y. Bayesian optimal interval designs for phase I clinical trials. J. R. Stat. Soc. Ser. C Appl. Stat. 2015;64(3):507–523. [Google Scholar]
- 12.Ivanova A. John Wiley & Sons, Inc; 2007. Dose Escalation and Up-and-Down Designs. Wiley Encyclopedia of Clinical Trials. [Google Scholar]
- 13.O'Quigley J., Pepe M., Fisher L. Continual reassessment method: a practical design for phase 1 clinical trials in cancer. Biometrics. 1990 Mar;46(1):33–48. [PubMed] [Google Scholar]
- 14.Goodman S.N., Zahurak M.L., Piantadosi S. Some practical improvements in the continual reassessment method for phase I studies. Stat. Med. 1995 Jun 15;14(11):1149–1161. doi: 10.1002/sim.4780141102. [DOI] [PubMed] [Google Scholar]
- 15.Moller S. An extension of the continual reassessment methods using a preliminary up-and-down design in a dose finding study in cancer patients, in order to investigate a greater range of doses. Stat. Med. 1995 May 15;14(9–10):911–922. doi: 10.1002/sim.4780140909. [DOI] [PubMed] [Google Scholar]
- 16.Piantadosi S., Fisher J.D., Grossman S. Practical implementation of a modified continual reassessment method for dose-finding trials. Cancer Chemother. Pharmacol. 1998;41(6):429–436. doi: 10.1007/s002800050763. [DOI] [PubMed] [Google Scholar]
- 17.Heyd J.M., Carlin B.P. Adaptive design improvements in the continual reassessment method for phase I studies. Stat. Med. 1999 Jun 15;18(11):1307–1321. doi: 10.1002/(sici)1097-0258(19990615)18:11<1307::aid-sim128>3.0.co;2-x. [DOI] [PubMed] [Google Scholar]
- 18.Braun T.M. The bivariate continual reassessment method. extending the CRM to phase I trials of two competing outcomes. Control Clin. Trials. 2002 Jun;23(3):240–256. doi: 10.1016/s0197-2456(01)00205-7. [DOI] [PubMed] [Google Scholar]
- 19.Leung D.H., Wang Y.G. An extension of the continual reassessment method using decision theory. Stat. Med. 2002 Jan 15;21(1):51–63. doi: 10.1002/sim.970. [DOI] [PubMed] [Google Scholar]
- 20.Yin G., Yuan Y. Bayesian model averaging continual reassessment method in phase I clinical trials. J. Am. Stat. Assoc. 2009;104(487):954–968. [Google Scholar]
- 21.Daimon T., Zohar S., O'Quigley J. Posterior maximization and averaging for Bayesian working model choice in the continual reassessment method. Stat. Med. 2011 Jun 15;30(13):1563–1573. doi: 10.1002/sim.4054. [DOI] [PubMed] [Google Scholar]
- 22.Yuan Y., Yin G. Robust EM continual reassessment method in oncology dose finding. J. Am. Stat. Assoc. 2011 Sep 1;106(495):818–831. doi: 10.1198/jasa.2011.ap09476. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Liu S., Yin G., Yuan Y. Bayesian data augmentation dose finding with continual reassessment method and delayed toxicity. Ann. Appl. Stat. 2013 Dec 1;7(4):1837–2457. doi: 10.1214/13-AOAS661. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Pan H., Yuan Y. A default method to specify skeletons for Bayesian model averaging continual reassessment method for phase I clinical trials. Stat. Med. 2017;36(2):266–279. doi: 10.1002/sim.6941. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Whitehead J., Brunier H. Bayesian decision procedures for dose determining experiments. Stat. Med. 1995 May 15;14(9–10):885–893. doi: 10.1002/sim.4780140904. [DOI] [PubMed] [Google Scholar]
- 26.Babb J., Rogatko A., Zacks S. Cancer phase I clinical trials: efficient dose escalation with overdose control. Stat. Med. 1998 May 30;17(10):1103–1120. doi: 10.1002/(sici)1097-0258(19980530)17:10<1103::aid-sim793>3.0.co;2-9. [DOI] [PubMed] [Google Scholar]
- 27.Gasparini M., Eisele J. A curve-free method for phase I clinical trials. Biometrics. 2000 Jun;56(2):609–615. doi: 10.1111/j.0006-341x.2000.00609.x. [DOI] [PubMed] [Google Scholar]
- 28.Cheung Y.K., Elkind M.S. Stochastic approximation with virtual observations for dose-finding on discrete levels. Biometrika. 2010 Mar;97(1):109–121. doi: 10.1093/biomet/asp065. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Rosenberger W.F., Haines L.M. Competing designs for phase I clinical trials: a review. Stat. Med. 2002 Sep 30;21(18):2757–2770. doi: 10.1002/sim.1229. [DOI] [PubMed] [Google Scholar]
- 30.Chevret S. 24 ed. John Wiley & Sons, Ltd; Hoboken, NJ, USA: 2006. Statistical Methods for Dose-finding Experiments. [Google Scholar]
- 31.Ting N. Springer Science & Business Media; New York, NY, USA: 2006. Dose Finding in Drug Development. [Google Scholar]
- 32.Fisher M., Cheung K., Howard G., Warach S. New pathways for evaluating potential acute stroke therapies. Int. J. Stroke. 2006 May;1(2):52–58. doi: 10.1111/j.1747-4949.2006.00025.x. [DOI] [PubMed] [Google Scholar]
- 33.Elkind M.S., Sacco R.L., MacArthur R.B., Fink D.J., Peerschke E., Andrews H. The Neuroprotection with Statin Therapy for Acute Recovery Trial (NeuSTART): an adaptive design phase I dose-escalation study of high-dose lovastatin in acute ischemic stroke. Int. J. Stroke. 2008 Aug;3(3):210–218. doi: 10.1111/j.1747-4949.2008.00200.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Fagan S.C., Waller J.L., Nichols F.T., Edwards D.J., Pettigrew L.C., Clark W.M. Minocycline to improve neurologic outcome in stroke (MINOS): a dose-finding study. Stroke. 2010 Oct;41(10):2283–2287. doi: 10.1161/STROKEAHA.110.582601. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Selim M., Yeatts S., Goldstein J.N., Gomes J., Greenberg S., Morgenstern L.B. Safety and tolerability of deferoxamine mesylate in patients with acute intracerebral hemorrhage. Stroke. 2011 Nov;42(11):3067–3074. doi: 10.1161/STROKEAHA.111.617589. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Krams M., Lees K.R., Hacke W., Grieve A.P., Orgogozo J.M., Ford G.A. Acute Stroke Therapy by Inhibition of Neutrophils (ASTIN): an adaptive dose-response study of UK-279,276 in acute ischemic stroke. Stroke. 2003 Nov;34(11):2543–2548. doi: 10.1161/01.STR.0000092527.33910.89. [DOI] [PubMed] [Google Scholar]
- 37.Whelan H.T., Cook J.D., Amlie-Lefond C.M., Hovinga C.A., Chan A.K., Ichord R.N. Practical model-based dose finding in early-phase clinical trials: optimizing tissue plasminogen activator dose for treatment of ischemic stroke in children. Stroke. 2008 Sep;39(9):2627–2636. doi: 10.1161/STROKEAHA.107.510164. [DOI] [PubMed] [Google Scholar]
- 38.Gonzales N.R., Shah J., Sangha N., Sosa L., Martinez R., Shen L. Design of a prospective, dose-escalation study evaluating the Safety of Pioglitazone for Hematoma Resolution in Intracerebral Hemorrhage (SHRINC) Int. J. Stroke. 2013 Jul;8(5):388–396. doi: 10.1111/j.1747-4949.2011.00761.x. [DOI] [PubMed] [Google Scholar]
- 39.Rahbar M.H., Gonzales N.R., Ardjomand-Hessabi M., Tahanan A., Sline M.R., Peng H. The University of Texas Houston Stroke Registry (UTHSR): implementation of enhanced data quality assurance procedures improves data quality. BMC Neurol. 2013;13:61. doi: 10.1186/1471-2377-13-61. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.O'Quigley J., Chevret S. Methods for dose finding studies in cancer clinical trials: a review and results of a Monte Carlo study. Stat. Med. 1991 Nov;10(11):1647–1664. doi: 10.1002/sim.4780101104. [DOI] [PubMed] [Google Scholar]
- 41.Casella G., George E.I. Explaining the Gibbs sampler. Am. Stat. 1992;46(3):167–174. [Google Scholar]
- 42.Cai C., Liu S., Yuan Y.A. Bayesian design for phase II clinical trials with delayed responses based on multiple imputation. Stat. Med. 2014 Oct 15;33(23):4017–4028. doi: 10.1002/sim.6200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Hossain M.M., Chen Z. Comments on ‘Application of an adaptive design to a randomized phase II selection trial in gastric cancer: a report of the study design’ by Satoshi Morita and Junichi Sakamoto. Pharm. Stat. Pharm. Stat. 2012;11(3):267–268. doi: 10.1002/pst.539. [DOI] [PubMed] [Google Scholar]


