Abstract
PURPOSE
A phase II/III trial is a type of phase III trial that has embedded in it an intermediate phase II go/no-go decision as to whether to continue the accrual to the phase III sample size. We examine the design characteristics and experience of a well-defined set of National Cancer Institute phase II/III trials, with special emphasis on designed accrual suspensions while awaiting the data to become mature enough for the phase II analysis. This experience is used to highlight the potential of using a calendar backstop to avoid an inordinately long accrual suspension.
METHODS
We identified all phase II/III trials conducted by NRG Oncology or its precursor National Cancer Institute Cooperative Groups (Radiation Therapy Oncology Group, Gynecologic Oncology Group, and National Surgical Adjuvant Breast and Bowel Project). The design characteristics were recorded, and, for completed trials, the trial results in terms of sample sizes and timing of analyses were tabulated.
RESULTS
Twenty-two trials were identified, 14 of which had a time-to-event end point for their phase II component. Thirteen of these 14 trials had designed accrual suspensions. Seven of the eight completed trials had designed accrual suspensions, all of which went on longer than their projected suspension times (3-20 months longer than planned). The trade-offs for using a backstop are discussed using one of these trials as an example.
CONCLUSION
Phase II/III trials with an accrual suspension and a predefined backstop for the phase II analysis can be a useful tool for minimizing patient exposure to ineffective experimental treatments while still obtaining the trial results in a timely fashion.
INTRODUCTION
Seamless phase II/III randomized clinical trials are phase III trials that have an interim go/no-go decision rule, typically using an intermediate end point for the initial (phase II) component, to decide if the experimental treatment has sufficient activity (compared with the control treatment) to continue the trial to the phase III component.1 A phase II/III trial substitutes for a phase II trial followed by a separate phase III trial. Its main advantage over sequential development is the use of the phase II patient data in the phase III analysis, shortening the time to obtain the phase III results and the total number of trial patients treated. The main disadvantage of a phase II/III trial is the commitment to a phase III trial of a specific experimental therapy against a specific control therapy in a specific population (if the phase II results are positive) earlier than if a stand-alone phase II trial had been performed. This could be problematic in a clinical setting with rapid advancements in drug and biomarker development.2
CONTEXT
Key Objective
How frequently does accrual suspension occur in phase II/III oncology trials, and can a calendar backstop help reduce delays in the transition from phase II to phase III?
Knowledge Generated
In an analysis of 22 phase II/III trials, 13 trials included designed accrual suspensions and most experienced delays beyond their projected timelines, ranging from 3 to 20 months. The study suggests that a calendar backstop may help manage these delays, allowing trials to progress more efficiently and reducing patient exposure to potentially ineffective treatments. This approach could enhance trial timelines while maintaining the integrity of phase II evaluations.
A key design element of a phase II/III trial is whether to have an accrual suspension while waiting for the phase II data to mature3 and its structure if one is used. Phase II/III designs are typically used when sufficient evidence for experimental treatment activity is lacking to directly proceed to a large phase III trial. Therefore, an accrual suspension can be important as otherwise the trial might accrue to almost its full phase III sample size before a potentially negative phase II result is obtained (thus exposing many more patients to a potentially toxic inactive treatment).4 For example, NRG-BN0075 suspended accrual after 159 patients had been randomly assigned to the phase II part of the trial. The (no-go) phase II decision was made when there were 100 progression-free survival (PFS) events, which occurred 11 months later after the suspension began. If an accrual suspension had not been used in this trial, we estimate that approximately an additional 73 patients would have been accrued to this negative trial (on the basis of its accrual rate). The disadvantage to having an accrual suspension is that it delays getting the phase III results (if the phase II results are positive). It also disrupts the trial process, especially for a multi-institutional trial and if the suspension is long.
There have been studies of the theoretical operating characteristics of phase II/III trials,6,7 but this report examines the NRG experience with phase II/III trials, with special focus on the use of accrual suspensions.
METHODS
We identified by searching the National Cancer Institute (NCI) Clinical Trials Support Unit database for all phase II/III trials conducted by NRG Oncology or its precursor NCI Cooperative Groups: Radiation Therapy Oncology Group (RTOG), Gynecologic Oncology Group (GOG), and National Surgical Adjuvant Breast and Bowel Project (NSABP). For trials with a time-to-event phase II end point, the trial design characteristics for the phase II and phase III components were recorded including the end points, the required sample sizes and numbers of events, the type 1 errors and powers, and the targeted hazard ratios (HRs).
For completed trials with published results, we compared the projected and actual timing of the phase II analysis. The timings were compared in terms of both calendar time and the number of patients accrued. For trials with a designed phase II accrual suspension, we compared the projected and actual phase II accrual suspension calendar times.
RESULTS
Twenty-two trials were identified. Eight of these trials had non–time-to-event phase II end points and are not considered further here as accrual suspensions would not be relevant, leaving 14 trials (Table 1). The phase II end points were PFS (12 trials), disease-free survival (one trial), and freedom from progression (one trial). The phase III end points were overall survival (OS; nine trials), PFS (one trial), PFS/OS (two trials), metastasis-free survival (one trial), and coprimary end points of PFS and a quality-of-life end point (one trial). The P value cutoffs for the go/no-go decisions for the phase II analyses were P ≤ .10 (five trials), P ≤ .15 (five trials), and P ≤ .20 (five trials).
TABLE 1.
Trial ID (NCT No.) | Phase II | Phase III | |||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
End Point | Target HR/Effect Size | Type 1 Error (1-sided), % | Required No. of Events | Power | End Point | Target HR/Effect Size | Type 1 Error (1-sided), % | Sample Size (including phase II) | Required No. of Eventsa | Power, %b | |
GOG-02818 (NCT02101788) | PFS | 0.66 | 20 | 106 | 90 | PFS | 0.66 | 2.5 | 250 | 213 | 80 |
GOG-0286B9 (NCT02065687) | PFS | 0.66 | 20 | 60 | 90 | OS | 0.66 | 5.0 | 540 | 300 | 97 |
NRG-BN0075 (NCT04396860) | PFS | 0.58 | 15 | 100 | 95 | OS | 0.72 | 2.4 | 485 | 363 | 85 |
NRG-BR00210 (NCT02364557) | PFS | 0.55 | 15 | 69 | 93 | OS | 0.67 | 2.5 | 360 | 231 | 85 |
NRG-GY005c,11 (NCT02502266) | PFS | 0.625 | 20 | 93 | 90 | PFS OSd |
0.625 0.625 |
0.83 0.83d |
340 | 323f 223 |
96 90 |
NRG-GY009c,12 (NCT02839707) | PFS | 0.625 | 10 | 110 | 88 | OSe PFSe |
0.625 0.55 |
1.24 0.01 |
320 | 236 280f |
91 90 |
NRG-HN00413 (NCT03258554) | PFS | 0.65 | 20 | 69 | 80 | OS | 0.73 | 2.5 | 444 | 364 | 80 |
NRG-LU00214 (NCT03137771) | PFS | 0.60 | 15 | 138 | 95 | OS | 0.68 | 2.5 | 378 | 278 | 85 |
NOTE. The additional six ongoing phase II/III trials were NRG-GU002 (NCT03070886), NRG-GY026 (NCT05256225), NRG-HN005 (NCT03952585), NRG-LU007 (NCT04402788), RTOG-1008 (NCT01220583), and RTOG-1216 (NCT01810913).
Abbreviations: GOG, Gynecologic Oncology Group; HR, hazard ratio; NSABP, National Surgical Adjuvant Breast and Bowel Project; OS, overall survival; PFS, progression-free survival; RTOG, Radiation Therapy Oncology Group.
If a trial specified the required number of events from only the control arm, the number in this table is given by (1+ HR) times the required number of control arm patients, where HR is the target HR.
This is the nominal power for the phase III trial not accounting for the phase II analysis.
This trial had more than one experimental arm. The numbers in this table refer to each experimental arm versus control arm comparison.
The OS end point was tested only if the PFS end point for that comparison was statistically significant.
Coprimary end points—both tested.
The PFS phase III analysis was specified to occur when the OS phase III analysis was performed. This is an estimate of the number of PFS events expected to have occurred at that time.
Among the 14 trials, 13 had designed accrual suspensions, with seven having ≥6-month expected suspensions. Of the eight completed trials with published results (Table 2), seven had designed potential accrual suspensions, with the actual suspensions lasting from 3 to 20 months longer than planned (median 15 months).
TABLE 2.
Trial | Timing of Phase II Analysis From Trial Activation (months/patients accrued) | Length of Suspension (from the last phase II patient accrued) | Phase II Result (proceed to phase III or stop) | Phase III Result (positive/negative) | ||||
---|---|---|---|---|---|---|---|---|
Projected | Actual | Projected | Actual | |||||
Months | No. | Months | No. | Months | Months | |||
GOG-0281 | 32 | 176 | 38 | 213 | NA | NA | Go | Positive |
GOG-0286B | 17 | 240 | 32 | 244 | 0a | 16 | Go | Negative |
NRG-BN007 | 23 | 150 | 31 | 159 | 7 | 11 | Stop | NA |
NRG-BR002 | 42 | 128 | 89 | 129 | 12 | 32 | Stop | NA |
NRG-GY005b | 22 | 104 | 34 | 106 | 3 | 18 | Go | Negative |
NRG-GY009b | 33 | 160 | 26 | 160 | 12 | 15 | Go | Negative |
NRG-HN004 | 24 | 234 | 41 | 186c | 0 | 13 | Stop | NA |
NRG-LU002 | 45 | 216 | 77 | 218 | 7 | 22 | Stop | NA |
Abbreviations: GOG, Gynecologic Oncology Group; NA, not applicable; NSABP, National Surgical Adjuvant Breast and Bowel Project; RTOG, Radiation Therapy Oncology Group.
The design was to stop accrual at the 240 phase II sample size if the required number of phase II events was not seen by that time, but it was projected that the required number of events would be seen at that time.
Sample size for each experimental arm versus control arm comparison.
After planned interim phase II futility analysis, the trial was closed to accrual.
DISCUSSION
When the experimental treatment works much better than expected or when the event rate has been overestimated in both treatment arms (the more likely scenario), an accrual suspension time can be much longer than projected. To lessen this problem, a backstop strategy15 is recommended: the phase II analysis is performed at the prespecified backstop time if the required number of phase II events has not yet occurred (the backstop length would depend on clinical and feasibility parameters15). The downside of using a backstop strategy is that with fewer events (if the backstop analysis is triggered), there is less information about the treatment effect to inform the go/no-go decision rule. Reduced information results in either a decrease in power or an increase in the type I error (or possibly both) of the phase II go/no-go decision rule. To balance these two competing considerations, we suggest using a go/no-go decision rule on the basis of the prespecified cutoff on the hazard ratio scale, rather than a rule on the basis of hypothesis testing with the prespecified type 1 error. The hazard ratio scale cutoff approach results in less power loss relative to the hypothesis testing approach with some inflation of the type I error.
For example, the design of NRG-BR00210 had phase II testing at a significance level of 0.15 at 69 PFS events (corresponding to requiring the observed HR to be <0.78) for the trial to continue onto phase III. The trial design had an accrual suspension that was projected to be 12 months, but instead lasted for 32 months awaiting the 69 events. If the trial design had included a backstop of 18 months and the backstop analysis was used (with, say, 38 events), we would have recommended that the HR observed at that time be compared with the designed cutoff HR of 0.78 for the go/no-go decision rather than testing at the 0.15 significance level. Using the 0.78 HR cutoff rather than the 0.15-level testing for an 18-month backstop analysis at 38 PFS events would have resulted in a readout 14 months earlier, at the cost of a power loss from 93% to 86% and an increased type 1 error from 0.15 to 0.22 (for the phase II portion of the trial). In comparison, had the same 18-month backstop analysis been conducted at the same type 1 error of 0.15, the power would have been further reduced to 79% (instead of 86%). If the trial had included a later backstop at 24 months instead of 18 months, and the backstop analysis was applied using the cutoff HR of 0.78 (with, say, 45 events), then the trial would have resulted in a power of 88% (instead of 83% on the basis of 0.15-level testing) and a type 1 error for the phase II portion of the trial of 0.20 (instead of 0.15).
In NRG-LU002,14 the phase II testing was designed to detect a PFS HR of 0.60, with a significance level of 0.15 and a 95% power. The go/no-go decision was contingent on whether the observed HR was 0.83 or lower when at least 138 PFS events were observed. Patient accrual, however, was suspended for 22 months—much longer than the projected 7 months—to reach the 138 PFS events. If the trial design had included a 12-month backstop and, say, 115 PFS events had been observed, a go/no-go decision (on the basis of the HR cutoff of 0.83) could have been made 10 months earlier, with a 93% power (compared with the planned 95%) and a significance level of 0.17 (instead of the planned 0.15).
Phase II/III trials with an accrual suspension with a predefined backstop can be a useful tool for minimizing patient exposure to ineffective experimental treatments while still obtaining the trial results in a timely fashion.
Chen Hu
Consulting or Advisory Role: Johnson & Johnson Enterprise Innovation Inc
No other potential conflicts of interest were reported.
SUPPORT
Supported in part by the National Cancer Institute, US National Institutes of Health award U10 CA180822.
AUTHOR CONTRIBUTIONS
Conception and design: All authors
Financial support: Chen Hu,
Administrative support: Chen Hu,
Provision of study materials or patients:
Collection and assembly of data: All authors
Data analysis and interpretation: All authors
Manuscript writing: All authors
Final approval of manuscript: All authors
Accountable for all aspects of the work: All authors
AUTHORS’ DISCLOSURES OF POTENTIAL CONFLICTS OF INTEREST
The following represents disclosure information provided by authors of this manuscript. All relationships are considered compensated unless otherwise noted. Relationships are self-held unless noted. I = Immediate Family Member, Inst = My Institution. Relationships may not relate to the subject matter of this manuscript. For more information about ASCO’s conflict of interest policy, please refer to https://ascopubs.org/authors.
Open Payments is a public database containing information reported by companies about payments made to US-licensed physicians (Open Payments).
Chen Hu
Consulting or Advisory Role: Johnson & Johnson Enterprise Innovation Inc
No other potential conflicts of interest were reported.
REFERENCES
- 1.Bretz F, Schmidli H, König F, et al. : Confirmatory seamless phase II/III clinical trials with hypotheses selection at interim: General concepts. Biom J 48:623-634, 2006 [DOI] [PubMed] [Google Scholar]
- 2.Freidlin B, McShane LM, Polley MC, et al. : Randomized phase II trial designs with biomarkers. J Clin Oncol 30:3304-3309, 2012 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Korn EL, Freidlin B, Abrams JS, et al. : Design issues in randomized phase II/III trials. J Clin Oncol 30:667-671, 2012 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Freidlin B, Korn EL, Abrams JS: Bias, operational bias, and generalizability in phase II/III trials. J Clin Oncol 36:1902, 2018 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Lassman A, Polly M, Iwamoto F, et al. : CTIM-18. NRG oncology study BN007: Randomized phase ii/iii trial of ipilimiumab (ipi) plus nivolumab (nivo) vs. temozolomide (tmz) in mgmt-unmethylated (UMGMT) newly diagnosed glioblastoma (NGBM). Neuro Oncol 25:v65, 2023. (suppl 5) [Google Scholar]
- 6.Hunsberger S, Zhao Y, Simon R: A comparison of phase II study strategies. Clin Cancer Res 15:5950-5955, 2009 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Stallard N, Todd S: Seamless phase II/III designs. Stat Methods Med Res 20:623-634, 2011 [DOI] [PubMed] [Google Scholar]
- 8.Gershenson DM, Miller A, Brady WE, et al. : Trametinib versus standard of care in patients with recurrent low-grade serous ovarian cancer (GOG 281/LOGS): An international, randomised, open-label, multicentre, phase 2/3 trial. Lancet 399:541-553, 2022 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Bae-Jump VL, Sill M, Gehrig PA, et al. : A randomized phase II/III study of paclitaxel/carboplatin/metformin versus paclitaxel/carboplatin/placebo as initial therapy for measurable stage III or IVA, stage IVB, or recurrent endometrial cancer: An NRG oncology/GOG study. Gynecol Oncol 159:7, 2020 [Google Scholar]
- 10.Chmura SJ, Winter KA, Woodward WA, et al. : NRG-BR002: A phase IIR/III trial of standard of care systemic therapy with or without stereotactic body radiotherapy (SBRT) and/or surgical resection (SR) for newly oligometastatic breast cancer (NCT02364557). J Clin Oncol 40, 2022. (suppl 16; abstr 1007) [Google Scholar]
- 11.Lee JM, Brady M, Miller A, et al. : A phase II/III study of cediranib and olaparib combination compared to cediranib or olaparib alone or standard of care chemotherapy, in platinum-resistant ovarian cancer (NRG-GY005). J Clin Oncol 10.1200/JCO.24.00683 [epub ahead of print on October 3, 2024] [DOI] [PMC free article] [PubMed]
- 12.O’Cearbhaill R, Sill M, Duong H, et al. : PO005LBA/#1579 A randomized, phase II/III study of pegylated-liposomal-doxorubicin and atezolizumab (IND #134427) versus pegylated-liposomal-doxorubicin, bevacizumab and atezolizumab versus pegylated-liposomal-doxorubicin and bevacizumab in platinum-resistant ovarian cancer (NRG-GY009). Int J Gynecol Cancer 33:A4, 2023. (suppl 4) [Google Scholar]
- 13.Mell LK, Torres-Saavedra P, Wong S, et al. : Radiotherapy with durvalumab vs. cetuximab in patients with locoregionally advanced head and neck cancer and a contraindication to cisplatin: Phase II results of NRG-HN004. Int J Radiat Oncol Biol Phys 114:1058, 2022 [Google Scholar]
- 14.Iyengar P, Hu C, Daniel RG, et al. : NRG-LU002: Randomized phase II/III trial of maintenance systemic therapy versus local consolidative therapy (LCT) plus maintenance systemic therapy for limited metastatic non-small cell lung cancer (NSCLC). J Clin Oncol 42, 2024. (suppl 16; abstr 8506) [Google Scholar]
- 15.Othus M, Freidlin B, Korn EL: Avoiding delays in reporting time-to-event randomized trials: Calendar backstops and other approaches. J Clin Oncol 42:3753-3760, 2024 [DOI] [PMC free article] [PubMed] [Google Scholar]