Single-Arm Phase II Survival Trial Design Under the Proportional Hazards Model

Jianrong Wu

doi:10.1080/19466315.2016.1174147

. Author manuscript; available in PMC: 2017 Sep 28.

Published in final edited form as: Stat Biopharm Res. 2017 Mar 2;9(1):25–34. doi: 10.1080/19466315.2016.1174147

Single-Arm Phase II Survival Trial Design Under the Proportional Hazards Model

Jianrong Wu ¹

PMCID: PMC5619878 NIHMSID: NIHMS899011 PMID: 28966721

Abstract

For designing single-arm phase II trials with time-to-event endpoints, a sample size formula is derived for the modified one-sample log-rank test under the proportional hazards model. The derived formula enables new methods for designing trials that allow a flexible choice of the underlying survival distribution. Simulation results showed that the proposed formula provides an accurate estimation of sample size. The sample size calculation has been implemented in an R function for the purpose of trial design.

Keywords: contiguous alternative, one-sample log-rank test, proportional hazards model, time-to-event, single-arm phase II trial, sample size

1 Introduction

A time-to-event endpoint, such as event-free survival or overall survival, is often the primary endpoint for cancer clinical trials. In pediatric oncology, single-arm phase II trials with time-to-event endpoints are often conducted with limited numbers of patients. Various statistical methods have been proposed for designing randomized phase III trials with time-to-event endpoints (e.g., by George and Desu, 1977; Lachin, 1981; Rubenstein et al., 1981; Schoenfeld, 1983; Lakatos, 1988; Barthel et al. 2006; and many others). However, the literature on designing single-arm phase II trials with time-to-event endpoints is relatively scarce. The current practice for designing such trials is limited to using a parametric maximum likelihood test under the exponential model or a naive approach based on dichotomizing the event time at a landmark time point (Owzar and Jung, 2008). Trial design under the exponential model may not be reliable and the naive approach is inefficient.

Recently, Kwak and Jung (2014) proposed a two-stage phase II survival trial design using the one-sample log-rank test (OSLRT) (Breslow, 1975; Woolson, 1981; and Finkelstein et al., 2003). However, simulation results showed that Kwak and Jung’s design is conservative and underpowered. To correct the power and conservativeness of the OSLRT, Wu (2015) proposed a modified one-sample log-rank test (MOSLRT) for single-arm phase II survival trial designs. The MOSLRT preserves the type I error well and provides adequate power for trial design for a class of common parametric survival distributions. However, all the parametric survival distributions make strong assumptions about the shape of the hazard functions and are difficult to validate for historical data. In this paper, formulae for the number of events and sample size are derived under the proportional hazards model. The derived formula for the number of events is an analog version of the Schoenfeld formula for a two-arm randomized phase III trial using the two-sample log-rank test (Schoenfeld, 1983). Trial design based on the proposed sample size formula offers great flexibility in choosing of the underlying survival distribution, which could be a parametric survival distribution, a non-parametric Kaplan-Meier curve, or a spline version of the survival distribution (Kooperberg and Stone, 1992; Bantis et al., 2012; Anderson et al., 2013).

The rest of the paper is organized as follows. The MOSLRT is introduced in section 2. The formulae for the number of events and sample size are derived in section 3. Parameter setting for trial design is discussed in section 4. Simulations are conducted to study the performance of the proposed methods in section 5. An example is given in section 6 to illustrate the single-arm phase II survival trial designs. Concluding remarks are made in section 7.

2 Test Statistics

Let S₀(t) denote the survival function under the null hypothesis that is chosen for a single-arm phase II trial design. Let S(t) denote the survival function of the experimental treatment. Consider the following proportional hazards model:

S (t) = {[S_{0} (t)]}^{δ},

(1)

where δ(> 0) is the hazard ratio. The hypothesis of improvement in survival with the experimental treatment is

H_{0} : δ \geq 1 vs . H_{1} : δ < 1.

(2)

Testing this hypothesis is equivalent to testing the difference between the survival distributions with the experimental treatment and under the null hypothesis. Thus, the OSLRT can be used. However, the OSLRT is conservative, as shown by Kwak and Jung (2014); Sun et al. (2010); and Wu (2015). Recently, Wu (2015) proposed a MOSLRT that preserves the type I error well and provides adequate power for study design. To introduce the MOSLRT, assume that during the accrual phase of the trial, n subjects are enrolled in the study. Let T_i and C_i denote the event time and censoring time, respectively, of the i^th subject. We assume that the event time T_i and censoring time C_i are independent and that {T_i, C_i, i = 1, …, n} are independent and identically distributed. Then, the observed event time and event indicator are X_i = T_i ∧C_i and Δ_i = I(T_i ≤ C_i), respectively, for the i^th subject. On the basis of the observed data {X_i, Δ_i, i = 1,…, n}, we define $O = \sum_{i = 1}^{n} Δ_{i}$ as the observed number of events and $E = \sum_{i = 1}^{n} Λ_{0} (X_{i})$ as the expected number of events (asymptotically), where Λ₀(t) = − log S₀(t) is the cumulative hazard function under the null hypothesis. Then, the MOSLRT is defined by

L = \frac{O - E}{\sqrt{(O + E) / 2}} .

To study the asymptotic distribution, we formulate it using counting-process notation. Specifically, let N_i(t) = Δ_iI{X_i ≤ t} and Y_i(t) = I{X_i ≥ t} be the failure and at-risk processes, respectively, then

O = \sum_{i = 1}^{n} \int_{0}^{\infty} d N_{i} (t), E = \sum_{i = 1}^{n} \int_{0}^{\infty} Y_{i} (t) d Λ_{0} (t) .

Thus, the counting-process formulation of the MOSLRT is given by $L = W / \hat{σ}$ , where

W = n^{- 1 / 2} \sum_{i = 1}^{n} \int_{0}^{\infty} {d N_{i} (t) - Y_{i} (t) d Λ_{0} (t)},

and

{\hat{σ}}^{2} = n^{- 1} \sum_{i = 1}^{n} \int_{0}^{\infty} {d N_{i} (t) + Y_{i} (t) d Λ_{0} (t)} / 2.

Under the null hypothesis H₀, by the strong law of large numbers, $n^{- 1} O \to E_{H_{0}} (Δ)$ and $n^{- 1} \sum_{i = 1}^{n} \int_{0}^{\infty} Y_{i} (t) d Λ_{0} (t) \to \int_{0}^{\infty} G (t) S_{0} (t) d Λ_{0} (t) = E_{H_{0}} {Λ_{0} (X)}$ , where G(t) is the survival distribution of the censoring time C. Then, ${\hat{σ}}^{2} \to \int_{0}^{\infty} G (t) S_{0} (t) d Λ_{0} (t) = {Var}_{H_{0}} (W)$ , and $E_{H_{0}} (W) = 0$ (Wu, 2015). Therefore, by the counting process central limit theorem (Fleming and Harrington, 1991), L is asymptotically standard normal distributed. Hence, we reject the null hypothesis H₀ with one-sided type I error α if $L = W / \hat{σ} < - z_{1 - α}$ , where z₁₋_α is the 100(1 − α) percentile of the standard normal distribution.

3 Sample Size Formulae

Traditionally, sample size is often derived under the fixed alternative. However, when the asymptotic distribution of the test statistics is difficult to derive under the fixed alternative, contiguous alternatives (Lin et al., 1999) can also be considered by assuming that the alternative value of the testing parameter decreases to the null value at the rate of n^−1/2, where n is the sample size. For example, under the proportional hazard model, the null hypothesis of interest is H₀ : γ = 0 and the fixed alternative hypothesis of interest is H₁ : γ = γ₁, where γ = − log(δ) is the negative hazard ratio and γ₁ > 0. The contiguous alternatives of interest are $H_{1 n} : γ = γ_{1 n} = b / \sqrt{n}$ , which converges to the null hypothesis H₀ : γ = 0 as sample size n goes to infinity. Here, we will first derive a formula for the number of events under the contiguous alternatives. The proportional hazard model (1) is equivalent to λ(t) = e⁻^γλ₀(t), where λ₀(t) and λ(t) are the hazard functions under the null hypothesis and the experimental treatment, and γ = − log(δ) > 0. To derive the formula, we consider a sequence of contiguous alternatives $H_{1 n} : γ = γ_{1 n} = b / \sqrt{n}$ , where b < ∞. Under the H₁_n, as shown in Appendix 1, $L = W / \hat{σ}$ is approximately normal distributed with mean

μ = - b p_{0}^{1 / 2} = n^{1 / 2} \log (δ) p_{0}^{1 / 2},

and unit variance, where $p_{0} = E_{H_{0}} (Δ)$ is the probability of failure under the null hypothesis, which can be shown as

p_{0} = \int_{0}^{\infty} G (t) S_{0} (t) d Λ_{0} (t) .

Therefore, the study power 1 − β under the contiguous alternatives H₁_n satisfies the following:

1 - β = P (L < - z_{1 - α} | H_{1 n}) = P (L - μ < - μ - z_{1 - α} | H_{1 n}) ≃ Φ (- n^{1 / 2} \log (δ) p_{0}^{1 / 2} - z_{1 - α}) .

Thus, d = np₀, the expected number of events under the null hypothesis, satisfies the equation

z_{1 - β} = - d^{1 / 2} \log (δ) - z_{1 - α} .

Solving for d, we obtain

d = \frac{{(z_{1 - α} + z_{1 - β})}^{2}}{{[\log (δ)]}^{2}},

(3)

which gives the expected number of events under the null hypothesis. To calculate the sample size of the trial, let p₁ be the probability of failure under the alternative, which is given by

p_{1} = \int_{0}^{\infty} G (t) S_{1} (t) d Λ_{1} (t),

where S₁(t) = [S₀(t)]^δ and Λ₁(t) = δΛ₀(t). Then, the required sample size for the trial is given by d₁/p₁, where d₁ is the number of events under the alternative. However, we don’t know for the d₁, and we have only derived number of events d under the null hypothesis. The d/p₀ is the sample size under the null which underestimates the sample size required under the alternative, where p₀ is the probability of failure under the null. As d/p₁ > d₁/p₁, thus, d/p₁ overestimates the required sample size. Let P be the average probabilities of failure under the null and alternative, that is

P = (p_{0} + p_{1}) / 2,

then, a reasonable estimate of the required sample size is n = d/P which can be calculated by

n = \frac{{(z_{1 - α} + z_{1 - β})}^{2}}{P {[\log (δ)]}^{2}} .

(4)

For the purpose of comparison, the sample size formula for the MOSLRT under the fixed alternative H₁ (Wu, 2015) is also given as follows:

n = \frac{{(\bar{σ} z_{1 - α} + σ z_{1 - β})}^{2}}{ω^{2}},

(5)

where ω = v₁ − v₀, ${\bar{σ}}^{2} = (v_{1} + v_{0}) / 2$ , and $σ^{2} = v_{1} - v_{1}^{2} + 2 v_{00} - v_{0}^{2} - 2 v_{01} + 2 v_{0} v_{1}$ , with v₀, v₁, v₀₀, and v₀₁ being given by the following equations:

v_{0} = \int_{0}^{\infty} G (t) S_{1} (t) d Λ_{0} (t),

v_{1} = \int_{0}^{\infty} G (t) S_{1} (t) d Λ_{1} (t),

v_{00} = \int_{0}^{\infty} G (t) S_{1} (t) Λ_{0} (t) d Λ_{0} (t),

v_{01} = \int_{0}^{\infty} G (t) S_{1} (t) Λ_{0} (t) d Λ_{1} (t) .

4 Parameter Setting for Trial Design

For trial design using sample size formula (4), we first consider one of the following common parametric survival distributions: Weibull, gamma, Gompertz, log-normal, or log-logistic. The design parameters of the underlying survival distribution S(t) under the null hypothesis can be set as follows. Let S(x) be the survival probability of S(t) at a landmark time point x, S₀(x) be the level of S(x) at which investigators are no longer interested in the experimental treatment, and S₁(x)(> S₀(x)) be the level of S(x) at which investigators consider the experimental treatment is promising. Then the hypothesis of (2) is equivalent to the following hypothesis:

H_{0} : S (x) \leq S_{0} (x) v s . H_{1} : S (x) > S_{0} (x),

and the trial is powered at the alternative S(x) = S₁(x). Here, the shape parameter of the underlying survival distribution is assumed to be known from historical data. Thus, the scale parameter (Table 1) for each distribution can be determined by the value of S₀(x), which is given as follows:

Weibull $S_{0} (t) = e^{- λ_{0} t^{κ}}$ , with λ₀ = − log S₀(x)/x^κ,
Log-normal $S_{0} (t) = 1 - Φ (\frac{\log t - μ_{0}}{σ})$ , with μ₀ = log(x) − σΦ⁻¹(1 − S₀(x)),
Gompertz $S_{0} (t) = e^{- \frac{θ_{0}}{γ} (e^{γ t} - 1)}$ , with θ₀ = −γ log S₀(x)/(e^γx − 1),
Gamma S₀(t) = 1 − I_k(λ₀t), with $λ_{0} = I_{k}^{- 1} (1 - S_{0} (x)) / x$ ,
Log-logistic S₀(t) = 1/(1 + λ₀t^p), with λ₀ = (1/S₀(x) − 1)/x^p.

The hazard ratio can be calculated by

δ = \frac{\log S_{1} (x)}{\log S_{0} (x)},

and the survival distribution under the alternative is given by S₁(t) = [S₀(t)]^δ. To calculate the probabilities p₀ and p₁ for formula (4), we assume that subjects are recruited with a uniform distribution over the accrual period t_a and followed for a period of t_f and that no subject is lost to follow-up. Thus, the censoring distribution is a uniform distribution over [t_f, t_a + t_f]. That is, the censoring survival distribution G(t) = 1 if t ≤ t_f; = (t_a + t_f − t)/t_a if t_f ≤ t ≤ t_a + t_f; = 0 otherwise. Hence, the probabilities of failure p₀ and p₁ can be calculated by the following integration:

p_{i} = 1 - \frac{1}{t_{a}} \int_{t_{f}}^{t_{a} + t_{f}} S_{i} (t) d t, i = 0, 1

where S₁(t) = [S₀(t)]^δ. If S₀(t) is a spline version of the survival distribution, then p_i can also be calculated by numerical integration. If S₀(t) is a Kaplan-Meier curve, then p_i can be calculated numerically using Simpson’s rule as follows:

p_{i} = 1 - \frac{1}{6} {S_{i} (t_{f}) + 4 S_{i} (0.5 t_{a} + t_{f}) + S_{i} (t_{a} + t_{f})}, i = 0, 1.

The proposed sample size formula can also incorporate lost to follow-up in the sample size calculation. For example, let C₁ be the loss to follow-up time and C₂ be the administrative censoring time, then the overall censoring time is C = C₁ ∧ C₂, where C₁ and C₂ are independent. Thus, the overall censoring distribution is G(t) = P(C > t) = P(C₁ > t)P(C₂ > t) = G₁(t)G₂(t). It is often assumed that the loss to follow-up distribution is an exponential G₁(t) = e⁻^ηt, and administrative censoring distribution G₂(t) is uniform. Therefore the sample size formula (4) can be calculated by numerical integrations. For non-uniform accrual, once the accrual distribution is specified, the sample size can be calculated as well.

Table 1.

Various parametric distributions used for single-arm phase II trial designs.

Surv. function

Density

Parameter

Cumu. hazard

Hazard

Dist.

S(t)

f(t)

Scale

Shape

Λ(t)

λ(t)

WB(λ, κ)

e^{- λ t^{κ}}

κ λ t^{κ - 1} e^{- λ t^{κ - 1}}

λt^κ

κλt^κ−1

GM(λ, k)

1 − I_k (λt)

\frac{λ t^{k - 1} e^{- λ t}}{Γ (k)}

− log S(t)

\frac{f (t)}{S (t)}

LN(μ, σ²)

1 - Φ (\frac{\log t - μ}{σ})

\frac{1}{\sqrt{2 π} σ t} e^{- \frac{(\log t - μ)}{2 σ^{2}}}

− log S(t)

\frac{f (t)}{S (t)}

LG(λ, p)

\frac{1}{1 + λ t^{p}}

\frac{p λ t^{p - 1}}{{(1 + λ t^{p})}^{2}}

log(1 + λt^p)

\frac{p λ t^{p - 1}}{1 + λ t^{p}}

GZ(θ, γ)

e^{- \frac{θ}{γ} (e^{γ t} - 1)}

θ e^{γ t} e^{- \frac{θ}{γ} (e^{γ t} - 1)}

\frac{θ}{γ} (e^{γ t} - 1)

θe^γt

Open in a new tab

Footnote: abbreviation Dist.: distribution; Surv.: survival; Cumu.: cumulative

5 Simulation studies

We first investigated whether formula (4) would give an accurate sample size estimation. We calculated sample sizes under various hazard ratios δ = 1.2⁻¹–2.0⁻¹, with powers of 80%, 85%, and 90% and a type I error of 5%. The accuracy was assessed by simulations performed under the Weibull distribution. The Weibull shape parameter κ was set to 0.5, 1 and 2 to reflect a decreasing, constant and increasing hazard function, and the median survival time under the null was set to m₀ = 1. We assumed that subjects were recruited with a uniform distribution over the accrual period t_a = 3 (years) and followed for t_f = 1 (year) and that no subject was lost to follow-up; that is, only administrative censoring was considered in the trial. Under these assumptions, the number of events and sample sizes were calculated, and empirical powers and type I errors were estimated based on 100,000 simulation runs (Table 2). All simulated empirical powers and type I errors were close to the nominal levels. Additional sample size calculations were conducted under the Weibull model for various combinations of accrual period t_a, follow-up time t_f and landmark time point x for survival probability S₀(x) under null which varies from 0.2 to 0.7 and a 10% increasing survival probability S₁(x) under alternative to mimic a variety of real trial design. Detail for the set up of the design parameters were given in Table 2. Simulations were conducted to estimate the empirical type I error and power for the corresponding sample size based on 100,000 runs. The empirical type I errors and powers were close to the nominal levels for all scenarios. Thus, the formula (4) did provide an accurate estimation of the sample size for trial design.

Table 2.

Number of events (d) and sample sizes (n) were calculated from formulae (3) and (4) for various of hazard ratios (δ) under the Weibull distribution, with nominal type I error of 0.05 and power of 80%, 85%, and 90%. The empirical type I errors $(\hat{α})$ and powers $(1 - \hat{β})$ were estimated based on 100,000 simulation runs.

Power

Design

κ = 0.5

κ = 1

κ = 2

δ⁻¹

\hat{α}

1 - \hat{β}

\hat{α}

1 - \hat{β}

\hat{α}

1 - \hat{β}

90%

1.2

258

415

0.051

0.904

338

0.051

0.901

285

0.049

0.902

1.3

125

205

0.051

0.903

166

0.052

0.902

139

0.049

0.901

1.4

128

0.052

0.904

103

0.051

0.904

0.049

0.900

1.5

0.053

0.907

0.051

0.904

0.050

0.901

1.6

0.052

0.905

0.052

0.902

0.049

0.900

1.7

0.052

0.904

0.052

0.906

0.050

0.903

1.8

0.055

0.908

0.052

0.908

0.049

0.905

1.9

0.054

0.902

0.052

0.904

0.049

0.900

2.0

0.054

0.903

0.053

0.903

0.050

0.903

85%

1.2

217

349

0.051

0.854

284

0.051

0.853

240

0.049

0.852

1.3

105

172

0.051

0.854

140

0.051

0.856

116

0.049

0.852

1.4

107

0.054

0.856

0.051

0.855

0.050

0.853

1.5

0.053

0.855

0.052

0.856

0.049

0.853

1.6

0.054

0.858

0.051

0.860

0.050

0.853

1.7

0.053

0.861

0.052

0.856

0.049

0.854

1.8

0.053

0.861

0.052

0.861

0.049

0.855

1.9

0.054

0.858

0.052

0.865

0.050

0.851

2.0

0.055

0.859

0.053

0.859

0.049

0.848

80%

1.2

186

300

0.052

0.806

244

0.051

0.805

206

0.050

0.806

1.3

148

0.052

0.805

120

0.051

0.807

100

0.048

0.805

1.4

0.051

0.806

0.052

0.807

0.049

0.803

1.5

0.052

0.808

0.052

0.811

0.050

0.812

1.6

0.055

0.809

0.052

0.810

0.048

0.808

1.7

0.053

0.809

0.052

0.810

0.049

0.808

1.8

0.054

0.818

0.053

0.818

0.049

0.815

1.9

0.055

0.816

0.053

0.816

0.048

0.801

2.0

0.056

0.813

0.053

0.814

0.048

0.809

Open in a new tab

Next, we conducted simulations to compare the sample size formulae (4) and (5). In simulations, the survival distributions were taken as Weibull, gamma, Gompertz, log-normal and log-logistic (Table 1). The parameters of the survival distribution under the null were set as follows: the shape parameter of each distribution was set to 0.5, 1, and 2; the survival probabilities at a landmark time point x = 2 under the null were set to S₀(x) = 0.2 – 0.7 and under the alternative were set to S₁(x) = 0.35 – 0.8, with same accrual and censoring distributions as before. Given a nominal type I error of 5% and power of 80%, the required sample sizes based on formulae (4) and (5) were calculated for each design scenario. For each calculated sample size, 100,000 random samples were generated from the corresponding distribution to estimate the empirical type I error and power (Tables 3 and 4). The simulation results showed that the empirical powers were close to the nominal level of 80% for all scenarios. Thus, sample size formulae (4) and (5) both gave an accurate estimation of sample size. The results also showed that the sample sizes calculated by formula (4) under the contiguous alternatives (Table 3) were almost identical to that calculated by formula (5) under the fixed alternative (Table 4). Furthermore, the MOSLRT controlled the type I error well when the survival probability under the null was low (S₀(x) < 0.5) and was slightly more liberal when the survival probability under the null (S₀(x) ≥ 0.5) was high.

Table 3.

Sample sizes (n) were calculated from formula (4) for various of accrual period (t_a), follow-up time (t_f), landmark time point (x), and survival probabilities under null and alternative for the Weibull distribution with nominal type I error of 0.05 and power of 80%. The empirical type I errors $(\hat{α})$ and powers $(1 - \hat{β})$ were estimated based on 100,000 simulation runs.

Design

κ = 0.5

κ = 1

κ = 2

(t_a, t_f, x)

S₀(x), S₁(x)

\hat{α}

1 - \hat{β}

\hat{α}

1 - \hat{β}

\hat{α}

1 - \hat{β}

(1,1,1)

0.2, 0.3

.050

.806

.050

.808

.050

.808

0.3, 0.4

115

.052

.806

106

.052

.808

.050

.806

0.4, 0.5

128

.052

.808

115

.052

.805

.050

.806

0.5, 0.6

129

.053

.807

113

.053

.805

.052

.808

0.6, 0.7

118

.054

.807

102

.053

.806

.052

.806

0.7, 0.8

.055

.803

.055

.808

.054

.810

(1,2,1)

0.2, 0.3

.050

.807

.049

.804

.048

.801

0.3, 0.4

103

.051

.809

.050

.807

.047

.803

0.4, 0.5

111

.051

.806

.049

.808

.049

.800

0.5, 0.6

109

.053

.806

.051

.807

.049

.807

0.6, 0.7

.053

.806

.051

.808

.050

.808

0.7, 0.8

.056

.808

.054

.808

.050

.819

(2,2,2)

0.2, 0.3

.051

.807

.050

.810

.050

.803

0.3, 0.4

115

.050

.805

106

.051

.808

.050

.805

0.4, 0.5

128

.052

.807

115

.052

.808

.050

.806

0.5, 0.6

129

.053

.809

113

.053

.807

.051

.808

0.6, 0.7

118

.054

.806

102

.054

.806

.053

.808

0.7, 0.8

.054

.808

.055

.808

.054

.809

(3,2,1)

0.2, 0.3

.048

.808

.048

.807

.049

.803

0.3, 0.4

.050

.806

.048

.806

.049

.805

0.4, 0.5

104

.051

.811

.051

.806

.048

.806

0.5, 0.6

100

.052

.807

.051

.809

.048

.802

0.6, 0.7

.053

.807

.052

.807

.048

.803

0.7, 0.8

.054

.813

.053

.814

.048

.812

(3,2,2)

0.2, 0.3

.051

.805

.050

.807

.049

.805

0.3, 0.4

112

.052

.809

101

.051

.808

.050

.807

0.4, 0.5

123

.052

.808

108

.051

.806

.050

.806

0.5, 0.6

123

.052

.807

105

.052

.809

.050

.806

0.6, 0.7

112

.054

.809

.054

.807

.050

.808

0.7, 0.8

.054

.809

.055

.809

.053

.810

(3,3,2)

0.2, 0.3

.050

.807

.051

.808

.048

.804

0.3, 0.4

105

.051

.804

.049

.808

.049

.804

0.4, 0.5

115

.052

.809

.051

.808

.049

.807

0.5, 0.6

113

.052

.807

.052

.807

.050

.807

0.6, 0.7

101

.054

.806

.054

.809

.050

.808

0.7, 0.8

.055

.808

.054

.809

.051

.810

Open in a new tab

Table 4.

Sample sizes (n) were calculated from formula (4) under the contiguous alternative for the Weibull, gamma, log-logistic, log-normal, and Gompertz distributions with nominal type I error of 0.05 and power of 80%. The corresponding empirical type I errors $(\hat{α})$ and powers $(1 - \hat{β})$ were estimated based on 100,000 simulation runs.

Distribution

Design

\hat{α}

1 - \hat{β}

\hat{α}

1 - \hat{β}

\hat{α}

1 - \hat{β}

WB(λ, κ)

S₀(2) vs S₁(2)

κ = 0.5

κ = 1

κ = 2

0.2 vs 0.35

.052

.810

.051

.808

.050

.806

0.2 vs 0.4

.052

.815

.052

.812

.050

.815

0.3 vs 0.45

.053

.809

.053

.808

.051

.805

0.5 vs 0.65

.054

.806

.054

.812

.052

.812

0.6 vs 0.75

.055

.811

.056

.808

.053

.810

0.7 vs 0.8

104

.055

.805

.054

.807

.055

.807

GM(γ, k)

k = 0.5

k = 1

k = 2

0.2 vs 0.35

.052

.813

.050

.811

.050

.810

0.2 vs 0.4

.053

.819

.052

.811

.050

.813

0.3 vs 0.45

.052

.806

.052

.808

.050

.809

0.5 vs 0.65

.055

.806

.055

.813

.054

.807

0.6 vs 0.75

.056

.809

.056

.808

.054

.812

0.7 vs 0.8

103

.056

.807

.056

.806

.055

.810

LG(λ, p)

p = 0.5

p = 1

p = 2

0.2 vs 0.35

.050

.812

.051

.807

.051

.811

0.2 vs 0.4

.052

.809

.052

.815

.051

.818

0.3 vs 0.45

.052

.807

.053

.806

.052

.811

0.5 vs 0.65

.054

.809

.056

.808

.053

.809

0.6 vs 0.75

.056

.810

.056

.807

.055

.811

0.7 vs 0.8

106

.054

.809

.055

.806

.055

.809

LN(μ, σ)

σ = 2

σ = 1

σ = 0.5

0.2 vs 0.35

.051

.808

.052

.811

.050

.803

0.2 vs 0.4

.052

.815

.051

.818

.050

.807

0.3 vs 0.45

.053

.811

.052

.809

.052

.809

0.5 vs 0.65

.054

.807

.054

.811

.051

.808

0.6 vs 0.75

.056

.807

.057

.809

.054

.812

0.7 vs 0.8

102

.057

.807

.054

.809

.055

.807

GZ(θ, γ)

γ = 0.5

γ = 1

γ = 2

0.2 vs 0.35

.050

.807

.048

.809

.049

.808

0.2 vs 0.4

.050

.808

.050

.807

.049

.800

0.3 vs 0.45

.050

.806

.051

.805

.049

.806

0.5 vs 0.65

.054

.807

.051

.810

.048

.804

0.6 vs 0.75

.054

.812

.053

.809

.049

.809

0.7 vs 0.8

.054

.809

.055

.808

.051

.800

Open in a new tab

Footnote: abbreviation WB: Weibull; GM: gamma; LG: log-logistic; LN: log-normal; GZ: Gompetz

6 Example

Between January 1974 and May 1984, the Mayo Clinic conducted a double-blind randomized trial on treating primary biliary cirrhosis of the liver (PBC), comparing the drug D-penicillamine (DPCA) with a placebo (Fleming and Harrington, 1991). PBC is a rare but fatal chronic liver disease of unknown cause, with a prevalence of approximately 50 cases per million in the population. The primary pathologic event appears to be the destruction of the interlobular bile ducts, which may be mediated by immunologic mechanisms. Of 158 patients treated with DPCA, 65 died. The median survival time was 9 years. Suppose an experimental treatment is now available and investigators wish to design a new trial using the Mayo Clinic patients treated with DPCA as the historical data with which to formulate the hypothesis. The survival distribution of the DPCA data is estimated by a Kaplan-Meier curve, a spline version of the survival distribution, which is fitted by using the R function oldlogspline, and the Weibull distribution, which is fitted by using the R function survreg with the estimated shape parameter κ = 1.22 (Figure 1). Both the spline and Weibull distributions are fitted well and are close to the Kaplan-Meier curve. The 5-year survival probability estimate from the Kaplan-Meier curve is 71%. Thus, for the trial design, S₀(5) = 71% is the 5-year survival probability at which investigators are no longer interested in the experimental treatment, and S₁(5) = 82% is the 5-year survival probability at which investigators consider the experimental treatment to be promising. Then, the hazard ratio is δ = log(0.82) / log(0.71) = 0.58. To calculate the sample size, we assume a uniform accrual with an accrual period t_a = 8 years and a follow-up period t_f = 3 years, with no patient being lost to follow-up. Thus, given a type I error of α = 5% and power of 1−β = 80%, the required sample sizes calculated using the R function SIZE (Appendix 2) are 63, 63, and 63 under the Weibull, spline and Kaplan-Meier curve, respectively. With a power of 90%, the required sample sizes are 88, 87 and 88 under the Weibull, spline and Kaplan-Meier curve, respectively. Sample size calculations under the spline distribution and Kaplan-Meier curve make no assumption regarding the underlying survival distribution. Thus, this approach takes advantage of possible misspecification by using a parametric survival distribution for the trial design.

Step functions are the Kaplan-Meier survival curve and its 95% confidence boundaries. Solid and dark solid curves are the fitted Weibull and spline survival distributions, respectively.

7 Conclusion

In this paper, formulae for the number of events and sample size for a single-arm phase II survival trial are derived for the MOSLRT under the proportional hazards model. The new sample size formula is simple and easy to compute. The simulation results show that the proposed formula provides an accurate estimation of sample size for trial design. The sample size calculation using the new formula is extended to a class of flexible survival distributions, including a Kaplan-Meier curve or a spline version of the survival distribution, and has been implemented in the R function SIZE for trial design.

Table 5.

Sample sizes (n) were calculated from formula (5) under the fixed alternative for the Weibull, gamma, log-logistic, log-normal, and Gompertz distributions with nominal type I error of 0.05 and power of 80%. The empirical type I errors $(\hat{α})$ and powers $(1 - \hat{β})$ were estimated based on 100,000 simulation runs.

Distribution

Design

\hat{α}

1 - \hat{β}

\hat{α}

1 - \hat{β}

\hat{α}

1 - \hat{β}

WB(λ, κ)

S₀(2) vs S₁(2)

κ = 0.5

κ = 1

κ = 2

0.2 vs 0.35

.051

.801

.051

.811

.049

.812

0.2 vs 0.4

.051

.802

.051

.811

.052

.818

0.3 vs 0.45

.053

.803

.051

.803

.050

.807

0.5 vs 0.65

.054

.795

.052

.799

.053

.805

0.6 vs 0.75

.056

.796

.056

.798

.055

.804

0.7 vs 0.8

100

.055

.793

.055

.794

.055

.798

GM(γ, k)

k = 0.5

k = 1

k = 2

0.2 vs 0.35

.051

.806

.051

.810

.051

.812

0.2 vs 0.4

.051

.808

.051

.812

.051

.813

0.3 vs 0.45

.053

.803

.053

.804

.051

.802

0.5 vs 0.65

.054

.795

.054

.798

.054

.800

0.6 vs 0.75

.056

.796

.055

.797

.055

.796

0.7 vs 0.8

.055

.792

.056

.790

.054

.795

LG(λ, p)

p = 0.5

p = 1

p = 2

0.2 vs 0.35

.053

.804

.052

.807

.051

.803

0.2 vs 0.4

.053

.808

.053

.813

.052

.805

0.3 vs 0.45

.053

.804

.053

.801

.052

.804

0.5 vs 0.65

.056

.799

.054

.795

.054

.804

0.6 vs 0.75

.056

.798

.056

.796

.055

.797

0.7 vs 0.8

101

.055

.791

.055

.795

.054

.797

LN(μ, σ)

σ = 2

σ = 1

σ = 0.5

0.2 vs 0.35

.052

.805

.051

.808

.050

.805

0.2 vs 0.4

.053

.813

.051

.807

.051

.809

0.3 vs 0.45

.052

.801

.054

.806

.051

.807

0.5 vs 0.65

.055

.798

.053

.800

.052

.803

0.6 vs 0.75

.056

.794

.056

.796

.052

.803

0.7 vs 0.8

.057

.793

.055

.798

.054

.803

GZ(θ, γ)

γ = 0.5

γ = 1

γ = 2

0.2 vs 0.35

,049

.810

.049

.809

.050

.816

0.2 vs 0.4

.049

.818

.050

.821

.049

.824

0.3 vs 0.45

.052

.804

.050

.810

.050

.811

0.5 vs 0.65

.052

.808

.052

.811

.049

.820

0.6 vs 0.75

.055

.805

.053

.811

.051

.819

0.7 vs 0.8

.054

.797

.053

.802

.050

.814

Open in a new tab

Footnote: abbreviation WB: Weibull; GM: gamma; LG: log-logistic; LN: log-normal; GZ: Gompetz

Acknowledgments

The author acknowledges two anonymous reviewers and an editor for their valuable comments that improved an earlier version of the paper. The work was supported in part by the National Cancer Institute support grant P30CA021765 and ALSAC.

Appendix 1: Derivation of the asymptotic distribution for the MOSLRT

The methods used to derive the asymptotic distribution of the MOSLRT under the contiguous alternative are similar to the derivation of the two-sample log-rank test (Fleming and Harrington, 1991).

Consider a sequence of contiguous alternatives H₁_n : λ₁_n(t) = e⁻^γ¹ⁿλ₀(t), where γ₁_n is a sequence of positive constants satisfying n^1/2γ₁_n = b < ∞. Then, the weighted one-sample log-rank score W is given by

W = n^{- 1 / 2} \sum_{i = 1}^{n} \int_{0}^{\infty} w_{n} (t) {d N_{i} (t) - Y_{i} (t) d Λ_{0} (t)},

where w_n(t) is a weight function convergence to w(t) as n → ∞. If we further define a sequence of martingale by

M_{i}^{(n)} (t) = N_{i} (t) - \int_{0}^{t} Y_{i} (u) e^{- γ_{1 n}} d Λ_{0} (u)

and let

W_{M} = n^{- 1 / 2} \sum_{i = 1}^{n} \int_{0}^{\infty} w_{n} (t) d M_{i}^{(n)} (t),

W_{D} = n^{- 1 / 2} \sum_{i = 1}^{n} \int_{0}^{\infty} (e^{- γ_{1 n}} - 1) w_{n} (t) Y_{i} (t) d Λ_{0} (t),

then we have

W = W_{M} + W_{D} .

It is easy to show that

W_{M} = n^{- 1 / 2} \sum_{i = 1}^{n} \int_{0}^{\infty} w (t) d M_{i}^{(n)} (t) + o_{p} (1)

and

W_{D} = - n^{1 / 2} γ_{1 n} \int_{0}^{\infty} w (t) {n^{- 1} \sum_{i = 1}^{n} Y_{i} (t)} d Λ_{0} (t) + o_{p} (1) .

n^{1 / 2} γ_{1 n} = b and n^{- 1} \sum_{i = 1}^{n} Y_{i} (t) \to π (t),

where π(t) = P(X ≥ t), we have where

W_{D} \to - b V,

where

V = \int_{0}^{\infty} w^{2} (t) π (t) d Λ_{0} (t) .

Therefore,

W = n^{- 1 / 2} \sum_{1}^{n} \int_{0}^{\infty} w (t) d M_{i}^{(n)} (t) - b V + o_{p} (1) .

By the martingale central limit theorem (Fleming and Harrington, 1991), W is approximate normal with mean −bV and variance V. When w(t) = 1, the variance V reduces to

p_{0} = \int_{0}^{\infty} π (t) d Λ_{0} (t) = \int_{0}^{\infty} G (t) S_{0} (t) d Λ_{0} (t) .

Hence,

W \to N (- b p_{0}, p_{0}) .

By the dominated convergence theorem, ${\hat{σ}}^{2} \to p_{0} = \int_{0}^{\infty} G (t) S_{0} (t) d Λ_{0} (t)$ . Finally, by Slutsky’s theorem, it follows that

L = W / \hat{σ} \to N (- b p_{0}^{1 / 2}, 1) .

Appendix 2: R code for the sample size calculation under the Weibull distribution, spline distribution, and Kaplan-Meier curve

library (survival)

library (polspline)

time=c

(1.10,

12.33,

2.77,

5.27,

6.58,

9.82,

0.36,

11.59,

1.84,

11.18,

10.78,

0.61,

6.29,

12.24,

3.70,

12.48,

6.18,

7.12,

6.54,

2.74,

3.93,

3.73,

8.99,

12.22,

6.09,

11.96,

10.94,

11.48,

11.07,

3.21,

9.47,

5.01,

3.26,

0.19,

4.63,

10.16,

6.96,

9.79,

11.10,

4.54,

0.54,

4.77,

7.37,

1.06,

10.72,

2.05,

10.55,

10.47,

8.49,

8.45,

8.83,

7.08,

5.77,

6.44,

2.68,

9.14,

2.97,

6.27,

1.41,

5.57,

9.03,

2.66,

8.41,

2.26,

2.84,

8.87,

8.07,

8.63,

8.49,

8.19,

3.55,

8.38,

8.36,

8.21,

2.09,

7.86,

3.16,

7.84,

0.38,

6.78,

5.63,

2.95,

4.61,

7.38,

7.05,

7.28,

7.24,

4.09,

7.07,

7.00,

6.92,

4.32,

6.39,

6.86,

6.69,

6.71,

6.38,

6.47,

6.48,

4.36,

6.22,

5.70,

6.18,

5.95,

2.48,

5.97,

5.94,

5.95,

3.38,

0.92,

5.33,

5.54,

2.74,

0.95,

5.35,

5.29,

5.23,

5.16,

1.90,

5.02,

4.96,

4.63,

3.93,

2.01,

4.88,

3.99,

4.85,

4.84,

2.02,

4.66,

4.42,

4.66,

4.42,

0.49,

3.26,

4.30,

4.18,

3.96,

3.70,

4.06,

3.87,

0.11,

3.84,

3.86,

3.38,

2.19,

3.73,

2.47,

3.57,

2.40,

1.46,

3.54,

3.48,

3.37,

3.16,

2.57,

2.30)

Open in a new tab

status=c

(1,

0,

1,

1

0,

1,

0,

1,

0,

1,

0,

1,

0,

1,

0,

1,

0,

1,

0,

1,

1

0,

1,

0,

1,

0,

1,

0,

1,

0,

1,

0,

1,

0,

1,

0

1,

0,

1,

0,

1,

0,

1,

0,

1,

0,

1,

0,

1,

0,

0

0,

1,

0,

1,

0,

1,

0,

1,

0,

1,

0,

0

1,

0,

1,

0,

1,

0,

1,

0,

1

0,

0)

Open in a new tab

`dat=data.frame(time=time, status=status)`
`SIZE=function(delta, ta, tf, alpha, beta, data)`
`{`
`tau=tf+ta`
`z0=qnorm(1-alpha)`
`z1=qnorm(1-beta)`
`######### fit KM curve #############`
`surv=Surv(time, status)`
`fitKM<- survfit(surv ~ 1, data = dat)`
`p0<-c(1, summary(fitKM)$surv)`	`# KM survival probability ###`
`t0<-c(0, summary(fitKM)$time)`	`# ordered failure times ###`
`outKM<-data.frame(t0=t0,p0=p0)`
`KM<-function(t){`
`t0=outKM$t0; p0=outKM$p0; k=length(t0)`
`if (t>=t0[k] \|\| t<0) {ans<-0}`
`for (i in 1:(k-1)){`
`if (t>=t0[i] & t<t0[i+1]) {S0=p0[i]}}`
`return(S0)}`
`######### fit Weibull curve ###########`
`fitWB=survreg(formula=surv~1, dist=“weibull”)`
`scale=as.numeric(exp(fitWB$coeff))`
`shape=1/fitWB$scale`
`WB=function(t){`
`kappa=shape; lambda0=1/scaleˆkappa`
`S0 = exp(-lambda0*tˆkappa); return(S0)}`
`######## fit spline curve ##########`
`fitSP=oldlogspline(time[status == 1], time[status == 0], lbound = 0)`
`SP=function(t) {S0=1-poldlogspline(t, fitSP); return(S0)}`
`####### sample size calculation #####`
`S0=function(t){WB(t)}`
`S1=function(t){WB(t)ˆdelta}`
`p0=1-integrate(S0, tf, tau)$value/ta`
`p1=1-integrate(S1, tf, tau)$value/ta`
`PWB=(p0+p1)/2`
`S0=function(t){SP(t)}`
`S1=function(t){SP(t)ˆdelta}`
`p0=1-integrate(S0, tf, tau)$value/ta`
`p1=1-integrate(S1, tf, tau)$value/ta`
`PSP=(p0+p1)/2`
`S0=function(t){KM(t)}`
`S1=function(t){KM(t)ˆdelta}`
`p0=1-(S0(tf)+4S0(0.5ta+tf)+S0(ta+tf))/6`
`p1=1-(S1(tf)+4S1(0.5ta+tf)+S1(ta+tf))/6`
`PKM=(p0+p1)/2`
`d0=(z0+z1)ˆ2/log(delta)ˆ2`		`# number of events` formula (3)
`nWB=ceiling(d0/PWB)`		`# sample size` formula (4) `under Weibull model`
`nSP=ceiling(d0/PSP)`		`# sample size` formula (4) `under spine curve`
`nKM=ceiling(d0/PKM)`		`# sample size` formula (4) `under KM curve`
`d=ceiling(d0)`
`ans=list(c(d=d, nWB=nWB, nSP=nSP, nKM=nKM))`
`return(ans)`
`}`
`#### 80% power ####`
`SIZE(delta=0.58, ta=8, tf=3, alpha=0.05, beta=0.2, data=dat)`
`d nWB nSP nKM`
`21 63 63 63`
`#### 90% power ####`
`SIZE(delta=0.58, ta=8, tf=3, alpha=0.05, beta=0.1, data=dat)`
`d nWB nSP nKM`
`29 88 87 88`

Open in a new tab

References

Anderson TML, Dickman PW, Eloranta S, Lambe M, Lambert PC. Estimating the loss in expectation of life due to cancer using flexible parametric survival models. Statistics in Medicine. 2013;32:5286–5300. doi: 10.1002/sim.5943. [DOI] [PubMed] [Google Scholar]
Bantis LE, Tsimikas JV, Georgiou SD. Survival estimation through the cumulative hazard function with monotone natural cubic splines. Lifetime Data Analysis. 2012;18:364–396. doi: 10.1007/s10985-012-9218-4. [DOI] [PubMed] [Google Scholar]
Barthel FMS, Babiker A, Royston P, Parmar MKB. Evaluation of sample size and power for multi-arm survival trials allowing for nonproportional hazards, loss to follow-up and crossover. Statistics in Medicine. 2006;25:2521–2542. doi: 10.1002/sim.2517. [DOI] [PubMed] [Google Scholar]
Breslow NE. Analysis of survival data under the proportional hazards model. International Statistical Review. 1975;43:44–58. [Google Scholar]
Finkelstein DM, Muzikansky A, Schoenfeld DA. Comparing survival of a sample to that of a standard population. Journal of National Cancer Institute. 2003;95:1434–1439. doi: 10.1093/jnci/djg052. [DOI] [PubMed] [Google Scholar]
Fleming TR, Harrington DP. Counting processes and survival analysis. New York: John Wiley and Sons; 1991. [Google Scholar]
George SL, Desu MM. Planning the size and duration of a clinical trial studying the time to some critical event. Journal of Chronic Diseases. 1977;27:15–24. doi: 10.1016/0021-9681(74)90004-6. (1977) [DOI] [PubMed] [Google Scholar]
Kooperberg C, Stone CJ. Logspline density estimation for censored data. Journal of Computational and Graphical Statistics. 1992;1:301–328. [Google Scholar]
Kwak M, Jung SH. Phase II clinical trials with time-to-event endpoints: Optimal two-stage designs with one-sample log-rank test. Statistics in Medicine. 2014;33:2004–2016. doi: 10.1002/sim.6073. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lin DY, Yao Q, Ying ZL. A general theory on stochastic curtailment for censored survival data. Journal of the American Statistical Association. 1999;94:510–521. [Google Scholar]
Lachin JM. Introduction to sample size determination and power analysis for clinical trials. Controlled Clinical Trials. 1981;2:93–114. doi: 10.1016/0197-2456(81)90001-5. [DOI] [PubMed] [Google Scholar]
Lakatos E. Sample size based on the log-rank statistic in complex clinical trials. Biometrics. 1988;44:229–241. [PubMed] [Google Scholar]
Owzar K, Jung SH. Designing phase II trials in cancer with time-to-event endpoints (with discussion) Clinical Trials. 2008;5:209–221. doi: 10.1177/1740774508091748. [DOI] [PubMed] [Google Scholar]
Rubenstein LV, Gail MH, Santner TJ. Planning the duration of a comparative clinical trial with loss to follow-up and a period of continued observation. Journal of Chronic Diseases. 1981;34:469–479. doi: 10.1016/0021-9681(81)90007-2. [DOI] [PubMed] [Google Scholar]
Schoenfeld DA. Sample-size formula for the proportional-hazards regression model. Biometrics. 1983;39:499–503. [PubMed] [Google Scholar]
Sun XQ, Peng P, Tu DS. Phase II cancer clinical trial with a one-sample log-rank test and its corrections based on the Edgeworth expansion. Contemporary Clinical Trials. 2011;32:108–113. doi: 10.1016/j.cct.2010.09.009. [DOI] [PubMed] [Google Scholar]
Woolson RF. Rank-tests and a one-sample log-rank test for comparing observed survival-data to a standard population. Biometrics. 1981;37:687–696. [Google Scholar]
Wu J. Single-arm phase II cancer survival trial designs. Journal of Biopharmaceutical Statistics. 2015 doi: 10.1080/10543406.2015.1052494. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R1] Anderson TML, Dickman PW, Eloranta S, Lambe M, Lambert PC. Estimating the loss in expectation of life due to cancer using flexible parametric survival models. Statistics in Medicine. 2013;32:5286–5300. doi: 10.1002/sim.5943. [DOI] [PubMed] [Google Scholar]

[R2] Bantis LE, Tsimikas JV, Georgiou SD. Survival estimation through the cumulative hazard function with monotone natural cubic splines. Lifetime Data Analysis. 2012;18:364–396. doi: 10.1007/s10985-012-9218-4. [DOI] [PubMed] [Google Scholar]

[R3] Barthel FMS, Babiker A, Royston P, Parmar MKB. Evaluation of sample size and power for multi-arm survival trials allowing for nonproportional hazards, loss to follow-up and crossover. Statistics in Medicine. 2006;25:2521–2542. doi: 10.1002/sim.2517. [DOI] [PubMed] [Google Scholar]

[R4] Breslow NE. Analysis of survival data under the proportional hazards model. International Statistical Review. 1975;43:44–58. [Google Scholar]

[R5] Finkelstein DM, Muzikansky A, Schoenfeld DA. Comparing survival of a sample to that of a standard population. Journal of National Cancer Institute. 2003;95:1434–1439. doi: 10.1093/jnci/djg052. [DOI] [PubMed] [Google Scholar]

[R6] Fleming TR, Harrington DP. Counting processes and survival analysis. New York: John Wiley and Sons; 1991. [Google Scholar]

[R7] George SL, Desu MM. Planning the size and duration of a clinical trial studying the time to some critical event. Journal of Chronic Diseases. 1977;27:15–24. doi: 10.1016/0021-9681(74)90004-6. (1977) [DOI] [PubMed] [Google Scholar]

[R8] Kooperberg C, Stone CJ. Logspline density estimation for censored data. Journal of Computational and Graphical Statistics. 1992;1:301–328. [Google Scholar]

[R9] Kwak M, Jung SH. Phase II clinical trials with time-to-event endpoints: Optimal two-stage designs with one-sample log-rank test. Statistics in Medicine. 2014;33:2004–2016. doi: 10.1002/sim.6073. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R10] Lin DY, Yao Q, Ying ZL. A general theory on stochastic curtailment for censored survival data. Journal of the American Statistical Association. 1999;94:510–521. [Google Scholar]

[R11] Lachin JM. Introduction to sample size determination and power analysis for clinical trials. Controlled Clinical Trials. 1981;2:93–114. doi: 10.1016/0197-2456(81)90001-5. [DOI] [PubMed] [Google Scholar]

[R12] Lakatos E. Sample size based on the log-rank statistic in complex clinical trials. Biometrics. 1988;44:229–241. [PubMed] [Google Scholar]

[R13] Owzar K, Jung SH. Designing phase II trials in cancer with time-to-event endpoints (with discussion) Clinical Trials. 2008;5:209–221. doi: 10.1177/1740774508091748. [DOI] [PubMed] [Google Scholar]

[R14] Rubenstein LV, Gail MH, Santner TJ. Planning the duration of a comparative clinical trial with loss to follow-up and a period of continued observation. Journal of Chronic Diseases. 1981;34:469–479. doi: 10.1016/0021-9681(81)90007-2. [DOI] [PubMed] [Google Scholar]

[R15] Schoenfeld DA. Sample-size formula for the proportional-hazards regression model. Biometrics. 1983;39:499–503. [PubMed] [Google Scholar]

[R16] Sun XQ, Peng P, Tu DS. Phase II cancer clinical trial with a one-sample log-rank test and its corrections based on the Edgeworth expansion. Contemporary Clinical Trials. 2011;32:108–113. doi: 10.1016/j.cct.2010.09.009. [DOI] [PubMed] [Google Scholar]

[R17] Woolson RF. Rank-tests and a one-sample log-rank test for comparing observed survival-data to a standard population. Biometrics. 1981;37:687–696. [Google Scholar]

[R18] Wu J. Single-arm phase II cancer survival trial designs. Journal of Biopharmaceutical Statistics. 2015 doi: 10.1080/10543406.2015.1052494. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Single-Arm Phase II Survival Trial Design Under the Proportional Hazards Model

Jianrong Wu

Abstract

1 Introduction

2 Test Statistics

3 Sample Size Formulae

4 Parameter Setting for Trial Design

Table 1.

5 Simulation studies

Table 2.

Table 3.

Table 4.

6 Example

Figure 1.

7 Conclusion

Table 5.

Acknowledgments

Appendix 1: Derivation of the asymptotic distribution for the MOSLRT

Appendix 2: R code for the sample size calculation under the Weibull distribution, spline distribution, and Kaplan-Meier curve

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Single-Arm Phase II Survival Trial Design Under the Proportional Hazards Model

Jianrong Wu

Abstract

1 Introduction

2 Test Statistics

3 Sample Size Formulae

4 Parameter Setting for Trial Design

Table 1.

5 Simulation studies

Table 2.

Table 3.

Table 4.

6 Example

Figure 1.

7 Conclusion

Table 5.

Acknowledgments

Appendix 1: Derivation of the asymptotic distribution for the MOSLRT

Appendix 2: R code for the sample size calculation under the Weibull distribution, spline distribution, and Kaplan-Meier curve

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases