Group-sequential logrank methods for trial designs using bivariate non-competing event-time outcomes

Tomoyuki Sugimoto; Toshimitsu Hamasaki; Scott R Evans; Susan Halabi

doi:10.1007/s10985-019-09470-4

. Author manuscript; available in PMC: 2021 Apr 1.

Published in final edited form as: Lifetime Data Anal. 2019 Apr 12;26(2):266–291. doi: 10.1007/s10985-019-09470-4

Group-sequential logrank methods for trial designs using bivariate non-competing event-time outcomes

Tomoyuki Sugimoto ¹, Toshimitsu Hamasaki ², Scott R Evans ³, Susan Halabi ⁴

PMCID: PMC7517875 NIHMSID: NIHMS1629114 PMID: 30980317

Abstract

We discuss the multivariate (2L-variate) correlation structure and the asymptotic distribution for the group-sequential weighted logrank statistics formulated when monitoring two correlated event-time outcomes in clinical trials. The asymptotic distribution and the variance-covariance for the 2L-variate weighted logrank statistic are derived as available in various group-sequential trial designs. These methods are used to determine a group-sequential testing procedure based on calendar times or information fractions. We apply the theoretical results to a group-sequential method for monitoring a clinical trial with early stopping for efficacy when the trial is designed to evaluate the joint effect on two correlated event-time outcomes. We illustrate the method with application to a clinical trial and describe how to calculate the required sample sizes and numbers of events.

Keywords: Bivariate dependence, Error-spending method, Independent censoring, Logrank statistic, Non-fatal events, Normal approximation

1. Introduction

Event-time outcomes are commonly used for evaluating the effect of a test intervention compared with a control. In some disease areas, e.g. HIV, oncology or cardiovascular disease, several event-time outcomes are used as the primary endpoints to more completely characterize the effect of an intervention on participants. Clinical trials with more than one primary endpoint can be designed to evaluate effects for all of the outcomes (i.e. co-primary endpoints) or to evaluate effects for at least one outcome (i.e. multiple primary endpoints). However, clinical trials with multiple event-time outcomes can be expensive and resource intensive as they often require large numbers of participants, collection of massive amounts of data, and long duration of follow-up. The use of group-sequential designs has the potential to improve efficiency, i.e. offering potentially fewer required trial participants, shortening the duration of clinical trials, and thus reducing the costs. Several authors have discussed group-sequential designs for multiple continuous or binary endpoints (e.g., Tang et al. 1989; Cook and Farewell 1994; Jennison and Turnbull 2000; Kosorok et al. 2004; Hung et al. 2007; Glimm et al. 2009; Tamhane et al. 2010, 2012; Asakura et al. 2014). Group-sequential theory and methods for single event-time outcomes have been studied (e.g., Tsiatis 1982; Slud and Wei 1982; Gordon and Lachin 1990; Gu and Lai 1991; Tsiatis et al. 1995; Lin et al. 1996; Lai and Shih 2004; Gombay 2008; Wu and Xiong 2017), and extended for multiple event-time outcomes (e.g., Wei and Lachin 1984; Pocock et al. 1987; Wei et al. 1990; Lin 1991; Cook and Farewell 1994) and for paired event-time data (e.g., Murray 2000; Andrei and Murray 2005; Jung 2008). Despite the extensive literature in group-sequential methods, there is a lack in the theory regarding the asymptotic structure of the weighted logrank statistics when group-sequentially comparing multiple event-time outcomes. Absence of this theory slows implementation and applying the group-sequential methodologies and creates challenges in calculating the power and the required sample size for multiple event-time outcomes.

We discuss a fundamental theory and methodology for group-sequential designs based on the weighted logrank statistic when monitoring several correlated event-time outcomes in clinical trials. We focus on bivariate event-time data rather than multivariate event-times, and consider a scenario where both events are non-fatal, as an extension of the existing method (Sugimoto et al. 2013). When considering the asymptotic distribution of the group-sequential logrank statistics, and the two martingale components with event-time outcomes are correlated on the different time axes, it is difficult to directly apply standard martingale theory for survival analysis, such as Rebolledo’s central limit theorem. We overcome this challenge by combining a martingale approach and Ito’s formula, and provide an asymptotic formula for group-sequential bivariate logrank statistic. We then apply the asymptotic result to group-sequential designs to evaluate a joint effect on both outcomes. We illustrate the design methodology with a clinical trial example.

This paper is organized as follows: in Sect. 2 we describe how the group-sequential weighted logrank statistic is applied to bivariate event-time data in a clinical trial. In Sect. 3, we discuss the asymptotic distribution with an explicit variance-covariance form for the bivariate version of group-sequential weighted logrank statistic, fundamental for determining the information fraction for each outcome and evaluating the probability of rejecting the null hypotheses. In Sect. 4, we apply the asymptotic result to a group-sequential clinical trial evaluating the joint effect on the co-primary endpoints. We outline how both or one of the outcomes are monitored and evaluated. In Sect. 5, we summarize the findings and discuss their implications.

2. Group-sequential bivariate event-time data and the logrank statistic

Consider designing a randomized group-sequential clinical trial comparing two interventions evaluating bivariate event-time outcomes. Suppose that up to the planned maximum number of participants n_L will be recruited during an entry period and followed to observe the bivariate survival outcomes. Further, suppose interim analyses are planned with the pre-specified maximum number of analyses L. Let n_ℓ and τ_ℓ be the cumulative total number of participants and the analysis time at the ℓth interim analysis, respectively, with n₁ ≤ ⋯ ≤ n_L and τ₁ < ⋯ < τ_L, and let [0, τ_A] be a period on which the trial recruiting is performed or which is planned in advance. The group index of intervention is denoted by j = 2 if the ith participant belongs to the test group and j = 1 otherwise. Let n_1ℓ and n_2ℓ denote the numbers of participants assigned to the control and test interventions at the ℓth analysis, respectively (n_ℓ = n_1ℓ + n_2ℓ), where the fractions n_j1/n₁,… , n_jL/n_L may be often assumed to be approximately equal in each intervention. For i = 1,… , n_L and k = 1, 2, let O_i be the ith participant’s entry time into the trial, let $T_{i k}^{*}$ be the ith participant’s underlying continuous event time for the kth outcomes, and let C_i be the ith participant’s underlying censoring time common for the two outcomes, where O_i is the origin time of $T_{i k}^{*}$ and C_i and is usually generated from the uniform distribution on the entry period [0, τ_A], the bivariate time ( $T_{i 1}^{*}$ , $T_{i 2}^{*}$ ) follows the joint survival distribution denoted by

S_{j} (t, s) = P (t < T_{i 1}^{*}, s < T_{i 2}^{*} ∣ g_{i} = j),

g_i is the ith group index of intervention, and all of the C_i’s follow the identical survival distribution C(t) = P(t < C_i) independently of ( $T_{i 1}^{*}$ , $T_{i 2}^{*}$ ). Thus, the ith right-censoring time occurring at the ℓth analysis is $C_{i}^{(ℓ)}$ , where

C_{i}^{(ℓ)} = min (C_{i}, \max (τ_{ℓ} - O_{i}, 0)) .

We will assume no dropouts where we observe C_i = τ_L – O_i because of well-controlled trial. Suppose that $T_{i 1}^{*}$ and $T_{i 2}^{*}$ are non-competing event-times, that is neither event-time is censored by the occurrence of the other event, which is typical in the case of non-fatal events (Sugimoto et al. 2013). For simplicity on notation, we write O₁ ≤ ⋯ ≤ O_{n_ℓ} ≤ τ_ℓ, although we assume that O_i and O_i′ for i ≠ i′ are mutually independent. Hence, we have a series of cumulative data set denoted by ${(T_{i 1}^{(ℓ)}, T_{i 2}^{(ℓ)}, Δ_{i 1}^{(ℓ)}, Δ_{i 2}^{(ℓ)}, g_{i})}_{i = 1}^{n_{ℓ}}$ , ℓ = 1,…, L, where $T_{i k}^{(ℓ)} = min (T_{i k}^{*}, C_{i}^{(ℓ)})$ and $Δ_{i k}^{(ℓ)} = 1 {T_{i k}^{*} < C_{i}^{(ℓ)}}$ are the ith observable time and censoring indicator for the kth outcome at the ℓth analysis, respectively, and $1 {\cdot}$ is the indicator function. The information of ( $T_{i k}^{(ℓ)}$ , $Δ_{i k}^{(ℓ)}$ ) is also represented by the counting process $N_{i k}^{(ℓ)} (t) = 1 {T_{i k}^{(ℓ)} \leq t, Δ_{i k}^{(ℓ)} = 1}$ and the at-risk process $Y_{i k}^{(ℓ)} (t) = 1 {T_{i k}^{(ℓ)} \geq t}$ . Denote their sums on the group j and the kth outcome by

{\bar{N}}_{j k}^{(ℓ)} (t) = \sum_{i = 1}^{n_{ℓ}} 1 {g_{i} = j} N_{i k}^{(ℓ)} (t), {\bar{Y}}_{j k}^{(ℓ)} (t) = \sum_{i = 1}^{n_{ℓ}} 1 {g_{i} = j} Y_{i k}^{(ℓ)} (t),

${\bar{N}}_{• k}^{(ℓ)} (t) = {\bar{N}}_{1 k}^{(ℓ)} (t) + {\bar{N}}_{2 k}^{(ℓ)} (t)$ and ${\bar{Y}}_{• k}^{(ℓ)} (t) = {\bar{Y}}_{1 k}^{(ℓ)} (t) + {\bar{Y}}_{2 k}^{(ℓ)} (t)$ .

Also, let λ_jk(t) and Λ_jk(t) be the marginal hazard function and its cumulative function for the kth event time $T_{i k}^{*}$ in the group j, respectively. Denote the marginal hazard ratio for the kth outcome between the two groups by ψ_k(t) = λ_2k(t)/λ_1k(t) and let ψ(t) = (ψ₁(t), ψ₂(t))^T.

We are interested in testing sequentially either hypothesis $H_{0}^{cp} = H_{01} \cup H_{02}$ (for joint effect) or $H_{0}^{mp} = H_{01} \cap H_{02}$ (for at least one effect) using the weighted log-rank statistics, where H_0k is the single null hypothesis for the kth outcome, “ψ_k(t) = 1 for all t”. For the bivariate event-time outcome with L maximum analyses, we have a set of 2L group-sequential weighted logrank statistics,

\hat{Z} = ({\hat{Z}}_{1} (τ_{1}), \dots, {\hat{Z}}_{1} (τ_{L}), {\hat{Z}}_{2} (τ_{1}), \dots, {\hat{Z}}_{2} (τ_{L}))^{T}

composed of

{\hat{Z}}_{k} (τ_{ℓ}) = \sqrt{n_{ℓ}} U_{k}^{(ℓ)} (τ_{ℓ}) ∕ \sqrt{{\hat{V}}_{k k}^{0 (ℓ)} (τ_{ℓ})}, k = 1, 2, ℓ = 1, \dots, L

where $\sqrt{n_{ℓ}} U_{k}^{(ℓ)} (t)$ is the weighted logrank process accompanied with the analysis time τ_ℓ,

U_{k}^{(ℓ)} (t) = \int_{0}^{t} {\hat{H}}_{k}^{(ℓ)} (s) {d {\hat{Λ}}_{1 k}^{(ℓ)} (s) - d {\hat{Λ}}_{2 k}^{(ℓ)} (s)},

${\hat{V}}_{k k}^{0 (ℓ)} (t)$ is the conditional variance of $\sqrt{n_{ℓ}} U_{k}^{(ℓ)} (t)$ under the null hypothesis H_0k,

{\hat{V}}_{k k}^{0 (ℓ)} (t) = \int_{0}^{t} {\hat{H}}_{k}^{(ℓ)} (s)^{2} {1 - \frac{d {\bar{N}}_{• k}^{(ℓ)} (s) - 1}{{\bar{Y}}_{• k}^{(ℓ)} (s) - 1}} {\frac{d {\hat{Λ}}_{1 k}^{(ℓ)} (s)}{n_{ℓ}^{- 1} {\bar{Y}}_{2 k}^{(ℓ)} (s)} + \frac{d {\hat{Λ}}_{2 k}^{(ℓ)} (s)}{n_{ℓ}^{- 1} {\bar{Y}}_{1 k}^{(ℓ)} (s)}} .

Also, ${\hat{Λ}}_{j k}^{(ℓ)} (t) = \int_{0}^{t} d {\bar{N}}_{j k}^{(ℓ)} (s) ∕ {\bar{Y}}_{j k}^{(ℓ)} (s)$ is the Nelson-Aalen estimator at the ℓth analysis for the kth outcome in the group j, ${\hat{H}}_{k}^{(ℓ)} (s)$ is the following function including the weight ${\hat{W}}_{k}^{(ℓ)}$ of the class $K$ (Fleming and Harrington 1991)

{\hat{H}}_{k}^{(ℓ)} (s) = n_{ℓ}^{- 1} {\hat{W}}_{k}^{(ℓ)} (s) {\bar{Y}}_{1 k}^{(ℓ)} (s) {\bar{Y}}_{2 k}^{(ℓ)} (s) ∕ {\bar{Y}}_{• k}^{(ℓ)} (s),

${\hat{W}}_{k}^{(ℓ)} (s) = f ({\hat{S}}_{• k}^{(ℓ)} (s))$ or ${\hat{W}}_{k}^{(ℓ)} (s) = f (n_{ℓ}^{- 1} {\bar{Y}}_{• k}^{(ℓ)} (s))$ , f(·) is a nonnegative bounded continuous function with bounded variation on [0, 1], and ${\hat{S}}_{• k}^{(ℓ)} (s)$ is the Kaplan-Meier estimator for the kth outcome in the pooled sample at the ℓth analysis time τ_ℓ. A well-known fact is that the logrank and Prentice-Wilcoxon statistics use ${\hat{W}}_{k}^{(ℓ)} (s) = 1$ and ${\hat{W}}_{k}^{(ℓ)} (s) = {\hat{S}}_{• k}^{(ℓ)} (s_{-})$ , respectively, where s_ is a time just prior to s. The weight ${\hat{W}}_{k}^{(ℓ)}$ should be selected effectively to detect a clinically significant difference. If there is no prior assumption on a specific difference in the clinical significance, the logrank statistic may be adopted, which can be interpreted as detecting the difference in the mean hazard rate. Also, one can consider an optimality for testing using a special weight into the design, if pilot data or registry database are available.

3. Asymptotic structure of the group-sequential bivariate logrank statistic

Asymptotic results regarding the univariate statistic ${\hat{Z}}_{k} (τ_{ℓ})$ and its group-sequential version $({\hat{Z}}_{k} (τ_{1}), \dots, {\hat{Z}}_{k} (τ_{L}))^{T}$ have been developed well (e.g., Andersen et al. 1993, X.2). For example, Lin (1991) shows that $\hat{Z}$ converges to a multivariate normal distribution with zero means and discuss the estimated variance-covariance matrix, although an explicit form for the asymptotic covariance of $\hat{Z}$ is not provided. Andrei and Murray (2005) provide a more detailed expression for the asymptotic covariance among weighted logrank statistics, but it is in the context of paired event-time data on the same time axes. To the best of our knowledge, a computable explicit form for the asymptotic variance-covariance of $\hat{Z}$ is not available in the literature. Extending the result for $\hat{Z}$ when L = 1, i.e. for fixed-sample design (Sugimoto et al. 2013), we provide the result of the asymptotic distribution of $\hat{Z}$ with an explicit variance-covariance structure for group-sequential design (Theorem 1).

We next provide details for expressing an asymptotic distribution of $\hat{Z}$ . The limit forms of ${\hat{H}}_{k}^{(ℓ)} (t)$ and $n_{ℓ}^{- 1} {\bar{Y}}_{j k}^{(ℓ)} (s)$ are different among the analysis time points as the censoring distributions vary with each analysis-time τ_ℓ. Let $H_{k}^{(ℓ)} (t)$ , $y_{j k}^{(ℓ)} (t)$ and $y_{• k}^{(ℓ)} (t)$ denote the limit forms of ${\hat{H}}_{k}^{(ℓ)} (t)$ , $n_{j ℓ}^{- 1} {\bar{Y}}_{j k}^{(ℓ)} (t)$ and $n_{ℓ}^{- 1} {\bar{Y}}_{• k}^{(ℓ)} (t)$ , respectively. Denote ${\hat{a}}_{j ℓ} = n_{j ℓ} ∕ n_{ℓ}$ for the sample rate of participants assigned to the group j at the ℓth analysis and ${\hat{γ}}_{ℓ} = n_{ℓ} ∕ n_{L}$ for the sample size ratio between the ℓth and final analyses. Let $\overset{P}{\to}$ denote the convergence in probability. We assume the following regularity conditions.

Condition 1. For each j, ℓ, 0 < a_jℓ < 1 is satisfied, where a_jℓ is a constant such that ${\hat{a}}_{j ℓ} \overset{P}{\to} a_{j ℓ}$ as n_ℓ → ∞.
Condition 2. For each ℓ, 0 < γ_ℓ ≤ 1 is satisfied with γ₁ ≤ ⋯ ≤ γ_L, where γ_ℓ is a constant such that ${\hat{γ}}_{ℓ} \overset{P}{\to} γ_{ℓ}$ as $n_{L} \to \infty$ .
Condition 3. For each j, k, ℓ, $y_{j k}^{(ℓ)} (t) > 0$ on [0, τ_ℓ] is satisfied with $τ_{ℓ} = sup {t : y_{j k}^{(ℓ)} (t) > 0}$ , where $y_{j k}^{(ℓ)}$ is a deterministic function such that, as n_jℓ → ∞,

sup_{t \in [0, τ_{ℓ}]} ∣ n_{j ℓ}^{- 1} {\bar{Y}}_{j k}^{(ℓ)} (t) - y_{j k}^{(ℓ)} (t) ∣ \to_{P} 0 .

Under our setting, the convergences provided in Conditions 1-2 and Condition 3 are derived by the law of large numbers and Glivenko-Cantelli theorem, respectively. Hence, we have $γ_{ℓ} = E ({\hat{γ}}_{ℓ})$ , $a_{j ℓ} = E ({\hat{a}}_{j ℓ})$ and $y_{• k}^{(ℓ)} (t) = a_{1 ℓ} y_{1 k}^{(ℓ)} (t) + a_{2 ℓ} y_{2 k}^{(ℓ)} (t)$ . Note that a_jℓ permits changing on the analysis time τ_ℓ, but each a_jℓ should be fixed at the design stage to control Type I error rate. The type of convergence in Condition 1 is usually replaced with the non-probabilistic version based on an allocation proceduce. Condition 3 provides ${lim}_{t \to τ_{ℓ} + 0} y_{j k}^{(ℓ)} (t) = 0$ , which means that all at-risk individuals are once censored at the ℓth analysis time τ_ℓ.

Let C_ℓ(t) be the survival function of censoring times $C_{i}^{(ℓ)}$ when the analysis time is τ_ℓ. Under the independent censoring assumption, we can easily show that

y_{j k}^{(ℓ)} (t) = C_{ℓ} (t_{-}) S_{j k} (t_{-}) and y_{• k}^{(ℓ)} (t) = C_{ℓ} (t_{-}) S_{• k}^{(ℓ)} (t_{-}),

(1)

where $S_{j k} (t) = P (t < T_{i k}^{*} ∣ g_{i} = j)$ is the marginal survival function of $T_{i k}^{*}$ assigned to the group j, and $S_{• k}^{(ℓ)} (t) = a_{1 ℓ} S_{1 k} (t) + a_{2 ℓ} S_{2 k} (t)$ . Hence, given the condition that bivariate event-time outcomes are non-fatal, for t ≤ τ_ℓ, we have

H_{k}^{(ℓ)} (t) = W_{k}^{(ℓ)} (t) C_{ℓ} (t_{-}) \frac{a_{1 ℓ} S_{1 k} (t_{-}) a_{2 ℓ} S_{2 k} (t_{-})}{S_{• k}^{(ℓ)} (t_{-})},

(2)

where $W_{k}^{(ℓ)} (t)$ is either $f (S_{• k}^{(ℓ)} (t_{-}))$ or $f (y_{• k}^{(ℓ)} (t))$ corresponding to the selection of ${\hat{W}}_{k}^{(ℓ)}$ in the class $K$ , so that $H_{k}^{(ℓ)} (t)$ is a deterministic continuous function of bounded variation. In particular, when considering a typical group sequential trial, we will assume that participants are recruited uniformly on [0, τ_A], followed up with no dropouts and then will be analyzed at the times t = τ₁,…,τ_L. Then we can specify the censoring survival distribution as

C_{ℓ} (t) = {\begin{matrix} 1, & 0 \leq t \leq τ_{ℓ} - min (τ_{ℓ}, τ_{A}) \\ (τ_{ℓ} - t) ∕ min (τ_{ℓ}, τ_{A}), & τ_{ℓ} - min (τ_{ℓ}, τ_{A}) < t \leq τ_{ℓ}, \\ 0, & τ_{ℓ} < t \end{matrix}

(3)

(recall τ_A is the length of the entry period planned in advance). Hence, we have

γ_{ℓ} = min (τ_{ℓ}, τ_{A}) ∕ τ_{A}

(4)

under the censoring assumption (3), because it is the averaged ratio of the number of participants recruited until the analysis time τ_ℓ.

Suppose that $Z^{*} = (Z_{1}^{*} (t_{1}), \dots, Z_{1}^{*} (t_{L}), Z_{2}^{*} (t_{1}), \dots, Z_{2}^{*} (t_{L}))^{T}$ follows 2L-variate normal distribution N(D_nμ, Σ) with mean vector

D_{n} μ = D_{n} (\begin{matrix} μ_{1} \\ μ_{2} \end{matrix}) = (\sqrt{n_{1}} μ_{11}, \dots, \sqrt{n_{L}} μ_{1 L}, \sqrt{n_{1}} μ_{21}, \dots, \sqrt{n_{L}} μ_{2 L})^{T}

and variance-covariance matrix

Σ = (\begin{matrix} Σ_{11} & Σ_{12} \\ Σ_{21} & Σ_{22} \end{matrix}) = (\begin{matrix} σ_{1111} & \dots & σ_{111 L} & σ_{1211} & \dots & σ_{121 L} \\ ⋮ & ⋱ & ⋮ & ⋮ & ⋱ & ⋮ \\ σ_{11 L 1} & \dots & σ_{11 L L} & σ_{12 L 1} & \dots & σ_{12 L L} \\ σ_{2111} & \dots & σ_{211 L} & σ_{2211} & \dots & σ_{221 L} \\ ⋮ & ⋱ & ⋮ & ⋮ & ⋱ & ⋮ \\ σ_{21 L 1} & \dots & σ_{21 L L} & σ_{22 L 1} & \dots & σ_{22 L L} \end{matrix}),

where $D_{n} = diag (\sqrt{n_{1}}, \sqrt{n_{2}}, \dots, \sqrt{n_{L}}, \sqrt{n_{1}}, \sqrt{n_{2}}, \dots, \sqrt{n_{L}})$ , μ_k = (μ_k1,…,μ_kL)^T and Σ_kk′ = (σ_{kk′ℓℓ′})_ℓ,ℓ′. That is, for k, k′ = 1, 2 and ℓ, ℓ′ = 1.…, L, the elements of means and covariances for $Z_{k}^{*} (t_{ℓ})$ and $Z_{k^{'}}^{*} (t_{ℓ^{'}})$ are written as

E (Z_{k}^{*} (τ_{ℓ})) = \sqrt{n_{ℓ}} μ_{k ℓ} = \sqrt{n_{ℓ}} \frac{m_{k}^{(ℓ)} (τ_{ℓ})}{\sqrt{V_{k k}^{0 (ℓ)} (τ_{ℓ})}}, Cov (Z_{k}^{*} (τ_{ℓ}), Z_{k^{'}}^{*} (τ_{ℓ^{'}})) = σ_{k k^{'} ℓ ℓ^{'}} = \frac{V_{k k^{'}} (τ_{ℓ}, τ_{ℓ^{'}} ∣ τ_{ℓ}, τ_{ℓ^{'}})}{\sqrt{V_{k k}^{0 (ℓ)} (τ_{ℓ}) V_{k^{'} k^{'}}^{0 (ℓ^{'})} (τ_{ℓ^{'}})}},

where we assume that the elements $m_{k}^{(ℓ)}$ , $V_{k k}^{0 (ℓ)}$ and V_kk′ are defined by

m_{k}^{(ℓ)} (t) = \int_{0}^{t} H_{k}^{(ℓ)} (x) {d Λ_{2 k} (x) - d Λ_{1 k} (x)}, V_{k k}^{0 (ℓ)} (t) = \int_{0}^{t} H_{k}^{(ℓ)} (x)^{2} {\frac{d Λ_{1 k} (x)}{a_{2 ℓ} y_{2 k}^{(ℓ)} (x)} + \frac{d Λ_{2 k} (x)}{a_{1 ℓ} y_{1 k}^{(ℓ)} (x)}}, V_{k k} (t, s ∣ τ_{ℓ}, τ_{ℓ^{'}}) = \sqrt{\frac{γ_{ℓ \land ℓ^{'}}}{γ_{ℓ \lor ℓ^{'}}}} \int_{0}^{t \land s} H_{k}^{(ℓ)} (x) H_{k}^{(ℓ^{'})} (x) {\frac{d Λ_{1 k} (x)}{a_{1 ℓ \lor ℓ^{'}} y_{1 k}^{(ℓ \lor ℓ^{'})} (x)} + \frac{d Λ_{2 k} (x)}{a_{2 ℓ \lor ℓ^{'}} y_{2 k}^{(ℓ \lor ℓ^{'})} (x)}}, V_{12} (t, s ∣ τ_{ℓ}, τ_{ℓ^{'}}) = \sqrt{\frac{γ_{ℓ \land ℓ^{'}}}{γ_{ℓ \lor ℓ^{'}}}} \int_{0}^{t} \int_{0}^{s} H_{1}^{(ℓ)} (x) H_{2}^{(ℓ^{'})} (y) C_{ℓ \land ℓ^{'}} (x \lor y) \times {\frac{A_{1} (d x, d y)}{a_{1 ℓ \lor ℓ^{'}} y_{11}^{(ℓ)} (x) y_{12}^{(ℓ^{'})} (y)} + \frac{A_{2} (d x, d y)}{a_{2 ℓ \lor ℓ^{'}} y_{21}^{(ℓ)} (x) y_{22}^{(ℓ^{'})} (y)}}, A_{j} (d x, d y) = S_{j} (d x, d y) + S_{j} (x, d y) d Λ_{j 1} (x) + S_{j} (d x, y) d Λ_{j 2} (y) + S_{j} (x, y) d Λ_{j 1} (x) d Λ_{j 2} (y), S_{j} (d x, d y) = S_{j} (x, y) - S_{j} (x_{-}, y) - S_{j} (x, y_{-}) + S_{j} (x_{-}, y_{-}),

S_j(dx, y) = S_j(x, y) – S_j(x_, y), S_j(x, dy) = S_j(x, y) – S_j(x, y_), x ⋁ y = max(x, y) and x ⋀ y = min(x, y). The forms provided in (4), (1) and (2) are applied into these elements $m_{k}^{(ℓ)}$ , $V_{k k}^{0 (ℓ)}$ and V_kk′. Under Conditions 1 and 3, it is well-known that the univariate weighted logrank statistic can be normally approximated (e.g., Fleming and Harrington 1991, Theorem 7.2.1). We have the following asymptotic result for the group-sequential weighted logrank statistic $\hat{Z}$ with correlated two outcomes.

Theorem 1 Suppose that Conditions 1-3 are satisfied (a_jℓ ∈ (0, 1), r_ℓ ∈ (0, 1], $τ_{ℓ} = sup {t : y_{1 k}^{(ℓ)} (t) y_{2 k}^{(ℓ)} (t) > 0}$ ), and that S_j(t, s), j = 1, 2 are continuous on (0, τ_L] × (0, τ_L]. Suppose that f(·) is a nonnegative bounded continuous function with bounded variation on [0, 1]. For sufficiently large n_ℓ’s (n₁ ≤ ⋯ ≤ n_L), the distribution of the 2L-variate weighted logrank statistic $\hat{Z}$ can then be approximated by N(D_nμ, Σ). That is, as n_L ≥ ⋯ ≥ n₁ → ∞, $\hat{Z} - D_{n} \hat{μ}$ converges in distribution to Z* – D_nμ distributed as N(0, Σ), where $\hat{μ}$ converges in probability to μ, $\hat{μ} = ({\hat{μ}}_{1}^{T}, {\hat{μ}}_{2}^{T})^{T}$ , ${\hat{μ}}_{k} = ({\hat{μ}}_{k 1}, \dots, {\hat{μ}}_{k L})^{T}$ , ${\hat{μ}}_{k ℓ} = {\hat{m}}_{k}^{(ℓ)} (τ_{ℓ}) ∕ \sqrt{V_{k k}^{0 (ℓ)} (τ_{ℓ})}$ ,

{\hat{m}}_{k}^{(ℓ)} (t) = \int_{0}^{t} {\hat{H}}_{k}^{(ℓ)} (x) {d Λ_{2 k} (x) - d Λ_{1 k} (x)},

and 0 is the 2L-dimensional zero vector.

This proof is provided in Appendix A. By conducting simulation stduies to evalaute the finite sample behavior for Theorem 1, we found that the asymptotic distribution works well in most practical situations if the event rate or sample size is not so small.

Several authors (e.g., Wei and Lachin 1984; Lin 1991) have indicated that the proof can be completed by the multivariate central limit theorem and the Cramér-Wald device, leading to asymptotic normality, but the asymptotic form of the variance-covariance was not clearly defined. The asymptotic form of variance-covariance as described in Theorem 1 has not been provided in the context of comparing independent groups with respect to several possibly correlated co-primary endpoints. In fact, when two martingale components with event-time outcomes are correlated on the different time axes as in this context, it is difficult to directly apply standard martingale theory, such as Rebolledo’s central limit theorem, for survival analysis (Fleming and Harrington 1991) considering how the covariance of martingale components converges. As a reference to overcome the problem, we provide our solution based on a martingale approach through the proof of Theorem 1 in Appendix A.

Based on the result of Theorem 1 that the distribution of the weighted logrank statistics, $\hat{Z}$ , can be approximated by N(D_nμ, Σ), we can consider a group-sequential design and the asymptotic power for the testing procedure. In our setting, the distribution parameters of the mean vector μ and the diagonal block matrix Σ_kk of Σ are determined by the setting of the marginal survival distributions S_jk(t), the censoring survival distributions C_ℓ(t), and the sample rates a_jℓ (k = 1, 2, j = 1, 2, ℓ = 1,…, L). In fact, the proportions γ₁,…,γ_L of sample sizes are determined by τ₁,… ,τ_L and τ_A under the censoring assumption (3). On the other hand, in determining the non-diagonal block matrix Σ₁₂ (and Σ₂₁) of Σ, the assumption of the joint survival distributions S_j(t, s), j = 1, 2 are required. At the design stage of a trial, one convenient setting is to model S_j(t, s) by

S_{j} (t, s) = C (S_{j 1} (t), S_{j 2} (s); θ)

(5)

where $C (\cdot, \cdot)$ is a copula function (such as Clayton, Gumbel and Frank models), and the association parameter θ characterizes the level of dependence between S_j1(t) and S_j2(t) and is a one-to-one function of a dependence measure (Hsu and Prentice 1996)

ρ_{j} = Corr [Λ_{j 1} (T_{i 1}^{*}), Λ_{j 2} (T_{i 2}^{*})] = \int_{0}^{\infty} \int_{0}^{\infty} S_{j} (t, s) d Λ_{j 1} (t) d Λ_{j 2} (s) - 1 .

The mean vector μ and the diagonal block matrix Σ_kk depend on the assumptions of the censoring distribution and the hazard ratios ψ₁(t) and ψ₂(t). The weighted logrank statistic is nonparametric, so that it is reasonable to assume the exponential distribution for marginals S_j1(t) and S_j2(t) in one group. Given the hazard ratios independent of times, such as the proportional hazard hypothesis {ψ_k(t) ≡ ψ_k for t ∈ (0, τ_L], k = 1, 2} (μ = 0 if ψ₁ = ψ₂ = 1), the marginals S_j1(t) and S_j2(t) in one group may model those of another group. Hence, a typical design calculation may be based on four exponential marginals S_jk(t), j = 1, 2, k = 1, 2 and the setting of the analysis times (τ₁,…, τ_L), the entry period (τ_A), the dependence measures (ρ₁, ρ₂) and the selection of some copula function. Numerically, calculations included in μ, Σ and ρ_j can be sufficiently precisely by using the numerical integration method, such as the Trapezoidal rule or Simpson’s rule (e.g., Sugimoto et al. 2013).

In group-sequential designs, the concept of information fraction is important in determining the critical boundary to preserve overall Type I error rate. This can be generalized to a bivariate event-time setting by analogy to a single event-time outcome. The information at τ_ℓ for each outcome can be characterized using an asymptotic form of the Fisher information, i.e., $I_{k ℓ} = n_{ℓ} V_{k k}^{0 (ℓ)} (τ_{ℓ})$ , k = 1, 2, which corresponds to the information under the null hypothesis for the log of the hazard ratios (e.g., Jennison and Turnbull 2000; Yin 2012). As information is accumulated from τ_ℓ to τ_L, the standardized internal time $R I_{k ℓ}$ for each outcome is defined by the fraction of the maximum information of $I_{k L}$ , i.e., $R I_{k ℓ} = I_{k ℓ} ∕ I_{k L}$ . Theorem 1 provides that the components of Σ, under the null hypothesis of ψ₁ = ψ₂ = 1, are obtained as

V_{k k}^{0 (ℓ)} (t) = a_{1 ℓ} a_{2 ℓ} \int_{0}^{t} W_{k}^{(ℓ)} (x)^{2} C_{ℓ} (x) S_{• k} (x) d Λ_{• k} (x), V_{k k^{'}} (t, s ∣ τ_{ℓ}, τ_{ℓ^{'}}) = a_{1 ℓ \land ℓ^{'}} a_{2 ℓ \land ℓ^{'}} \sqrt{\frac{γ_{ℓ \land ℓ^{'}}}{γ_{ℓ \lor ℓ^{'}}}} \times {\begin{matrix} \int_{0}^{t \land s} W_{k}^{(ℓ)} (x) W_{k}^{(ℓ^{'})} (x) C_{ℓ \land ℓ^{'}} (x) S_{• k} (x) d Λ_{• k} (x) & if k = k^{'}, \\ \int_{0}^{t} \int_{0}^{s} W_{k}^{(ℓ)} (x) W_{k^{'}}^{(ℓ^{'})} (y) C_{ℓ \land ℓ^{'}} (x \lor y) A_{•} (d x, d y) & if k \neq k^{'}, \end{matrix}

where we have Λ_•k(x) = Λ_1k(x) = Λ_2k(x), S_•k(x) = S_1k(x) = S_2k(x) and A_•(x, y) = A₁(x, y) = A₂(x, y) because ψ₁ = ψ₂ = 1. The result given for the single endpoint (e.g., Andersen et al. 1993, X.2) is that the asymptotic correlation between group-sequential weighted logrank statistics

Corr [Z_{k}^{*} (τ_{ℓ}), Z_{k}^{*} (τ_{ℓ^{'}})] = \frac{V_{k k} (τ_{ℓ}, τ_{ℓ^{'}} ∣ τ_{ℓ}, τ_{ℓ^{'}})}{\sqrt{V_{k k} (τ_{ℓ}, τ_{ℓ} ∣ τ_{ℓ}, τ_{ℓ}) V_{k k} (τ_{ℓ^{'}}, τ_{ℓ^{'}} ∣ τ_{ℓ^{'}}, τ_{ℓ^{'}})}},

reduces to $\sqrt{R I_{k ℓ \land ℓ^{'}} ∕ R I_{k ℓ \lor ℓ^{'}}}$ when the null hypothesis is true (ψ₁ = ψ₂ = 1) and $W_{k}^{(ℓ)} (s)$ is independent of ℓ, such as $W_{k}^{(ℓ)} (s) = 1$ . Theorem 1 describes that the correlation of ( ${\hat{Z}}_{k} (τ_{ℓ})$ , ${\hat{Z}}_{k^{'}} (τ_{ℓ^{'}})$ ) including between different endpoints for k, k′ = 1, 2 and 1 ≤ ℓ′ ≤ ℓ ≤ L can be approximated by

Corr [Z_{k}^{*} (τ_{ℓ}), Z_{k^{'}}^{*} (τ_{ℓ^{'}})] = \frac{V_{k k^{'}} (τ_{ℓ}, τ_{ℓ^{'}} ∣ τ_{ℓ}, τ_{ℓ^{'}})}{\sqrt{V_{k k} (τ_{ℓ}, τ_{ℓ} ∣ τ_{ℓ}, τ_{ℓ}) V_{k^{'} k^{'}} (τ_{ℓ^{'}}, τ_{ℓ^{'}} ∣ τ_{ℓ^{'}}, τ_{ℓ^{'}})}},

which is ${1 (k = k^{'}) + 1 (k \neq k^{'}) ρ_{Z} (τ_{ℓ}, τ_{ℓ^{'}})} \sqrt{R I_{k ℓ^{'}} ∕ R I_{k ℓ}}$ if the null hypothesis is true and $W_{k}^{(ℓ)} (s)$ is independent of ℓ, where

ρ_{Z} (τ_{ℓ}, τ_{ℓ^{'}}) = \frac{V_{k k^{'}} (τ_{ℓ}, τ_{ℓ^{'}} ∣ τ_{ℓ}, τ_{ℓ^{'}})}{\sqrt{V_{k k} (τ_{ℓ}, τ_{ℓ^{'}} ∣ τ_{ℓ}, τ_{ℓ^{'}}) V_{k^{'} k^{'}} (τ_{ℓ}, τ_{ℓ^{'}} ∣ τ_{ℓ}, τ_{ℓ^{'}})}} .

4. Application to group-sequential design

We provide an application to the group-sequential design based on the result discussed in Sect. 3. As a motivating example, consider a major HIV treatment trial within the AIDS Clinical Trials Group, “A Phase III Randomized Comparative Study of Three Non-Nucleoside Reverse Transcriptase Inhibitor (NNRTI)-Sparing Antiretroviral Regimens for Treatment-Naïve HIV-1-Infected Volunteers (The ARDENT Study: Atazanavir, Raltegravir, or Darunavir with Emtricitabine/Tenofovir for Naïve Treatment)” (Lennox et al. 2014). The planned total sample size of 1800 (equally-sized groups) was calculated for the paired comparison of the three regimens with respect to the two co-primary endpoints: “virologic failure” and “regimen failure due to tolerability”, not taking into account the potential correlation, with 3% inflation to the adjustment for interim monitoring, under the study duration of 96 weeks after enrollment of the last subject, where the two failures are non-fatal. The study had (i) a power of 0.90 to establish non-inferiority in the risk reduction of virologic failure with the non-inferiority margin of 10% at α = 0.0125 for one-sided test, assuming the virologic failure rate of 25% at 96 weeks, and (ii) a power of 0.85 to detect a 10% difference in regimen failure at α = 0.025 for two-sided test, assuming the regimen failure rate of 45% at 96 weeks.

For illustrative purposes, suppose that the objective of the ARDENT trial was to test for a two intervention superiority on both co-primary endpoints (OC1: virologic failure, OC2: regimen failure). The allocation ratios are assumed to be constant across analyses (a_j1 = ⋯ = a_jL) and are not changed arbitrarily during trial as the arbitrary choices may effect on the Type I error and power. The significance level of 2.5% (α = 0.025) is allocated to each endpoint using a one-sided logrank test in a group-sequential setting, where the group sizes at each analysis are equal (a_1ℓ = 0.5), the survival rate at 96 weeks is assumed to be 75% and 85% for OC1, and 55% and 65% for OC2, in the control and test intervention groups, respectively (S₁₁(96) = 0.75, S₂₁(96) = 0.85; S₁₂(96) = 0.55, S₂₂(96) = 0.65). Two analyses are planned: the first at τ₁= 48 + τ_A and the final at τ₂ = 96 + τ_A (L = 2). Letting $ψ^{*} = (ψ_{1}^{*}, ψ_{2}^{*})$ be the hazard ratios of interest as a true vector of ψ = (ψ₁, ψ₂), typical exponential assumptions lead to ψ*≐(0.565, 0.721) based on $S_{2 k} (t) = S_{1 k} (t)^{ψ_{k}^{*}}$ for ARDENT study.

A superiority clinical trial with two event-time outcomes (OC1 and OC2) as “co-primary” endpoints is often designed to evaluate if the test intervention is superior to the control on both outcomes. For two co-primary endpoints, the testing procedure is to test the union $H_{0}^{cp} = H_{01} \cup H_{02}$ of two individual nulls against the alternative $H_{1}^{cp} = H_{11} \cap H_{12}$ . For simplicity, suppose that the proportional hazards hypothesis ψ₁(t) ≡ ψ₁ and ψ₂(t) ≡ ψ₂, and single null hypothesis H_0k : ψ_k = 1 is tested versus H_1k : ψ_k < 1 at the significance level α for each k. When evaluating a joint effect on both endpoints within the context of group-sequential designs, one decision-making framework associated with hypothesis testing is to reject $H_{0}^{cp}$ if statistical significance of the test intervention relative to the control intervention is achieved for both of the endpoints at any interim analysis until the final analysis not necessarily simultaneously (Asakura et al. 2014; Hamasaki et al. 2015). The power corresponding to the decision-making framework at ψ = ψ* is

1 - β = P ({⋃_{ℓ = 1}^{L} {Z_{1 ℓ} (τ_{ℓ}) > c_{1 ℓ} (α)}} ⋂ {⋃_{ℓ = 1}^{L} {Z_{2 ℓ} (τ_{ℓ}) > c_{2 ℓ} (α)}}) = 1 - P (⋂_{ℓ = 1}^{L} {Z_{1 ℓ} (τ_{ℓ}) \leq c_{1 ℓ} (α)}; ψ_{1} = ψ_{1}^{*}) - P (⋂_{ℓ = 1}^{L} {Z_{2 ℓ} (τ_{ℓ}) \leq c_{2 ℓ} (α)}; ψ_{2} = ψ_{2}^{*}) + P (⋂_{ℓ = 1}^{L} {Z_{1 ℓ} (τ_{ℓ}) > c_{1 ℓ} (α) ⋂ Z_{2 ℓ} (τ_{ℓ}) > c_{2 ℓ} (α)}; ψ = ψ^{*}),

(6)

where c_kℓ(α) is the critical boundary at the ℓth analysis for the kth outcome, specified and determined in advance using any group-sequential methods, as if the two endpoints were a single primary endpoint, ignoring the other endpoint, analogously to the single endpoint case. Note that only the marginal results of Theorem 1 are required for the standardized internal times $R I_{k ℓ}$ , where $R I_{k ℓ}$ does not depend on the correlation between OC1 and OC2 in the situation where both outcomes are non-fatal. Once $R I_{k ℓ}$ , k = 1, 2 are determined, then the critical boundaries can be calculated using the group-sequential methods to control an overall Type I error rate in each marginal. Using the result of Theorem 1 that the distribution of $\hat{Z}$ can be approximated by that of Z* under a large sample size, the power (6) can be approximately calculated as

1 - β = 1 - \int_{- \infty}^{c_{11}^{*}} \dots \int_{- \infty}^{c_{1 L}^{*}} f_{L} (z_{11}, \dots, z_{1 L}; R_{11}) d z_{11} \dots d z_{1 L} - \int_{- \infty}^{c_{21}^{*}} \dots \int_{- \infty}^{c_{2 L}^{*}} f_{L} (z_{21}, \dots, z_{2 L}; R_{22}) d z_{21} \dots d z_{2 L} + \int_{- \infty}^{c_{11}^{*}} \dots \int_{- \infty}^{c_{1 L}^{*}} \int_{- \infty}^{c_{21}^{*}} \dots \int_{- \infty}^{c_{2 L}^{*}} f_{2 L} (z_{11}, \dots, z_{1 L}, z_{21}, \dots, z_{2 L}; R) d z_{11} \dots d z_{1 L} d z_{21} \dots d z_{2 L},

(7)

where f_m(·; A) is m-variate normal density function with zero mean vector and variance-covariance matrix A, R is the correlation matrix given by

R = (\begin{matrix} R_{11} & R_{12} \\ R_{21} & R_{22} \end{matrix}) = S^{- \frac{1}{2}} Σ S^{- \frac{1}{2}},

S = diag(σ₁₁₁₁, σ₁₁₂₂, … , σ_11LL, σ₂₂₁₁, σ₂₂₂₂, … σ_22LL), the integration limits $c_{k ℓ}^{*}$ are

c_{k ℓ}^{*} = \frac{1}{\sqrt{σ_{k k ℓ ℓ}}} {c_{k ℓ} (α) - \sqrt{γ_{ℓ} n_{L}} μ_{k ℓ}}, k = 1, 2; ℓ = 1, \dots, L,

and recall n_ℓ = γ_ℓn_L.

Returning to the ARDENT study, let τ_A = 0 similarly to the manner assumed by Lennox et al. (2014). Although τ_A is not zero in fact, this selection of τ_A provides a conservative result and is reasonable in practice because of difficulty of estimating the feasible entry period. Two fixed analysis times are (τ₁, τ₂) = (48, 96), where the censoring distribution (3) under τ_A = 0 is simplified to

C_{1} (t) = {\begin{matrix} 1, & 0 \leq t \leq τ_{1} = 48 \\ 0, & τ_{1} < t \end{matrix}, C_{2} (t) = {\begin{matrix} 1, & 0 \leq t < τ_{2} = 96 \\ 0, & τ_{2} < t \end{matrix} .

(8)

We select the weight function of $W_{k}^{(ℓ)} (s) = 1$ corresponding to the logrank statistic. Under these configurations with the exponential marginal assumption, we calculate $R I_{k ℓ}$ whose values are 0.5314 and 0.5669 at 48 weeks for the OC1 and OC2, and then determine c_kℓ(α) by the O’Brien-Fleming-type function (O’Brien and Fleming 1979) using the Lan-DeMets error-spending method (Lan and DeMets 1983), as shown in Table 1 including the Pocock-type boundary (Pocock 1977). The power (7) is then calculated, given the settings of the joint survival functions S_j(t, s) and the correlations ρ_j between the OC1 and OC2. We use the copula model (5) to identify the joint survival distribution S_j(t, s). In particular, we utilize the Clayton copula (late time-dependency) (Clayton 1976) and the Gumbel copula (early time-dependency) (Hougaard 1986), that is, we set, under the Clayton copula,

S_{j} (t, s) = (e^{θ_{j} λ_{j 1} t} + e^{θ_{j} λ_{j 2} s} - 1)^{- 1 ∕ θ_{j}}

and, under the Gumbel copula,

S_{j} (t, s) = exp (- {(λ_{j 1} t)^{1 ∕ θ_{j}} + (λ_{j 2} s)^{1 ∕ θ_{j}}}^{θ_{j}}),

the mariginal hazard rates are given by λ_1k = − log S_1k (96)/96 and $λ_{2 k} = λ_{1 k} ψ_{k}^{*}$ , and the association parameter θ_j is determined by the value of ρ_j (see Sugimoto et al. (2013) for more details). For simplicity, we set the correlations as ρ₁ = ρ₂ ≡ ρ and consider ρ = 0, 0.1, …, 0.9 and 0.95. Based on (7), the total maximum sample size (MSS) required for the final analysis is the smallest integer n_L which provides (7) not less than the desired power at the prespecified ψ = ψ*. For example, using the method with above parameter configuration and setting, for ρ = 0, R₁₁, R₂₂, and R₁₂ are approximately calculated by

R_{11} = (\begin{matrix} 1, & 0.7260 \\ 0.7260, & 1 \end{matrix}), R_{22} = (\begin{matrix} 1, & 0.7507 \\ 0.7507, & 1 \end{matrix}) and R_{12} = (\begin{matrix} 0, & 0 \\ 0, & 0 \end{matrix}),

respectively, and for ρ = 0.8

R_{11} = (\begin{matrix} 1, & 0.7260 \\ 0.7260, & 1 \end{matrix}), R_{22} = (\begin{matrix} 1, & 0.7507 \\ 0.7507, & 1 \end{matrix}) and R_{12} = (\begin{matrix} 0.2159, & 0.1569 \\ 0.1622, & 0.3341 \end{matrix}),

respectively. Once the MSS is computed, the maximum event number (MEN) d_kL is calculated using d_kL = n_LP_kL (event), where P_kℓ(event) is the probability that the event of the kth outcome occurs on the time interval (0, τ_ℓ] and can be calculated, for example, based on Collett (2003) or Sugimoto et al. (2017, Appendix B). Also, the average event number (AEN) ${\bar{d}}_{k}$ is calculated using hypothetical reference values, similarly to Asakura et al. (2014), by

{\bar{d}}_{k} = \sum_{ℓ = 1}^{L - 1} d_{k ℓ} P_{ℓ} (stop) + d_{k L} (1 - \sum_{ℓ = 1}^{L - 1} P_{ℓ} (stop))

where d_kℓ = n_ℓP_kℓ(event), and P_ℓ(stop) is the stopping probability as defined by the frequency of crossing the critical boundaries at the ℓth interim analysis under the true values ψ* of the intervention effects. The AEN can provide information regarding the number of events anticipated in a group-sequential design in order to reach a decision point.

Table 1.

Calculated information fractions and the corresponding O’Brien-Fleming-type (OF) and Pocock-type (PC) critical boundaries.

Anal- ysis #	Calen- dar Time	OC1			OC2
Anal- ysis #	Calen- dar Time	Information Fraction	OF-type Bound	PC-type Bound	Information Fraction	OF-type Bound	PC-type Bound
1	48	0.5314	2.8616	2.1390	0.5669	2.7576	2.1200
2	96	1.0000	1.9718	2.2110	1.0000	1.9761	2.2215

Open in a new tab

Table 2 summarizes the MSS, MEN and AEN, and empirical power for the late time-dependent association. The empirical power under the calculated MSS achieves the targeted power. First of all, we can see that the group-sequential design provides a quite smaller AEN than fixed-sample design does in every case, which is preferred in terms of costs saving. As expected, the MSS decreases with higher positive correlation, but the reduction is small. Power and sample size is less impacted by the correlation than the hazard ratio. The MSS is nearly determined by the hazard ratio closer to 1 and it does not vary with the correlation when one hazard ratio is relatively smaller (or larger) than the other. Similarly there is little difference in the MEN between the group-sequential and fixed-sample designs. Based on these results, for the ARDENT study, the MSS is nearly determined by OC2. We only describe the result assuming the late time-dependent association. Similar patterns are observed in the case of an early time-dependent association (Gumbel copula), where the design planning results under the Gumbel copula is provided in Table B.1 of Appendix B. Also, we provide the considerations and results about Type I error rates control in Appendix B (Tables B.2 and B.3).

Table 2.

Sample sizes, number of events, and empirical powers in a group-sequential trials with two co-primary outcomes (Clayton copula).

Corr ρ_j	^*FSS	Group-sequential design					Empirical power (%)
		MSS	MEN		AEN		Both EP	At least one EP	Single EP
		MSS	OC1	OC2	OC1	OC2	Both EP	At least one EP	OC1	OC2
0.0	830	835	168	335	141	293	80.6	99.3	95.3	84.6
0.1	829	833	167	334	140	292	80.5	99.2	95.2	84.5
0.2	827	832	167	333	140	291	80.4	99.2	95.2	84.4
0.3	826	831	167	333	140	291	80.7	99.1	95.3	84.5
0.4	824	829	166	332	139	291	80.5	99.0	95.2	84.3
0.5	822	827	166	331	139	290	80.6	99.0	95.1	84.3
0.6	820	825	166	331	139	290	80.6	99.0	95.1	84.2
0.7	816	821	165	329	138	288	80.5	98.5	95.0	83.9
0.8	811	816	164	327	137	287	80.5	98.2	95.0	83.7
0.9	801	806	162	323	136	284	80.3	97.5	94.6	83.2
0.95	792	797	160	319	134	280	80.4	96.8	94.4	82.8

Open in a new tab

FSS:Sample sizes required for fixed-sample design.

The trial is designed to evaluate if an intervention is superior to the control with respect to both virologic (OC1) and regimen failure (OC2) with 80% power at the 2.5% significance level of a one-sided logrank test, where two analyses are planned at fixed calendar times of 48 and 96 weeks. For both outcomes, the critical boundaries are determined using the Lan-DeMets error-spending method with the O’Brien-Fleming type function. The bivariate exponential distribution is modeled using the Clayton copula. Empirical power is calculated using 100,000 repetitions. The marginal powers for OC1 and OC2 are calculated under a calculated maximum sample size.

In this illustration, the interim analysis was planned to be conducted at the prespecified calendar times as participants are recruited in calendar time. On the other hand, one may design a survival trial based on information fraction as interim summary statistics depend on the amount of information available. For example, the first analysis is planned when 50% of the maximum event numbers for one endpoint has been observed. The proposed method can be applied to information fraction as well. Table B.4 in Appendix B summarizes the statistics required for information-based designs including the corresponding calendar time, variance, and information fraction for one endpoint relative to information fraction for the other endpoint.

5. Discussion

A single primary endpoint may or may not provide a comprehensive picture of the important effects of the intervention. For this reason, many investigators prefer to design clinical trials with more than one primary endpoint (Dmitrienko et al. 2009). Multiple primary endpoints offer an attractive design feature as they capture a more complete characterization of the effect of an intervention on short and long term outcomes. For example, the Ambassador trial (NCT03244384) was designed to test the effect of pembrolizumab on overall survival and disease-free survival in patients with bladder cancer. In addition, it is common in oncology trials to use two primary endpoints to study the effect of treatment in different patient populations. For example, SWOG S0819 (Herbst et al. 2018) was designed to test the effect of cetuximab plus chemotherapy on overall survival in all patients with lung cancer and to study the impact of the combination therapy on progression-free survival in patients who were EGFR positive. However, for both multiple primary and co-primary endpoints, it is non-trivial to control the Type I and Type II errors when the endpoints are correlated. Evaluating an impact of the correlations among the endpoints is important, in design and analysis of clinical trials with multiple endpoints. Although methodologies to address continuous or binary endpoints in fixed-sample designs are well-developed, methodologies for event-time endpoints are limited (Halabi 2012; Rauch et al. 2016), especially in a group-sequential setting.

In this paper, we discuss a basic theory and method for group-sequential design in clinical trials with two non-fatal event-time outcomes. We present the asymptotic form and computing method of the variance-covariance function for the two sets of group-sequential weighted logrank statistic, which is fundamental for determining the information fraction for each outcome and for evaluating the probability of rejecting the null hypotheses. Several authors have developed many methods for group-sequential designs. However, in the context of comparing co-primary or multiple endpoints between groups, the form of the asymptotic variance-covariance matrix has not been provided based on the data correlation structure among two event-times. The description of the multivariate central limit theorem and the Cramer-Wald device by some authors did not clearly provide the asymptotic form of the variance-covariance matrix and the connection with a martingale approach, which cause challenges when calculating a power and the required sample size for a trial design. Although the covariance form similar to Theorem 1 has been reported in Murray (2000) and Andrei and Murray (2005), their contexts are different from ours and in paired logrank statistics on the same time axis. When two martingale components with event-time outcomes are correlated on different time axes, it is difficult to directly apply the standard martingale theory for survival analysis. We overcome these difficulties by deriving the two-dimensional Volterra integral equation using the discrete Ito formula (Jacod and Shiryaev 2003) within a martingale approach, which is provided in Appendix A as the proof of Theorem 1. From the simulation result, the asymptotic distribution of Theorem 1 works well in most practical situations as long as the event rate or sample size is not so small.

We apply the asymptotic result to group-sequential methodology for monitoring both or one of the event-time outcomes, when the trial is designed to evaluate a joint effect on both outcomes. There are several advantages for our developed methods. First, they provide an approach to determine the information and information fraction for two event-time outcomes. Second, these methods present the opportunity of evaluating the relationship between two event-time endpoints and how it impacts the decision-making for rejecting the null hypothesis, in terms of the Type I error, power, sample size and number of events. Finally, the methods provide insights on how to optimally choose a strategy for monitoring two event-time endpoints. We outline the method for calculating the probability, sample size, and number of events for the method, and illustrate the methods using a clinical trial example in HIV. Under a calculated total maximum sample size for a joint effect on two outcomes, the monitoring method achieves the targeted power and adequately controls the Type I error. The empirical Type I error rate was evaluated using Monte-Carlo simulation, and the methods presented here are valid in other practical situations. The objectives of the methods are to incorporate the correlation between the two event-time outcomes in power, Type I error evaluation and sample size calculation and to investigate how they behave as the correlation varies. The strength and shape of the association may be estimated from external or internal pilot data, but are usually unknown.

We discuss the situation where both event-time outcomes are non-fatal. Sugimoto et al. (2017) discussed the fixed-sample design when one event is fatal, and when both are fatal. An extension of their work to a group-sequential setting will require an extensive study to modify the variance-covariance structure of the group-sequential logrank statistics in order to handle dependent censoring. Research on group-sequential designs under such situations is an important area for future studies.

Acknowledgements

We thank one reviewer and the Associate Editor for their comments. Research reported in this publication was supported by JSPS KAKENHI grant numbers JP17K00054 and JP17K00069, the Project Promoting Clinical Trials for Development of New Drugs (18lk0201061h0002/18lk0201061h0202) from the Japan Agency for Medical Research and Development (AMED) and the National Institute of Allergy and Infectious Diseases of the National Institutes of Health under Award Number UM1AI068634. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Appendix

A. Proof of Theorem 1

Let $M_{i k}^{(ℓ)} (t) = N_{i k}^{(ℓ)} (t) - \int_{0}^{t} Y_{i k}^{(ℓ)} (x) d Λ_{g_{i} k} (x)$ and ${F_{k, t}^{(ℓ)} : t \geq 0}$ be a standard filtration generated from the history through time t for the kth outcome and the ℓth analysis ( $F_{k, t}^{(ℓ)}$ is the smallest σ-algebra generated by { $N_{i k}^{(ℓ)} (x)$ , $N_{i k}^{C (ℓ)} (x) : 0 \leq x \leq t$ , i = 1, ⋯ , n_ℓ}, where $N_{i k}^{C (ℓ)} (t) = 1 {T_{i k}^{(ℓ)} \leq t, Δ_{i k}^{(ℓ)} = 0}$ is a censoring counting process). As is well-known, $M_{i k}^{(ℓ)} (t)$ has the $F_{k, t}^{(ℓ)}$ -martingale property. We discuss the asymptotic behavior using the decomposition of the weighted logrank process $U_{k}^{(ℓ)} (t) = {\hat{m}}_{k}^{(ℓ)} (t) + n_{ℓ}^{- \frac{1}{2}} M_{k}^{(ℓ)} (t)$ from the definition of $U_{k}^{(ℓ)}$ , where

M_{k}^{(ℓ)} (t) = \int_{0}^{t} {\hat{H}}_{k}^{(ℓ)} (x) n_{ℓ}^{\frac{1}{2}} \sum_{n_{L}}^{i = 1} d {\tilde{M}}_{i k}^{(ℓ)} (x) = \int_{0}^{t} {\hat{H}}_{k}^{(ℓ)} (x) n_{ℓ}^{\frac{1}{2}} d {\tilde{M}}_{• k}^{(ℓ)} (x), d {\tilde{M}}_{• k}^{(ℓ)} (x) = \frac{d {\bar{M}}_{2 k}^{(ℓ)} (x)}{{\bar{Y}}_{2 k}^{(ℓ)} (x)} - \frac{d {\bar{M}}_{1 k}^{(ℓ)} (x)}{{\bar{Y}}_{1 k}^{(ℓ)} (x)}, d {\bar{M}}_{j k}^{(ℓ)} (x) = \sum_{i = 1}^{n_{ℓ}} 1 {g_{i} = j} d M_{i k}^{(ℓ)} (x), d {\tilde{M}}_{i k}^{(ℓ)} (x) = 1 {i \leq n_{ℓ}} {\frac{1 {g_{i} = 2}}{{\bar{Y}}_{2 k}^{(ℓ)} (x)} - \frac{1 {g_{i} = 1}}{{\bar{Y}}_{1 k}^{(ℓ)} (x)}} d M_{i k}^{(ℓ)} (x),

and $M_{k}^{(ℓ)} (t)$ is $F_{k, t}^{(ℓ)}$ -martingale because ${\hat{H}}_{k}^{(ℓ)} (t)$ is $F_{k, t}^{(ℓ)}$ -predictable.

Let ${\hat{Z}}^{*} = ({\hat{Z}}_{1}^{*} (τ_{1}), \dots, {\hat{Z}}_{1}^{*} (τ_{L}), {\hat{Z}}_{2}^{*} (τ_{1}), \dots, {\hat{Z}}_{2}^{*} (τ_{L}))^{T}$ and let ${\hat{Z}}_{k}^{*} (τ_{ℓ})$ be ${\hat{Z}}_{k} (τ_{ℓ})$ whose denominator is replaced by the limit version,

{\hat{Z}}_{k}^{*} (τ_{ℓ}) = n_{ℓ}^{\frac{1}{2}} \frac{U_{k}^{(ℓ)} (τ_{ℓ})}{\sqrt{V_{k k}^{0 (ℓ)} (τ_{ℓ})}} = n_{ℓ}^{\frac{1}{2}} {\hat{μ}}_{k ℓ} + ξ_{k ℓ} M_{k}^{(ℓ)} (τ_{ℓ}),

where we write $ξ_{k ℓ} = 1 ∕ \sqrt{V_{k k}^{0 (ℓ)} (τ_{ℓ})}$ for simplicity. The distribution of $\hat{Z} - D_{n} \hat{μ}$ is asymptotically equivalent to

{\hat{Z}}^{*} - D_{n} \hat{μ} = {(ξ_{11} M_{1}^{(1)} (τ_{1}), \dots, ξ_{1 L} M_{1}^{(L)} (τ_{L}), ξ_{21} M_{2}^{(1)} (τ_{1}), \dots, ξ_{2 L} M_{2}^{(L)} (τ_{L}))}^{T}

because the dominated convergence theorem works by the convergence of ${\hat{V}}_{k k}^{0 (ℓ)} (τ_{ℓ}) \overset{P}{\to} V_{k k}^{0 (ℓ)} (τ_{ℓ})$ uniformly on ℓ = 1,…, L as n_L ≥ ⋯ ≥ n₁ → ∞. We find it necessary to study the covariance of $M_{k}^{(ℓ),} s$ for characterizing the distribution of ${\hat{Z}}^{*} - D_{n} \hat{μ}$ .

In the proof hereafter, it is sufficient to consider the case of L = 2. As a function related to the characteristic function of $M_{k}^{(ℓ)} (t)$ , define

G_{k}^{(ℓ)} (t) = exp (i z_{k ℓ} M_{k}^{(ℓ)} (t) + \frac{z_{k ℓ}^{2}}{2} 〈 M_{k}^{(ℓ)}, M_{k}^{(ℓ)} 〉 (t))

for a real non-zero z_kℓ and $i = \sqrt{- 1}$ , where ⟨m₁, m₂⟩ denotes a predictable covariance process for two martingales m₁ and m₂. In this case we have

〈 M_{k}^{(ℓ)}, M_{k}^{(ℓ^{'})} 〉 (t) = n_{ℓ}^{\frac{1}{2}} n_{ℓ^{'}}^{\frac{1}{2}} \int_{0}^{t} {\hat{H}}_{k}^{(ℓ)} (x) {\hat{H}}_{k}^{(ℓ^{'})} (x) {\frac{d Λ_{1 k} (x)}{{\bar{Y}}_{1 k}^{(ℓ \lor ℓ^{'})} (x)} + \frac{d Λ_{2 k} (x)}{{\bar{Y}}_{2 k}^{(ℓ \lor ℓ^{'})} (x)}},

following the standard martingale theory of survival analysis (see Fleming and Harrington (1991)). The consistency of ${\hat{S}}_{j k}^{(ℓ)}$ , the Glivenko-Cantelli theorem, and Conditions 1 and 3 imply ${sup}_{0 \leq x \leq τ_{ℓ}} ∣ {\hat{H}}_{k}^{(ℓ)} (x) - H_{k}^{(ℓ)} (x) ∣ \overset{P}{\to} 0$ and

sup_{0 \leq x \leq τ_{ℓ}} ∣ {\hat{H}}_{k}^{(ℓ)} (x) ∕ n_{ℓ}^{- 1} {\bar{Y}}_{j k}^{(ℓ)} (x) - h_{j k}^{(ℓ)} (x) ∣ \overset{P}{\to} 0 as n_{ℓ} \to \infty,

(9)

where

h_{j k}^{(ℓ)} (x) = \frac{H_{k}^{(ℓ)} (x)}{a_{j ℓ} y_{j k}^{(ℓ)} (x)} = W_{k}^{(ℓ)} (x) \frac{a_{j^{'} ℓ} S_{j^{'} k} (x_{-})}{S_{• k}^{(ℓ)} (x_{-})}, j^{'} = 3 - j,

and note that $0 \leq H_{k}^{(ℓ)} (x) < \infty$ for x ∈ [0, τ_ℓ], $H_{k}^{(ℓ)} (x) = 0$ for τ_ℓ < x and $0 \leq h_{j k}^{(ℓ)} (x) < \infty$ for all x. The univariate asymptotic result provides $E (e^{i z_{k ℓ} M_{k}^{(ℓ)} (t)}) \to exp (- \frac{z_{k ℓ}^{2}}{2} V_{k k} (t, t ∣ τ_{ℓ}, τ_{ℓ}))$ as n_ℓ → ∞, which corresponds to the following convergences,

E (G_{k}^{(ℓ)} (t)) \to 1 and 〈 M_{k}^{(ℓ)}, M_{k}^{(ℓ^{'})} 〉 (t) \overset{P}{\to} V_{k k} (t, t ∣ τ_{ℓ}, τ_{ℓ^{'}})

(Nishiyama 2011). For different k ≡ k′, it is difficult to show joint normality with correlation between $M_{k}^{(ℓ)}$ and $M_{k^{'}}^{(ℓ)}$ with standard martingale theory of counting processes (Fleming and Harrington 1991; Andersen et al. 1993). However, we overcome the challenge applying Ito’s formula. The discrete Ito’s formula (Jacod and Shiryaev 2003; Huang and Strawderman 2006) provides the decomposition of $G_{k}^{(ℓ)} (t)$ ,

G_{k}^{(ℓ)} (t) - 1 = \sum_{j = 1, 2} \int_{0}^{t} G_{k}^{(ℓ)} (x -) {\tilde{H}}_{j k}^{a (ℓ)} (x) d {\bar{M}}_{j k}^{(ℓ)} (x) + \sum_{j = 1, 2} \int_{0}^{t} G_{k}^{(ℓ)} (x -) {\tilde{H}}_{j k}^{(ℓ)} (x) {\bar{Y}}_{j k}^{(ℓ)} (x) d Λ_{j k} (x),

(10)

where, with i₁ = −i and i₂ = i,

{\tilde{H}}_{j k}^{a (ℓ)} (x) = exp (i_{j} z_{k ℓ} \frac{\sqrt{n_{ℓ}} {\hat{H}}_{k}^{(ℓ)} (x)}{{\bar{Y}}_{j k}^{(ℓ)} (x)}) - 1, {\tilde{H}}_{j k}^{(ℓ)} (x) = exp (i_{j} z_{k ℓ} \frac{\sqrt{n_{ℓ}} {\hat{H}}_{k}^{(ℓ)} (x)}{{\bar{Y}}_{j k}^{(ℓ)} (x)}) - 1 - i_{j} z_{k ℓ} \frac{\sqrt{n_{ℓ}} {\hat{H}}_{k}^{(ℓ)} (x)}{{\bar{Y}}_{j k}^{(ℓ)} (x)} + \frac{z_{k ℓ}^{2}}{2} {(\frac{\sqrt{n_{ℓ}} {\hat{H}}_{k}^{(ℓ)} (x)}{{\bar{Y}}_{j k}^{(ℓ)} (x)})}^{2} .

The expectation of the right-hand side of (10) converges to zero as n_ℓ → ∞, because

E (\int_{0}^{t} G_{k}^{(ℓ)} (x -) {\tilde{H}}_{j k}^{a (ℓ)} (x) d {\bar{M}}_{j k}^{(ℓ)} (x)) = 0 and E (\int_{0}^{t} G_{k}^{(ℓ)} (x -) {\tilde{H}}_{j k}^{(ℓ)} (x) {\bar{Y}}_{j k}^{(ℓ)} (x) d Λ_{j k} (x)) \to 0

(11)

by the martingale property of ${\bar{M}}_{j k}^{(ℓ)}$ and the Lindeberg condition, respectively. In fact, using the integrable martingale property of $G_{k}^{(ℓ)} (x_{-})$ and the well-known inequality

∣ exp (i c) - 1 - i c + \frac{1}{2} c^{2} ∣ \leq 1 {∣ c ∣ \leq ε} ∣ c ∣^{3} + 1 {∣ c ∣ > ε} ∣ c ∣^{2}

for any real c, the latter result of (11) is obtained as

E (\int_{0}^{t} ∣ G_{k}^{(ℓ)} (x_{-}) {\tilde{H}}_{j k}^{(ℓ)} (x) ∣ {\bar{Y}}_{j k}^{(ℓ)} (x) d Λ_{j k} (x)) \leq exp (\frac{z_{k ℓ}^{2}}{2} 〈 M_{k}^{(ℓ)}, M_{k}^{(ℓ)} 〉 (t)) \times {E (\int_{0}^{t} ∣ c_{j k ℓ} (x) ∣^{3} 1 {∣ c_{j k ℓ} (x) ∣ \leq ε} {\bar{Y}}_{j k}^{(ℓ)} (x) d Λ_{j k} (x)) + E (\int_{0}^{t} ∣ c_{j k ℓ} (x) ∣^{2} 1 {∣ c_{j k ℓ} (x) ∣ > ε} {\bar{Y}}_{j k}^{(ℓ)} (x) d Λ_{j k} (x))} \to 0

as n_ℓ → ∞, where ε is an arbitrary positive number, $c_{j k ℓ} (x) = z_{k ℓ} \sqrt{n_{ℓ}} {\hat{H}}_{k}^{(ℓ)} (x) ∕ {\bar{Y}}_{j k}^{(ℓ)} (x)$ and we have $\sqrt{n_{ℓ}} c_{j k ℓ} (x) \overset{P}{\to} z_{k ℓ} h_{j k}^{(ℓ)} (x)$ uniformly on (0, τ_ℓ] from (9). Hence, we have

E ((G_{1}^{(ℓ)} (t) - 1) (G_{2}^{(ℓ^{'})} (s) - 1)) \to E (G_{1}^{(ℓ)} (t) G_{2}^{(ℓ^{'})} (s)) - 1

(12)

as n_ℓ, n_ℓ′ → ∞ by the univariate results of $E (G_{k}^{(ℓ)} (t)) \to 1$ , while using the formula (10) we can also find

E ((G_{1}^{(ℓ)} (t) - 1) (G_{2}^{(ℓ^{'})} (s) - 1)) \to \sum_{j = 1, 2} \int_{0}^{t} \int_{0}^{s} E (G_{1}^{(ℓ)} (x_{-}) G_{2}^{(ℓ^{'})} (y_{-}) {\tilde{H}}_{j 1}^{a (ℓ)} (x) {\tilde{H}}_{j 2}^{a (ℓ^{'})} (y) d {\bar{M}}_{j 1}^{(ℓ)} (x) d {\bar{M}}_{j 2}^{(ℓ^{'})} (y))

(13)

as n_ℓ, n_ℓ′ → ∞. Similarly to showing the latter result of (11), with asymptotic equality, we can replace the terms e^{i_j(c_j1ℓ(x)+c_j2ℓ′(y))} and e^{i_jc_jkℓ(·)} included in (13) by $1 + i_{j} {c_{j 1 ℓ} (x) + c_{j 2 ℓ^{'}} (y)} - \frac{1}{2} {c_{j 1 ℓ} (x) + c_{j 2 ℓ^{'}} (y)}^{2}$ and $1 + i_{j} c_{j k ℓ} (\cdot) - \frac{1}{2} c_{j k ℓ} (\cdot)^{2}$ , respectively. In fact, we can show that

{\tilde{H}}_{j 1}^{a (ℓ)} (x) {\tilde{H}}_{j 2}^{a (ℓ^{'})} (y) = e^{i_{j} (c_{j 1 ℓ} (x) + c_{j 2 ℓ^{'}} (y))} - e^{i_{j} c_{j 1 ℓ} (x)} - e^{i_{j} c_{j 2 ℓ^{'} (y)}} + 1 = - c_{j 1 ℓ} (x) c_{j 2 ℓ^{'}} (y) + o_{P} (1 ∕ \sqrt{n_{ℓ} n_{ℓ^{'}}})

from the convergence result of $\sqrt{n_{ℓ}} c_{j k ℓ} (x)$ . Hence, we have

\sqrt{n_{ℓ} n_{ℓ^{'}}} {\tilde{H}}_{j 1}^{a (ℓ)} (x) {\tilde{H}}_{j 2}^{a (ℓ^{'})} (y) \overset{P}{\to} - z_{1 ℓ} z_{2 ℓ^{'}} h_{j 1}^{(ℓ)} (x) h_{j 2}^{(ℓ^{'})} (y)

(14)

as n_ℓ, n_ℓ′ → ∞, so that we can apply this result to (13). Also, similar to Prentice and Cai (1992) and Sugimoto et al. (2013, 2017), we can show

\frac{1}{{\hat{a}}_{j ℓ \land ℓ^{'}} n_{ℓ \land ℓ^{'}}} E (\iint d {\bar{M}}_{j 1}^{(ℓ)} (x) d {\bar{M}}_{j 2}^{(ℓ^{'})} (y)) = E (\iint d M_{i 1}^{(ℓ)} (x) d M_{i 2}^{(ℓ^{'})} (y) ∣ g_{i} = j) = \iint C_{ℓ \land ℓ^{'}} (x \land y) A_{j} (d x, d y) .

For simplicity, let $ϕ (t, s) = E (G_{1}^{(ℓ)} (t) G_{2}^{(ℓ^{'})} (s))$ . From (12), (13), (14), ${\hat{γ}}_{ℓ} \overset{P}{\to} γ_{ℓ}$ , ${\hat{a}}_{j ℓ} \overset{P}{\to} a_{j ℓ}$ (Conditions 1-2) and the dominated convergence theorem, we have the integral equation for ϕ(t, s) under n_ℓ, n_ℓ′ → ∞,

ϕ (t, s) - 1 = - z_{1 ℓ} z_{2 ℓ^{'}} \sqrt{\frac{γ_{ℓ \land ℓ^{'}}}{γ_{ℓ \lor ℓ^{'}}}} \times \int_{0}^{t} \int_{0}^{s} ϕ (x_{-}, y_{-}) \sum_{j = 1}^{2} a_{j ℓ \land ℓ^{'}} h_{j 1}^{(ℓ)} (x) h_{j 2}^{(ℓ^{'})} (y) C_{ℓ \land ℓ^{'}} (x \land y) A_{j} (d x, d y) .

(15)

Similarly to bivariate survival function (Dabrowska 1988), the two-dimensional Volterra integral equation

ϕ (t, s) = 1 + \int_{0}^{t} \int_{0}^{s} ϕ (x_{-}, y_{-}) b_{12} (d x, d y) with ϕ (t, 0) = ϕ (0, s) = 1

is solved as $ϕ (t, s) = exp [\int_{0}^{t} \int_{0}^{s} {b_{12} (d x, d y) - b_{1} (d x, y) b_{2} (x, d y)}]$ , where

b_{1} (d x, y) = ϕ (d x, y) ∕ ϕ (x_{-}, y_{-}) and b_{2} (x, d y) = ϕ (x, d y) ∕ ϕ (x_{-}, y_{-}) .

However, note that it is difficult to obtain b_k(x, y), k = 1, 2 by directly differentiating (15) because of including the expectation of non-differentiable $M_{i 1}^{(ℓ)} (x)$ and $M_{i 2}^{(ℓ^{'})} (y)$ . Alternatively, we can use the formula (10) again for the purpose, so that by the discussion similar to obtaining (15), as n_ℓ, n_ℓ′ → ∞, we have

\int ϕ (d x, y) = \int {E (d G_{1}^{(ℓ)} (x) d G_{2}^{(ℓ^{'})} (y_{-})) + E (d G_{1}^{(ℓ)} (x) G_{2}^{(ℓ^{'})} (y_{-}))} \to \int ϕ (x_{-}, y_{-}) E (\sum_{j} {\tilde{H}}_{j 1}^{a (ℓ)} (x) d {\bar{M}}_{j 1}^{(ℓ)} (x)) = 0 .

This yields ∫∫ b₁(dx, y)b₂(x, dy) = 0. Hence, the solution of (15) is

ϕ (t, s) = exp (- z_{1 ℓ} z_{2 ℓ^{'}} V_{12} (t, s ∣ τ_{ℓ}, τ_{ℓ^{'}})) .

Therefore, if $E (\iint d {\bar{M}}_{j 1}^{(ℓ)} (x) d {\bar{M}}_{j 2}^{(ℓ^{'})} (y)) \neq 0$ , the correlation between the two martingales works, which results in $E (G_{1}^{(ℓ)} (t) G_{2}^{(ℓ^{'})} (s)) \neq 1$ but concludes

E (G_{1}^{(ℓ)} (t) G_{2}^{(ℓ^{'})} (s)) ϕ (t, s)^{- 1} \to 1 as n_{L} \geq \dots \geq n_{1} \to \infty .

In summary, these results provide that the characteristic function of marginal martingale vector $(M_{k}^{(ℓ)} (t), M_{k^{'}}^{(ℓ^{'})} (s))^{T}$ converges to that of bivariate normal distribution as

E (e^{i z_{k ℓ} M_{k}^{(ℓ)} (t) + i z_{k^{'} ℓ^{'}} M_{k^{'}}^{(ℓ^{'})} (s)}) \to exp (- \frac{1}{2} z_{k ℓ}^{2} V_{k k} (t, s ∣ τ_{ℓ}, τ_{ℓ}) - z_{k ℓ} z_{k^{'} ℓ^{'}} V_{k k^{'}} (t, s ∣ τ_{ℓ}, τ_{ℓ^{'}}) - \frac{1}{2} z_{k^{'} ℓ^{'}}^{2} V_{k^{'} k^{'}} (t, s ∣ τ_{ℓ^{'}}, τ_{ℓ^{'}})) = {\begin{matrix} exp (- 2 z_{k ℓ}^{2} V_{k k} (t, s ∣ τ_{ℓ}, τ_{ℓ})) & if k = k^{'}, ℓ = ℓ^{'}, \\ exp (- \frac{1}{2} {z_{k ℓ} V_{k k} (t, s ∣ τ_{ℓ}, τ_{ℓ})^{1 ∕ 2} + z_{k ℓ^{'}} V_{k k} (t, s ∣ τ_{ℓ^{'}}, τ_{ℓ^{'}})^{1 ∕ 2}}^{2}) & if k = k^{'}, ℓ \neq ℓ^{'}, \\ same as the above form & otherwise . \end{matrix}

A replication of the similar discussion provides that ( $M_{1}^{(1)} (t)$ , $M_{1}^{(2)} (t)$ , $M_{2}^{(1)} (t)$ , $M_{2}^{(2)} (t)$ ) converges in distribution to a multivariate normal distribution with zero mean vector and covarince matrix

(\begin{matrix} V_{11} (t, s ∣ τ_{1}, τ_{1}), \\ V_{11} (t, s ∣ τ_{2}, τ_{1}), V_{11} (t, s ∣ τ_{2}, τ_{2}), \\ V_{21} (t, s ∣ τ_{1}, τ_{1}), V_{21} (t, s ∣ τ_{1}, τ_{2}), V_{22} (t, s ∣ τ_{1}, τ_{1}), \\ V_{21} (t, s ∣ τ_{2}, τ_{1}), V_{12} (t, s ∣ τ_{2}, τ_{2}), V_{22} (t, s ∣ τ_{2}, τ_{1}), V_{22} (t, s ∣ τ_{2}, τ_{2}) \end{matrix}) .

These results lead imidiately to the convergence of ${\hat{Z}}^{*} - D_{n} \hat{μ}$ in distibution to Z* – D_nμ, as summarized in Theorem 1. □

B. Some additional results

Table 2 of Sect. 4 displays the results obtained under the assumption of a late time-dependent association (Clayton copula) for the joint survival distribution of the two event-time outcomes. The users may be interested in how the results change if the other types of dependency between two outcomes are assumed. In Table B.1, we provide results from the design stage calculated under the same assumptions as Table 2 except that the joint survival distribution is replaced by an early time-dependent association (Gumbel copula). The pattern of the results of MSS, MEN and AEN under Gumbel copula are quite similar to Table 2, but, as the correlation is higher, their reduction rates from the values at zero correlation are slightly larger than those under Clayton copula.

Table B.1.

Sample sizes, number of events, and empirical powers in a group-sequential trial with two co-primary outcomes under an early time-dependent association (Gumbel copula).

Corr. ρ_j	^*FSS	Group-sequential design					Empirical power (%)
		MSS	MEN		AEN		Both EP	At least one EP	Single EP
		MSS	OC1	OC2	OC1	OC2	Both EP	At least one EP	OC1	OC2
0.0	830	835	168	335	141	293	80.6	99.3	95.3	84.5
0.1	824	829	166	332	139	290	80.6	99.0	95.2	84.4
0.2	818	823	165	330	138	289	80.4	98.6	95.2	83.9
0.3	812	817	164	327	137	286	80.5	98.1	94.9	83.7
0.4	805	810	163	325	136	285	80.5	97.7	94.8	83.4
0.5	799	804	161	322	134	282	80.6	97.3	94.6	83.3
0.6	792	797	160	319	133	280	80.3	96.7	94.2	82.7
0.7	786	791	159	317	132	279	80.6	96.3	94.2	82.7
0.8	780	785	158	315	132	277	80.3	96.0	94.1	82.2
0.9	776	781	157	313	131	276	80.7	95.8	94.2	82.3
0.95	775	780	157	313	131	276	80.4	95.6	94.0	82.0

Open in a new tab

FSS:Sample sizes required for fixed-sample design.

This table is created under the same settings and descriptions as those of Table 2 except the association between two outcomes OC1 and OC2. The joint survival distribution is modeled using the Gumbel copula which provides an early time-dependent association.

As indicated by one referee, an important matter of concern is how the Type I error rates are controlled or not. In fact, the proposed design method is based on asymptotic results. To answer such a problem, we evaluate the behavior of the actual Type I error rates under sample sizes calculated by the proposed methods. Using ARDENT study, we consider three settings of (ψ₁, ψ₂) = (1.0, 1.0), (0.565, 1.0) and (1.0, 0.721) (both null hypotheses and the two marginals) under the same configurations as Sect. 4, and we confirm the behavior via Monte-Carlo simulation with 1,000,000 runs. For the simulation, a trial ended at the planned follow-up duration. When the observed numbers were larger than the planned ones, the critical value at the final analysis was recalculated based on

1 - P (Z_{k 1} < c_{k 1}, \dots, Z_{k L} < {\tilde{c}}_{k L} ∣ H_{0 k}) = α_{k},

where ${\tilde{c}}_{k L}$ is the critical value at the final analysis, recalculated such that the above equation is satisfied to control the Type I error adequately if the planned numbers are different from the observed ones.

Tables B.2 and B.3 show the results of the actual Type I error rates, which are corresponding to the situations under null hypotheses of Tables 2 and B.1 under Clayton and Gumbel copulas, respectively. Where the columns “Both” and “ALO” give the probabilities to reject two null hypotheses of OC1 and OC2 jointly (Both) and at least one (ALO), respectively, and “OC1” and “OC2” provide the probabilities to reject two single hypotheses of OC1 and OC2, respectively. We observe that the results of “Joint” are well controlled at the nominal error rate 2.5% in the three cases. Those of “ALO” are less than 2 × 2.5% only at both null hypotheses and reflect the effect of multiplicity using two times testing. Also, the results of “OC1” and “OC2’ are well controlled at the nominal Type I error rate in three cases. Therefore, our method works well in controlling the nominal Type I error rate under the calculated sample size.

Table B.2.

Simulation assessment: Probability of rejecting null hypothesis under Clayton copula.

Corr. ρ_j	MSS	(ψ₁, ψ₂) = (0.565, 1.0)				(ψ₁, ψ₂) = (1.0, 0.721)				(ψ₁, ψ₂) = (1.0.1.0)
Corr. ρ_j	MSS	Both	ALO	OC1	OC2	Both	ALO	OC1	OC2	Both	ALO	OC1	OC2
0.0	835	2.39	95.4	95.3	2.50	2.10	85.0	2.49	84.6	0.06	4.91	2.48	2.49
0.1	833	2.39	95.4	95.3	2.50	2.15	84.8	2.50	84.5	0.08	4.94	2.50	2.52
0.2	832	2.42	95.3	95.3	2.51	2.20	84.7	2.51	84.4	0.08	4.91	2.49	2.51
0.3	831	2.44	95.3	95.2	2.51	2.24	84.6	2.50	84.4	0.10	4.92	2.51	2.52
0.4	829	2.44	95.2	95.2	2.50	2.27	84.6	2.51	84.3	0.12	4.90	2.50	2.52
0.5	827	2.43	95.2	95.1	2.48	2.30	84.3	2.50	84.2	0.15	4.87	2.53	2.49
0.6	825	2.47	95.1	95.1	2.51	2.33	84.3	2.49	84.2	0.18	4.80	2.46	2.51
0.7	821	2.48	95.0	95.0	2.51	2.38	84.0	2.49	83.9	0.23	4.80	2.51	2.52
0.8	816	2.52	94.9	94.9	2.53	2.45	83.8	2.52	83.7	0.30	4.69	2.49	2.51
0.9	806	2.51	94.7	94.4	2.52	2.48	83.3	2.51	83.3	0.45	4.50	2.48	2.47
0.95	797	2.51	94.5	94.5	2.51	2.46	82.8	2.47	82.8	0.62	4.39	2.51	2.50

Open in a new tab

Table B.3.

Simulation assessment: Probability of rejecting null hypothesis under Gumbel copula.

Corr. ρ_j	MSS	(ψ₁, ψ₂) = (0.565, 1.0)				(ψ₁, ψ₂) = (1.0, 0.721)				(ψ₁, ψ₂) = (1.0.1.0)
Corr. ρ_j	MSS	Both	ALO	OC1	OC2	Both	ALO	OC1	OC2	Both	ALO	OC1	OC2
0.0	835	2.40	95.4	95.3	2.51	2.11	84.9	2.51	84.5	0.06	4.94	2.49	2.51
0.1	829	2.44	95.3	95.2	2.50	2.26	84.5	2.50	84.2	0.12	4.89	2.51	2.50
0.2	823	2.46	95.0	95.0	2.49	2.36	84.1	2.50	84.0	0.18	4.84	2.51	2.51
0.3	817	2.47	94.9	94.9	2.49	2.42	83.8	2.50	83.8	0.27	4.75	2.50	2.52
0.4	810	2.51	94.8	94.8	2.52	2.47	83.6	2.51	83.5	0.35	4.63	2.48	2.50
0.5	804	2.52	94.6	94.6	2.52	2.50	83.2	2.52	83.2	0.47	4.55	2.53	2.50
0.6	797	2.51	94.5	94.5	2.51	2.49	82.8	2.49	82.8	0.57	4.42	2.50	2.49
0.7	791	2.48	94.3	94.3	2.48	2.52	82.6	2.52	82.6	0.69	4.27	2.49	2.47
0.8	785	2.52	94.1	94.1	2.52	2.49	82.3	2.50	82.3	0.82	4.17	2.50	2.49
0.9	781	2.51	94.0	94.0	2.51	2.50	82.1	2.50	82.1	0.95	4.06	2.49	2.52
0.95	780	2.52	94.0	94.0	2.52	2.51	82.1	2.51	82.1	0.99	4.05	2.51	2.54

Open in a new tab

Table 1 of Sect. 4 displays the planning information for a group-sequential design at the fixed analysis time points (48 and 96 weeks) considered in ARDENT trial. Other group-sequential designs based on selected information fractions can be constructed. Table B.4 displays the planning information for a group-sequential design for information fractions of 0.5 and 1.0.

Table B.4.

Variance, calendar time and information fraction corresponding to the other endpoint’s information fraction

Endpoint	Variance, corresponding calendar time and information fraction		1st analysis	Final analysis
Virologic failure (OC1)	information fraction		0.5	1.0
	Corresponding Calendar time (week)		45.5	96.0
	OC1	$V_{11}^{0 (ℓ)} (τ_{ℓ})$	0.0252	0.0499
	OC2 (Regimen failure)	$V_{22}^{0 (ℓ)} (τ_{ℓ})$	0.0539	0.0998
	OC2 (Regimen failure)	Corresponding information fraction	0.5400	1.0
Regimen failure (OC2)	information fraction		0.5	1.0
	Corresponding Calendar time (week)		42.0	96.0
	OC1 (Virologic failure)	$V_{11}^{0 (ℓ)} (τ_{ℓ})$	0.0233	0.0499
	OC1 (Virologic failure)	Corresponding information fraction	0.4675	1.0
	OC2	$V_{22}^{0 (ℓ)} (τ_{ℓ})$	0.0502	0.0998

Open in a new tab

Footnotes

Publisher's Disclaimer: This Author Accepted Manuscript is a PDF file of an unedited peer-reviewed manuscript that has been accepted for publication but has not been copyedited or corrected. The official version of record that is published in the journal is kept up to date and so may therefore differ from this version.

Contributor Information

Tomoyuki Sugimoto, Department of Mathematics and Computer Science, Graduate School of Science and Technology, Kagoshima University, 1-21-35 Korimoto, Kagoshima 890-8580, Japan.

Toshimitsu Hamasaki, Department of Data Science, National Cerebral and Cardiovascular Center, 5-7-1 Fujishiro-dai, Suita, Osaka 565-8565, Japan.

Scott R. Evans, Epidemiology and Biostatistics and the Center for Biostatistics, George Washington University, 6110 Executive Boulevard Suite 750 Rockville 20852-3943, MD, USA

Susan Halabi, Department of Biostatistics and Bioinformatics,Duke University School of Medicine, Durham, 27705, NC, USA.

References

Andersen PK, Borgan Ø, Gill RD, Keiding N (1993) Statistical models based on counting processes. Springer-Verlag, New York. [Google Scholar]
Andrei A-C, Murray S (2005) Simultaneous group sequential analysis of rank-based and weighted Kaplan-Meier tests for paired censored survival data. Biometrics 61:715–720. [DOI] [PubMed] [Google Scholar]
Asakura K, Hamasaki T, Sugimoto T, Hayashi K, Evans SR, Sozu T (2014) Sample size determination in group-sequential clinical trials with two co-primary endpoints. Stat Med 33:2897–2913. [DOI] [PMC free article] [PubMed] [Google Scholar]
Clayton DG (1976) A model for association in bivariate life tablesand its application in epidemiological studies of familial tendencyin chronic disease. Biometrika 65:141–151. [Google Scholar]
Collett D (2003) Modelling survival data in medical research, 2nd edn. Chapman & Hall/CRC, Boca Raton. [Google Scholar]
Cook RJ, Farewell VT (1994) Guidelines for monitoring efficacy and toxicity responses in clinical trials. Biometrics 50:1146–1152. [PubMed] [Google Scholar]
Dabrowska DM (1988) Kaplan-Meier estimate on the plane. Ann Stat 16:1475–1489. [Google Scholar]
Dmitrienko A, Tamhane AC, Bretz F (2009) Multiple Testing Problems in Pharmaceutical Statistics. Chapman & Hall/CRC, Boca Raton. [Google Scholar]
Fleming TR, Harrington DP (1991) Counting process and survival analysis. John Wiley & Sons, New York. [Google Scholar]
Glimm E, Mauer W, Bretz F (2009) Hierarchical testing of multiple endpoints in group-sequential trials. Stat Med 29:219–228. [DOI] [PubMed] [Google Scholar]
Gombay E (2008) Weighted logrank statistics in sequential tests. Sequential Anal 27:97–104. [Google Scholar]
Gordon LKK, Lachin JM (1990) Implementation of group sequential logrank tests in a maximum duration trial. Biometrika 46:759–770. [PubMed] [Google Scholar]
Gu MG, Lai TL (1991) Weak convergence of time-sequential censored rank statistics with applications to sequential testing in clinical trials. Ann Stat 19:1403–1433. [Google Scholar]
Halabi S (2012) Adjustment on the type I error rate for a clinical trial monitoring for both intermediate and primary endpoints. J Biom Biostat 7:15. [DOI] [PMC free article] [PubMed] [Google Scholar]
Herbst RS, Redman MW, Kim ES, Semrad TJ, Bazhenova L, Masters G, Oettel K, Guaglianone P, Reynolds C, Karnad A, Arnold SM, Varella-Garcia M, Moon J, Mack PC, Blanke CD, Hirsch FR, Kelly K, Gandara DR (2018) Cetuximab plus carboplatin and paclitaxel with or without bevacizumab versus carboplatin and paclitaxel with or without bevacizumab in advanced NSCLC (SWOG S0819): a randomised, phase 3 study. Lancet Oncol 19: 101–114. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hamasaki T, Asakura K, Evans SR, Sugimoto T, Sozu T (2015) Group-sequential strategies in clinical trials with multiple co-primary endpoints. Stat Biopharm Res 7:36–54. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hougaard P (1986) A class of multivariate failure time distribution. Biometrika 73:671–678. [Google Scholar]
Hsu L, Prentice RL (1996) On assessing the strength of dependency between failure time variables. Biometrika 83:491–506. [Google Scholar]
Huang X, Strawderman RL (2006) A note on the Breslow survival estimator. J Nonparametr Stat 18:45–56. [Google Scholar]
Hung HMJ, Wang SJ, O’Neill RT (2007) Statistical considerations for testing multiple endpoints in group sequential or adaptive clinical trials. J Biopharm Stat 17:1201–1210. [DOI] [PubMed] [Google Scholar]
Jacod J, Shiryaev AN (2003) Limit theorems for stochastic processes, 2nd edn. Springer-Verlag, Berlin-Heidelberg. [Google Scholar]
Jennison C, Turnbull BW (2000) Group sequential methods with applications to clinical trials. Chapman & Hall/CRC, Boca Raton. [Google Scholar]
Jung S-H (2008) Sample size calculation for the weighted rank statistics with paired survival data. Stat Med 27:3350–3365. [DOI] [PubMed] [Google Scholar]
Kosorok MR, Shi Y, DeMets DL (2004) Design and analysis of group-sequential clinical trials with multiple primary endpoints. Biometrics 60:134–145. [DOI] [PubMed] [Google Scholar]
Lai TL, Shih M-C (2004) Power, sample size and adaptation considerations in the design of group sequential clinical trials. Biometrika 91:507–528. [Google Scholar]
Lan KKG, DeMets DL (1983) Discrete sequential boundaries for clinical trials. Biometrika 70:659–663. [Google Scholar]
Lennox JL, Landovitz RJ, Ribaudo HJ, Ofotokun I, Na LH, Godfrey C, Kuritzkes DR, Sagar M, Brown TT, Cohn SE, McComsey GA, Aweeka F, Fichtenbaum CJ, Presti RM, Koletar SL, Haas DW, Patterson KB, Benson CA, Baugh BP, Leavitt RY, Rooney JF, Seekins D, Currier JS (2014) A phase III comparative study of the efficacy and tolerability of three non-nucleoside reverse transcriptase inhibitor-sparing antiretroviral regimens for Treatment-naïve HIV-1-infected volunteers: A randomized, controlled trial. Annals of Internal Medicine 161:461–471. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lin DY, Shen L, Ying Z, Breslow NE (1996) Group sequential designs for monitoring survival probabilities. Biometrics 52:1033–1041. [PubMed] [Google Scholar]
Lin DY (1991) Nonparametric sequential testing in clinical trials with incomplete multivariate observations. Biometrika 78:123–131. [Google Scholar]
Murray S (2000) Nonparametric rank-based methods for group sequential monitoring of paired censored survival data. Biometrics 54:984–990. [DOI] [PubMed] [Google Scholar]
Nishiyama Y (2011) Statistical analysis by the theory of martingales. Kindaikagakusha, Tokyo. (in Japanese) [Google Scholar]
O’Brien PC, Fleming TR (1979) A multiple testing procedure for clinical trials. Biometrics 35:549–556. [PubMed] [Google Scholar]
Pocock ST, Geller NL, Tsiatis AA (1987) The analysis of multiple endpoints in clinical trials. Biometrics 43:487–498. [PubMed] [Google Scholar]
Pocock ST (1977) Group sequential methods in the design and analysis of clinical trials. Biometrika 64:191–199. [Google Scholar]
Prentice RL, Cai J (1992) Covariance and survivor function estimation using censored multivariate failure time data. Biometrika 79:495–512. [Google Scholar]
Rauch G, Schüler S, Wirths M, Stefan E, Kieser M (2016) Adaptive designs for two candidate primary time-to-event endpoints. Stat Biopharm Res 8:207–216. [Google Scholar]
Slud EV, Wei LJ (1982) Two-sample repeated significance tests based on the modified Wilcoxon statistic. J Am Stat Assoc 77: 862–868. [Google Scholar]
Sugimoto T, Hamasaki T, Sozu T, Evans SR (2017) Sizing clinical trials when comparing bivariate time- to-event outcomes. Stat Med 36:1363–1382. [DOI] [PMC free article] [PubMed] [Google Scholar]
Sugimoto T, Sozu T, Hamasaki T, Evans SR (2013) A logrank test-based method for sizing clinical trials with two co-primary time-to-event endpoints. Biostatistics 14:409–421. [DOI] [PMC free article] [PubMed] [Google Scholar]
Tamhane AC, Mehta CR, Liu L (2010) Testing a primary and secondary endpoint in a group sequential design. Biometrics 66:1174–1184. [DOI] [PMC free article] [PubMed] [Google Scholar]
Tamhane AC, Wu Y, Mehta C (2012) Adaptive extensions of a two-stage group sequential procedure for testing primary and secondary endpoints (I): unknown correlation between the endpoints. Stat Med 31:2027–2040. [DOI] [PubMed] [Google Scholar]
Tang DI, Gnecco C, Geller NL (1989) Design of group sequential clinical trials with multiple endpoints. J Am Stat Assoc 84:776–779. [Google Scholar]
Tsiatis AA, Boucher H, Kim K (1995) Sequential methods for parametric survival models. Biometrika 82:165–173. [Google Scholar]
Tsiatis AA (1982) Group sequential methods for survival analysis with staggered entry In Survival Analysis (eds., Crowley J and Johnson RA), Hayward, California: IMS Lecture Notes, 257–268. [Google Scholar]
Wei LJ, Su JQ, Latin JM (1990) Interim analyses with repeated measurements in a sequential clinical trial. Biometrika 77:359–364. [Google Scholar]
Wei LJ, Lachin JM (1984) Two-sample asymptotically distribution-free tests for imcomplete multivariate observations. J Am Stat Assoc 79:653–661. [Google Scholar]
Wu J, Xiong X (2017) Group-sequential survival trial design and monitoring using the log-rank test. Stat Biopharm Res 9:35–43. [DOI] [PMC free article] [PubMed] [Google Scholar]
Yin G (2012) Clinical trial design: Bayesian and frequentist adaptive methods. John Wiley & Sons, New York. [Google Scholar]

[R1] Andersen PK, Borgan Ø, Gill RD, Keiding N (1993) Statistical models based on counting processes. Springer-Verlag, New York. [Google Scholar]

[R2] Andrei A-C, Murray S (2005) Simultaneous group sequential analysis of rank-based and weighted Kaplan-Meier tests for paired censored survival data. Biometrics 61:715–720. [DOI] [PubMed] [Google Scholar]

[R3] Asakura K, Hamasaki T, Sugimoto T, Hayashi K, Evans SR, Sozu T (2014) Sample size determination in group-sequential clinical trials with two co-primary endpoints. Stat Med 33:2897–2913. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R4] Clayton DG (1976) A model for association in bivariate life tablesand its application in epidemiological studies of familial tendencyin chronic disease. Biometrika 65:141–151. [Google Scholar]

[R5] Collett D (2003) Modelling survival data in medical research, 2nd edn. Chapman & Hall/CRC, Boca Raton. [Google Scholar]

[R6] Cook RJ, Farewell VT (1994) Guidelines for monitoring efficacy and toxicity responses in clinical trials. Biometrics 50:1146–1152. [PubMed] [Google Scholar]

[R7] Dabrowska DM (1988) Kaplan-Meier estimate on the plane. Ann Stat 16:1475–1489. [Google Scholar]

[R8] Dmitrienko A, Tamhane AC, Bretz F (2009) Multiple Testing Problems in Pharmaceutical Statistics. Chapman & Hall/CRC, Boca Raton. [Google Scholar]

[R9] Fleming TR, Harrington DP (1991) Counting process and survival analysis. John Wiley & Sons, New York. [Google Scholar]

[R10] Glimm E, Mauer W, Bretz F (2009) Hierarchical testing of multiple endpoints in group-sequential trials. Stat Med 29:219–228. [DOI] [PubMed] [Google Scholar]

[R11] Gombay E (2008) Weighted logrank statistics in sequential tests. Sequential Anal 27:97–104. [Google Scholar]

[R12] Gordon LKK, Lachin JM (1990) Implementation of group sequential logrank tests in a maximum duration trial. Biometrika 46:759–770. [PubMed] [Google Scholar]

[R13] Gu MG, Lai TL (1991) Weak convergence of time-sequential censored rank statistics with applications to sequential testing in clinical trials. Ann Stat 19:1403–1433. [Google Scholar]

[R14] Halabi S (2012) Adjustment on the type I error rate for a clinical trial monitoring for both intermediate and primary endpoints. J Biom Biostat 7:15. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R15] Herbst RS, Redman MW, Kim ES, Semrad TJ, Bazhenova L, Masters G, Oettel K, Guaglianone P, Reynolds C, Karnad A, Arnold SM, Varella-Garcia M, Moon J, Mack PC, Blanke CD, Hirsch FR, Kelly K, Gandara DR (2018) Cetuximab plus carboplatin and paclitaxel with or without bevacizumab versus carboplatin and paclitaxel with or without bevacizumab in advanced NSCLC (SWOG S0819): a randomised, phase 3 study. Lancet Oncol 19: 101–114. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R16] Hamasaki T, Asakura K, Evans SR, Sugimoto T, Sozu T (2015) Group-sequential strategies in clinical trials with multiple co-primary endpoints. Stat Biopharm Res 7:36–54. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R17] Hougaard P (1986) A class of multivariate failure time distribution. Biometrika 73:671–678. [Google Scholar]

[R18] Hsu L, Prentice RL (1996) On assessing the strength of dependency between failure time variables. Biometrika 83:491–506. [Google Scholar]

[R19] Huang X, Strawderman RL (2006) A note on the Breslow survival estimator. J Nonparametr Stat 18:45–56. [Google Scholar]

[R20] Hung HMJ, Wang SJ, O’Neill RT (2007) Statistical considerations for testing multiple endpoints in group sequential or adaptive clinical trials. J Biopharm Stat 17:1201–1210. [DOI] [PubMed] [Google Scholar]

[R21] Jacod J, Shiryaev AN (2003) Limit theorems for stochastic processes, 2nd edn. Springer-Verlag, Berlin-Heidelberg. [Google Scholar]

[R22] Jennison C, Turnbull BW (2000) Group sequential methods with applications to clinical trials. Chapman & Hall/CRC, Boca Raton. [Google Scholar]

[R23] Jung S-H (2008) Sample size calculation for the weighted rank statistics with paired survival data. Stat Med 27:3350–3365. [DOI] [PubMed] [Google Scholar]

[R24] Kosorok MR, Shi Y, DeMets DL (2004) Design and analysis of group-sequential clinical trials with multiple primary endpoints. Biometrics 60:134–145. [DOI] [PubMed] [Google Scholar]

[R25] Lai TL, Shih M-C (2004) Power, sample size and adaptation considerations in the design of group sequential clinical trials. Biometrika 91:507–528. [Google Scholar]

[R26] Lan KKG, DeMets DL (1983) Discrete sequential boundaries for clinical trials. Biometrika 70:659–663. [Google Scholar]

[R27] Lennox JL, Landovitz RJ, Ribaudo HJ, Ofotokun I, Na LH, Godfrey C, Kuritzkes DR, Sagar M, Brown TT, Cohn SE, McComsey GA, Aweeka F, Fichtenbaum CJ, Presti RM, Koletar SL, Haas DW, Patterson KB, Benson CA, Baugh BP, Leavitt RY, Rooney JF, Seekins D, Currier JS (2014) A phase III comparative study of the efficacy and tolerability of three non-nucleoside reverse transcriptase inhibitor-sparing antiretroviral regimens for Treatment-naïve HIV-1-infected volunteers: A randomized, controlled trial. Annals of Internal Medicine 161:461–471. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R28] Lin DY, Shen L, Ying Z, Breslow NE (1996) Group sequential designs for monitoring survival probabilities. Biometrics 52:1033–1041. [PubMed] [Google Scholar]

[R29] Lin DY (1991) Nonparametric sequential testing in clinical trials with incomplete multivariate observations. Biometrika 78:123–131. [Google Scholar]

[R30] Murray S (2000) Nonparametric rank-based methods for group sequential monitoring of paired censored survival data. Biometrics 54:984–990. [DOI] [PubMed] [Google Scholar]

[R31] Nishiyama Y (2011) Statistical analysis by the theory of martingales. Kindaikagakusha, Tokyo. (in Japanese) [Google Scholar]

[R32] O’Brien PC, Fleming TR (1979) A multiple testing procedure for clinical trials. Biometrics 35:549–556. [PubMed] [Google Scholar]

[R33] Pocock ST, Geller NL, Tsiatis AA (1987) The analysis of multiple endpoints in clinical trials. Biometrics 43:487–498. [PubMed] [Google Scholar]

[R34] Pocock ST (1977) Group sequential methods in the design and analysis of clinical trials. Biometrika 64:191–199. [Google Scholar]

[R35] Prentice RL, Cai J (1992) Covariance and survivor function estimation using censored multivariate failure time data. Biometrika 79:495–512. [Google Scholar]

[R36] Rauch G, Schüler S, Wirths M, Stefan E, Kieser M (2016) Adaptive designs for two candidate primary time-to-event endpoints. Stat Biopharm Res 8:207–216. [Google Scholar]

[R37] Slud EV, Wei LJ (1982) Two-sample repeated significance tests based on the modified Wilcoxon statistic. J Am Stat Assoc 77: 862–868. [Google Scholar]

[R38] Sugimoto T, Hamasaki T, Sozu T, Evans SR (2017) Sizing clinical trials when comparing bivariate time- to-event outcomes. Stat Med 36:1363–1382. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R39] Sugimoto T, Sozu T, Hamasaki T, Evans SR (2013) A logrank test-based method for sizing clinical trials with two co-primary time-to-event endpoints. Biostatistics 14:409–421. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R40] Tamhane AC, Mehta CR, Liu L (2010) Testing a primary and secondary endpoint in a group sequential design. Biometrics 66:1174–1184. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R41] Tamhane AC, Wu Y, Mehta C (2012) Adaptive extensions of a two-stage group sequential procedure for testing primary and secondary endpoints (I): unknown correlation between the endpoints. Stat Med 31:2027–2040. [DOI] [PubMed] [Google Scholar]

[R42] Tang DI, Gnecco C, Geller NL (1989) Design of group sequential clinical trials with multiple endpoints. J Am Stat Assoc 84:776–779. [Google Scholar]

[R43] Tsiatis AA, Boucher H, Kim K (1995) Sequential methods for parametric survival models. Biometrika 82:165–173. [Google Scholar]

[R44] Tsiatis AA (1982) Group sequential methods for survival analysis with staggered entry In Survival Analysis (eds., Crowley J and Johnson RA), Hayward, California: IMS Lecture Notes, 257–268. [Google Scholar]

[R45] Wei LJ, Su JQ, Latin JM (1990) Interim analyses with repeated measurements in a sequential clinical trial. Biometrika 77:359–364. [Google Scholar]

[R46] Wei LJ, Lachin JM (1984) Two-sample asymptotically distribution-free tests for imcomplete multivariate observations. J Am Stat Assoc 79:653–661. [Google Scholar]

[R47] Wu J, Xiong X (2017) Group-sequential survival trial design and monitoring using the log-rank test. Stat Biopharm Res 9:35–43. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R48] Yin G (2012) Clinical trial design: Bayesian and frequentist adaptive methods. John Wiley & Sons, New York. [Google Scholar]

PERMALINK

Group-sequential logrank methods for trial designs using bivariate non-competing event-time outcomes

Tomoyuki Sugimoto

Toshimitsu Hamasaki

Scott R Evans

Susan Halabi

Abstract

1. Introduction

2. Group-sequential bivariate event-time data and the logrank statistic

3. Asymptotic structure of the group-sequential bivariate logrank statistic

4. Application to group-sequential design

Table 1.

Table 2.

5. Discussion

Acknowledgements

Appendix

A. Proof of Theorem 1

B. Some additional results

Table B.1.

Table B.2.

Table B.3.

Table B.4.

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Group-sequential logrank methods for trial designs using bivariate non-competing event-time outcomes

Tomoyuki Sugimoto

Toshimitsu Hamasaki

Scott R Evans

Susan Halabi

Abstract

1. Introduction

2. Group-sequential bivariate event-time data and the logrank statistic

3. Asymptotic structure of the group-sequential bivariate logrank statistic

4. Application to group-sequential design

Table 1.

Table 2.

5. Discussion

Acknowledgements

Appendix

A. Proof of Theorem 1

B. Some additional results

Table B.1.

Table B.2.

Table B.3.

Table B.4.

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases