Cumulative Hazard Ratio Estimation for Treatment Regimes in Sequentially Randomized Clinical Trials

Xinyu Tang; Abdus S Wahed

doi:10.1007/s12561-013-9089-6

. Author manuscript; available in PMC: 2015 Jun 15.

Published in final edited form as: Stat Biosci. 2013 May 17;7(1):1–18. doi: 10.1007/s12561-013-9089-6

Cumulative Hazard Ratio Estimation for Treatment Regimes in Sequentially Randomized Clinical Trials

Xinyu Tang ¹, Abdus S Wahed ²

PMCID: PMC4467029 NIHMSID: NIHMS594885 PMID: 26085847

Abstract

The proportional hazards model is widely used in survival analysis to allow adjustment for baseline covariates. The proportional hazard assumption may not be valid for treatment regimes that depend on intermediate responses to prior treatments received, and it is not clear how such a model can be adapted to clinical trials employing more than one randomization. Besides, since treatment is modified post-baseline, the hazards are unlikely to be proportional across treatment regimes. Although Lokhnygina and Helterbrand (Biometrics 63: 422–428, 2007) introduced the Cox regression method for two-stage randomization designs, their method can only be applied to test the equality of two treatment regimes that share the same maintenance therapy. Moreover, their method does not allow auxiliary variables to be included in the model nor does it account for treatment effects that are not constant over time. In this article, we propose a model that assumes proportionality across covariates within each treatment regime but not across treatment regimes. Comparisons among treatment regimes are performed by testing the log ratio of the estimated cumulative hazards. The ratio of the cumulative hazard across treatment regimes is estimated using a weighted Breslow-type statistic. A simulation study was conducted to evaluate the performance of the estimators and proposed tests.

Keywords: Cumulative treatment effect, Non-proportional hazards, Sequentially randomized clinical trial, Stratified proportional hazards model

1 Introduction

It is common practice in treatments of chronic diseases for patients to start with an initial therapy based on specific diagnoses. Patients stay on this initial therapy until a predetermined milestone, commonly referred to as response, is achieved, or physicians determine that the treatment is not providing the intended benefit, commonly referred to as non-response. In either case, further treatment could be suggested, which could be a rescue therapy if patients show primary resistance to the initial therapy, or a maintenance therapy if a response is observed. Thus, treatments applied at later stages depend on the responses to the initial therapies, and a comparison across treatments at various stages would be misleading. Instead, dynamic treatment sequences are usually formed based on stage-specific treatments and responses to these treatments during the course of therapy. In order to study the effect of a treatment sequence on the survival outcome, sequentially randomized designs are used in clinical trials. In a sequentially randomized clinical trial, eligible patients are first randomized to receive one of the initial therapies. Patients reaching the second stage will continue to participate in a second randomization to either rescue or maintenance therapies. The procedure continues. This design results in different treatment regimes consisting of an initial therapy, the intermediate response and a second-stage therapy. The analysis and comparisons of such treatment regimes can be done based on classic survival methods, weighted using inverse probabilities of treatment allocations.

The idea of inverse-probability weighting (IPW) to estimate treatment regime effect in two-stage randomization designs was first introduced by Lunceford, Davidian and Tsiatis [2]. Since then the strategy of IPW has been incorporated into many other classical statistical methods to make them applicable to two-stage randomization designs. For example, Guo and Tsiatis [3] extended the Aalen–Nelson estimator to a Weighted Risk Set Estimator (WRSE) using IPW. Hernan, Lanoy, Costagliola and Robins [4] applied IPW in comparing treatment strategies from observational studies using a Cox proportional hazards model under artificial censoring. Lokhnygina and Helterbrand [1] incorporated IPW into the Cox regression method for two-stage randomization designs. Feng and Wahed [5] presented a modified weighted log-rank test for comparing different treatment strategies using IPW. Miyahara and Wahed [6] introduced IPW to Kaplan–Meier estimators with both fixed and time-dependent weights. More recently, Goldberg and Kosorok [7] used IPW methods in Q-learning to estimate treatment effect for censored data from multistage designs. However, except for the work of Hernan et al. [4], all the aforementioned models failed to adjust for baseline covariates.

Proportional hazards models are widely used in analyzing data from clinical trials with time-to-event endpoints. The advantage of the proportional hazards model over nonparametric methods (e.g. a log-rank test) is that auxiliary covariates can be included in the model to explain the variability of the dependent variable further. Even in randomized clinical trials, baseline characteristics are often imbalanced between treatment groups and hence adjustment becomes a consideration. This imbalance is even more problematic for two stages of randomization, since patients proceeding to the second stage are randomized differently based on their intermediate responses. Lokhnygina and Helterbrand [1] proposed to use a Cox regression method to test the equality of two treatment regimes that share the same maintenance therapy in a two-stage randomization setting. However, their method does not allow auxiliary variables to be included in the model nor does it account for treatment effects that are not constant over time, an issue in many medical studies. Therefore, comparisons among treatment regimes in the presence of non-proportional hazards is of importance to health research communities. Wei and Schaubel [8] proposed cumulative treatment effect estimation based on treatment-specific cumulative baseline hazards using a stratified proportional hazards model. Their model assumed proportional hazards with respect to baseline covariates within each treatment group and non-proportional hazards across treatment groups. In this article, we take a similar approach to estimate the cumulative treatment effect for treatment regimes from sequentially randomized clinical trials. Comparisons among treatment regimes are performed by testing the ratio of the estimated cumulative hazards. A simulation study is conducted to evaluate the performance of the estimators and proposed tests. The estimators and proposed tests are also applied to the neuroblastoma data to compare different treatment regimes in a neuroblastoma study with respect to the overall survival.

2 Design Setting and Statistical Model

2.1 Notation

We consider a two-stage randomization design, in which n eligible patients are first randomized to one of the J initial therapies (A₁, … , A_J) and patients achieving a clinical response are then randomized to one of the K second-stage therapies (B₁, … , B_K). By design, non-responders in the first stage are not eligible to receive any treatment in the second stage, similarly to the neuroblastoma study (Sect. 5) that motivated this work. Thus, there are a total of J × K treatment regimes based on this design, namely, A₁B₁, … , A_JB_K. The treatment regime A_jB_k , j = 1, … , J ; k = 1, … , K, is defined as “treat with A_j as initial therapy, then B_k as second-stage therapy if responds to A_j” [2]. This treatment regime consists of an initial treatment (A_j), the intermediate response status, and a second-stage treatment (B_k) in the event the patient responds to the initial treatment. Thus, estimation of the effect of treatment regime A_jB_k will not only include patients who were randomized to receive initial therapy A_j, responded, and then randomized to B_k as second-stage therapy, but also patients who were randomized to A_j and did not respond.

Let X_ji be the indicator for initial therapy A_j such that X_ji = 1 if the ith patient receives initial treatment $A_{j} (\sum_{j = 1}^{J} X_{ji} = 1)$ . The response status is denoted by R_i, with R_i = 1 for responders and _Ri = 0 for non-responders. For responders (_Ri = 1), let Z_ki be the indicator for second-stage therapy B_k, i.e., Z_ki = 1 if the ith patient responds and receives second-stage treatment $B_{k} (R_{i} \sum_{k = 1}^{K} Z_{ki} = R_{i})$ . Let T_i and C_i be the survival and censoring times from the time of first randomization for the ith patient, respectively. We assume that the survival time is independently right censored. Then the observed time and event indicator would be defined as U_i = min(T_i,_Ci) and Δ_i = I (T_i ≤ C_i). If we define V_i as the vector of baseline covariates, the observed data for the ith individual can be described as the set of random vectors {X_ji, R_i, R_iZ_ki, U_i, Δ_i, V_i, j = 1, … , J ; k = 1, … , K}, i = 1, 2, … , n. Note that in this consideration we assume that R_i is always observed, which may not be true for patients who are censored or died prior to response evaluation. Customarily such patients are treated as non-responders and R_i is set to zero for them [2]. Whenever there is no ambiguity, we will drop the subscript i to represent a generic observation from the population.

2.2 Model

Our proposed model assumes proportional hazards within each treatment regime with respect to the baseline covariates V_i. However, the hazard functions for J × K treatment regimes are left unspecified, and could be proportional or non-proportional across regimes. In other words, a stratified proportional hazards model is used to account for the non-proportionality across treatment regimes. More specifically, let us denote the hazard and the cumulative hazard functions for treatment regime A_jB_k as λ_jk (t) and $Λ_{jk} (t) = \int_{0}^{t} λ_{jk} (s) ds$ , respectively. Based on stratified proportional hazards with treatment regimes as strata, the hazard function for treatment regime A_jB_k, j = 1, … , J and k = 1, … , K, could be written as

λ_{jk} (t) = λ_{jk 0} (t) \exp {β^{T} V}, j = 1, \dots, J and k = 1, \dots, K,

(1)

where λ_jk0(t) is the baseline hazard function for treatment regime A_jB_k, and β is a vector of coefficients corresponding to baseline covariates V. Note that in model (1), the parameter vector β is assumed to be constant across regimes, which implies that no interaction between treatment regimes and baseline covariates is assumed. For a general treatment regime, please see Sect. 2.1. Moreover, no functional form is specified for the regime-specific baseline hazard functions. For example, under a two-stage randomization design with J = K = 2, the hazard functions for treatment regimes A₁B₁, A₁B₂, A₂B₁ and A₂B₂ can be written as

λ_{11} (t) = λ_{110} (t) \exp {β^{T} V},

and similarly, λ₁₂(t) = λ₁₂₀(t) exp{β^T V}, λ₂₁(t) = λ210(t) exp{β^T V}, and λ₂₂(t) = λ₂₂₀(t) exp{β^T V}, respectively. Thus, irrespective of the treatment regimes, the effect of baseline covariate V on the hazard can be quantified by the log hazard ratio parameter β. The forms of the baseline hazard functions λ₁₁₀(t), λ₁₂₀(t), λ₂₁₀(t), and λ₂₂₀(t) are left unspecified, and could be non-proportional. However, we assume proportionality within each regime with respect to the baseline covariates, and the effects of the baseline covariates do not vary across regimes.

3 Inference

Our objective is to draw inference about treatment regimes A_jB_k, j = 1, … , J ; k = 1, … , K. Note that the focus of the inference in model (1) is not the parameter vector β, rather it is to compare the hazards across treatment regimes. Based on the analytical framework of IPW (see for details in [9]) we define the weight function for treatment regime A_jB_k, j = 1, … , J and k = 1, … , K, as

W_{jki} = X_{ji} {(1 - R_{i}) + R_{i} Z_{ki} ∕ π_{jk}} ∕ π_{j},

where π_j = P(X_ji = 1), and π_jk = P(Z_ki = 1∣X_ji = 1, R_i = 1). Thus, both responders (R_i = 1) to A_j and non-responders (R_i = 0) to A_j are weighted with the inverse of the probability of randomization when evaluating the effect of the treatment regime A_jB_K. The probabilities of being assigned to B_k, k = 1, … , K, could be different for initial treatments A_j, j = 1, … , J. For example, in some two-stage randomization studies, if the patients receive one initial treatment, particular choices in the second stage might be more toxic than others, and hence these patients are randomized with less probability into such choices. Based on the counting process notation described in Fleming and Harrington [10], the event and risk indicators for the ith patient are defined as N_i (t) = Δ_iI (U_i ≤ t), and Y_i (t) = I (U_i ≥ t), respectively. We define the weighted event and risk indicators for treatment regime A_jBk as N_jki = W_jkiΔ_iI (U_i ≤ t) and Y_jki(t) = W_jkiI (U_i ≥ t).

The partial likelihood estimate of β can be obtained by solving the pseudo-score equation [1]:

U (β) = \sum_{i = 1}^{n} \sum_{j = 1}^{J} \sum_{k = 1}^{K} \int_{0}^{L} {V_{i} - {\bar{V}}_{jk} (t, β)} {dN}_{jki} (t) = 0,

where

{\bar{V}}_{jk} (t, β) = \frac{\sum_{p = 1}^{n} V_{p} Y_{jkp} (t) \exp {β^{T} V_{p}}}{\sum_{p = 1}^{n} Y_{jkp} (t) \exp {β^{T} V_{p}}} .

Note that in defining U(·), we have utilized the weighted event and risk processes. The estimated vector of coefficients is denoted by $\hat{β}$ . Then the Breslow estimator [12] of the cumulative baseline hazard for treatment regime A_jB_k can be obtained as

{\hat{Λ}}_{jk 0} (t, \hat{β}) = \sum_{i = 1}^{n} \int_{0}^{t} \frac{{dN}_{jki} (s)}{\sum_{p = 1}^{n} Y_{jkp} (s) \exp {{\hat{β}}^{T} V_{p}}} .

A comparison of different treatment regimes can then be carried out in terms of the ratio of the cumulative baseline hazards. The ratio of the cumulative baseline hazards for comparing treatment regimes A_jB_k and A_jB_k is defined as

θ_{{jkj}^{'} k^{'}} (t) = \frac{Λ_{jk 0} (t)}{Λ_{j^{'} k^{'} 0} (t)} .

(2)

This ratio of the cumulative baseline hazards equals the ratio of the cumulative hazards given the same values for covariates, because

θ_{{jkj}^{'} k^{'}} (t) = \frac{Λ_{jk 0} (t)}{Λ_{j^{'} k^{'} 0} (t)} = \frac{Λ_{jk 0} (t) \exp {β^{T} V_{i}}}{Λ_{j^{'} k^{'} 0} (t) \exp {β^{T} V_{i}}} = \frac{Λ_{jk} (t)}{Λ_{j^{'} k^{'}} (t)} .

The ratio of the cumulative baseline hazards can be estimated by replacing the estimated cumulative baseline hazards in (2). Let us denote the corresponding estimator by ${\hat{θ}}_{{jkj}^{'} k^{'}} (t)$ . For unweighted group comparisons, Wei and Schaubel [8] showed that such a ratio converges asymptotically to a Gaussian process. A similar argument outlined in Appendix A can be used to show that ${\hat{θ}}_{{jkj}^{'} k^{'}} (t)$ follows a Gaussian process with mean θjkj’k’(t) and variance function $σ_{{jkj}^{'} k^{'}}^{2} (t)$ , where

\begin{matrix} σ_{{jkj}^{'} k^{'}}^{2} (t) & = E {ξ_{{jkj}^{'} k^{'} i}^{2} (t, β)}, \\ ξ_{{jkj}^{'} k^{'} i} (t, β) & = \frac{Φ_{jki} (t, β)}{Λ_{j^{'} k^{'} 0} (t)} - \frac{Λ_{jk 0} (t) Φ_{j^{'} k^{'} i} (t, β)}{Λ_{j^{'} k^{'} 0}^{2} (t)}, \\ Φ_{jki} (t, β) & = h_{jk}^{T} (t, β) Ω^{- 1} (β) Ψ_{i} (β) + \int_{0}^{t} y_{jk}^{(0)} {(s, β)}^{- 1} {dM}_{jki} (s, β), \\ h_{jk} (t, β) & = - \int_{0}^{t} {\bar{v}}_{jk} (s, β) d Λ_{jk 0} (s), \\ Ω (β) & = \sum_{j = 1}^{J} \sum_{k = 1}^{K} \int_{0}^{L} τ_{jk} (t, β) y_{jk}^{(0)} (t, β) d Λ_{jk 0} (t), \\ Ψ_{i} (β) & = \sum_{j = 1}^{J} \sum_{k = 1}^{K} \int_{0}^{L} {V_{i} - {\bar{v}}_{jk} (t, β)} {dM}_{jki} (t, β), \\ {dM}_{jki} (t, β) & = {dN}_{jki} (t) - Y_{jki} (t) \exp {β^{T} V_{i}} d Λ_{jk 0} (t) . \end{matrix}

(3)

Besides, $y_{jk}^{(0)} (t, β), {\bar{v}}_{jk} (t, β)$ and τ_jk(t,β) are the limiting values of $Y_{jk}^{(0)} (t, β) = \frac{1}{n} \sum_{p = 1}^{n} Y_{jkp} (t) \exp {β^{T} V_{p}}, {\bar{V}}_{jk} (t, β)$ , and $\frac{\sum_{p = 1}^{n} V_{p} V_{p}^{T} Y_{jkp} (t) \exp {β^{T} V_{p}}}{\sum_{p = 1}^{n} Y_{jkp} (t) \exp {β^{T} V_{p}}} - {\bar{V}}_{jk} (t, β) {\bar{V}}_{jk}^{T} (t, β)$ , respectively. The variance function can be estimated by ${\hat{σ}}_{{jkj}^{'} k^{'}}^{2} (t) = \frac{1}{n} \sum_{i = 1}^{n} {\hat{ξ}}_{{jkj}^{'} k^{'} i}^{2} (t, \hat{β})$ . The detailed steps for computing ${\hat{ξ}}_{{jkj}^{'} k^{'} i} (t, \hat{β})$ are described in the Appendix B. One might expect that the asymptotic distribution of $\ln {{\hat{θ}}_{{jkj}^{'} k^{'}} (t)}$ would be closer to a Gaussian process than ${\hat{θ}}_{{jkj}^{'} k^{'}} (t)$ . Based on the delta method, the variance of $\ln {{\hat{θ}}_{{jkj}^{'} k^{'}} (t)}$ can be estimated by ${\hat{σ}}_{{jkj}^{'} k^{'}}^{'} {(t)}^{2} = \frac{{\hat{σ}}_{{jkj}^{'} k^{'}}^{2} (t)}{{\hat{θ}}_{{jkj}^{'} k^{'}} (t)}$ . We will use simulation to investigate the properties of ${\hat{θ}}_{{jkj}^{'} k^{'}} (t)$ and $\ln {{\hat{θ}}_{{jkj}^{'} k^{'}} (t)}$ in a sequentially randomized setting in Sect. 4.

Based on the log ratio estimator of the cumulative baseline hazards, a Wald-type test can be used for comparing different treatment regimes. For example, for comparing treatment regimes A_jB_k and A_j’Bk’ at a specific time point t₀, the null hypothesis can be described as H_jkj’k’ : ln{θ_jkj’k’ (t₀)} = 0, and the test statistic can be written as

D_{{jkj}^{'} k^{'}} = \frac{\ln {{\hat{θ}}_{{jkj}^{'} k^{'}} (t_{0})}}{{\hat{σ}}_{{jkj}^{'} k^{'}}^{'} (t_{0})} .

This test statistic is then compared to a standard normal distribution. Because the cumulative hazard can be viewed as an “accumulation” of the hazard over time, the choice of t₀ will depend on the disease-specific survival patterns as well as the treatments. If there are more than two treatment regimes involved, multiple treatment regimes can be compared by performing an overall test of difference among all regimes using the Wald Chi-square test. More specifically, in our case, let θ(t₀) = [θ_jk11(t₀), j = 1, … , J ; k = 1, … , K; jk ≠ 11]^T be the ratios (J K – 1)-dimensional vector of cumulative hazard ratios at t₀, and

Σ (t_{0}) = (\begin{matrix} σ_{1111}^{2} (t_{0}) & σ_{1111 \cdot 1211} (t_{0}) & \dots & σ_{1111 \cdot JK 11} (t_{0}) \\ σ_{1111 \cdot 1211} (t_{0}) & σ_{1211}^{2} (t_{0}) & \dots & σ_{1211 \cdot JK 11} (t_{0}) \\ ⋮ & ⋮ & ⋱ & ⋮ \\ σ_{1111 \cdot JK 11} & σ_{1211 \cdot JK 11} & \dots & σ_{JK 11}^{2} (t_{0}) \end{matrix})

be the corresponding variance-covariance matrix. The diagonal elements of $σ_{jk 11}^{2} (t_{0})$ are the variances of ${\hat{θ}}_{jk 11} (t)$ , j = 1, …, J; k = 1, … , K. The off-diagonal elements are the covariance between ${\hat{θ}}_{jk 11} (t)$ and ${\hat{θ}}_{j^{'} k^{'} 11} (t)$ , j = 1, … , J ; k = 1, … , K; jk ≠ j’k’. Then for testing H₀ : Λ₁₁(t₀) = ⋯ = Λ_JK(t₀) or equivalently H₀ : ln{θ(t₀)} = 0, the test statistic can be written as

χ_{JK - 1}^{2} = {[\ln {\hat{θ} (t_{0})}]}^{T} {\hat{Σ}}^{'^{- 1}} (t_{0}) [\ln {\hat{θ} (t_{0})}],

where

{\hat{Σ}}^{'} (t_{0}) = {(\frac{\partial \ln {\hat{θ} (t_{0})}}{\partial \hat{θ} (t_{0})})}^{T} \hat{Σ} (t_{0}) (\frac{\partial \ln {\hat{θ} (t_{0})}}{\partial \hat{θ} (t_{0})}) .

This test statistic is then compared to a Chi-square distribution with (J K – 1) degrees of freedom.

4 Simulation Study

For simplicity, a simulation study was carried out under a two-stage randomization design with only two treatment options for each stage (J = K = 2). Under this design, the performance of the ratio estimator, log ratio estimator and proposed tests were assessed under the following scenarios:

– Scenario I: The survival distributions for treatment regimes A₁B₁ and A₁B₂ are the same (note that the treatment regimes A₁B₁ and A₁B₂ share the same induction therapy A₁);
– Scenario II: The survival distributions for treatment regimes A₁B₁ and A₂B₁ are the same (note that the treatment regimes A₁B₁ and A₂B₁ share the same second-stage therapy B₁);
– Scenario III: The survival distributions for treatment regimes A₁B₁ and A₂B₂ are the same (note that the treatment regimes A₁B₁ and A₂B₂ have different induction and second-stage therapies);
– Scenario IV: No pattern is specified in the population.

An indicator (X_1i) for initial treatment A₁ was generated following the Bernoulli(0.5) distribution. An indicator for response (R_i) was generated from the Bernoulli distribution with 60 % response rate. An indicator (Z_1i) for second-stage treatment B₁ was drawn from the Bernoulli(0.5) distribution where R_i = 1. Based on the stratified proportional hazards model (1), the death time T_jki for the ith patient in the jkth regime is calculated based on the Weibull distribution using

T_{jki} = {- \log (ϕ_{i}) ∕ [α_{jk} \exp {β^{T} V_{i}}]}^{1 ∕ γ_{jk}},

for j = 1, 2, and k = 0, 1, 2, where ϕ_i was generated from the Uniform(0,1) distribution. V_i = [V_1i, V_2i]^T and both V_1i and V_2i were generated from the Bernoulli(0.5) distribution. The parameter vector β was set to [0.5, 0.5]^T. A variety of values were used for α_jk and γ_jk to assess the properties of the ratio estimator, log ratio estimator and proposed tests under different scenarios (Table 1). Null hypotheses H₁₂₁₁ : ln{θ₁₂₁₁(t)} = 0, H₂₁₁₁ : ln{θ₂₁₁₁(t)} = 0, and H₂₂₁₁ : ln{θ₂₂₁₁(t)} = 0 were true in the population under scenarios I, II and III, respectively. For scenario IV, parameter values were chosen with no pattern. The true cumulative hazard ratios for comparing A₁B₂, A₁B₂, and A₁B₂ to A₁B₂ under different scenarios are plotted in Fig. 1. Censoring time was generated according to the Uniform(τ/2, τ) distribution, where the value of τ was chosen to be 5, resulting in censoring percentage ranging from 10 to 30 %. The sample size was varied from 400 to 1200, and 1000 replications were performed under each scenario. Time t₀ was chosen to be the 75th percentile of the observed time under each scenario. For each sample, the ratios and the log ratios of the cumulative baseline hazards for comparing A₁B₂, A₂B₁, and A₂B₂ to A₁B₁ were calculated, and theWald-type tests were performed based on the log ratio estimates.

Table 1.

Different values for α_jk and γ_jk, j = 1, 2, and k = 0, 1, 2, under different scenarios (for scenario details see Sect. 4)

Scenario	γ ₁₀	γ ₁₁	γ ₁₂	γ ₂₀	γ ₂₁	γ ₂₂	α ₁₀	α ₁₁	α ₁₂	α ₂₀	α ₂₁	α ₂₂	Censoring (%)
I	1.4	1.4	1.4	1.2	1.2	1.2	0.7	0.4	0.4	0.3	0.2	0.25	10
II	1.0	1.5	1.2	1.0	1.5	1.3	0.5	0.2	0.1	0.5	0.2	0.05	23
III	1.2	1.0	1.2	1.2	1.5	1.0	0.1	0.4	0.7	0.1	0.04	0.4	30
IV	1.0	1.5	1.2	1.5	1.0	1.4	0.7	0.4	0.35	0.3	0.2	0.25	10

Open in a new tab

Fig. 1 — The true cumulative hazard ratios for comparing A₁B₂, A₂B₁, and A₂B₂ to A₁B₁ under different scenarios. *Solid line A*₁B₂ vs. A₁B₁, *dashed line A*₂B₁ vs. A₁B₁, *dotted line A*₂B₂ vs. A₁B₁

Table 2 shows the ratio and log ratio estimates (EST), absolute bias (BIAS), asymptotic standard error (ASE), empirical standard deviation (ESD) and 95 % coverage probability (CP) under each scenario. For example, under scenario I and at a sample size of 800, the estimate of θ₁₂₁₁(t₀) was 1.01 with an absolute bias of 0.01. The asymptotic standard error was 0.119, close to the empirical standard deviation of 0.121. The corresponding coverage probability was 93 % based on the estimate and asymptotic standard error. Similarly, the estimate for the ln{θ₁₂₁₁(t₀)} was 0.00 with an absolute bias of 0.01. The asymptotic standard error (0.118) and empirical standard deviation (0.120) were close. The coverage probability was 94 %, better than that for ${\hat{θ}}_{1211} (t_{0})$ . In most of the cases, the estimates were approximately unbiased; with the absolute biases differed from 0.00 to 0.02. The asymptotic standard errors were close to the empirical standard deviations, demonstrating that the estimated standard errors were consistent. Most coverage probabilities were close to 95 %, attaining the nominal level. As sample size increased, the estimated standard error decreased and the coverage probability increased. Some of the log ratio estimates were less biased than the corresponding ratio estimates, and the overall coverage probabilities were closer to 95 % based on the log ratio estimates than those based on the ratio estimates.

Table 2.

Simulation results under each scenario (for scenario details see Sect. 4). EST: ratio or log ratio estimate; BIAS: absolute bias; ASE: asymptotic standard error; ESD: empirical standard deviation; and CP: 95 % coverage probability

Scenario	n	θ₁₂₁₁(t₀)					ln{θ₁₂₁₁(t₀)}
		EST	BIAS	ASE	ESD	CP	EST	BIAS	ASE	ESD	CP
I	400	1.01	0.00	0.166	0.175	0.93	−0.01	0.02	0.165	0.171	0.94
	800	1.01	0.01	0.119	0.121	0.93	0.00	0.01	0.118	0.120	0.94
	1200	1.00	0.01	0.097	0.102	0.93	0.00	0.01	0.097	0.101	0.94
II	400	0.51	0.01	0.084	0.083	0.95	−0.69	0.00	0.167	0.166	0.95
	800	0.51	0.01	0.060	0.061	0.96	−0.69	0.01	0.119	0.121	0.95
	1200	0.51	0.00	0.049	0.051	0.95	−0.69	0.00	0.098	0.100	0.95
III	400	1.14	0.01	0.113	0.105	0.97	0.13	0.00	0.099	0.092	0.97
	800	1.14	0.01	0.082	0.073	0.97	0.13	0.00	0.072	0.064	0.97
	1200	1.14	0.00	0.067	0.058	0.98	0.13	0.00	0.059	0.05	0.98
IV	400	0.79	0.01	0.121	0.126	0.93	−0.24	0.02	0.153	0.157	0.95
	800	0.79	0.01	0.087	0.089	0.92	−0.24	0.02	0.109	0.113	0.93
	1200	0.79	0.01	0.071	0.072	0.94	−0.24	0.02	0.089	0.090	0.94

Scenario	n	θ₂₁₁₁(t₀)					ln{θ₂₁₁₁(t₀)}
		EST	BIAS	ASE	ESD	CP	EST	BIAS	ASE	ESD	CP

I	400	0.43	0.01	0.075	0.077	0.94	−0.86	0.01	0.174	0.179	0.94
	800	0.43	0.00	0.053	0.052	0.96	−0.86	0.00	0.123	0.122	0.94
	1200	0.43	0.00	0.043	0.044	0.95	−0.86	0.01	0.101	0.103	0.95
II	400	1.02	0.02	0.181	0.197	0.93	0.00	0.00	0.178	0.190	0.94
	800	1.00	0.01	0.128	0.131	0.95	0.00	0.00	0.127	0.129	0.95
	1200	1.01	0.01	0.105	0.108	0.94	0.00	0.00	0.105	0.107	0.95
III	400	0.51	0.02	0.098	0.103	0.94	−0.70	0.01	0.195	0.202	0.94
	800	0.50	0.01	0.070	0.072	0.95	−0.70	0.02	0.140	0.143	0.95
	1200	0.50	0.01	0.057	0.057	0.96	−0.70	0.01	0.115	0.113	0.95
IV	400	0.44	0.01	0.076	0.075	0.95	−0.82	0.02	0.170	0.168	0.95
	800	0.44	0.01	0.053	0.054	0.96	−0.83	0.01	0.121	0.123	0.94
	1200	0.44	0.01	0.044	0.045	0.95	−0.83	0.01	0.099	0.101	0.95

Scenario	n	θ₂₂₁₁(t₀)					ln{θ₂₂₁₁(t₀)}
		EST	BIAS	ASE	ESD	CP	EST	BIAS	ASE	ESD	CP

I	400	0.49	0.00	0.084	0.088	0.93	−0.72	0.01	0.171	0.182	0.93
	800	0.49	0.00	0.059	0.058	0.95	−0.72	0.01	0.122	0.119	0.96
	1200	0.49	0.00	0.049	0.049	0.93	−0.72	0.01	0.100	0.100	0.95
II	400	0.38	0.00	0.069	0.069	0.94	−0.99	0.01	0.182	0.181	0.94
	800	0.38	0.00	0.049	0.048	0.95	−0.99	0.01	0.130	0.129	0.95
	1200	0.37	0.00	0.040	0.039	0.94	−0.99	0.01	0.106	0.104	0.95
III	400	1.01	0.01	0.175	0.180	0.94	0.00	0.01	0.172	0.177	0.95
	800	1.01	0.00	0.125	0.129	0.94	0.00	0.01	0.124	0.127	0.95
	1200	1.00	0.00	0.102	0.100	0.95	0.00	0.00	0.102	0.099	0.95
IV	400	0.61	0.01	0.101	0.104	0.93	−0.51	0.00	0.165	0.172	0.95
	800	0.60	0.00	0.071	0.071	0.95	−0.51	0.01	0.118	0.117	0.95
	1200	0.60	0.00	0.058	0.058	0.94	−0.51	0.01	0.096	0.097	0.96

Open in a new tab

Table 3 presents the rejection rates for testing null hypotheses H₁₂₁₁ : ln{θ₁₂₁₁(t)} = 0, H₂₁₁₁ : ln{θ₂₁₁₁(t)} = 0, and H₂₂₁₁ : ln{θ₂₂₁₁(t)} = 0, separately under each scenario. Because the null hypotheses H₁₂₁₁ : ln{θ₁₂₁₁(t)} = 0, H₂₁₁₁ : ln{θ₂₁₁₁(t)} = 0, and H₂₂₁₁ : ln{θ₂₂₁₁(t)} = 0, were true under scenarios I, II and III, respectively, the rejection rates for H₁₂₁₁, H₂₁₁₁, and H₂₂₁₁ were close to the nominal level of 0.05 under scenarios I, II and III, respectively, suggesting that the tests were approximately unbiased. The rejection rates were relatively larger under the smaller sample size of 400. As sample size increased, the rejection rate approached 0.05.

Table 3.

Rejection rates for testing null hypotheses H₁₂₁₁ : ln{θ₁₂₁₁(t)} = 0, H₂₁₁₁ : ln{θ₂₁₁₁(t)} = 0, and H₂₂₁₁ : ln{θ₂₂₁₁(t)} = 0 under different scenarios. Scenarios I, II and III respectively represent null hypotheses H₁₂₁₁, H₂₁₁₁ and H₂₂₁₁, while for scenario IV, all three null hypotheses are false

Scenario	n	H ₁₂₁₁	H ₂₁₁₁	H ₂₂₁₁
I	400	0.059	0.999	0.991
	800	0.053	1.000	1.000
	1200	0.055	1.000	1.000
II	400	0.992	0.064	1.000
	800	1.000	0.048	1.000
	1200	1.000	0.050	1.000
III	400	0.233	0.937	0.050
	800	0.444	1.000	0.050
	1200	0.603	1.000	0.046
IV	400	0.351	0.992	0.864
	800	0.587	1.000	0.994
	1200	0.753	1.000	0.999

Open in a new tab

5 Analysis of Neuroblastoma Data

From 1991 to 1996, the Children’s Cancer Group conducted a randomized clinical trial to study the effect of a combination of myeloablative chemotherapy, total-body irradiation and transplantation of autologous bone marrow purged of cancer cells (ABMT) to a standard chemotherapy in treating children with high risk neuroblastoma [13]. Since relapse is common after completion of induction therapies among children with high risk neuroblastoma, patients without progressive disease (PD) or histologically confirmed disease (HCD) after induction therapies were then randomly assigned to receive either 13-cis-rectinoic acid (cis-RA) or no further therapy. Therefore, the high risk neuroblastoma clinical trial followed a two-stage randomization design. By the end of the study, a total of 379 children participated in the first randomization, with 190 children assigned to ABMT, and 189 children assigned to chemotherapy. After the completion of the first-stage therapy, 203 children had no PD or HCD, and thus participated in the second randomization. During the second randomization, 102 children were assigned to cis-RA, and 101 patients were assigned to no further therapy. Matthay et al. [13] reported 55 more children who participated in the second randomization, resulting in 130 children assigned to cis-RA and 128 children assigned to no further therapy. These children were not included in our setting because they did not participate in the first randomization. This two-stage randomization design resulted in four treatment regimes: (i) treat with ABMT, followed by cis-RA if no PD or HCD (AC); (ii) treat with ABMT, followed by no further therapy if no PD or HCD (AN); (iii) treat with chemotherapy, followed by cis-RA if no PD or HCD (CC); (iv) treat with chemotherapy, followed by no further therapy if no PD or HCD (CN).

A stratified proportional hazards model based on (1) was applied to the neuroblastoma data, using treatment regimes as strata. The only baseline characteristics found imbalanced among different treatment groups was “Evan’s stage” in Matthay et al. [13], and thus was included in the model as one of the covariates. Additionally, “age” was also included in the model as a continuous covariate. The time to second randomization did not differ significantly across the first stage treatments (p = 0.26). The resulting log ratio estimates of the cumulative baseline hazards and their corresponding 95 % confidence intervals within the time interval of [0, 2000] days are shown in Fig. 2. From the plot of the log ratio estimate of the cumulative baseline hazards comparing treatment regime AN to AC, we observed a notable difference around 450 days. However, the horizontal “zero” line was within the confidence interval most of the time, suggesting that there was no significant difference between treatment regimes AN and AC. In the plot for comparing treatment regime CC to AC the confidence band moved further away from “zero” as time increased. Thus, we would suspect that children following the treatment regime AC had better overall survival compared to those following the regime CC. Besides, the confidence band also became narrower as time increased since there was higher variability with fewer events at the beginning of the study. Similar results were observed from the plot for comparing treatment regime CN to AC. The hazard ratios (95 % confidence interval) for comparing AN, CC, and CN to AC at year 3 were estimated to be 1.10 (0.78, 1.43), 1.13 (0.74, 1.52), and 1.02 (0.67, 1.38), respectively. The hazard ratios (95 % confidence interval) for comparing AN, CC and CN to AC at year 5 were estimated to be 1.22 (0.86, 1.57), 1.38 (0.92, 1.85), and 1.43 (0.95, 1.91), respectively. We also used the Wald Chi-square test to evaluate the overall difference among four treatment regimes at year 5. It resulted in a p-value of 0.17, showing that there was no overall significant difference in the cumulative hazard among four treatment regimes at 5 years. Therefore, we would suspect that children with high risk neuroblastoma would have a similar overall 5-year survival irrespective of the induction therapy they received. After the completion of induction therapies, whether they would be subsequently treated with cis-RA or not, did not significantly improve the overall 5-year survival either.

Fig. 2 — Estimated log ratios of the cumulative baseline hazards and the corresponding 95 % pointwise confidence intervals from the neuroblastoma study. AC: “treat with ABMT, and then followed by cis-RA if no PD or HCD,” AN: “treat with ABMT, and then followed by no further therapy if no PD or HCD,” CC: “treat with chemotherapy, and then followed by cis-RA if no PD or HCD,” CN: “treat with chemotherapy, and then followed by no further therapy if no PD or HCD”

6 Discussion

In this article, we proposed a stratified proportional hazards model to estimate the cumulative treatment effect for treatment regimes from sequentially randomized clinical trials. This approach is similar to that advocated by Wei and Schaubel [8] for a single-stage randomization. Comparisons among treatment regimes were performed by testing the log ratio of the estimated cumulative hazards. Simulation results showed that the ratio estimator was approximately unbiased, and the coverage probabilities were close to the nominal level. However, the log ratio estimator performed better, with smaller absolute biases and better coverage probabilities. The comparative hypothesis testing can also be performed maintaining adequate type I errors from the proposed model under moderate sample size based on the Wald-type tests. In this paper we assumed that the randomization probabilities are known (by design). Estimation of these probabilities from the observed data might result in more efficient estimators.

Wald-type tests were used to assess the survival difference between treatment regimes at a specific time point. Although such point-wise comparisons are of interest in many diseases-specific areas, comparisons of treatment regimes based on overall hazard curves may be of importance alongside the construction of simultaneous confidence bands. This issue is beyond the scope of this manuscript and is being considered as a separate publication.

Acknowledgements

We would like to thank COG Neuroblastoma Disease Committee for kind permission to use the neuroblastoma data set, especially Dr. Wendy London for her help during the application process. We thank Dr. Susan Ellenberg from University of Pennsylvania for her insightful comments. We also thank the referees of this article for their helpful comments. Dr. Wahed’s research was in part supported by a National Institute of Mental Health Grant P30 MH090333.

Appendix A. Outline of the asymptotic normality of ${\hat{θ}}_{{jkj}^{'} k^{'}} (t)$

The following are the equivalent assumptions in sequentially randomized clinical trials to those outlined in Sect. 3 of Wei and Schaubel [8].

(X_ji, R_i, R_iZ_ki, U_i, Δ_i, V_i) are i.i.d. random vectors for i = 1, … , n.
Elements of V_i have bounded total variation for i 1, … , n
The cumulative hazard is finite over a pre-specified interval [0, L] such that P (U_i > L) > 0.
(d)
$y_{jk}^{(1)} (t, β) = \frac{\partial}{\partial β} y_{jk}^{(0)} (t, β) and y_{jk}^{(2)} (t, β) = \frac{\partial^{2}}{\partial β \partial β^{T}} y_{jk}^{(0)} (t, β),$
where
$y_{jk}^{(d)} (t, β) = \lim_{n \to \infty} Y_{jk}^{(d)} (t, β) for d = 0, 1, 2,$
where
$\begin{matrix} Y_{jk}^{(d)} (t, β) & = n^{- 1} \sum_{i = 1}^{n} Y_{jki} (t) V_{i}^{\otimes d} \exp {β^{T} V_{i}} for d = 0, 1, 2, \\ where & V_{i}^{\otimes 0} = 1, V_{i}^{\otimes 1} = V_{i}, V_{i}^{\otimes 2} = V_{i} V_{i}^{T}, \end{matrix}$
with $y_{jk}^{(1)} (t, β)$ and $y_{jk}^{(2)} (t, β)$ bounded away from 0 for t ∈ [0. L] and β in an open set.
Positive-definiteness of the matrix
$Ω (β) = \sum_{j = 1}^{J} \sum_{k = 1}^{K} \int_{0}^{L} τ_{jk} (t, β) y_{jk}^{(0)} (t, β) d Λ_{jk 0} (t, β),$
where
$τ_{jk} (t, β) = y_{jk}^{(2)} (t, β) ∕ y_{jk}^{(0)} (t, β) - {\overset{‒}{v}}_{jk} {(t, β)}^{\otimes 2},$

Consistency and asymptotic normality of ${\hat{θ}}_{{jkj}^{'} k^{'}} (t)$ Consistency and asymptotic normality of ${\hat{θ}}_{{jkj}^{'} k^{'}} (t)$ can be established in a similar manner as in the proof of Theorems 1 and 2 given in the Web Appendix of Wei and Schaubel [8] as long as the following results hold:

$\hat{β} \overset{a . s .}{\to} β_{0}$ .
$\sqrt{n} (\hat{β} - β_{0})$ is asymptotically normal
${\hat{Λ}}_{jk} (t, \hat{β})$ is a uniformly consistent estimator of Λ_jk(t, β).

1. Consistency of $\hat{β}$

Recall that β is a solution to the equation

U_{n} (β) = \frac{1}{n} \sum_{i = 1}^{n} \sum_{j = 1}^{J} \sum_{k = 1}^{K} \int_{0}^{L} {V_{i} - {\bar{V}}_{jk} (t, β)} {dN}_{jki} (t) = 0 .

First note that the processes Y_jk(t) and N_jk(t) are both cardlag processes and hence they are Donsker. Since the classes {β ∈ B} and {V} are trivially Donsker, the functions Y_jk(t) exp{^βT V}, VY_jk(t) exp{β^T V}, VV^T Y_jk(t) exp{β^T V} are all Donsker for t ∈ [0, L], β ∈ B. The derivative of U_n(β) with respect to β is Ω_n(β), where

\begin{matrix} Ω_{n} (β) = & \sum_{j = 1}^{J} \sum_{k = 1}^{K} \int_{0}^{L} [\frac{\sum_{p = 1}^{n} V_{p} V_{p}^{T} Y_{jkp} (t) \exp (β^{T} V_{p})}{\sum_{p = 1}^{n} Y_{jkp} (t) \exp (β^{T} V_{p})} \\ - {\frac{\sum_{p = 1}^{n} V_{p} Y_{jkp} (t) \exp (β^{T} V_{p})}{\sum_{p = 1}^{n} Y_{jkp} (t) \exp (β^{T} V_{p})}}^{\otimes 2}] \frac{1}{n} \sum_{i = 1}^{n} {dN}_{jki} (s), \end{matrix}

where ⊗2 denotes the outer product. All functions in the above expressions are Glivenko–Cantelli and the limiting value $y_{jk}^{(0)} (t, β)$ of $\frac{1}{n} \sum_{p = 1}^{n} Y_{jkp} (t) \exp (β^{T} V_{p})$ is bounded away from zero. Therefore,

\sup_{β \in B} ∣ Ω_{n} (g) - Ω (β) ∣ \overset{a . s .}{\to} 0,

where Ω(β) is defined in (3). Since Ω(β) is positive semidefinite. U_n(β) is almost surely convex for large n. Therefore, $\hat{β} \overset{a . s .}{\to} β_{0}$ .

2. Asymptotic normality of $\sqrt{n} (\hat{β} - β_{0})$

We write Ψ_n(β) as

Ψ_{n} (β) = \frac{1}{n} \sum_{i = 1}^{n} \sum_{j = 1}^{J} \sum_{k = 1}^{K} \int_{0}^{L} {V_{i} - {\bar{v}}_{jk} (t, β)} {dM}_{jki} (t) = 0,

where

M_{jki} (t, β) = N_{jki} (t) - \int_{0}^{L} Y_{jki} (t) \exp (β^{T} V_{i}) d Λ_{jk 0} (t) .

Let

Ψ (β) = E [\sum_{j = 1}^{J} \sum_{k = 1}^{K} \int_{0}^{L} {V - {\bar{v}}_{jk} (t, β)} {dM}_{jk} (t)],

where

{\bar{v}}_{jk} (t, β) = \frac{E [{VY}_{jk} (t) \exp (β^{T} V)]}{E [Y_{jk} (t) \exp (β^{T} V)]} .

Then the arguments on pages 56 and 57 in Kosorok (2008, Chap. 4, [11]) can be applied to show that $\sqrt{n} (\hat{β} - β_{0})$ weakly converges to a mean zero normal random vector with covariance matrix Ω⁻¹(β₀).

3. Almost sure convergence of ${\hat{Λ}}_{jk} (t, \hat{β})$

Note that

{\hat{Λ}}_{jk} (t, \hat{β}) = \int_{0}^{t} \frac{P_{n} {dN}_{jk} (s)}{P_{n} Y_{jk} (s) \exp ({\hat{β}}^{T} V)},

where P_n is the empirical measure, namely, $P_{n} f = \frac{1}{n} \sum_{i = 1}^{n} f (x_{i})$ for real-valued functions $f : X \to R$ , $X$ being the sample space. This is basically the same estimator defined in the first display on page 57 of Kosorok (2008, Chap. 4, [11]). Hence, a straightforward application of the argument therein leads to

\sup_{t \in [0, L]} ∣ {\hat{Λ}}_{jk} (t, \hat{β}) - Λ_{jk 0} (t) ∣ \overset{a . s .}{\to} 0 .

With the above results (1–3), it is now straightforward to apply the arguments in the Web Appendix A of Wei and Schaubel [8] to establish that ${\hat{θ}}_{{jkj}^{'} k^{'}} (t)$ is uniformly consistent and asymptotically normal.

Appendix B. Estimation of ${\hat{ξ}}_{{jkj}^{'} k^{'} i} (t, \hat{β})$

The detailed steps for computing ${\hat{ξ}}_{{jkj}^{'} k^{'} i} (t, \hat{β})$ are as follows:

Step 1: Calculate ${\hat{\bar{V}}}_{jk} (s, \hat{β}) = \frac{\sum_{p = 1}^{n} V_{p} Y_{jkp} (s) \exp {{\hat{β}}^{T} V_{p}}}{\sum_{p = 1}^{n} Y_{jkp} (s) \exp {{\hat{β}}^{T} V_{p}}}$ .

Step 2: Calculate

\begin{matrix} {\hat{h}}_{jk} (t, \hat{β}) & = - \int_{0}^{t} {\hat{\bar{V}}}_{jk} (s, \hat{β}) d {\hat{Λ}}_{jk 0} (s, \hat{β}) \\ = - \int_{0}^{t} {\hat{\bar{V}}}_{jk} (s, \hat{β}) \sum_{i = 1}^{n} \frac{{dN}_{jki} (s)}{\sum_{p = 1}^{n} Y_{jkp} (s) \exp {{\hat{β}}^{T} V_{p}}} \\ = - \sum_{i = 1}^{n} \int_{0}^{t} \frac{{\hat{\bar{V}}}_{jk} (s, \hat{β}) {dN}_{jki} (s)}{\sum_{p = 1}^{n} Y_{jkp} (s) \exp {{\hat{β}}^{T} V_{p}}} . \end{matrix}

Step 3: Define ${\hat{τ}}_{jk} (t, \hat{β}) = \frac{\sum_{p = 1}^{n} V_{p} V_{p}^{T} Y_{jkp} (s) \exp {{\hat{β}}^{T} V_{p}}}{\sum_{p = 1}^{n} Y_{jkp} (s) \exp {{\hat{β}}^{T} V_{p}}} - {\hat{\bar{V}}}_{jk} (t, \hat{β}) {\hat{\bar{V}}}_{jk}^{T} (t, \hat{β})$ .

Step 4: Calculate

\begin{matrix} \hat{Ω} (\hat{β}) = & \frac{1}{n} \sum_{j = 1}^{J} \sum_{k = 1}^{K} \int_{0}^{L} {\hat{τ}}_{jk} (t, \hat{β}) \sum_{p = 1}^{n} Y_{jkp} (t) \exp {{\hat{β}}^{T} V_{p}} d {\hat{Λ}}_{jk 0} (t, \hat{β}) \\ = & \frac{1}{n} \sum_{j = 1}^{J} \sum_{k = 1}^{K} \int_{0}^{L} {\hat{τ}}_{jk} (t, \hat{β}) \sum_{p = 1}^{n} Y_{jkp} (t) \exp {{\hat{β}}^{T} V_{p}} \\ \times \sum_{i = 1}^{n} \frac{{dN}_{jki} (t)}{\sum_{p = 1}^{n} Y_{jkp} (t) \exp {{\hat{β}}^{T} V_{p}}} \\ = & \frac{1}{n} \sum_{i = 1}^{n} \sum_{j = 1}^{J} \sum_{k = 1}^{K} \int_{0}^{L} {\hat{τ}}_{jk} (t, \hat{β}) {dN}_{jki} (t) . \end{matrix}

Step 5: Calculate

\begin{matrix} {\hat{Ψ}}_{i} (\hat{β}) = & \sum_{j = 1}^{J} \sum_{k = 1}^{K} \int_{0}^{L} {V_{i} - {\hat{\bar{V}}}_{jk} (t, \hat{β})} d {\hat{M}}_{jki} (t, \hat{β}) \\ = & \sum_{j = 1}^{J} \sum_{k = 1}^{K} \int_{0}^{L} {V_{i} - {\hat{\bar{V}}}_{jk} (t, \hat{β})} [{dN}_{jki} (t) - Y_{jki} \exp {{\hat{β}}^{T} V_{i}} d {\hat{Λ}}_{jk 0} (t, \hat{β})] \\ = & \sum_{j = 1}^{J} \sum_{k = 1}^{K} \int_{0}^{L} {V_{i} - {\hat{\bar{V}}}_{jk} (t, \hat{β})} {dN}_{jki} (t) \\ - \sum_{j = 1}^{J} \sum_{k = 1}^{K} \int_{0}^{L} {V_{i} - {\hat{\bar{V}}}_{jk} (t, \hat{β})} Y_{jki} \exp {{\hat{β}}^{T} V_{i}} \\ \times \sum_{i = 1}^{n} \frac{{dN}_{jki} (t)}{\sum_{p = 1}^{n} Y_{jkp} (t) \exp {{\hat{β}}^{T} V_{p}}} \\ = & \sum_{j = 1}^{J} \sum_{k = 1}^{K} \int_{0}^{L} {V_{i} - {\hat{\bar{V}}}_{jk} (t, \hat{β})} {dN}_{jki} (t) \\ - \sum_{i = 1}^{n} \sum_{j = 1}^{J} \sum_{k = 1}^{K} \int_{0}^{L} \frac{{V_{i} - {\hat{\bar{V}}}_{jk} (t, \hat{β})} Y_{jki} \exp {{\hat{β}}^{T} V_{i}}}{\sum_{p = 1}^{n} Y_{jkp} (t) \exp {{\hat{β}}^{T} V_{p}}} {dN}_{jki} (t) . \end{matrix}

Step 6: Define

\begin{matrix} {\hat{Φ}}_{jki}^{L} (t, \hat{β}) = & \int_{0}^{t} \frac{n}{\sum_{p = 1}^{n} Y_{jkp} (s) \exp {{\hat{β}}^{T} V_{p}}} d {\hat{M}}_{jki} (s, \hat{β}) \\ = & \int_{0}^{t} \frac{n}{\sum_{p = 1}^{n} Y_{jkp} (s) \exp {{\hat{β}}^{T} V_{p}}} \\ \times [{dN}_{jki} (s) - Y_{jki} \exp {{\hat{β}}^{T} V_{i}} d {\hat{Λ}}_{jk 0} (s, \hat{β})] \\ = & \int_{0}^{t} \frac{{ndN}_{jki} (s)}{\sum_{p = 1}^{n} Y_{jkp} (s) \exp {{\hat{β}}^{T} V_{p}}} \\ - \int_{0}^{t} \frac{{nY}_{jki} \exp {{\hat{β}}^{T} V_{i}}}{\sum_{p = 1}^{n} Y_{jkp} (s) \exp {{\hat{β}}^{T} V_{p}}} \sum_{i = 1}^{n} \frac{{dN}_{jki} (s)}{\sum_{p = 1}^{n} Y_{jkp} (s) \exp {{\hat{β}}^{T} V_{p}}} \\ = & \int_{0}^{t} \frac{nd N_{jki} (s)}{\sum_{p = 1}^{n} Y_{jkp} (s) \exp {{\hat{β}}^{T} V_{p}}} \\ - \sum_{i = 1}^{n} \int_{0}^{t} \frac{{nY}_{jki} \exp {{\hat{β}}^{T} V_{i}} {dN}_{jki} (s)}{{[\sum_{p = 1}^{n} Y_{jkp} (s) \exp {{\hat{β}}^{T} V_{p}}]}^{2}} . \end{matrix}

Step 7: Calculate ${\hat{Φ}}_{jki} (t, \hat{β}) = {\hat{h}}_{jk}^{T} (t, \hat{β}) {\hat{Ω}}^{- 1} (\hat{β}) {\hat{Ψ}}_{i} (\hat{β}) + {\hat{Φ}}_{jki}^{L} (t, \hat{β})$ .

Step 8: Calculate ${\hat{ξ}}_{{jkj}^{'} k^{'} i} (t, \hat{β}) = \frac{{\hat{Φ}}_{jki (t, \hat{β})}}{{\hat{Λ}}_{j^{'} k^{'} 0} (t, \hat{β})} - \frac{{\hat{Λ}}_{jk 0} (t, \hat{β}) {\hat{Φ}}_{j^{'} k^{'} i} (t, \hat{β})}{{\hat{Λ}}_{j^{'} k^{'} 0}^{2} (t, \hat{β})}$ .

Contributor Information

Xinyu Tang, Tang Biostatistics Program, Department of Pediatrics, University of Arkansas for Medical Sciences, Little Rock, AR 72202, USA.

Abdus S. Wahed, Department of Biostatistics, University of Pittsburgh, Pittsburgh, PA 15261, USA

References

1.Lokhnygina Y, Helterbrand JD. Cox regression methods for two-stage randomization designs. Biometrics. 2007;63:422–428. doi: 10.1111/j.1541-0420.2007.00707.x. [DOI] [PubMed] [Google Scholar]
2.Lunceford JK, Davidian M, Tsiatis AA. Estimation of survival distributions of treatment policies in two-stage randomization designs in clinical trials. Biometrics. 2002;58:48–57. doi: 10.1111/j.0006-341x.2002.00048.x. [DOI] [PubMed] [Google Scholar]
3.Guo X, Tsiatis AA. A weighted risk estimator for survival distributions in two-stage randomization designs with censored survival data. Int J Biostat. 2005;1:1–15. [Google Scholar]
4.Hernan MA, Lanoy E, Costagliola D, Robins JM. Comparison of dynamic treatment regimes via inverse probability weighting. Basic Clin Pharmacol Toxicol. 2006;98:237–242. doi: 10.1111/j.1742-7843.2006.pto_329.x. [DOI] [PubMed] [Google Scholar]
5.Feng W, Wahed AS. Supremum weighted log-rank test and sample size for comparing two-stage adaptive treatment strategies. Biometrika. 2008;95:695–707. [Google Scholar]
6.Miyahara S, Wahed AS. Weighted Kaplan–Meier estimators for two-stage treatment regimes. Stat Med. 2010;29:2581–2591. doi: 10.1002/sim.4020. [DOI] [PubMed] [Google Scholar]
7.Goldberg Y, Kosorok MR. Q-learning and censored data. Ann Stat. 2012 doi: 10.1214/12-AOS968. doi:10.1214/12-AOS968. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Wei G, Schaubel DE. Estimating cumulative treatment effects in the presence of nonproportional hazards. Biometrics. 2008;64:724–732. doi: 10.1111/j.1541-0420.2007.00947.x. [DOI] [PubMed] [Google Scholar]
9.Wahed AS, Tsiatis AA. Optimal estimator for the survival distribution and related quantities for treatment policies in two-stage randomization designs in clinical trials. Biometrics. 2004;60:124–133. doi: 10.1111/j.0006-341X.2004.00160.x. [DOI] [PubMed] [Google Scholar]
10.Fleming TR, Harrington DP. Counting processes and survival analysis. Wiley; New York: 1991. [Google Scholar]
11.Kosorok MR. Introduction to Empirical Processes and Semiparametric Inference. Springer; New York: 2008. [Google Scholar]
12.Breslow NE. Contribution to the discussion of the paper by D.R. Cox. J R Stat Soc, Ser B. 1972;34:187–220. [Google Scholar]
13.Matthay KK, Reynolds P, Seeger RC, Shimada H, Adkins ES, Haas-Kogan A, et al. Long-term results for children with high-risk neuroblastoma treated on a randomized trial of myeloablative therapy followed by 13-cis-retinoic acid: A Children’s Oncology Group study. J Clin Oncol. 2009;27:1007–1013. doi: 10.1200/JCO.2007.13.8925. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R1] 1.Lokhnygina Y, Helterbrand JD. Cox regression methods for two-stage randomization designs. Biometrics. 2007;63:422–428. doi: 10.1111/j.1541-0420.2007.00707.x. [DOI] [PubMed] [Google Scholar]

[R2] 2.Lunceford JK, Davidian M, Tsiatis AA. Estimation of survival distributions of treatment policies in two-stage randomization designs in clinical trials. Biometrics. 2002;58:48–57. doi: 10.1111/j.0006-341x.2002.00048.x. [DOI] [PubMed] [Google Scholar]

[R3] 3.Guo X, Tsiatis AA. A weighted risk estimator for survival distributions in two-stage randomization designs with censored survival data. Int J Biostat. 2005;1:1–15. [Google Scholar]

[R4] 4.Hernan MA, Lanoy E, Costagliola D, Robins JM. Comparison of dynamic treatment regimes via inverse probability weighting. Basic Clin Pharmacol Toxicol. 2006;98:237–242. doi: 10.1111/j.1742-7843.2006.pto_329.x. [DOI] [PubMed] [Google Scholar]

[R5] 5.Feng W, Wahed AS. Supremum weighted log-rank test and sample size for comparing two-stage adaptive treatment strategies. Biometrika. 2008;95:695–707. [Google Scholar]

[R6] 6.Miyahara S, Wahed AS. Weighted Kaplan–Meier estimators for two-stage treatment regimes. Stat Med. 2010;29:2581–2591. doi: 10.1002/sim.4020. [DOI] [PubMed] [Google Scholar]

[R7] 7.Goldberg Y, Kosorok MR. Q-learning and censored data. Ann Stat. 2012 doi: 10.1214/12-AOS968. doi:10.1214/12-AOS968. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R8] 8.Wei G, Schaubel DE. Estimating cumulative treatment effects in the presence of nonproportional hazards. Biometrics. 2008;64:724–732. doi: 10.1111/j.1541-0420.2007.00947.x. [DOI] [PubMed] [Google Scholar]

[R9] 9.Wahed AS, Tsiatis AA. Optimal estimator for the survival distribution and related quantities for treatment policies in two-stage randomization designs in clinical trials. Biometrics. 2004;60:124–133. doi: 10.1111/j.0006-341X.2004.00160.x. [DOI] [PubMed] [Google Scholar]

[R10] 10.Fleming TR, Harrington DP. Counting processes and survival analysis. Wiley; New York: 1991. [Google Scholar]

[R11] 11.Kosorok MR. Introduction to Empirical Processes and Semiparametric Inference. Springer; New York: 2008. [Google Scholar]

[R12] 12.Breslow NE. Contribution to the discussion of the paper by D.R. Cox. J R Stat Soc, Ser B. 1972;34:187–220. [Google Scholar]

[R13] 13.Matthay KK, Reynolds P, Seeger RC, Shimada H, Adkins ES, Haas-Kogan A, et al. Long-term results for children with high-risk neuroblastoma treated on a randomized trial of myeloablative therapy followed by 13-cis-retinoic acid: A Children’s Oncology Group study. J Clin Oncol. 2009;27:1007–1013. doi: 10.1200/JCO.2007.13.8925. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Cumulative Hazard Ratio Estimation for Treatment Regimes in Sequentially Randomized Clinical Trials

Xinyu Tang

Abdus S Wahed

Abstract

1 Introduction

2 Design Setting and Statistical Model

2.1 Notation

2.2 Model

3 Inference

4 Simulation Study

Table 1.

Fig. 1.

Table 2.

Table 3.

5 Analysis of Neuroblastoma Data

Fig. 2.

6 Discussion

Acknowledgements

Appendix A. Outline of the asymptotic normality of ${\hat{θ}}_{{jkj}^{'} k^{'}} (t)$

Appendix B. Estimation of ${\hat{ξ}}_{{jkj}^{'} k^{'} i} (t, \hat{β})$

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Cumulative Hazard Ratio Estimation for Treatment Regimes in Sequentially Randomized Clinical Trials

Xinyu Tang

Abdus S Wahed

Abstract

1 Introduction

2 Design Setting and Statistical Model

2.1 Notation

2.2 Model

3 Inference

4 Simulation Study

Table 1.

Fig. 1.

Table 2.

Table 3.

5 Analysis of Neuroblastoma Data

Fig. 2.

6 Discussion

Acknowledgements

Appendix A. Outline of the asymptotic normality of θ^jkj′k′(t)

Appendix B. Estimation of ξ^jkj′k′i(t,β^)

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Appendix A. Outline of the asymptotic normality of ${\hat{θ}}_{{jkj}^{'} k^{'}} (t)$

Appendix B. Estimation of ${\hat{ξ}}_{{jkj}^{'} k^{'} i} (t, \hat{β})$