Nonparametric inference for assessing treatment efficacy in randomized clinical trials with a time-to-event outcome and all-or-none compliance

Robert M Elashoff; Gang Li; Ying Zhou

doi:10.1093/biomet/ass004

. 2012 Mar 20;99(2):393–404. doi: 10.1093/biomet/ass004

Nonparametric inference for assessing treatment efficacy in randomized clinical trials with a time-to-event outcome and all-or-none compliance

Robert M Elashoff ¹, Gang Li ², Ying Zhou ³

PMCID: PMC3635705 PMID: 23843664

Abstract

To evaluate the biological efficacy of a treatment in a randomized clinical trial, one needs to compare patients in the treatment arm who actually received treatment with the subgroup of patients in the control arm who would have received treatment had they been randomized into the treatment arm. In practice, subgroup membership in the control arm is usually unobservable. This paper develops a nonparametric inference procedure to compare subgroup probabilities with right-censored time-to-event data and unobservable subgroup membership in the control arm. We also present a procedure to estimate the onset and duration of treatment effect. The performance of our method is evaluated by simulation. An illustration is given using a randomized clinical trial for melanoma.

Keywords: Biological efficacy, Censoring, Counting process, Martingale, Noncompliance, Survival probability

1. Introduction

In randomized clinical trials, subjects often fail to comply with their assigned treatment regimen. The intent-to-treat analysis is a standard method for primary analysis of randomized trials that compares the treatment and control based on the initial randomization regardless of whether the subjects actually received their assigned treatment or not. However, in the presence of non-compliance, the intent-to-treat analysis estimates the programmatic effectiveness, not the biological efficacy of the treatment (Sommer & Zeger, 1991; Hollis & Campbell, 1999; Lachin, 2000).

Consider a randomized clinical trial with all-or-none compliance in the treatment arm, but no noncompliance in the control arm, where all-or-none compliance means that subjects either fully comply or fully do not comply with their assigned treatment regimen. To evaluate the treatment efficacy, one needs to compare the compliers in the treatment arm with the latent subgroup of patients in the control arm who would have received the treatment if they were offered the treatment. However, the membership of the latent compliance subgroup in the control arm is unobservable. For example, in the Eastern Cooperative Oncology Group trial E9288 (Kemeny et al., 2002), patients with liver metastases of colorectal cancer were randomized to receive surgical resection alone or surgical resection followed by chemotherapy. All patients received surgical resection, but only a portion of those randomized to the chemotherapy arm actually received the postoperative chemotherapy for reasons related to survival. To assess the biological efficacy of chemotherapy, one should compare patients in the chemotherapy arm who have actually received chemotherapy with those in the no-chemotherapy arm who would have received chemotherapy had they been assigned to receive postoperative chemotherapy. However, the latent chemotherapy-compliance status is not observable in the no-chemotherapy arm. Another example is Zelen’s (1979, 1990) single-consent design, in which subjects randomized to the control arm must receive the standard care, and those to the treatment arm can choose either the experimental treatment or the standard care.

This problem also arises from subgroup analysis when the subgroup status is identified by a diagnostic test that is performed in the treatment arm but not in the control arm. One example is the Multicenter Selective Lymphadenectomy Trial I (Morton et al., 2006). In this trial, newly diagnosed melanoma patients were randomized to a sentinel-node biopsy arm or a nodal observation arm. All patients underwent wide excision of the primary melanoma. In the biopsy arm, sentinel-node biopsy was performed on patients to identify the presence of sentinel-node metastases. An immediate complete lymphadenectomy was then performed for patients whose sentinel-node biopsies were positive for metastases. For all other patients in the trial, delayed complete lymphadenectomy was performed only when nodal recurrences became clinically detectable. An interesting question is whether the trial provides sufficient evidence that immediate complete lymphadenectomy improves survival relative to delayed complete lymphadenectomy in patients with sentinel-node metastases. The design of this trial does not facilitate a direct comparison since the sentinel-node status is unknown in the nodal observation arm.

There is a large literature on treatment noncompliance. For instance, Sommer & Zeger (1991) studied the problem with a dichotomous outcome and all-or-none compliance. Their approach has been extended to handle ordered categorical compliance (Goethgebeur & Molenberghs, 1996), all-or-none compliance with contamination (Cuzick et al., 1997) and clustered binary outcomes (Albert, 2002). Several methods are available for a time-to-event outcome with right-censored data. Robins & Tsiatis (1991) proposed a structural failure time model based on potential outcomes. Loeys & Goetghebeur (2003), Loeys et al. (2005) and Cuzick et al. (2007) proposed methods based on proportional hazards models. Frangakis & Rubin (1999) studied a nonparametric procedure to evaluate treatment efficacy in a randomized trial with missing outcomes and all-or-none noncompliance and discussed an extension to right-censored time-to-event data. Loeys & Goetghebeur (2003) proposed a nonparametric plug-in estimator of the survival function for compliers in the control arm, but its large-sample properties have not been studied.

The purpose of this paper is to develop a nonparametric inference method to assess treatment efficacy in randomized clinical trials with a right-censored time-to-event outcome and all-or-none compliance. We derive large sample properties of the estimated subgroup survival probabilities, which lead to an asymptotic inference procedure to compare a survival probability for compliers in the treatment arm with that for the latent compliers in the control arm. We also provide a procedure to determine the onset and duration of the treatment effect.

2. Methods

2.1. Notation and assumptions

Suppose that n₂ subjects are randomized to the treatment arm and n₁ subjects to the control arm. Let n = n₁ + n₂. In the treatment arm, one observes n₂ independent and identically distributed triplets (X_2i, δ_2i, g_2i) (i = 1, …, n₂), where for the ith subject the observation time X_2i = min(T_2i, C_2i) is the minimum of a nonnegative continuous survival time T_2i and a censoring time C_2i, δ_2i = I (T_2i ⩽ C_2i) is a failure indicator, and g_2i is a binary compliance indicator, 1 for compliance. In the control arm, one observes n₁ independent and identically distributed pairs (X_1i, δ_1i) (i = 1, …, n₁), where for subject i, X_1i = min(T_1i, C_1i), δ_1i = I (T_1i ⩽ C_1i) and the binary compliance indicator g_1i is not observed. Table 1 defines the subgroup survival functions by study arms and compliance status.

Table 1.

Notation of survival functions by study arms and compliance status

	Treatment	Control
Compliers (g_i = 1)	S₂(t) = pr(T_2i > t \| g_2i = 1)	S₁(t) = pr(T_1i > t \| g_1i = 1)
Noncompliers (g_i = 0)	G₂(t) = pr(T_2i > t \| g_2i = 0)	G₁(t) = pr(T_1i > t \| g_1i = 0)
Overall	—	H₁(t) = pr(T_1i > t)

Open in a new tab

Let Λ₁(t), $Λ_{2}^{S} (t)$ and $Λ_{2}^{G} (t)$ denote the cumulative hazard functions of H₁(t), S₂(t) and G₂(t), respectively. In addition, let D₁(t) = pr(C_1i > t), $D_{2}^{S} (t) = pr (C_{2 i} > t | g_{2 i} = 1)$ and $D_{2}^{G} (t) = pr (C_{2 i} > t | g_{2 i} = 0)$ .

The following assumptions are made throughout the paper.

Assumption 1. The compliance proportion is independent of randomization, i.e., pr(g_1i =1) = pr(g_2i = 1) ≡ p.

Assumption 2. The survival function of noncompliers is independent of randomization, i.e., G₁(t) = G₂(t), for all t.

Assumption 3. In the control arm, the censoring time is independent of the survival time. In the treatment arm, the censoring time is independent of the survival time conditional on the compliance status.

2.2. The estimator

The overall survival function for the control arm is H₁(t) = pr(g_1i = 1)S₁(t) + pr(g_1i = 0)G₁(t). This, together with Assumptions 1 and 2, implies that

S_{1} (t) = {H_{1} (t) - (1 - p) G_{2} (t)} / p,

which leads to the plug-in estimator (Loeys & Goetghebeur, 2003)

{\hat{S}}_{1} (t) = {{\hat{H}}_{1} (t) - (1 - \hat{p}) {\hat{G}}_{2} (t)} / \hat{p} .

(1)

Here Ĥ₁(t) = Π_{X_1i:n₁ ⩽ t}{1 – 1/(n₁ – i + 1)}^δ_1i:n1 is the Kaplan & Meier (1958) estimator of H₁(t), where 0 ⩽ X_11:n₁ ⩽ ⋯ ⩽ X_1n₁:n₁ are the ordered observation times in the control arm and δ_1i:n₁ is the failure indicator for X_1i:n₁, Ĝ₂(t) is the Kaplan–Meier estimator of G₂(t) based on the noncompliers in the treatment arm and $\hat{p} = \sum_{i = 1}^{n_{2}} g_{2 i} / n_{2}$ .

Let Ŝ₂(t) be the Kaplan–Meier estimator of S₂(t) based on the compliers (g_2i = 1) in the treatment arm. We estimate S₂(t) – S₁(t), a measure of treatment efficacy, by Ŝ₂(t) – Ŝ₁(t).

Remark 1. Frangakis & Rubin (1999, § 5) derived a different estimator of S₁(t) that is consistent for S₁(t) under independent censoring conditional on compliance status in both arms. Because compliance status is unobserved in the control arm, the conditional independent censoring assumption in the control arm is untestable. Furthermore, the plug-in estimator defined in (1) may not be consistent under the assumptions of Frangakis & Rubin (1999) since the conditional independent censoring assumption in the control arm renders the Kaplan–Meier estimator Ĥ₁(t) generally inconsistent for H₁(t). We make different assumptions in Assumption 3 that the censoring time is independent of the survival time conditional on the compliance status in the treatment arm, but unconditionally independent in the control arm. Thus, under Assumption 3, Ĥ₁(t) is consistent for H₁(t) and Ĝ₂(t) is consistent for G₂(t). Consequently, the plug-in estimator Ŝ₁(t) is consistent for S₁(t).

2.3. Asymptotic properties

The counting process and martingale theory are used to study the asymptotic properties of Ŝ₁(t) and Ŝ₂(t) – Ŝ₁(t). For the control arm, define the at-risk processes

Y_{1 i} (t) = I (X_{1 i} ⩾ t), Y_{1 (t)} = \sum_{i = 1}^{n_{1}} Y_{1 i} (t),

and counting processes

N_{1 i} (t) = I (X_{1 i} ⩽ t, δ_{1 i} = 1), N_{1} (t) = \sum_{i = 1}^{n_{1}} N_{1 i} (t) .

Then $M_{1 i} (t) = N_{1 i} (t) - \int_{0}^{t} Y_{1 i} (u) d Λ_{1} (u) (i = 1, \dots, n_{1})$ , are orthogonal locally integrable martingales with respect to the filtration 𝒡₁(t) = σ [{N_1i (u), Y_1i (u)}, i = 1, …, n₁ : 0 ⩽ u ⩽ t], the failure and censoring histories up to time t.

Similarly, let Y_2i (t) and N_2i (t) denote the corresponding at-risk process and counting process for subject i in the treatment arm. For the compliance subgroup in the treatment arm, define $Y_{2}^{S} (t) = \sum_{i = 1}^{n_{2}} g_{2 i} Y_{2 i} (t)$ , $N_{2}^{S} (t) = \sum_{i = 1}^{n_{2}} g_{2 i} N_{2 i} (t)$ and $M_{2}^{S} (t) = \sum_{i = 1}^{n_{2}} g_{2 i} M_{2 i}^{S} (t)$ , where $M_{2 i}^{S} (t) = N_{2 i} (t) - \int_{0}^{t} Y_{2 i} (u) d Λ_{2}^{S} (u)$ . For the noncompliance subgroup in the treatment arm, define $Y_{2}^{G} (t) = \sum_{i = 1}^{n_{2}} (1 - g_{2 i}) Y_{2 i} (t)$ , $N_{2}^{G} (t) = \sum_{i = 1}^{n_{2}} (1 - g_{2 i}) N_{2 i} (t)$ and $M_{2}^{G} (t) = \sum_{i = 1}^{n_{2}} (1 - g_{2 i}) M_{2 i}^{G} (t)$ , where $M_{2 i}^{G} (t) = N_{2 i} (t) - \int_{0}^{t} Y_{2 i} (u) d Λ_{2}^{G} (u)$ .

Let

{\hat{Λ}}_{1} (t) = \int_{0}^{t} \frac{d N_{1} (u)}{Y_{1} (u)}, {\hat{Λ}}_{2}^{S} (t) = \int_{0}^{t} \frac{d N_{2}^{S} (u)}{Y_{2}^{S} (u)}, {\hat{Λ}}_{2}^{G} (t) = \int_{0}^{t} \frac{d N_{2}^{G} (u)}{Y_{2}^{G} (u)}

be the Nelson–Aalen estimators of Λ₁(t), $Λ_{2}^{S} (t)$ and $Λ_{2}^{G} (t)$ .

The following lemma is needed to derive the joint distribution of Ĥ₁(t), Ŝ₂(t), Ĝ₂(t) and p̂.

Lemma 1. Assume that Λ₁(τ) < ∞, $Λ_{2}^{S} (τ) < \infty$ , $Λ_{2}^{G} (τ) < \infty$ , D₁(τ) > 0, $D_{2}^{S} (τ) > 0$ and $D_{2}^{G} (τ) > 0$ , where τ is usually the time when data collection ends. Assume n₁/n → ρ₁ and n₂/n → ρ₂ for 0 < ρ₁, ρ₂ < 1 as n → ∞. Then, for any t in (0, τ],

n^{1 / 2} (\begin{matrix} {\hat{H}}_{1} (t) - H_{1} (t) \\ {\hat{S}}_{2} (t) - S_{2} (t) \\ {\hat{G}}_{2} (t) - G_{2} (t) \\ \hat{p} - p \end{matrix}) \to N ((\begin{matrix} 0 \\ 0 \\ 0 \\ 0 \end{matrix}), Σ (t))

in distribution, as n → ∞, where the variance–covariance matrix is

Σ (t) = (\begin{matrix} {H_{1}}^{2} (t) ρ_{1}^{- 1} σ_{11} (t) & 0 & 0 & 0 \\ 0 & {S_{2}}^{2} (t) ρ_{2}^{- 1} σ_{22} (t) & 0 & - S_{2} (t) ρ_{2}^{- 1} σ_{24} (t) \\ 0 & 0 & {G_{2}}^{2} (t) ρ_{2}^{- 1} σ_{33} (t) & - G_{2} (t) ρ_{2}^{- 1} σ_{34} (t) \\ 0 & - S_{2} (t) ρ_{2}^{- 1} σ_{24} (t) & - G_{2} (t) ρ_{2}^{- 1} σ_{34} (t) & ρ_{2}^{- 1} p (1 - p) \end{matrix}),

with

\begin{array}{l} σ_{11} (t) = E [{\int_{0}^{t} \frac{d M_{1 i} (u)}{H_{1} (u -) D_{1} (u -)}}^{2}], σ_{22} (t) = E [{\int_{0}^{t} \frac{d M_{2 i}^{S} (u) g_{2 i}}{p S_{2} (u -) D_{2}^{S} (u -)}}^{2}], \\ σ_{33} (t) = E [{\int_{0}^{t} \frac{d M_{2 i}^{G} (u) (1 - g_{2 i})}{(1 - p) G_{2} (u -) D_{2}^{G} (u -)}}^{2}], \\ σ_{24} (t) = E {(g_{2 i} - p) \int_{0}^{t} \frac{d M_{2 i}^{S} (u) g_{2 i}}{p S_{2} (u -) D_{2}^{S} (u -)}}, \\ σ_{34} (t) = E {(g_{2 i} - p) \int_{0}^{t} \frac{d M_{2 i}^{G} (u) (1 - g_{2 i})}{(1 - p) G_{2} (u -) D_{2}^{G} (u -)}} . \end{array}

Furthermore, σ₁₁(t), σ₂₂(t), σ₃₃(t), σ₂₄(t) and σ₃₄(t) can be consistently estimated by

\begin{array}{l} {\hat{σ}}_{11} (t) = n_{1} \sum_{i = 1}^{n_{1}} {\int_{0}^{t} \frac{d {\hat{M}}_{1 i} (u)}{Y_{1} (u)}}^{2}, {\hat{σ}}_{22} (t) = n_{2} \sum_{i = 1}^{n_{2}} {\int_{0}^{t} \frac{d {\hat{M}}_{2 i}^{S} (u) g_{2 i}}{Y_{2}^{S} (u)}}^{2}, \\ {\hat{σ}}_{33} (t) = n_{2} \sum_{i = 1}^{n_{2}} {\int_{0}^{t} \frac{d {\hat{M}}_{2 i}^{G} (u) (1 - g_{2 i})}{Y_{2}^{G} (u)}}^{2}, {\hat{σ}}_{24} (t) = \sum_{i = 1}^{n_{2}} (g_{2 i} - \hat{p}) \int_{0}^{t} \frac{d {\hat{M}}_{2 i}^{S} (u) g_{2 i}}{Y_{2}^{S} (u)}, \\ {\hat{σ}}_{34} (t) = \sum_{i = 1}^{n_{2}} (g_{2 i} - \hat{p}) \int_{0}^{t} \frac{d {\hat{M}}_{2 i}^{G} (u) (1 - g_{2 i})}{Y_{2}^{G} (u)}, \end{array}

where ${\hat{M}}_{1 i} (t) = N_{1 i} (t) - \int_{0}^{t} Y_{1 i} (u) d {\hat{Λ}}_{1} (u)$ , ${\hat{M}}_{2 i}^{S} (t) = N_{2 i} (t) - \int_{0}^{t} Y_{2 i} (u) d {\hat{Λ}}_{2}^{S} (u)$ and ${\hat{M}}_{2 i}^{G} (t) = N_{2 i} (t) - \int_{0}^{t} Y_{2 i} (u) d {\hat{Λ}}_{2}^{G} (u)$ .

The joint limiting distribution of Ŝ₁(t) and Ŝ₂(t) is stated below.

Theorem 1. Assume that the assumptions of Lemma 1 hold. At a given time-point t in (0, τ],

n^{1 / 2} (\begin{matrix} {\hat{S}}_{1} (t) - S_{1} (t) \\ {\hat{S}}_{2} (t) - S_{2} (t) \end{matrix}) \to N {(\begin{matrix} 0 \\ 0 \end{matrix}), (\begin{matrix} ν_{11} (t) & ν_{12} (t) \\ ν_{12} (t) & ν_{22} (t) \end{matrix})}

in distribution, as n → ∞, n₁/n → ρ₁ and n₂/n → ρ₂, where

\begin{array}{l} ν_{11} (t) = & \frac{H_{1}^{2} (t)}{p^{2}} ρ_{1}^{- 1} σ_{11} (t) + \frac{{(1 - p)}^{2} {G_{2}}^{2} (t)}{p^{2}} ρ_{2}^{- 1} σ_{33} (t) + \frac{{H_{1} (t) - G_{2} (t)}^{2}}{p^{4}} ρ_{2}^{- 1} σ_{44} \\ - 2 \frac{(1 - p) G_{2} (t) {H_{1} (t) - G_{2} (t)}}{p^{3}} ρ_{2}^{- 1} σ_{34} (t), \end{array}

(2)

ν_{22} (t) = {S_{2}}^{2} (t) ρ_{2}^{- 1} σ_{22} (t),

(3)

ν_{12} (t) = \frac{S_{2} (t) {H_{1} (t) - G_{2} (t)}}{p^{2}} ρ_{2}^{- 1} σ_{24} (t) .

(4)

The proofs of Lemma 1 and Theorem 1 are provided in the Appendix.

2.4. Pointwise confidence intervals for survival probabilities

Let ν̂₁₁(t), ν̂₂₂(t) and ν̂₁₂(t) be the consistent estimates of ν₁₁(t), ν₂₂(t) and ν₁₂(t) obtained by replacing the theoretical quantities in (2)–(4) with their consistent estimators. It follows from Theorem 1 that the 100(1 – α)% confidence intervals for S₁(t) and S₂(t) at a given t ∈ [0, τ] are given by

{\hat{S}}_{1} (t) \pm z_{1 - α / 2} {\frac{{\hat{ν}}_{11} (t)}{n}}^{1 / 2}, {\hat{S}}_{2} (t) \pm z_{1 - α / 2} {\frac{{\hat{ν}}_{22} (t)}{n}}^{1 / 2},

respectively, where z_1–α/2 is the upper α/2 quantile of the standard normal distribution.

Furthermore, a 100(1 – α)% confidence interval for S₂(t) – S₁(t) is given by

{{\hat{S}}_{2} (t) - {\hat{S}}_{1} (t)} \pm z_{1 - α / 2} n^{- 1 / 2} \hat{σ} (t),

where σ̂²(t) = ν̂₁₁(t) + ν̂₂₂(t) – 2ν̂₁₂(t).

2.5. Onset and duration of treatment effect

In practice, one is often interested in the estimation of the time interval in which S₂(t) exceeds S₁(t). Below we provide a procedure to determine the onset and duration of the time interval with a confidence level 1 – α. The procedure is based on ideas from Berger & Boos (1999).

Step 1. Construct a one-sided, α/2-level test of H_0t : S₂(t) – S₁(t) = 0 versus H_at : S₂(t) – S₁(t) > 0 for a given t ∈ [0, τ]. Define Z_t = {Ŝ₂(t) – Ŝ₁(t)}/{n^−1/2σ̂(t)}. We reject the null hypothesis H_0t if Z_t > z_1–α/2.

Step 2. Choose starting value t_s ∈ [0, τ]. If Z_{t_s} accepts H_{0t_s}, no confidence statement is made. If Z_{t_s} rejects H_{0t_s}, we test sequentially downward and upward from t_s. Let L be the last t ⩽t_s for which H_0t is rejected, and U be the last t ⩾ t_s for which H_0t is rejected. Then [L, U] is the largest interval containing t_s for which Z_t rejects H_0t for all t ∈ [L, U].

Theorem 2. Let [L, U] be defined by the above algorithm. Then,

pr {S_{2} (t) - S_{1} (t) > 0, t \in [L, U]} ⩾ 1 - α,

as n → ∞, n₁/n → ρ₁ and n₂/n → ρ₂ for some constants 0 < ρ₁, ρ₂ < 1.

As remarked by Berger & Boos (1999), the starting value t_s should be chosen before the study. Any starting point satisfying L ⩽ t_s ⩽ U will lead to the same interval [L, U]. The choice of t_s requires some prior information about the time frame during which S₂(t) is likely to be higher than S₁(t). If prior knowledge is unavailable, one can repeat the procedure at k different starting points and use level α/(2k) for each one-sided test to achieve the overall confidence level 1 – α.

3. Simulation studies

We carried out some simulations to investigate the finite sample performance of our method for different combinations of sample size, censoring rate and compliance proportion.

In the first simulation, the survival times were generated from Weibull distributions S₁(t) = exp(−t^0.8), S₂(t) = exp(−t^0.75/2) and G₁(t) = G₂(t) = exp(−t^0.7/4). The censoring times were generated from an exponential distribution with hazard rate λ chosen to give a prespecified overall censoring rate. For each simulation, 1000 Monte Carlo samples were generated. Table 2 reports the empirical mean of the plug-in estimate Ŝ₁(t), the empirical standard error of Ŝ₁(t), the empirical mean of estimated standard error of Ŝ₁(t) and the achieved coverage probability of the 95% confidence interval for S₁(t). For the moderate sample size n₁ = n₂ = 100, the plug-in estimator Ŝ₁(t) and its standard error have very small biases and the confidence interval has very small coverage probability error in almost all cases. The only exception is that when the censoring rate is 70%, the coverage rate, 90.9%, is notably lower than the nominal level 95% in the right tail, S(t) = 0.25. Hence, one should interpret the analysis results for the right tail with caution in the presence of heavy censoring. Furthermore, as the sample sizes increase to n₁ = n₂ = 500, the performance of our method is satisfactory in all cases considered.

Table 2.

Performance of Ŝ₁(t), estimated standard error and confidence interval

p	CR	S₁(t)	n₁ = n₂ = 100				n₁ = n₂ = 500
p	CR	S₁(t)	Ŝ₁(t)	SE	$\hat{S E}$	ACP	Ŝ₁(t)	SE	$\hat{S E}$	ACP
0.7	0.3	0.50	0.50	0.08	0.08	94.2	0.50	0.04	0.04	94.8
	0.3	0.25	0.25	0.09	0.09	94.7	0.25	0.04	0.04	94.9
	0.5	0.50	0.50	0.08	0.08	94.8	0.50	0.04	0.04	95.2
	0.5	0.25	0.25	0.10	0.10	94.6	0.25	0.04	0.04	95.6
	0.7	0.50	0.50	0.10	0.09	94.5	0.50	0.04	0.04	94.9
	0.7	0.25	0.24	0.15	0.13	90.9	0.25	0.06	0.06	95.1
0.5	0.3	0.50	0.50	0.12	0.11	94.5	0.50	0.05	0.05	96.0
	0.3	0.25	0.24	0.13	0.13	94.6	0.25	0.06	0.06	95.4
	0.5	0.50	0.50	0.12	0.12	94.1	0.50	0.05	0.05	95.9
	0.5	0.25	0.24	0.15	0.14	94.6	0.25	0.06	0.06	95.3
	0.7	0.50	0.50	0.14	0.13	94.3	0.50	0.06	0.06	95.4
	0.7	0.25	0.24	0.19	0.18	94.6	0.25	0.08	0.08	94.9
0.3	0.3	0.50	0.49	0.20	0.19	95.7	0.50	0.08	0.08	95.9
	0.3	0.25	0.24	0.23	0.23	96.1	0.25	0.10	0.10	95.8
	0.5	0.50	0.49	0.21	0.20	95.6	0.50	0.08	0.09	95.4
	0.5	0.25	0.23	0.25	0.24	95.3	0.25	0.10	0.11	95.4
	0.7	0.50	0.49	0.22	0.21	95.9	0.50	0.09	0.09	96.0
	0.7	0.25	0.23	0.30	0.29	95.7	0.25	0.13	0.13	96.2

Open in a new tab

Ŝ₁(t), the empirical mean of estimated survival probability; SE, the empirical standard error of Ŝ₁(t); $\hat{S E}$ , the empirical mean of estimated standard error of Ŝ₁(t); ACP, the achieved coverage probability of the 95% confidence interval for S₁(t); p, the compliance proportion; CR, the censoring rate.

A similar simulation was conducted under conditions that emulate the Multicenter Selective Lymphadenectomy Trial I, in which the overall censoring rate is 87%, the observed proportion of node-positive patients is 15% and the sample sizes are n₁ = 500 and n₂ = 764. In this simulation, the survival times were generated from Weibull distributions S₁(t) = exp(−t^0.87/48.3), S₂(t) = exp(−t^0.87/89.8) and G₁(t) = G₂(t) = exp(−t^0.79/117.2). The results are summarized in Table 3. It is seen that similar to the previous simulation, the plug-in estimate Ŝ₁(t) and its standard error estimate show small bias and the confidence interval has small coverage probability error except in the right tail.

Table 3.

Simulation designed to emulate the Multicenter Selective Lymphadenectomy Trial I

n₁	n₂	p	CR	S₁(t)	Ŝ₁(t)	SE	$\hat{S E}$	ACP
500	764	0.15	0.87	0.50	0.51	0.25	0.25	95.30
				0.25	0.22	0.63	0.55	91.70

Open in a new tab

p, the sample size; CR, the censoring rate; Ŝ₁(t), the empirical mean of estimated survival probability; SE, the empirical standard error of Ŝ₁(t); $\hat{S E}$ , the empirical mean of estimated standard error of Ŝ₁ (t); ACP, the achieved coverage probability of the 95% confidence interval for S₁(t).

We conducted more simulations under different scenarios that include different survival distributions and a null case where S₁ = S₂. The performance of Ŝ₂(t) – Ŝ₁(t) was also investigated. The results were similar to those for Ŝ₁(t) and thus omitted. We also carried out a small sensitivity analysis of Assumption 2 and did not observe serious estimation bias and coverage error under moderate censoring when Assumption 2 is slightly violated.

4. An example

We applied our method to analyse the Multicenter Selective Lymphadenectomy Trial I described in § 1. Between January 1994 and March 2002, 769 patients were randomized to the sentinel-node biopsy arm and 500 patients to the nodal observation arm. Our analysis excludes five patients in the biopsy arm whose sentinel-node status was unavailable. Immediate complete lymphadenectomy was performed on 122 of 764 patients in the biopsy arm whose biopsies were positive for sentinel-node metastases. For other patients, delayed complete lymphadenectomy was performed upon clinically observable nodal relapse.

To apply our method, it is important that Assumption 2 in § 2.1 is reasonable for this trial. Because removing the sentinel node among the node-negative patients is not expected to change their survival experience, it is reasonable to assume that G₁(t) = G₂(t), where G₁(t) and G₂(t) represent the survival probabilities of node-negative patients in the observation arm and the biopsy arm, respectively.

We applied our method to estimate the benefit of immediate versus delayed complete lymphadenectomy on patients with sentinel-node metastases with respect to melanoma-specific survival, i.e., survival until death due to melanoma. Let S₁(t) and S₂(t) be the survival probabilities for patients with sentinel-node metastases in the observation arm and the biopsy arm, respectively. Table 4 reports the estimated survival probabilities Ŝ₁(t) and Ŝ₂(t), the estimated survival difference Ŝ₂(t) – Ŝ₁(t) and their estimated standard errors at some given time-points. It is seen from Table 4 that the 95% confidence interval for S₂(t) – S₁(t) is (0.02, 0.42) at 2.5 years, which implies that the 2.5-year melanoma-specific survival probability for patients with sentinel-node metastases in the biopsy arm is significantly higher than that of the observation arm at 0.05 significance level. Furthermore, using the procedure in § 2.5, we found that immediate complete lymphadenectomy significantly improves melanoma-specific survival relative to delayed complete lymphadenectomy for patients with sentinel-node metastases over the time interval [2.05, 2.97] years at 95% confidence level.

Table 4.

Melanoma-specific survival probabilities, and standard errors in parentheses, for patients with sentinel-node metastases

	t (years)
	1	2	2.5	3
Delayed complete lymphadenectomy Ŝ₁(t)	0.90 (0.04)	0.75 (0.08)	0.65 (0.09)	0.64 (0.10)
Immediate complete lymphadenectomy Ŝ₂(t)	0.97 (0.02)	0.88 (0.03)	0.86 (0.03)	0.83 (0.04)
Difference Ŝ₂(t) – Ŝ₁(t)	0.06 (0.04)	0.13 (0.08)	0.22 (0.10)	0.20 (0.11)

Open in a new tab

5. Discussion

Most existing methods for assessing treatment efficacy compare either mean survival times (Robins & Tsiatis, 1991) or hazard rates (Loeys & Goetghebeur, 2003; Loeys et al., 2005; Cuzick et al., 2007) between study arms using parametric or semiparametric models. The difference in subgroup survival probabilities provides a useful alternative measure of treatment efficacy. Our method is fully nonparametric. If the proportional hazards assumption between S₁(t) and S₂(t) holds, our nonparametric method may not be as efficient as a proportional hazards model-based method. On the other hand, our method is more robust when the hazards of S₁(t) and S₂(t) are not proportional.

As illustrated in our simulations, caution is needed when drawing inference in the right tail of a survival distribution for moderate sample size, especially under heavy censoring and extremely unbalanced compliance proportions.

The focus of this paper is on comparison of subgroup survival probabilities at a fixed time-point based on the plug-in estimator Ŝ₁(t). As pointed out by Loeys & Goetghebeur (2003), Ŝ₁(t) is not a proper estimate of the entire survival curve because it is not monotonically non-increasing. One possible solution is to apply isotonic regression to the plug-in estimator to obtain a proper survival function. Properties of the resulting estimator, however, are difficult to study. An alternative approach is to consider nonparametric maximum likelihood estimation of S₁.

Acknowledgments

The authors are grateful to the editor, an associate editor and two referees for their insightful and constructive comments. We also thank Dr Donald L. Morton for providing the melanoma data used in the example. Gang Li’s research was supported by a National Institutes of Health grant.

Appendix. Proofs

To prove the main results, we need the following lemma.

Lemma A1. Under the assumptions of Lemma 1, we have ${sup}_{t \in [0, τ]} | Y_{2}^{S} (t) / n_{2} - y_{2}^{S} (t) | \to 0$ , ${sup}_{t \in [0, τ]} | Y_{2}^{G} (t) / n_{2} - y_{2}^{G} (t) | \to 0$ and sup_{t ∈ [0,τ]} |Y₁(t)/n₁ – y₁(t)| → 0, in probability, where $y_{2}^{S} (t) = p S_{2} (t -) D_{2}^{S} (t -)$ , $y_{2}^{G} (t) = (1 - p) G_{2} (t -) D_{2}^{G} (t -)$ and y₁(t) = H₁(t−) D₁(t−).

Proof. For each i = 1, …, n₂, Y_2i = I (X_2i ⩾ t) is a left-continuous nonincreasing random function. We assume that S₂(t) and D₂(t) are continuous functions. At a given time-point t ∈ [0, τ],

E {g_{2 i} Y_{2 i} (t)} = pr (g_{2 i} = 1) pr {Y_{2 i} (t) = 1 | g_{2 i} = 1} = p S_{2} (t -) D_{2}^{S} (t -) .

By the weak law of large numbers, at each given time-point t,

| \frac{\sum_{i = 1}^{n_{2}} g_{2 i} Y_{2 i} (t)}{n_{2}} - p S_{2} (t -) D_{2}^{S} (t -) | \to 0

in probability, as n₂ → ∞. By the dominated convergence theorem,

E {g_{2 i} Y_{2 i} (t +)} = E {lim_{k \to \infty} g_{2 i} Y_{2 i} (t + \frac{1}{k})} = lim_{k \to \infty} E {g_{2 i} Y_{2 i} (t + \frac{1}{k})} = p S_{2} (t) D_{2}^{S} (t) .

Thus, as $n_{2} \to \infty | {\sum_{i = 1}^{n_{2}} g_{2 i} Y_{2 i} (t +)} / n_{2} - p S_{2} (t) D_{2}^{S} (t) | \to 0$ in probability. Using arguments similar to those in the proof of Chung (2001, Theorem 5.5.1), it can be shown that ${sup}_{t \in [0, τ]} | Y_{2}^{S} (t) / n_{2} - y_{2}^{S} (t) | \to 0$ in probability, as n₂ → ∞, where $y_{2}^{S} (t) = p S_{2} (t -) D_{2}^{S} (t -)$ .

The other results can be proved along the same lines.

Proof of Lemma 1. It is well known that Λ̂₁(t), ${\hat{Λ}}_{2}^{S} (t)$ and ${\hat{Λ}}_{2}^{G} (t)$ are consistent estimators of Λ₁(t), $Λ_{2}^{S} (t)$ and $Λ_{2}^{G} (t)$ , respectively (Andersen et al., 1993), and that p̂ is a consistent estimator of p. It is also clear that Λ̂₁(t) is independent of ${\hat{Λ}}_{2}^{S} (t)$ , ${\hat{Λ}}_{2}^{G} (t)$ and p̂. Furthermore, $Λ_{2}^{S} (t)$ and ${\hat{Λ}}_{2}^{G} (t)$ are independent. We have

n_{1}^{1 / 2} {{\hat{Λ}}_{1} (t) - Λ_{1} (t)} = n_{1}^{- 1 / 2} \int_{0}^{t} \frac{d M_{1} (u)}{y_{1} (u)} + n_{1}^{- 1 / 2} \int_{0}^{t} {\frac{1}{Y_{1} (u) / n_{1}} - \frac{1}{y_{1} (u)}} d M_{1} (u) .

(A1)

Assuming Λ₁(τ) < ∞, $n_{1}^{- 1 / 2} M_{1}$ converges weakly to a zero-mean Gaussian process by the martingale central limit theorem (Andersen et al., 1993). It follows from Lin & Ying (2001, Lemma A.1) and our Lemma A1 that $n_{1}^{- 1 / 2} \int_{0}^{t} [1 / {Y_{1} (u) / n_{1}} - 1 / y_{1} (u)] d M_{1} (u) \to 0$ in probability uniformly in t ∈ [0, τ]. This, together with (A1), implies that

n^{1 / 2} {{\hat{Λ}}_{1} (t) - Λ_{1} (t)} = n_{1}^{- 1 / 2} ρ_{1}^{- 1 / 2} \sum_{i = 1}^{n_{1}} \int_{0}^{t} \frac{d M_{1 i} (u)}{y_{1} (u)} + o_{p} (1)

(A2)

uniformly in t ∈ [0, τ]. Similarly, we can show that as n → ∞ and n₂/n → ρ₂,

n^{1 / 2} {{\hat{Λ}}_{2}^{S} (t) - Λ_{2}^{S} (t)} = n_{2}^{- 1 / 2} ρ_{2}^{- 1 / 2} \sum_{i = 1}^{n_{1}} \int_{0}^{t} \frac{d M_{2 i}^{S} (u) g_{2 i}}{y_{2}^{S} (u)} + o_{p} (1)

(A3)

and

n^{1 / 2} {{\hat{Λ}}_{2}^{G} (t) - Λ_{2}^{G} (t)} = n_{2}^{- 1 / 2} ρ_{2}^{- 1 / 2} \sum_{i = 1}^{n_{2}} \int_{0}^{t} \frac{d M_{2 i}^{G} (u) (1 - g_{2 i})}{y_{2}^{G} (u)} + o_{p} (1)

(A4)

uniformly in t ∈ [0, τ].

Let 𝒟 [0, τ]³ be the metric space consisting of {f₁(t), f₂(t), f₃(t)}, where f_k : [0, τ] → R for k = 1, 2, 3 are right-continuous functions with left limits. The metric of 𝒟[0, τ]³ is defined as d(f, g) = max_{t ∈ [0,τ]}{‖f_k (t) – g_k(t) ‖: 1 ⩽ k ⩽ 3} for f, g ∈ 𝒟[0, τ]³. It is easy to see from equations (A2)–(A4) that the stochastic process n^1/2[{Λ̂₁(t) – Λ₁(t)}, ${{\hat{Λ}}_{2}^{S} (t) - Λ_{2}^{S} (t)}$ , ${{\hat{Λ}}_{2}^{G} (t) - Λ_{2}^{G} (t)}$ , (p̂ – p)] in 𝒟[0, τ]³ × R is asymptotically equivalent to a sum of independent and identically distributed random vectors. By the multivariate central limit theorem, its finite-dimensional distributions converge asymptotically to zero-mean multivariate normal distributions. Moreover, because the elements, n^1/2{Λ̂₁(t) – Λ₁(t)}, n^1/2 ${{\hat{Λ}}_{2}^{S} (t) - Λ_{2}^{S} (t)}$ and $n^{1 / 2} {{\hat{Λ}}_{2}^{G} (t) - Λ_{2}^{G} (t)}$ , are square-integrable martingales with respect to their marginal filtrations, their tightness follows from the proof of Pollard (1984, Theorem VIII.13). Hence, $n^{1 / 2} [{{\hat{Λ}}_{1} (t) - Λ_{1} (t)}, {{\hat{Λ}}_{2}^{S} (t) - Λ_{2}^{S} (t)}, {{\hat{Λ}}_{2}^{G} (t) - Λ_{2}^{G} (t)}, (\hat{p} - p)]$ converges weakly to a zero-mean Gaussian stochastic process {𝒲₁(t), 𝒲₂(t), 𝒲₃(t), 𝒲₄} in 𝒟[0, τ]³ × R with the variance-covariance functions between 𝒲₁(t₁), 𝒲₂(t₂), 𝒲₃(t₃) and 𝒲₄ given by

(\begin{matrix} ρ_{1}^{- 1} σ_{11} (t_{1}) & 0 & 0 & 0 \\ 0 & ρ_{2}^{- 1} σ_{22} (t_{2}) & 0 & ρ_{2}^{- 1} σ_{24} (t_{2}) \\ 0 & 0 & ρ_{2}^{- 1} σ_{33} (t_{3}) & ρ_{2}^{- 1} σ_{34} (t_{3}) \\ 0 & ρ_{2}^{- 1} σ_{24} (t_{2}) & ρ_{2}^{- 1} σ_{34} (t_{3}) & ρ_{2}^{- 1} p (1 - p) \end{matrix}) .

Recall that Ĥ₁(t) = Π_u⩽t{1–dΛ̂₁(u)}, ${\hat{S}}_{2} (t) = \prod_{u ⩽ t} {1 - d {\hat{Λ}}_{2}^{S} (u)}$ and ${\hat{G}}_{2} (t) = \prod_{u ⩽ t} {1 - d {\hat{Λ}}_{2}^{G} (u)}$ , respectively. It follows from the functional delta method (Andersen et al., 1993) that the joint stochastic process n^1/2[{Ĥ₁(t) – H₁(t)}, {Ŝ₂(t) – S₂(t)}, {Ĝ₂(t) – G₂(t)}, (p̂ – p)] converges weakly to a zero-mean Gaussian process { $𝒲_{1}^{*} (t)$ , $𝒲_{2}^{*} (t)$ , $𝒲_{3}^{*} (t)$ , $𝒲_{4}^{*}$ } with the variance-covariance function between $𝒲_{1}^{*} (t_{1})$ , $𝒲_{2}^{*} (t_{2})$ , $𝒲_{3}^{*} (t_{3})$ and $𝒲_{4}^{*}$ given by

(\begin{matrix} {H_{1}}^{2} (t_{1}) ρ_{1}^{- 1} σ_{11} (t_{1}) & 0 & 0 & 0 \\ 0 & {S_{2}}^{2} (t_{2}) ρ_{2}^{- 1} σ_{22} (t_{2}) & 0 & - S_{2} (t_{2}) ρ_{2}^{- 1} σ_{24} (t_{2}) \\ 0 & 0 & {G_{2}}^{2} (t_{3}) ρ_{2}^{- 1} σ_{33} (t_{3}) & - G_{2} (t_{3}) ρ_{2}^{- 1} σ_{34} (t_{3}) \\ 0 & - S_{2} (t_{2}) ρ_{2}^{- 1} σ_{24} (t_{4}) & - G_{2} (t_{3}) ρ_{2}^{- 1} σ_{34} (t_{3}) & ρ_{2}^{- 1} p (1 - p) \end{matrix}) .

Therefore, for any t ∈ [0, τ], n^1/2[{Ĥ₁(t) – H₁(t)}, {Ŝ₂(t) – S₂(t)}, {Ĝ₂(t) – G₂(t)}, (p̂ – p)] converges to a multivariate normal distribution with mean zero and variance-covariance matrix Σ(t).

To prove the consistency of σ̂₁₁(t), we have

\begin{array}{l} {\hat{σ}}_{11} (t) = & \frac{1}{n_{1}} \sum_{i = 1}^{n_{1}} {[\int_{0}^{t} {\frac{1}{Y_{1} (u) / n_{1}} - \frac{1}{y_{1} (u)}} d {\hat{M}}_{1 i} (u)]}^{2} \\ + \frac{2}{n_{1}} \sum_{i = 1}^{n_{1}} [\int_{0}^{t} {\frac{1}{Y_{1} (u) / n_{1}} - \frac{1}{y_{1} (u)}} d {\hat{M}}_{1 i} (u) \int_{0}^{t} \frac{d {\hat{M}}_{1 i} (u)}{y_{1} (u)}] + \frac{1}{n_{1}} \sum_{i = 1}^{n_{1}} {\int_{0}^{t} \frac{d {\hat{M}}_{1 i} (u)}{y_{1} (u)}}^{2} . \end{array}

By the uniform consistency of Λ̂₁(t) and Y₁(t)/n₁ and the fact that |Y_1i (t)| ⩽ 1, we have

\begin{array}{r} sup_{t \in [0, τ]} | \frac{1}{n_{1}} \sum_{i = 1}^{n_{1}} {[\int_{0}^{t} {\frac{1}{Y_{1} (u) / n_{1}} - \frac{1}{y_{1} (u)}} d {\hat{M}}_{1 i} (u)]}^{2} | \to 0, \\ sup_{t \in [0, τ]} | \frac{1}{n_{1}} \sum_{i = 1}^{n_{1}} [\int_{0}^{t} {\frac{1}{Y_{1} (u) / n_{1}} - \frac{1}{y_{1} (u)}} d {\hat{M}}_{1 i} (u) \int_{0}^{t} \frac{d {\hat{M}}_{1 i} (u)}{y_{1} (u)}] | \to 0, \\ sup_{t \in [0, τ]} | \frac{1}{n_{1}} \sum_{i = 1}^{n_{1}} {\int_{0}^{t} \frac{d {\hat{M}}_{1 i} (u)}{y_{1} (u)}}^{2} - \frac{1}{n_{1}} \sum_{i = 1}^{n_{1}} {\int_{0}^{t} \frac{d M_{1 i} (u)}{y_{1} (u)}} | \to 0, \end{array}

in probability. Thus, ${\hat{σ}}_{11} (t) = n_{1}^{- 1} \sum_{i = 1}^{n_{1}} {\int_{0}^{t} d M_{1 i} (u) / y_{1} (u)}^{2} + o_{p} (1)$ uniformly in t ∈ [0, τ]. Furthermore, it follows from the uniform law of large numbers (Newey & McFadden, 1994, Lemma 2.4) that

sup_{t \in [0, τ]} | \frac{1}{n_{1}} \sum_{i = 1}^{n_{1}} {\int_{0}^{t} \frac{d M_{1 i} (u)}{y_{1} (u)}}^{2} - E [{\int_{0}^{t} \frac{d M_{1 i} (u)}{y_{1} (u)}}^{2}] | \to 0

in probability. Therefore, ${\hat{σ}}_{11} (t) \to E [{\int_{0}^{t} d M_{1 i} (u) / y_{1} (u)}^{2}] \equiv σ_{11} (t)$ in probability uniformly in t ∈ [0, τ].

The uniform consistency of σ̂₂₂(t), σ̂₃₃(t), σ̂₂₄(t) and σ̂₃₄(t) can be proved similarly. The consistency of p̂(1 – p̂) follows directly from the weak law of large numbers.

Proof of Theorem 1. It follows from Lemma 1 and the delta method that at a given time-point t ∈ [0, τ ], n^1/2[{Ŝ₁(t) – S₁(t)}, {Ŝ₂(t) – S₂(t)}] is asymptotically normally distributed with mean zero and variance-covariance matrix f Σ(t) f′, where

f = (\begin{matrix} 1 / p & 0 & - (1 - p) / p & - {H_{1} (t) - G_{2} (t)} / p^{2} \\ 0 & 1 & 0 & 0 \end{matrix}) .

Applying Slutsky’s theorem, we have ν̂₁₁(t) → ν₁₁(t), ν̂₂₂(t) → ν₂₂(t) and ν̂₁₂(t) → ν₁₂(t) in probability as n → ∞, n₁/n → ρ₁ and n₂/n → ρ₂.

Proof of Theorem 2. The theorem can be proved using essentially the same arguments given in the appendix of Berger & Boos (1999). Thus, we omit the details.

References

Albert JM. Estimating efficacy in clinical trials with clustered binary responses. Statist Med. 2002;21:649–61. doi: 10.1002/sim.1059. [DOI] [PubMed] [Google Scholar]
Andersen PK, Borgan Ø, Gill RD, Keiding N. Statistical Models Based on Counting Processes. New York: Springer; 1993. [Google Scholar]
Berger RL, Boos DD. Confidence limits for the onset and duration of treatment effect. Biomet J. 1999;41:517–31. [Google Scholar]
Chung KL. A Course in Probability Theory. 3rd ed. New York: Academic Press; 2001. [Google Scholar]
Cuzick J, Edwards R, Segnan N. Adjusting for non-compliance and contamination in randomized clinical trials. Statist Med. 1997;16:1017–29. doi: 10.1002/(sici)1097-0258(19970515)16:9<1017::aid-sim508>3.0.co;2-v. [DOI] [PubMed] [Google Scholar]
Cuzick J, Sasieni P, Myles J, Tyrer J. Estimating the effect of treatment in a proportional hazards model in the presence of non-compliance and contamination. J. R. Statist. Soc. B. 2007;69:565–88. [Google Scholar]
Frangakis CE, Rubin DB. Addressing complications of intention-to-treat analysis in the combined presence of all-or-none treatment-non-compliance and subsequent missing outcomes. Biometrika. 1999;86:365–79. [Google Scholar]
Goethgebeur E, Molenberghs G. Causal inference in a placebo-controlled clinical trial with binary outcome and ordered compliance. J Am Statist Assoc. 1996;91:928–34. [Google Scholar]
Hollis S, Campbell F. What is meant by intention to treat analysis? Survey of published randomised controlled trials. Br Med J. 1999;319:670–4. doi: 10.1136/bmj.319.7211.670. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kaplan EL, Meier P. Nonparametric estimation from incomplete observations. J Am Statist Assoc. 1958;53:457–81. [Google Scholar]
Kemeny MM, Adak S, Gray B, Macdonald JS, Smith T, Lipsitz S, Sigurdson ER, O’Dwyer PJ, Benson AB. Combined-modality treatment for resectable metastatic colorectal carcinoma to the liver: surgical resection of hepatic metastases in combination with continuous infusion of chemotherapy—an intergroup study. J Clin Oncol. 2002;20:1499–505. doi: 10.1200/JCO.2002.20.6.1499. [DOI] [PubMed] [Google Scholar]
Lachin JM. Statistical considerations in the intent-to-treat principle. Contr. Clin. Trials. 2000;21:167–89. doi: 10.1016/s0197-2456(00)00046-5. [DOI] [PubMed] [Google Scholar]
Lin DY, Ying Z. Semiparametric and nonparametric regression analysis of longitudinal data. J Am Statist Assoc. 2001;96:103–13. [Google Scholar]
Loeys T, Goetghebeur E. A causal proportional hazards estimator for the effect of treatment actually received in a randomized trial with all-or-nothing compliance. Biometrics. 2003;59:100–5. doi: 10.1111/1541-0420.00012. [DOI] [PubMed] [Google Scholar]
Loeys T, Goetghebeur E, Vabdebosch A. Casual proportional hazards models and time-consistent exposure in randomized clinical trials. Lifetime Data Anal. 2005;11:435–49. doi: 10.1007/s10985-005-5233-z. [DOI] [PubMed] [Google Scholar]
Morton DL, Thompson JF, Cochran AJ, Mozzillo N, Elashoff R, Essner R, Nieweg OE, Roses DF, Hoekstra HJ, Karakousis CP, et al. Sentinel-node biopsy or nodal observation in melanoma. New Engl J Med. 2006;355:1307–17. doi: 10.1056/NEJMoa060992. [DOI] [PubMed] [Google Scholar]
Newey WK, McFadden D. Handbook of Econometrics. Vol. 4. Amsterdam: Elsevier Science; 1994. Large sample estimation and hypothesis testing; pp. 2111–245. Ch. 36. [Google Scholar]
Pollard D. Weak Convergence of Stochastic Processes. New York: Springer; 1984. [Google Scholar]
Robins JM, Tsiatis AA. Correcting for non-compliance in randomized trials using rank preserving structural failure time models. Commun. Statist. A. 1991;20:2609–31. [Google Scholar]
Sommer A, Zeger SL. On estimating efficacy from clinical trials. Statist Med. 1991;10:45–53. doi: 10.1002/sim.4780100110. [DOI] [PubMed] [Google Scholar]
Zelen M. A new design for randomized clinical trials. New Engl J Med. 1979;300:1242–5. doi: 10.1056/NEJM197905313002203. [DOI] [PubMed] [Google Scholar]
Zelen M. Randomized consent designs for clinical trials: an update. Statist Med. 1990;9:645–56. doi: 10.1002/sim.4780090611. [DOI] [PubMed] [Google Scholar]

[b1-ass004] Albert JM. Estimating efficacy in clinical trials with clustered binary responses. Statist Med. 2002;21:649–61. doi: 10.1002/sim.1059. [DOI] [PubMed] [Google Scholar]

[b2-ass004] Andersen PK, Borgan Ø, Gill RD, Keiding N. Statistical Models Based on Counting Processes. New York: Springer; 1993. [Google Scholar]

[b3-ass004] Berger RL, Boos DD. Confidence limits for the onset and duration of treatment effect. Biomet J. 1999;41:517–31. [Google Scholar]

[b4-ass004] Chung KL. A Course in Probability Theory. 3rd ed. New York: Academic Press; 2001. [Google Scholar]

[b5-ass004] Cuzick J, Edwards R, Segnan N. Adjusting for non-compliance and contamination in randomized clinical trials. Statist Med. 1997;16:1017–29. doi: 10.1002/(sici)1097-0258(19970515)16:9<1017::aid-sim508>3.0.co;2-v. [DOI] [PubMed] [Google Scholar]

[b6-ass004] Cuzick J, Sasieni P, Myles J, Tyrer J. Estimating the effect of treatment in a proportional hazards model in the presence of non-compliance and contamination. J. R. Statist. Soc. B. 2007;69:565–88. [Google Scholar]

[b7-ass004] Frangakis CE, Rubin DB. Addressing complications of intention-to-treat analysis in the combined presence of all-or-none treatment-non-compliance and subsequent missing outcomes. Biometrika. 1999;86:365–79. [Google Scholar]

[b8-ass004] Goethgebeur E, Molenberghs G. Causal inference in a placebo-controlled clinical trial with binary outcome and ordered compliance. J Am Statist Assoc. 1996;91:928–34. [Google Scholar]

[b9-ass004] Hollis S, Campbell F. What is meant by intention to treat analysis? Survey of published randomised controlled trials. Br Med J. 1999;319:670–4. doi: 10.1136/bmj.319.7211.670. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b10-ass004] Kaplan EL, Meier P. Nonparametric estimation from incomplete observations. J Am Statist Assoc. 1958;53:457–81. [Google Scholar]

[b11-ass004] Kemeny MM, Adak S, Gray B, Macdonald JS, Smith T, Lipsitz S, Sigurdson ER, O’Dwyer PJ, Benson AB. Combined-modality treatment for resectable metastatic colorectal carcinoma to the liver: surgical resection of hepatic metastases in combination with continuous infusion of chemotherapy—an intergroup study. J Clin Oncol. 2002;20:1499–505. doi: 10.1200/JCO.2002.20.6.1499. [DOI] [PubMed] [Google Scholar]

[b12-ass004] Lachin JM. Statistical considerations in the intent-to-treat principle. Contr. Clin. Trials. 2000;21:167–89. doi: 10.1016/s0197-2456(00)00046-5. [DOI] [PubMed] [Google Scholar]

[b13-ass004] Lin DY, Ying Z. Semiparametric and nonparametric regression analysis of longitudinal data. J Am Statist Assoc. 2001;96:103–13. [Google Scholar]

[b14-ass004] Loeys T, Goetghebeur E. A causal proportional hazards estimator for the effect of treatment actually received in a randomized trial with all-or-nothing compliance. Biometrics. 2003;59:100–5. doi: 10.1111/1541-0420.00012. [DOI] [PubMed] [Google Scholar]

[b15-ass004] Loeys T, Goetghebeur E, Vabdebosch A. Casual proportional hazards models and time-consistent exposure in randomized clinical trials. Lifetime Data Anal. 2005;11:435–49. doi: 10.1007/s10985-005-5233-z. [DOI] [PubMed] [Google Scholar]

[b16-ass004] Morton DL, Thompson JF, Cochran AJ, Mozzillo N, Elashoff R, Essner R, Nieweg OE, Roses DF, Hoekstra HJ, Karakousis CP, et al. Sentinel-node biopsy or nodal observation in melanoma. New Engl J Med. 2006;355:1307–17. doi: 10.1056/NEJMoa060992. [DOI] [PubMed] [Google Scholar]

[b17-ass004] Newey WK, McFadden D. Handbook of Econometrics. Vol. 4. Amsterdam: Elsevier Science; 1994. Large sample estimation and hypothesis testing; pp. 2111–245. Ch. 36. [Google Scholar]

[b18-ass004] Pollard D. Weak Convergence of Stochastic Processes. New York: Springer; 1984. [Google Scholar]

[b19-ass004] Robins JM, Tsiatis AA. Correcting for non-compliance in randomized trials using rank preserving structural failure time models. Commun. Statist. A. 1991;20:2609–31. [Google Scholar]

[b20-ass004] Sommer A, Zeger SL. On estimating efficacy from clinical trials. Statist Med. 1991;10:45–53. doi: 10.1002/sim.4780100110. [DOI] [PubMed] [Google Scholar]

[b21-ass004] Zelen M. A new design for randomized clinical trials. New Engl J Med. 1979;300:1242–5. doi: 10.1056/NEJM197905313002203. [DOI] [PubMed] [Google Scholar]

[b22-ass004] Zelen M. Randomized consent designs for clinical trials: an update. Statist Med. 1990;9:645–56. doi: 10.1002/sim.4780090611. [DOI] [PubMed] [Google Scholar]

PERMALINK

Nonparametric inference for assessing treatment efficacy in randomized clinical trials with a time-to-event outcome and all-or-none compliance

Robert M Elashoff

Gang Li

Ying Zhou

Abstract

1. Introduction

2. Methods

2.1. Notation and assumptions

Table 1.

2.2. The estimator

2.3. Asymptotic properties

2.4. Pointwise confidence intervals for survival probabilities

2.5. Onset and duration of treatment effect

3. Simulation studies

Table 2.

Table 3.

4. An example

Table 4.

5. Discussion

Acknowledgments

Appendix. Proofs

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Nonparametric inference for assessing treatment efficacy in randomized clinical trials with a time-to-event outcome and all-or-none compliance

Robert M Elashoff

Gang Li

Ying Zhou

Abstract

1. Introduction

2. Methods

2.1. Notation and assumptions

Table 1.

2.2. The estimator

2.3. Asymptotic properties

2.4. Pointwise confidence intervals for survival probabilities

2.5. Onset and duration of treatment effect

3. Simulation studies

Table 2.

Table 3.

4. An example

Table 4.

5. Discussion

Acknowledgments

Appendix. Proofs

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases