PROPORTIONAL HAZARDS MODELS WITH CONTINUOUS MARKS

Yanqing Sun; Peter B Gilbert; Ian W McKeague

doi:10.1214/07-AOS554

. Author manuscript; available in PMC: 2009 Oct 15.

Published in final edited form as: Ann Stat. 2009 Feb 1;37(1):394–426. doi: 10.1214/07-AOS554

PROPORTIONAL HAZARDS MODELS WITH CONTINUOUS MARKS

Yanqing Sun ¹, Peter B Gilbert ¹, Ian W McKeague ¹

PMCID: PMC2762218 NIHMSID: NIHMS93879 PMID: 19838313

Abstract

For time-to-event data with finitely many competing risks, the proportional hazards model has been a popular tool for relating the cause-specific outcomes to covariates [Prentice et al. Biometrics 34 (1978) 541–554]. This article studies an extension of this approach to allow a continuum of competing risks, in which the cause of failure is replaced by a continuous mark only observed at the failure time. We develop inference for the proportional hazards model in which the regression parameters depend nonparametrically on the mark and the baseline hazard depends nonparametrically on both time and mark. This work is motivated by the need to assess HIV vaccine efficacy, while taking into account the genetic divergence of infecting HIV viruses in trial participants from the HIV strain that is contained in the vaccine, and adjusting for covariate effects. Mark-specific vaccine efficacy is expressed in terms of one of the regression functions in the mark-specific proportional hazards model. The new approach is evaluated in simulations and applied to the first HIV vaccine efficacy trial.

Key words and phrases: Competing risks, distribution-free confidence bands and tests, failure time data, genetic data, HIV vaccine trial, pointwise and simultaneous confidence bands, semiparametric model, survival analysis

1. Introduction

It has been 30 years since Prentice et al. [15] introduced a Cox regression framework for the analysis of failure time data in the presence of finitely many competing risks. Yet many important applications of competing risks methodology involve continuous causes-of-failure (marks). In HIV vaccine trials, for example, genetic divergence of infecting HIV viruses from the HIV strain represented in the vaccine needs to be taken into account to properly assess vaccine efficacy, but the mark variable is essentially continuous because of the large number of mutations involved. Other examples of continuous mark variables include lifetime medical cost or a quality of life score associated with survival time [14]. The grouping of continuous mark data into discrete marks is unsatisfactory because that amounts to a coarsening of the data and the results will depend on the way the groups are defined. To address this problem, we develop inference for a proportional hazards model in which both the regression parameters and the baseline hazard function depend nonparametrically on a continuous mark.

The paper is motivated by the need for new methods to analyze data from HIV vaccine efficacy trials. Approximately 15,000 new HIV infections occur each day [21], making development of a protective HIV vaccine a top priority for biomedical science. In efficacy trials thousands of HIV-negative volunteers are randomized to receive vaccine or placebo, and are monitored for HIV infection. Four efficacy trials are ongoing (http://www.iavi.org). A primary objective of each trial is to assess vaccine efficacy (VE) to prevent infection, where typically VE is defined as one minus the hazard ratio (vaccine/placebo) of HIV infection. One of the greatest barriers to achieving an efficacious vaccine is the extreme genetic heterogeneity of HIV [12, 7]. Although it may be possible to develop a vaccine that protects against HIV strains genetically similar to the HIV virus or viruses represented in the vaccine, it may be quite difficult to develop one to protect against HIV strains dissimilar from the vaccine material. This phenomenon is well known for flu vaccines—moderate genetic mismatch between an exposing flu virus and the virus represented in the vaccine causes vaccine failure, which has necessitated development of a new vaccine each year that is closely matched to the contemporary circulating flu strains. The genetic divergence (or distance) between two aligned HIV sequences can be measured as the weighted percent mismatch of amino acids, and since this distance may be unique for all infected subjects, it is natural to consider it as a continuous mark variable. The formidable problem of HIV genetic diversity implies that an important objective of an efficacy trial is assessment of if and how VE depends on the genetic divergence.

This problem can be addressed in terms of the conditional mark-specific hazard function, defined as

\begin{matrix} λ (t, υ | z) = lim_{h_{1,} h_{2} \to 0} & P {T \in [t, t + h_{1}), \\ V \in [υ, υ + h_{2}) | T \geq t, Z (t) = z} / h_{1} h_{2}, \end{matrix}

(1)

where T is the failure (infection diagnosis) time, V is a continuous mark variable and Z(t) is a (possibly time-dependent) p-dimensional covariate. Huang and Louis [8] developed the nonparametric maximum likelihood estimator of the joint distribution of T and V in terms of the unconditional mark-specific hazard function. Gilbert, McKeague and Sun [6] defined mark-specific vaccine efficacy as VE(t, υ) = 1 − λ(t, υ|1)/λ(t, υ|0), with z being the indicator of membership in the vaccine group; they developed several nonparametric and semiparametric tests concerning VE(t, υ).

In this article, we develop the mark-specific proportional hazards (PH) model

λ (t, υ | z (t)) = λ_{0} (t, υ) exp {β {(υ)}^{T} z (t)},

(2)

where the baseline hazard function λ₀(·, υ) and the p-dimensional regression parameter β(υ) are unknown continuous functions of υ. As far as we know, this model has never been studied in the literature, even though it is closely related to the discrete cause-of-failure models discussed by Prentice et al. [15]. The approach in the continuous case departs from the discrete case in that it is necessary to “borrow strength” from data in a neighborhood of υ, with the data closest to υ contributing the most.

For the HIV vaccine trial application, we partition the covariate as z(t) = (z₁, z₂(t))^T, where z₁ is the treatment (vaccine) group indicator and z₂(t) is a vector of possibly time-dependent covariates. Then the vaccine efficacy defined above takes the simpler form VE(υ) = 1 − exp(β₁(υ)), without any dependence on t. By assuming proportional hazards, model (2) can provide more powerful tests of mark-specific vaccine efficacy than the nonparametric procedures of Gilbert, McKeague and Sun [6], and the model allows adjustment for covariate effects. Furthermore, ignoring the mark variable and studying vaccine efficacy using the standard Cox model, as is widely practiced in vaccine trials for many infectious diseases, can give misleading results. In fact, even in the case of model (2) with z as the treatment indicator, the ordinary (marginal) Cox model will be misspecified unless the baseline λ₀(t, υ) factors into separate functions of t and υ.

Indeed, consider the model λ(t, υ|z = 0) = γ₀/2 + γ₁tυ and λ(t, υ|z = 1) = γ₀υ + γ₁tυ², for t ≥ 0, 0 ≤ υ ≤ 1, z ∈ {0, 1}. The corresponding marginal hazard functions are λ(t|z = 0) = γ₀/2 + γ₁t/2 and λ(t|z = 1) = γ₀/2 + γ₁t/3, for t ≥ 0. It is clear that λ(t|z) is not a proportional hazards model unless γ₀ or γ₁ is zero. If γ₁ = 0, the resulting marginal hazards become proportional for z = 0 and z = 1. However, in this example, the marginal vaccine efficacy VE = 1 − λ(t|z = 1)/λ(t|z = 0) = 0 while the mark-specific vaccine efficacy is VE(υ) = 1 − 2υ. The ordinary Cox model averages the mark-specific vaccine efficacy over its range, and important vaccine effects may be missed. This issue will be further illustrated in our simulation study. In general, use of the ordinary Cox model for studying hazard ratios can be misleading if an important mark variable is ignored. The mark-specific PH model offers a way to correct for that deficiency.

We also consider a cumulative vaccine efficacy estimand defined as $CV (υ) = \int_{a}^{υ} VE (u) d u$ where a > 0. We develop distribution-free uniform confidence bands for CV(υ), which are useful for inferential purposes. In addition we derive test statistics for evaluating mark-specific vaccine efficacy based on the estimator of CV(υ).

The paper is organized as follows. Section 2 develops a local partial likelihood procedure for estimating β(υ), leading to the construction of pointwise confidence intervals and formal tests for various hypotheses of interest concerning vaccine efficacy. A simulation study evaluating the performance of the proposed tests and the pointwise and simultaneous confidence intervals for VE(υ) and CV(υ) is presented in Section 3. The proposed methods are applied to analyze the data from the first HIV vaccine efficacy trial in Section 4. We discuss some general aspects of mark-specific PH models in Section 5. Proofs of the main results are placed in the Appendix.

2. Mark-specific proportional hazards model

2.1. Local partial likelihood

We begin by stating some assumptions and notations that are used throughout the paper. The mark variable V is assumed to have a known and bounded support; rescaling V if necessary, this support is taken without loss of generality to be [0, 1]. The observations (X_i, δ_i, δ_i V_i, Z_i), i = 1,…,n, are assumed to be i.i.d. replicates of (X, δ, δV, Z), where X is the right-censored failure time corresponding to T, which satisfies the model (2), and δ is the indicator of non-censorship. The mark is assumed to be observed whenever the corresponding failure time is uncensored; when δ_i = 0, V_i is undefined and is not meaningful. The censoring time is assumed to be conditionally independent of (T, V) given Z.

We consider a localized version of the log partial likelihood function for β = β(υ) at a fixed υ:

\begin{matrix} l (υ, β) = \sum_{i = 1}^{n} \int_{0}^{1} \int_{0}^{τ} K_{h} (u - υ) [β^{T} Z_{i} (t) - log (\sum_{j = 1}^{n} Y_{j} (t) e^{β^{T} Z_{j} (t)})] \\ \times N_{i} (dt, du), \end{matrix}

(3)

where K_h(x) = K(x/h)/h, K(·) is a kernel function with support [−1, 1], τ is the end of the follow-up period and h = h_n is a bandwidth. Here Y_i(t) = I(X_i ≥ t) and N_i(t, υ) = I(X_i ≤ t, δ_i = 1, V_i ≤ υ) is the marked point counting process with a jump at an uncensored failure times X_i and the associated mark V_i. For background on marked point processes see Brémaud [2] and Martinussen and Scheike [11].

The log partial likelihood function (3) resembles that of Kalbfleisch and Prentice [9] in the case of discrete marks, except that it borrows strength from observations having marks in the neighborhood of υ. The kernel function is designed to give greater weight to observations with marks near υ than those further away. The local maximum partial likelihood estimator of β(υ) is a maximizer β̂(υ) of (3). A similar approach has been studied by Cai and Sun [3] for estimating time-dependent coefficients in Cox regression models.

Denote μ_j = ∫ u^j K (u) du, ν_j = ∫ u^j K²(u) du for j = 0, 1, 2. For β ∈ ℝ^p, t ≥ 0, let

S^{(j)} (t, β) = n^{- 1} \sum_{i = 1}^{n} Y_{i} (t) exp {β^{T} Z_{i} (t) {} Z}_{i} {(t)}^{\otimes j},

where for any z ∈ ℝ^p, we denote z^⊗0 = 1, z^⊗1 = z and z^⊗2 = zz^T. Define s^(j)(t, β) = ES^(j)(t, β) and

\begin{matrix} J_{n} (t, β) = \frac{S^{(2)} (t, β)}{S^{(0)} (t, β)} - {(\frac{S^{(1)} (t, β)}{S^{(0)} (t, β)})}^{\otimes 2}, \\ J (t, β) = \frac{s^{(2)} (t, β)}{s^{(0)} (t, β)} - {(\frac{s^{(1)} (t, β)}{s^{(0)} (t, β)})}^{\otimes 2} . \end{matrix}

Taking the derivative of l(υ, β) with respect to β gives the score function

\begin{matrix} U (υ, β) & = l_{β}^{'} (υ, β) \\ = \sum_{i = 1}^{n} \int_{0}^{1} \int_{0}^{τ} K_{h} (u - υ) [Z_{i} (t) - \frac{S^{(1)} (t, β)}{S^{(0)} (t, β)}] N_{i} (dt, du) . \end{matrix}

(4)

The maximum partial likelihood estimator is a solution to U(υ, β̂(υ)) = 0, and can be computed using a Newton–Raphson algorithm. The second derivative of l(υ, β) with respect to β yields

l_{β}^{″} (υ, β) = - \sum_{i = 1}^{n} \int_{0}^{1} \int_{0}^{τ} K_{h} (u - υ) J_{n} (t, β) N_{i} (dt, du) .

Although inference on β is usually of primary interest, the baseline function λ₀(t, υ) can also be estimated, by smoothing the increments of the following estimator of the doubly cumulative baseline function $Λ_{0} (t, υ) = \int_{0}^{t} \int_{0}^{υ} λ_{0} (s, u) ds du$ :

{\hat{Λ}}_{0} (t, υ) = \int_{0}^{t} \int_{0}^{υ} \frac{N (ds, du)}{n S^{(0)} (s, \hat{β} (u))} .

(5)

2.2. Asymptotic results

We make use of the following regularity conditions; not all of these conditions are required for the proof of each theorem, nor are they the minimum required set of conditions.

CONDITION A

(A.1)
β(υ) has componentwise continuous second derivatives on [0, 1]. The second partial derivative of λ₀(t, υ) with respect to υ exists and is continuous on [0, τ] × [0, 1]. The covariate process Z(t) has paths that are left-continuous and of bounded variation, and satisfies the moment condition E[‖Z(t)‖⁴ exp(2M‖Z(t)‖)] < ∞, where M is a constant such that (υ, β(υ)) ∈ [0, 1] × (−M, M)^p for all υ and ‖A‖ = max_k,l |a_kl| for a matrix A = (a_kl).
(A.2)
For j = 0, 1, 2, each component of s^(j)(t, θ) is continuous on [0, τ] × [−M, M]^p, and sup_{t∈[0,τ],θ∈[−M, M]^p} ‖S^(j)(t, θ) − s^(j)(t, θ)‖ = O_p(n^−1/2).
(A.3)
s⁽⁰⁾(t, θ) > 0 on [0, τ] × [−M, M]^p and the matrix $\sum (υ) = \int_{0}^{τ} J (t, β (υ)) \times λ_{0} (t, υ) s^{(0)} (t, β (υ)) dt$ is positive definite.
(A.4)
E(N_i(dt, dυ)|ℱ_t−) = E(N_i(dt, dυ)|Y_i(t), Z_i(t)), where ℱ_t = σ{I (X_i ≤ s, δ_i = 1), I(X_i ≤ s, δ_i = 0), V_i I (X_i ≤ s, δ_i = 1), Z_i(s); 0 ≤ s ≤ t, i = 1,…,n} is the (right-continuous) filtration generated by {N_i(s, υ), Y_i(s), Z_i(s); 0 ≤ s ≤ t, 0 ≤ υ ≤ 1, i = 1,…,n}.
(A.5)
The kernel function K(·) is symmetric with support [−1, 1] and of bounded variation. The bandwidth satisfies nh² → ∞ and nh⁵ → 0 as n → ∞.

Note that the condition (A.2) holds under the condition (A.1) given some additional moment conditions on Z(t) − Z(s) and exp(b^T Z(t)) − exp(b^T Z(s)). If Z(t) = Z, not depending on t, then (A.2) holds by the Donsker theorem (Theorem 19.5 of van der Vaart [20]). The condition (A.4) assumes that the mark-specific instantaneous failure rate at time t given the observed information up to time t only depends on the failure status and the current covariate value. Under (A.4) and by the definition (1), E(N_i(dt, dυ)|ℱ_t−) = Y_i(t)λ(t, υ|Z_i(t)) dt dυ, and $M_{i} (t, υ) = \int_{0}^{t} \int_{0}^{υ} [N_{i} (ds, dx) - Y_{i} (s) λ (s, x | Z_{i} (s)) ds dx]$ is a martingale with respect to ℱ_t for each fixed υ ([11], page 31). Further, it follows by Aalan and Johansen [1] that M_i(·, υ₁) and M_i(·, υ₂) − M_i(·, υ₁) are orthogonal square integrable martingales with respect to ℱ_t for any 0 ≤ υ₁ ≤ υ₂ ≤ 1. To avoid the problems at the boundaries υ = 0, 1, we shall study the asymptotic properties of β̂(υ) for the interior values of υ ∈ [a, b] ⊂ (0, 1).

First we present the following result that is essential for proving the asymptotic normality of β̂(υ) and provides insight into the constructions of the confidence bands and test statistics that follow. Let

{\tilde{W}}_{A} (υ) = n^{- 1 / 2} \sum_{i = 1}^{n} \int_{a}^{υ} \int_{0}^{τ} A (u) [Z_{i} (t) - \frac{s^{(1)} (t, β (u))}{s^{(0)} (t, β (u))}] M_{i} (dt, du),

(6)

where A(u) is a deterministic p × p matrix with bounded components and 0 ≤ a < b ≤ 1.

THEOREM 1

Assume that each component of the p × p matrix A(υ), υ ∈ [a, b], is continuous. Under conditions (A.1)–(A.4), W̃_A(υ) converges weakly to a p-dimensional mean-zero Gaussian martingale, W_A(υ), with continuous sample paths on υ ∈ [a, b]. The covariance matrix of W_A(υ) is given by $Cov (W_{A} (υ)) = \int_{a}^{υ} A (u) \sum (u) A (u) du$ .

Let

{\hat{Σ}}_{\hat{A}} (υ) = n^{- 1} \sum_{i = 1}^{n} \int_{a}^{υ} \int_{0}^{τ} \hat{A} (u) J_{n} (t, \hat{β} (u)) {\hat{A}}^{T} (u) N_{i} (dt, du),

(7)

where Â(υ) is a consistent estimator of A(υ) uniformly in υ ∈ [a, b] ⊂ [0, 1]. It can be shown that Σ̂_A(υ) is a consistent estimator of Cov(W_A(υ)).

The consistency and asymptotic normality of β̂(υ) are established in the next two theorems.

THEOREM 2

Under conditions (A.1)–(A.5), β̂(υ) converges to β(υ) uniformly in υ ∈ [a, b] ⊂ (0, 1).

THEOREM 3

Under conditions (A.1)–(A.5), ${(nh)}^{1 / 2} (\hat{β} (υ) - β (υ)) \overset{𝒟}{\to} N (0, ν_{0} Σ^{- 1} (υ))$ for υ ∈ [a, b].

The proof of Theorem 3 uses a Taylor expansion of the score function, leading to $\hat{β} (υ) - β (υ) = - {(l_{β}^{″} (υ, β^{*} (υ)))}^{- 1} U (β (υ))$ , where β^*(υ) is on the line segment between β̂(υ) and β(υ). The asymptotic variance of n^−1/2h^1/2U (β(υ)) is shown to be ν₀Σ(υ), which is the in probability limit of ${\tilde{Σ}}_{n} (β (υ)) = n^{- 1} h \times \sum_{i = 1}^{n} \int_{0}^{1} \int_{0}^{τ} {(K_{h} (u - υ))}^{2} J_{n} (t, β (υ)) N_{i} (dt, du)$ . It can also be shown that $\hat{Σ} (υ) \equiv - l_{β}^{″} (υ, \hat{β} (υ)) / n \overset{P}{\to} Σ (υ) as n \to \infty$ . Thus, the asymptotic variance of (nh)^1/2 × (β̂(υ) − β(υ)) can be estimated by ${\hat{Σ}}_{1} (υ) = {(l_{β}^{″} (υ, \hat{β} (υ)) / n)}^{- 1} {\tilde{Σ}}_{n} (\hat{β} (υ)) {(l_{β}^{″} (υ, \hat{β} (υ)) / n)}^{- 1}$ . An alternative estimator is ${\hat{Σ}}_{2} (υ) = - ν_{0} {(l_{β}^{″} (υ, \hat{β} (υ)) / n)}^{- 1}$ . It is easy to check that ν₀ = 3/5 for Epanechnikov’s kernel $K (x) = \frac{3}{4} (1 - x^{2}), - 1 < x < 1$ . Simulations indicate that the two estimators have similar finite sample performance.

Theorem 3 will lead to the construction of pointwise confidence intervals for VE(υ). Simultaneous inference over υ ∈ [a, b] will be possible in terms of the estimate $\hat{B} (υ) = \int_{a}^{υ} \hat{β} (u) du$ of the cumulative regression coefficient $B (υ) = \int_{a}^{υ} β (u) du$ . We have the following weak convergence result for B̂(υ).

THEOREM 4

Under conditions (A.1)–(A.5), n^1/2(B̂(υ) − B(υ)) converges weakly to a p-dimensional mean-zero Gaussian martingale W_Σ⁻¹(υ) with continuous sample paths on υ ∈ [a, b]. The covariance matrix of W_Σ⁻¹(υ) is $\int_{a}^{υ} Σ {(u)}^{- 1} du$ , which can be consistently estimated by Σ̂_Â(υ) defined by (7) with A(υ) = (Σ(υ))⁻¹ and Â(υ) = (Σ̂(υ))⁻¹.

2.3. Confidence bands for vaccine efficacy

Let $β (υ) = {(β_{1} (υ), β_{2}^{T} (υ))}^{T}$ . Then the vaccine efficacy can be expressed as VE(υ) = 1 − exp(β₁(υ)). The estimated vaccine efficacy is $\hat{VE} (υ) = 1 - exp ({\hat{β}}_{1} (υ))$ . By Theorem 3 and the delta method, ${(nh)}^{1 / 2} (\hat{VE} (υ) - VE (υ)) \overset{𝒟}{\to} N (0, ν_{0} σ_{1}^{2} (υ) exp (2 β_{1} (υ)))$ for υ ∈ [a, b], where $σ_{1}^{2} (υ)$ is the first element on the diagonal of Σ⁻¹(υ). Let ${\hat{σ}}_{β_{1}}^{2} (υ)$ be the first element on the diagonal of Σ̂₁(υ). By the discussions on the consistent estimators for the asymptotic variance following Theorem 3, ${\hat{σ}}_{β_{1}}^{2} (υ)$ is a consistent estimator for $ν_{0} σ_{1}^{2} (υ)$ . A pointwise 100(1 − α)% confidence band for VE(υ) is given by

\hat{VE} (υ) \pm {(nh)}^{- 1 / 2} z_{α / 2} {\hat{σ}}_{β 1} (υ) exp ({\hat{β}}_{1} (υ)), a \leq υ \leq b,

(8)

where z_α/2 is the upper α/2 quantile of the standard normal distribution.

To derive simultaneous confidence bands for the cumulative vaccine efficacy $CV (υ) = \int_{a}^{υ} VE (u) du$ , we consider the point estimator $\hat{CV} (υ) = \int_{a}^{υ} \hat{VE} (u) du$ . Then

\sqrt{n} (\hat{CV} (υ) - CV (υ)) = \sqrt{n} \int_{a}^{υ} (exp (β_{1} (υ)) - exp ({\hat{β}}_{1} (υ))) du .

Note that $\sqrt{n} (\hat{CV} (υ) - CV (υ)) \approx \sqrt{n} \int_{a}^{υ} exp (β_{1} (υ) (β_{1} (υ) - {\hat{β}}_{1} (υ)) du$ . From the proof of Theorem 4, it can be shown that $\sqrt{n} (\hat{CV} (υ) - CV (υ))$ converges weakly to a mean-zero Gaussian process, $e_{1}^{T} W_{A} (υ), a \leq υ \leq b$ , with continuous paths and independent increments, where A(υ) = exp(β₁(υ))Σ(υ)⁻¹ and e₁ is the first column of the p × p identity matrix. The variance of $e_{1}^{T} W_{A} (υ)$ equals $ρ^{2} (υ) = \int_{a}^{υ} σ_{1}^{2} (u) exp (2 β_{1} (u)) du$ by Theorem 1, which can be conveniently estimated by $\int_{a}^{υ} {\hat{σ}}_{1}^{2} (u) exp (2 {\hat{β}}_{1} (u)) du$ , where ${\hat{σ}}_{1}^{2} (υ)$ is the first element of the diagonal of Σ̂(υ)⁻¹. We suspect that this estimator may ignore the finite sample correlations of β₁(υ) − β̂₁(υ) at different values of υ, thus over- or underestimating the true variance. We propose to use ${\hat{ρ}}^{2} (υ) = e_{1}^{T} {\hat{Σ}}_{Â} (υ) e_{1}$ as the estimator of the asymptotic variance of $\sqrt{n} (\hat{CV} (υ) - CV (υ))$ , where Σ̂_Â(υ) is obtained from (7) with Â(υ) = exp(β̂₁(υ))Σ̂(υ)⁻¹, which is uniformly consistent by Theorem 1. Consequently, a pointwise 100(1 − α)% confidence band for CV(υ) is given by

\hat{CV} (υ) \pm n^{- 1 / 2} z_{α / 2} \hat{ρ} (υ), a \leq υ \leq b .

(9)

Let 𝒱 be a set of values of υ in [a, b]. We may take 𝒱 = [a, b] or 𝒱 = {υ_k, k = 1,…,K} with υ₁ < ··· < υ_K. Note that if U(υ) is a Gaussian martingale with variance ρ²(υ), for a ≤ υ ≤ b, then U(υ)ρ(b)[ρ²(b) + ρ²(υ)]⁻¹ has the same distribution as B⁰(ρ²(υ)/(ρ²(b) + ρ²(υ))), a ≤ υ ≤ b, where B⁰(·) is a Brownian bridge. By the weak convergence of $\sqrt{n} (\hat{CV} (υ) - CV (υ))$ , the uniform consistency of ρ̂²(υ) to ρ²(υ) and the continuous mapping theorem, we have

\begin{matrix} sup_{υ \in 𝒱} | \sqrt{n} (\hat{CV} (υ) - CV (υ)) \hat{ρ} (b) / ({\hat{ρ}}^{2} (b) + {\hat{ρ}}^{2} (υ)) | \\ \overset{𝒟}{\to} sup_{υ \in 𝒱} | B^{0} (ρ^{2} (υ) / (ρ^{2} (b) + ρ^{2} (υ))) | . \end{matrix}

Thus a simultaneous 100(1 − α)% confidence band for CV(υ), υ ∈ 𝒱, is given by

\hat{CV} (υ) \pm n^{- 1 / 2} u_{α} [{\hat{ρ}}^{2} (b) + {\hat{ρ}}^{2} (υ)] / \hat{ρ} (b),

(10)

where u_α is the upper α-quantile of the distribution of sup_υ∈𝒱|B⁰(ρ²(υ)/(ρ²(b) + ρ²(υ)))|. The u_α is the upper α-quantile of sup_0≤υ≤1/2|B⁰(υ)| if 𝒱 = [a, b], which has been tabulated by Schumacher [16] for some α values. In the simulation study presented in the next section, we estimate u_α by the upper α-quantile of the distribution of sup_υk∈𝒱|B⁰(ρ̂²(υ_k)/(ρ̂²(b) + ρ̂²(υ_k)))| in both cases when 𝒱 = [a, b] or 𝒱 = {υ_k, k = 1,…,K}, which can be obtained by simulating a Brownian bridge for given ρ̂²(υ).

Alternatively, other resampling techniques such as the Gaussian multiplier method of Lin, Wei and Ying [10] can be used to estimate the critical value u_α. This method can be briefly outlined as follows. Let ξ₁,…,ξ_n be i.i.d. standard normal random variables and

W_{Â}^{*} (υ) = n^{- 1 / 2} \sum_{i = 1}^{n} ξ_{i} \int_{0}^{υ} \int_{0}^{τ} Â (u) [Z_{i} (t) - \frac{S^{(1)} (t, \hat{β} (u))}{S^{(0)} (t, \hat{β} (u))}] M_{i} (dt, du) .

(11)

Then the distribution $\sqrt{n} (\hat{CV} (υ) - CV (υ))$ can be approximated by the conditional distribution of $e_{1}^{T} W_{Â}^{*} (υ)$ given the observed data sequence, where Â = exp(β̂₁(υ)) × (Σ̂(υ))⁻¹. Consequently, the distribution of ${sup}_{υ \in 𝒱} | \sqrt{n} (\hat{CV} (υ) - CV (υ)) \hat{ρ} (b) {[{\hat{ρ}}^{2} (b) + {\hat{ρ}}^{2} (υ)]}^{- 1} |$ can be approximated by the conditional distribution of $U^{*} = {sup}_{υ \in 𝒱} | e_{1}^{T} W_{Â}^{*} (υ) \hat{ρ} (b) {[{\hat{ρ}}^{2} (b) + {\hat{ρ}}^{2} (υ)]}^{- 1} |$ given the observed data sequence. Let $u_{α}^{*}$ be the (1 − α)-quantile of the copies of U^* obtained by repeatedly generating sets of i.i.d. standard normal random variables. A simultaneous 100(1 − α)% confidence band for CV(υ), υ ∈ 𝒱, is given by

\hat{CV} (υ) \pm n^{- 1 / 2} u_{α}^{*} [{\hat{ρ}}^{2} (b) + {\hat{ρ}}^{2} (υ)] / \hat{ρ} (b) .

(12)

This resampling technique is also applicable to the hypothesis tests for vaccine efficacy developed in the next subsection.

2.4. Testing vaccine efficacy

We are interested in testing the following two sets of hypotheses. The first set of hypotheses is

\begin{matrix} H_{10} : VE (υ) = 0 for υ \in [a, b] \\ versus & H_{1 a} : VE (υ) \neq 0 for some υ (general alternative) \\ or & H_{1 m} : VE (υ) \geq 0 with strict inequality for at least some υ (monotone alternative) . \end{matrix}

The second set of hypotheses is

\begin{matrix} H_{20} : VE (υ) does not depend on υ \in [a, b] \\ versus & H_{2 a} : VE (υ) depends on υ (general alternative) \\ or & H_{2 m} : VE (υ) decreases as υ increases (monotone alternative) . \end{matrix}

Let β₁(υ) be the first component of β(υ). Then the null hypothesis H₁₀ is equivalent to β₁(υ) = 0 and the null hypothesis H₂₀ is equivalent to β₁(υ) does not depend on υ. The null hypothesis H₁₀ implies the vaccine affords no protection against any infecting strain of virus. The alternative H_1m indicates that the vaccine provides protection for at least some of the infecting strains, while H_1a states that the vaccine provides either protection or increased risk for some infecting strains. The null hypothesis H₂₀ implies there is no difference in vaccine protection for different infecting strains, measured by their distance υ to the strains contained in the vaccine. The ordered alternative H_2m states that vaccine efficacy decreases with υ and the alternative H_2a indicates that the vaccine efficacy changes with υ.

In this section, we develop some test procedures for detecting departures from H₁₀ in the direction of H_1m and H_1a and for detecting departures from H₂₀ in the direction of H_2m and H_2a. By Theorem 4 and the discussions in Section 2.3, the process $\sqrt{n} (\hat{CV} (υ) - CV (υ)), a \leq υ \leq b$ , converges weakly to a Gaussian martingale with predictable variation ρ²(υ). Let $ξ (υ) = \sqrt{n} (\hat{CV} (υ) - CV (υ)) / ρ (b)$ . It follows from Theorem 4 that $ξ (υ) \overset{𝒟}{\to} W (t (υ)), a \leq υ \leq b$ , where W(·) is a Wiener process and t(υ) = ρ²(υ)/ρ²(b).

To test H₁₀, let ${\hat{Z}}^{(1)} (υ) = \sqrt{n} \hat{CV} (υ) / \hat{ρ} (b) and \hat{t} (υ) = {\hat{ρ}}^{2} (υ) / {\hat{ρ}}^{2} (b)$ . Consider the following test statistics:

T_{a}^{(1)} = \int_{a}^{b} {({\hat{Z}}^{(1)} (υ))}^{2} d \hat{t} (υ), T_{m 1}^{(1)} = \int_{a}^{b} {\hat{Z}}^{(1)} (υ) d \hat{t} (υ) .

These test statistics have somewhat complicated null distributions (see below) so we consider the following simpler test statistic based on a finite grid, which leads to a standard normal null distribution:

T_{m 2}^{(1)} = {(K - 1)}^{- 1 / 2} \sum_{k = 2}^{K} ({\hat{Z}}^{(1)} (υ_{k}) - {\hat{Z}}^{(1)} (υ_{k - 1})) / {(\hat{t} (υ_{k}) - \hat{t} (υ_{k - 1}))}^{1 / 2},

where a ≤ υ₁ < … < υ_K ≤ b are the grid points in [a, b]. A similar test statistic with a standard normal null distribution is also proposed for H₂₀ later. Under H₁₀, $T_{a}^{(1)} \overset{𝒟}{\to} \int_{a}^{b} (W (t (υ)))^{2} d t (υ) \overset{𝒟}{=} \int_{0}^{1} {(W (t))}^{2} d t, T_{m 1}^{(1)} \overset{𝒟}{\to} \int_{a}^{b} W (t (υ)) d t (υ) \overset{𝒟}{=} \int_{0}^{1} W (t) d t and T_{m 2}^{(1)} \overset{𝒟}{\to} N (0, 1)$ . The distributions of $T_{a}^{(1)} and T_{m 1}^{(1)}$ under H₁₀ can also be approximated by those of $\int_{a}^{b} {(W (\hat{t} (υ)))}^{2} d \hat{t} (υ) and \int_{a}^{b} W (\hat{t} (υ)) d \hat{t} (υ)$ for given t̂(υ), respectively, which are used in the numerical studies for better finite sample approximations. We denote the upper α-quantiles of these two distributions by $c_{a}^{(1)} and c_{m 1}^{(1)}$ , respectively.

The test statistic $T_{a}^{(1)}$ captures general departures H_1a, while the test statistics $T_{m 1}^{(1)} and T_{m 2}^{(1)}$ are sensitive to the monotone departure H_1m. Both test statistics $T_{m 1}^{(1)} and T_{m 2}^{(1)}$ are likely to be positive when VE(υ) ≥ 0 for all υ with strict inequality for some υ. Hence the tests based on $T_{a}^{(1)}, T_{m 1}^{(1)} and T_{m 2}^{(1)}$ reject H₁₀ if $T_{a}^{(1)} > c_{a}^{(1)}, T_{m 1}^{(1)} > c_{m 1}^{(1)} and T_{m 2}^{(1)} > z_{α}$ , respectively.

To test H₂₀, let ${\hat{Z}}^{(2)} (υ) = \sqrt{n} (\frac{1}{υ - a} \hat{CV} (υ) - \frac{1}{b - a} \hat{CV} (b)) / \hat{ρ} (b)$ . Note that, under H₂₀, ${\hat{Z}}^{(2)} (υ) = \sqrt{n} [\frac{1}{υ - a} (\hat{CV} (υ) - CV (υ)) - \frac{1}{b - a} (\hat{CV} (b) - CV (b))] / \hat{ρ} (b)$ . By Theorem 4 and the continuous mapping theorem, under H₂₀, ${\hat{Z}}^{(2)} (υ) \overset{𝒟}{\to} \frac{1}{υ - a} W (t (υ)) - \frac{1}{b - a} W (1) \equiv Z^{(2)} (υ)$ for υ ∈ [a₁, b], where a < a₁ < b. We propose the following test statistics for evaluating H₂₀:

\begin{matrix} T_{a}^{(2)} = \int_{a_{1}}^{b} {({\hat{Z}}^{(2)} (υ))}^{2} d \hat{t} (υ), T_{m 1}^{(2)} = \int_{a_{1}}^{b} {\hat{Z}}^{(2)} (υ) d \hat{t} (υ), \\ T_{m 2}^{(2)} = {\hat{Π}}_{K}^{- 1} \sum_{k = 2}^{K} ({\hat{Z}}^{(2)} (υ_{k - 1}) - {\hat{Z}}^{(2)} (υ_{k})) / {\hat{π}}_{k}, \end{matrix}

where a₁ ≤ υ₁ < … < υ_K ≤ b are K grid points in [a₁, b], ${\hat{π}}_{k}^{2}$ is an estimate of the variance $π_{k}^{2} = Var (Z^{(2)} (υ_{k - 1}) - Z^{(2)} (υ_{k})) and {\hat{Π}}_{K}^{2}$ is an estimate of the variance $Π_{K}^{2} of \sum_{k = 2}^{K} (Z^{(2)} (υ_{k - 1}) - Z^{(2)} (υ_{k})) / π_{k}$ . By the covariance of the Wiener process, it is easy to show that

\begin{matrix} τ_{i, j} & = Cov (Z^{(2)} (υ_{i}), Z^{(2)} (υ_{j})) \\ = \frac{t (υ_{i})}{(υ_{i} - a) (υ_{j} - a)} - \frac{t (υ_{i})}{(υ_{i} - a) (b - a)} - \frac{t (υ_{j})}{(υ_{j} - a) (b - a)} + \frac{1}{{(b - a)}^{2}}, \end{matrix}

for υ_i ≤ υ_j. Thus, $π_{k}^{2} = τ_{k - 1, k - 1} - 2 τ_{k - 1, k} + τ_{k, k}$ . Let Γ = (τ_i,j)_K×K and

ξ^{T} = (π_{2}^{- 1}, π_{3}^{- 1} - π_{2}^{- 1}, \dots, π_{K}^{- 1} - π_{K - 1}^{- 1}, - π_{K}^{- 1}) .

It follows that Π_K = ξ^TΓξ. The estimates ${\hat{π}}_{k}^{2} and {\hat{Π}}_{K}^{2}$ are obtained by replacing t(υ) with t̂(υ).

By the weak convergence of Ẑ⁽²⁾(υ) to Z⁽²⁾(υ), and the convergence in probability of t̂(υ) to t(υ), a₁ ≤ υ ≤ b, we have $T_{m 2}^{(2)} \overset{𝒟}{\to} N (0, 1)$ under H₂₀. It also follows that $T_{a}^{(2)} \overset{𝒟}{\to} \int_{a_{1}}^{b} {(Z^{(2)} (υ))}^{2} d t (υ)$ , and $T_{m 1}^{(2)} \overset{𝒟}{\to} \int_{a_{1}}^{b} Z^{(2)} (υ) d t (υ)$ under H₂₀. The distributions of $T_{a}^{(2)} and T_{m 1}^{(2)}$ under H₂₀ can be approximated by those of $\int_{a_{1}}^{b} (W (\hat{t} (υ)) / (υ - a) - W (\hat{t} (b)) / (b - a))^{2} d \hat{t} (υ) and \int_{a_{1}}^{b} (W (\hat{t} (υ)) / (υ - a) - W (\hat{t} (b)) / (b - a)) d \hat{t} (υ)$ for given t̂(υ), respectively, which are used in the numerical studies for better finite sample approximations. We denote the upper α-quantiles of these two distributions by $c_{a}^{(2)} and c_{m 1}^{(2)}$ , respectively.

The test statistic $T_{a}^{(2)}$ captures general departures H_2a while the test statistics $T_{m 1}^{(2)} and T_{m 2}^{(2)}$ are sensitive to the monotone departure H_2m. Both $T_{m 1}^{(2)} and T_{m 2}^{(2)}$ are expected to be positive when VE(υ) decreases as υ increases, that is, when H_2m holds. Hence the tests $T_{a}^{(2)}, T_{m 1}^{(2)} and T_{m 2}^{(2)}$ reject H₂₀ if $T_{a}^{(2)} > c_{a}^{(2)}, T_{m 1}^{(2)} > c_{m 1}^{(2)} and T_{m 2}^{(2)} > z_{α}$ , respectively.

3. Simulation study

In this section, we conduct a simulation study to check the finite sample performance of the proposed estimation and hypothesis testing procedures using the simple mark-specific proportional hazards model:

λ (t, υ | z) = exp {γ υ + (α + β υ) z}, t \geq 0, 0 \leq υ \leq 1,

(13)

where α, β and γ are constants and the treatment indicator z takes value 0 or 1 with probability of 0.5 for each value. Under model (13), the mark-specific baseline function is λ₀(t, υ) = exp(γ υ) and VE(υ) = 1 − exp(α + β υ). The null hypothesis H₁₀ of no vaccine efficacy holds if both α = 0 and β = 0, and the null hypothesis H₂₀ that vaccine efficacy does not depend on the type of infecting strain is true if β = 0. Various choices of α and β specify different alternatives for H₁₀ and H₂₀.

We consider the following simulation models:

(α, β, γ) = (0, 0, 0.3), for the null hypothesis H_{10} of no vaccine efficacy;

(M1)

(α, β, γ) = (- 0.5, 0.5, 0.3), as the first alternative of H_{10};

(M2)

(α, β, γ) = (- 0.6, 0.6, 0.3), as the second alternative of H_{10};

(M3)

(α, β, γ) = (- 0.6, 0, 0.3), as the third alternative of H_{10};

(M4)

(α, β, γ) = (- 0.69, 0, 0.3), for the null hypothesis H_{20} that vaccine efficacy does not depend on the type of infecting strain;

(M5)

(α, β, γ) = (- 1.2, 1.2, 0.3), as the first alternative of H_{20};

(M6)

(α, β, γ) = (- 1.5, 1.5, 0.3), as the second alternative of H_{20};

(M7)

(α, β, γ) = (- 1.8, 1.8, 0.3), as the third alternative of H_{20} .

(M8)

The models (M2) to (M4) are considered as the alternatives for H_1m and H_1a. The departure from H₁₀: VE(υ) = 0 increases as the simulation model moves from (M2) to (M4). The models (M6) to (M8) are considered as the alternatives for H_2m and H_2a. The departure from H₂₀ increases as the simulation model moves from (M6) to (M8).

We generate the censoring times from an exponential distribution, independent of (T, V), with the censoring rates ranging from 20% to 30%. We set the interval of analyses for υ as [a, b] = [0.1, 0.9] and bandwidths are chosen as h = 0.05, 0.1, 0.15. The observed failure times with marks outside the interval [a, b] can also be used since the smoothing at υ takes the cases with marks in its h-neighborhood. The Epanechnikov kernel K(x) = 0.75(1 − x²)I{|x| ≤ 1} is used throughout. Sample sizes of n = 500 and 800 are studied.

For the tests $T_{m 2}^{(1)} and T_{m 2}^{(2)}$ , we take the grid of eight evenly spaced points in [a, b] from 0.196 to 0.868. Table 1 lists the empirical sizes and powers of the test statistics $T_{a}^{(1)}, T_{m 1}^{(1)} and T_{m 2}^{(1)}$ and Table 2 for the test statistics $T_{a}^{(2)}, T_{m 1}^{(2)} and T_{m 2}^{(2)}$ . The significance levels of these tests are given at α = 0.05. Both tables also list the coverage probabilities of the 95% simultaneous confidence intervals for CV(υ), for υ ∈ [a, b] and for υ in the grid. The critical values for the tests $T_{m 2}^{(1)} and T_{m 2}^{(2)}$ at α = 0.05 are z_α = 1.645. The critical values for the tests $T_{a}^{(1)} and T_{a}^{(2)}$ , $T_{m 1}^{(1)} and T_{m 1}^{(2)}$ are obtained by generating 10,000 Wiener processes W(·) with time parameter equal to t̂(υ) and calculating the corresponding functionals of W(t̂(υ)), as described in the previous section. Each entry in Tables 1 and 2 is based on 1000 repetitions.

TABLE 1.

Empirical sizes and powers of the tests $T_{a}^{(1)}, T_{m 1}^{(1)} and T_{m 2}^{(1)}$ at the nominal level 0.05, and coverage probabilities of the 95% simultaneous confidence intervals for CV(υ) with υ on the grid and on [a, b]

Size/Power

Coverage

Model

(α, β, γ)

T_{a}^{(1)}

T_{m 1}^{(1)}

T_{m 2}^{(1)}

Grid

[a, b]

(0, 0, 0.3)

500

0.05

2.9

3.1

7.8

97.5

98.1

0.1

4.9

5.9

8.3

96.6

97.4

0.15

5.1

6.9

7.3

96.2

96.8

800

0.05

5.3

2.8

6.9

95.9

96.8

0.1

5.7

4.7

6.8

95.5

97.0

0.15

5.8

5.2

6.3

95.6

96.5

(−0.5, 0.5, 0.3)

500

0.05

45.4

56.3

63.2

97.6

98.0

0.1

60.3

71.4

65.7

97.0

97.5

0.15

66.0

77.4

65.5

96.7

97.6

800

0.05

69.1

78.4

77.5

96.1

96.8

0.1

80.3

86.5

80.1

95.6

96.7

0.15

82.9

89.1

80.1

96.0

97.2

(−0.6, 0.6, 0.3)

500

0.05

59.7

70.0

76.5

97.5

98.0

0.1

75.4

83.9

78.8

96.9

97.8

0.15

80.9

87.2

78.5

96.9

97.9

800

0.05

83.7

90.4

87.6

96.2

96.9

0.1

90.8

94.4

89.6

96.0

96.8

0.15

93.0

96.0

89.6

96.2

97.2

(−0.6, 0, 0.3)

500

0.05

96.0

95.6

99.9

97.0

97.8

0.1

99.1

98.8

100

96.7

97.6

0.15

99.5

99.7

100

96.7

97.4

800

0.05

99.9

99.5

100

97.0

98.0

0.1

100

96.9

97.3

0.15

100

96.4

97.4

Open in a new tab

TABLE 2.

Empirical sizes and powers of the tests $T_{a}^{(2)}, T_{m 1}^{(2)} and T_{m 2}^{(2)}$ at the nominal level 0.05, and coverage probabilities of the 95% simultaneous confidence intervals for CV(υ) with υ on the grid and on [a, b]

Size/Power

Coverage

Model

(α, β, γ)

T_{a}^{(2)}

T_{m 1}^{(2)}

T_{m 2}^{(2)}

grid

[a, b]

(−0.69, 0, 0.3)

500

0.05

1.6

3.7

97.0

97.8

0.1

2.1

3.7

4.5

96.5

97.5

0.15

2.1

3.5

4.6

96.8

97.3

800

0.05

2.3

4.0

2.9

97.3

98.3

0.1

2.6

4.3

3.2

97.0

97.6

0.15

2.1

3.5

3.0

96.9

97.4

(−1.2, 1.2, 0.3)

500

0.05

47.2

67.6

47.7

97.9

98.5

0.1

60.2

76.7

62.3

97.1

97.6

0.15

63.2

80.3

73.3

97.5

97.8

800

0.05

69.2

85.1

69.2

96.5

97.2

0.1

80.4

92.0

80.4

96.6

97.6

0.15

84.2

94.1

88.4

96.9

97.8

(−1.5, 1.5, 0.3)

500

0.05

63.8

81.4

62.1

97.7

98.0

0.1

76.9

78.0

63.6

97.2

98.0

0.15

81.2

91.7

86.3

97.6

98.0

800

0.05

85.1

94.4

82.6

96.2

97.1

0.1

93.2

98.2

91.8

96.1

97.6

0.15

96.0

98.9

97.4

96.7

97.7

(−1.8, 1.8, 0.3)

500

0.05

77.6

89.1

73.6

97.8

98.5

0.1

87.1

95.6

85.7

97.3

98.4

0.15

91.5

96.9

92.8

97.7

98.7

800

0.05

93.5

98.2

91.4

96.4

97.4

0.1

98.2

99.5

97.0

96.3

97.5

0.15

99.3

99.9

99.2

96.5

97.9

Open in a new tab

Most tests have appropriate sizes close to 5%. The test $T_{a}^{(2)}$ seems to be conservative for the simulation models used in the study. The test $T_{m 1}^{(1)}$ has better power than the tests $T_{a}^{(1)} and T_{m 2}^{(1)}$ . The test $T_{m 1}^{(2)}$ has better power than the tests $T_{a}^{(2)} and T_{m 2}^{(2)}$ . Therefore the tests that incorporate $\hat{CV} (υ)$ over the entire range [a, b] present greater power than the simpler tests based on $\hat{CV} (υ)$ over the grid. We also observed that the powers of the tests seem to be influenced by the selection of bandwidth, with greater power for a larger bandwidth. Similar plots (not included here) to Figure 1 and Figure 2 but with larger bandwidth h = 0.2 show that the estimated standard errors of $\hat{CV} (υ)$ become smaller for larger h while the biases stay approximately the same, resulting in increased power for the larger bandwidth. We suspect that this phenomenon is associated with the sample size and the convergence rate of the normalized $\hat{CV} (υ)$ to a Wiener process. The dependence of the power on the bandwidth should become small as the sample size increases. Further study on the bandwidth selection is warranted.

FIG. 1 — Plots of estimates for β(υ), VE(υ) and CV(υ) under the models M1, M2, M5 and M6 for n = 500, h = 0.1. The solid dark lines are the true functions and the dashed lines are the averages of the estimates based on 1000 repetitions. The gray lines are the corresponding estimates for β(υ), VE(υ) and CV(υ) of 50 random samples.

FIG. 2 — Plots of the standard errors under the models M1, M2, M5 and M6, based on n = 500, h = 0.1. The solid lines are the averages of the estimates of the standard deviations of β̂(υ), $\hat{VE} (υ) and \hat{CV} (υ)$ , while the dashed lines are the sample standard deviations of β̂(υ), $\hat{VE} (υ) and \hat{CV} (υ)$ , based on 1000 repetitions. The gray lines are the corresponding estimates for the standard deviations of β̂(υ), $\hat{VE} (υ) and \hat{CV} (υ)$ of 50 random samples.

The coverage probabilities of the simultaneous confidence intervals for CV(υ) are closer to the 95% nominal level for υ on the grid than on [a, b]. This may be explained by the fact that the convergence for υ over the entire range [a, b] is slower than the convergence on the grid. The evaluations of the proposed estimators for β(υ), VE(υ) and CV(υ) and their respective estimators of the standard deviations under some of the simulation models are presented in Figure 1 and Figure 2. The plots of the pointwise coverage probabilities for VE(υ) and for CV(υ) are given in Figure 3. These plots are based on n = 500 and h = 0.1.

FIG. 3 — Plots of the pointwise coverage probabilities for VE(υ) (gray lines) and for CV(υ) (solid lines), based on n = 500, h = 0.1 and 1000 repetitions. The models on the left panel are M1, M2 and M3. The models on the right panel are M5, M6 and M7.

Now we demonstrate with a simulation example that the adoption of a standard method for testing the vaccine efficacy that ignores the mark is inefficient and can be misleading. We consider a special case of the model discussed in the Introduction, with λ(t, υ|z = 0) = 1 and λ(t, υ|z = 1) = 2υ, for t ≥ 0, 0 ≤ υ ≤ 1. The covariate z is again a treatment indicator taking values 0 and 1 with probability of 0.5 for each value. The marginal hazards model ignoring the mark is therefore λ(t|z = 0) = 1 and λ(t|z = 1) = 1, for t ≥ 0. The rest of the simulation setup such as the percentage of censorship, the kernel function and the bandwidth is the same as for the previous models. The model considered here represents both a proportional mark-specific hazards model for λ(t, υ|z) and a proportional hazards model for λ(t|z) = λ₀(t) exp(βz), with the mark-specific vaccine efficacy VE(υ) = 1 − 2υ and the marginal VE = 1 − exp(β) = 0. The standard Wald test, denoted by T_w, under the marginal Cox model is often used to test for the vaccine efficacy. As expected, the standard Wald test shows no power (Table 3). It is incapable of revealing any vaccine efficacy or that the vaccine efficacy depends on the mark, thus missing the important scientific finding that the vaccine protects against viruses with smaller mark values (V < 0.5) and increases risk of infection with viruses with larger mark values (V > 0.5). The example we constructed here shows the weakness of using the standard approach that ignores the mark and is what motivates the present research.

TABLE 3.

Comparison of the standard Wald test with the proposed tests $T_{a}^{(1)}, T_{m 1}^{(1)}, T_{m 2}^{(1)}, T_{a}^{(2)}, T_{m 1}^{(2)} and T_{m 2}^{(2)}$ at the nominal level 0.05

Power

T_w

T_{a}^{(1)}

T_{m 1}^{(1)}

T_{m 2}^{(1)}

T_{a}^{(2)}

T_{m 1}^{(2)}

T_{m 2}^{(2)}

500

0.05

5.9

14.9

24.2

16.6

98.0

99.4

97.3

0.1

–

23.9

35.7

16.0

99.6

100

99.8

0.15

–

27.9

39.1

15.7

99.9

100

99.9

800

0.05

6.1

32.4

39.6

15.0

100

99.6

0.1

–

43.1

51.5

13.8

100

0.15

–

46.0

53.3

13.9

100

Open in a new tab

4. Application

The first preventive HIV vaccine efficacy trial was carried out in North America and The Netherlands, and enrolled 5403 HIV-negative volunteers at risk for acquiring HIV infection [4]. Volunteers were randomized in a 2:1 ratio to receive a recombinant glycoprotein 120 vaccine (AIDSVAX) or placebo, and were monitored for HIV infection at semi-annual HIV testing visits for 36 months. The primary objective was to assess VE using the standard Cox model, and a secondary objective was to test H₁₀: VE(t, υ) = 0 and H₂₀: VE(t, υ) = VE(t) for three different mark variables V defined in terms of the percent mismatch of aligned amino acid sequences (for each infecting HIV sequence compared to the HIV sequence [named GNE8] contained in the AIDSVAX construct) in three subregions of HIV-gp120. For brevity, in this article we consider only one mark V, defined as the percent mismatch of amino acids in the whole gp120 region (581 amino acids long), where all possible mismatches of particular pairs of amino acids (e.g., A versus C) are weighted by the estimated probability of interchange [13]. The distance is based on the gp120 region because this region contains neutralizing epitopes that potentially can induce anti-HIV antibody responses that prevent HIV infection [22]; the vaccine was designed to protect by stimulating high titer antibodies that neutralize exposing HIVs. Of the 368 individuals infected during the trial, 32 had missing marks. Of the remaining 336 samples, all marks were unique (217 vaccine; 119 placebo).

The vaccine efficacy is estimated and tested by adjusting for two covariates: age (ranging 18–62 years with mean of 36.5) and behavioral risk score (taking values 0–7) as defined in [4]. It is relevant to adjust for these covariates because they predict infection rate and because trial participants with different values of these covariates may be exposed to HIV strains with different distributions of V. Both covariates are considered as continuous variables. The histograms of the rescaled mark values, ages in years and behavioral risk scores are plotted in Figure 4. We denote the treatment indicator by z₁ (z₁ = 1 for the vaccine and z₁ = 0 for the placebo), age by z₂ and behavioral risk score by z₃, and denote the corresponding coefficient functions by β₁(υ), β₂(υ) and β₃(υ). Fitting model (2) with h = 0.3, the plots of the estimates for β₁(υ), β₂(υ) and β₃(υ) and their pointwise confidence bands are given in Figure 5. The plots of $\hat{VE} (υ) and \hat{CV} (υ)$ with their corresponding pointwise confidence bands adjusting for the two covariates z₂ and z₃ are given in Figure 6.

FIG. 4 — Histograms for the observed mark values, ages in years and behavioral risk scores. The left panel is for the vaccine group and the right panel is for the placebo group.

FIG. 5 — Plots of the estimated regression coefficients β₁(υ), β₂(υ) and β₃(υ) and their 95% pointwise confidence bands for the vaccine trial data with h = 0.3.

FIG. 6 — Plots of the estimates of VE(υ) and CV(υ) and their confidence bands for the vaccine trial data with h = 0.3. The dashed lines are 95% pointwise confidence bands and the dotted lines are 95% simultaneous confidence bands.

Adjusting for age and behavioral risk score, the Wald test statistic for testing the marginal VE = 0 using the standard Cox model is −0.978, yielding a p-value of 0.328 for the two-sided alternative and 0.164 for the monotone alternative. Our test with the test statistic $T_{a}^{(1)}$ for H₁₀: VE(υ) = 0 for all υ versus the general alternative H_1a yields a p-value of 0.1532. The p-values for testing against the monotone alternative H_1m are 0.0916 for $T_{m 1}^{(1)}$ and 0.0228 for $T_{m 2}^{(1)}$ . These results give some, albeit weak, evidence of nonzero vaccine efficacy for at least one mark value; see Figure 6.

In addition, adjusting for age and behavioral risk score, we conducted the tests to evaluate whether the vaccine efficacy varies with the mark. The p-value for testing H₂₀ that VE(υ) does not depend on υ versus the general alternative H_2a is 0.2067 for the test statistic $T_{a}^{(2)}$ . The p-value for testing for the monotone alternative H_2m is 0.9363 for the test statistic $T_{m 1}^{(2)}$ and 0.9047 for the test statistic $T_{m 2}^{(2)}$ . These p-values are expected given the plots in Figure 6 where $\hat{VE} (υ)$ shows some tendency to increase with υ.

5. Discussion

This article developed inference techniques for the proportional hazards model with a continuous mark variable, including nonparametric methods for estimation and testing of mark-specific regression functions. These techniques can be used to estimate mark-specific vaccine efficacy (VE(υ)) and cumulative mark-specific vaccine efficacy (CV(υ)) with simultaneous confidence bands, and to test hypotheses for VE(υ), while adjusting for time-dependent covariate effects. The testing procedures based on the statistics $T_{m 1}^{(1)} and T_{m 2}^{(2)}$ showed greatest power in simulations and are recommended for testing VE(υ) = 0 for all υ and for testing VE(υ) independent of υ, respectively.

An alternative approach to the continuous mark-specific PH model would be a similar model that treats the mark variable as ordinal categorical. We focused on a continuous mark because (i) it most naturally suits the HIV vaccine application, as the choice of K bins for categorizing the marks would be arbitrary and (ii) testing β(υ) = β can often be done with greater power than testing equality of the cause-specific regression coefficients β₁ = … = β_K.

As is well known for a discrete mark-specific hazard function, the interpretation of the continuous mark-specific hazard function λ(t, υ) is restricted to actual study conditions, that is, it is the instantaneous rate of failure in the presence of all of the circulating competing risks (i.e., is a “crude” hazard in the terminology of Prentice et al. [15]). However, often the main scientific interest is in the “net” mark-specific hazard, the instantaneous rate of failure by mark υ in the absence of any other competing risks, but unfortunately this parameter is not identified except under untestable assumptions such as mutual independence of all of the notional (latent) mark-specific failure times [19]. This problem necessitates careful interpretation of inferences in the mark-specific PH model.

For the HIV vaccine trial example, the crude mark-specific hazard can be factored as

λ (t, υ | z) = λ_{E} (t, υ | z) \times λ_{PC} (t | υ, z)

(14)

where λ_E(t, υ|z) is the intensity of exposure to strain υ for participants with covariates z and λ_PC(t|υ, z) (the “per-contact” transmission hazard) is the same as λ(t, υ|z) except that it further conditions on the (unobserved) presence of exposure to a virus with genetic distance υ during [t, t + dt). Exposure can arise from unprotected sex or sharing a needle with an individual infected with strain υ. Therefore the identified parameter measures a mixture of vaccine/placebo-group differences in mark-specific exposure rates and in conditional mark-specific per-exposure transmission probabilities, whereas biological interest is in

{VE}^{PC} (t | υ, z_{2}) = 1 - \frac{λ_{PC} (t | υ, 1, z_{2})}{λ_{PC} (t | υ, 0, z_{2})}

as a measure of vaccine efficacy. However, as data are not available for estimating the relative intensity λ_E(t, υ|1, z₂)/λ_E(t, υ|0, z₂), our approach is to use

VE (t, υ | z_{2}) = 1 - \frac{λ (t, υ | 1, z_{2})}{λ (t, υ | 0, z_{2})}

as the target estimand, and assume identical exposure rates between the two groups, so this target has the same interpretation as VE^PC(t|υ, z₂). Reliance on this assumption demonstrates the value of including covariates z₂ that predict mark-specific exposure into the mark-specific PH model: the richer the covariate information the more likely VE(t, υ|z₂) reflects biological vaccine efficacy. Gilbert, McKeague and Sun [6] provided further discussion of the interpretation of mark-specific hazard ratios.

The usefulness of our approach relies on the validity of the mark-specific proportional hazards model. Lin, Wei and Ying [10] developed goodness-of-fit tests for the standard Cox model based on martingale residuals, and their tests can be extended to the present setting by using the mark-specific martingale residuals

{\hat{M}}_{i} (t, υ) = \int_{0}^{t} \int_{a}^{υ} [N_{i} (ds, du) - Y_{i} (s) exp ((\hat{β} (u))^{T} Z_{i}) {\hat{Λ}}_{0} (ds, du)],

(15)

for i = 1,…, n. These residuals may be interpreted as the difference at time t between the observed and the predicted number of events with mark less than υ for the ith subject, and are informative about model misspecification. It can be checked that $n^{- 1 / 2} \sum_{i = 1}^{n} {\hat{M}}_{i} (t, υ) = o_{p} (1)$ . This property is similar to that in the standard Cox model, where the sum of the martingale residuals is exactly zero. The difference here is caused by the kernel smoothing in a neighborhood of υ. Because β(υ) is treated nonparametrically, the checking of the model (2) needs further development and has additional issues related to the bandwidth. This would need a thorough treatment that is beyond the scope of the present paper.

Finally, we caution that the method proposed here requires large sample sizes to work well as demonstrated in the simulation study. This is the result of β(υ) being treated nonparametrically: the estimation of β(υ) utilizes only the observed failures with marks in a neighborhood of υ. Although this does not cause a problem in our application to the first preventive HIV vaccine trial (which has a sample size of 5403), one needs to be careful in applying the method to situations with small sample sizes.

Acknowledgments

The authors gratefully acknowledge David Jobes and VaxGen Inc. for providing the HIV sequence data. The authors also thank the Associate Editor and two referees for their valuable comments.

APPENDIX

The following lemma is an extension of Theorem 5.7 of Van der Vaart [20] and will be used to prove the uniform consistency of β̂(υ).

LEMMA A.1

Let Q_n(υ, θ) be random functions and let Q(υ, θ) be a fixed function of (υ, θ) ∈ [a, b] × Θ, Θ ⊂ ℝ^p. Let β(υ) be a fixed function of υ ∈ [a, b] taking values in Θ. Assume that ${sup}_{υ, θ} | Q_{n} (υ, θ) - Q (υ, θ) | \overset{P}{\to} 0$ and that for every ε > 0 there exists an η > 0 such that sup_{‖θ−β(υ)‖>ε} Q(υ, θ) < Q(υ, β(υ)) − η for υ ∈ [a, b]. Then for any sequence of estimators β̂(υ), with Q_n(υ, β̂(υ)) > Q_n(υ, β(υ)) − o_p(1) uniformly in υ ∈ [a, b], we have $\hat{β} (υ) \overset{P}{\to} β (υ)$ uniformly in υ ∈ [a, b].

PROOF

For every ε > 0, there exists an η > 0 such that

\begin{matrix} {sup_{υ} ‖ \hat{β} (υ) - β (υ) ‖ > ε} & \subset \underset{υ}{\cup} {‖ \hat{β} (υ) - β (υ) ‖ > ε} \\ \subset \underset{υ}{\cup} {Q (υ, \hat{β} (υ)) < Q (υ, β (υ)) - η} . \end{matrix}

Since $Q_{n} (υ, \hat{β} (υ)) > Q_{n} (υ, β (υ)) - o_{p} (1) \overset{P}{\to} Q (υ, β (υ))$ , uniformly in υ ∈ [a, b], we have Q_n(υ, β̂(υ)) > Q(υ, β(υ)) − o_p(1), uniformly in υ ∈ [a, b]. It follows that

\begin{matrix} \underset{υ}{\cup} {Q (υ, \hat{β} (υ)) < Q (υ, β (υ)) - η} \\ \subset \underset{υ}{\cup} {Q (υ, \hat{β} (υ)) < Q_{n} (υ, \hat{β} (υ)) - η + o_{p} (1)} \\ = {inf_{υ} (Q (υ, \hat{β} (υ)) - Q_{n} (υ, \hat{β} (υ))) < - η + o_{p} (1)} \\ = {sup_{υ} (Q_{n} (υ, \hat{β} (υ)) - Q (υ, \hat{β} (υ))) > η - o_{p} (1)} \\ \subset {sup_{υ} | Q_{n} (υ, \hat{β} (υ)) - Q (υ, \hat{β} (υ)) | > η - o_{p} (1)}, \end{matrix}

whose probability goes to 0 by the uniform convergence of Q_n(υ, θ) to Q(υ, θ). Hence P{sup_υ ‖β̂(υ) − β(υ)‖ > ε} → 0.

The following lemma is used to prove Theorem 3 and Theorem 4. Let $N = \sum_{i = 1}^{n} N_{i} and M = \sum_{i = 1}^{n} M_{i}$ .

LEMMA A.2

Under conditions (A.1)–(A.4), $n^{- 1} N (t, υ) \overset{P}{\to} E N_{i} (t, υ)$ , uniformly in (t, u) ∈ [0, τ] × [0, 1], and n^−1/2 M(t, υ) converges weakly to a mean-zero continuous Gaussian random field G(t, υ), (t, υ) ∈ [0, τ] × [0, 1], with independent increments and $Var (G (t, υ)) = \int_{0}^{t} \int_{0}^{υ} λ_{0} (s, u) s^{(0)} (s, β (u)) ds du$ .

PROOF

We treat ω_i = (X_i, δ_i, V_i), i = 1,…, n, as a random sample from a probability distribution P on a measurable space (𝒳, 𝒜), with 𝒳 = [0, ∞) × {0, 1} × [0, 1] and 𝒜 its Borel σ-field. Let ℱ be the class of all indicator functions f_{t, υ}: 𝒳 → R, where f_{t, υ}(ω_i) = I([0, t] × {1} × [0, υ])(ω_i) = I(X_i ≤ t, δ_i = 1, V_i ≤ υ), for 0 ≤ t ≤ τ, 0 ≤ υ ≤ 1. Then $n^{- 1} N (t, υ) = n^{- 1} \sum_{i = 1}^{n} f_{t, υ} (ω_{i})$ . Let ‖f_{t, υ}‖P,r = (P|f_{t, υ}|^r)^1/r be L_r(P)-norm of f_{t, υ}.

Let 0 = t₀ < t₁ < … < t_K = τ and 0 = υ₀ < υ₁ < … < υ_J = 1 be partitions of the intervals [0, τ] and [0, 1]. Define the bracketing functions l_kj = N_i(t_k−1, υ_j−1) and u_kj = N_i(t_k, υ_j), for k = 1,…, K, j = 1,…, J. Then for any f_{t, υ} ∈ ℱ, there is a bracket [l_kj, u_kj] such that f_{t, υ} ∈ [l_kj, u_kj]. And

\begin{matrix} ‖ u_{k j} - l_{k j} ‖_{P, 1} & \leq E (N_{i} (t_{k}, υ_{j}) - N_{i} (t_{k - 1}, υ_{j - 1})) \\ = \int_{0}^{t_{k}} \int_{0}^{υ_{j}} λ_{0} (s, x) s^{(0)} (s, β (x)) ds dx \\ - \int_{0}^{t_{k - 1}} \int_{0}^{υ_{j - 1}} λ_{0} (s, x) s^{(0)} (s, β (x)) ds dx \\ \leq \int_{t_{k - 1}}^{t_{k}} \int_{0}^{1} λ_{0} (s, x) s^{(0)} (s, β (x)) ds dx \\ + \int_{0}^{τ} \int_{υ_{j - 1}}^{υ_{j}} λ_{0} (s, x) s^{(0)} (s, β (x)) ds dx \\ \leq C_{1} (t_{k} - t_{k - 1}) + C_{2} (υ_{j} - υ_{j - 1}), \end{matrix}

where C₁ and C₂ are some positive constants. For any ε > 0, choose the grid points such that t_k − t_k−1 < ε and υ_j − υ_j−1 < ε. Then ‖u_kj − l_kj‖_P,1 ≤ [C₁ + C₂]ε. Hence, the bracketing number N_[·](ε, ℱ, L₁(P)) is of the polynomial order (1/ε)². By the Glivenko–Cantelli theorem (Theorem 19.4 of van der Vaart [20]), $n^{- 1} N (t, υ) \overset{P}{\to} E N_{i} (t, υ)$ , uniformly in (t, υ) ∈ [0, τ] × [0, 1].

Next, consider the processes {M_i(t, υ), 0 ≤ t ≤ τ, 0 ≤ υ ≤ 1}, i = 1,…, n, as a random sample from a probability distribution P on a measurable space (𝒳, 𝒜). Let ℱ be the class of coordinate projections f_{t, υ}: 𝒳 → R, where f_{t, υ}(M_i) = M_i(t, υ), for 0 ≤ t ≤ τ, 0 ≤ υ ≤ 1. The process {M_i(t, υ), 0 ≤ t ≤ τ, 0 ≤ υ ≤ 1} is determined by the {X_i, δ_i, δ_i V_i, Z_i}.

Again, let 0 = t₀ < t₁ < … < t_K = τ and 0 = υ₀ < υ₁ < … < υ_J = 1 be the partitions of the intervals [0, τ] and [0, 1]. Define the bracketing functions $l_{k j} = N_{i} (t_{k - 1}, υ_{j - 1}) - \int_{0}^{t_{k}} \int_{0}^{υ_{j}} Y_{i} (s) λ (s, x | Z_{i} (s)) ds dx and u_{k j} = N_{i} (t_{k}, υ_{j}) - \int_{0}^{t_{k - 1}} \int_{0}^{υ_{j - 1}} Y_{i} (s) λ (s, x | Z_{i} (s)) ds dx$ , for k = 1,…, K, j = 1,…, J. Then for any f_{t, υ} ∈ ℱ, there is a bracket [l_kj, u_kj] such that f_{t, υ} ∈ [l_kj, u_kj]. The bracket size is

\begin{matrix} ‖ u_{k j} - l_{k j} ‖_{P, 2} & \leq & ‖ N_{i} & (t_{k}, υ_{j}) - N_{i} (t_{k - 1}, υ_{j - 1}) ‖_{P, 2} \\ + ‖ & \int_{0}^{t_{k}} \int_{0}^{υ_{j}} Y_{i} (s) λ (s, x | Z_{i} (s)) ds dx \\ {- \int_{0}^{t_{k - 1}} \int_{0}^{υ_{j - 1}} Y_{i} (s) λ (s, x | Z_{i} (s)) ds dx ‖}_{_{P, 2}} \\ \leq & [C_{1} & (t_{k} - t_{k - 1}) + C_{2} (υ_{j} - υ_{j - 1})]^{1 / 2}, \end{matrix}

where C₁ and C₂ are some positive constants. For any ε > 0, choose the grid points such that t_k − t_k−1 < ε and υ_j − υ_j−1 < ε. Then ‖u_kj − l_kj‖_P,2 ≤ [C₁ + C₂]^1/2ε^1/2. Hence, the bracketing number N_[·](ε^1/2, ℱ, L₂(P)) is of the polynomial order (1/ε)². Thus, N_[·](ε, ℱ, L₂(P)) is of the polynomial order (1/ε)⁴. So the bracketing integral J_[·](1, ℱ, L₂(P)) < ∞. By the Donsker theorem (Theorem 19.5 of Van der Vaart [20]), $n^{- 1 / 2} M = {n^{- 1 / 2} \sum_{i = 1}^{n} M_{i} (t, υ), 0 \leq t \leq τ, 0 \leq υ \leq 1}$ converges weakly to a mean-zero Gaussian process G(t, υ), (t, υ) ∈ [0, τ] × [0, 1], which can be constructed to have continuous paths by Theorem 18.14 and Lemma 18.15 of van der Vaart [20].

Now we show that G(t, υ) has independent increments. Note that for t₁ ≤ t₂ and υ₁ ≤ υ₂, the covariance of G(t₁, υ₁) and G(t₂, υ₂) − G(t₁, υ₁) is E{M_i(t₁, υ₁) × (M_i(t₂, υ₂) − M_i(t₁, υ₁))}. By Aalan and Johansen [1], M_i(t, υ₁) and M_i(t, υ₂) − M_i(t, υ₁), 0 ≤ t ≤ τ, are orthogonal square integrable martingales for 0 ≤ υ₁ ≤ υ₂ ≤ 1. It follows that

\begin{matrix} E {M_{i} & (t_{1}, υ_{1}) (M_{i} (t_{2}, υ_{2}) - M_{i} (t_{1}, υ_{1}))} \\ = E {M_{i} (t_{1}, υ_{1}) (M_{i} (t_{2}, υ_{2}) - M_{i} (t_{2}, υ_{1}))} \\ + E {M_{i} (t_{1}, υ_{1}) (M_{i} (t_{2}, υ_{1}) - M_{i} (t_{1}, υ_{1}))} \\ = 0 . \end{matrix}

Hence G(t₁, υ₁) and G(t₂, υ₂) − G(t₁, υ₁) are independent.

PROOF OF THEOREM 1

It is easy to check that the conditions of Lemma 1 of Sun and Wu [18] are satisfied under Condition A. It follows that W̃_A(υ) converges weakly to a vector of continuous mean-zero Gaussian random processes, W_A(υ), υ ∈ [a, b]. Now we show that W_A(υ) has independent increments. Let $w_{i} (t, υ) = \int_{a}^{υ} \int_{0}^{t} A (u) [Z_{i} (t) - s^{(1)} (t, β (u)) / s^{(0)} (t, β (u))] M_{i} (dt, du)$ . Then ${\tilde{W}}_{A} (υ) = n^{- 1 / 2} \sum_{i = 1}^{n} w_{i} (τ, υ)$ . For a ≤ υ₁ ≤ υ₂ ≤ b, the covariance matrix of W_A(υ₁) and W_A(υ₂) − W_A(υ₁) is equal to E{w_i(τ, υ₁)(w_i(τ, υ₂) − w_i(τ, υ₁))^T}. Since M_i(t, υ₁) and M_i(t, υ₂) − M_i(t, υ₁), 0 ≤ t ≤ τ, are orthogonal square integrable martingales, it follows that w_i(t, υ₁) and w_i(t, υ₂) − w_i(t, υ₁), 0 ≤ t ≤ τ, are orthogonal square integrable martingales. Hence E{w_i(τ, υ₁)(w_i(τ, υ₂) − w_i(τ, υ₁))^T} = 0. So W_A(υ), υ ∈ [a, b], is a vector of mean-zero Gaussian random processes with independent increments.

Further, the covariance matrix of W_A(υ) is

\begin{matrix} E {w_{i} & (τ, υ) {(w_{i} (τ, υ))}^{T}} \\ = E {\int_{a}^{υ} \int_{0}^{τ} A (u) {[Z_{i} (t) - \frac{s^{(1)} (t, β (u))}{s^{(0)} (t, β (u))}]}^{\otimes 2} A (u) N_{i} (dt, du)} \\ = E {\int_{a}^{υ} \int_{0}^{τ} A (u) {[Z_{i} (t) - \frac{s^{(1)} (t, β (u))}{s^{(0)} (t, β (u))}]}^{\otimes 2} \\ \times A (u) y (t | Z_{i} (t)) λ (t, u | Z_{i} (t)) dt du} \\ = \int_{a}^{υ} A (u) E {{\int_{0}^{τ} [Z_{i} (t) - \frac{s^{(1)} (t, β (u))}{s^{(0)} (t, β (u))}]}^{\otimes 2} y (t | Z_{i} (t)) λ (t, u | Z_{i} (t)) dt} \\ \times A (u) du \\ = \int_{a}^{υ} A (u) \sum (u) A (u) du . \end{matrix}

This completes the proof of Theorem 1.

PROOF OF THEOREM 2

We shall prove Theorem 2 by verifying the conditions of Lemma A.1.

Let

\begin{matrix} η_{n} (u, θ) = n^{- 1} \sum_{i = 1}^{n} \int_{0}^{u} \int_{0}^{τ} [θ^{T} Z_{i} (t) - log (S^{(0)} (t, θ))] N_{i} (dt, du), \\ ξ_{n} (u, θ) = n^{- 1} \sum_{i = 1}^{n} \int_{0}^{u} \int_{0}^{τ} [θ^{T} Z_{i} (t) - log (s^{(0)} (t, θ))] N_{i} (dt, du), \\ Q_{n} (υ, θ) = n^{- 1} l (υ, θ) + n^{- 1} log n \int_{0}^{1} K_{h} (u - υ) N (τ, du) . \end{matrix}

Then by Condition A, η_n(υ, θ) = ξ_n(υ, θ) + O_p(n^−1/2) and

\begin{matrix} Q_{n} (υ, θ) & = \int_{0}^{1} K_{h} (u - υ) η_{n} (du, θ) \\ = \int_{0}^{1} K_{h} (u - υ) ξ_{n} (du, θ) + O_{p} (n^{- 1 / 2} h^{- 1}), \end{matrix}

uniformly in (υ, θ) ∈ [0, 1] × [−M, M], for M > 0. By application of the Glivenko–Cantelli and Donsker theorems, similarly to the proofs of Lemma A.2 and Theorem 1, ξ_n(υ, θ) = ξ(υ, θ) + O_p(n^−1/2), uniformly in (υ, θ) ∈ [0, 1] × [−M, M], with

ξ (υ, θ) = E [\int_{0}^{u} \int_{0}^{τ} [θ^{T} Z_{i} (t) - log (s^{(0)} (t, θ))] N_{i} (dt, du)] .

It follows that Q_n(υ, θ) = Q(υ, θ) + O_p(n^−1/2 h⁻¹), uniformly in (υ, θ) ∈ [a, b] × [−M, M], where

Q (υ, θ) = E [\int_{0}^{τ} [θ^{T} Z_{i} (t) - log (s^{(0)} (t, θ))] λ_{0} (t, υ) exp (β^{T} (υ) Z_{i} (t)) Y_{i} (t) dt] .

Now we show that β(υ) is the well-separated point of maximum of Q(υ, θ) for υ ∈ [0, 1]. Note that

\begin{matrix} \partial Q (υ, θ) / \partial θ & = E [\int_{0}^{τ} [Z_{i} (t) - \frac{s^{(1)} (t, θ)}{s^{(0)} (t, θ)}] λ_{0} (t, υ) exp (β^{T} (υ) Z_{i} (t)) Y_{i} (t) dt] \\ \partial^{2} Q (υ, θ) / \partial θ^{2} & = - E [\int_{0}^{τ} {\frac{s^{(2)} (t, θ)}{s^{(0)} (t, θ)} - {(\frac{s^{(1)} (t, θ)}{s^{(0)} (t, θ)})}^{\otimes 2}} \\ \times λ_{0} (t, υ) exp (β^{T} (υ) Z_{i} (t)) Y_{i} (t) dt] . \end{matrix}

We have ∂Q(υ, β(υ))/∂θ = 0, and for every ε > 0 there exists an η > 0 such that sup_{‖θ−β(υ)‖>ε} Q(υ, θ) < Q(υ, β(υ)) − η for υ ∈ [a, b], under condition (A.3), by Taylor expansion and continuity. Further, since $Q_{n} (υ, θ) \overset{P}{\to} Q (υ, θ), \partial Q_{n} (υ, θ) / \partial θ \overset{P}{\to} \partial Q (υ, θ) / \partial θ, and \partial^{2} Q_{n} (υ, θ) / \partial θ^{2} \overset{P}{\to} \partial^{2} Q (υ, θ) / \partial θ^{2}$ uniformly in (υ, θ) ∈ [a, b] × [−M, M], and −M̃ < β(υ) < M̃ for a ≤ υ ≤ b for some M̃ < M, we have for every α > 0 there exists an n₀ such that P(−M ≤ β̂(υ) ≤ M, a ≤ υ ≤ b) > 1 − α for n ≥ n₀.

Therefore, for every ε > 0,

\begin{matrix} P (sup_{a \leq υ \leq b} ‖ \hat{β} (υ) - β (υ) ‖ > ε) \\ \leq α + P (sup_{a \leq υ \leq b} ‖ \hat{β} (υ) - β (υ) ‖ > ε, - M \leq \hat{β} (υ) \leq M, a \leq υ \leq b) \\ \to α \end{matrix}

as n → ∞, by the previous checking of the conditions of Lemma A.1 together with Q_n(υ, β̂(υ)) ≥ Q_n(υ, β(υ)). Since α is arbitrary, we have P(sup_a≤υ≤b ‖β̂(υ) − β(υ)‖ > ε) → 0.

PROOF OF THEOREM 3

In the proof of this theorem, we set β = β(υ) for simplicity. Note that under Condition A, using a second-order Taylor expansion for λ(t, u|Z_i(t)) in the neighborhood of υ, we have

\begin{matrix} n^{- 1 / 2} | \sum_{i = 1}^{n} \int_{0}^{1} \int_{0}^{τ} K_{h} (u - υ) [Z_{i} (t) - \frac{S^{(1)} (t, β)}{S^{(0)} (t, β)}] Y_{i} (t) \\ \times [λ (t, υ | Z_{i} (t)) - λ (t, u | Z_{i} (t))] dt du | \\ = O_{p} (n^{1 / 2} h^{2}), \end{matrix}

uniformly in υ ∈ [0, 1]. It follows that

\begin{matrix} n^{- 1 / 2} U (υ, β) & = n^{- 1 / 2} \sum_{i = 1}^{n} \int_{0}^{1} \int_{0}^{τ} K_{h} (u - υ) [Z_{i} (t) - \frac{S^{(1)} (t, β)}{S^{(0)} (t, β)}] \\ \times [N_{i} (dt, du) - Y_{i} (t) λ (t, υ | Z_{i} (t)) dt du] \\ = n^{- 1 / 2} \sum_{i = 1}^{n} \int_{0}^{1} \int_{0}^{τ} K_{h} (u - υ) [Z_{i} (t) - \frac{S^{(1)} (t, β)}{S^{(0)} (t, β)}] M_{i} (dt, du) \\ + O_{p} (n^{1 / 2} h^{2}), \end{matrix}

uniformly in υ ∈ [0, 1].

Next, we show that for each υ, n^−1/2 h^1/2U(υ, β) converges weakly to a normal distribution. By Lemma A.2, n^−1/2 M(t, υ) converges weakly to a mean-zero Gaussian process. By Condition A, ‖S^(j)(t, β) − s^(j)(t, β)‖ = o_p(n^−1/2+δ), uniformly in t for j = 0, 1, for 0 < δ < 1/2. Note that n^−1/2+δ h^−1/2 = o(1) for δ = 1/4 as nh² → ∞. We have h^1/2 K_h(u − υ) ‖S^(j)(t, β) − s^(j)(t, β)‖ goes in probability to zero. Applying Lemma 2 of Gilbert, McKeague and Sun [6], we have

\begin{matrix} n^{- 1 / 2} h^{1 / 2} U (β (υ)) \\ = n^{- 1 / 2} h^{1 / 2} \sum_{i = 1}^{n} \int_{0}^{1} \int_{0}^{τ} K_{h} (u - υ) [Z_{i} (t) - \frac{s^{(1)} (t, β)}{s^{(0)} (t, β)}] \\ \times M_{i} (dt, du) + O_{p} (n^{1 / 2} h^{5 / 2}) + o_{p} (1) \\ = n^{- 1 / 2} h^{1 / 2} \sum_{i = 1}^{n} \int_{0}^{1} \int_{0}^{τ} K_{h} (u - υ) [Z_{i} (t) - \frac{s^{(1)} (t, β (u))}{s^{(0)} (t, β (u))}] \\ \times M_{i} (dt, du) + O_{p} (n^{1 / 2} h^{5 / 2}) + o_{p} (1) \\ = h^{1 / 2} \int_{0}^{1} K_{h} (u - υ) {\tilde{W}}_{I} (du) + O_{p} (n^{1 / 2} h^{5 / 2}) + o_{p} (1), \end{matrix}

(16)

where W̃_I(υ) is defined in (6) with A = I and a = 0.

Since ${\tilde{W}}_{I} (υ) \overset{𝒟}{\to} W_{I} (υ)$ by Theorem 1, by the almost sure representation theorem ([17], page 47), there exist ${\tilde{W}}_{I}^{*} (υ) and W_{I}^{*} (υ)$ on some probability space that have the same distributions and sample paths as W̃_I(υ) and W_I(υ), respectively, such that ${\tilde{W}}_{I}^{*} (υ) \overset{a . s .}{\to} W_{I}^{*} (υ)$ uniformly in υ ∈ [0, 1]. Hence $\int_{0}^{1} K_{h} (u - υ) {\tilde{W}}_{I}^{*} (du) = \int_{0}^{1} K_{h} (u - υ) W_{I}^{*} (du) + O_{p} (n^{- 1 / 2} h^{- 1})$ by integration by parts since K(·) has bounded variation. It follows that

\begin{matrix} h^{1 / 2} \int_{0}^{1} K_{h} (u - υ) {\tilde{W}}_{I} (du) & \overset{𝒟}{=} h^{1 / 2} \int_{0}^{1} K_{h} (u - υ) {\tilde{W}}_{I}^{*} (du) \\ = h^{1 / 2} \int_{0}^{1} K_{h} (u - υ) W_{I}^{*} (du) + O_{p} (n^{- 1 / 2} h^{- 1 / 2}) . \end{matrix}

Since $W_{I}^{*} (υ)$ is a Gaussian martingale with covariance matrix of $\int_{0}^{υ} \sum (u) du, and h^{1 / 2} \int_{0}^{1} K_{h} (u - υ) W_{I}^{*} (du)$ is a mean-zero Gaussian random vector with covariance matrix equal to $h \int_{0}^{1} K_{h}^{2} (u - υ) \sum (u) du \to ν_{0} \sum (υ) as h \to 0$ . Hence, $h^{1 / 2} \int_{0}^{1} K_{h} (u - υ) {\tilde{W}}_{I} (du) \overset{𝒟}{\to} N (0, ν_{0} \sum (υ)) as h \to 0, nh \to \infty$ . By the Slut-sky theorem, n^−1/2h^1/2U(υ, β) converges weakly to N(0, ν₀∑(υ)) as nh² → ∞ and nh⁵ → 0.

Note that $U (\hat{β} (υ)) - U (β (υ)) = l_{β}^{″} (υ, β^{*} (υ)) (\hat{β} (υ) - β (υ))$ , where β^*(υ) is on the line segment between β̂(υ) and β(υ). By Condition A and the uniform consistency of β̂(υ) on υ ∈ [a, b] ⊂ (0, 1), we have $n^{- 1} l_{β}^{″} (υ, β^{*} (υ)) = - \sum (υ) + o_{p} (1)$ , uniformly in υ ∈ [a, b] for 0 < δ < 1/2. Hence,

\begin{matrix} n^{1 / 2} h^{1 / 2} (\hat{β} (υ) - β (υ)) & = - {(l_{β}^{″} (υ, β^{*} (υ)) / n)}^{- 1} n^{- 1 / 2} h^{1 / 2} U (β (υ)) \\ = {(\sum (υ))}^{- 1} n^{- 1 / 2} h^{1 / 2} U (β (υ)) + o_{p} (1), \end{matrix}

(17)

uniformly in υ ∈ [a, b]. It follows that ${(nh)}^{1 / 2} (\hat{β} (υ) - β (υ)) \overset{𝒟}{\to} N (0, ν_{0} {\sum (υ)}^{- 1})$ as nh² → ∞ and nh⁵ → 0.

PROOF OF THEOREM 4

From (16) and the first line of (17), we have, for υ ∈ [a, b],

\int_{a}^{υ} n^{1 / 2} (\hat{β} (u) - β (u)) du = - \int_{a}^{υ} {(\sum (u))}^{- 1} \int_{0}^{1} K_{h} (x - u) {\tilde{W}}_{I} (dx) du + o_{p} (1) .

Exchanging the order of integration and by the compact support of the kernel function K(·) on [−1, 1], we have

\begin{matrix} \int_{a}^{υ} n^{1 / 2} (\hat{β} (u) - β (u)) du \\ = - \int_{0}^{1} [\int_{a}^{υ} {(\sum (u))}^{- 1} K_{h} (x - u) du] {\tilde{W}}_{I} (dx) + o_{p} (1) \\ = - \int_{a + h}^{υ - h} [\int_{a}^{υ} {(\sum (u))}^{- 1} K_{h} (x - u) du] {\tilde{W}}_{I} (dx) \\ - \int_{a - h}^{a + h} [\int_{a}^{υ} {(\sum (u))}^{- 1} K_{h} (x - u) du] {\tilde{W}}_{I} (dx) \\ - \int_{υ - h}^{υ + h} [\int_{a}^{υ} {(\sum (u))}^{- 1} K_{h} (x - u) du] {\tilde{W}}_{I} (dx) + o_{p} (1) . \end{matrix}

(18)

By Theorem 1, the process W̃_I(x) converges weakly to a mean-zero Gaussian process with continuous paths. Under the assumption (A.4), $\int_{a}^{υ} {(\sum (u))}^{- 1} K_{h} (x - u) du$ has bounded variation and converges uniformly to ∑(x)⁻¹ for x ∈ (a + h, υ − h). By Lemma 2 of Gilbert, McKeague and Sun [6], the first term in (18) is $- \int_{a}^{υ} {(\sum (x))}^{- 1} {\tilde{W}}_{I} (dx) + o_{p} (1)$ . Similar arguments lead to the second and the third terms in (18) to be o_p(1). Hence

\begin{matrix} \int_{a}^{υ} n^{1 / 2} (\hat{β} (u) - β (u)) du & = - \int_{a}^{υ} {(\sum (x))}^{- 1} {\tilde{W}}_{I} (dx) + o_{p} (1) \\ = - {\tilde{W}}_{\sum - 1} (υ) + o_{p} (1), \end{matrix}

which converges weakly to a p-dimensional mean-zero Gaussian martingale, W_∑(υ)⁻¹(υ), with continuous paths. The covariance matrix of W_∑(υ)⁻¹(υ) equals to $Cov (W_{\sum^{- 1}} (υ)) = \int_{a}^{υ} \sum {(u)}^{- 1} \sum (u) \sum {(u)}^{- 1} du = \int_{a}^{υ} \sum {(u)}^{- 1} du$ .

Footnotes

Supported in part by NSF Grant DMS-06-4576, NIH Grant 2 RO1 AI054165-04 and funds provided by the University of North Carolina at Charlotte.

Supported in part by NIH Grant 2 RO1 AI054165-04.

Supported in part by NSF Grant DMS-0505201.

AMS 2000 subject classifications. Primary 62N01; secondary 62N02, 62N03, 62G20.

REFERENCES

1.Aalen OO, Johansen S. An empirical transition matrix for non-homogeneous Markov chains based on censored observations. Scand. J. Statist. 1978;5:141–150. MR0509450. [Google Scholar]
2.Brémaud P. Point Processes and Queues: Martingale Dynamics. New York: Springer; 1981. MR0636252. [Google Scholar]
3.Cai Z, Sun Y. Local linear estimation for time-dependent coefficients in Cox’s regression models. Scand. J. Statist. 2003;30:93–111. MR1963895. [Google Scholar]
4.Flynn NM, Forthal DN, Harro CD, Judson FN, Mayer KH, Para MF, Gilbert PB The RGP120 HIV Vaccine Study Group. Placebo-controlled phase 3 trial of recombinant glycoprotein 120 vaccine to prevent HIV-1 infection. J. Infectious Diseases. 2005;191:654–665. doi: 10.1086/428404. [DOI] [PubMed] [Google Scholar]
5.Gilbert PB, McKeague IW, Sun Y. Tests for comparing mark-specific hazards and cumulative incidence functions. Lifetime Data Anal. 2004;10:5–28. doi: 10.1023/b:lida.0000019253.69537.91. MR2058572. [DOI] [PubMed] [Google Scholar]
6.Gilbert PB, McKeague IW, Sun Y. The two-sample problem for failure rates depending on a continuous mark: An application to vaccine efficacy. Biostatistics. 2007 doi: 10.1093/biostatistics/kxm028. To appear. [DOI] [PubMed] [Google Scholar]
7.Graham BS. Clinical trials of HIV vaccines. Annual Review of Medicine. 2002;53:207–221. doi: 10.1146/annurev.med.53.082901.104035. [DOI] [PubMed] [Google Scholar]
8.Huang Y, Louis TA. Nonparametric estimation of the joint distribution of survival time and mark variables. Biometrika. 1998;85:785–798. MR1666750. [Google Scholar]
9.Kalbfleisch JD, Prentice RL. The Statistical Analysis of Failure Time Data. New York: Wiley; 1980. MR0570114. [Google Scholar]
10.Lin DY, Wei LJ, Ying Z. Checking the Cox model with cumulative sums of martingale-based residuals. Biometrika. 1993;80:557–572. MR1248021. [Google Scholar]
11.Martinussen T, Scheike TH. Dynamic Regression Models for Survival Data. New York: Springer; 2006. MR2214443. [Google Scholar]
12.Nabel GJ. Challenges and opportunities for development of an AIDS vaccine. Nature. 2001;410:1002–1007. doi: 10.1038/35073500. [DOI] [PubMed] [Google Scholar]
13.Nickle DC, Heath L, Jensen MA, Gilbert PB, Kosakovsky Pond SLK, Mullins JI. Amino acid substitution matrices for HIV-1 subtype B. Technical report. Univ. Washington; 2005. [Google Scholar]
14.Olschewski M, Schumacher M. Statistical analysis of quality of life in cancer clinical trials. Statistics in Medicine. 1990;9:749–763. doi: 10.1002/sim.4780090705. [DOI] [PubMed] [Google Scholar]
15.Prentice RL, Kalbfleisch JD, Peterson AV, Fluornoy N, Farewell VT, Breslow NE. The analysis of failure time in the presence of competing risks. Biometrics. 1978;34:541–554. [PubMed] [Google Scholar]
16.Schumacher M. Two-sample tests of Cramér–von Mises and Kolmogorov–Smirnov type for randomly censored data. Internat. Statist. Rev. 1984;52:263–281. MR0867175. [Google Scholar]
17.Shorack GR, Wellner JA. Empirical Processes with Applications to Statistics. New York: Wiley; 1986. MR0838963. [Google Scholar]
18.Sun Y, Wu H. Semiparametric time-varying coefficients regression model for longitudinal data. Scand. J. Statist. 2005;32:21–47. MR2136800. [Google Scholar]
19.Tsiatis AA. A nonidentifiability aspect of the problem of competing risks. Proc. Natl. Acad. Sci. USA. 1975;72:20–22. doi: 10.1073/pnas.72.1.20. MR0356425. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Van Der Vaart AW. Asymptotic Statistics. Cambridge Univ. Press; 1998. MR1652247. [Google Scholar]
21.UNAIDS. Joint United Nations Programme for HIV/AIDS. 2004 AIDS Epidemic Update. [PubMed]
22.Wyatt R, Kwong PD, Desjardins E, Sweet RW, Robinson J, Hendrickson WA, Sodroski JG. The antigenic structure of the HIV gp120 envelope glycoprotein. Nature. 1998;393:705–711. doi: 10.1038/31514. [DOI] [PubMed] [Google Scholar]

[R1] 1.Aalen OO, Johansen S. An empirical transition matrix for non-homogeneous Markov chains based on censored observations. Scand. J. Statist. 1978;5:141–150. MR0509450. [Google Scholar]

[R2] 2.Brémaud P. Point Processes and Queues: Martingale Dynamics. New York: Springer; 1981. MR0636252. [Google Scholar]

[R3] 3.Cai Z, Sun Y. Local linear estimation for time-dependent coefficients in Cox’s regression models. Scand. J. Statist. 2003;30:93–111. MR1963895. [Google Scholar]

[R4] 4.Flynn NM, Forthal DN, Harro CD, Judson FN, Mayer KH, Para MF, Gilbert PB The RGP120 HIV Vaccine Study Group. Placebo-controlled phase 3 trial of recombinant glycoprotein 120 vaccine to prevent HIV-1 infection. J. Infectious Diseases. 2005;191:654–665. doi: 10.1086/428404. [DOI] [PubMed] [Google Scholar]

[R5] 5.Gilbert PB, McKeague IW, Sun Y. Tests for comparing mark-specific hazards and cumulative incidence functions. Lifetime Data Anal. 2004;10:5–28. doi: 10.1023/b:lida.0000019253.69537.91. MR2058572. [DOI] [PubMed] [Google Scholar]

[R6] 6.Gilbert PB, McKeague IW, Sun Y. The two-sample problem for failure rates depending on a continuous mark: An application to vaccine efficacy. Biostatistics. 2007 doi: 10.1093/biostatistics/kxm028. To appear. [DOI] [PubMed] [Google Scholar]

[R7] 7.Graham BS. Clinical trials of HIV vaccines. Annual Review of Medicine. 2002;53:207–221. doi: 10.1146/annurev.med.53.082901.104035. [DOI] [PubMed] [Google Scholar]

[R8] 8.Huang Y, Louis TA. Nonparametric estimation of the joint distribution of survival time and mark variables. Biometrika. 1998;85:785–798. MR1666750. [Google Scholar]

[R9] 9.Kalbfleisch JD, Prentice RL. The Statistical Analysis of Failure Time Data. New York: Wiley; 1980. MR0570114. [Google Scholar]

[R10] 10.Lin DY, Wei LJ, Ying Z. Checking the Cox model with cumulative sums of martingale-based residuals. Biometrika. 1993;80:557–572. MR1248021. [Google Scholar]

[R11] 11.Martinussen T, Scheike TH. Dynamic Regression Models for Survival Data. New York: Springer; 2006. MR2214443. [Google Scholar]

[R12] 12.Nabel GJ. Challenges and opportunities for development of an AIDS vaccine. Nature. 2001;410:1002–1007. doi: 10.1038/35073500. [DOI] [PubMed] [Google Scholar]

[R13] 13.Nickle DC, Heath L, Jensen MA, Gilbert PB, Kosakovsky Pond SLK, Mullins JI. Amino acid substitution matrices for HIV-1 subtype B. Technical report. Univ. Washington; 2005. [Google Scholar]

[R14] 14.Olschewski M, Schumacher M. Statistical analysis of quality of life in cancer clinical trials. Statistics in Medicine. 1990;9:749–763. doi: 10.1002/sim.4780090705. [DOI] [PubMed] [Google Scholar]

[R15] 15.Prentice RL, Kalbfleisch JD, Peterson AV, Fluornoy N, Farewell VT, Breslow NE. The analysis of failure time in the presence of competing risks. Biometrics. 1978;34:541–554. [PubMed] [Google Scholar]

[R16] 16.Schumacher M. Two-sample tests of Cramér–von Mises and Kolmogorov–Smirnov type for randomly censored data. Internat. Statist. Rev. 1984;52:263–281. MR0867175. [Google Scholar]

[R17] 17.Shorack GR, Wellner JA. Empirical Processes with Applications to Statistics. New York: Wiley; 1986. MR0838963. [Google Scholar]

[R18] 18.Sun Y, Wu H. Semiparametric time-varying coefficients regression model for longitudinal data. Scand. J. Statist. 2005;32:21–47. MR2136800. [Google Scholar]

[R19] 19.Tsiatis AA. A nonidentifiability aspect of the problem of competing risks. Proc. Natl. Acad. Sci. USA. 1975;72:20–22. doi: 10.1073/pnas.72.1.20. MR0356425. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R20] 20.Van Der Vaart AW. Asymptotic Statistics. Cambridge Univ. Press; 1998. MR1652247. [Google Scholar]

[R21] 21.UNAIDS. Joint United Nations Programme for HIV/AIDS. 2004 AIDS Epidemic Update. [PubMed]

[R22] 22.Wyatt R, Kwong PD, Desjardins E, Sweet RW, Robinson J, Hendrickson WA, Sodroski JG. The antigenic structure of the HIV gp120 envelope glycoprotein. Nature. 1998;393:705–711. doi: 10.1038/31514. [DOI] [PubMed] [Google Scholar]

PERMALINK

PROPORTIONAL HAZARDS MODELS WITH CONTINUOUS MARKS

Yanqing Sun

Peter B Gilbert

Ian W McKeague

Abstract

1. Introduction

2. Mark-specific proportional hazards model

2.1. Local partial likelihood

2.2. Asymptotic results

CONDITION A

THEOREM 1

THEOREM 2

THEOREM 3

THEOREM 4

2.3. Confidence bands for vaccine efficacy

2.4. Testing vaccine efficacy

3. Simulation study

TABLE 1.

TABLE 2.

FIG. 1.

FIG. 2.

FIG. 3.

TABLE 3.

4. Application

FIG. 4.

FIG. 5.

FIG. 6.

5. Discussion

Acknowledgments

APPENDIX

LEMMA A.1

PROOF

LEMMA A.2

PROOF

PROOF OF THEOREM 1

PROOF OF THEOREM 2

PROOF OF THEOREM 3

PROOF OF THEOREM 4

Footnotes

REFERENCES

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases